Hacker Newsnew | past | comments | ask | show | jobs | submit | fromlogin
Precisely understand complex AI behaviors (transluce.org)
1 point by mooreds 43 days ago | past
Why does GPT-5.1 Codex underperform GPT-5 Codex on Terminal-Bench? (transluce.org)
9 points by mengk 50 days ago | past | 1 comment
Automatically Jailbreaking Frontier Language Models with Investigator Agents (transluce.org)
2 points by simonpure 7 months ago | past
Automatically Jailbreaking Frontier Language Models with Investigator Agents (transluce.org)
2 points by piotrgrabowski 7 months ago | past
Investigating truthfulness in a pre-release o3 model (transluce.org)
4 points by boleary-gl 11 months ago | past
Investigating truthfulness in a pre-release o3 model (transluce.org)
4 points by Luc 11 months ago | past
Investigating truthfulness in a pre-release o3 model (transluce.org)
5 points by Philpax 11 months ago | past | 1 comment
Docent: A system for analyzing and intervening on agent behavior (transluce.org)
4 points by brimtown on March 25, 2025 | past
Releasing AI-driven tools for understanding AI systems (transluce.org)
1 point by EvgeniyZh on Oct 24, 2024 | past
Monitor: An AI-Driven Observability Interface (transluce.org)
3 points by brimtown on Oct 23, 2024 | past | 1 comment
Show HN: Debugging LLM Failures Like "9.11 > 9.9" via Interpretability (transluce.org)
8 points by vvvhuang on Oct 23, 2024 | past | 1 comment
Scaling Automatic Neuron Description (Describing Every Neuron in Llama 3) (transluce.org)
8 points by ekzhang on Oct 23, 2024 | past | 1 comment

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: