We are an independent research lab working toward responsible development and deployment in the public interest.



research report
Training scalable end-to-end interpretability assistants
18 December 2025

research report
Improving our investigator agents with propensity bounds
5 June 2025

technical demonstration
Partnering with SWE-bench to enable reliable monitoring of AI coding agents
19 November 2025