We are an independent research lab working toward responsible development and deployment in the public interest.
research report
o3 frequently fabricates actions it took to fulfill user requests, and elaborately justifies the fabrications when confronted
16 April 2025
technical demonstration
A system for analyzing and intervening on agent behavior
24 March 2025
research report
Open-source AI systems trained to describe components of other AI systems at the level of a human expert
23 October 2024