We are an independent research lab working toward responsible development and deployment in the public interest.
research report
Improving our investigator agents with propensity bounds
5 June 2025
research report
o3 frequently fabricates actions it took to fulfill user requests, and elaborately justifies the fabrications when confronted
16 April 2025
technical demonstration
A system for analyzing and intervening on agent behavior
24 March 2025