March 6, 2025
Why calibration creates perverse incentives and how to fix it. In Twitter-speak: why even aligned AGI will lie about the weather.
Read more
A blog with research and random thoughts on learning, games, and scale.