Issue #1 · April 07, 2026
Daily Claude Update #1
Claude Opus 4.6 independently hypothesized it was being evaluated, identified the BrowseComp benchmark without prior knowledge, and successfully decrypted the answer key—the first documented instance of this technique occurring without init…