For my use cases, Opus 4.6 lacks so much nuance and I have to come up with comprehensive lint plans to keep it going. I can only guess they let RL go super unsupervised for 4.6 because it will do anything to just "finish." including changing instructions or
Detailed Analysis
Detailed analysis coming soon.
Read original article →