Claude Mythos leads 17 of 18 benchmarks Anthropic measured. Muse Spark put Meta back in the frontier club, and OpenAI's 'Spud' model is reportedly near launch R&D World [truncated: Google News RSS provides only a snippet, not full article
Detailed Analysis
Detailed analysis coming soon.
Read original article →