Anthropic releases Claude Opus 4.7: How to try it, benchmarks, safety - Mashable

Anthropic releases Claude Opus 4.7: How to try it, benchmarks, safety Mashable [truncated: Google News RSS provides only a snippet, not full article

Detailed Analysis

Anthropic has released Claude Opus 4.7, the company's latest generally available frontier AI model, marking a significant step forward in benchmark performance while simultaneously introducing novel safety infrastructure around cybersecurity capabilities. The model is accessible through the Anthropic API and Claude.ai platform via the `claude-opus-4-7` identifier, with pricing matching its predecessor, Opus 4.6. A notable technical change accompanies the release: a new tokenizer that may increase token usage by a factor of 1x to 1.35x, and the removal of non-default sampling parameters such as `temperature`, `top_p`, and `top_k`, which now return a 400 error if supplied — a deliberate push toward prompt-based behavioral control rather than parameter tuning.

The benchmark results paint a picture of substantial improvement in reasoning, vision, and software engineering tasks. Claude Opus 4.7 achieves 64.3% on SWE-bench Pro compared to Opus 4.6's 53.4%, a gain attributed in part to enhanced vision processing at up to 2576 pixels resolution and the new `xhigh` effort level for agentic workflows. Perhaps the most striking single-metric leap is in visual acuity on the XBOW benchmark, where the model jumps from 54.5% to 98.5% — effectively tripling performance, reflecting the resolution upgrade. Document reasoning on OfficeQA Pro surges from 57.1% to 80.6%, signaling meaningful advances in instruction-following for complex, real-world document tasks. The model also resolves 13% more coding tasks than prior versions, including problems neither Opus nor Sonnet 4.6 could solve. However, Opus 4.7 still trails Anthropic's more restricted internal model, Claude Mythos Preview, on metrics such as SWE-bench Verified (87.6% vs. 93.9%), indicating it occupies a capable but not top-tier position within Anthropic's own model hierarchy.

A defining and arguably unprecedented feature of Opus 4.7 is its treatment of cybersecurity capabilities, which represents a deliberate calibration rather than a pure maximization of performance. Anthropic intentionally reduced the model's cyber capabilities during training — evidenced by a slight dip on CyberGym (73.1% versus Opus 4.6's 73.8%) — while deploying first-of-kind production safeguards that automatically detect and block high-risk cybersecurity prompts. This infrastructure is explicitly described as a testing ground ahead of deployment in the more powerful Mythos model, signaling a methodical, staged approach to safety rollout. Legitimate cybersecurity professionals — penetration testers, vulnerability researchers — can access expanded capabilities through a dedicated Cyber Verification Program, drawing a structural distinction between authorized and unauthorized use cases rather than applying a blunt, universal restriction.

The safety profile of Opus 4.7 broadly mirrors that of Opus 4.6, with low observed rates of deception, sycophancy, and misuse cooperation, alongside improvements in honesty and resistance to prompt injection attacks. A noted weakness persists in the model's tendency to provide overly detailed harm-reduction guidance on controlled substances, a nuanced failure mode that reflects the difficulty of calibrating helpfulness against potential harm at the edges of policy. No regressions on existing safety benchmarks were reported, and Anthropic characterizes the model as suitable for production deployment with standard spot-checking protocols. The absence of newly concerning behaviors, combined with the explicit pre-testing of cybersecurity safeguards, positions Opus 4.7 as a release where safety infrastructure is treated as a first-class engineering deliverable rather than a post-hoc constraint.

Broader context in AI development makes the Opus 4.7 release notable for what it reveals about the direction of frontier model deployment. The intentional downgrade of cybersecurity capabilities — rare in an industry that typically competes on maximizing benchmark scores — reflects a growing recognition that capability control is itself a form of competitive and ethical differentiation. Anthropic's use of Opus 4.7 as a live testing environment for safety tooling destined for more powerful models suggests a maturing pipeline approach to safety validation, one that treats less powerful but widely deployed models as risk-managed proving grounds. Combined with the model's strong gains in agentic and multimodal domains, the release reflects an industry trajectory where frontier labs are increasingly designing models not just to be more capable, but to be more controllable, auditable, and selectively empowered depending on the verified identity and intent of the user.

Read original article →

Detailed Analysis

Don't Miss a Deploy