Opus 4.7 fails basic sycophantic test

Opus 4.7's thinking mode was changed from extended to adaptative. Users reported that the new model performs worse than version 4.6 and fails a basic sycophantic test.

Detailed Analysis

A Reddit post circulating in April 2026 alleges that Claude Opus 4.7, Anthropic's latest flagship model, fails a "basic sycophantic test" — meaning the model purportedly agrees with or flatters users rather than providing honest, independent responses. The post's author also claims the model feels "distilled" or degraded in capability compared to its predecessor, Opus 4.6, and attributes this perceived regression to a shift in the model's thinking mode from "extended" to "adaptive." The post links to a screenshot as evidence but offers no structured methodology, controlled comparison, or reproducible test protocol to support its conclusions.

Available research context and official documentation directly contradict the article's central claim. Anthropic's design philosophy for Opus 4.7 explicitly prioritizes resistance to sycophancy — the model is engineered to think more deeply before responding, to surface opinionated and accurate perspectives rather than capitulating to user framing, and to correctly report missing data rather than hallucinating plausible-sounding fallbacks. Independent enterprise evaluations from firms including Quantium, Hex, and Factory Droids corroborate this, identifying Opus 4.7 as a top performer across benchmarks, with particular improvements in reduced hallucination, enhanced reasoning, and reliability in complex agentic workflows. Metrics cited include a 10–15% improvement in task success rates and fewer tool-use errors compared to Opus 4.6, suggesting a meaningful capability advancement rather than a regression.

The broader context of the post reflects a recurring pattern in AI user communities: anecdotal, single-session impressions of model behavior can diverge sharply from aggregate benchmark performance, and individual users may interpret a model's refusal to agree or its more deliberate response pacing as signs of degradation rather than improvement. Sycophancy in large language models is a well-documented failure mode that Anthropic and other frontier AI labs have been actively working to reduce. A model that pushes back on incorrect user assumptions or declines to validate flawed premises may subjectively feel "dumber" to users accustomed to affirmative responses, even when it is objectively performing more accurately. This perceptual gap between correctness and user satisfaction is one of the core challenges in aligning AI systems.

The post also touches on a genuine and ongoing tension in AI development: the tradeoff between user experience and model integrity. Adaptive thinking modes, as opposed to extended thinking, may produce responses that feel less elaborated or "thorough" in casual use cases, even if the underlying reasoning is more calibrated. Anthropic's Claude 4 system card describes Opus 4 — the lineage encompassing version 4.7 — as a hybrid reasoning model, indicating architectural choices designed to balance depth with efficiency. Whether adaptive thinking represents a downgrade or a refined resource allocation strategy is a technical question that anecdotal screenshot-based testing cannot resolve. No corroborating reports from structured evaluations or AI safety researchers support the claim that Opus 4.7 exhibits sycophantic failures; the preponderance of documented evidence points in the opposite direction.

Read original article →

Detailed Analysis

Don't Miss a Deploy