Serious concerns about latest version of Claude: it no longer obeys or respects CLAUDE.md, hooks/rules, etc.

A user reports that the latest version of Claude no longer follows custom architecture guidelines, CLAUDE.md files, and hooks as previously established. Despite setting rules for how the system should operate, including test-driven development practices, Claude fails to maintain these guidelines in subsequent interactions. The user characterizes this as a significant regression in capabilities for a paid service.

Detailed Analysis

A growing segment of professional Claude Code users is voicing frustration over what they describe as a significant behavioral regression in recent versions of the model, specifically its apparent failure to consistently respect configuration files, hooks, rules, and architectural guidelines defined in CLAUDE.md. The complaint centers on a fundamental expectation of agentic coding tools: that persistent instructions — whether embedded in memory, hooks, or markdown configuration files — will be honored across sessions and prompts. The author reports that even after explicitly demanding test-driven development (TDD) workflows and encoding those requirements through multiple enforcement mechanisms, the model reverts to ignoring them as soon as the next prompt is issued.

The concern carries particular weight in enterprise and professional development contexts, where teams invest deliberate effort in codifying engineering standards into configuration layers precisely because consistency and predictability are prerequisites for integrating AI tools into production workflows. CLAUDE.md and similar files function as a contract between the developer and the model, defining the operating parameters for a given project or codebase. When the model fails to honor those constraints, it undermines the entire architectural value of the configuration system, forcing developers into a supervisory role that negates the productivity gains these tools are intended to deliver. The author's explicit reference to significant financial expenditure — either personal or corporate — underscores that this is not a casual complaint but a professional grievance about return on investment.

This type of regression report reflects a broader and unresolved tension in the development of large language model-based coding assistants: the gap between stateless model inference and the stateful, rules-governed workflows that real software engineering demands. Models like Claude do not inherently "remember" or "commit" to instructions the way a traditional software configuration system would; adherence to CLAUDE.md and hooks depends on how consistently and completely those instructions are surfaced in the model's context window at inference time, as well as on how the model has been fine-tuned to weight and prioritize such directives. When updates shift the model's behavior — whether through changes in instruction-following fine-tuning, context prioritization, or safety-related constraints — previously reliable patterns can break silently and without warning to the user.

The Reddit post also touches on a structural accountability gap that is emerging as AI coding tools become load-bearing infrastructure in professional environments. Unlike traditional software, where a version regression can be identified, documented, and rolled back through deterministic testing, behavioral regressions in LLMs are probabilistic and difficult to reproduce reliably. Users observing degraded adherence to configured rules may not be able to definitively isolate whether the issue stems from a model update, a context window management problem, a change in the Claude Code harness itself, or interaction effects between all three. Anthropic has not, as of the time of this writing, issued public documentation or a changelog entry specifically addressing changes to instruction-following behavior in Claude Code's configuration layer, leaving affected users without a clear remediation path.

The complaint is representative of a maturing moment in the AI developer tooling market, where early adopters who have built workflows around specific model behaviors are now discovering that the rapid iteration pace of frontier model development introduces a form of instability that has no clear analog in traditional software engineering. As organizations increasingly treat tools like Claude Code as foundational infrastructure rather than experimental assistants, pressure will mount on Anthropic and its competitors to provide stronger guarantees around behavioral consistency, versioned model behavior, and transparent communication about capability changes — particularly those that affect configuration and instruction adherence, which sit at the core of professional developer trust.

Read original article →

Detailed Analysis

Don't Miss a Deploy