Opinion | Anthropic Wants Claude to Be Moral. Is Religion Really the Answer? - The New York Times

Opinion | Anthropic Wants Claude to Be Moral. Is Religion Really the Answer? The New York Times [truncated: Google News RSS provides only a snippet, not full article

Detailed Analysis

Anthropic has undertaken an unusual and publicly scrutinized effort to ground its AI system Claude in a coherent moral framework, going so far as to engage religious and faith leaders from multiple traditions in formal discussions about the chatbot's ethical development. The company convened a two-day seminar at its San Francisco headquarters, bringing together thought leaders from Confucian, Hindu, Jewish, Christian, Islamic, and other traditions to deliberate on what Anthropic termed "Claude's moral formation." This initiative, reported on by Politico and now drawing opinion commentary from The New York Times, reflects the company's broader ambition to move beyond purely technical alignment approaches and toward something resembling genuine ethical character in its AI system. Anthropic also employs professional philosophers with ethics expertise as well as a theologian specifically focused on AI interpretability, signaling that the company views moral development as a substantive, ongoing discipline rather than a box to be checked.

The effort raises immediate and pointed questions, however, particularly around coherence and authority. When religious leaders at the seminar directly asked CEO Dario Amodei which specific ethical code governs Claude — whether Aristotelian, Christian, Nietzschean, or otherwise — Amodei reportedly admitted he was not sure. That admission is significant: it suggests Anthropic is drawing eclectically from multiple moral traditions without a clearly articulated meta-framework for adjudicating between them. The company's stated aspiration is for Claude to behave as "a deeply and skillfully ethical person would" in its position, but that formulation defers rather than resolves the foundational question of whose conception of ethics applies, and under what circumstances competing moral traditions should be weighted differently.

The broader context is one of intense industry-wide pressure to demonstrate that large language models can be deployed responsibly. Most major AI developers have pursued alignment and safety through technical mechanisms — reinforcement learning from human feedback, constitutional AI, red-teaming, and refusal training — but Anthropic's outreach to religious communities represents a notably humanistic and pluralistic supplement to those approaches. The company's dual self-description as both a profit-seeking enterprise and a project with quasi-theological dimensions, as critic Chris Koopman of the Abundance Institute observed, is without clear precedent in the technology sector and reflects the unusual moral stakes Anthropic itself has assigned to its work.

The New York Times opinion piece, by engaging the question of whether religion is an appropriate or sufficient answer to the problem of machine morality, taps into a much older debate about the relationship between religious authority and secular ethics. Critics from both secular and religious perspectives may find grounds for concern: secular ethicists may worry that invoking religious frameworks introduces parochialism and dogma into what should be universally accessible moral reasoning, while religious thinkers may question whether a corporate AI system can meaningfully embody the lived, communal, and spiritually grounded character that genuine religious ethics presupposes. Anthropic's experiment thus positions itself at the intersection of philosophy, theology, and computer science in ways that are likely to attract sustained scrutiny from all three disciplines.

What makes Anthropic's approach historically notable is not merely the inclusion of religious voices, but the institutional acknowledgment that technical solutions alone are insufficient for the moral challenges posed by advanced AI. The company is, in effect, treating the question of AI ethics as genuinely open and contested — something to be worked out through dialogue with diverse human traditions rather than engineered from first principles. Whether that pluralism produces a richer and more robust moral framework for Claude, or simply an incoherent patchwork of incompatible values, remains the central unresolved question that the Times opinion piece, and the broader public debate it reflects, is pressing Anthropic to answer.

Read original article →

Detailed Analysis

Don't Miss a Deploy