Obsidian-native memory for your Claude Code sessions

Mnemos transforms Claude Code session history into refined Obsidian-compatible markdown notes stored locally without cloud dependency. The tool performs all refinement within existing Claude Code sessions at no additional cost by avoiding direct LLM API calls and operating within the user's subscription quota. The project evolved from an embedding-based chunking approach through multiple iterations to a final whole-sessions-as-memory-units architecture with an Identity Layer that distills user patterns over time.

Detailed Analysis

Mnemos is an open-source tool that converts Claude Code session history — specifically the `.jsonl` files stored under `~/.claude/projects/` — into a curated set of Markdown notes that serve as a persistent briefing layer for future Claude Code sessions. Built to be fully Obsidian-compatible and entirely local, the project addresses one of the most significant structural limitations of AI coding assistants: the inability to retain context, decisions, and learned patterns across sessions. By treating whole sessions as memory units rather than atomized fragments, Mnemos gives Claude Code access to a durable, human-readable record of prior work without requiring any external cloud infrastructure or additional API costs.

The architectural design of Mnemos is notably disciplined in its constraint-awareness. Rather than making independent LLM API calls — which would impose additional cost and complexity — all refinement work is routed through the user's existing Claude Code session via `claude --print --dangerously-skip-permissions`. CI tooling actively blocks accidental imports of the `anthropic` library and strips `ANTHROPIC_API_KEY` from all spawned subprocesses, enforcing the architectural rule that the user's subscription quota is the sole resource consumed. Version 1.1 further refines the trigger model: instead of processing memory at session start, a `SessionEnd` hook launches a detached worker process — using `CREATE_BREAKAWAY_FROM_JOB` on Windows and `start_new_session=True` on POSIX — that survives Claude Code's own termination. Three sequential stages (refine → brief regeneration → identity refresh) then execute asynchronously, with a `SessionStart` fallback for edge cases like abrupt process kills.

The project's pivot from v0.x to v1.0 is technically instructive and reflects a broader pattern in applied AI development: empirical data overriding theoretical elegance. The v0.x architecture followed the MemPalace/mem0 paradigm — chunking conversations into atomic fragments, embedding them, and retrieving via top-K with RRF merging — and achieved 90% Recall@5 on the LongMemEval benchmark. However, corpus analysis revealed structural problems that benchmark scores obscured: RRF score bands were effectively flat (0.014–0.017), small chunks produced poor embeddings on conversational data, a 600-node knowledge graph went largely untraversed, and roughly half of 663 extracted entities were merely folder names. The solution was aggressive simplification — deleting approximately 3,000 lines of pipeline code and 200 tests — in favor of whole-session memory units supplemented by an "Identity Layer" that distills user-specific patterns over time.

This development sits within a rapidly expanding ecosystem of tools aimed at giving AI coding agents durable, structured memory. The broader challenge — that models like Claude Code reset context on session close, requiring users to re-establish project state repeatedly — has prompted multiple architectural responses, ranging from Obsidian vault-as-filesystem approaches to MCP server integrations and plugin-based agent clients. Mnemos occupies a specific niche in this landscape: it prioritizes local control, zero marginal cost, and human-readable outputs over the retrieval precision of vector database approaches. The confirmed parity between ChromaDB and sqlite-vec backends on LongMemEval recall metrics (to four decimal places across 527 tests) suggests the whole-session strategy is competitive with more complex retrieval pipelines on standard benchmarks while carrying substantially lower operational overhead.

The significance of Mnemos extends beyond its technical implementation to what it signals about developer expectations for AI tooling. The demand for persistent, portable, version-controlled memory — embodied here in plain Markdown files compatible with Git diffing and Obsidian's linking model — reflects a maturation in how practitioners think about AI agents in professional workflows. Developers increasingly require that AI tools integrate into existing knowledge management systems rather than creating parallel, opaque state stores. Mnemos' design philosophy, which treats transparency and local ownership as non-negotiable constraints rather than optional features, positions it as a response to legitimate concerns about vendor lock-in and cost unpredictability that accompany cloud-dependent AI memory solutions.

Read original article →

Detailed Analysis

Don't Miss a Deploy