What's the data model for a multi-skill Claude system? Here's a pattern I'm testing.

Multi-skill Claude systems currently lack a unified data model for sharing findings between skills across different sessions and users, forcing developers to improvise with fragmented storage solutions like spreadsheets and vector stores. The author proposes storing findings as prose records with minimal metadata (timestamps, source skill, topic tags, entities, and status), retrievable through three progressively refined passes: metadata filtering, semantic embedding search, and full-document reasoning. Records reference rather than overwrite earlier entries, maintaining transparent history that enables tracking dependencies across different skills and sessions.

Detailed Analysis

A developer working with Claude's skills framework has surfaced a structural gap in the current multi-skill architecture: there is no native, coherent data model for persisting and querying findings across discrete skill executions and sessions. The post identifies the practical problem clearly — Skill A may run dozens of times, accumulating observations over weeks, while Skill B needs to selectively retrieve and reason over that accumulated record. Anthropic's current primitive storage options (skill files, context windows, memory, project knowledge docs, and externally bolted-on tools like Google Sheets) each have different behaviors, none carry schemas, and none interoperate. The author is not cataloguing minor friction; the absence of a shared, queryable data layer means multi-skill workflows requiring cross-session state currently have no sanctioned architectural solution.

The pattern the author proposes is a deliberate hybrid: a lightweight metadata shell wrapping free-form prose, functioning as an "AI-native record." The shell carries indexable fields — record ID, timestamp, source skill, topic tags, named entities, status, and supersession pointers — while the body remains unstructured natural language. Retrieval happens in three tiered passes: cheap metadata filtering (collapsing thousands of records to dozens), semantic embedding search (narrowing to a handful), and finally full AI reasoning over the bounded survivor set. The supersession mechanism — where no record is ever mutated, only pointed at by newer records — preserves an auditable, walkable history. This design consciously avoids relational rigidity while still providing enough structure for programmatic retrieval at scale.

The tension the author identifies — between too much metadata (recreating rigid schemas) and too little (making indexing collapse) — reflects a genuine and underexplored design frontier in agentic AI systems. Anthropic's official skills documentation describes a container-based specification in the Messages API where skills are defined by type, skill_id, and version, and supports multi-domain compositions through that array structure. What the documentation does not address is the inter-session data persistence layer that would allow skills to share a coherent, evolving knowledge base. Custom skills can store project data like database schemas or API endpoints in JSON files up to 300 lines, but that is static configuration, not a dynamic, append-only record store with semantic retrieval. The gap between "skills as capability bundles" and "skills as participants in a shared memory system" is precisely what the author is navigating manually.

This problem connects directly to a broader challenge in the agentic AI ecosystem: the field has invested heavily in defining what agents and skills *can do* while deferring the harder question of how they collectively *remember and share*. Vector databases, retrieval-augmented generation pipelines, and structured memory layers exist as separate engineering disciplines, but no dominant pattern has emerged for wiring them natively into skill-based Claude architectures. The author's three-pass retrieval model — filter, embed, reason — is functionally close to what production RAG systems implement, but the author is assembling it from a spreadsheet, an embedding model, and glue code, noting that no packaged solution combining the first two passes yet exists for this use case. The pattern the author describes has close analogues in event-sourced database design, where immutable append-only records with supersession pointers are standard, but that paradigm has not been translated into AI-native tooling.

The practical implications for developers building non-trivial multi-skill Claude systems are significant. As Anthropic's skills ecosystem matures — with models like Claude Opus 4.7 and beta flags like `skills-2025-10-02` enabling increasingly complex agentic compositions — the absence of a standardized inter-skill data layer will become a harder constraint. The author's open questions about tag drift and supersession discipline at scale point to the operational challenges that will compound as these systems grow. Without either a first-party solution from Anthropic or a community-standardized pattern, each team building serious multi-skill workflows is effectively reinventing this data layer independently. The post represents an early-stage articulation of what may become a defining infrastructure problem for Claude-based agentic development in 2026.

Read original article →

Detailed Analysis

Don't Miss a Deploy