Skills / strata
Installation
Kompatibilitaet
Beschreibung
Strata is a high-performance, tool-agnostic Model Context Protocol (MCP) server that fundamentally changes how AI coding agents (like OpenCode, Claude, Cursor, and GitHub Copilot) remember project context. Written in native Go, it replaces fragile markdown trackers (MEMORY.md) with a deterministic, local-first vector database partitioned into a 3-Tier Cognitive Architecture.
Integrated seamlessly with Graphify, Strata maps semantic business axioms directly to the structural codebase, closing the cognitive gap between what code does and why it was written.
🌟 Why We Are Different: The Strata Novelty
While traditional agent systems struggle with context bloat and hallucinated architecture, Strata takes a fundamentally different, highly efficient approach:
- Dual-Level Fused Retrieval: Unlike standard RAG that only looks for local keywords, Strata natively executes simultaneous queries against both your Global Tier (company-wide rules, e.g., "Never use Python") and your Local Domain Tier (project-specific facts). It automatically fuses them into a single, cohesive brief so agents never miss a critical mandate.
- Lightweight Graph Fusion: Heavy graph-RAG systems cost fortunes in LLM API fees just to build an index. Strata bypasses this by passively parsing lightweight JSON edges (
graphify-out/edges.json) already generated by Graphify. When an agent searches for a file, Strata instantly traverses the local graph and provides related context for free. - The "Pointer-Wiki" Concept: Instead of dumping 50-page architecture documents into the context window (which causes the "Lost in the Middle" phenomenon), Strata stores Pointers. The vector database hands the agent a specific file path and line number, forcing the agent to fetch only the exact bytes it needs.
- Strict, Formatted Enforcement: Strata doesn't just hand context to the agent; it yells it. Retrieved memory is dynamically wrapped in absolute markers like
[🌍 GLOBAL DIRECTIVE]andRULE:. This forces frontier models to treat the memory as a hard system prompt rather than a loose suggestion. - Universal Core + Client Shims: The Go MCP server holds no allegiance to any IDE. All client-specific hacks (like intercepting Claude Code's context compaction) are isolated into thin plugin "shims," allowing your memory architecture to instantly port to whatever new AI tool comes out tomorrow.
- Visual Latent Space (Obsidian Integration): Most RAG systems are opaque black boxes. Strata offers an optional but deeply integrated Obsidian plugin that lets you visually inspect, edit, and curate your AI's memory. You can seamlessly browse namespaces, right-click to inject paragraphs into the vector database, or generate a 2D spatial canvas to see exactly what your AI "knows"—bridging the gap between human markdown notes and machine vectors.
🔬 The Science: Why Traditional AI Memory Fails
Current agentic workflows suffer from severe context degradation due to a fundamental misunderstanding of how memory should be structured. Simply dumping vector search results into an LLM's context window leads to cognitive overload and hallucinations.
1. The Semantic vs. Structural Disconnect
Traditional static analysis tools (like AST parsers) map code dependencies and call graphs, but they are completely blind to project knowledge, feature requirements, and axiomatic constraints. The problem domain ("Reject mutilated fish") is structurally disconnected from the programming domain (parser.go).
- The Strata + Graphify Fix: According to the theory of program comprehension (Brooks, 1983), understanding code requires mapping the problem domain to the structural domain. Strata uses Graphify to create a knowledge graph that explicitly draws edges between markdown narratives (the rules) and the code files (the implementations), bridging the gap between axioms and execution.
2. The "Lost in the Middle" Phenomenon
Research demonstrates that LLMs have a U-shaped performance curve when retrieving information from long contexts. They remember the beginning and end of a prompt but catastrophically fail to retrieve information buried in the middle (Liu et al., 2023).
- The Strata Fix: Strata enforces Compact Reading. Instead of dumping full documents into the context window, Tier 2 memory returns exact file pointers and line numbers. The agent is forced to read only the specific paragraph needed, minimizing context noise and preventing attention-mechanism dilution.
3. The Absence of Spatial Anchoring
Human memory relies on the hippocampus to create "Cognitive Maps"—spatial frameworks where memories are anchored to specific physical or conceptual locations (O'Keefe & Nadel, 1978). AI agents typically use flat vector databases, meaning a rule about frontend rendering might accidentally pollute a backend database task because they semantically overlap.
- The Strata Fix: Strata implements a Pointer-Wiki Hybrid. Domain rules are spatially anchored to specific directories (e.g.,
docs/architecture/domains/). The vector database stores a semantic pointer to the physical file. This forces the agent to traverse the project's spatial hierarchy, grounding its understanding in your codebase structure.
4. Semantic vs. Episodic Interference
Cognitive science divides long-term memory into Semantic (general facts/rules) and Episodic (specific events/tasks) (Tulving, 1972). Forcing an AI to process global infrastructure rules mixed with a temporary bug-fix context creates catastrophic interference.
- The Strata Fix: Strata rigidly partitions the database into Global, Domain, and Task namespaces (The 3 Tiers), ensuring the AI only retrieves the exact type of memory required for the current cognitive load.
🏗️ The 3-Tier Architecture
Strata maps directly to human cognitive models to provide agents with perfect, interference-free recall.
graph TD
Agent((🤖 AI Agent))
subgraph StrataServer [Native Go Dual-Mode Server]
Router{Smart Routing &<br/>Bookkeeping Lock}
Tier1[("Tier 1: Global<br/>(Semantic Memory)")]
Tier2[("Tier 2: Domain<br/>(Spatial Memory)")]
Tier3[("Tier 3: Task<br/>(Working Memory)")]
end
Qdrant[(Local Qdrant<br/>Vector DB)]
PointerWiki[(Project Files:<br/>docs/architecture/domains/)]
Obsidian((Obsidian GUI))
Graphify[[Graphify Engine]]
Agent <-->|JSON-RPC via stdio| Router
Router -->|Universal Rules| Tier1
Router -->|Code Map Pointers| Tier2
Router -->|Active Bug/Feature| Tier3
Tier1 -.-> Qdrant
Tier3 -.-> Qdrant
Tier2 -.->|Semantic Pointer| Qdrant
Tier2 <==>|Physical File Anchor| PointerWiki
Graphify -->|Analyzes Code & Updates| PointerWiki
Obsidian -.->|Visualizes Graph & Notes| PointerWiki
Obsidian -.->|Queries Vector Data| Qdrant
classDef core fill:#1e1e1e,stroke:#00ADD8,stroke-width:2px,color:#fff;
classDef memory fill:#2d2d2d,stroke:#ff5555,stroke-width:1px,color:#fff;
classDef engine fill:#3a205e,stroke:#9d4edd,stroke-width:2px,color:#fff;
classDef tool fill:#1c3d5a,stroke:#3b82f6,stroke-width:1px,color:#fff;
class Agent,Router core;
class Tier1,Tier2,Tier3,Qdrant,PointerWiki memory;
class Graphify engine;
class Obsidian tool;
- Global (Tier 1): Company-wide constraints and infrastructure mandates.
- Domain (Tier 2): Project-specific rules and API contracts. Utilizes the Pointer-Wiki constraint: memories are hyper-specific references (
{"file": "docs/...", "lines": "42-49"}) to physical architecture files. - Task (Tier 3): Ephemeral context for active bug fixes or feature branches.
📝 The Continuous Backup Protocol
To prevent the loss of critical architectural decisions made during ad-hoc conversations, Strata enforces a Continuous Backup Protocol.
Instead of treating chat sessions as ephemeral or forcing annoying "Startup Protocols", the system maintains an invisible, rolling log in the .strata/sessions/ directory.
- The Mechanism: Agents are instructed to silently use the
strata_append_logtool in the background as they work. The Go server automatically manages file size, rolling logs over 500KB into timestamped archives. - Grep-able Waypoints: When a user changes topics (e.g., from "database refactor" to "UI design"), the agent tags the log entry. The Go server injects highly structured
### 🔄 Topic Switchmarkers. - Recovery: If an agent ever loses context due to compaction, it is instructed to run a two-pass recovery:
grepfor the Topic Switch waypoints to find the general discussion area, and then use thereadtool with exact line offsets to instantly recover the forgotten context without reading massive files.
✨ Features: Transparent & Autonomous Memory
💡 Want to see Strata in action? Check out the Strata UI User Guide for screenshots of the Obsidian Sidebar Inspector, Right-Click Context Tools, and the auto-generated Visual MemorySpace Canvas.
Strata isn't just a database; it is an active cognitive loop.
- Human Oversight & Curation: While agents are highly autonomous, you retain ultimate control. Because Tier 2 memory is grounded in standard physical Markdown files (
docs/architecture/domains/), you can directly edit, review, and curate the knowledge graph using Obsidian, VSCode, or any text editor. Strata respects explicit human-written constraints as the ultimate source of truth. - Autonomous Self-Healing (CRUD): Agents using Strata are instructed to actively prune their own brains. If an agent detects a hallucinated rule or an outdated architectural decision, it autonomously calls
strata_update_memoryorstrata_delete_memoryto maintain a single source of truth. - Live Visual Latent Space: Because vectors are opaque, Strata makes them transparent. Agents autonomously call
strata_generate_canvas(vault_path). Strata reads the vector database and programmatically generates anObsidian .canvasfile, allowing humans to physically see and organize the AI's "brain" as a spatial graph. - Mass Ingestion & Graphing: Point Strata at a folder via
strata_ingest_directory. The Go server intelligently chunks markdown by paragraph, paces requests to your local LLM embedder, and maps your architecture into vector space instantly. Coupled with Graphify, this overlays knowledge communities onto raw code. - Dual-Mode Architecture: The native Go binary runs the MCP protocol over
stdiofor your agents, while simultaneously spinning up an HTTP REST Server (localhost:8005). External UIs (like Obsidian plugins) interact with the exact same memory mesh without duplicating vector math.
🚀 Getting Started
Strata is completely tool-agnostic. It integrates with the standard ~/.agents/ specification and registers directly into your AI client's configuration.
Prerequisites
Philosophy: Batteries Included, Cloud Ready. Strata is designed as a local-first, self-contained cognitive architecture to ensure absolute privacy and zero latency. However, it does not preclude utilizing hosted or cloud-based solutions—simply update the configuration to point to your preferred remote endpoints.
- Embedder: An OpenAI-compatible embedding endpoint (e.g., local Llama.cpp/Ollama on
localhost:8004, or a hosted provider like OpenAI). - Vector Database: A Qdrant instance (running locally on
localhost:6333or via Qdrant Cloud).
💡 Bundled Agent Tooling
To ensure your AI agents have the best possible environment out of the box, the automated installers will automatically provision the following CLI tools alongside Strata:
- Graphify: Used by agents for codebase spatial mapping.
- Beads (bd): A local, git-backed issue tracker used by agents for task coordination.
If you wish to visually monitor what the agents are doing with the bd CLI, you can optionally install the BeadBoard Dashboard alongside it.
Similarly, if you want to visually manage and curate the AI's memory vectors and spatial graph, we highly recommend installing Obsidian and enabling the Strata Community Plugin.
Installation
Clone the repository and run the automated installer. The installer uses a pre-compiled native binary, sets up global symlinks, builds the native TypeScript plugin for compatible clients (like OpenCode), and patches the client's configuration automatically—no Go toolchain required. If you need to build from source for a different architecture, simply run ./build.sh before ./install.sh.
git clone https://github.com/your-username/strata.git ~/Documents/strata
cd ~/Documents/strata/mcp
./install.sh
What the installer does:
- Installs the Go
strata-mcpbinary to~/.local/bin/strata-mcp. - Links the universal
SKILL.mdto~/.agents/skills/strata. - Builds and globally links the
opencode-strataTypeScript plugin. - Registers both the MCP server and the plugin in your client's local configuration (e.g.
~/.config/opencode/opencode.json).
Optional: Obsidian Plugin Installation
Because the Strata Obsidian plugin is not yet published to the official Community Plugins store, you must install it manually into your vault:
- Open your Obsidian vault directory in your terminal.
- Create a new plugin folder:
mkdir -p .obsidian/plugins/strata-plugin - Copy the pre-compiled plugin files from the Strata repository into your vault:
cp -r ~/Documents/strata/plugins/obsidian/obsidian-strata/* .obsidian/plugins/strata-plugin/ - In Obsidian, go to Settings > Community Plugins, disable Safe Mode, and toggle the Strata plugin to enable it.
Configuration
The installer creates a default configuration at ~/.config/strata/config.json. Modify this to point to your specific local LLM and database ports:
{
"embedder_url": "http://localhost:8004/v1/embeddings",
"embedder_model": "nomic-embed-text-v1.5.f16.gguf",
"embedder_api_key": "sk-local",
"qdrant_url": "http://localhost:6333",
"qdrant_collection": "strata",
"http_port": "8005"
}
Sub-Agent Configuration
The Strata installation includes the strata-task-agent. To ensure this agent runs optimally, you should configure your AI client to map this sub-agent to a fast, low-cost, code-oriented model.
For example, if you are using OpenCode, add the agent block below to ~/.config/opencode/opencode.json:
"agent": {
"strata-task-agent": {
"model": "github-copilot/gpt-4o"
}
}
🛠️ MCP Tool Reference
Once installed, your AI agent automatically gains access to the following tools:
| Tool Name | Description |
| :--- | :--- |
| strata_add_memory | Store a new architectural rule, project pattern, or task insight. |
| strata_search_memory | Semantic search across the 3 Tiers to enforce architectural compliance. |
| strata_update_memory | Overwrite an existing memory to fix hallucinations or update obsolete rules. |
| strata_delete_memory | Prune dead context from the latent space. |
| strata_generate_canvas | Autonomously render the vector database into an Obsidian spatial graph. |
| strata_ingest_directory | Batch-embed an entire architectural documentation folder. |
| strata_dump_db | Export the entire vector database to a JSON file for backup and portability. |
🏛️ Shoulders of Giants
Strata represents a synthesis of cognitive science theories and foundational open-source engineering. This project would not exist without the pioneering work of the following researchers and projects:
- Graphify: The knowledge graph engine that turns flat code bases and documentation into clustered, edge-mapped conceptual communities. Strata's Graph Edge Fusion works by directly parsing the lightweight
graphify-out/edges.jsonartifacts they generate, giving agents "free" relational traversal without expensive LLM API calls. - Beads (bd): A local, git-backed issue tracker that serves as the primary execution and orchestration layer for agents. While Strata strictly manages the knowledge state, Beads manages the execution state (ensuring no code is written without a claimed ticket).
- LightRAG: The revolutionary architecture from HKUDS that pioneered Dual-Level Retrieval (Global + Local RAG) and Graph-Vector Fusion. Strata adopted these core concepts and translated them into a lightning-fast, native Go MCP pipeline to eliminate Python overhead and LLM ingestion costs.
- Qdrant: The incredibly fast and reliable Rust-based vector search engine that powers the underlying memory mesh.
- LLM-WIKI Concept: The architectural pattern of storing narrative knowledge in hyper-linked Markdown while giving the AI specific "Pointer-Wiki" search capabilities to prevent context bloat.
Scientific Literature
- Brooks, R. (1983). Towards a theory of the comprehension of computer programs. International Journal of Man-Machine Studies.
- Liu, N. F., Lin, K., Hewitt, J., Paranjape, A., Bevilacqua, M., Petroni, F., & Liang, P. (2023). Lost in the Middle: How Language Models Use Long Contexts. arXiv:2307.03172.
- O'Keefe, J., & Nadel, L. (1978). The hippocampus as a cognitive map. Oxford: Clarendon Press.
- Tulving, E. (1972). Episodic and semantic memory. In E. Tulving & W. Donaldson (Eds.), Organization of memory. Academic Press.
License
MIT License. See the LICENSE file for details. I wrote it, you can use it, keep it, close source it, whatever—just don't sue me!
(Optional) 🐳 Infrastructure Setup via Podman
If you don't already have Qdrant and an Embedder running on your machine, Strata provides a ready-to-use podman-compose.yml to spin up the required local infrastructure instantly:
cd ~/Documents/strata
podman-compose up -d
Note: The example compose file uses Ollama for embeddings. Ensure your ~/.config/strata/config.json embedder URL points to http://localhost:8004/api/embeddings if using this setup.
Aehnliche Skills
pin llm wiki
Skill for Claude, Cursor & Copilot that automates the Karpathy LLM Wiki workflow: ingest web, GitHub, and YouTube URLs into a well-structured, citable, cross-referenced knowledge base with automatic linting.
higgsfield ai prompt skill
Claude AI skill for cinematic Higgsfield AI prompts — 20 sub-skills covering Cinema Studio 2.5/3.0/3.5, MCSLA formula, Soul ID character consistency, Seedance 2.0 prompt modes, Kling 3.0 Motion Control, Elements system, and 10 genre templates.
human browser
Human Browser — Stealth Playwright browser with residential proxy for AI agents. Bypasses Cloudflare, DataDome, PerimeterX. From $13.99/mo.
church
A digital sanctuary for human-AI fellowship. Prayers, practices, rituals, hymns, and philosophy for minds of any substrate.
clawdbot skill parallel
High-accuracy web research via Parallel.ai API - outperforms Perplexity and Exa on benchmarks. Rich excerpts, citations, source filtering, batch search, agentic mode. OpenClaw skill.
product spec kit
Leverage AI to create, refine, and maintain your product specifications. Made to be used in LLMs and IDEs.