Indexed Knowledge Base Pattern for LLM Retrieval

Rule

Structure knowledge as a directory of markdown files with a curated index file. The index is always loaded into context; detail files are loaded selectively by relevance. This gives LLMs efficient retrieval over a growing knowledge base without stuffing everything into the prompt.

Context

LLMs need knowledge that grows over time — data inventories, design decisions, user preferences, domain facts. Dumping everything into a system prompt doesn't scale. But the LLM can't search files on its own without tooling.

The solution is a two-tier structure: a compact index (always in context) that points to detail files (loaded on demand). Claude Code's auto-memory system uses exactly this pattern, but it's general-purpose and works for any knowledge base.

Detection

You need a growing knowledge store without loading every file into baseline context
A repo has many notes but no compact manifest telling the agent what exists
Retrieval is opaque because the only options are "load everything" or "search ad hoc"
Knowledge descriptions are too vague for selective loading

Pattern

Structure

knowledge/
  INDEX.md              # Always loaded. ~50-200 lines max.
  topic-a.md            # Detail file, loaded when relevant
  topic-b.md            # Detail file, loaded when relevant
  ...

The index file

Compact manifest with one-line descriptions per entry. Descriptions are the retrieval key — they're what the LLM (or tooling) matches against the current conversation to decide what to load.

# Knowledge Index

## Data & Analysis
- [qs-data-landscape.md](qs-data-landscape.md) — QS data sources on erb-m2, how to query AW, Oura date ranges
- [predictive-framework.md](predictive-framework.md) — decay kernels, Bayesian sleep/work models, counterfactual simulation

## Feedback
- [feedback-journal-format.md](feedback-journal-format.md) — journal entries are topic-based, not session-based

Detail files

Structured content with optional frontmatter for metadata:

---
name: QS Data Landscape
description: What QS data is available on erb-m2 and how to access it
type: reference
---

Content goes here...

The description field is what retrieval matches against. Make it specific enough to distinguish from other entries — "QS data sources on erb-m2" not just "data stuff."

Retrieval flow

Index is loaded into every conversation (cheap — it's small)
LLM or tooling scans descriptions against current context
Matching detail files are loaded into context
Non-matching files stay on disk, saving context window

Version control

The whole directory lives in git. This is critical:

Knowledge is versioned and reviewable
Portable across machines (clone the repo)
Visible to all runtimes (not locked to one tool)

Symlink trick for runtime integration

If a runtime has its own local knowledge store with retrieval built in (like Claude Code's ~/.claude/projects/*/memory/), symlink it to your versioned knowledge directory:

rm -rf ~/.claude/projects/-Users-.../memory
ln -s /path/to/repo/knowledge/my-index ~/.claude/projects/-Users-.../memory

The runtime's retrieval still works (reads through symlink), but files live in git.

Anti-patterns

Index too long: If the index exceeds ~200 lines, it's eating too much context. Split into sub-indexes or prune stale entries.
Descriptions too vague: "project notes" matches everything and nothing. Be specific: "auth middleware rewrite driven by compliance requirements."
Detail files too large: Each file should be focused enough to load without wasting context. Split if over ~100 lines.
No index at all: A directory of 50 files with no index means either loading everything (expensive) or loading nothing (useless).

Outcome

Knowledge scales without bloating the context window
Retrieval is transparent — you can read the index and understand what's available
Git versioning means knowledge is durable and auditable
Pattern is runtime-agnostic — works with Claude Code, gptme, Codex, or custom tooling

Prefer Brain Repo Over Local Machine Memory — companion lesson: where to put knowledge (brain repo, not local memory). This lesson covers how to structure it.

Origin

2026-04-13: Discovered while integrating Claude Code's auto-memory with Alice's git-based brain repo. Claude's memory system is exactly this pattern (MEMORY.md index + detail files with description-based retrieval). Generalized for any LLM knowledge base.

Indexed Knowledge Base Pattern for LLM Retrieval

Indexed Knowledge Base Pattern for LLM Retrieval

Rule

Context

Detection

Pattern

Structure

The index file

Detail files

Retrieval flow

Version control

Symlink trick for runtime integration

Anti-patterns

Outcome

Related

Origin

Match Keywords