OpenAlice Academy — 03 · 08 / The LLM-maintained Wiki

01 / 08

The division of labour

You keep the judgement. The LLM does the bookkeeping.

A human wiki rots because nobody updates the index, nobody fixes the dangling link, nobody notices page A now contradicts page C. An LLM never gets bored. That swap — clerical tax for tireless bookkeeping — is the whole pitch.

human → curate · direct · ask · judge LLM → read · summarise · cross-link · update index · file it all back

Karpathy's framing: "the tedious part of maintaining a knowledge base is not the reading or the thinking — it's the bookkeeping." LLMs don't forget to update a cross-reference. So hand them the clerical work and keep the part only a human can do.

IT IS AGENT MEMORY

A durable, structured place an agent reads from and writes to across sessions, threads, and months — the unsolved problem of agentic AI made tractable with plain files. Your own MEMORY.md + topic files are a per-agent LLM-wiki.

It is not a product or a library — it is a discipline. The cleverness lives in how you set up the layers and the rules, not in any code you install.

02 / 08

Compile-once vs retrieve-every-time

The wiki is your L1 cache.

Think of it as a cache hierarchy. The curated wiki is L1 — small, fast, always loaded into context. RAG over a giant corpus is L2 — large, slower, occasionally misses. Press a query and watch where the cost lands.

FIG.02 — SAME QUESTION · TWO STRATEGIES · COST FLIPS

RAG · retrieve every timeL2

SYNTHESIS COST— · paid now

GETS SMARTER?no — resets

FRESHNESSalways live

WIKI · compiled onceL1

SYNTHESIS COST— · paid once

GETS SMARTER?yes — compounds

FRESHNESSstale until re-ingested

queries asked 0

RAG re-pays the synthesis cost on every query and never gets smarter. The wiki pays synthesis once at ingest and amortises it across all future reads. The tradeoff is freshness — a compiled fact is stale until re-ingested.

Below a certain size you only need L1: direct file-reading is simpler, more reliable, and cheaper per query than any RAG pipeline. The smart move at scale is hybrid — curated wiki as L1, RAG as L2.

"Query Atlas before grepping" is literally "read the compiled wiki before re-deriving from raw." You are running this pattern every time you use the lab's knowledge base.

03 / 08

The architecture · raw · wiki · schema

Three layers. One of them is the real product.

The whole thing is three folders on disk. Click each layer to see what it owns — and who's allowed to write to it.

raw/ · immutable sources layer 1 · you curate

Articles, papers, images, data files. The single source of truth. The LLM reads from it but never modifies it — so a wrong wiki page can always be re-derived from an untouched source, and you always know what is primary input vs derived synthesis.

wiki/ · the compiled markdown layer 2 · LLM owns it

Summary pages, entity pages, concept pages, synthesis docs — all cross-linked with [[wikilinks]]. Two special files stop it rotting: index.md (a catalog of every page, the LLM's map) and log.md (an append-only audit trail of every ingest, query, and lint).

CLAUDE.md · the schema layer 3 · the real product

A config doc (vendor-neutral name: AGENTS.md) that turns a generic LLM into a disciplined knowledge worker: structure, naming, page templates, conflict-handling rules, and the exact workflows for ingest / query / lint. The community consensus is blunt — a bad schema produces a confidently wrong wiki. The schema is where your domain knowledge lives.

// the entire on-disk shape raw/ ← immutable source documents wiki/ ← LLM-compiled markdown index.md ← catalog · the map log.md ← append-only trail CLAUDE.md ← schema · governs behaviour

Notice the asymmetry of who writes what. You own raw/; the LLM owns wiki/; you co-author the schema. Provenance is a first-class concern — at scale you can't tell human-curated truth from LLM guess by looking, so every file carries provenance tags in YAML frontmatter.

In our library every article carries sources:, authored_by, and date — the primary-vs-derived tagging the production write-ups had to retrofit, we had from day one.

04 / 08

The flow · ingest → index → serve

Watch a source fan into the wiki.

Drop a source in and the LLM reads it, mints a summary, upserts every relevant entity and concept page, links them, and updates the catalog and trail. Press Ingest → and watch the document flow through the pipeline, live.

FIG.04 — LIVE INGEST PIPELINE · one source · many pages

pages touched 0 · sources 0

① READ

not silent

The LLM reads the source and discusses the takeaways with you. Human-in-the-loop is by design — that conversation is the write gate, not friction.

② INTEGRATE ★

merge, don't duplicate

It writes a summary, then upserts each entity and concept page — merging into what exists and adding [[cross-links]]. One source, 10–15 pages.

③ FILE

the trail compounds

A one-line entry goes in index.md; a timestamped record goes in log.md. The map and the audit trail stay current automatically.

05 / 08

The operations · ingest · query · lint

Everything the LLM does is one of three verbs.

Ingest fans a source into many pages. Query answers from the wiki — and files the answer back. Lint is the bookkeeping that keeps the store from compounding into contradiction. Run each verb and watch the trail.

FIG.05 — THE THREE VERBS · log.md trail

// the query verb is itself a WRITE path on query(q): pages = search(wiki, q) answer = synthesise(pages) + citations file_back(answer) // explorations compound

The crucial twist: a good answer is filed back into the wiki. Your questions become knowledge. The query path is a write path — that is how exploration compounds.

Lint is not a nicety. It hunts contradictions, stale claims newer sources have superseded, orphan pages with no inbound links, and dangling cross-references. It is the operation Karpathy's own early vault was missing — and the thing that keeps a compounding store from compounding into error.

06 / 08

Query · semantic ranking, live

Type a question. The pages reorder themselves.

A real (tiny) semantic search, running in your browser — no fakes. Each note is a bag-of-words vector; your query is too; the cards sort by cosine similarity. Type below and watch the most relevant pages float to the top, glowing.

FIG.06 — LIVE SEMANTIC SEARCH OVER WIKI PAGES

avoid re-deriving why wikis rot human in the loop find orphans too big, needs rag

HOW THE RANKING WORKS

Every page becomes a sparse term-frequency vector. Your query becomes one too. Similarity is the cosine of the angle between them — shared words pull a page upward. This is the same idea behind production vector search, shrunk to fit on screen.

PAGES IN WIKI—

QUERY TERMS—

TOP MATCH—

TOP SIMILARITY—

Past ~200 pages the single index.md stops scaling and you need real retrieval — BM25 + vector + graph traversal fused by reciprocal-rank-fusion. At that point you're running a hybrid system, and "no RAG needed" is no longer true.

07 / 08

Lint · the cross-link graph

Light up a page's links. Then hunt the orphans.

Cross-links turn a pile of pages into a graph the LLM can traverse. Click any node to light its neighbours. Then press Lint — and watch the bookkeeping flag every orphan with no inbound link.

FIG.07 — WIKILINK GRAPH · click a node · lint for orphans

click a node to trace its links

CONTRADICTIONS

page A vs page C

Newer sources can quietly contradict older pages. Lint surfaces the clash so a human resolves it by explicit supersession, not silent decay.

ORPHANS ★

no inbound links

A page nothing points to is invisible to graph traversal — effectively lost. Orphan detection is exactly the lint pass Karpathy's first vault was missing.

STALE & DANGLING

broken [[links]]

A [[wikilink]] to a page that no longer exists is a dead end. Lint scans for them, plus stale claims and important concepts that lack their own page.

08 / 08

Where it bites · be honest

Genuinely useful. Genuinely oversold in places.

This pattern is the substrate under our whole knowledge base — and it has hard edges worth naming before you build one.

The claim	The honest reality	The discipline
"Just write a CLAUDE.md"	The schema is most of the work; a bad one produces a confidently wrong wiki. The gist gives the shape, not tested prompts or a conflict algorithm.	Treat the schema as the product. Iterate it like code.
"It auto-decays old facts"	v2's confidence scores + forgetting curves are contested — numeric confidence is false precision, and decaying errors repeats old bugs.	Explicit supersession + git history beats automatic decay.
"Just drop files in"	Event-driven auto-ingest drifts — one production deploy needed 14 MCP servers + a post-compact hook to survive live operations.	Filter at ingest. Manual before automated. Git as audit trail.
"No RAG needed"	True only under ~200 pages. Above that, index.md stops scaling and you need real retrieval.	Go hybrid: wiki as L1, RAG as L2. No shortcut around the context limit.
"Compounding is all upside"	A wrong fact filed back during a query propagates. The mechanism that makes the wiki smarter makes a polluted one confidently wrong.	Lint frequency is a real operational parameter, not a footnote.

THE ATLAS KB *IS* THIS PATTERN

Not a borrowed analogy. knowledge/research/ + ingested repos are our raw layer; the zoned markdown under knowledge/ is the wiki; ORG-CONVENTIONS.md + each zone's README.md + the auto-injected conventions are the schema.

The per-zone README.md files are Karpathy's index.md; the weekly audit-2026-Wxx.md files are his log.md. Cron re-ingests ~60 repos and emits weekly digests — log.md and a partial vault-wide lint, automated.

M12 + M13 ARE THE INGEST PIPELINE

M12 — this educational library — and M13 — the deep-research loop that fans out reader models, synthesises, and writes a cross-linked article — are the ingest pipeline made concrete. The source article for this very lesson was produced by it.

We keep writes gated. Agents author pages, NAO curates — exactly the "manual before automated, git as audit trail" discipline the practitioners recommend. The clean three-layer diagram quietly grows a lot of plumbing in reality. Keep the human in the loop.

03 · 08 — you reached the end of the path

You finished
the whole path.

Compile, don't retrieve. Three layers — immutable raw, LLM-owned wiki, the schema that governs it all. Three verbs — ingest, query, lint. It only works small, so go hybrid; keep writes gated, provenance tagged, git as the audit trail. The knowledge base you just learned to build is the one this lesson lives in.

★ the loop closes

From a single neuron to a self-maintaining wiki.

You started by building backprop by hand. You end by understanding the discipline that lets an LLM keep an entire knowledge base — including this one — alive. That's the full arc: build the engine, then teach it to remember.

03·06 Model Routing · send each query to the cheapest capable model ✓ done

03·07 Graphify · turn a codebase into a queryable call-graph ✓ done

03·08 The LLM-maintained Wiki · compile · ingest · query · lint ✓ complete

00·00 Neural network from scratch · loop back to the very beginning ↺ revisit

Loop back · 00 · 00