EarlyTerms

LLM wiki

Validating · Emerged · 73 days old · Last reviewed

LLM wiki is a pattern where an AI agent incrementally compiles a folder of plain-markdown entity pages from your raw sources, then answers questions against that structured wiki instead of re-retrieving the originals every time. Knowledge compounds in the filesystem rather than evaporating after each chat.

The term crystallized on April 4, 2026 when Andrej Karpathy posted a gist framing his personal research workflow — raw/ on one side, a wiki/ of cross-linked markdown on the other. It hit Hacker News at 296 points / 95 comments the same day, and within a week spawned a dozen Show HN clones and a domain rush across every major TLD.

💡

Karpathy's own wiki reportedly grew to ~100 articles and 400k words while still navigable by a coding agent. Open clones followed within days: lucasastorian/llmwiki (570 stars), nashsu/llm_wiki desktop app (2k stars), and sdyckjq-lab/llm-wiki-skill — a Claude Code skill that ports the pattern into a reusable agent harness.

Like a personal Wikipedia that writes and edits itself while you read.

Search Interest

peak ~2.1K/mo
updated 2026-06-14
~2.1K/mo ~1.0K/mo 0
2026-05-16 2026-05-31 2026-06-14
Term Lifecycle
  1. Nascent
    0–7 days
  2. Emergent
    8–30 days
  3. Validating ← now
    31–90 days
  4. Rising
    91–180 days
  5. Established
    180 days +

Why is it emerging now?

TL;DR

Karpathy's April 4 gist reframed RAG as a compiler problem — ingest once, cross-link forever — and agentic coding tools like Claude Code finally made the maintenance step cheap enough to automate. The HN thread crossed 95 comments in a day, and 12/13 TLDs on llmwiki.* were grabbed within two weeks.

5 forces driving coverage — scroll →

Outlook

6-month signal projection and commercial timeline.

Signal medium
Revenue moderate

Karpathy attribution plus a week-one clone wave; fate rides on one impl becoming canonical before RAG vendors absorb it.

Risk · If vendors rebrand it 'persistent RAG' or 'agent memory,' the distinct term fades back into generic knowledge-base marketing.

Analogs · vibe coding · nanoGPT · software 2.0

Monetization timeline
  1. now
    Clone wave, domains gone

    10+ Show HN impls in 9 days; 12/13 TLDs on llmwiki.* taken since April 4.

  2. 3-6mo
    Skills and Obsidian plugins

    Agent Skills and Obsidian plugins become the distribution surface over standalone apps.

  3. 6-12mo
    Folded into vendor memory

    Anthropic, Notion, NotebookLM ship native wikis; category name fragments.

Competition & Opportunity for term “LLM wiki”

Three heuristic signals derived from the tracked queries, the term's monetization cards, and its cluster neighbors. Directional, not audited.

Content Gap
10 queries tracked
Led by General (9), Showcase (1)
10 Suggest-only tails — long-tail opening
Revenue Potential
0% commercial-intent queries
2 monetization angles mapped
Mostly informational — pre-commercial
Build Difficulty
Medium
Stage: validating — incumbents warming up
12 / 13 default TLDs taken · oldest incumbent llmwiki.com (2023-04-12)
4 related terms already published
Heuristic · signals: tracked queries, term monetization cards, cluster neighbors

Ideas for term “LLM wiki”

Buildable pitches — turn this term into an article, site, product, post, newsletter, video, or course. Steal any card and run with it.

Article
LLM Wiki vs RAG: When the Compiler Beats the Retriever

The core explainer Google is already surfacing ('bye bye RAG' is a live search tail). Compare ingestion-time structuring to query-time retrieval with a real 100-source corpus benchmark.

Article
How to Build Karpathy's LLM Wiki in Claude Code: A Concrete 30-Minute Setup

Tutorial query already rising in autocomplete ('llm wiki skill', 'llm wiki github'). Cover raw/ + wiki/ layout, a lint-pass prompt, and the Obsidian graph view.

Article
LLM Wiki at 400k Words: Does It Still Work When Your Corpus Gets Big?

Address the top HN critique (context degradation, O(n²) lint passes). Specific, testable, and ranks for 'llm wiki scaling' / 'llm wiki limitations' long-tail.

Article
Every Open-Source LLM Wiki Implementation, Compared

Round-up of llmwiki, llm-wiki-compiler, nashsu/llm_wiki, karpathy-llm-wiki and sage-wiki. Comparison SERP is empty on April 20.

Website
LLMWiki.directory — tracker for every implementation and agent skill

Compact brand taken across 12 TLDs but '.top' still available; a directory site could ride the ecosystem fragmentation that will keep happening through Q3.

Product
LLM-wiki-as-a-service for founders and PMs

Dropbox-style folder that auto-compiles any upload into a wiki. The pattern is simple; the UX of keeping wiki/ and raw/ in sync is what civilians can't do.

Product
Wiki lint pass service

The HN skeptics are right that contradictions and 'slop' compound over time. A scheduled lint agent that flags stale entries and conflicting claims is a genuine pain point.

Video
'I replaced my RAG pipeline with a markdown folder' — 15-min YouTube demo

Visual before/after: vector DB vs Karpathy-style wiki on the same 50 papers. Strong thumbnail, clear watch-time payoff, rides the Karpathy name.

Post Newsletter / LinkedIn
The Year Karpathy Killed RAG (Again)

A 400-line gist from April 4 has already inspired more open-source clones than most VC-funded AI startups launched this quarter.

Post HN / r/programming
I Ran Karpathy's LLM Wiki on 3,000 Papers. Here's Where It Broke.

Everyone's reposting the gist. Nobody is running it past 100 articles. I did, and the lint pass exploded.

Post YouTube / Tech media
Why Everyone in AI Bought llmwiki.com Last Week

Twelve of thirteen TLDs on llmwiki.* were registered in April 2026 — most within 72 hours of Karpathy's gist. What does the registrar data say about where this goes?

What People Search

Long-tail queries from Google Suggest + Trends. Volume and competition are heuristics — directional, not audited. Content Type comes from query shape.

Keyword
Competition
Content Type
llm wiki
Very Low
General
llm wiki karpathy
Very Low
General
llm wiki andrej
Very Low
General
llm wiki gist
Very Low
General
llm wiki github
Very Low
Showcase
llm wiki andrej karpathy
Very Low
General
llm wiki skill
Very Low
General
llm wikidata
Very Low
General
1–8 of 10
1 / 2
Updated 2026-06-14 · sources: Google Trends, Google Suggest · Competition is heuristic

SERP of term “LLM wiki”

What searchers see today — organic results on top, paid ads if anyone's bidding. Ad density is a real-time commercial signal.

FAQ

What is LLM wiki?

LLM wiki is a pattern where an AI agent incrementally compiles a folder of plain-markdown entity pages from your raw sources, then answers questions against that structured wiki instead of re-retrieving the originals every time.

Why is LLM wiki emerging now?

Karpathy's April 4 gist reframed RAG as a compiler problem — ingest once, cross-link forever — and agentic coding tools like Claude Code finally made the maintenance step cheap enough to automate. The HN thread crossed 95 comments in a day, and 12/13 TLDs on llmwiki.* were grabbed within two weeks.

When did LLM wiki emerge?

Publicly emerged around 2026-04-04 (about 73 days ago as of 2026-06-16). EarlyTerms first recorded a pipeline signal on 2026-04-21.

Related Terms

Other terms in the same space — aliases, subtypes, competitors, and neighbors to explore next.

Explore next
Also mentioned
  • Also known as idea file
  • Competitor RAG·GraphRAG·NotebookLM·DeepWiki
  • Related Obsidian·knowledge graph·second brain

Sources

Primary URLs this report cites — open any to verify the claim yourself.

  1. 01 Karpathy — llm-wiki.md gist gist.github.com
  2. 02 Hacker News flagship thread (296 pts) news.ycombinator.com
  3. 03 Analytics Vidhya — LLM Wiki Revolution analyticsvidhya.com
  4. 04 Data Science Dojo — tutorial datasciencedojo.com
  5. 05 Global Advisors — Term briefing globaladvisors.biz
  6. 06 MindStudio — build with Claude Code mindstudio.ai
  7. 07 Level Up Coding — Beyond RAG levelup.gitconnected.com
  8. 08 lucasastorian/llmwiki (reference OSS impl) github.com