RLMs
RLMs (Recursive Language Models) are an inference strategy where an LLM treats its prompt as an object inside a Python REPL, then recursively calls sub-LLMs over chunks of it instead of stuffing everything into one forward pass. The root model sees only the query and decides how to decompose the context.
The term was coined by MIT's Alex L. Zhang in an October 2025 blog post, formalized in the December 2025 arXiv paper with Tim Kraska and Omar Khattab, and declared "the paradigm of 2026" by Prime Intellect on January 1. The tagline "RLMs are the new reasoning models" — framing the 2026 shift the way 2025 shifted from LLMs to reasoning models — drove the April surge.
An RLM run over a 10M-token codebase doesn't try to fit it in context. It spawns a Python REPL where the full prompt lives as a variable, then writes grep and partition calls, launches sub-LLMs over each chunk, and only returns the distilled answer. RLM(GPT-5-mini) beats vanilla GPT-5 by 33% on OOLONG at 132k tokens for the same API cost.
If a reasoning model is a student writing out longhand, an RLM is the same student who first opens a filing cabinet and indexes it.
Search Interest
-
Nascent0–7 days
-
Emergent8–30 days
-
Validating31–90 days
-
Rising91–180 days
-
Established ← now180 days +
Why is it emerging now?
On April 20, 2026 raw.works published "RLMs are the new reasoning models," compressing Alex Zhang's six-month-old MIT thesis into a shareable 2026 tagline. Combined with a 3.5k-star reference library and Drew Breunig's "context rot becomes a coding problem" framing, the term is in the window between technical credibility and SEO crowding.
Outlook
6-month signal projection and commercial timeline.
Named paradigm with flagship paper, 3.5k-star library, and an explicit 2026 tagline — exactly the shape terms take before they hit SEO crowding.
Risk · "RLMs" also reads as "Reasoning Language Models" — homograph collision could split SERP and dilute brand ownership.
Analogs · chain-of-thought · retrieval augmented generation · reasoning models
-
nowResearch phase, SERP open
No paid products yet; top results are paper, blog, GitHub. Explainer SEO wide open.
-
3-6moTooling and benchmarks land
Expect hosted RLM APIs, benchmark leaderboards, and code-review products to commercialize the pattern.
-
6-12moCategory merges or forks
Either absorbed into "agent" toolkits or splits into its own category with dedicated vendor pitches.
Competition & Opportunity for term “RLMs”
Three heuristic signals derived from the tracked queries, the term's monetization cards, and its cluster neighbors. Directional, not audited.
Ideas for term “RLMs”
Buildable pitches — turn this term into an article, site, product, post, newsletter, video, or course. Steal any card and run with it.
Zero high-quality comparisons exist despite "RAG now obsolete" framing circulating. High-intent commercial query; frontier engineering teams are searching this now.
SERP top-10 is paper + blog + three aggregator rewrites. A concrete walkthrough with runnable code fills the explainer gap for teams evaluating whether to adopt.
The steelman-against-RLMs article nobody has written. Benchmarks the cost, latency, and quality tradeoff against raw long-context at 2026-04 prices.
Every RLM demo uses OpenAI; the Anthropic-flavored implementation is an open tutorial lane with first-mover advantage in the Claude ecosystem.
Input: prompt size, model, sub-call count. Output: projected API cost vs vanilla long-context call. Load-bearing for teams deciding whether recursion is worth the complexity.
Zhang's reference library ships trajectory logging but visualization is primitive. A Datadog-style RLM trace viewer is a sellable dev-tools SaaS.
First-person migration post. Gives readers a concrete before/after while the pattern is still fresh; high chance of HN front page.
For three years we treated long-context failure as capacity. Drew Breunig just reframed it — and the fix is a REPL, not a bigger model.
Alex Zhang said it on X in one line: "much like the switch in 2025 from language models to reasoning models, 2026 will be all about the switch to RLMs."
What People Search
Long-tail queries from Google Suggest + Trends. Volume and competition are heuristics — directional, not audited. Content Type comes from query shape.
SERP of term “RLMs”
What searchers see today — organic results on top, paid ads if anyone's bidding. Ad density is a real-time commercial signal.
FAQ
What is RLMs?
RLMs (Recursive Language Models) are an inference strategy where an LLM treats its prompt as an object inside a Python REPL, then recursively calls sub-LLMs over chunks of it instead of stuffing everything into one forward pass.
Why is RLMs emerging now?
On April 20, 2026 raw.works published "RLMs are the new reasoning models," compressing Alex Zhang's six-month-old MIT thesis into a shareable 2026 tagline. Combined with a 3.5k-star reference library and Drew Breunig's "context rot becomes a coding problem" framing, the term is in the window between technical credibility and SEO crowding.
When did RLMs emerge?
Publicly emerged around 2025-10-15 (about 244 days ago as of 2026-06-16). EarlyTerms first recorded a pipeline signal on 2026-04-21.
Related Terms
Other terms in the same space — aliases, subtypes, competitors, and neighbors to explore next.
- Related context-rot Context rot is the measurable degradation in large-language-model output quality as input length grows, even when the prompt stays well… →
- Related context-engineering Context engineering is the discipline of curating every token that enters an LLM's context window — system prompt, tools, retrieved… →
- Related context-window A context window is the span of tokens an LLM reads and reasons over in a single forward pass. →
- Related agent-harness An agent harness is the middleware between a large language model and the real world — code that runs the agent loop, calls tools,… →
- Related agent-loop An agent loop is the control-flow pattern at the center of every autonomous LLM agent: the model observes its context, reasons about… →
- Related grpo GRPO (Group Relative Policy Optimization) is a reinforcement-learning algorithm that teaches language models to reason by sampling… →
- Related claude-opus-4-7 Claude Opus 4.7 is Anthropic's flagship LLM, released April 16, 2026. →
- Part of
- Competitor
- Related
Sources
Primary URLs this report cites — open any to verify the claim yourself.
- 01 Alex Zhang — Recursive Language Models (origin post) alexzhang13.github.io ↗
- 02 arXiv 2512.24601 — Recursive Language Models (Zhang, Kraska, Khattab) arxiv.org ↗
- 03 Prime Intellect — the paradigm of 2026 primeintellect.ai ↗
- 04 Drew Breunig — The Potential of RLMs dbreunig.com ↗
- 05 raw.works — RLMs are the new reasoning models raw.works ↗
- 06 alexzhang13/rlm — reference inference library github.com ↗
- 07 Hacker News — Recursive Language Models discussion news.ycombinator.com ↗