EarlyTerms

AutoResearch

Rising · Emerged · 101 days old · Last reviewed

AutoResearch is an agent loop in which an LLM autonomously edits a single training file, runs a fixed 5-minute experiment, checks whether a chosen metric improved, and keeps or reverts the change — repeating overnight. It is a minimal "modify → verify → keep/discard → repeat" harness, not a heavyweight framework.

The name crystallized when Andrej Karpathy open-sourced karpathy/autoresearch on March 7, 2026, pointing it at his own nanochat GPT-2 codebase. Two days, ~700 experiments, ~20 real improvements, and an 11% end-to-end speedup later, the repo had 74k+ stars and the term had been adopted as a generic category for autonomous experiment loops.

💡

Karpathy's reference setup uses three files with strict ownership: `prepare.py` is immutable and handles data plus the `val_bpb` evaluator, `train.py` is the agent's sandbox, and `program.md` is the human-written research brief. The agent proposes a change (e.g. a QK-norm scaler, a banded-attention tweak, an AdamW beta), trains for exactly 5 minutes, and git-reverts anything that doesn't lower validation loss.

Like leaving a junior researcher alone overnight with a stopwatch and one dial — they try things, and only the wins survive.

Search Interest

peak ~1.3K/mo
updated 2026-06-12
~1.3K/mo ~647/mo 0
2026-05-14 2026-05-29 2026-06-12
Term Lifecycle
  1. Nascent
    0–7 days
  2. Emergent
    8–30 days
  3. Validating
    31–90 days
  4. Rising ← now
    91–180 days
  5. Established
    180 days +

Why is it emerging now?

TL;DR

Karpathy open-sourced AutoResearch on March 7, 2026 — 630 lines of Python that let an AI agent run ~100 training experiments a night on a single GPU. Two days of autorun shaved 11% off his already-tuned nanochat GPT-2 pipeline, and Fortune ran the "loopy era" thesis ten days later.

6 forces driving coverage — scroll →

Outlook

6-month signal projection and commercial timeline.

Signal high
Revenue moderate

Karpathy-sized mindshare plus a minimal spec means copycats and verticalizations (kernels, SAT, RL) keep compounding the term for 6+ months.

Risk · Skeptics frame it as fancy hyperparameter search; if that critique sticks, the term could fragment into vendor-specific "agent labs" branding.

Analogs · nanoGPT · AutoML · agent loop

Monetization timeline
  1. now
    OSS traction, zero SaaS

    Karpathy's repo plus copycats; no paid product owns the category yet.

  2. 3-6mo
    Managed loops land

    Expect hosted AutoResearch-as-a-service on SkyPilot, Modal, Together, RunPod.

  3. 6-12mo
    Verticals fragment term

    AutoKernel, auto-SAT, auto-RL siphon searches into domain-specific brands.

Competition & Opportunity for term “AutoResearch”

Three heuristic signals derived from the tracked queries, the term's monetization cards, and its cluster neighbors. Directional, not audited.

Content Gap
10 queries tracked
Led by General (9), Showcase (1)
10 Suggest-only tails — long-tail opening
Revenue Potential
0% commercial-intent queries
2 monetization angles mapped
Mostly informational — pre-commercial
Build Difficulty
High
Stage: rising — red-ocean, crowded
11 / 13 default TLDs taken · oldest incumbent autoresearch.com (1997-09-30)
7 related terms already published
Heuristic · signals: tracked queries, term monetization cards, cluster neighbors

Ideas for term “AutoResearch”

Buildable pitches — turn this term into an article, site, product, post, newsletter, video, or course. Steal any card and run with it.

Article
AutoResearch vs AutoML: What Karpathy's Loop Actually Adds Over Bayesian Sweeps

The top HN critique is "this is just hyperparameter search." A clear, code-level explainer of the three differences (arbitrary code edits, sequential memory, full automation) ranks for the skeptics' queries.

Article
How to Run AutoResearch on a Single RTX 4090: A Step-by-Step Walkthrough

Tutorials for running nanochat on consumer GPUs are thin, and "autoresearch karpathy" is already a live Google autocomplete. First clean walkthrough owns the tutorial query.

Article
AutoResearch for Non-ML: Applying Karpathy's Loop to Scrapers, Solvers, and Kernels

AutoKernel and agent-SAT already exist. A category piece that collects the non-ML ports and names the pattern wins cross-domain backlinks.

Website
Directory of AutoResearch ports, skills, and hosted loops

yibie/awesome-autoresearch is a GitHub list, not a browsable site. A search-friendly directory indexed on "autoresearch <domain>" queries has clear SEO headroom.

Product
Managed AutoResearch runner with spot-GPU orchestration

Upload `train.py` + `program.md`, pick a budget, get a morning report. SkyPilot already has a scaling post; a productized wrapper with cost caps is a one-weekend SaaS.

Product
AutoResearch dashboard / experiment diff viewer

100 experiments a night means log overload. A web UI that shows kept diffs, reverted diffs, and metric curves is the natural complement to the CLI loop.

Post
I Ran Karpathy's AutoResearch on My Own Codebase for a Week. Here's What It Found.

First-person experiment posts with real diffs and real regressions travel on HN and X. Few exist outside Karpathy's own nanochat run.

Video
"AutoResearch Overnight: I Fell Asleep, the Agent Shipped 20 Improvements" — 12-min YouTube build

Time-lapse of the agent editing `train.py`, watching val_bpb drop, narrating the wins. Visual-first demo is under-served; Karpathy's own launch tweet is text.

Post Newsletter / LinkedIn
The Loopy Era Started March 7

In 630 lines of Python, Karpathy fired the starting gun on autonomous ML research — and the frontier labs are already copying the spec.

Post HN / r/MachineLearning
Is AutoResearch Actually New, or Just AutoML Wearing an LLM Costume?

The best HN comment under the launch thread wasn't "wow" — it was "isn't this just Bayesian optimization with extra steps?"

Post X / Tech media
Every Week Someone Ports AutoResearch to a New Domain

AutoKernel for GPU Triton. agent-SAT for solvers. pi-autoresearch for Raspberry Pi. The pattern is eating every optimization problem with a numeric verifier.

What People Search

Long-tail queries from Google Suggest + Trends. Volume and competition are heuristics — directional, not audited. Content Type comes from query shape.

Keyword
Competition
Content Type
autoresearch
Very Low
General
autoresearch github
Very Low
Showcase
autoresearch karpathy
Very Low
General
autoresearchclaw
Very Low
General
autoresearch skill
Very Low
General
autoresearch andrej karpathy
Very Low
General
autoresearcher
Very Low
General
autoresearch claude
Very Low
General
1–8 of 10
1 / 2
Updated 2026-06-12 · sources: Google Trends, Google Suggest · Competition is heuristic

SERP of term “AutoResearch”

What searchers see today — organic results on top, paid ads if anyone's bidding. Ad density is a real-time commercial signal.

FAQ

What is AutoResearch?

AutoResearch is an agent loop in which an LLM autonomously edits a single training file, runs a fixed 5-minute experiment, checks whether a chosen metric improved, and keeps or reverts the change — repeating overnight.

Why is AutoResearch emerging now?

Karpathy open-sourced AutoResearch on March 7, 2026 — 630 lines of Python that let an AI agent run ~100 training experiments a night on a single GPU. Two days of autorun shaved 11% off his already-tuned nanochat GPT-2 pipeline, and Fortune ran the "loopy era" thesis ten days later.

When did AutoResearch emerge?

Publicly emerged around 2026-03-07 (about 101 days ago as of 2026-06-16). EarlyTerms first recorded a pipeline signal on 2026-04-20.

Related Terms

Other terms in the same space — aliases, subtypes, competitors, and neighbors to explore next.

Explore next
Also mentioned
  • Includes AutoResearchClaw·AutoKernel
  • Competitor AutoML
  • Related nanochat

Sources

Primary URLs this report cites — open any to verify the claim yourself.

  1. 01 karpathy/autoresearch — canonical repo github.com
  2. 02 VentureBeat — Karpathy's open-source autoresearch venturebeat.com
  3. 03 Fortune — Why everyone is talking about Karpathy's autonomous AI research agent fortune.com
  4. 04 DataCamp — Guide to AutoResearch datacamp.com
  5. 05 Hacker News — launch thread (208 pts, Mar 7) news.ycombinator.com
  6. 06 Hacker News — "Autoresearch on an old research idea" (428 pts, Mar 23) news.ycombinator.com
  7. 07 SkyPilot blog — Scaling Karpathy's Autoresearch to a GPU cluster blog.skypilot.co
  8. 08 awesome-autoresearch — ecosystem list github.com