Long-running Agents

Established · Emerged 2025-11-26 · 202 days old · Last reviewed 2026-04-30

Long-running agents are AI agents designed to sustain work across multiple context windows, persisting state through structured artifacts — progress files, git commits, feature specs — so each new session resumes where the last ended. The pattern addresses a hard constraint: every context window is amnesia.

Anthropicエンgineer Justin Young formalized the concept on November 26, 2025 with a two-agent harness (initializer + coding agent) enabling Claude to build production web apps across sessions. By February 2026, Cursor launched a research preview with documented 36–52 hour autonomous coding runs; Anthropic followed with a three-agent planner-generator-evaluator architecture in March 2026.

💡

Cursor's long-running agent completed an all-new chat platform integration in 36 hours and a mobile app port in 30 hours, producing PRs with merge rates comparable to short-session agents. The runs used multi-agent plan-and-verify loops to prevent context drift on tasks spanning thousands of files.

Think of it as a relay race for AI: each runner picks up the baton from a structured handoff note, not from memory.

Search Interest

peak ~518/mo

updated 2026-06-12

~518/mo ~259/mo 0

2026-05-14 2026-05-29 2026-06-12

Term Lifecycle

Nascent

0–7 days
Emergent

8–30 days
Validating

31–90 days
Rising

91–180 days
Established ← now

180 days +

Why is it emerging now?

TL;DR

Frontier models can now sustain 25–52 hour autonomous coding sessions with proper harnesses. Anthropic's November 2025 engineering post established the canonical two-agent pattern; Cursor's February 2026 research preview demonstrated it at production scale (151k-line PRs). Three forces converge: models capable enough to stay coherent, harness patterns proven at scale, and token costs low enough to run for days.

6 forces driving coverage — scroll →

Anthropic Engineering

Effective harnesses for long-running agents

Two-agent initializer + coding loop; claude-progress.txt + feature JSON track state across sessions. Each new session begins with no memory of what came before.

Nov 26, 2025

Y Hacker News

Effective harnesses for long-running agents

Nov 28, 2025 125 points · 37 comments

Cursor

Expanding our long-running agents research preview

Runs of 36h (chat platform), 30h (mobile app), 25h (auth refactor); one user reported a 52-hour task with 151k lines of code.

Feb 12, 2026

Anthropic Engineering

Harness design for long-running application development

Three-agent planner-generator-evaluator; 5–15 cycles per session, up to 4 hours. Separating the working agent from the judging agent is the key lever.

Mar 24, 2026

Cursor

Scaling long-running autonomous coding

1M+ lines of code across 1,000 files in a week; Planner/Worker split reduced inter-agent conflicts; GPT-5.2 outperforms others at sustained autonomous work.

Jan 14, 2026

Addy Osmani

Long-running Agents

Synthesizes three subtypes — long-horizon reasoning, long-running execution, persistent agency — and maps dominant Brain/Hands/Session pattern across major labs.

Apr 28, 2026

Outlook

6-month signal projection and commercial timeline.

Signal high

Revenue strong

Both Anthropic and Cursor shipping production-grade harnesses signals rapid platform standardization over the next 6 months.

Risk · Longer context windows (>2M tokens) could shrink the 'cross-session' problem before harness patterns solidify.

Analogs · serverless · containers · CI/CD pipelines

Monetization timeline

now

Harness tooling gap open

Harness frameworks, context-management SDKs, and observability tools serve teams deploying agents now.
3-6mo

Platform and SaaS layer

Hosted harness-as-a-service products and per-session pricing models emerge as category matures.
6-12mo

Enterprise evaluation and audit

Compliance, cost-control, and audit tools become essential as enterprises run 24h+ autonomous agent sessions.

Competition & Opportunity for term “Long-running Agents”

Three heuristic signals derived from the tracked queries, the term's monetization cards, and its cluster neighbors. Directional, not audited.

Content Gap

10 queries tracked

Led by General (9), Showcase (1)

10 Suggest-only tails — long-tail opening

Revenue Potential

0% commercial-intent queries

2 monetization angles mapped

Mostly informational — pre-commercial

Build Difficulty

Very High

Stage: established — category is settled

1 / 10 default TLDs taken · oldest incumbent longrunningagents.com (2025-11-22)

10 related terms already published

Heuristic · signals: tracked queries, term monetization cards, cluster neighbors

Ideas for term “Long-running Agents”

Buildable pitches — turn this term into an article, site, product, post, newsletter, video, or course. Steal any card and run with it.

Article

Long-running agents vs. standard agents: when to use each

High-intent comparison query with clear task-decomposition angle. Ranks for developers evaluating multi-session architecture.

Article

How to build a long-running agent harness with Claude Code

Tutorial targeting Anthropic's engineering blog pattern; strong long-tail on 'claude progress.txt' and 'feature spec JSON' sub-queries.

Article

Long-running agents alternatives: LangGraph, CrewAI, and custom harnesses compared

Comparison article capturing 'what framework should I use' intent from teams starting new agentic projects.

Product

A drop-in harness SDK for cross-session state management in long-running Claude agents

Targets the exact gap Anthropic's blog exposed: teams hand-rolling progress files, feature specs, and git-commit logic.

Product

Observability dashboard for multi-session agent runs: cost, drift, and task-completion tracking

Every 30h+ run incurs hundreds of dollars; teams need cost visibility and early warning on context drift before sessions fail.

Newsletter

This Week in Long-Running Agents — weekly 5-item briefing on harness patterns, new framework releases, and production case studies

Genuine niche with no existing newsletter: Anthropic + Cursor publishing technical harness posts weekly, builders hungry for synthesis.

Video

I ran a 24-hour Claude agent on my codebase. Here's what actually happened.

First-person experiment format; shareable for its specific runtime claim and honest failure/recovery narrative.

Course

Production Long-Running Agents: a weekend workshop on harness design, state persistence, and multi-agent evaluation loops

Teachable skill: harness architecture isn't obvious from docs alone. $149–299 workshop on Maven or Gumroad targets builders already deploying agents.

Post HN / r/programming

Long-running agents are economically infeasible by design — and here's why that's actually fine

A 52-hour Claude coding session costs north of $500. Most teams shipping long-running agents in 2026 know this and deploy anyway.

Post Newsletter / LinkedIn

The Year Agent Harnesses Ate the Stack

In November 2025 Anthropic published a 1,200-word engineering post about a 'claude-progress.txt' file. By April 2026 that file pattern had spawned three GitHub stars, two SaaS products, and an InfoQ feature.

Post YouTube / Tech media

Every new context window is amnesia. We built a cure.

The core problem of long-running agents is brutal in its simplicity: the model forgets everything after every session. We ran 15 experiments to find what actually works.

What People Search

Long-tail queries from Google Suggest + Trends. Volume and competition are heuristics — directional, not audited. Content Type comes from query shape.

Keyword

Competition

Content Type

long-running agents

Low

General

long running agents anthropic

Very Low

General

long running agents cursor

Very Low

General

long running agents claude

Very Low

General

long running agents claude code

Very Low

General

long running agents harness

Very Low

General

long running agents github

Very Low

Showcase

long running agents ai

Very Low

General

1–8 of 10

1 / 2

Updated 2026-06-12 · sources: Google Trends, Google Suggest · Competition is heuristic

SERP of term “Long-running Agents”

What searchers see today — organic results on top, paid ads if anyone's bidding. Ad density is a real-time commercial signal.

FAQ

What is Long-running Agents?

Why is Long-running Agents emerging now?

When did Long-running Agents emerge?

Publicly emerged around 2025-11-26 (about 202 days ago as of 2026-06-16). EarlyTerms first recorded a pipeline signal on 2026-04-30.

Related Terms

Other terms in the same space — aliases, subtypes, competitors, and neighbors to explore next.

Explore next

Also mentioned

Also known as multi-session agents
Related harness engineering

Sources

Primary URLs this report cites — open any to verify the claim yourself.

Domain Availability

longrunningagents.com
longrunningagents.ai
longrunningagents.net
longrunningagents.io
longrunningagents.co
longrunningagents.app
longrunningagents.pro
longrunningagents.top
longrunningagents.org
longrunningagents.info
longrunningagents.xyz
longrunningagents.run
longrunningagents.me
long-running-agents.com
long-running-agents.ai
long-running-agents.net
long-running-agents.io
long-running-agents.co
long-running-agents.app
long-running-agents.pro
long-running-agents.top
long-running-agents.org
long-running-agents.info
long-running-agents.xyz
long-running-agents.run
long-running-agents.me

Checked via RDAP — live from your browser.

EarlyTerms Weekly

5–8 new terms every Tuesday. Research, story angles, buildable ideas — straight to your inbox.

Join the waitlist for issue #1. No spam.

Search Interest

Why is it emerging now?

Outlook

Competition & Opportunity for term “Long-running Agents”

Ideas for term “Long-running Agents”

What People Search

SERP of term “Long-running Agents”

FAQ

Related Terms

Sources

Full access is a paid feature