SubQ
SubQ is the brand name of the first large language model built on a fully sub-quadratic architecture, where attention compute scales linearly rather than quadratically with context length. The model is developed by Miami-based startup Subquadratic using a proprietary mechanism called Subquadratic Sparse Attention (SSA).
Subquadratic launched SubQ on May 5, 2026 alongside a $29M seed round. At 12 million tokens, the company claims SSA reduces attention compute by roughly 1,000x versus dense-attention frontier models, with a 95% score on the RULER 128K benchmark at a cost of $8 — compared to $2,600 for Claude Opus 4.6 at the same benchmark.
Think of it as sparse indexing for databases — skip scanning every row, jump only to the relevant ones.
Search Interest
-
Nascent0–7 days
-
Emergent8–30 days
-
Validating ← now31–90 days
-
Rising91–180 days
-
Established180 days +
Why is it emerging now?
On May 5, 2026, Subquadratic launched SubQ with $29M seed funding and a single provocative claim: the first frontier LLM that ditches quadratic attention entirely. At 12 million tokens, SSA architecture cuts compute 1,000x vs. Claude Opus — but researchers are immediately demanding independent proof.
Outlook
6-month signal projection and commercial timeline.
Architecture credibility hinges on an independent technical report; if verified, sub-quadratic attention becomes a category, not just a brand.
Risk · Benchmark cherry-picking allegations and closed weights may suppress adoption until third-party replication.
Analogs · Mamba · RWKV · FlashAttention
-
nowAPI early access open
Developer API available; enterprise deals for 12M-token codebase ingestion.
-
3-6moIndependent benchmarks land
Third-party evals determine adoption pace; SubQ Code CLI competes with Cursor and Copilot.
-
6-12moArchitecture licensed or forked
If SSA replicates, hyperscalers integrate or acquire; open-source competitors emerge.
Competition & Opportunity for term “SubQ”
Three heuristic signals derived from the tracked queries, the term's monetization cards, and its cluster neighbors. Directional, not audited.
Ideas for term “SubQ”
Buildable pitches — turn this term into an article, site, product, post, newsletter, video, or course. Steal any card and run with it.
Head-to-head at 128K and 1M tokens — the benchmark cherry-picking controversy is itself traffic. Comparison articles on new models rank fast when published day-one.
Zero educational content ranks for 'subquadratic sparse attention' yet. First explainer owns the query. Target: engineers curious about transformer alternatives.
Category explainer that positions SubQ inside the broader linear-attention landscape — evergreen as the space matures.
At 1M+ tokens the cost delta is dramatic; a simple input-token-count → cost comparison tool captures transactional intent with affiliate or API upsell.
SubQ Code targets teams loading entire repos. A wrapper that packages local codebases and sends them to SubQ API has obvious B2B SaaS surface.
SubQ's headline use case is codebase-at-once ingestion. A recorded walkthrough with a real repo is visual and shareable; publish within the first week while search competition is zero.
If SubQ's claims hold, sub-quadratic architectures become the next transformer moment. A technical briefing anchored to this space has 6+ months of tailwind.
The benchmarks look good. The weights are closed, the technical report hasn't dropped, and Will Depue says the scaling numbers don't line up.
The claim is 325x cheaper than Claude Opus at equivalent long-context accuracy. That would be the biggest cost reduction in model history.
Three times in the past two years, a startup has announced 1,000x efficiency gains over transformers. Two of them shipped nothing.
What People Search
Long-tail queries from Google Suggest + Trends. Volume and competition are heuristics — directional, not audited. Content Type comes from query shape.
SERP of term “SubQ”
What searchers see today — organic results on top, paid ads if anyone's bidding. Ad density is a real-time commercial signal.
FAQ
What is SubQ?
SubQ is the brand name of the first large language model built on a fully sub-quadratic architecture, where attention compute scales linearly rather than quadratically with context length.
Why is SubQ emerging now?
On May 5, 2026, Subquadratic launched SubQ with $29M seed funding and a single provocative claim: the first frontier LLM that ditches quadratic attention entirely. At 12 million tokens, SSA architecture cuts compute 1,000x vs. Claude Opus — but researchers are immediately demanding independent proof.
When did SubQ emerge?
Publicly emerged around 2026-05-05 (about 42 days ago as of 2026-06-16). EarlyTerms first recorded a pipeline signal on 2026-05-06.
Related Terms
Other terms in the same space — aliases, subtypes, competitors, and neighbors to explore next.
- Part of context-window A context window is the span of tokens an LLM reads and reasons over in a single forward pass. →
- Competitor claude-opus-4-6 Claude Opus 4.6 is Anthropic's flagship LLM released February 5, 2026. →
- Competitor gpt-5-4 GPT-5.4 is OpenAI's March 2026 frontier language model that unifies the Codex and GPT product lines into a single system, adding native… →
- Competitor gemini-3-1-pro Gemini 3.1 Pro is Google DeepMind's flagship reasoning model, released February 19, 2026. →
- Related context-engineering Context engineering is the discipline of curating every token that enters an LLM's context window — system prompt, tools, retrieved… →
- Related long-running-agents Long-running agents are AI agents designed to sustain work across multiple context windows, persisting state through structured… →
- Related managed-agents Managed Agents is an infrastructure paradigm where cloud platforms host, orchestrate, and operate AI agents as a service. →
- Includes ·
- Related ··
Sources
Primary URLs this report cites — open any to verify the claim yourself.
- 01 Subquadratic — Introducing SubQ (official launch post) subq.ai ↗
- 02 Subquadratic — How SSA Makes Long Context Practical subq.ai ↗
- 03 SiliconANGLE — Subquadratic launches with $29M to bring 12M-token context windows to AI siliconangle.com ↗
- 04 VentureBeat — Miami startup claims 1,000x AI efficiency gain; researchers demand independent proof venturebeat.com ↗
- 05 Hacker News — SubQ: Sub-quadratic LLM built for 12M-token context (discussion) news.ycombinator.com ↗
- 06 The Neuron Daily — SubQ ships 12M tokens at 1/5 the cost theneurondaily.com ↗