Anthropic Claude Code clamp re-routes harnesses – OpenCode 1M MAUs, $200 tier

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Anthropic is formalizing boundaries around Claude Code by blocking third‑party harnesses that spoof its desktop client and route heavy agent traffic through flat‑rate Pro/Max plans; bans tied to this pattern have been reversed, but OAuth copy and business terms now explicitly prohibit such flows and restrict exposing Claude as a first‑class model inside products serving rival labs, which already cut xAI’s access via Cursor. High‑token harnesses like Ultrawork and Sisyphus say their “more tokens” workloads were hit; RepoPrompt stresses it uses Anthropic’s headless client instead. OpenCode is pivoting to “Sign in with ChatGPT” and Codex‑backed limits, selling out a $200/month Black tier while targeting 1M MAUs; OpenAI leans into the Codex app server and docs MCP to court third‑party coding IDEs via OAuth and MCP rather than consumer‑plan spoofing.

• Infra, safety, and evals: Epoch pegs the Anthropic–AWS Indiana campus at ~750 MW today on a path past 1 GW with >500k Trainium2 chips; Anthropic’s Constitutional Classifiers++ report ~1% overhead, an 87% drop in benign refusals and no universal jailbreak in 1,700 red‑team hours, while its new agent‑eval playbook and Letta’s .af kit push toward config‑driven evals; Datadog’s Codex case study says system‑level reviews would have flagged ~22% of sampled incidents.
• Speech and stack: ElevenLabs’ Scribe v2 targets ~5% English WER, up to 10‑hour files, 100 keyterms and 56 entity types with diarization for 48 speakers, plus a low‑latency Realtime variant for agents.
• Capital and competition: MiniMax’s Hong Kong IPO raised ~$620M at >HK$100B value; DeepSeek V4 is tipped for mid‑February with internal claims of coding wins over Claude and GPT, but no independent public benchmarks yet.

Harness governance, infra build‑out and capital flows now interact directly: access rules are reshaping which labs power coding agents, while Chinese players like MiniMax and DeepSeek signal more competition on both model quality and pricing.

Feature: Harness wars reshape coding agent access

Anthropic blocks third‑party spoofing of Claude Code access, lifts impacted bans, and clarifies ToS. OpenCode pivots to Codex/ChatGPT auth; access and lock‑in strategies will shape where teams build agentic coding workflows next.

Cross‑account story: Anthropic blocked third‑party tools from spoofing Claude Code subscriptions; bans linked to abnormal harness traffic were lifted and ToS clarified. OpenCode pivots to Codex/ChatGPT auth; ecosystem debates lock‑in vs openness. Mostly agent access/go‑to‑market news today; excludes product changelogs, which are covered separately.

Jump to Feature: Harness wars reshape coding agent access topics

🧩 Feature: Harness wars reshape coding agent access

Anthropic blocks spoofed Claude Code harnesses, restores bans and clarifies ToS

Claude Code access (Anthropic): Anthropic has tightened server‑side checks to block third‑party tools that spoof the Claude Code desktop harness while funneling traffic through individual Claude Pro/Max subscriptions, after some users were auto‑banned by abuse filters for "unusual" agent traffic traced to these setups, as explained in the engineering update by Thariq Shihab in the abuse safeguards and follow‑ups in the traffic explanation. Anthropic now says these third‑party harnesses are a clear Terms of Service violation, has lifted all bans it can attribute to this specific pattern, and plans to make the restriction explicit in the OAuth consent screen per the comments in the oauth clarification.

• Reason given: missing telemetry: The team stresses that spoofed harnesses lack the internal telemetry Claude Code sends (tool usage, error contexts, rate‑limit metadata), which makes it hard to debug user complaints and distinguish legitimate heavy agent loops from abuse patterns, according to the detailed summary in the policy recap.
• Official path: API only: Anthropic reiterates that the supported way to embed Claude in external tools is via the API rather than hijacking the consumer desktop app, and explicitly invites maintainers of third‑party harnesses to discuss proper integration paths in DMs, as stated in the api guidance.

The move formalizes a boundary Anthropic had previously left implicit: flat‑rate chat subscriptions are priced for human, in‑client use, while high‑volume or automated agent workloads are expected to run over metered API access where the company gets both telemetry and commercial upside.

Anthropic Claude Code clamp re-routes harnesses – OpenCode 1M MAUs, $200 tier

Executive Summary

Top links today

Feature: Harness wars reshape coding agent access

Table of Contents

🧩 Feature: Harness wars reshape coding agent access

Anthropic blocks spoofed Claude Code harnesses, restores bans and clarifies ToS

Anthropic business terms reportedly bar reselling Claude to rival AI labs

OpenCode pivots to ChatGPT and Codex auth as Claude access tightens

OpenAI leans into "Sign in with ChatGPT" for Codex and MCP docs

Heavy-token harness makers regroup after Claude usage flagged as "abnormal"

OpenCode Black high-usage tier sells out repeatedly amid Anthropic friction

🏭 Anthropic–AWS Indiana AI campus scale and power math

Epoch AI pegs Anthropic–AWS Indiana campus at 750MW on path to 1GW+

🛠️ Claude Code 2.1.3 and coding agent tooling

Claude Code 2.1.3 tightens Bash/git behavior and ships a long CLI bugfix wave

Amp Free exposes Opus 4.5 coding via $300/month ad-supported tier

Cline 3.48.0 adds Claude Skills compatibility and direct web search tooling

RepoPrompt confirms Claude integration uses official headless client, not spoofed Claude Code

repo_updater CLI targets agent-first Git workflows across many repos

🛡️ Jailbreak defense: Constitutional Classifiers++

Anthropic details Constitutional Classifiers++ with 1% overhead and no universal jailbreaks found

🧪 Agent evals: Anthropic’s playbook and leaderboard churn

Anthropic publishes concrete playbook for evaluating multi‑turn AI agents

Engineers converge on concrete agent eval habits beyond benchmarks

LMArena quantifies how fast top LLMs fall down the leaderboard

Letta AI open-sources .af-based harness for large-scale agent evals

📚 Reasoning & agent methods: context, batching, and signals

Fast-weight Product Key Memory framed as episodic memory that reaches 128K context

InfiAgent uses file-centric memory to keep long-horizon agents bounded

Logical phase transitions reveal sharp reasoning collapse bands in LLMs

SPICE fuses priors and context for regret-optimal in-context RL

ChaosNLI study finds CoT changes decisions more than uncertainty

GDPO stabilizes group-normalized multi-reward RL beyond GRPO

IROTE uses self-reflective prompts to stably elicit LLM personas

Large Reasoning Models show English-centric latent reasoning across languages

Pruning shows LLMs already encode which reasoning tokens truly matter

Semantic reasoning graphs catch RAG hallucinations better than token LRP

🗣️ STT baseline shifts: ElevenLabs Scribe v2

ElevenLabs Scribe v2 pushes STT accuracy toward 5% WER

Scribe v2 introduces contextual keyterm prompting and rich entity detection

Scribe v2 Realtime targets low-latency voice agents and live tools

Scribe v2 targets 10-hour files and HIPAA/GDPR for enterprise STT

📈 MiniMax IPO pops; China’s challenger arms up

MiniMax’s Hong Kong IPO more than doubles, valuing lean full‑stack lab near $13–14B

🚧 Frontier watch: DeepSeek V4 coding push (Feb)

DeepSeek V4 tipped for mid‑February with claims of coding gains over Claude and GPT

DeepSeek expected to use LMArena/ChatbotArena again to benchmark V4

Commentators frame 2026 as a fast iteration year for DeepSeek and Chinese coding models

DeepSeek V4 hyped as a moment where Chinese coding models could overtake US peers

🎨 Creator stacks: Midjourney Niji V7, Kling control, $20k challenge

Midjourney ships Niji V7 with stronger anime and text rendering

Creators standardize on Kling Motion Control for precise dance and movement edits

Higgsfield launches $20k AI‑Cinema Challenge around Cinema Studio

🏢 System-level ROI: Datadog’s Codex code review

Datadog’s incident replay shows Codex can catch 22% of historical issues

🧵 MCP and skills interoperability is accelerating

OpenAI launches Developer Docs MCP for API, Codex, Apps SDK and commerce protocol

mcp-cli debuts as dynamic MCP discovery CLI with ~99% token savings

OpenRouter SDK adds Skills Loader to reuse Agent Skills across models

🗂️ Parsing and structured extraction at scale

ByteDance’s Dolphin parser targets complex PDFs and scans, hits 89.8 on OmniDocBench

LlamaIndex shows zero‑shot multi‑doc extraction with LlamaSplit and LlamaExtract

Free Qwen3‑VL Colab shows end‑to‑end multimodal RAG on a single T4

NotebookLM data tables feature extracts structured data across 32‑file notebooks

On this page