Anthropic alleges 24,000 fake accounts and 16M Claude exchanges – distillation crackdown

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Anthropic publicly alleged “industrial-scale distillation attacks” against Claude attributed to DeepSeek, Moonshot AI, and MiniMax; the company claims 24,000+ fraudulent accounts generated 16M+ Claude exchanges to copy agentic tool use, coding, and even reasoning-trace reconstruction prompts. Anthropic paired the disclosure with tighter auth language: OAuth tokens from Claude Free/Pro/Max are now stated as for Claude.ai and Claude Code only, with third-party tooling directed to API keys; it also argues this kind of output-extraction strengthens the case for export controls, though attribution and impact are tweet-level and not independently benchmarked.

• DoD access pressure: Reuters/Axios-style reporting says the Pentagon is pushing Anthropic to run Claude on classified networks “without safety filters,” with talks reportedly near collapse.
• OpenAI Responses API: WebSockets mode ships for long-running, tool-heavy agents; OpenAI claims 20%–40% speedups; Cline reports ~39% on complex workflows; Cursor cites up to ~30%.
• Coding eval norms: OpenAI deprecates SWE-bench Verified reporting; community audit chatter cites 16.4% “technically unsolvable” tasks and ID leakage concerns.

Distillation attacks go public: fraud at scale, ToS tightening, and national‑security stakes

Anthropic publicly attributes large-scale Claude distillation via fraud (24k accounts, 16M chats), triggering immediate ToS lock-downs and policy escalation. For builders, this foreshadows stricter auth, monitoring, and data-exfil defenses across AI APIs.

Anthropic alleges industrial-scale distillation campaigns (24k fake accounts, 16M+ Claude exchanges) attributed to DeepSeek, Moonshot, and MiniMax—plus follow-on debate about IP, safeguards removal, export controls, and enforcement. Includes new Claude OAuth token restrictions and US DoD pressure reports; excludes unrelated API/perf updates covered elsewhere.

Jump to Distillation attacks go public: fraud at scale, ToS tightening, and national‑security stakes topics

🛡️ Distillation attacks go public: fraud at scale, ToS tightening, and national‑security stakes

Anthropic says DeepSeek, Moonshot, MiniMax ran industrial-scale Claude distillation

Claude distillation attacks (Anthropic): Anthropic says it identified “industrial-scale distillation attacks” attributed to DeepSeek, Moonshot AI, and MiniMax—claiming 24,000+ fraudulent accounts and 16M+ exchanges used to extract Claude capabilities for training competing models, as stated in the opening allegation.

The company frames “distillation” as sometimes legitimate (e.g., compressing models) but argues these campaigns were illicit and risk removing safeguards—raising concern about downstream military/intelligence/surveillance uses per the distillation framing and the opening allegation.

Anthropic alleges 24,000 fake accounts and 16M Claude exchanges – distillation crackdown

Executive Summary

Top links today

Distillation attacks go public: fraud at scale, ToS tightening, and national‑security stakes

Table of Contents

🛡️ Distillation attacks go public: fraud at scale, ToS tightening, and national‑security stakes

Anthropic says DeepSeek, Moonshot, MiniMax ran industrial-scale Claude distillation

Anthropic updates terms: bans Claude subscription OAuth tokens in third-party tools

Anthropic: distillers targeted tool use, coding, and reasoning-trace extraction

Anthropic argues distillation attacks strengthen the case for export controls

Report: Pentagon presses Anthropic to run Claude on classified networks without filters

Backlash theme: “you trained on everyone’s data—why complain about distillation?”

Distillation allegations hit mainstream: WSJ frames it as “siphoning data from Claude”

Legal framing: pretraining fair use claims vs API distillation as ToS breach

🔌 Agents get faster: WebSockets land in OpenAI Responses API (persistent context)

Responses API gets WebSockets for persistent, low-latency agent runs

Cline reports sizable speedups from Responses API WebSockets on multi-file work

Cursor upgrades OpenAI traffic to WebSockets and advertises up to 30% speedups

OpenAI points to early team usage patterns for WebSockets in Responses API

Developers start treating the transport layer as first-class for agent speed

📏 Coding eval reset: SWE-bench Verified deprecated, Pro recommended

OpenAI stops reporting SWE-bench Verified, citing saturation and contamination

SWE-bench Verified audit highlights task-ID leakage and unsolvable-by-spec problems

SWE-bench Pro gets the nod: structured tasks and private codebases to curb leakage

Meridian Labs forms to build open evaluation tooling around Inspect and Petri

🎙️ Realtime voice stacks: gpt-realtime-1.5 bumps tool-calling + multilingual reliability

OpenAI releases gpt-realtime-1.5 for speech-to-speech agents in the Realtime API

gpt-realtime-1.5 internal evals show gains in audio reasoning and transcription

Dictation quality becomes the practical UX bottleneck for everyday AI use

Partner implementations start to standardize around gpt-realtime-1.5 voice workflows

Builders call voice models an under-covered surface area

🧠 Claude Code churn: CLI prompt diffs, worktree tooling, and speed experiments

Claude Code 2.1.51 adds remote-control, stream-json I/O, and TodoWrite planning

AGENTS.md guidance: keep only non-discoverable landmines; use a directory hierarchy

Claude Code 2.1.51 hardens hooks: allowedEnvVars, workspace trust gating, sandbox proxy

Claude Code 2.1.51 prompt changes: EnterWorktree tool added; skill invocation guidance removed

Claude in Chrome adds Quick Mode experiment for faster responses; pairs with Opus 4.6 fast

Workflow pattern: enforce max 500 lines per file to keep agent edits reviewable

Claude Code 2.1.52 ships request-path behavior tweaks; no CLI surface deltas detected

User sentiment: Claude Code with Opus 4.6 is “thinking WAY TOO long”

Pattern: automated reminders to re-read AGENTS.md at the start of Claude Code sessions

🧑‍💻 Codex/Cursor in practice: multi-agent toggles, harness co-design, and user switching

Codex CLI multi-agent mode can be enabled via config.toml and /experimental

Codex leadership frames a shift from code review to plan review

GPT-5.3 Codex shows up inside Cursor

Builders report switching to Codex for precision and instruction following

Codex compaction is being described as “near infinite context”

Codex used for a full infra migration (Railway to Hetzner) in about an hour

Cursor founder hints at an upcoming product change

Some teams are splitting work: Codex for code, Opus for browser/planning

🧪 Agentic engineering patterns: TDD guardrails, context hygiene, and planning artifacts

Red→Green→Refactor becomes the go-to “anti-cheat” loop for coding agents

AGENTS.md as “landmines only,” and split it by directory

Agentic engineering: when code is cheap, specs and review discipline matter more

Compaction edge-cases are becoming a first-class worry in long agent sessions

Max-lines linting as an “agent constraint” for maintainable diffs

“Understand your tools” becomes the lightweight safety rail for agent work

ADRs with coding agents: are teams actually using them, and what changes?

Automated “read AGENTS.md” reminders as a low-effort context hygiene hack

🦞 OpenClaw ops & security posture: personal-assistant threat model, providers, and local runners

OpenClaw threat model: treat it as a personal assistant, not shared multi-user infra

Ollama 0.17 adds a one-command path to run OpenClaw with local open models

OpenClaw beta ships security+bugfix work plus Kilo provider and Kimi vision/video

NanoClaw positions “auditable minimalism” as a security response to Claw complexity

Together models become selectable in OpenClaw (Kimi K2.5, MiniMax M2.5, more)

OpenClaw expands “stop/abort” trigger coverage to reduce runaway sessions

OpenRouter shares a drop-in skill for “Sign in with OpenRouter” OAuth wiring

🧰 Agent SDK plumbing: tracing, interaction retention, and delegation frameworks

DeepMind proposes a formal framework for delegating tasks to AI agents

LangSmith adds native tracing for Google ADK agents

Gemini Interactions API adds include_input=True with tiered retention

LangSmith updates trace filtering for faster debugging

🧩 Antigravity ↔ AI Studio ↔ OpenClaw: access enforcement and reliability fallout

Antigravity bans after abuse detection ripple into OpenClaw OAuth Gemini routing

Google AI Studio Build upgrade caused slowness; Google says fixes underway

AI Studio shows Antigravity integration UI for full-stack app building

Gemini 3.1 Pro “not available on this version” blocks Antigravity use

Antigravity users report “Waiting” stalls and approval friction despite settings

🛠️ Dev tools shipping around agents: dashboards, review surfaces, and browser sandboxes

Devin Review adds one-click inline code fixes for issues it flags in PRs