Skip to content
AI Primer

Explore what's new in AI

Where people deep in AI come to stay current.

Filters

Category

Tags

Breaking

Codex raises weekly and hourly limits to 100% after 5 million users

OpenAI restored Codex weekly and hourly quotas across paid ChatGPT plans after Tibo Sottiaux said the product hit 5 million users. Watch for long-running QA loops, migration PRs, and remote desktop sessions that can still burn through quotas fast.

Codex raises weekly and hourly limits to 100% after 5 million users
New
Codex·31st May·5 min read
Opus 4.8 users report token burn, failed tool calls, and DeepSWE gaps
New

Opus 4.8 users report token burn, failed tool calls, and DeepSWE gaps

Three days after Opus 4.8 launched, new tests and field reports added failed tool calls, Bash-specific breakdowns, and higher token burn to the complaint list. Users report materially worse cost and stability in long coding sessions, while DeepSWE and GBA Eval point in different directions.

Benchmarks31st May·6 min read
Breaking

CopilotKit integrates Claude Agent SDK with AG-UI for React and mobile frontends

CopilotKit shipped an AG-UI integration that streams Claude Agent SDK agents into web and mobile frontends with generative UI and approval checkpoints. The adapter lets teams embed terminal-first Claude agents in React, Vue, Angular, and React Native without rewriting transport or state plumbing.

CopilotKit integrates Claude Agent SDK with AG-UI for React and mobile frontends
New
DX Tooling·31st May·3 min read
See all stories →
🤖Agentic Engineering(21)
🧩Agent Development(6)
Inference & Infrastructure(4)
🔒Security & Reliability(1)
🔬Research & Benchmarks(2)
📊Business & Policy(1)

Top storiesthis week

Opus 4.8 users report write failures, sycophancy, and 58% DeepSWE

Two days after launch, users and benchmarks pointed to write failures, sycophancy, lower security recall, and a 58% DeepSWE result. GPT-5.5 still leads on cost, output tokens, and pass@1 in shared coding-agent tests, so compare both before switching.

Opus 4.8 users report write failures, sycophancy, and 58% DeepSWE
Claude Code·30th May·6 min read
See all stories →
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.