Skip to content
AI Primer

Explore what's new in AI

Where people deep in AI come to stay current.

Filters

Category

Tags

Release

DeepSeek V4 reports CSA/HCA attention and 10% KV cache at 1M context

Engineers unpacked DeepSeek V4's hybrid CSA/HCA attention a day after launch; it claims 27% of V3.2 FLOPs and 10% of its KV cache at 1M tokens. External tests pushed V4 Pro near the top of open-model indexes, but users also reported rate limits and mixed third-party results.

DeepSeek V4 reports CSA/HCA attention and 10% KV cache at 1M context
New
Inference Optimization·24th April·8 min read
Release

OpenAI opens GPT-5.5 API with 1M context and Responses support

OpenAI added GPT-5.5 and GPT-5.5 Pro to the API and Playground with 1M context and Responses support. Partners including OpenRouter, Perplexity, GitHub Copilot, Vercel, Warp, and Devin rolled it out the same day, widening access beyond Codex.

OpenAI opens GPT-5.5 API with 1M context and Responses support
New
Agent Product Launch·24th April·7 min read
Breaking

BidirLM-Omni-2.5B-Embedding launches 2048-dim text-image-audio vectors

BidirLM released a 2.5B multilingual encoder that embeds text, images, and audio into one shared 2048-dimensional space and works directly with Sentence Transformers. It tops several open-data embedding leaderboards and can run locally on GPU.

BidirLM-Omni-2.5B-Embedding launches 2048-dim text-image-audio vectors
New
Multimodal·24th April·4 min read
See all stories →
🤖Agentic Engineering(20)
🧩Agent Development(5)
Inference & Infrastructure(9)
🔒Security & Reliability(1)

Top storiesthis week

Breaking

DeepSeek releases V4-Pro and V4-Flash with 1M context and $0.14/M input

DeepSeek open-sourced V4-Pro and V4-Flash under MIT, with 1M context and aggressive Flash pricing. Day-one support in SGLang, vLLM, and OpenRouter pushes open-weight agentic coding closer to closed frontier models.

DeepSeek releases V4-Pro and V4-Flash with 1M context and $0.14/M input
New
LLM Serving·23rd April·6 min read
See all stories →
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.