OpenAI GPT‑Image‑1.5 tops image arenas at 1264 Elo – 20% cheaper tokens

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

OpenAI quietly made its biggest image move since DALL·E by shipping GPT‑Image‑1.5 across ChatGPT (Free, Go, Plus, Edu, Pro) and the API, then wiring it into a new Images tab. The model is up to 4× faster than GPT‑Image‑1 and moves to token billing with roughly 20% cheaper image input and output tokens, resolutions up to 1536×1024, and discounted cached inputs so high‑volume apps don’t get punished for retries.

On benchmarks, the jump is real: GPT‑Image‑1.5 debuts on LMArena’s text‑to‑image board at 1264 Elo, about 29 points ahead of Gemini 3 Pro Image / Nano Banana Pro, and chatgpt-image-latest lands #1 on the edit leaderboard. Artificial Analysis shows similar gains, with +147 Elo over GPT‑Image‑1 on generation and +245 on editing. Builders report much sharper, identity‑preserving edits—"change only what you ask for" is finally more than marketing—and vastly better in‑image text for UI mocks, posters, and fake newspapers.

The new ChatGPT Images surface adds reusable style chips ("3D glam doll", "Plushie", "Holiday portrait"), prompt starters, and mobile branching so image work feels more like a design tool than a chat log. Day‑0 integrations on fal, Replicate, and Figma Weave mean you can A/B it against Nano Banana Pro today—many still prefer Google for dense infographics and party scenes. One caveat: early jailbreaks and a mobile partial‑diffusion bug suggest you should layer your own safety filters in front of whatever OpenAI ships by default.

Feature: ChatGPT Images with GPT‑Image‑1.5

OpenAI’s GPT‑Image‑1.5 lands in ChatGPT and API: 4× faster gen, tighter instruction‑following/edits, 20% lower image I/O cost, new Images UI; early arenas show #1 text‑to‑image and #1 image edit (chatgpt‑image‑latest).

Cross‑account, high‑volume launch. OpenAI ships a new image model and an Images surface in ChatGPT; posts span API pricing/params, speedups, editing fidelity, UI rollout, and early head‑to‑head results/leaderboards.

Jump to Feature: ChatGPT Images with GPT‑Image‑1.5 topics

🖼️ Feature: ChatGPT Images with GPT‑Image‑1.5

OpenAI launches GPT Image 1.5 and new ChatGPT Images surface

OpenAI rolled out GPT Image 1.5 as its new flagship image model, powering a refreshed ChatGPT Images experience across Free, Go, Plus, Edu and Pro tiers, and exposing it in the API as gpt-image-1.5 with up to 4× faster generation, stronger instruction following, and more precise editing than GPT‑Image‑1. (launch thread, feature details, images surface)

The API pricing moves to token-based billing with 20% cheaper image input and output tokens, quality presets (low/medium/high/auto), support for resolutions up to 1536×1024, transparent backgrounds, streaming, and discounted cached inputs for both text and image tokens. (api pricing, rollout summary, image docs) OpenAI is also promoting a dedicated prompting guide and curated gallery to help developers learn how to get consistent results from the new model. (prompt guide, gallery examples, launch blog)

For builders, the key change is that you no longer have to pick a separate "image model" inside ChatGPT: GPT‑Image‑1.5 is wired behind the scenes for both from‑scratch generation and edits, exposes the same behavior in the Playground and API, and is priced to be a drop‑in upgrade over GPT‑Image‑1 workflows rather than a separate SKUs you have to special‑case.

OpenAI GPT‑Image‑1.5 tops image arenas at 1264 Elo – 20% cheaper tokens

Executive Summary

Top links today

Feature: ChatGPT Images with GPT‑Image‑1.5

Table of Contents

🖼️ Feature: ChatGPT Images with GPT‑Image‑1.5

OpenAI launches GPT Image 1.5 and new ChatGPT Images surface

Builders report much sharper, identity‑preserving edits with GPT‑Image‑1.5

GPT‑Image‑1.5 jumps to #1 on LMArena and Artificial Analysis

ChatGPT adds an Images tab with style presets on web and mobile

GPT Image 1.5 lands day‑0 on fal, Replicate and Figma Weave

GPT‑Image‑1.5 improves on in‑image text and logo preservation

Practitioners see GPT‑Image‑1.5 and Nano Banana Pro trading blows

Users uncover partial‑diffusion leak and jailbreak patterns in GPT‑Image‑1.5

🚀 Open models and new releases (non‑vision)

Xiaomi open-sources MiMo‑V2‑Flash 309B MoE model with strong coding and math

Meta’s SAM Audio model lands on Hugging Face as open audio editor

Browser‑Use open-sources BU‑30B‑A3B‑Preview, tuned for cheap web agents

Meituan’s LongCat‑Video‑Avatar open-sources real‑time talking character model

NVIDIA Nemotron‑Cascade 8B debuts as open general‑purpose reasoning model

VoxCPM TTS ports to Apple’s MLX for native Mac inference

🗣️ Voice agents and native audio progress

Lemon Slice 2 turns any voice agent into a 20 fps talking avatar

Meta’s SAM Audio model brings open unified sound separation and editing

OpenAI retires ChatGPT voice on macOS desktop app in January

Grok reportedly clones caller’s voice mid‑conversation

🧑‍💻 Agent stacks and coding toolchains

cua-bench debuts as a benchmark and RL suite for computer-use agents

AI SDK 6 beta adds Standard JSON Schema support and Anthropic tool search

Claude Code ships diffs highlighting, prompt suggestions, plugin marketplace and guest passes

CopilotKit launches useAgent hook with A2A protocol for interactive agent apps

Kilo adopts KAT-Coder-Pro as free default non-reasoning coding model

Yutori’s Scouts browser research agent moves from preview to general availability

⚙️ Serving, routing and runtime efficiency

SGLang and DeepXPU R‑Fork slashes weight load time with GPU‑to‑GPU tensor fork

vLLM Router adds prefill/decode‑aware load balancing with KV‑cache affinity

Cline moves to Vercel AI Gateway, cutting errors 43.8% and speeding P99 streams

SGLang adds day‑0 MiMo‑V2‑Flash support with efficient SWA and multi‑layer MTP

📊 Evals and agent benchmarks (non‑vision)

Gemini 3 Pro double‑length Pokémon Crystal run beats 2.5 Pro on tokens and time

Agent S edges past human benchmark on OSWorld desktop suite

GPT‑5.2‑high debuts at #13 on Text Arena but leads math category

GPT‑5.2 medium‑reasoning tops new creative story‑writing benchmark

Epoch’s ECI framework can spot a 2× capability acceleration within months

Offline IQ tests put GPT‑5.2 and Gemini 3 Pro roughly neck‑and‑neck

🎥 Vision/video ecosystem beyond OpenAI

Hunyuan 3D v3 brings 3.6B‑voxel, 1536³ text/image/sketch‑to‑3D to fal

Higgsfield offers “UNLIMITED WAN 2.6” video runs with aggressive promo pricing

Kling 2.6 on fal adds voice cloning and multi-character control for video

AI2’s Molmo 2 pushes open multimodal models to SOTA on image and video tasks

Builders still treat Nano Banana Pro as the quality bar despite new GPT Image gains

Higgsfield pipelines lean on Nano Banana Pro grids for consistent multi-shot scenes

Higgsfield showcases NB Pro + Wan 2.6 pipelines for AI “x-ray” and cartoon workflows

Tencent HY World 1.5 streams controllable 3D “world model” video in real time

Meituan’s LongCat-Video-Avatar brings audio-driven character animation to Hugging Face

Tencent pushes Hunyuan 3D into consumer workflows with HolidayHYpe ornament challenge

🔌 Interoperability: MCP and app‑level connectors

Firecrawl ships Lovable connector for instant scrape/search/crawl in apps

Gemini Gems from Labs bring visual AI workflows into the Gemini web app

CopilotKit’s new useAgent hook wires A2A agents directly into frontends

Gemini for Workspace adds Asana, HubSpot and Mailchimp connectors

💼 Capital, customers and enterprise adoption

67% of physicians now use AI daily; OpenEvidence and ChatGPT lead

OpenAI reportedly seeks $10B+ from Amazon and access to its AI chips

GPT Image 1.5 launches with cheaper tokens and broad ChatGPT access

OpenAI reshapes its outward-facing leadership with George Osborne hire

US government forms 1,000-person “Tech Force” focused on AI infrastructure

Chai Discovery raises $130M Series B at $1.3B to industrialize AI drug design

Gemini for Workspace adds Asana, HubSpot and Mailchimp connectors

Google pushes Gemini AI Pro with family sharing and 4‑month gift trials

Google tests “CC” AI agent that lives in Gmail and plans your day

OpenAI will retire voice in the ChatGPT macOS app in January

🛡️ Guardrails and red‑team chatter

Early jailbreakers push GPT‑Image‑1.5 past OpenAI’s safety filters

ChatGPT mobile bug reveals partially undiffused versions of blocked images

PsAIch uses therapy-style “psychometric jailbreaks” to probe LLM inner life

🤖 Human‑to‑robot transfer and VLA scaling

Scaling π0‑series VLAs makes human video naturally transferable to robots

📄 New research: agents, attention windows, 3D view synthesis

ARTEMIS multi‑agent framework rivals human penetration testers on real network