OpenAI GPT‑Image‑1.5 tops image arenas at 1264 Elo – 20% cheaper tokens feature image for Tue, Dec 16, 2025

OpenAI GPT‑Image‑1.5 tops image arenas at 1264 Elo – 20% cheaper tokens

Stay in the loop

Free daily newsletter & Telegram daily report

Join Telegram Channel

Executive Summary

OpenAI quietly made its biggest image move since DALL·E by shipping GPT‑Image‑1.5 across ChatGPT (Free, Go, Plus, Edu, Pro) and the API, then wiring it into a new Images tab. The model is up to 4× faster than GPT‑Image‑1 and moves to token billing with roughly 20% cheaper image input and output tokens, resolutions up to 1536×1024, and discounted cached inputs so high‑volume apps don’t get punished for retries.

On benchmarks, the jump is real: GPT‑Image‑1.5 debuts on LMArena’s text‑to‑image board at 1264 Elo, about 29 points ahead of Gemini 3 Pro Image / Nano Banana Pro, and chatgpt-image-latest lands #1 on the edit leaderboard. Artificial Analysis shows similar gains, with +147 Elo over GPT‑Image‑1 on generation and +245 on editing. Builders report much sharper, identity‑preserving edits—"change only what you ask for" is finally more than marketing—and vastly better in‑image text for UI mocks, posters, and fake newspapers.

The new ChatGPT Images surface adds reusable style chips ("3D glam doll", "Plushie", "Holiday portrait"), prompt starters, and mobile branching so image work feels more like a design tool than a chat log. Day‑0 integrations on fal, Replicate, and Figma Weave mean you can A/B it against Nano Banana Pro today—many still prefer Google for dense infographics and party scenes. One caveat: early jailbreaks and a mobile partial‑diffusion bug suggest you should layer your own safety filters in front of whatever OpenAI ships by default.

Top links today

Feature Spotlight

Feature: ChatGPT Images with GPT‑Image‑1.5

OpenAI’s GPT‑Image‑1.5 lands in ChatGPT and API: 4× faster gen, tighter instruction‑following/edits, 20% lower image I/O cost, new Images UI; early arenas show #1 text‑to‑image and #1 image edit (chatgpt‑image‑latest).

Cross‑account, high‑volume launch. OpenAI ships a new image model and an Images surface in ChatGPT; posts span API pricing/params, speedups, editing fidelity, UI rollout, and early head‑to‑head results/leaderboards.

Jump to Feature: ChatGPT Images with GPT‑Image‑1.5 topics

Table of Contents

🖼️ Feature: ChatGPT Images with GPT‑Image‑1.5

OpenAI launches GPT Image 1.5 and new ChatGPT Images surface

Builders report much sharper, identity‑preserving edits with GPT‑Image‑1.5

GPT‑Image‑1.5 jumps to #1 on LMArena and Artificial Analysis

ChatGPT adds an Images tab with style presets on web and mobile

GPT Image 1.5 lands day‑0 on fal, Replicate and Figma Weave

GPT‑Image‑1.5 improves on in‑image text and logo preservation

Practitioners see GPT‑Image‑1.5 and Nano Banana Pro trading blows

Users uncover partial‑diffusion leak and jailbreak patterns in GPT‑Image‑1.5


🚀 Open models and new releases (non‑vision)

Xiaomi open-sources MiMo‑V2‑Flash 309B MoE model with strong coding and math

Meta’s SAM Audio model lands on Hugging Face as open audio editor

Browser‑Use open-sources BU‑30B‑A3B‑Preview, tuned for cheap web agents

Meituan’s LongCat‑Video‑Avatar open-sources real‑time talking character model

NVIDIA Nemotron‑Cascade 8B debuts as open general‑purpose reasoning model

VoxCPM TTS ports to Apple’s MLX for native Mac inference


🗣️ Voice agents and native audio progress

Lemon Slice 2 turns any voice agent into a 20 fps talking avatar

Meta’s SAM Audio model brings open unified sound separation and editing

OpenAI retires ChatGPT voice on macOS desktop app in January

Grok reportedly clones caller’s voice mid‑conversation


🧑‍💻 Agent stacks and coding toolchains

cua-bench debuts as a benchmark and RL suite for computer-use agents

Claude Code ships diffs highlighting, prompt suggestions, plugin marketplace and guest passes

CopilotKit launches useAgent hook with A2A protocol for interactive agent apps

Kilo adopts KAT-Coder-Pro as free default non-reasoning coding model

Yutori’s Scouts browser research agent moves from preview to general availability


⚙️ Serving, routing and runtime efficiency

SGLang and DeepXPU R‑Fork slashes weight load time with GPU‑to‑GPU tensor fork

vLLM Router adds prefill/decode‑aware load balancing with KV‑cache affinity

Cline moves to Vercel AI Gateway, cutting errors 43.8% and speeding P99 streams

SGLang adds day‑0 MiMo‑V2‑Flash support with efficient SWA and multi‑layer MTP


📊 Evals and agent benchmarks (non‑vision)

Gemini 3 Pro double‑length Pokémon Crystal run beats 2.5 Pro on tokens and time

Agent S edges past human benchmark on OSWorld desktop suite

GPT‑5.2‑high debuts at #13 on Text Arena but leads math category

GPT‑5.2 medium‑reasoning tops new creative story‑writing benchmark

Epoch’s ECI framework can spot a 2× capability acceleration within months

Offline IQ tests put GPT‑5.2 and Gemini 3 Pro roughly neck‑and‑neck


🎥 Vision/video ecosystem beyond OpenAI

Hunyuan 3D v3 brings 3.6B‑voxel, 1536³ text/image/sketch‑to‑3D to fal

Higgsfield offers “UNLIMITED WAN 2.6” video runs with aggressive promo pricing

Kling 2.6 on fal adds voice cloning and multi-character control for video

AI2’s Molmo 2 pushes open multimodal models to SOTA on image and video tasks

Builders still treat Nano Banana Pro as the quality bar despite new GPT Image gains

Higgsfield pipelines lean on Nano Banana Pro grids for consistent multi-shot scenes

Higgsfield showcases NB Pro + Wan 2.6 pipelines for AI “x-ray” and cartoon workflows

Tencent HY World 1.5 streams controllable 3D “world model” video in real time

Meituan’s LongCat-Video-Avatar brings audio-driven character animation to Hugging Face

Tencent pushes Hunyuan 3D into consumer workflows with HolidayHYpe ornament challenge


🔌 Interoperability: MCP and app‑level connectors

Firecrawl ships Lovable connector for instant scrape/search/crawl in apps

Gemini Gems from Labs bring visual AI workflows into the Gemini web app

CopilotKit’s new useAgent hook wires A2A agents directly into frontends

Gemini for Workspace adds Asana, HubSpot and Mailchimp connectors


💼 Capital, customers and enterprise adoption

67% of physicians now use AI daily; OpenEvidence and ChatGPT lead

OpenAI reportedly seeks $10B+ from Amazon and access to its AI chips

GPT Image 1.5 launches with cheaper tokens and broad ChatGPT access

OpenAI reshapes its outward-facing leadership with George Osborne hire

US government forms 1,000-person “Tech Force” focused on AI infrastructure

Chai Discovery raises $130M Series B at $1.3B to industrialize AI drug design

Gemini for Workspace adds Asana, HubSpot and Mailchimp connectors

Google pushes Gemini AI Pro with family sharing and 4‑month gift trials

Google tests “CC” AI agent that lives in Gmail and plans your day

OpenAI will retire voice in the ChatGPT macOS app in January


🛡️ Guardrails and red‑team chatter

Early jailbreakers push GPT‑Image‑1.5 past OpenAI’s safety filters

ChatGPT mobile bug reveals partially undiffused versions of blocked images

PsAIch uses therapy-style “psychometric jailbreaks” to probe LLM inner life


🤖 Human‑to‑robot transfer and VLA scaling

Scaling π0‑series VLAs makes human video naturally transferable to robots


📄 New research: agents, attention windows, 3D view synthesis

ARTEMIS multi‑agent framework rivals human penetration testers on real network

Hindsight proposes human‑like agent memory with 91.4% on LongMemEval

Sliding Window Attention Adaptation makes full-attention LLMs cheaper on long prompts

Fairy2i turns real LLMs into complex 2‑bit models with minimal quality loss

Motif-2-12.7B-Reasoning shows how to RL-train a 12.7B model into GPT‑5.1 class

PersonaLive delivers 7–22× faster portrait animation for live streaming

qa-FLoRA fuses multiple LoRAs per query to boost multi-domain performance

StereoSpace learns stereo geometry from a single image via canonical diffusion

On this page

Executive Summary
Feature Spotlight: Feature: ChatGPT Images with GPT‑Image‑1.5
🖼️ Feature: ChatGPT Images with GPT‑Image‑1.5
OpenAI launches GPT Image 1.5 and new ChatGPT Images surface
Builders report much sharper, identity‑preserving edits with GPT‑Image‑1.5
GPT‑Image‑1.5 jumps to #1 on LMArena and Artificial Analysis
ChatGPT adds an Images tab with style presets on web and mobile
GPT Image 1.5 lands day‑0 on fal, Replicate and Figma Weave
GPT‑Image‑1.5 improves on in‑image text and logo preservation
Practitioners see GPT‑Image‑1.5 and Nano Banana Pro trading blows
Users uncover partial‑diffusion leak and jailbreak patterns in GPT‑Image‑1.5
🚀 Open models and new releases (non‑vision)
Xiaomi open-sources MiMo‑V2‑Flash 309B MoE model with strong coding and math
Meta’s SAM Audio model lands on Hugging Face as open audio editor
Browser‑Use open-sources BU‑30B‑A3B‑Preview, tuned for cheap web agents
Meituan’s LongCat‑Video‑Avatar open-sources real‑time talking character model
NVIDIA Nemotron‑Cascade 8B debuts as open general‑purpose reasoning model
VoxCPM TTS ports to Apple’s MLX for native Mac inference
🗣️ Voice agents and native audio progress
Lemon Slice 2 turns any voice agent into a 20 fps talking avatar
Meta’s SAM Audio model brings open unified sound separation and editing
OpenAI retires ChatGPT voice on macOS desktop app in January
Grok reportedly clones caller’s voice mid‑conversation
🧑‍💻 Agent stacks and coding toolchains
cua-bench debuts as a benchmark and RL suite for computer-use agents
AI SDK 6 beta adds Standard JSON Schema support and Anthropic tool search
Claude Code ships diffs highlighting, prompt suggestions, plugin marketplace and guest passes
CopilotKit launches useAgent hook with A2A protocol for interactive agent apps
Kilo adopts KAT-Coder-Pro as free default non-reasoning coding model
Yutori’s Scouts browser research agent moves from preview to general availability
⚙️ Serving, routing and runtime efficiency
SGLang and DeepXPU R‑Fork slashes weight load time with GPU‑to‑GPU tensor fork
vLLM Router adds prefill/decode‑aware load balancing with KV‑cache affinity
Cline moves to Vercel AI Gateway, cutting errors 43.8% and speeding P99 streams
SGLang adds day‑0 MiMo‑V2‑Flash support with efficient SWA and multi‑layer MTP
📊 Evals and agent benchmarks (non‑vision)
Gemini 3 Pro double‑length Pokémon Crystal run beats 2.5 Pro on tokens and time
Agent S edges past human benchmark on OSWorld desktop suite
GPT‑5.2‑high debuts at #13 on Text Arena but leads math category
GPT‑5.2 medium‑reasoning tops new creative story‑writing benchmark
Epoch’s ECI framework can spot a 2× capability acceleration within months
Offline IQ tests put GPT‑5.2 and Gemini 3 Pro roughly neck‑and‑neck
🎥 Vision/video ecosystem beyond OpenAI
Hunyuan 3D v3 brings 3.6B‑voxel, 1536³ text/image/sketch‑to‑3D to fal
Higgsfield offers “UNLIMITED WAN 2.6” video runs with aggressive promo pricing
Kling 2.6 on fal adds voice cloning and multi-character control for video
AI2’s Molmo 2 pushes open multimodal models to SOTA on image and video tasks
Builders still treat Nano Banana Pro as the quality bar despite new GPT Image gains
Higgsfield pipelines lean on Nano Banana Pro grids for consistent multi-shot scenes
Higgsfield showcases NB Pro + Wan 2.6 pipelines for AI “x-ray” and cartoon workflows
Tencent HY World 1.5 streams controllable 3D “world model” video in real time
Meituan’s LongCat-Video-Avatar brings audio-driven character animation to Hugging Face
Tencent pushes Hunyuan 3D into consumer workflows with HolidayHYpe ornament challenge
🔌 Interoperability: MCP and app‑level connectors
Firecrawl ships Lovable connector for instant scrape/search/crawl in apps
Gemini Gems from Labs bring visual AI workflows into the Gemini web app
CopilotKit’s new useAgent hook wires A2A agents directly into frontends
Gemini for Workspace adds Asana, HubSpot and Mailchimp connectors
💼 Capital, customers and enterprise adoption
67% of physicians now use AI daily; OpenEvidence and ChatGPT lead
OpenAI reportedly seeks $10B+ from Amazon and access to its AI chips
GPT Image 1.5 launches with cheaper tokens and broad ChatGPT access
OpenAI reshapes its outward-facing leadership with George Osborne hire
US government forms 1,000-person “Tech Force” focused on AI infrastructure
Chai Discovery raises $130M Series B at $1.3B to industrialize AI drug design
Gemini for Workspace adds Asana, HubSpot and Mailchimp connectors
Google pushes Gemini AI Pro with family sharing and 4‑month gift trials
Google tests “CC” AI agent that lives in Gmail and plans your day
OpenAI will retire voice in the ChatGPT macOS app in January
🛡️ Guardrails and red‑team chatter
Early jailbreakers push GPT‑Image‑1.5 past OpenAI’s safety filters
ChatGPT mobile bug reveals partially undiffused versions of blocked images
PsAIch uses therapy-style “psychometric jailbreaks” to probe LLM inner life
🤖 Human‑to‑robot transfer and VLA scaling
Scaling π0‑series VLAs makes human video naturally transferable to robots
📄 New research: agents, attention windows, 3D view synthesis
ARTEMIS multi‑agent framework rivals human penetration testers on real network
Hindsight proposes human‑like agent memory with 91.4% on LongMemEval
Sliding Window Attention Adaptation makes full-attention LLMs cheaper on long prompts
Fairy2i turns real LLMs into complex 2‑bit models with minimal quality loss
Motif-2-12.7B-Reasoning shows how to RL-train a 12.7B model into GPT‑5.1 class
PersonaLive delivers 7–22× faster portrait animation for live streaming
qa-FLoRA fuses multiple LoRAs per query to boost multi-domain performance
StereoSpace learns stereo geometry from a single image via canonical diffusion