FLUX.2 [klein] 9B-KV adds KV caching – 2× faster multi-reference edits

Stay in the loop

Free daily newsletter & Telegram daily report

Executive Summary

Black Forest Labs shipped FLUX.2 [klein] 9B-KV, a KV-cached variant aimed at multi-reference image editing; the company claims 2×+ faster iterations with no price increase, with gains increasing as more references are added because ref-encoding compute is amortized across steps. It’s positioned as a free API upgrade for existing users; FP8 weights also drop for lower-VRAM local runs, expanding self-hosting viability while keeping the same “reference-heavy edit loop” focus.

• Gemini API billing: Google adds experimental spend caps; enforcement can lag by ~10 minutes, with overages during the window still billed; email notifications are “shortly” rolling out.
• Grok Imagine refs: xAI demos explicit reference binding via @Image tags with up to 7 references; creators are treating prompts as slot-filled shot lists, but scale/perspective consistency complaints persist.
• Velma (Modulate): transcription API pitched as 10–90× cheaper than common STT vendors on a cost-vs-WER chart; no raw eval artifact is linked in the threads.

Net: caching, caps, and reference binding are all pushing toward longer, denser “asset stacks” per project; third-party benchmarks and reproducible artifacts remain the weak link across most claims.

While you're reading this, something just shipped.

New models, tools, and workflows drop daily. The creators who win are the ones who know first.

Last week: 47 releases tracked · 12 breaking changes flagged · 3 pricing drops caught

Photoshop’s new object rotation + “Harmonize” makes compositing feel 3D (beta)

Photoshop’s Rotate Object + Harmonize (beta) turns flat cutouts into re-poseable, relightable elements—speeding up believable composites without leaving PS.

Biggest hands-on creative tooling story today: Photoshop (beta) adds Rotate Object for 2D layers plus Harmonize for relighting/shadow blending, with multiple creators demoing “turntable-like” control. This category centers on practical Photoshop workflows and excludes other platforms’ releases.

Jump to Photoshop’s new object rotation + “Harmonize” makes compositing feel 3D (beta) topics

🧩 Photoshop’s new object rotation + “Harmonize” makes compositing feel 3D (beta)

Photoshop (beta) ships Rotate Object for turntable-like control over 2D layers

Rotate Object (Adobe Photoshop beta): Photoshop’s new Rotate Object (beta) tool lets you rotate a cutout/isolated pixel layer to a different viewpoint (effectively “turntable for pixels”), as announced in the feature release and demonstrated in the 2D to 3D-ish demo.

The practical creative impact is faster compositing iterations: instead of hunting for a new source photo angle (or re-rendering in 3D), you can push the same asset to a better-facing pose and keep moving.

FLUX.2 [klein] 9B-KV adds KV caching – 2× faster multi-reference edits

Executive Summary

While you're reading this, something just shipped.

Top links today

Photoshop’s new object rotation + “Harmonize” makes compositing feel 3D (beta)

Table of Contents

🧩 Photoshop’s new object rotation + “Harmonize” makes compositing feel 3D (beta)

Photoshop (beta) ships Rotate Object for turntable-like control over 2D layers

Rotate then Harmonize: the new Photoshop compositing loop

Photoshop beta “2D to 3D-ish” placement experiments are emerging

“Cat rotation” becomes the shorthand demo for Photoshop’s new rotate tool

🧷 Reference-driven creation gets real: Grok Imagine image refs + “Omni” mashups

Grok Imagine adds image references for video prompts with @Image tagging

Grok Imagine “Omni” blends unrelated references into one video concept

Reference-only video building in Grok Imagine: separate assets, then direct the shot

Grok Imagine compositing pattern: character + product + location via references

Grok Imagine reference mixing still shows scale/perspective failures

Stop-motion aesthetic tests are landing well in Grok Imagine

🎬 AI video craft: Kling 3.0 multi-shot prompts, horror beats, and Seedance ad power

Kling Motion Control 3.0 Challenge sets $30K + 300M credits with Mar 18 deadline

Seedance 2.0 commercial workflow: generate, extract assets, and re-prompt with references

A copy-paste Kling 3.0 multi-shot prompt uses timestamps to lock a 5-shot sequence

Kling 3.0 horror micro-scene: “closet opens → shadow reveal” jump-scare beat

LTX Studio 2.3 Fast clip frames “same-day idea to video” as the new baseline

🛠️ End-to-end creator automation: agents that pitch, plan, and produce while you sleep

Automated ad-audit sales deck: Firecrawl + Apify + Gemini 2.5 Pro + Gamma in ~5 minutes

OpenClaw “no‑AI‑slop” content ops system: ideas→research→scripts→schedule→analytics loop

Raelume pipeline: one image→camera sheet→shot selects→Kling 3.0→CapCut edit

AVA agent runs recurring content jobs via Notion/Drive/Discord/Gmail integrations

Operational agent task: collecting 3 months of invoices across email + portals (login friction)

🧪 Copy‑paste prompt kit: Midjourney SREF styles + Nano Banana UI/brand templates

Nano Banana smart prompt for bento-grid e-commerce UI concepts with one variable swap

Brand-to-era website prompt reskins modern e-commerce into Art Deco, Y2K, grunge, and more

Midjourney SREF 790656295 targets New Yorker-style editorial fashion sketching

One prompt generates a full fintech campaign board grid (hero, UI, merch, billboard)

Midjourney SREF 1003439084 channels French 80s/90s animation with Don Bluth influence

Midjourney SREF 1301865671 maps to classic European children’s book ink + watercolor

🖼️ Image models & design generators: faster FLUX editing and “web→editable UI” tools

FLUX.2 [klein] 9B gets up to 2× faster for multi-reference edits

FLUX.2 [klein] 9B-KV rolls out: free API upgrade and FP8 quantized weights

Web to Design: paste a URL to convert a live site into editable UI

🪄 Finishing passes: Wonder 2 upscaling + human matting for cleaner composites

Wonder 2 upscaling gets spotlighted for rescuing ultra low-res sources

MatAnyone 2 targets cleaner human mattes on messy real-world footage

🗣️ Voice economics shift: cheap transcription + small-GPU voice cloning

Modulate claims a 10–90× cheaper transcription API, reshaping voice app unit economics

LuxTTS claims voice cloning from 3 seconds of audio on a 4GB GPU

🧱 3D + game-ready assets: text→character pipelines and AI-native game tooling

Text-to-playable character in UE5 using fal + Hunyuan 3D v3.1 + auto-rig

Meshy caps its GDC week with “AI-native games” positioning and Booth 941 demos

Wonder 3D inside Autodesk Flow Studio as a “starter model” generator

💻 Creator infra knobs: spend caps and on-device LLM runtimes for mobile apps

Gemini API ships spend caps to bound runaway bills

llama.rn runs Llama/Qwen/Mistral locally inside React Native apps

🧭 Gemini expands into everyday apps: Maps “Ask Gemini” and creator-facing Google surfaces

Google Maps rolls out Gemini-powered “Ask” plus Immersive Navigation driving

The new “ask while driving” Maps pattern: natural language → contextual POI shortlist

Google AI Studio seeks Android early-access testers ahead of Google I/O

DeepMind’s Platform 37 includes a public AI Exchange for exhibitions and education

🧰 Open-source builder drops for creatives: learn LLMs, diagram from screenshots, better code review

Edit Banana: screenshot → editable Draw.io XML via segmentation + OCR passes

Maxime Labonne’s “LLM Course” repo packages a full, free LLM curriculum plus one-click Colab notebooks

rasbt’s “LLMs-from-scratch” repo resurfaces as the build-a-GPT-by-hand reference

Hugging Face plugin for Cursor brings datasets, evals, and training into the IDE

Qodo AI claims higher-recall code reviews than Claude at a lower price point

📚 LLMs as research & thinking partners: Claude prompts, interactive charts, and ideation hacks

Claude prompt pack to turn paper dumps into a structured lit review workflow

Claude chats can generate interactive charts inside the thread

Use LLMs to find the specific workflow users gave up fixing

📈 Marketing & growth tactics with AI: synthetic influencers, pitch automation, and strategy memos

Agencies are automating the classic “free ad audit” into a scalable pitch deck

LLM-assisted market research: turn competitor complaints into a problem map and outreach copy

Synthetic influencer tactic: engineer an identity gap that mirrors the viewer

Strategy memo for pre‑AI companies: stop over-engineering around models and restructure teams

🔬 Research radar that affects creative tools: agents, spatial QA, and efficiency primitives

OpenClaw-RL trains agents from “next-state signals” across chat, tools, and GUIs

Flash-KMeans claims exact GPU k-means with up to 17.9× speedups on H200

“Intelligent AI Delegation” gets framed as a new direction for agent coordination

MA-EgoQA benchmarks question answering across multiple agents’ POV videos

DeepMind marks 10 years of AlphaGo with a podcast on Move 37 and algorithmic discovery