releaseMarch 20, 2026

SAMA releases 14B instruction-guided video editing with sparse anchor frames

SAMA is a new 14B open model for instruction-guided video editing that separates semantic anchoring from motion alignment and claims state-of-the-art open results. Track it if you need edits that change objects or style without wrecking motion.

Video Editing Motion Control

2 min read

SAMA releases 14B instruction-guided video editing with sparse anchor frames

TL;DR

SAMA is a new 14B open model for instruction-guided video editing, and the release thread says it targets the usual hard tradeoff: changing content without breaking motion.
According to the paper post, the model splits editing into semantic anchoring and motion alignment instead of treating both as one problem.
The launch thread says SAMA is Apache 2.0 licensed and claims state-of-the-art performance among open-source video editing models.

What shipped

𝚉𝚊𝚎𝚜𝚊𝚛

@zaesarius

·Follow

Baidu, Tsinghua, and Zhejiang University release SAMA. Factorized semantic anchoring and motion alignment for instruction-guided video editing. Balances semantic modification and motion preservation through sparse anchor frames and motion-centric pretext tasks. Two-stage Show more

Watch on X

11:11 PM · Mar 20, 2026

Read 1 reply

SAMA is pitched as a general video editor for object replacement, addition, removal, and style transfer. The core idea is sparse anchor frames: the model predicts semantic tokens and video latents at key frames, then uses a separate motion-focused module to carry those edits through time without the usual flicker or drift.

How the workflow is different

@_akhaliq

·Follow

SAMA Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing paper: huggingface.co/papers/2603.19…

6:55 PM · Mar 20, 2026

Read 1 reply

The paper describes a two-stage training setup. First, SAMA learns motion from raw video with restoration-style pretext tasks including cube inpainting, speed perturbation, and tube shuffling. Then it is fine-tuned on paired editing data for instruction following. That separation is the creative hook: it is designed for prompts that swap subjects or restyle scenes while keeping camera movement and scene dynamics intact, which is also what the supporting writeup highlights in its demo overview.

🧾 More sources

How the workflow is different1 tweets

Technical details on SAMA's factorized design and training pipeline.

SAMA releases 14B instruction-guided video editing with sparse anchor frames

TL;DR

What shipped

How the workflow is different

🧾 More sources

Topview integrates Seedance 2.0 into Agent V2 with storyboard timelines and 365-day unlimited access

CapCut opens Seedance 2.0 on desktop and web in 7 countries

WAR FOREVER drops 4-minute D-Day sneak peek before June 6 release

Calico AI adds Zillow-to-video listing workflows at about $12 in credits

Read next

Topview integrates Seedance 2.0 into Agent V2 with storyboard timelines and 365-day unlimited access

Midjourney V8 updates film-still workflows with deeper compositions and ECLIPTIC remake tests

LTX Studio supports a Vice City rerender pipeline with Nano Banana 2 and 4K animation

Nano Banana 2 supports character turnarounds, realism traits, and composition-locked rerenders