§ Research notes18 essays

What the field already knows.

Long-form essays on multi-agent film production, character consistency, cinematic camera control, and the agent-skill layer beneath AI-directed video. Written for practitioners, not reviewers.

№ 017 min
The AI Film Crew: How Multi-Agent Systems Are Replacing Solo Prompting
64.4% to 79.2%.
multi-agent film productionMulti-agent
№ 026 min
Camera Templates Beat Prose: The Shot Vocabulary That Stops Framing Drift
Write "medium shot, low angle, warm key light" in ten consecutive prompts and you'll get ten different framings. The model interprets "medium" differently each time. "Low angle" might mean 15 degrees or 45. "Warm" could be golden hour or tungsten. You know this already if you've tried to maintain visual consistency across a multi-shot AI video. Every shot re-rolls the dice on what your camera language means.
camera template library AICamera & previz
№ 037 min
The Identity Drift Problem: 7 Architectures for Keeping Characters Consistent
Ask 100 AI filmmakers what's broken and they'll tell you the same thing. A CVPR 2025 workshop survey (arXiv 2504.08296, Zhang et al.) did exactly that. Character movement consistency ranked first. Camera control second. Overall character consistency third. Not generation quality. Not resolution. Not speed. Consistency.
character consistency AI videoConsistency
№ 046 min
Pre-Render Validation: The Cheapest Quality Gate You're Not Using
58% vs 25%.
AI previsualizationCamera & previz
№ 057 min
Multi-Shot Video Generation: The Technical Landscape 2024-2026
Thirty-three papers in eighteen months. That's how fast multi-shot video generation went from "interesting research direction" to "crowded field with five competing paradigms, seven benchmarks, and a $3.24 billion market projected to hit $23.54 billion by 2033" (Grand View Research, 25.4% CAGR).
multi-shot video generationSequences
№ 065 min
Discuss-Revise-Judge vs Debate-Judge-Validation: Picking Your Collaboration Pattern
Mind-of-Director uses both patterns — and uses them for different stages. That's the tell. If one pattern were universally better, they'd use it everywhere. They don't. The choice of collaboration pattern is an engineering decision with measurable tradeoffs, and the paper's own architecture is the clearest evidence for when each one fits.
multi-agent film productionMulti-agent
№ 075 min
RAG Over 440,000 Film Clips: How FilMaster Learns Camera Language
Most approaches to the camera language problem work top-down. Someone defines a vocabulary — 21 templates, a fine-tuned model, a structured prompt schema — and the system generates shots within that vocabulary. FilMaster (arXiv 2506.18899, Huang et al., KwaiVGI/Kuaishou, June 2025) works bottom-up. It built a retrieval system over 440,000 real film clips and asks: how did actual films handle this kind of scene?
cinematic language AICamera & previz
№ 084 min
The Simulated Audience: Using AI Viewers to Judge Your AI Film's Pacing
You've got four critic roles in your multi-agent pipeline — continuity, DP, performance, comprehension. They check whether the shots match, whether the camera works, whether the acting reads, whether the scene makes sense. They're all evaluating from the production side. Nobody's watching from the audience side.
simulated audience feedback AISequences
№ 095 min
Scene-Level Style Lock: Why Your Establishing Shot Should Anchor Every Frame
Background consistency improves by 21.6% when you explicitly plan for it. Character consistency improves by 9.6%. Props by 7.6%. Those numbers are from CANVAS (arXiv 2604.13452, Mondal et al., April 2026), comparing the same generation models with and without explicit continuity planning.
cross-shot consistency AIConsistency
№ 105 min
MCP for Video: The Agent Skill Layer That's Quietly Emerging
While the research papers debate multi-agent architectures for AI filmmaking, a parallel stack is assembling in the open. MCP servers — Model Context Protocol endpoints that give AI agents tool access — are showing up for video editing. Clipping. Captioning. Dubbing. Assembly. The pieces of an agentic video editing pipeline are becoming available as callable tools.
MCP video editingTooling
№ 115 min
From Single Clips to Full Sequences: The 5 Paradigms of Multi-Shot Generation
Every AI video tool generates great 10-second clips. String twelve of them together and you get a slideshow. The gap between "one good shot" and "twelve shots that feel like a film" is where the entire multi-shot generation field lives, and it's split into five fundamentally different approaches. Each makes different tradeoffs on consistency, flexibility, compute cost, and output quality.
multi-shot video generationSequences
№ 124 min
What 100 AI Filmmakers Actually Want: The CVPR Artist Survey Decoded
A hundred AI filmmakers walked into a survey and the researchers actually listened. The results (arXiv 2504.08296, Zhang et al., CVPR 2025 Workshop) are buried in an academic paper, which means the people who most need to read them — tool builders — probably haven't.
AI filmmakingMulti-agent
№ 136 min
Memory Banks for Video: How AI Remembers Characters Across Scenes
AI video models have no memory. Each shot starts fresh. The model doesn't know what your character looked like in shot 1 when it generates shot 5. Every consistency mechanism is a hack to inject memory into a memoryless system.
entity memory bank videoConsistency
№ 144 min
The $23B AI Filmmaking Market: Where Research Points and Money Flows
$3.24 billion in 2024. $23.54 billion by 2033. That's Grand View Research's estimate for the AI filmmaking market, growing at 25.4% CAGR. North America holds 40.1% revenue share. Production applications lead at 38.8%. Feature films dominate by production type.
AI film productionMulti-agent
№ 154 min
Generative Expansion: Starting from Footage, Not a Blank Prompt
The entire AI filmmaking conversation assumes you start with nothing. Type a prompt, get a video. Blank canvas to finished film.
AI video storytellingSequences
№ 165 min
MCTS for Shot Selection: Why Monte Carlo Tree Search Beats Single-Pass
Generate shot 1. Looks good. Generate shot 2. Looks good. Generate shot 3. Looks good. Assemble them. Shot 3's color grade clashes with shot 1. Shot 2's character is facing the wrong direction for the cut from shot 1 to work. The sequence fails even though every individual shot passes quality inspection.
multi-shot video generationSequences
№ 177 min
Building an AI Director Skill: From Paper to Pipeline
Thirty-three papers say multi-agent beats single-agent for film. Zero of them ship a product. This article bridges the gap — the implementation patterns for building a multi-agent AI director pipeline from the components the research describes.
AI director agentMulti-agent
№ 187 min
Cinematic Transitions Are Solved — If You Train on Film Data
Watch any AI-generated multi-shot video and you'll notice the cuts before you notice anything else. The shots might be beautiful individually. But the transitions between them feel like a slideshow — hard cuts with no editorial logic, no rhythm, no awareness of how the previous shot ended or how the next one begins. The camera doesn't "hand off" from one framing to the next. It just stops and restarts.
multi-shot video generationSequences

What the field already knows.

The AI Film Crew: How Multi-Agent Systems Are Replacing Solo Prompting

Camera Templates Beat Prose: The Shot Vocabulary That Stops Framing Drift

The Identity Drift Problem: 7 Architectures for Keeping Characters Consistent

Pre-Render Validation: The Cheapest Quality Gate You're Not Using

Multi-Shot Video Generation: The Technical Landscape 2024-2026

Discuss-Revise-Judge vs Debate-Judge-Validation: Picking Your Collaboration Pattern

RAG Over 440,000 Film Clips: How FilMaster Learns Camera Language

The Simulated Audience: Using AI Viewers to Judge Your AI Film's Pacing

Scene-Level Style Lock: Why Your Establishing Shot Should Anchor Every Frame

MCP for Video: The Agent Skill Layer That's Quietly Emerging

From Single Clips to Full Sequences: The 5 Paradigms of Multi-Shot Generation

What 100 AI Filmmakers Actually Want: The CVPR Artist Survey Decoded

Memory Banks for Video: How AI Remembers Characters Across Scenes

The $23B AI Filmmaking Market: Where Research Points and Money Flows

Generative Expansion: Starting from Footage, Not a Blank Prompt

MCTS for Shot Selection: Why Monte Carlo Tree Search Beats Single-Pass

Building an AI Director Skill: From Paper to Pipeline

Cinematic Transitions Are Solved — If You Train on Film Data