Module f-persona · Video · 8 min
Persona persistence
across frames.
Keeping a character's face, costume, and lighting consistent across hundreds of frames.
Reading time8 min
Audio-
Prerequisitesf-video-diff, 18
SourceTrack A · Gemini
§ 1
What this lesson covers.
This module is one of 42 in the curriculum. Below is the canonical interactive lesson — tabs, cards, and diagrams from the source repo, rendered inside the course shell. There is no audio narration for this module - it ships as text + interactive lesson only.
If you prefer to read first and play with the demos after, the interactive lesson sits below this section. If you'd rather hear it narrated while you scroll, hit play on the sticky audio bar at the top — or just let it autoplay.
Interactive lesson · ported from Gemini track
Click tabs to navigate · hover cards for details
THE PERSONA PROBLEMEach generation forgets
An AI music idol or a brand mascot needs to look the same in every video. The model has no memory of "this is my character" between generations. Naive text prompts ("a woman with red hair, green eyes, freckles") drift — her face changes from clip to clip, her freckles disappear, her hair turns auburn. Persona persistence is the engineering problem of locking identity across hundreds of frames and dozens of generations.
VISUAL DNAA persona is structured data, not a prompt
Treat the persona as a versioned record: face descriptors (eye color, jaw shape, freckle pattern), wardrobe rules, lighting preferences, posture defaults. Store as JSON. Every generation pulls from the same record. This is what separates a real character from a generated face — without it, every shoot starts at zero.
REFERENCE INJECTIONIP-Adapter + LoRA + DreamBooth, all at once
For maximum persistence: train a DreamBooth on 100+ shots of the persona (one-time), train a LoRA on the persona's specific lighting/aesthetic (recurring), and use IP-Adapter at inference with 3-5 reference frames pulled from the persona's library. The three techniques compound: DreamBooth locks the face shape, LoRA captures the look, IP-Adapter handles per-shot variations.
CROSS-CLIP CONSISTENCYSame persona, different shots, same identity
For multi-shot videos: render all shots in one batch with the same seed offset, use the same persona reference, and post-process with a face-consistency model that detects drift and re-conditions. Practical heuristic: if you can't tell the persona is the same across two random frames, audiences won't either — reshoot.
The canonical references for this module. External links open in a new tab.
§ NEXT
What to read next.
Use the pager below to move sequentially through the curriculum, or jump to any module from the course index. Each track has a "Prereq: ↑ foundation" callout so you can backfill anything that wasn't clear.