LocalGhost

🎬 Guide to Writing Multi-Shot Text-to-Video Prompts for Seedance 2.0

Hello Everyone! 👋If you're just getting started with Seedance 2.0 or want to improve the quality of your AI-generated videos, this guide is for you. One of the most effective ways to create cinematic, consistent, and engaging videos is by using multi-shot prompts. Instead of describing everything in a single scene, you'll learn how to structure your prompts like a filmmaker, breaking your story into clear shots that flow naturally from beginning to end.Let's dive in! 🎬Why Use Multi-Shot Prompts? Many beginners write prompts like this:"A girl sits in a cafe drinking coffee, the camera slowly moves closer."The result can be unpredictable. The camera may move oddly, character consistency may break, and the video often feels flat.The secret is simple:Think like a film director, not an image generator user.Seedance 2.0 performs much better when scenes are organized into clear, sequential shots.🎥 Think Like a DirectorBefore writing your prompt, answer these three questions:1. Who is the main character?Examples:A Japanese schoolgirlAn office workerAn astronaut2. What is happening?Examples:Waiting for someoneRunning late for schoolDriving a vintage car3. What is the sequence of events?Example:Walking down the streetChecking the timePanicking and runningThis sequence becomes your multi-shot structure.Basic Multi-Shot StructureThe simplest format:Shot 1: [Scene description] Shot 2: [Scene description] Shot 3: [Scene description]Avoid cramming everything into one scene.Let the story flow naturally from shot to shot.The Simple Seedance FormulaEvery shot should ideally contain:Location Character Action Camera Movement MoodExample:Shot 1: A young Japanese schoolgirl stands beside a quiet railway crossing on a bright morning. She gently adjusts her school bag while looking into the distance. Medium shot. Slow camera push-in. Peaceful atmosphere.The Most Reliable Shot StructureShot 1 = Establishing ShotIntroduce the location and character.Example:Shot 1: A beautiful Japanese schoolgirl stands beside a quiet railway crossing on a sunny morning. The wind gently moves her long hair. Wide shot. Slow cinematic push-in.Purpose:✅ Establish the setting✅ Introduce the character✅ Define the moodShot 2 = Action ShotThe character does something.Example:Shot 2: The girl hears a distant train horn and turns her head. She smiles softly and takes a step forward. Medium shot. The camera slowly orbits around her.Purpose:✅ Add movement✅ Increase engagementShot 3 = Payoff ShotThe key moment or ending.Example:Shot 3: The train passes behind her. She looks toward the camera and smiles warmly. Close-up shot. Gentle lens compression. Bright cinematic atmosphere.Purpose:✅ Deliver the climax✅ Leave a memorable impressionCommon Camera MovementsPush InThe camera moves closer to the subject.Slow camera push-in.Best for:Emotional momentsDramaRomancePull BackThe camera moves away from the subject.Camera slowly pulls back.Best for:EndingsRevealing environmentsOrbitThe camera circles around the subject.Camera slowly orbits around the subject.Best for:Character-focused scenesFashion shotsEmotional momentsTracking ShotThe camera follows the character.Tracking shot following her movement.Best for:WalkingRunningDrivingCrane UpThe camera rises upward.Camera cranes upward.Best for:Cinematic endingsLandscape revealsThe 15-Second Story FormulaThis structure works exceptionally well for short videos:Shot 1 = Setup Shot 2 = Development Shot 3 = PayoffExample:SetupThe character discovers something.DevelopmentThe character reacts.PayoffA surprise or conclusion occurs.ExampleTheme: Running Late for SchoolShot 1: A beautiful Japanese schoolgirl runs along a sunny residential street while looking at her wristwatch. Wide tracking shot. Bright morning atmosphere. Shot 2: She notices the time is already 9:00 AM. Her eyes widen in panic. Medium shot. The camera quickly pushes in. Shot 3: She finally reaches the school gate, breathing heavily. Then she notices a sign that says "Sunday." She freezes in confusion. Close-up shot. Comedic atmosphere.This type of structured storytelling is usually much more engaging than a single disconnected scene.Common Beginner Mistakes❌ Too Many Actions in One ShotBad:She runs, jumps, laughs, turns around, opens a door, sits down, drinks coffee, and reads a book.The model is trying to do too much at once.Better:Shot 1: Running Shot 2: Opening the door Shot 3: Drinking coffee❌ Random Location ChangesBad:Shot 1: Beach Shot 2: Space station Shot 3: Medieval castleSudden environment changes often cause character inconsistency.❌ Excessive Unimportant DetailsBad:The chair is made of imported oak wood from...Seedance cares much more about:CharacterActionCameraVisual environmentThe Golden Rules of Seedance 2.0 ✨Use 3 to 5 ShotsThis range is usually the most stable for short videos.Repeat Key Character DescriptionsDon't be afraid to mention the main character repeatedly in each shot.This helps maintain consistency.Keep Camera Instructions SimpleChoose one primary movement:Push-inOrbitTrackingPull-backAvoid stacking multiple camera movements in the same shot.Focus on One StoryA 15-second video is not a two-hour movie.One strong idea almost always performs better than ten ideas squeezed together.🎵 Audio in Seedance 2.0Besides visuals and camera movements, Seedance 2.0 can also interpret audio descriptions within your prompt. Adding audio helps create a more immersive and cinematic experience by telling the model not only what the audience should see, but also what they should hear.Basic Audio StructureA simple format looks like this:Shot 1: [Visual Description] Audio: [Sound Description]Example:Shot 1: A young woman stands beside a railway crossing on a sunny afternoon. The wind gently moves her hair. Wide shot. Slow camera push-in. Audio: Gentle wind blowing, distant train crossing bell, soft ambient city sounds.Types of Audio You Can UseAmbient SoundsAmbient sounds establish the atmosphere of the scene.Examples:Audio: Birds chirping, gentle wind, distant traffic.Audio: Ocean waves, seagulls, light sea breeze.Audio: Rainfall, distant thunder, dripping water.Sound Effects (SFX)These sounds are directly related to actions or objects in the scene.Examples:Audio: Footsteps on wet pavement.Audio: Car engine idling softly.Audio: Door creaking open.Audio: Paper rustling.DialogueDialogue can be included when a character speaks.Examples:Audio: She softly says, "What are you looking at?"Audio: He whispers, "I finally found it."Background MusicMusic helps reinforce the mood and emotion of a scene.Examples:Audio: Soft emotional piano music.Audio: Upbeat pop music.Audio: Epic orchestral soundtrack.Audio: Lo-fi chill background music.Recommended Multi-Shot FormatFor modern video generation workflows, a complete prompt often follows this structure:Shot 1: [Visual Description] Shot 2: [Visual Description] Shot 3: [Visual Description] Audio: [Audio Description]Complete ExampleShot 1: Wide establishing shot of an empty train platform at dusk after rainfall, wet ground scattered with shallow puddles reflecting dim station lights, soft blue-gray evening sky with lingering clouds, faint mist drifting in the air, subtle ambient motion from dripping water and gentle wind, atmosphere calm and slightly melancholic. Shot 2: Medium shot—A Japanese schoolgirl with long straight hair and full bangs sits alone on a bench under a dim station light, posture slightly slouched, loosely holding her phone without using it, her gaze unfocused toward the tracks, distant train headlights begin to emerge behind her, soft wind brushing her hair and uniform. Shot 3: Close-up—Her phone screen lights up, displaying a simple message: “Did you get home safely?” The soft glow illuminates her face in the dim environment, her thumb hovers above the screen, she pauses briefly, her eyes soften as a subtle emotional shift begins. Shot 4: Cutaway—A train rushes past the platform with a low rumble, wind flows through the station, her hair and skirt gently sway, reflections ripple across the puddles, streaks of moving light glide across her face, the moment feels quietly transitional. Shot 5: Close-up—She exhales slowly, her shoulders relax, a faint and fragile smile appears, she begins typing a reply, warmth subtly replaces the earlier emptiness. Shot 6: Wide shot from behind—She lowers her phone, holding that small smile, she gazes ahead for a brief moment as if gathering strength, then stands up, adjusts her bag, and walks away along the platform, her figure gradually fading into the distance. Audio: Distant train rumble, soft and low, Gentle evening wind and subtle station ambience, Light water dripping in the background, Soft notification sound when the message appears, Very light emotional piano entering toward the latter half, carrying through the ending, No dialogueAudio Tips for Seedance 2.0 ✨Keep Audio Descriptions SimpleGood:Audio: Gentle rain, distant thunder.Less Effective:Audio: The rain should sound as if it is falling at approximately...Short and clear descriptions are usually interpreted more reliably.Match Audio to the ActionIf a character is running:Audio: Fast footsteps.If a character is driving:Audio: Engine hum, road noise.If a character is speaking:Audio: Dialogue.The audio should naturally support what is happening on screen.Layer Sounds NaturallyA good cinematic soundscape often combines:Ambient + SFX + MusicExample:Audio: Ocean waves, seagulls, soft piano music.This combination creates a richer and more immersive viewing experience.That's all for this guide! 🎥Hopefully this guide gives you a solid foundation for creating better multi-shot prompts in Seedance 2.0. Don't be afraid to experiment with different shot sequences, camera movements, and storytelling styles. The more you practice, the more natural it becomes.Thank you for reading, and happy prompting! ✨

LocalGhost

Guide to creating full body images using Flux

Hi everyone, in this guide I will try to explain the correct way to create a full body image using Flux (other models will most likely be able to do this too). You may have experienced the difficulty of creating full-body images, as the results are usually cropped, usually with a 2:3 image ratio (768x1152) and other variations. One effective way is to change the image ratio to be larger in height and smaller in width (e.g., 800x2048). But actually, that's unnecessary because Flux is very "stubborn."Why is Flux so “stubborn” about full body?1️⃣ Data bias: Flux “likes face & torso”The Flux model is heavily biased towards the face, chest and waist because:The training dataset is full of portrait, half-body, fashion cropFace = high detail = high aesthetic value according to the modelWhen you write:full body shot, wide angle shotThe model reads it, but its internal priority remains the face. So the camera “zooms in” on its own.2️⃣ The term “wide angle” ≠ camera distanceThis is a classic trap.Prompt → What the model understandswide angle → lens distortionfull body → intention, not a guaranteecinematic → lighting & mood❌ It doesn't mean: the camera is far away✔️ Models can still use wide lenses but close up3️⃣ Human = main object → auto cropFor human subjects, Flux automatically:Zoom in on the subjectSacrifice leg firstfocus on expression & top clothingThat's why the background is rarely "used".The RIGHT way to force full body in FluxThis isn't one trick. It's a combination of techniques.🧠 Main principlesDon't just say "full body" Force it with physical context and framing.✅ Technique 1: Use a physical anchor (THIS IS IMPORTANT)The model adheres more to object relations than to camera terms.❌ Bad:full body shot, wide angle✅ Good:standing on the ground, feet visible, head to toe visible, shoes touching the floorModels cannot crop the feet if "feet touching the ground".✅ Technique 2: Use “distance language”Replace camera language with physical distance.Effective example:camera placed far away, subject small in frame, full height visibleOr:long distance shot, entire body visible within the frame✅ Technique 3: Use “environment dominance”Make the background more important than the human.large environment, vast background, subject occupying small portion of the frameFlux will “move the camera away”.✅ Technique 4: Add anti-crop instructionFlux is quite responsive to explicit prohibitions.no cropped body, no half body, no close upThis isn't an official negative prompt, but it's still influential.🔥 Examples of prompts that are PROVEN to be more compliantSimple example:a young woman standing on a street, full height visible from head to toe, feet clearly visible touching the ground, camera placed far away, subject small in frame, wide environment background, entire body inside the frame, no cropped body, natural daylightOr a more “Flux-friendly” version:long distance shot of a woman, standing upright, head to toe visible, shoes visible on the ground, wide environment, camera far away, subject centered but small in frame⚠️ Things to AVOID🚫 Don't just rely on:full body shotwide anglecinematicfashion photographyThat's a cosmetic prompt, not a structural instruction.The summary 🎯Flux naturally cuts through the human body“full body” is not enoughObject + camera distance + environment relation is keyForce models with physical logic, not photographic terms.👩 FULL BODY TEMPLATE IMAGE OF A WOMAN (Or Man if you want to change it)Flux is more aggressive in zooming in on women, so this template is tighter on the legs & framing. Example:a young adult woman, standing upright on the ground, full height visible from head to toe, legs fully visible, feet clearly visible touching the ground, entire body fully inside the frame, no cropped body, no half body, no close up, camera placed far away, long distance shot, subject small in frame, wide environment surrounding the subject, wearing a summer dress and flat sandals, full outfit visible including footwear, located in a wide park with trees and open space, background dominates the scene, large scale background, natural body proportions, realistic female anatomy, balanced natural standing pose, soft daylight, high detail, realistic, flux dev style🧠 Why this template works“feet touching the ground” = Flux dare not cut“subject small in frame” = camera moves away slightlyEnvironment dominance = background priority > faceExplicit crop ban = suppress portrait biasThis isn't magic. It's stubborn negotiation with a model. 😄🔧 Additional tips (optional but powerful)If it's still cut off → add:full body framed with extra space above head and below feetFor an establishing backdrop:vast environment, large scale sceneFor safe poses:standing straight, arms relaxed at sidesWill "camera placed far away" make the impression that the human object in the photo is too far away / zoomed-out?Short answer:Maybe yes, maybe not.It depends on the combination, not the phrase alone.Now we dissect it with the Flux version of camera logic.Why does “camera placed far away” feel scary?Because if it stands alone, Flux reads it as:small subjectdominant environmentextreme distanceThe result could be something like:🧍‍♂️ ➝ human dot in the background oceanBut… in the template above, why is it safe?Because the phrase is CLIPPED by another instruction.Safe combination:camera placed far away, subject centered, full height visible, natural body proportionsIt means:camera back away ➝ so that the feet are inbut the subject remains the center of the framenot an extreme wide establishing shotHow to set a “safe distance” (sweet spot)🔧 Opt 1: Safe version (RECOMMENDED)Change far away ➝ moderate distancecamera placed at a moderate distance, full body visible, subject clearly visibleThis is the most stable for Flux.🔧 Opt 2: Stay “far away” but lockedcamera placed far away, subject clearly visible, subject occupying medium portion of the frameThe prompt medium portion holds back excessive zoom-out.🔧 Opt 3: Framing language, not distanceFlux is more compliant to this:full body framed with space above the head and below the feetWithout mentioning the distance at all.Quick table: distance word effects in Fluxcamera placed far away ➝ safe for feet, risky too smallmoderate distance ➝ most balancedlong distance shot ➝ dominant environmentsmall subject in frame ➝ very far awaymedium subject in frame ➝ clear full bodyMy final recommendation 🎯For 90% of human full body cases:camera placed at a moderate distance, full body framed from head to toe, subject centered and clearly visibleIt's far enough for feet, close enough for details.Conclusion:❌ “far away” is not poison⚠️ But it must be clamped✅ “moderate distance + framing language” = most stable🧠 Think of Flux like a stabborn photographer, not a passive camera.Below are 3 camera distances templates.I wrote the core template and then a ready-to-use example so it's easy to copy.🧍‍♀️ CLOSE FULL BODYImpression: dominant subject, clear clothing details, secure feetfull body framed tightly from head to toe, subject occupying large portion of the frame, camera placed at a close but full-body distance, space visible above the head and below the feet, entire body inside the frame, feet clearly visible touching the ground, no cropped body, no close up🧍 MEDIUM FULL BODYImpression: most balanced, safe for 80% of use casesfull body clearly visible from head to toe, subject occupying medium portion of the frame, camera placed at a moderate distance, balanced framing, wide environment visible but not dominant, feet clearly visible touching the ground, no cropped body🌆 WIDE / ESTABLISHING SHOT (BUT FULL BODY IS SAFE)Impression: strong location, small but intact humanestablishing shot, wide environment dominating the scene, subject occupying small portion of the frame, full body visible from head to toe, entire body inside the frame, feet visible touching the ground, subject clearly identifiableImportant for Flux:Don't mix small subjects in frame with close full body shots.Always repeat head to toe + feet touching groundFor women, repeating the leg 2× is normalDoes the prompt fit at the beginning of the prompt before the subject and details or at the end of the prompt after the subject and other details?Answer:Place it at the BEGINNING of the prompt.NOT at the end. NOT in the middle. IN FRONT.Now a really useful explanation for FluxHow Flux “reads” prompts (practical version)Flux doesn't read like a human reads a sentence. It works like this:Beginning = framing & compositionMiddle = subject & poseEnd = cosmetics (style, lighting, mood)It means:What you write in the first 25–30% of the prompt determines the camera.If “full body” and “feet visible” appear later, Flux has often already “decided” on the portrait framing.IDEAL sequence for full body (must follow)[1] Framing & Camera (FRONT) [2] Main subject (human) [3] Pose & physical [4] Clothing & visual details [5] Environment & background [6] Lighting & styleHard rules (brief but important)✅ MANDATORY at the beginningfull body / head to toefeet touching groundframing (close / medium / wide)subject portion in frameno cropped body⚠️ CAN be in the middleposegenderageexpression🎨 AT THE ENDcinematicfashionstreetlightingmoodrealismWhy does it often fail if it is placed at the end?Because Flux:Determine the crop earlyDon't “repeat” framing unless you force it.Trust initial instructions more than final revisionsConclusion 🎯Framing = frontDetail = backFull body fails → almost always because the framing comes too lateFinal note: This is just a guide, you don't need to 100% copy the prompt I made, you can modify it according to your own taste as long as you stick to the prompt placement rules. ✌👍👌That's all, folks. Hopefully, this guide helps you create text2image with Flux, especially full-body images, with satisfying results. 😁✨🤩

LocalGhost

🎬 Guide to Writing Multi-Shot Text-to-Video Prompts for Seedance 2.0

🎬 Mastering Multi-Character Prompts in Text-to-Image Generation

🎥 Mastering Camera Angles & Perspective in Text-to-Image

A quick guide in the sequences/orders of using LoRA

Guide to creating full body images using Flux