Video Generation

Seedance-2.0-mini

Lower-cost, high-volume text-to-video and image-to-video generation with multimodal references and native, synchronized audio at up to 720p.

About This Template

Seedance 2.0 mini is a budget-friendly variant of the Seedance 2.0 family designed for scalable video generation. It produces high-quality clips with synchronized audio while keeping costs low, making it ideal for prototyping, iteration, and large content pipelines. Core capabilities - Text to video: Turn natural-language prompts into cinematic clips with optional dialogue, sound effects, and music. - Image to video: Animate a still image as the first frame; optionally lock a last frame for precise in/out control. - Multimodal references: Guide motion, style, and identity using up to 9 images, 3 videos (max 15s total), and 3 audio files (max 15s total). Reference them in your prompt as [Image1], [Video1], [Audio1], etc. - Video editing and extension: Edit a reference video or continue it with a text-described follow-up. - Native audio: Generates synchronized speech (use double quotes in your prompt), SFX, and background music. - Intelligent duration and adaptive framing: Set duration to -1 to let the model choose the best length, or aspect_ratio to "adaptive" for automatic composition. When to choose Mini vs other variants - Choose Seedance-2.0-mini for cost-sensitive, high-volume workflows or rapid prototyping (outputs at 480p and 720p). - Switch to Seedance 2.0 for 1080p or 4K delivery, or Seedance 2.0 Fast for a faster mid-tier option. Inputs and constraints - prompt (required): Up to 4000 characters. For best results, keep prompts under ~600 English words. - image (optional): First frame for image-to-video. Cannot be combined with reference_images. - last_frame_image (optional): Only valid if a first-frame image is provided. Cannot be combined with reference_images. - reference_images (optional): Up to 9. Use for character/style consistency and blocking. Do not mix with first/last frame images. - reference_videos (optional): Up to 3; total duration ≤ 15s. Use for motion transfer, style, and editing. - reference_audios (optional): Up to 3; total duration ≤ 15s. Requires at least one reference image or video. - duration (optional): Seconds; use -1 for intelligent duration. - resolution (optional): 480p or 720p. - aspect_ratio (optional): 16:9, 4:3, 1:1, 3:4, 9:16, 21:9, 9:21, or "adaptive". - generate_audio (optional): Enable to synthesize dialogue, SFX, and BGM. Put spoken lines in double quotes. - seed (optional): Set for reproducibility across runs. Output - A URI to the generated MP4 video file, including audio if enabled. Best-practice tips - Be specific: Describe camera moves, lensing, lighting, mood, pacing, and key actions. - Use dialogue formatting: "She turns and whispers: \"Follow me.\"" - Label references clearly in the prompt: "The character from [Image1] performs the motion from [Video1]." - Prototype efficiently: Start with 5-second clips at 480p, then scale duration/resolution for finals. - Control continuity: Provide first and last frame images to stabilize openings and endings. - For lip-sync: Supply reference audio and include quoted lines for precise timing. - For consistency: Use multiple reference images (varied angles/lighting) of the same subject.

Video Generation
Quick to set upFully customizableReady to use

How to Use This Template

1

Step 1: Enter your text in 'Prompt' Node

Fill the 'Prompt' node with the required text.

Example :
Hyper-realistic cinematic street racing shot. Audio: High-pitched engine revving, aggressive tire screech, and rain hitting metal. Camera starts low to the ground on a wet asphalt hairpin curve at night. A matte-black vintage sports car drifts aggressively into frame. The camera executes a fast whip-pan to the right, perfectly tracking the car's speed. The car slides out of frame, kicking up a massive rooster tail of neon-lit water droplets. The camera abruptly stops panning and immediately rack-focuses to a wet, crushed soda can resting on the asphalt in the extreme foreground. Perfect water physics, 1080p, 24fps.
2

Step 2: Upload your file

In the 'Image' node, upload the file you want to process.

Example :
No example available.
3

Step 3: Upload your file

In the 'Last Frame Image' node, upload the file you want to process.

Example :
No example available.
4

Step 4: Upload your files

In the 'Reference Images' node, upload the files you want to use in the workflow.

Example :
No example available.
5

Step 5: Run the Flow

Click the 'Run' button to execute the flow and get the final output.

Who is this for?

Perfect for professionals and creators looking to streamline their workflow

Marketing and social teams

Produce large volumes of short-form ads, promos, and social videos with consistent characters and styles at a lower cost.

Content studios and agencies

Rapidly prototype concepts, mood pieces, and story beats before committing to higher-resolution final renders.

Product and growth teams

Generate A/B test variants for landing pages and app stores with native audio and quick iteration cycles.

Game and XR developers

Create animated teasers, in-world vignettes, and motion studies using multimodal references and motion transfer.

Ecommerce and brand creators

Maintain visual identity and character consistency across seasonal campaigns using reference images and videos.

Researchers and hobbyists

Explore text-to-video and audio-visual generation affordably while retaining strong control over inputs.

Ready to build?

Start using this template

Open it directly in AI-Flow and start creating in minutes

Frequently Asked Questions

What is Seedance-2.0-mini best for?

It’s optimized for cost-efficient, high-volume video generation and rapid prototyping. Use it when you need frequent iterations or scaled output at 480p or 720p with synchronized audio.

How does Seedance-2.0-mini differ from Seedance 2.0 and Seedance 2.0 Fast?

Mini costs about half as much per second as Seedance 2.0 and outputs up to 720p. Seedance 2.0 supports 1080p and 4K for higher fidelity, while Seedance 2.0 Fast is a quicker mid-tier option.

Which input modes are supported?

Text-to-video, image-to-video (with optional last frame), multimodal reference (images, videos, audios), video editing, and video extension. Audio is generated natively for dialogue, SFX, and music.

Can I combine first/last frame images with reference images?

No. First/last frame images cannot be used together with reference_images. If you provide a last_frame_image, you must also provide a first-frame image.

How do I add dialogue and sound effects?

Enable generate_audio and put spoken lines in double quotes inside your prompt. You can also include SFX and music descriptions, or provide reference audio for timing and style.

What are the limits for reference media?

Up to 9 reference images, up to 3 reference videos with a combined maximum of 15 seconds, and up to 3 reference audios with a combined maximum of 15 seconds. Reference audio requires at least one reference image or video.

What resolutions and aspect ratios are supported?

Seedance-2.0-mini outputs 480p or 720p across common aspect ratios (16:9, 4:3, 1:1, 3:4, 9:16, 21:9, 9:21). Use aspect_ratio="adaptive" to let the model choose.

What is intelligent duration and when should I use it?

Set duration to -1 to enable intelligent duration. The model will select a suitable clip length based on your inputs and scene complexity.

How can I improve character and style consistency?

Provide multiple reference_images of the same subject from different angles and lighting, label them in your prompt, and describe the target style clearly. You can also include a short reference video for motion consistency.

How do I get reproducible results?

Set a seed and keep all inputs (prompt, references, settings) unchanged between runs. Small variations may still occur, but a fixed seed improves consistency.

What does the API return?

A URI pointing to the generated MP4 file. If generate_audio is enabled, the video will include synchronized audio.

Any tips to control cost during prototyping?

Start with 5-second clips at 480p, avoid video references unless needed (they’re costlier than image-only), and only increase duration or resolution for final passes.

How long can prompts be and what makes a strong prompt?

Prompts can be up to 4000 characters. For best results, keep under ~600 words and specify camera moves, lighting, mood, pacing, subject actions, and any dialogue in quotes.

What is AI-FLOW and how can it help me?

AI-FLOW is an all-in-one AI platform that allows you to build, integrate, and automate AI-powered workflows using an intuitive drag-and-drop interface. Whether you're a beginner or an expert, you can leverage multiple AI models to create innovative solutions without any coding required.

Is there a free trial available?

Yes, AI-FLOW offers a free trial to get you started. After that, you can purchase credits as needed—no subscription or long-term commitment required.

Can I integrate my API keys from providers like OpenAI and Replicate with AI-FLOW Cloud Version ?

Yes, you can easily integrate your existing API keys with AI-FLOW. If specified, nodes related to the API Key provided will use your API key, significantly reducing your platform credit usage.