Your First Project
import { Steps } from ‘@astrojs/starlight/components’;
This guide walks you through creating your first dubbed video. It assumes UltiVoice is installed and activated — if not, start with Installation.
The whole process takes about 5 minutes of hands-on time (plus processing time depending on video length and your hardware).
What you’ll need
Section titled “What you’ll need”- A video file: MP4, MKV, MOV, or AVI — up to ~2 hours. For your first test, use a short clip (1–3 minutes) to keep processing time low.
- The source language of the video (the language being spoken).
- Your target language (the language you want to dub into).
Step-by-step
Section titled “Step-by-step”-
Create a new project
Click New Project on the home screen or from File → New Project. Give it a name and choose a folder where output files will be saved.
-
Import your source video
Click Add Source Video and select your file. UltiVoice reads the video metadata and displays duration, resolution, and detected audio tracks.
-
Set the source language
In the Source dropdown, select the language spoken in the video. If you’re unsure, leave it on Auto-detect — Whisper will identify the language during transcription.
-
Set the target language
In the Target dropdown, pick the language you want to dub into. You can add multiple target languages and process them in one run.
-
Choose a voice
Under Voice, select a preset voice for the target language, or click Clone voice to use a reference audio clip from the original speaker. For your first run, a preset voice is recommended.
See Voice Selection & Cloning for full detail.
-
Run the pipeline
Click Start Dubbing. UltiVoice runs the pipeline in order:
- Transcribe — Whisper reads the audio and produces a timed transcript.
- Translate — Each segment is translated into the target language.
- Synthesise — TTS generates dubbed audio for each segment.
- Mix & render — FFmpeg assembles the final video.
Progress is shown per-stage. On a mid-range machine with a GPU, a 3-minute video typically completes in 5–10 minutes.
-
Review and export
When the pipeline finishes, the output video is previewed in the player. If any segments sound off, you can edit the translated text and re-synthesise individual segments without rerunning the full pipeline.
Click Export to save the final video. See Export & Download for output format options.
Next steps
Section titled “Next steps”Now that you’ve done your first dub, explore the full workflow guides:
- App Overview — understand the interface.
- Voice Selection & Cloning — get better-sounding output with voice cloning.
- Subtitles — edit timing and style before export.
- Dubbing Workflow — dive into per-segment review and editing.