Media and standalone tools
Beyond the project pipeline, the sidebar offers standalone tools for audio, video, and document conversion jobs.
Transcribe
The entry point for video translation: upload a video or audio file and configure the transcription:
- Source Language (or auto-detect) and Speaker Separation for multi-speaker content.
- Background Separation — isolate speech from music/noise before transcribing.
- Subtitle Length / CPL (characters per line) — readability constraints applied to the generated subtitles.
- Translation Length Limit and Dubbing Length Tolerance — how much translated subtitles may deviate from source timing, which matters once you dub.
The result opens in the video editor: review the transcription, translate to target languages, edit subtitles and timings, then export subtitles or generate dubs.
Batch Transcription
Transcribe many audio/video files in one job — built for podcast back-catalogues and e-learning libraries:
- Speaker Diarization and Tag Audio Events (laughter, music, etc.).
- Text output: individual TXT files (one per audio) or a single combined file — or both.
- Subtitle format: SRT only, VTT only (for LMS/web), both, or none.
Batch TTS
The reverse direction: generate speech from text at scale. Pick the target language, a voice, and an accent (American, British, Australian, Indian, …), and optionally generate subtitles (SRT/VTT) alongside the audio — useful for e-learning voice-overs in many languages at once.
Lip Sync
Re-syncs a speaker's mouth movements to match dubbed audio, taking a dubbed video from "voice-over" to natural-looking. Processing is GPU-intensive, so jobs run in the background.
OCR
The OCR dashboard converts scanned documents into translatable text outside the normal upload flow: upload files or a whole ZIP against a project, choose the OCR model, and optionally provide a custom prompt to steer extraction (e.g. "preserve table structure"). Jobs run in the background; outputs land on the project as translatable files.
Alignment
Builds a translation memory from existing translated document pairs — covered in TM & termbase.