Media and standalone tools

Beyond the project pipeline, the sidebar offers standalone tools for audio, video, and document conversion jobs.

Transcribe

The entry point for video translation: upload a video or audio file and configure the transcription:

Source Language (or auto-detect) and Speaker Separation for multi-speaker content.
Background Separation — isolate speech from music/noise before transcribing.
Subtitle Length / CPL (characters per line) — readability constraints applied to the generated subtitles.
Translation Length Limit and Dubbing Length Tolerance — how much translated subtitles may deviate from source timing, which matters once you dub.

The result opens in the video editor: review the transcription, translate to target languages, edit subtitles and timings, then export subtitles or generate dubs.

Batch Transcription

Transcribe many audio/video files in one job — built for podcast back-catalogues and e-learning libraries:

Speaker Diarization and Tag Audio Events (laughter, music, etc.).
Text output: individual TXT files (one per audio) or a single combined file — or both.
Subtitle format: SRT only, VTT only (for LMS/web), both, or none.

Batch TTS

The reverse direction: generate speech from text at scale. Pick the target language, a voice, and an accent (American, British, Australian, Indian, …), and optionally generate subtitles (SRT/VTT) alongside the audio — useful for e-learning voice-overs in many languages at once.

Lip Sync

Re-syncs a speaker's mouth movements to match dubbed audio, taking a dubbed video from "voice-over" to natural-looking. Processing is GPU-intensive, so jobs run in the background.

OCR

The OCR dashboard converts scanned documents into translatable text outside the normal upload flow: upload files or a whole ZIP against a project, choose the OCR model, and optionally provide a custom prompt to steer extraction (e.g. "preserve table structure"). Jobs run in the background; outputs land on the project as translatable files.

Alignment

Builds a translation memory from existing translated document pairs — covered in TM & termbase.

Transcribe​

Batch Transcription​

Batch TTS​

Lip Sync​

OCR​

Alignment​

Transcribe

Batch Transcription

Batch TTS

Lip Sync

OCR

Alignment