From script to aligned dub track
Generate TTS audio and immediately produce word-level alignment timestamps. One bundled workflow, billed once upfront at £0.04/min SD or £0.05/min HD on estimated minutes (input chars / 900). Built for dubbing studios prepping ADR tracks and voice-direction sessions.
Two stages, one bundle
Each stage's project remains independently inspectable on your dashboard.
Synthesize
Text-to-speech generates the audio track from your script using the voice and quality you select.
- Standard or High Fidelity voice quality
- Voice and language picked at workflow creation
- Synthesis artifacts remain available on the linked project
Align
True forced alignment matches the generated audio back to the source script at phoneme level.
- Word-level timing under VocaSync's alignment guarantees
- Same-source linkage — no transcript drift between stages
- Ready to drive ADR sessions or feed into a DAW timeline
Per-minute pricing, charged once upfront.
Because the input is text and audio duration isn't known until synthesis runs, estimated minutes are derived from input character count (chars / 900) at reservation time. The bundle covers both stages and auto-refunds on partial failure.
£0.04 / min SD
Standard-quality TTS plus alignment, billed on estimated minutes from input chars.
£0.05 / min HD
High Fidelity TTS plus alignment, same per-minute basis.
Auto-refund
Terminal failures release the unconsumed portion back to your balance.
30-day expiry
Idle workflows auto-refund so reservations never sit on your balance forever.