Pay As You Go

Simple, transparent pricing

No subscriptions. No commitments. Top up your balance and only pay for what you actually use. Add a payment method to unlock our generous free tier.

Speech Synthesis

Convert text to natural speech

  • Standard quality: £0.03/1k chars
  • High Fidelity quality: £0.05/1k chars
  • 57 languages supported
  • 9 unique AI voices
  • 5 output formats (MP3, AAC, OPUS, FLAC, WAV)
  • Sub-300ms latency
£0.03per 1,000 characters

True Forced Alignment

Your transcript in = your transcript out

  • Not ASR—deterministic acoustic alignment
  • Complete bundle: JSON, SRT & WebVTT included
  • Word-level & segment timestamps
  • 15 supported languages
  • Large file support (100MB)
  • Enterprise pricing available for bulk orders
£0.02per minute of audio

Transcription (ASR)

Convert audio to text with AI

  • State-of-the-art WhisperX engine
  • 99 languages supported
  • Auto-detect language
  • Segment-level timestamps
  • Large file support (100MB)
  • JSON & TXT output formats
  • Review transcript before linked alignment
£0.01per minute of audio

Translation

Neural translation, timestamp-aware

  • 30+ target languages
  • Cue timestamps preserved verbatim
  • SRT, VTT, and JSON inputs supported
  • Premium adds register-aware tone polish (Sonnet 4.6)
  • Best for tonal content — anime, drama, dialogue
  • Switch tiers per project from the wizard
Standard
£0.080per 1k chars
Premium
£0.100per 1k chars

Includes Claude Sonnet 4.6 refinement pass

Workflow Bundles

Stages chained, billed as one

Workflows orchestrate multiple services into a single bundled job with upfront pricing and refund-on-failure protection. Subtitling is live today; more workflows are on the way.

Subtitling

Live

Transcription ➔ Alignment ➔ Translation

Audio in, translated subtitles out. Single upfront charge per minute of source audio (1-min minimum). Auto-refunds on partial failure or 30-day idle.

Standard

£0.08

per minute

Premium

£0.09

+ Sonnet refinement

Dub-Prep

Live

Synthesis ➔ Alignment

Generate TTS audio and word-level alignment in one bundle. Estimated minutes derive from input character count (chars / 900) — no audio duration needed at submit time.

Standard

£0.04

per minute

High Fidelity

£0.05

per minute

More workflows

Coming soon

Subtitle Base · Quick Translate

  • Subtitle Base — Transcription + Alignment (no translation). £0.03 / min
  • Quick Translate — Transcription + Translation (no alignment). £0.06 / min Standard · £0.08 / min Premium (Premium bundles Sonnet refinement).

Free Tier Included

Every account gets a generous monthly free allowance. Perfect for testing and small projects. Free tier applies to Standard quality synthesis and alignment only.

10

min / month

Alignment

1,000

chars / month

Synthesis (Standard)

Transcription is pay-as-you-go only (no free tier)

Balance Top-Up

Add credits when you need them

No auto-renewal, no surprise charges. Just straightforward prepaid credits.

£5

500 credits

Popular

£10

1000 credits

£25

2500 credits

£50

5000 credits

£100

10000 credits

Secure payments via Stripe. Credits never expire.

Enterprise & Bulk Pricing

Volume Discounts

Processing thousands of hours? We offer significant volume discounts for bulk alignment and synthesis jobs. Perfect for podcast networks, audiobook publishers, subtitle houses, and content platforms.

Custom volume pricing
Priority processing
Dedicated support
SLA guarantees

What's Included

Everything you need, nothing you don't

Full API Access

RESTful endpoints with comprehensive documentation

Unlimited API Keys

Create as many keys as your workflow needs

Unlimited Projects

Organize your work with no project limits

Persistent Storage

Files stored until you delete them

Email Support

Get help when you need it

Usage Dashboard

Track spending and usage in real-time

Included Voices

9 AI voices at no extra cost

All voices are included in your usage—no premium voice tiers.

alloy
ash
coral
echo
fable
onyx
nova
sage
shimmer

Languages

Global language support

57 languages for synthesis. 15 languages for alignment:

🇺🇸English (US)
🇬🇧English (UK)
🇫🇷French
🇩🇪German
🇪🇸Spanish
🇵🇹Portuguese (PT)
🇸🇪Swedish
🇨🇿Czech
🇵🇱Polish
🇹🇷Turkish
🇷🇺Russian
🇺🇦Ukrainian
🇯🇵Japanese
🇰🇷Korean
🇨🇳Mandarin (CN)

Output Formats

5 audio formats

Export in the format that fits your workflow.

mp3Universal compatibility
aacApple optimized
opusWeb streaming
flacLossless audio
wavRaw quality

FAQ

Common questions

Ready to get started?

Create your free account in seconds. Add a payment method to unlock the free tier.