Simple, transparent pricing
No subscriptions. No commitments. Top up your balance and only pay for what you actually use. Add a payment method to unlock our generous free tier.
Speech Synthesis
Convert text to natural speech
- Standard quality: £0.03/1k chars
- High Fidelity quality: £0.05/1k chars
- 57 languages supported
- 9 unique AI voices
- 5 output formats (MP3, AAC, OPUS, FLAC, WAV)
- Sub-300ms latency
True Forced Alignment
Your transcript in = your transcript out
- Not ASR—deterministic acoustic alignment
- Complete bundle: JSON, SRT & WebVTT included
- Word-level & segment timestamps
- 15 supported languages
- Large file support (100MB)
- Enterprise pricing available for bulk orders
Transcription (ASR)
Convert audio to text with AI
- State-of-the-art WhisperX engine
- 99 languages supported
- Auto-detect language
- Segment-level timestamps
- Large file support (100MB)
- JSON & TXT output formats
- Review transcript before linked alignment
Free Tier Included
Every account gets a generous monthly free allowance. Perfect for testing and small projects. Free tier applies to Standard quality synthesis and alignment only.
10 min
Alignment
1,000
Characters
(Standard only)
Transcription is pay-as-you-go only (no free tier)
Balance Top-Up
Add credits when you need them
No auto-renewal, no surprise charges. Just straightforward prepaid credits.
£5
500 credits
£10
1000 credits
£25
2500 credits
£50
5000 credits
£100
10000 credits
Secure payments via Stripe. Credits never expire.
Enterprise & Bulk Pricing
Volume DiscountsProcessing thousands of hours? We offer significant volume discounts for bulk alignment and synthesis jobs. Perfect for podcast networks, audiobook publishers, subtitle houses, and content platforms.
What's Included
Everything you need, nothing you don't
Full API Access
RESTful endpoints with comprehensive documentation
Unlimited API Keys
Create as many keys as your workflow needs
Unlimited Projects
Organize your work with no project limits
Persistent Storage
Files stored until you delete them
Email Support
Get help when you need it
Usage Dashboard
Track spending and usage in real-time
Included Voices
9 AI voices at no extra cost
All voices are included in your usage—no premium voice tiers.
Languages
Global language support
57 languages for synthesis. 15 languages for alignment:
Output Formats
5 audio formats
Export in the format that fits your workflow.
FAQ
Common questions
You top up your account balance with credits, then each API call deducts from your balance based on usage. For synthesis, you're charged per 1,000 characters. For alignment and transcription, you're charged per minute of audio (rounded up to the nearest 15 seconds, minimum 1 minute). No recurring charges—just pay for what you use.
If you run both services, each job is billed independently: transcription is charged at £0.01/min for the ASR job, and alignment is charged at £0.02/min for the alignment job. The transcription project and linked alignment project are separate units of work.
No, your credits never expire. Once you top up, the balance stays in your account until you use it.
Yes! Every account receives a monthly free allowance: 10 minutes of forced alignment and 1,000 characters of speech synthesis. This resets every 30 days. Note: Transcription and High Fidelity synthesis are not included in the free tier.
We accept all major credit and debit cards through our secure Stripe integration. This includes Visa, Mastercard, American Express, and more.
Yes, invoices are automatically generated for each top-up and are available in your billing dashboard.
Absolutely! We offer significant volume discounts for enterprise customers processing bulk alignment and synthesis jobs. Whether you're a podcast network, audiobook publisher, or content platform, we can tailor pricing to your needs. Contact us at sales@vocasync.io to discuss your requirements.
Ready to get started?
Create your free account in seconds. Add a payment method to unlock the free tier.