If your product needs voice generation at runtime โ a learning platform that reads questions out loud, an IVR system that personalizes greetings, a chatbot that talks back, a render farm that produces ad variants overnight โ the API tier is built for you. Same voices as the web app, same billing, no extra contract to negotiate.
What you get
- REST API, JSON in / audio URL out
- Idempotency keys to prevent double-billing on retries
- Same voice library and clones as the web app
- Available on Pro, Scale, Business and Enterprise
- Per-organization rate limits (configurable)
- Credits shared with your web workspace โ no separate billing
One endpoint, no surprises.
POST /api/generations with text and voice name. Response includes a generation ID, status, audio URL, and the credit debit. Status starts as PROCESSING and flips to COMPLETED or FAILED. Pass an Idempotency-Key header to safely retry network failures without double-billing.
Built-in safety for production.
We enforce a max characters-per-generation limit based on your plan (10k on Pro, 50k on Business, 100k on Enterprise). If a request would put your org over its credit balance, it's rejected before any work runs. If a generation fails mid-way, credits are refunded. Errors come back with stable codes you can branch on.
Same workspace, same trail.
Every API generation shows up in your Workspace History tab next to manual ones. You can see who or what produced each render, when, and at what cost. Combine that with team seats and you've got a billing model your finance team will actually like.
Quick answers
Which plans include API access?
Pro, Scale, Business and Enterprise. The Free, Starter and Creator plans don't include API โ they're focused on web-app usage.
What about webhooks?
Webhooks for generation status events are on the roadmap. For now, polling the generation ID is reliable and cheap.
Can I use the API for voice cloning, dubbing, and image gen too?
Voice generation and image generation are GA. Cloning and dubbing endpoints are in private beta โ talk to support if you need them.

