Integration with alternative AI voice generators
A
Abe Dearmer
Hi Sarah, thanks for the request. To make sure we scope this correctly, can you share a bit more about what you are trying to achieve and what “good” looks like for your use case?
1) Which AI voice provider(s) would you want to use (for example ElevenLabs, PlayHT, Azure, Google, Amazon Polly, etc.)?
2) Is the main need a New Zealand accent, or broader control like region, gender, tone, and speaking style?
3) Where are you using AI voice today in Sendspark: voiceover for a full video, short intro/outro, or something else?
4) Do you need the voice to be generated from a script (text to speech), or do you also want voice cloning from a sample?
5) How should this work in the product: a simple “Voice provider” dropdown, or per video and per scene controls?
6) Any requirements around pronunciation controls (custom dictionary), SSML support, or speed and pitch adjustments?
7) Are there compliance or data handling requirements we should consider (for example where audio is processed or stored)?
If you can also share 1 to 2 example videos where the current US accent feels mismatched, plus the target accent and style you want, that will help us evaluate the best approach and which providers to prioritize.