Integration with alternative AI voice generators | Feature Requests

Integration with alternative AI voice generators

Sarah

I would like to integrate with another AI voice generator because the US accent is jarring in contrast to the rest of the video. It would be beneficial to have the option to choose different voice providers for the AI voice, especially for accents like New Zealand.

Created by Autopilot

March 14, 2026

Abe Dearmer

Hi Sarah, thanks for the request. To make sure we scope this correctly, can you share a bit more about what you are trying to achieve and what “good” looks like for your use case?
1) Which AI voice provider(s) would you want to use (for example ElevenLabs, PlayHT, Azure, Google, Amazon Polly, etc.)?
2) Is the main need a New Zealand accent, or broader control like region, gender, tone, and speaking style?
3) Where are you using AI voice today in Sendspark: voiceover for a full video, short intro/outro, or something else?
4) Do you need the voice to be generated from a script (text to speech), or do you also want voice cloning from a sample?
5) How should this work in the product: a simple “Voice provider” dropdown, or per video and per scene controls?
6) Any requirements around pronunciation controls (custom dictionary), SSML support, or speed and pitch adjustments?
7) Are there compliance or data handling requirements we should consider (for example where audio is processed or stored)?
If you can also share 1 to 2 example videos where the current US accent feels mismatched, plus the target accent and style you want, that will help us evaluate the best approach and which providers to prioritize.