Standalone Dynamic Voice Note Generation
A
Abe Dearmer
Hi Sumayya, thanks for the request. To make sure we scope this correctly, can you share a bit more about how you’d want “voice note only” to work?
1) Output format and delivery
- What file formats do you need (MP3, WAV, M4A)?
- Should the result be downloadable, shareable via a link, or both?
- Do you need an embeddable audio player page (similar to our video share pages), or just the raw file?
2) Personalization inputs
- Would this use the same dynamic variables you use for Dynamic Videos today (for example, first name, company, custom fields)?
- Are you expecting text-to-speech from a script template, or uploading a base recording and having us stitch in dynamic segments?
3) Voice and generation options
- Should it support the same voice options as Dynamic Videos (voice selection, tone/style, speed), or a simpler set?
- Any requirements around pronunciation controls (phonetics, custom dictionary) or multiple languages?
4) Scale and workflow
- Roughly how many voice notes would you generate at a time (one-off, tens, hundreds, thousands)?
- Do you want this available via API/Zapier, CSV upload, or only in-app?
5) Use case and constraints
- What’s the primary use case (sales outreach, support follow-ups, order updates, internal comms)?
- Any length targets per voice note, and do you need background music or silence trimming?
If you can answer the above (even roughly), we can propose the right UX (for example, a “Create Audio” flow parallel to Dynamic Video) and confirm what’s feasible based on the current dynamic voice pipeline.