App: AI Voices

AI Voices App Image

AI Voices provides text-to-speech, transcription, and lip-sync data generation powered by AWS Polly neural voice engine. Convert any text into natural-sounding spoken audio across multiple voices and languages, with output delivered as MP3 or other audio formats ready to serve directly to web clients or store for reuse.


Beyond basic audio, AI Voices generates lip-sync tween data alongside speech - a frame-by-frame animation dataset that drives avatar mouth movements in sync with the generated audio. This enables realistic talking avatars and animated characters built directly from your content without any external animation software.


AI Voices is designed as a service layer for the rest of the platform. The AI Chatbot can route its responses directly through Voices to produce spoken replies, combining conversational AI with audio output in a single flow. Chain Commands can call Voices as a node in any automation sequence. Custom apps can use it as a building block for any product that requires spoken output or transcription.

Apps only charge for actual usage for web requests, data lookups / storage and automatic background processing. Credits are charged when you use features or make any api requests from your website. Credits are charged for server, database and AI Model fees.