Explore
All Tools
Price
Enterprise
News
Login
Sign up
Login
Sign up
Menu
Close
All Models
Audio Models
Image Models
Text Models
Tool Models
Video Models
Lyria-3
Audio model
Google's AI music generation model — create original, high-fidelity instrumental and vocal music from text descriptions in seconds
Lyria-3-pro
Audio model
Professional-grade AI music composition — studio-quality audio generation with fine-grained control over genre, instrumentation, tempo, and mood
Gemini-3.1-flash-tts
Audio model
Natural-sounding AI text-to-speech — fast, expressive voice synthesis with multilingual support for apps, podcasts, and accessibility tools
Whisper
Audio model
OpenAI's open-source speech recognition model — accurate multilingual transcription and translation from audio files for apps, podcasts, and accessibility
Stable audio 2.5
Audio model
Stability AI's advanced audio generation model — create high-fidelity music, sound effects, and ambient audio from text prompts for games, films, podcasts, and creative projects
Minimax music-2.6
Audio model
Minimax's AI music generation model — compose original, high-fidelity instrumental and vocal tracks from text descriptions for content creators, filmmakers, and game developers
Speech 2.6 turbo
Audio model
Upgraded fast-track AI speech model — improved voice naturalness and emotional expressiveness at high speed for scalable, responsive voice-enabled application development
Speech 02 hd
Audio model
Fast AI voice synthesis for real-time apps — low-latency text-to-speech with natural-sounding output, built for chatbots, virtual assistants, and live interaction use cases
Speech 02 turbo
Audio model
High-definition AI text-to-speech — natural, expressive voice synthesis with studio-grade audio clarity for audiobooks, voiceovers, and professional narration projects
Speech 2.8 turbo
Audio model
Minimax's fastest AI speech model — ultra-low latency voice generation with refined prosody and multilingual support for demanding real-time voice interfaces and API integrations
Speech 2.8 hd
Audio model
Minimax's highest-quality AI voice model — broadcast-grade speech synthesis with the most natural intonation, nuanced emotion, and pristine audio fidelity across multiple languages