Audio Models | AI Model Details

All Tools

Image to Text Image Generator Image Variation Image Relight Multi-angle Camera
Image Upscaler Image Crop Image Retouch Image Color adjust

Price Enterprise News

Menu

Close

Google's AI music generation model — create original, high-fidelity instrumental and vocal music from text descriptions in seconds

Professional-grade AI music composition — studio-quality audio generation with fine-grained control over genre, instrumentation, tempo, and mood

Gemini-3.1-flash-tts

Natural-sounding AI text-to-speech — fast, expressive voice synthesis with multilingual support for apps, podcasts, and accessibility tools

OpenAI's open-source speech recognition model — accurate multilingual transcription and translation from audio files for apps, podcasts, and accessibility

Stable audio 2.5

Stability AI's advanced audio generation model — create high-fidelity music, sound effects, and ambient audio from text prompts for games, films, podcasts, and creative projects

Minimax music-2.6

Minimax's AI music generation model — compose original, high-fidelity instrumental and vocal tracks from text descriptions for content creators, filmmakers, and game developers

Speech 2.6 turbo

Upgraded fast-track AI speech model — improved voice naturalness and emotional expressiveness at high speed for scalable, responsive voice-enabled application development

Fast AI voice synthesis for real-time apps — low-latency text-to-speech with natural-sounding output, built for chatbots, virtual assistants, and live interaction use cases

Speech 02 turbo

High-definition AI text-to-speech — natural, expressive voice synthesis with studio-grade audio clarity for audiobooks, voiceovers, and professional narration projects

Speech 2.8 turbo

Minimax's fastest AI speech model — ultra-low latency voice generation with refined prosody and multilingual support for demanding real-time voice interfaces and API integrations

Minimax's highest-quality AI voice model — broadcast-grade speech synthesis with the most natural intonation, nuanced emotion, and pristine audio fidelity across multiple languages

Image Tools

Image to Text Image Generator Image Variation Image Relight Multi-angle Camera
Image Upscaler Image Crop Image Retouch Image Color adjust

AI Text Models

All Claude version All Gemini version All GPT version

AI Audio Models

All Eleven labs version All Stability.ai version All Minimax version All Google Model All OpenAI Model

Business

Our Price Enterprise

Programs

Become a Experts Become a Creators Become a Ambassador

Resources

Contact Marketplace Hire Experts Support News

Video Tools

Video Generator Video Upscaler

AI Image Models

All Google models All Seedream version All Flux Max version All Ideogram version All Qwen version All Z-image version All Runway version All Datacte version All Chenxwh version All Claude version All Luma AI version All Stability.ai version

AI Video Models

All Seedream version All Wan version All Kling version All Veo version All Minimax version All ByteDance |... All Alibaba version All Grok version All Lightricks version All Luma AI All Pruna ai version All PixVerse version All Fictions ai version All Sync version All Mirelo version All Veed version All lucataco version

© linkfilm 2026

Terms of Service Privacy Policy