Generate speech

Use AI for text-to-speech or to clone your voice via API

Назад к коллекциям

Модели в коллекции

Сортировка: по популярности (run_count)
jaaari/kokoro-82m

Kokoro v1.0 - text-to-speech (82M params, based on StyleTTS2)

79 023 245 запусков
minimax/speech-02-turbo

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Designed for real-time applications with low latency

9 406 796 запусков
lucataco/xtts-v2

Coqui XTTS-v2: Multilingual Text To Speech Voice Cloning

4 991 657 запусков
minimax/speech-02-hd

Text-to-Audio (T2A) that offers voice synthesis, emotional expression, and multilingual capabilities. Optimized for high-fidelity applications like voiceovers and audiobooks.

1 680 614 запусков
zsxkib/realistic-voice-cloning

Create song covers with any RVC v2 trained AI voice from audio files.

1 525 006 запусков
suno-ai/bark

🔊 Text-Prompted Generative Audio Model

305 044 запусков
resemble-ai/chatterbox

Generate expressive, natural speech. Features unique emotion control, instant voice cloning from short audio, and built-in watermarking.

225 900 запусков
awerks/neon-tts

NeonAI Coqui AI TTS Plugin.

186 054 запусков
afiaka87/tortoise-tts

Generate speech from text, clone voices from mp3 files. From James Betker AKA "neonbjb".

173 307 запусков
adirik/styletts2

Generates speech from text

132 452 запусков
resemble-ai/chatterbox-turbo

The fastest open source TTS model without sacrificing quality.

123 646 запусков
cjwbw/seamless_communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

99 552 запусков
chenxwh/openvoice

Updated to OpenVoice v2: Versatile Instant Voice Cloning

83 751 запусков
minimax/voice-cloning

Clone voices to use with Minimax's speech-02-hd and speech-02-turbo

42 792 запусков
x-lance/f5-tts

F5-TTS, the new state-of-the-art in open source voice cloning

40 063 запусков
lucataco/orpheus-3b-0.1-ft

Orpheus 3B - high quality, emotive Text to Speech

33 601 запусков
resemble-ai/chatterbox-multilingual

Generate expressive, natural speech in 23 languages. Features instant voice cloning from short audio, emotion control, and seamless cross-language voice transfer.

22 235 запусков
resemble-ai/chatterbox-pro

Generate expressive, natural speech with Resemble AI's Chatterbox.

18 045 запусков
camenduru/metavoice

MetaVoice-1B: 1.2B parameter base model trained on 100K hours of speech

13 499 запусков
zsxkib/dia

Dia 1.6B by Nari Labs, Generates realistic dialogue audio from text, including non-verbal cues and voice cloning

12 779 запусков
cjwbw/voicecraft

Zero-Shot Speech Editing and Text-to-Speech in the Wild

10 839 запусков
minimax/speech-2.8-turbo

Minimax Speech 2.8 Turbo: Turn text into natural, expressive speech with voice cloning, emotion control, and support for 40+ languages

8 326 запусков
cjwbw/parler-tts

lightweight text-to-speech (TTS) model, trained on 10.5K hours of audio data

2 765 запусков
fermatresearch/spanish-f5-tts

A F5-TTS fine-tuned for Spanish

1 487 запусков
lucataco/csm-1b

CSM (Conversational Speech Model) is a speech generation model from Sesame that generates RVQ audio codes from text and audio inputs

1 133 запусков
lucataco/pheme

Pheme generates a variety of conversational voices in 16 kHz for phone-call applications

573 запусков
platform-kit/mars5-tts

A novel speech model for insane prosody.

541 запусков