Transcribe speech to text

Use AI to transcribe speech to text via API

Назад к коллекциям

Модели в коллекции

Сортировка: по популярности (run_count)
openai/whisper

Convert speech in audio to text

143 425 819 запусков
vaibhavs10/incredibly-fast-whisper

whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗

25 840 053 запусков
victor-upmeet/whisperx

Accelerated transcription, word-level timestamps and diarization with whisperX large-v3

6 246 067 запусков
thomasmol/whisper-diarization

⚡️ Blazing fast audio transcription with speaker diarization | Whisper Large V3 Turbo | word & sentence level timestamps | prompt

5 791 475 запусков
google/gemini-3-pro

Google's most advanced reasoning Gemini model

836 895 запусков
cjwbw/seamless_communication

SeamlessM4T—Massively Multilingual & Multimodal Machine Translation

99 552 запусков
daanelson/whisperx

Accelerated transcription of audio using WhisperX

94 354 запусков
m1guelpf/whisper-subtitles

Generate subtitles from an audio file, using OpenAI's Whisper model.

73 930 запусков
openai/gpt-4o-transcribe

A speech-to-text model that uses GPT-4o to transcribe audio

35 504 запусков
nvidia/parakeet-rnnt-1.1b

🗣️ Nvidia + Suno.ai's speech-to-text conversion with high accuracy and efficiency 📝

24 453 запусков
adidoes/whisperx-video-transcribe

ASR from video URL based on whisperx using large-v2 model

19 647 запусков
openai/gpt-4o-mini-transcribe

A speech-to-text model that uses GPT-4o mini to transcribe audio

11 331 запусков
nicknaskida/whisper-diarization

⚡️ Insanely Fast audio transcription | whisper large-v3 | speaker diarization | word & sentence level timestamps | prompt | hotwords. Fork of thomasmol/whisper-diarization. Added batched whisper, 3x-4x speedup 🚀

451 запусков
cjwbw/canary-1b

Nvidia Automatic speech-to-text recognition (ASR) in 4 languages (English, German, French, Spanish)

277 запусков