Speaker diarization
Identify speakers from audio and video inputs
Модели в коллекции
Сортировка: по популярности (run_count)whisper-large-v3, incredibly fast, powered by Hugging Face Transformers! 🤗
Accelerated transcription, word-level timestamps and diarization with whisperX large-v3
⚡️ Blazing fast audio transcription with speaker diarization | Whisper Large V3 Turbo | word & sentence level timestamps | prompt
Ultra-fast, customizable speech-to-text and speaker diarization for noisy, multi-speaker audio. Includes advanced noise reduction, stereo channel support, and flexible audio preprocessing—ideal for call centers, meetings, and podcasts.
Whisper transcription plus speaker diarization
Whisper AI with channel separation and speaker diarization
Fast automatic speech recognition (70x realtime with large-v2) with word-level timestamps and speaker diarization.
Segments an audio recording based on who is speaking (on A100)
Segments an audio recording based on who is speaking