🎙️ Speech Recognition

voice-activity-detection

pyannote/voice-activity-detection

Get AI Model →
3.5M
Downloads
❤️
233
Likes
🏷️
14
Tags
📦
pyannote-audio
Library
Model Details
Full Model IDpyannote/voice-activity-detection
Pipeline / Taskautomatic-speech-recognition
Librarypyannote-audio
Downloads (all-time)3.5M
Likes233
Last Modified5/10/2024
Author / Orgpyannote
PrivateNo — public
⚡ Quick Usage (Python)

Using the 🤗 Transformers library. Install with pip install transformers

from transformers import pipeline

# Load the model
pipe = pipeline("automatic-speech-recognition", model="pyannote/voice-activity-detection")

# Run inference
result = pipe("Your input here")
print(result)
🏷️ Tags
pyannote-audiopyannotepyannote-audio-pipelineaudiovoicespeechspeakervoice-activity-detectionautomatic-speech-recognitiondataset:amidataset:diharddataset:voxconverselicense:mitregion:us
More Speech Recognition Models
See all →
whisperkit-coreml

argmaxinc/whisperkit-coreml

8.5M❤️ 193
Get AI Model →
speaker-diarization-3.1

pyannote/speaker-diarization-3.1

8.5M❤️ 2.5K
Get AI Model →
whisper-large-v3-turbo

openai/whisper-large-v3-turbo

7.7M❤️ 3.1K
Get AI Model →
🚀 Use This Model

Access model files, inference API, and full documentation on Hugging Face.

Open on Hugging Face →Browse Model Files ↗← Browse All Models
🎙️ Task: Speech Recognition

This model is designed for the Speech Recognition task. Explore more models for this use case.

All Speech Recognition Models →
📊 Popularity
Downloads3.5M
❤️ Community Likes233
🛠️ Requirements
  • Install: pip install pyannote-audio
  • Python 3.8+ recommended for Transformers.
  • GPU (CUDA) speeds up inference significantly.
  • Use model.half() for fp16 on limited VRAM.
👋 Need help with code?