🎙️ Speech Recognition

voice-activity-detection

pyannote/voice-activity-detection

Get AI Model →
2.2M
Downloads
❤️
230
Likes
🏷️
14
Tags
📦
pyannote-audio
Library
Model Details
Full Model IDpyannote/voice-activity-detection
Pipeline / Taskautomatic-speech-recognition
Librarypyannote-audio
Downloads (all-time)2.2M
Likes230
Last Modified5/10/2024
Author / Orgpyannote
PrivateNo — public
⚡ Quick Usage (Python)

Using the 🤗 Transformers library. Install with pip install transformers

from transformers import pipeline

# Load the model
pipe = pipeline("automatic-speech-recognition", model="pyannote/voice-activity-detection")

# Run inference
result = pipe("Your input here")
print(result)
🏷️ Tags
pyannote-audiopyannotepyannote-audio-pipelineaudiovoicespeechspeakervoice-activity-detectionautomatic-speech-recognitiondataset:amidataset:diharddataset:voxconverselicense:mitregion:us
More Speech Recognition Models
See all →
speaker-diarization-3.1

pyannote/speaker-diarization-3.1

10.2M❤️ 1.8K
Get AI Model →
whisperkit-coreml

argmaxinc/whisperkit-coreml

7.0M❤️ 169
Get AI Model →
whisper-large-v3-turbo

openai/whisper-large-v3-turbo

6.7M❤️ 3.0K
Get AI Model →
🚀 Use This Model

Access model files, inference API, and full documentation on Hugging Face.

Open on Hugging Face →Browse Model Files ↗← Browse All Models
🎙️ Task: Speech Recognition

This model is designed for the Speech Recognition task. Explore more models for this use case.

All Speech Recognition Models →
📊 Popularity
Downloads2.2M
❤️ Community Likes230
🛠️ Requirements
  • Install: pip install pyannote-audio
  • Python 3.8+ recommended for Transformers.
  • GPU (CUDA) speeds up inference significantly.
  • Use model.half() for fp16 on limited VRAM.
👋 Need help with code?