Model Details
Full Model IDantirez/deepseek-v4-gguf
Pipeline / Tasktext-generation
Librarygguf
Downloads (all-time)5.7M
Likes270
Last Modified5/31/2026
Author / Organtirez
PrivateNo — public
⚡ Quick Usage (Python)
Using the 🤗 Transformers library. Install with pip install transformers
from transformers import pipeline
# Load the model
pipe = pipeline("text-generation", model="antirez/deepseek-v4-gguf")
# Run inference
result = pipe("Your input here")
print(result)🏷️ Tags
ggufquantizeddeepseekdeepseek-v4deepseek-v4-flashmoemixture-of-experts2-bit4-bitiq2_xxsq2_kq4_kds4apple-siliconmetaltext-generationenbase_model:deepseek-ai/DeepSeek-V4-Flashbase_model:quantized:deepseek-ai/DeepSeek-V4-Flashlicense:mitendpoints_compatibleregion:usconversational
More Text Generation Models
See all →🚀 Use This Model
Access model files, inference API, and full documentation on Hugging Face.
Open on Hugging Face →Browse Model Files ↗← Browse All Models✍️ Task: Text Generation
This model is designed for the Text Generation task. Explore more models for this use case.
All Text Generation Models →📊 Popularity
⬇ Downloads5.7M
❤️ Community Likes270
🛠️ Requirements
- →Install: pip install gguf
- →Python 3.8+ recommended for Transformers.
- →GPU (CUDA) speeds up inference significantly.
- →Use model.half() for fp16 on limited VRAM.