🤖 any-to-any

gemma-4-12B-it-qat-GGUF

unsloth/gemma-4-12B-it-qat-GGUF

Get AI Model →
439.0K
Downloads
❤️
284
Likes
🏷️
13
Tags
📦
transformers
Library
Model Details
Full Model IDunsloth/gemma-4-12B-it-qat-GGUF
Pipeline / Taskany-to-any
Librarytransformers
Downloads (all-time)439.0K
Likes284
Last Modified6/10/2026
Author / Orgunsloth
PrivateNo — public
⚡ Quick Usage (Python)

Using the 🤗 Transformers library. Install with pip install transformers

from transformers import pipeline

# Load the model
pipe = pipeline("any-to-any", model="unsloth/gemma-4-12B-it-qat-GGUF")

# Run inference
result = pipe("Your input here")
print(result)
🏷️ Tags
transformersggufgemma4unslothgemmagoogleany-to-anybase_model:google/gemma-4-12B-it-qat-q4_0-unquantizedbase_model:quantized:google/gemma-4-12B-it-qat-q4_0-unquantizedlicense:apache-2.0endpoints_compatibleregion:usconversational
More any-to-any Models
See all →
gemma-4-E4B-it

google/gemma-4-E4B-it

6.0M❤️ 1.3K
Get AI Model →
gemma-4-E2B-it

google/gemma-4-E2B-it

2.3M❤️ 777
Get AI Model →
gemma-4-12B-it

google/gemma-4-12B-it

2.2M❤️ 1.2K
Get AI Model →
🚀 Use This Model

Access model files, inference API, and full documentation on Hugging Face.

Open on Hugging Face →Browse Model Files ↗← Browse All Models
🤖 Task: any-to-any

This model is designed for the any-to-any task. Explore more models for this use case.

All any-to-any Models →
📊 Popularity
Downloads439.0K
❤️ Community Likes284
🛠️ Requirements
  • Install: pip install transformers
  • Python 3.8+ recommended for Transformers.
  • GPU (CUDA) speeds up inference significantly.
  • Use model.half() for fp16 on limited VRAM.
👋 Need help with code?