🤖 text-ranking

llama-nemotron-rerank-1b-v2

nvidia/llama-nemotron-rerank-1b-v2

Get AI Model →

⬇

536.4K

Downloads

❤️

50

Likes

🏷️

17

Tags

📦

transformers

Library

Model Details

Full Model IDnvidia/llama-nemotron-rerank-1b-v2

Pipeline / Tasktext-ranking

Librarytransformers

Downloads (all-time)536.4K

Likes50

Last Modified4/10/2026

Author / Orgnvidia

PrivateNo — public

⚡ Quick Usage (Python)

Using the 🤗 Transformers library. Install with pip install transformers

from transformers import pipeline

# Load the model
pipe = pipeline("text-ranking", model="nvidia/llama-nemotron-rerank-1b-v2")

# Run inference
result = pipe("Your input here")
print(result)

🏷️ Tags

transformerspytorchsafetensorsllama_bidirectext-classificationtextrerankercross-encoderretrievalsemantic-searchtext-rankingcustom_codemultilinguallicense:othertext-embeddings-inferenceendpoints_compatibleregion:us

More text-ranking Models

ms-marco-MiniLM-L6-v2

cross-encoder/ms-marco-MiniLM-L6-v2

⬇ 80.6M❤️ 269

Get AI Model →

ms-marco-MiniLM-L4-v2

cross-encoder/ms-marco-MiniLM-L4-v2

⬇ 3.8M❤️ 16

Get AI Model →

ms-marco-MiniLM-L12-v2

cross-encoder/ms-marco-MiniLM-L12-v2

⬇ 2.5M❤️ 106

Get AI Model →

🚀 Use This Model

Access model files, inference API, and full documentation on Hugging Face.

Open on Hugging Face →Browse Model Files ↗← Browse All Models

🤖 Task: text-ranking

This model is designed for the text-ranking task. Explore more models for this use case.

All text-ranking Models →

📊 Popularity

⬇ Downloads536.4K

❤️ Community Likes50

🛠️ Requirements

→Install: pip install transformers
→Python 3.8+ recommended for Transformers.
→GPU (CUDA) speeds up inference significantly.
→Use model.half() for fp16 on limited VRAM.

🔗 Quick Links

Model Card↗Model Files↗Inference API↗Community Discussions↗

👋 Need help with code?