Skip to content
AI Primer

Granite Embedding R2

Enterprise embedding models for dense retrieval and reranking.

IBM’s Granite Embedding R2 is a family of English encoder-based embedding models for enterprise-scale dense retrieval, including both bi-encoder and cross-encoder architectures.

Pricing

Official site · May 2, 2026, 6:30 AM
Input / 1M
$0.10
Output / 1M
$0.10

IBM publishes a single embedding-model rate of USD 0.10 per million tokens; the pricing page does not break out separate input and output token prices for this model.

IBM's official watsonx.ai pricing page states that all embedding models are available for USD 0.10 per million tokens. IBM's Granite Embedding English Reranker r2 model card identifies the r2 model as a text-embedding/reranking model in the Granite Embeddings collection. No separate public r2-specific price line was found, so the published embedding-model rate applies.

View source

Model Intelligence

Context window
8,192 tokens
Benchmarkable
No
Model level
family

Recent stories

1 linked story
AI PrimerAI Primer

Your daily guide to AI tools, workflows, and creative inspiration.

© 2026 AI Primer. All rights reserved.