IBM’s Granite Embedding R2 is a family of English encoder-based embedding models for enterprise-scale dense retrieval, including both bi-encoder and cross-encoder architectures.
Pricing
Official site · May 2, 2026, 6:30 AM
Input / 1M
$0.10
Output / 1M
$0.10
IBM publishes a single embedding-model rate of USD 0.10 per million tokens; the pricing page does not break out separate input and output token prices for this model.
IBM's official watsonx.ai pricing page states that all embedding models are available for USD 0.10 per million tokens. IBM's Granite Embedding English Reranker r2 model card identifies the r2 model as a text-embedding/reranking model in the Granite Embeddings collection. No separate public r2-specific price line was found, so the published embedding-model rate applies.
Model Intelligence
Context window
8,192 tokens
Benchmarkable
No
Model level
family
Recent stories
1 linked story