Pull down to refresh stories

Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context

The AI subscription race is moving out of demo mode and into practical use. When a vendor adds more storage, unlocks stronger models, or folds research and creation into the same plan without blowing up the price, readers have a reason to rethink what they are paying for. This piece sits on 1 source layers, but the real value is showing why the story should not be skimmed past too quickly. Try The Models TL;DR: Two new Apache 2.0 multilingual embedding models built on ModernBERT — a 97M-parameter compact model that beats every open sub-100M multilingual embedder on MTEB Multilingual Retrieval (60.3), and a 311M full-size model that scores 65.2 on MTEB Multilingual Retrieval (#2 among open models under 500M parameters) with Matryoshka support.

Models Datasets Spaces Buckets new Docs Enterprise Pricing Website Tasks HuggingChat Collections Languages Organizations Community Blog Posts Daily Papers Learn Discord Forum GitHub Solutions Team & Enterprise Hugging Face PRO Enterprise Support Inference Providers Inference Endpoints Storage Buckets --[0--> --]--> Back to Articles Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality Enterprise Article Published May 14, 2026 Upvote 30 +24 Radu Florian hansolosan Follow ibm-granite Parul Awasthy pawasthy Follow ibm-granite Aashka Trivedi aashkaaa Follow ibm-granite Madison Lee kristunlee Follow ibm-granite Enterprise-Ready by Design A Strong Sub-100M Multilingual Model What Changed from R1 Training the Full-Size 311M Model Building the compact 97M Multilingual model Benchmark Results Multilingual Retrieval Speed and Throughput Matryoshka Embeddings (311M) Cross-lingual Retrieval Deployment Options For Framework Integrators Which Model Should You Use? The useful read is not just the monthly price or storage number, but which model tier gets unlocked, which tools are bundled, how the data is protected, and whether the plan actually removes the need for extra side subscriptions. Even when the core is settled, the next useful read is still the rollout speed, the real impact, and the switching cost for users or teams. Try The Models TL;DR: Two new Apache 2.0 multilingual embedding models built on ModernBERT — a 97M-parameter compact model that beats every open sub-100M multilingual embedder on MTEB Multilingual Retrieval (60.3), and a 311M full-size model that scores 65.2 on MTEB Multilingual Retrieval (#2 among open models under 500M parameters) with Matryoshka support.

Verified The story is backed by strong or official sources.
Reference image for: Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context
Reference image from Hugging Face Blog. Hugging Face Blog

Models Datasets Spaces Buckets new Docs Enterprise Pricing Website Tasks HuggingChat Collections Languages Organizations Community Blog Posts Daily Papers Learn Discord Forum GitHub Solutions Team & Enterprise Hugging Face PRO Enterprise Support Inference Providers Inference Endpoints Storage Buckets --[0--> --]--> Back to Articles Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality Enterprise Article Published May 14, 2026 Upvote 30 +24 Radu Florian hansolosan Follow ibm-granite Parul Awasthy pawasthy Follow ibm-granite Aashka Trivedi aashkaaa Follow ibm-granite Madison Lee kristunlee Follow ibm-granite Enterprise-Ready by Design A Strong Sub-100M Multilingual Model What Changed from R1 Training the Full-Size 311M Model Building the compact 97M Multilingual model Benchmark Results Multilingual Retrieval Speed and Throughput Matryoshka Embeddings (311M) Cross-lingual Retrieval Deployment Options For Framework Integrators Which Model Should You Use? major AI vendors are pulling the AI plan race into practical use: price, storage, stronger models, and bundle rights that land in everyday work. Hugging Face Blog is strong enough to treat the story as verified, but the useful part still lies in the context and practical impact.

Featured offer

Patrick Tech Store Open the AI plans, tools, and software currently getting the push Jump straight into the store to see what Patrick Tech is pushing right now.

The upgrade worth noting

Models Datasets Spaces Buckets new Docs Enterprise Pricing Website Tasks HuggingChat Collections Languages Organizations Community Blog Posts Daily Papers Learn Discord Forum GitHub Solutions Team & Enterprise Hugging Face PRO Enterprise Support Inference Providers Inference Endpoints Storage Buckets --[0--> --]--> Back to Articles Granite Embedding Multilingual R2: Open Apache 2. 0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality Enterprise Article Published May 14, 2026 Upvote 30 +24 Radu Florian hansolosan Follow ibm-granite Parul Awasthy pawasthy Follow ibm-granite Aashka Trivedi aashkaaa Follow ibm-granite Madison Lee kristunlee Follow ibm-granite Enterprise-Ready by Design A Strong Sub-100M Multilingual Model What Changed from R1 Training the Full-Size 311M Model Building the compact 97M Multilingual model Benchmark Results Multilingual Retrieval Speed and Throughput Matryoshka Embeddings (311M) Cross-lingual Retrieval Deployment Options For Framework Integrators Which Model Should You Use? Try The Models TL;DR: Two new Apache 2. 0 multilingual embedding models built on ModernBERT — a 97M-parameter compact model that beats every open sub-100M multilingual embedder on MTEB Multilingual Retrieval (60. 3), and a 311M full-size model that scores 65. 2 on MTEB Multilingual Retrieval (#2 among open models under 500M parameters) with Matryoshka support. Both cover 200+ languages, are tuned on 52 languages, handle 32K-token context (64x R1), and add code retrieval across 9 programming languages. Hugging Face Blog is strong enough to treat the story as verified, but the useful part still lies in the context and practical impact.

Where to look at price and bundle value

Models Datasets Spaces Buckets new Docs Enterprise Pricing Website Tasks HuggingChat Collections Languages Organizations Community Blog Posts Daily Papers Learn Discord Forum GitHub Solutions Team & Enterprise Hugging Face PRO Enterprise Support Inference Providers Inference Endpoints Storage Buckets --[0--> --]--> Back to Articles Granite Embedding Multilingual R2: Open Apache 2. 0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality Enterprise Article Published May 14, 2026 Upvote 30 +24 Radu Florian hansolosan Follow ibm-granite Parul Awasthy pawasthy Follow ibm-granite Aashka Trivedi aashkaaa Follow ibm-granite Madison Lee kristunlee Follow ibm-granite Enterprise-Ready by Design A Strong Sub-100M Multilingual Model What Changed from R1 Training the Full-Size 311M Model Building the compact 97M Multilingual model Benchmark Results Multilingual Retrieval Speed and Throughput Matryoshka Embeddings (311M) Cross-lingual Retrieval Deployment Options For Framework Integrators Which Model Should You Use? On AI plans, the critical read is not just the extra terabytes on paper, but whether pricing stays stable, which model tier is actually unlocked, how tight the regional limits remain, and how clearly data privacy is promised.

Featured offer

Patrick Tech Store Open the AI plans, tools, and software currently getting the push Jump straight into the store to see what Patrick Tech is pushing right now.

Which AI layers are lifting the plan

Try The Models TL;DR: Two new Apache 2. 0 multilingual embedding models built on ModernBERT — a 97M-parameter compact model that beats every open sub-100M multilingual embedder on MTEB Multilingual Retrieval (60. 3), and a 311M full-size model that scores 65. 2 on MTEB Multilingual Retrieval (#2 among open models under 500M parameters) with Matryoshka support. Both cover 200+ languages, are tuned on 52 languages, handle 32K-token context (64x R1), and add code retrieval across 9 programming languages. What makes this worth opening is that the bundled AI touches real tools like mail, docs, research, image generation, video, or note-taking instead of sitting as a standalone demo.

Who should pay attention

The readers who should watch most closely are the ones already paying for storage, docs, meetings, content creation, and AI at the same time. If one plan truly bundles those layers, the value will surface quickly. Readers using AI only for occasional prompts may still be fine on lighter or free tiers.

Patrick Tech Media take

Patrick Tech Media reads moves like this as a race for practical value. The plan that removes the need for extra side services, reduces switching between tools, and keeps AI quality stable will hold an advantage longer than the launch buzz. From 1 early signals, the piece keeps 1 references that are useful for locking the main details in place.

Context Worth Keeping

Models Datasets Spaces Buckets new Docs Enterprise Pricing Website Tasks HuggingChat Collections Languages Organizations Community Blog Posts Daily Papers Learn Discord Forum GitHub Solutions Team & Enterprise Hugging Face PRO Enterprise Support Inference Providers Inference Endpoints Storage Buckets --[0--> --]--> Back to Articles Granite Embedding Multilingual R2: Open Apache 2. 0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality Enterprise Article Published May 14, 2026 Upvote 30 +24 Radu Florian hansolosan Follow ibm-granite Parul Awasthy pawasthy Follow ibm-granite Aashka Trivedi aashkaaa Follow ibm-granite Madison Lee kristunlee Follow ibm-granite Enterprise-Ready by Design A Strong Sub-100M Multilingual Model What Changed from R1 Training the Full-Size 311M Model Building the compact 97M Multilingual model Benchmark Results Multilingual Retrieval Speed and Throughput Matryoshka Embeddings (311M) Cross-lingual Retrieval Deployment Options For Framework Integrators Which Model Should You Use? major AI vendors are pulling the AI plan race into practical use: price, storage, stronger models, and bundle rights that land in everyday work. Hugging Face Blog is strong enough to treat the story as verified, but the useful part still lies in the context and practical impact. The important thing to keep in view is that the AI race is no longer only about model bragging rights; it is about practical value in daily work. The floor is firmer here because the story is anchored by an official source, not only by second-hand reaction.

Source notes

From Patrick Tech

Contextual tools

Related stories