EmbeddingGemma.NET

EmbeddingGemma.NET provides .NET bindings for Google DeepMind's EmbeddingGemma-300m model, enabling fully local, offline text embedding with no API key, no cloud dependency, and no data egress.

A single NuGet package supports both plain IServiceCollection (ASP.NET Core / generic host) and Microsoft Semantic Kernel's IKernelBuilder.

dotnet add package EmbeddingGemma.NET

Further installation guidance for the NuGet package can be found on the NuGet Gallery.

Why `EmbeddingGemma.NET`?


No runtime cost	Runs on-device, no API calls or external services required
Privacy first	All inference is performed locally; no data leaves the machine
Top-class accuracy	EmbeddingGemma is the top #1 ranked among open multilingual embedding models under 500M parameters on MTEB
High efficiency	Be able to run on low-end devices without GPU and with as little as 4 GB of RAM, making it ideal for a wide range of applications and users
Multilingual	Supports 100+ languages out of the box
Task-aware embeddings	15 built-in task types automatically apply the correct prompt prefix

Model Setup

The ONNX model and tokenizer files must be present on disk before the service can be used. They are hosted at onnx-community/embeddinggemma-300m-ONNX on Hugging Face.

Option A: PowerShell Script

Run the included script once from the repository root. It downloads all required files into a .embedding_resources folder by default; pass -OutputPath to use a different location.

.\Initialize-Embedding-Resources.ps1

Option B: Manual Download

Manually download the following files and place them in the same directory:

File	Download link	Size
`model.onnx`	`onnx/model.onnx`	~480 KB
`model.onnx_data`	`onnx/model.onnx_data`	~1.23 GB
`tokenizer.json`	`tokenizer.json`	~20 MB
`tokenizer.model`	`tokenizer.model`	~4.7 MB
`tokenizer_config.json`	`tokenizer_config.json`	~1.2 MB

The resulting directory must have the following structure:

<model-directory>/
├── model.onnx
├── model.onnx_data
├── tokenizer.json
├── tokenizer.model
└── tokenizer_config.json

Usage

Dependency Registration

Via IServiceCollection (ASP.NET Core / generic host)

using PhanXuanQuang.AI.LocalEmbeddings.EmbeddingGemma;

// Registers IEmbeddingGenerator<string, Embedding<float>> as a keyed singleton.
builder.Services.AddGemmaOnnxEmbeddingGenerator(options => options.ModelDirectory = @"C:\path\to\model-directory");

// Optional: keyed registration for multi-service scenarios.
builder.Services.AddGemmaOnnxEmbeddingGenerator(options => options.ModelDirectory = @"C:\path\to\model-directory", serviceId: "gemma");

Via IKernelBuilder (Microsoft Semantic Kernel)

using PhanXuanQuang.AI.LocalEmbeddings.EmbeddingGemma;
using Microsoft.SemanticKernel;

var builder = Kernel.CreateBuilder();

builder.AddGemmaOnnxEmbeddingGenerator(options => options.ModelDirectory = @"C:\path\to\model-directory");

var kernel = builder.Build();

Generating Embeddings

Resolve IEmbeddingGenerator<string, Embedding<float>> from the DI container and call GenerateAsync.

using Microsoft.Extensions.AI;
using PhanXuanQuang.AI.LocalEmbeddings.EmbeddingGemma.Enums;
using PhanXuanQuang.AI.LocalEmbeddings.EmbeddingGemma.Services.Options;

var generator = serviceProvider.GetRequiredService<IEmbeddingGenerator<string, Embedding<float>>>();

// Without a task type — no prompt prefix is added.
var embeddings = await generator.GenerateAsync(["Hello, world!"]);

// With a task type — the appropriate prompt prefix is applied automatically.
var options = new EmbeddingGemmaEmbeddingGenerationOptions 
{ 
    TaskType = EmbeddingGemmaTaskType.RetrievalQuery 
};
var queryEmbeddings = await generator.GenerateAsync(["What is semantic search?"], options);

For document embeddings, supply an optional DocumentTitle to improve retrieval quality:

var docOptions = new EmbeddingGemmaEmbeddingGenerationOptions
{
    TaskType = EmbeddingGemmaTaskType.RetrievalDocument,
    DocumentTitle = "Introduction to Semantic Search"
};

var docEmbeddings = await generator.GenerateAsync(["Semantic search ranks results by meaning..."], docOptions);

Task Types

Set EmbeddingGemmaEmbeddingGenerationOptions.TaskType to have the service automatically prepend the correct prompt prefix for your scenario. When TaskType is null, no prefix is added.

`EmbeddingGemmaTaskType`	Intended use
`RetrievalQuery`	User-supplied search queries
`RetrievalDocument`	Documents or passages being indexed (no title)
`Document`	Documents or passages being indexed (no title, alias)
`Query` / `Retrieval`	General-purpose retrieval
`QuestionAnswering`	Questions in a QA pipeline
`FactVerification`	Claims requiring evidence lookup
`Classification` / `MultilabelClassification`	Sentiment, spam detection, labelling
`Clustering`	Grouping documents by topic
`SentenceSimilarity` / `PairClassification`	Direct text-to-text similarity comparison
`Summarization`	Texts intended for summarization
`InstructionRetrieval`	Natural-language-to-code retrieval
`Reranking`	Re-scoring a candidate result set
`BitextMining`	Parallel sentence alignment across languages

For detailed guidance on prompt formatting, refer to:

Demo Application

The repository also includes a Windows application (EmbeddingGemma.DemoApp) that demonstrates real-world semantic search over your browser history.

Features:

Reads history from Google Chrome, Microsoft Edge, and Mozilla Firefox automatically
Embeds all history entries into an in-memory vector store on startup
Performs semantic search and returns top results by semantic similarity
Displays execution time and memory consumption per query

Quick Start:

Complete the Model Setup step so the .embedding_resources folder is present at the repository root.
Open the solution and run EmbeddingGemma.DemoApp.
Select a browser, choose a date range, and click Load to index your history.
Type a query in natural language and click Search.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
EmbeddingGemma.Core		EmbeddingGemma.Core
EmbeddingGemma.DemoApp		EmbeddingGemma.DemoApp
.gitattributes		.gitattributes
.gitignore		.gitignore
EmbeddingGemma.NET.slnx		EmbeddingGemma.NET.slnx
Initialize-Embedding-Resources.ps1		Initialize-Embedding-Resources.ps1
README.md		README.md
icon.png		icon.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EmbeddingGemma.NET

Why `EmbeddingGemma.NET`?

Model Setup

Option A: PowerShell Script

Option B: Manual Download

Usage

Dependency Registration

Generating Embeddings

Task Types

Demo Application

About

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

EmbeddingGemma.NET

Why EmbeddingGemma.NET?

Model Setup

Option A: PowerShell Script

Option B: Manual Download

Usage

Dependency Registration

Generating Embeddings

Task Types

Demo Application

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Why `EmbeddingGemma.NET`?

Packages