🔑 vector-kv

A vector key-value store with semantic search. Stores text content with vector embeddings and retrieves entries by cosine similarity. 🧠

Built with Go, ONNX Runtime (all-MiniLM-L6-v2), and PostgreSQL with pgvector.

📡 API

📝 Store content

POST /:key
Body: plain text content
Header: X-Metadata (optional) - searchable metadata string
Header: X-Chunk (optional)    - chunk number (integer, default 0)

Embeds the body text and stores it under the given key. Optionally attach metadata via X-Metadata for later filtering and a chunk number via X-Chunk for ordering. Returns 201 Created.

🔍 Query by similarity

GET /:key?q=<query>&m=<metadata>&k=<limit>

Search entries by semantic query (q), metadata filter (m), or both. At least one of q or m is required. Metadata filtering uses case-insensitive substring matching. Results are ordered by chunk number, then by distance. k defaults to 10 if not provided.

[
  {"content": "some stored text", "distance": 0.25, "chunk": 1, "metadata": "source=docs"},
  {"content": "another entry", "distance": 0.61, "chunk": 2, "metadata": "source=docs"}
]

🗑️ Delete entries

DELETE /:key

Deletes all entries for the given key. Returns 204 No Content.

🗂️ List keys

GET /keys

Returns a JSON array of all distinct keys that have stored entries.

["my-docs", "notes", "recipes"]

🚀 Build and Deploy

Requires Docker and a local Kubernetes cluster with Helm.

# Build and push the Docker image to the local registry
bin/build

# Deploy to the vector-kv namespace
bin/deploy

The service is exposed via NodePort on port 30080.

# Store content
curl -X POST http://<node-ip>:30080/my-key -d "Some text to store"

# Store content with metadata
curl -X POST http://<node-ip>:30080/my-key -H "X-Metadata: source=docs" -d "Some text to store"

# Query by similarity (top 5 results)
curl "http://<node-ip>:30080/my-key?q=search+terms&k=5"

# Query by metadata only
curl "http://<node-ip>:30080/my-key?m=source=docs&k=5"

# Query by both similarity and metadata
curl "http://<node-ip>:30080/my-key?q=search+terms&m=source=docs&k=5"

# List all keys
curl http://<node-ip>:30080/keys

# Delete a key
curl -X DELETE http://<node-ip>:30080/my-key

💻 CLI

A command-line client for interacting with the vector-kv server.

Install

bin/install-cli

Builds and installs vector-kv to ~/.local/bin/.

Setup

vector-kv config set-url http://<node-ip>:30080

Commands

# List all keys
vector-kv keys

# Store a value (inline or from stdin)
vector-kv set my-key "Some text to store"
cat file.txt | vector-kv set my-key

# Store with metadata
vector-kv set my-key "Some text" --meta "source=docs"
cat file.txt | vector-kv set my-key --meta "source=docs"

# Semantic search (top 5 results)
vector-kv get my-key -q "search terms" -k 5

# Search by metadata only
vector-kv get my-key -m "source=docs"

# Search by both query and metadata
vector-kv get my-key -q "search terms" -m "source=docs" -k 5

# Delete a key
vector-kv delete my-key

# Index a folder (with optional glob filter, metadata, and dry-run)
vector-kv index my-key ./docs --glob "*.md" --dry-run
vector-kv index my-key ./docs --glob "*.md"
vector-kv index my-key ./docs --glob "*.md" --meta "project=myapp"

# Configure chunking (applies to set and index)
vector-kv config set-chunk-size 800
vector-kv config set-chunk-overlap 200

# Show current config
vector-kv config show

All HTTP requests retry up to 3 times with exponential backoff on network errors and 5xx server responses.

🏗️ Architecture

🖥️ Go HTTP service — handles API requests, runs embeddings in-process via ONNX Runtime, stores/queries vectors in PostgreSQL
🤖 all-MiniLM-L6-v2 — sentence transformer model (384-dimensional embeddings), loaded as ONNX and executed with yalue/onnxruntime_go
🐘 PostgreSQL + pgvector — vector storage with HNSW index for fast approximate nearest neighbor search (50GB storage)
✂️ Pure Go tokenizer — WordPiece tokenizer for BERT-style tokenization, no CGO dependency

⚙️ Configuration

Environment variables (with defaults):

Variable	Default	Description
`DATABASE_URL`	`postgres://vectorkv:vectorkv@localhost:5432/vectorkv?sslmode=disable`	PostgreSQL connection string
`MODEL_PATH`	`/model/model.onnx`	Path to ONNX model file
`VOCAB_PATH`	`/model/vocab.txt`	Path to WordPiece vocabulary
`ORT_LIB_PATH`	`/usr/lib/libonnxruntime.so`	Path to ONNX Runtime shared library
`LISTEN_ADDR`	`:8080`	HTTP listen address

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
bin		bin
chart		chart
cmd		cmd
internal		internal
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🔑 vector-kv

📡 API

📝 Store content

🔍 Query by similarity

🗑️ Delete entries

🗂️ List keys

🚀 Build and Deploy

💻 CLI

Install

Setup

Commands

🏗️ Architecture

⚙️ Configuration

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🔑 vector-kv

📡 API

📝 Store content

🔍 Query by similarity

🗑️ Delete entries

🗂️ List keys

🚀 Build and Deploy

💻 CLI

Install

Setup

Commands

🏗️ Architecture

⚙️ Configuration

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages