Rust-annie

A lightning-fast, Rust-powered Approximate Nearest Neighbor library for Python with multiple backends, thread-safety, and GPU acceleration.

Features

Multiple Backends:
- Brute-force (exact) with SIMD acceleration for guaranteed accuracy
- HNSW (approximate) for large-scale datasets with near-constant memory
Multiple Distance Metrics: Euclidean, Cosine, Manhattan, Chebyshev
Batch Queries for efficient processing of multiple vectors
Thread-safe indexes with concurrent access patterns
Zero-copy NumPy integration for minimal memory overhead
On-disk Persistence with serialization/deserialization
Filtered Search with custom Python callbacks and metadata
GPU Acceleration for brute-force calculations on NVIDIA GPUs
Multi-platform support (Linux, Windows, macOS)
Automated CI/CD with benchmarking and performance tracking

Installation

From PyPI (Recommended)

# Stable release
pip install rust-annie

# With GPU support (requires CUDA Toolkit)
pip install rust-annie[gpu]

From Source

git clone https://github.com/arnavk23/Annie.git
cd Annie
pip install maturin
maturin develop --release

Quick Start

Brute-Force Index (Exact Search)

import numpy as np
from rust_annie import AnnIndex, Distance

# Create index
index = AnnIndex(128, Distance.EUCLIDEAN)

# Add data
data = np.random.rand(1000, 128).astype(np.float32)
ids = np.arange(1000, dtype=np.int64)
index.add(data, ids)

# Search
query = np.random.rand(128).astype(np.float32)
neighbor_ids, distances = index.search(query, k=5)
print(f"Top 5 neighbors: {neighbor_ids}")
print(f"Distances: {distances}")

HNSW Index (Approximate, Scalable)

from rust_annie import PyHnswIndex
import numpy as np

# Create index
index = PyHnswIndex(dims=128)

# Add data
data = np.random.rand(10000, 128).astype(np.float32)
ids = np.arange(10000, dtype=np.int64)
index.add(data, ids)

# Search
query = np.random.rand(128).astype(np.float32)
neighbor_ids, distances = index.search(query, k=10)
print(f"Approximate neighbors: {neighbor_ids}")

Examples

Batch Queries

from rust_annie import AnnIndex, Distance
import numpy as np

index = AnnIndex(16, Distance.EUCLIDEAN)
data = np.random.rand(1000, 16).astype(np.float32)
ids = np.arange(1000, dtype=np.int64)
index.add(data, ids)

# Batch search (32 queries at once)
queries = data[:32]
labels_batch, dists_batch = index.search_batch(queries, k=10)
print(labels_batch.shape)  # (32, 10)

Thread-Safe Index

from rust_annie import ThreadSafeAnnIndex, Distance
import numpy as np
from concurrent.futures import ThreadPoolExecutor

# Create thread-safe index
index = ThreadSafeAnnIndex(32, Distance.EUCLIDEAN)
data = np.random.rand(500, 32).astype(np.float32)
ids = np.arange(500, dtype=np.int64)
index.add(data, ids)

# Concurrent searches
def search_task(q):
    return index.search(q, k=5)

with ThreadPoolExecutor(max_workers=8) as executor:
    futures = [executor.submit(search_task, data[i]) for i in range(8)]
    results = [f.result() for f in futures]

Filtered Search

from rust_annie import AnnIndex, Distance
import numpy as np

index = AnnIndex(3, Distance.EUCLIDEAN)
data = np.array([
    [1.0, 2.0, 3.0],
    [4.0, 5.0, 6.0],
    [7.0, 8.0, 9.0]
], dtype=np.float32)
ids = np.array([10, 20, 30], dtype=np.int64)
index.add(data, ids)

# Filter function
def even_ids(id: int) -> bool:
    return id % 2 == 0

# Filtered search
query = np.array([1.0, 2.0, 3.0], dtype=np.float32)
filtered_ids, filtered_dists = index.search_filter_py(query, k=3, filter_fn=even_ids)
print(filtered_ids)  # [10, 30] (20 is filtered out)

Persistence (Save/Load)

from rust_annie import AnnIndex, Distance
import numpy as np

# Create and populate index
index = AnnIndex(64, Distance.COSINE)
data = np.random.rand(5000, 64).astype(np.float32)
ids = np.arange(5000, dtype=np.int64)
index.add(data, ids)

# Save to disk
index.save("my_index.bin")

# Load later
loaded_index = AnnIndex.load("my_index.bin")
query = np.random.rand(64).astype(np.float32)
neighbors, distances = loaded_index.search(query, k=5)

Benchmark Results

Single Query Performance

Operation	Dataset	Time	Speedup
Single Query	10k × 64	0.7 ms	4× vs NumPy
Batch Query (64)	10k × 64	0.23 ms per query	12× vs NumPy
HNSW Query	100k × 128	0.05 ms	56× vs NumPy

See the Live Benchmark Dashboard for continuous performance tracking across versions.

API Reference

AnnIndex

Brute-force exact nearest neighbor search.

AnnIndex(dim: int, metric: Distance)

Methods:

add(data: np.ndarray[N×D], ids: np.ndarray[N]) -> None - Add vectors to index
search(query: np.ndarray[D], k: int) -> (ids, distances) - Single query search
search_batch(queries: np.ndarray[N×D], k: int) -> (ids, distances) - Batch search
search_filter_py(query: np.ndarray[D], k: int, filter_fn: Callable) -> (ids, distances) - Filtered search
remove(ids: Sequence[int]) -> None - Remove vectors by ID
save(path: str) -> None - Serialize to disk
load(path: str) -> AnnIndex - Load from disk (static method)

PyHnswIndex

Hierarchical Navigable Small World (HNSW) approximate search.

PyHnswIndex(dims: int, ef_construction: int = 200, M: int = 5)

Methods:

add(data: np.ndarray[N×D], ids: np.ndarray[N]) -> None - Add vectors
search(query: np.ndarray[D], k: int, ef: int = 200) -> (ids, distances) - Search
save(path: str) -> None - Serialize
load(path: str) -> PyHnswIndex - Load (static method)

ThreadSafeAnnIndex

Thread-safe wrapper for concurrent access.

ThreadSafeAnnIndex(dim: int, metric: Distance)

Same API as AnnIndex, safe for multi-threaded use.

Distance

Enum for distance metrics:

Distance.EUCLIDEAN - L2 distance
Distance.COSINE - Cosine similarity
Distance.MANHATTAN - L1 distance
Distance.CHEBYSHEV - L∞ distance

Performance

Single-Query Overhead

For small queries, Python function call overhead dominates. Use .search_batch() for multiple vectors.

Batch Query Efficiency

Process 64+ queries together for near-optimal throughput:

# ✓ Efficient: amortizes overhead
ids, dists = index.search_batch(queries_1000, k=5)

# ✗ Inefficient: repeats overhead
for q in queries_1000:
    ids, dists = index.search(q, k=5)

Memory Usage

Brute-force: O(N·D) where N=vectors, D=dimensions
HNSW: ~O(N·D + N·M) where M=connectivity parameter (~5-15)

GPU Acceleration

Requirements

NVIDIA GPU with CUDA compute capability 5.0+
CUDA Toolkit 11.0+ installed
cuBLAS libraries available

Build with GPU Support

# From source with GPU
maturin develop --release --features gpu

# Or install pre-built wheels with GPU
pip install rust-annie[gpu]

GPU Usage

Automatically used for:

Batch L2 distance calculations
High-dimensional searches (D > 256)

from rust_annie import AnnIndex, Distance

# GPU acceleration is automatic for large batches
index = AnnIndex(512, Distance.EUCLIDEAN)
data = np.random.rand(100000, 512).astype(np.float32)
index.add(data, np.arange(len(data), dtype=np.int64))

# This will use GPU if available and beneficial
neighbors, distances = index.search_batch(queries, k=10)

Development

Local Setup

git clone https://github.com/arnavk23/Annie.git
cd Annie

# Install development dependencies
pip install maturin
cargo install cargo-watch

# Build and test
maturin develop
pytest tests/

Running Tests

# Rust tests
cargo test --all

# Python tests
pytest tests/ -v

# Benchmarks
python scripts/benchmark.py --dataset medium
python scripts/dashboard.py

License

This project is licensed under the MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 1,026 Commits
.github		.github
benches		benches
benchmarks		benchmarks
docs		docs
fuzz		fuzz
rust_annie_macros/foo		rust_annie_macros/foo
scripts		scripts
src		src
tests		tests
.gitignore		.gitignore
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
pyproject.toml		pyproject.toml

License

Programmers-Paradise/Annie

Folders and files

Latest commit

History

Repository files navigation

Rust-annie

Table of Contents

Features

Installation

From PyPI (Recommended)

From Source

Quick Start

Brute-Force Index (Exact Search)

HNSW Index (Approximate, Scalable)

Examples

Batch Queries

Thread-Safe Index

Filtered Search

Persistence (Save/Load)

Benchmark Results

Single Query Performance

API Reference

AnnIndex

PyHnswIndex

ThreadSafeAnnIndex

Distance

Performance

Single-Query Overhead

Batch Query Efficiency

Memory Usage

GPU Acceleration

Requirements

Build with GPU Support

GPU Usage

Development

Local Setup

Running Tests

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 8

Uh oh!

Contributors 19

Uh oh!

Languages