API Reference#

Index#

class usearch.index.BatchMatches(keys: ndarray, distances: ndarray, counts: ndarray, visited_members: int = 0, computed_distances: int = 0)#

Search results for multiple queries in batch operations.

Unused positions in arrays contain sentinel values (default keys, max distances). Access individual results via indexing: batch_matches[i] returns valid matches only.

keys#

2D array of shape (n_queries, k) containing match keys

Type:: numpy.ndarray

distances#

2D array of shape (n_queries, k) containing distances

Type:: numpy.ndarray

counts#

1D array of shape (n_queries,) with actual number of matches per query

Type:: numpy.ndarray

visited_members#

Total graph nodes visited during search

Type:: int

computed_distances#

Total distance computations performed

Type:: int

computed_distances: int = 0#

count_matches(expected: ndarray, count: int | None = None) → int#: Measures recall [0, len(expected)] as of Matches that contain the corresponding expected entry anywhere among results.

counts: ndarray#

distances: ndarray#

keys: ndarray#

mean_recall(expected: ndarray, count: int | None = None) → float#: Measures recall [0, 1] as of Matches that contain the corresponding expected entry anywhere among results.

to_list() → List[List[tuple]]#: Convert the result for each query to the list of tuples with information about its matches.

visited_members: int = 0#

class usearch.index.Clustering(index: 'Index', matches: 'BatchMatches', queries: 'Optional[np.ndarray]' = None)#

property centroids_popularity: Tuple[ndarray, ndarray]#

members_of(centroid: uint64) → ndarray#

property network#

plot_centroids_popularity()#

subcluster(centroid: uint64, **clustering_kwards) → Clustering#

class usearch.index.CompiledMetric(pointer, kind, signature)#

kind: MetricKind#: Alias for field number 1

pointer: int#: Alias for field number 0

signature: MetricSignature#: Alias for field number 2

class usearch.index.Index(*, ndim: int = 0, metric: str | ~usearch.compiled.MetricKind | ~usearch.index.CompiledMetric = <MetricKind.Cos: 99>, dtype: str | ~usearch.compiled.ScalarKind | None = None, connectivity: int | None = None, expansion_add: int | None = None, expansion_search: int | None = None, multi: bool = False, path: ~os.PathLike | None = None, view: bool = False, enable_key_lookups: bool = True)#

Fast approximate nearest neighbor search for dense vectors.

Supports various distance metrics (cosine, euclidean, inner product, etc.) and automatic precision optimization. Vector keys must be integers. All vectors must have the same dimensionality.

Example

>>> index = Index(ndim=128, metric='cos')
>>> index.add(key=42, vector=np.random.rand(128))
>>> matches = index.search(query_vector, count=10)

Inserts one or move vectors into the index.

For maximal performance the keys and vectors should conform to the Python’s “buffer protocol” spec.

To index a single entry:: keys: int, vectors: np.ndarray.
To index many entries:: keys: np.ndarray, vectors: np.ndarray.

When working with extremely large indexes, you may want to pass copy=False, if you can guarantee the lifetime of the primary vectors store during the process of construction.

Parameters:

keys (Optional[KeyOrKeysLike], can be None) – Unique identifier(s) for passed vectors
vectors (VectorOrVectorsLike) – Vector or a row-major matrix
copy (bool, defaults to True) – Should the index store a copy of vectors
threads (int, defaults to 0) – Optimal number of cores to use
log (Union[str, bool], defaults to False) – Whether to print the progress bar
progress (Optional[ProgressCallback], defaults to None) – Callback to report stats of the progress and control it

Returns:

Inserted key or keys

Type:

Union[int, np.ndarray]

property capacity: int#

Returns the current capacity of the index.

This indicates the maximum number of vectors that can be indexed without reallocation.

Returns:: The capacity of the index.
Return type:: int

clear()#: Erases all vectors from the index, preserving the allocated space for future insertions.

Clusters already indexed or provided vectors, mapping them to various centroids.

Parameters:

vectors (Optional[VectorOrVectorsLike]) –
.
count (Optional[int], defaults to None) – Upper bound on the number of clusters to produce
threads (int, defaults to 0) – Optimal number of cores to use,
log (Union[str, bool], defaults to False) – Whether to print the progress bar
progress (Optional[ProgressCallback], defaults to None) – Callback to report stats of the progress and control it

Returns:

Matches for one or more queries

Return type:

Union[Matches, BatchMatches]

property connectivity: int#

Returns the connectivity parameter of the index.

This parameter controls how many neighbors each node in the graph is connected to.

Returns:: The connectivity of the index.
Return type:: int

contains(keys: uint64 | Iterable[uint64] | int | Iterable[int] | ndarray | memoryview) → bool | ndarray#

copy() → Index#

Creates a copy of the current index.

Returns:: A new instance of the Index class with the same configuration and data.
Return type:: Index

count(keys: uint64 | Iterable[uint64] | int | Iterable[int] | ndarray | memoryview) → int | ndarray#

property dtype: ScalarKind#

Returns the data type of the vectors in the index.

Returns:: The data type of the vectors.
Return type:: ScalarKind

property expansion_add: int#

Returns the expansion parameter used during addition.

This parameter controls how many candidates are considered when adding new vectors to the index.

Returns:: The expansion parameter for additions.
Return type:: int

property expansion_search: int#

Returns the expansion parameter used during searches.

This parameter controls how many candidates are considered when searching in the index.

Returns:: The expansion parameter for searches.
Return type:: int

Looks up one or more keys from the Index, retrieving corresponding vectors.

Returns None, if one key is requested, and its not present. Returns a (row) vector, if the key maps into a single vector. Returns a (row-major) matrix, if the key maps into a multiple vectors. If multiple keys are requested, composes many such responses into a tuple.

Parameters:: keys (KeyOrKeysLike) – One or more keys to lookup
Returns:: One or more keys lookup results
Return type:: Union[Optional[np.ndarray], Tuple[Optional[np.ndarray]]]

property hardware_acceleration: str#

Describes the kind of hardware-acceleration support used in this instance.

This indicates the type of hardware acceleration that is available and being utilized for the current index configuration, including the metric kind and number of dimensions.

Returns:: “auto” if no hardware acceleration is available, otherwise an ISA subset name.
Return type:: str

property jit: bool#

True, if the provided metric was JIT-ed :rtype: bool

Type:: return

join(other: Index, max_proposals: int = 0, exact: bool = False, progress: Callable[[int, int], bool] | None = None) → Dict[uint64, uint64]#

Performs “Semantic Join” or pairwise matching between self & other index. Is different from search, as no collisions are allowed in resulting pairs. Uses the concept of “Stable Marriages” from Combinatorics, famous for the 2012 Nobel Prize in Economics.

Parameters:

other (Index) – Another index.
max_proposals (int, optional) – Limit on candidates evaluated per vector, defaults to 0
exact (bool, optional) – Controls if underlying search should be exact, defaults to False
progress (Optional[ProgressCallback], defaults to None) – Callback to report stats of the progress and control it

Returns:

Mapping from keys of self to keys of other

Return type:

Dict[Key, Key]

property keys: IndexedKeys#

Returns all keys currently indexed.

Returns:: All indexed keys.
Return type:: IndexedKeys

level_stats(level: int) → IndexStats#

Get statistics for a specific level of the graph.

Parameters:: level (int) – The level for which to retrieve statistics.
Returns:: Statistics for the specified level.
Return type:: _CompiledIndexStats

Statistics:

nodes (int): Number of nodes in the level.
edges (int): Number of edges in the level.
max_edges (int): Maximum possible number of edges in the level.
allocated_bytes (int): Memory allocated for the level.

property levels_stats: List[IndexStats]#

Get the accumulated statistics for each level of the graph.

Returns:: List of statistics for each level of the graph.
Return type:: List[_CompiledIndexStats]

Statistics for each level:

nodes (int): Number of nodes in the level.
edges (int): Number of edges in the level.
max_edges (int): Maximum possible number of edges in the level.
allocated_bytes (int): Memory allocated for the level.

Loads the index from a file or buffer.

If path_or_buffer is not provided, it defaults to the path stored in self.path.

Parameters:

path_or_buffer (Union[str, os.PathLike, BytesLike, NoneType], optional) – The path or buffer from which the index will be loaded.
progress (Optional[ProgressCallback], optional) – A callback function for progress tracking.

Raises:

Exception – If no source is defined.
RuntimeError – If the file does not exist.

property max_level: int#

Returns the maximum level in the multi-level graph.

Returns:: The maximum level in the graph.
Return type:: int

property memory_usage: int#

Returns the memory usage of the index in bytes.

Returns:: The memory usage of the index.
Return type:: int

static metadata(path_or_buffer: str | PathLike | bytes | bytearray | memoryview) → dict | None#

property metric: MetricKind | CompiledMetric#

Returns the metric object used for distance calculations.

Returns:: The metric used in the index.
Return type:: Union[MetricKind, CompiledMetric]

property metric_kind: MetricKind | CompiledMetric#

Returns the type of metric used for distance calculations.

Returns:: The metric kind used in the index.
Return type:: Union[MetricKind, CompiledMetric]

property multi: bool#

Indicates whether the index supports multi-value entries.

Returns:: True if the index supports multi-value entries, False otherwise.
Return type:: bool

property ndim: int#

Returns the number of dimensions for vectors in the index.

Returns:: The dimensionality of vectors in the index.
Return type:: int

property nlevels: int#

Returns the number of levels in the multi-level graph.

Returns:: Number of levels in the graph.
Return type:: int

Computes the pairwise distance between keys or key arrays.

If left and right are single keys, returns the distance between them. If left and right are arrays of keys, returns a matrix of pairwise distances.

Parameters:

left (KeyOrKeysLike) – A single key or an iterable of keys.
right (KeyOrKeysLike) – A single key or an iterable of keys.

Returns:

Pairwise distance(s) between the provided keys.

Return type:

Union[np.ndarray, float]

Removes one or move vectors from the index.

When working with extremely large indexes, you may want to mark some entries deleted, instead of rebuilding a filtered index. In other cases, rebuilding - is the recommended approach.

Parameters:

keys (KeyOrKeysLike) – Unique identifier for passed vectors, optional
compact (bool, optional) – Removes links to removed nodes (expensive), defaults to False
threads (int, optional) – Optimal number of cores to use, defaults to 0

Returns:

Array of integers for the number of removed vectors per key

Type:

Union[int, np.ndarray]

Rename existing member vector or vectors.

May be used in iterative clustering procedures, where one would iteratively relabel every vector with the name of the cluster an entry belongs to, until the system converges.

Parameters:

from (KeyOrKeysLike) – One or more keys to be renamed
to (KeyOrKeysLike) – New name or names (of identical length as from_)

Returns:

Number of vectors that were found and renamed

Return type:

int

reset()#: Erases all data from the index, closes any open files, and returns allocated memory to the OS.

static restore(path_or_buffer: str | PathLike | bytes | bytearray | memoryview, view: bool = False, **kwargs) → Index | None#

save(path_or_buffer: str | PathLike | None = None, progress: Callable[[int, int], bool] | None = None) → bytes | None#

Saves the index to a file or buffer.

If path_or_buffer is not provided, it defaults to the path stored in self.path.

Parameters:

path_or_buffer (Union[str, os.PathLike, NoneType], optional) – The path or buffer where the index will be saved.
progress (Optional[ProgressCallback], optional) – A callback function for progress tracking.

Returns:

The index data as bytes if saving to a buffer, otherwise None.

Return type:

Optional[bytes]

Performs approximate nearest neighbors search for one or more queries.

When searching with batch queries, returns BatchMatches that pre-allocates arrays for the requested count size. If fewer matches exist than requested (e.g., when count > index size), use individual query access via batch_matches[i] to get only valid results, or check batch_matches.counts to see actual result counts per query.

Parameters:

vectors (VectorOrVectorsLike) – Query vector or vectors.
count (int, defaults to 10 When count > index size, only available vectors will be returned. For BatchMatches, unused positions contain sentinel values.) – Upper count on the number of matches to find
threads (int, defaults to 0) – Optimal number of cores to use
exact (bool, defaults to False) – Perform exhaustive linear-time exact search
log (Union[str, bool], optional) – Whether to print the progress bar, default to False
progress (Optional[ProgressCallback], defaults to None) – Callback to report stats of the progress and control it

Returns:

Matches for one or more queries

Return type:

Union[Matches, BatchMatches] For single queries: Matches with only valid results For batch queries: BatchMatches - use indexing for individual results

property serialized_length: int#

Returns the length in bytes required to serialize the index.

Returns:: The serialized length of the index in bytes.
Return type:: int

property size: int#

Returns the number of vectors currently indexed.

Returns:: The number of vectors in the index.
Return type:: int

property specs: Dict[str, str | int | bool]#

Returns the specifications of the index.

Returns:: Dictionary of index specifications.
Return type:: Dict[str, Union[str, int, bool]]

property stats: IndexStats#

Get the accumulated statistics for the entire multi-level graph.

Returns:: Statistics for the entire multi-level graph.
Return type:: _CompiledIndexStats

Statistics:

nodes (int): Number of nodes in the graph.
edges (int): Number of edges in the graph.
max_edges (int): Maximum possible number of edges in the graph.
allocated_bytes (int): Memory allocated for the graph.

property vectors: ndarray#

Retrieves all vectors associated with the indexed keys.

Returns:: Array of vectors.
Return type:: np.ndarray

Maps the index from a file or buffer without loading it into memory.

If path_or_buffer is not provided, it defaults to the path stored in self.path.

Parameters:

path_or_buffer (Union[str, os.PathLike, bytes, bytearray, NoneType], optional) – The path or buffer to map the index from.
progress (Optional[ProgressCallback], optional) – A callback function for progress tracking.

Raises:

Exception – If no source is defined.

class usearch.index.IndexedKeys(index: Index)#: View of all keys in the index.

class usearch.index.Indexes(indexes: Iterable[Index] = [], paths: Iterable[PathLike] = [], view: bool = False, threads: int = 0)#

merge(index: Index)#

merge_path(path: PathLike)#

search(vectors, count: int = 10, *, threads: int = 0, exact: bool = False, progress: Callable[[int, int], bool] | None = None)#

class usearch.index.Match(key: int, distance: float)#

Single search result with key and distance.

distance: float#

key: int#

to_tuple() → tuple#

class usearch.index.Matches(keys: ndarray, distances: ndarray, visited_members: int = 0, computed_distances: int = 0)#

Search results for a single query.

computed_distances: int = 0#

distances: ndarray#

keys: ndarray#

to_list() → List[tuple]#: Convert to list of (key, distance) tuples.

visited_members: int = 0#

usearch.index.kmeans(X, k, metric: str = 'l2sq', dtype: str = 'bf16', max_iterations: int = 300, inertia_threshold: float = 0.0001, max_seconds: float = 60.0, min_shifts: float = 0.01, seed: int | None = None) → Tuple[ndarray, ndarray, ndarray]#

Performs KMeans clustering on a dataset using the USearch library with mixed-precision support.

This function clusters the given dataset X into k clusters by iteratively assigning points to the nearest centroids and updating the centroids based on the mean of the points assigned to them. The algorithm supports mixed-precision types and early termination based on convergence criteria like the number of iterations, inertia threshold, maximum runtime, and minimum point shifts.

Parameters:

X (numpy.ndarray) – The input data, where each row represents a data point and each column represents a feature.
k (int) – The number of clusters to form.
metric (str, optional) – The distance metric used to calculate the distance between points and centroids. Default is “l2sq” (squared Euclidean distance). Cosine “cos” distance is also supported.
dtype (str, optional) – The data type used for clustering calculations. Default is “bf16” (Brain Float 16). Other supported types include “f32” (float32) and “f64” (float64), “f16” (float16), “i8” (int8), and b1 (boolean) bit-packed vectors.
max_iterations (int, optional) – The maximum number of iterations the algorithm should run. Default is 300.
inertia_threshold (float, optional) – The threshold for inertia (sum of squared distances to centroids) to terminate early. When the change in inertia between iterations falls below this value, the algorithm stops. Default is 1e-4.
max_seconds (float, optional) – The maximum allowable runtime for the algorithm in seconds. If exceeded, the algorithm terminates early. Default is 60.0 seconds.
min_shifts (float, optional) – The minimum fraction of points that must change their assigned cluster between iterations to continue. If fewer than this fraction of points change clusters, the algorithm terminates. Default is 0.01 (1% of the total points).
seed (int, optional) – The random seed used to initialize the centroids. Default is None.

Returns:

assignments (numpy.ndarray) – An array containing the index of the assigned cluster for each point in the dataset.
distances (numpy.ndarray) – An array containing the distance of each point to its assigned cluster centroid.
centroids (numpy.ndarray) – The final centroids of the clusters.

Raises:

ValueError – If any of the input parameters are invalid, such as the number of clusters being greater than the number of data points.

Notes

This implementation utilizes mixed-precision computation to speed up the clustering process while maintaining accuracy. It also incorporates early exit conditions to avoid unnecessary computation when the clustering has stabilized, either by reaching a minimal inertia threshold, exceeding the maximum runtime, or when very few points are changing clusters between iterations.

Example

>>> X = np.random.rand(100, 10)
>>> k = 5
>>> assignments, distances, centroids = usearch.index.kmeans(X, k)

usearch.index.search(dataset: ~numpy.ndarray, query: ~numpy.ndarray, count: int = 10, metric: str | ~usearch.compiled.MetricKind | ~usearch.index.CompiledMetric = <MetricKind.Cos: 99>, *, exact: bool = False, threads: int = 0, log: str | bool = False, progress: ~typing.Callable[[int, int], bool] | None = None) → Matches | BatchMatches#

Shortcut for search, that can avoid index construction. Particularly useful for tiny datasets, where brute-force exact search works fast enough.

Parameters:

dataset (np.ndarray) – Row-major matrix.
query (np.ndarray) – Query vector or vectors (also row-major), to find in dataset.
count (int, optional) – Upper count on the number of matches to find, defaults to 10
metric (MetricLike, defaults to MetricKind.Cos Kind of the distance function, or the Numba cfunc JIT-compiled object. Possible MetricKind values: IP, Cos, L2sq, Haversine, Pearson, Hamming, Tanimoto, Sorensen.) – Distance function
threads (int, optional) – Optimal number of cores to use, defaults to 0
exact (bool, optional) – Perform exhaustive linear-time exact search, defaults to False
log (Union[str, bool], optional) – Whether to print the progress bar, default to False
progress (Optional[ProgressCallback], defaults to None) – Callback to report stats of the progress and control it

Returns:

Matches for one or more queries

Return type:

Union[Matches, BatchMatches]

IO#

usearch.io.guess_numpy_dtype_from_filename(filename) → type | None#

usearch.io.load_matrix(filename: str, start_row: int = 0, count_rows: int | None = None, view: bool = False, dtype: type | None = None) → ndarray | None#

Read *.ibin, *.bbib, *.hbin, *.fbin, *.dbin files with matrices.

Parameters:

filename – path to the matrix file
start_row – start reading vectors from this index
count_rows – number of vectors to read. If None, read all vectors
view – set to True to memory-map the file instead of loading to RAM

Returns:

parsed matrix

Return type:

numpy.ndarray

usearch.io.numpy_scalar_size(dtype) → int#

usearch.io.save_matrix(vectors: ndarray, filename: str)#

Write *.ibin, *.bbib, *.hbin, *.fbin, *.dbin files with matrices.

Parameters:

vectors (numpy.ndarray) – the matrix to serialize
filename (str) – path to the matrix file

Evaluation#

class usearch.eval.AddTask(keys: 'np.ndarray', vectors: 'np.ndarray')#

clusters(number_of_clusters: int) → List[AddTask]#: Splits this dataset into smaller chunks.

property count#

inplace_shuffle()#: Reorders the vectors and keys. Often used for robustness benchmarks.

keys: ndarray#

property ndim#

slices(batch_size: int) → List[AddTask]#: Splits this dataset into smaller chunks.

vectors: ndarray#

class usearch.eval.Dataset(keys: 'np.ndarray', vectors: 'np.ndarray', queries: 'np.ndarray', neighbors: 'np.ndarray')#

Either loads an existing dataset from disk, or generates one on the fly.

Parameters:

vectors (Optional[str], optional) – _description_, defaults to None
queries (Optional[str], optional) – _description_, defaults to None
neighbors (Optional[str], optional) – _description_, defaults to None
count (Optional[int], optional) – _description_, defaults to None
ndim (Optional[int], optional) – _description_, defaults to None
k (Optional[int], optional) – _description_, defaults to None

crop_neighbors(k: int)#

keys: ndarray#

property ndim#

neighbors: ndarray#

queries: ndarray#

vectors: ndarray#

class usearch.eval.Evaluation(tasks: 'List[Union[AddTask, SearchTask]]', count: 'int', ndim: 'int')#

count: int#

static for_dataset(dataset: Dataset, batch_size: int = 0, clusters: int = 1) → Evaluation#

ndim: int#

tasks: List[AddTask | SearchTask]#

class usearch.eval.SearchStats(index_size: int, count_queries: int, count_matches: int, visited_members: int, computed_distances: int)#

Contains statistics for one or more search runs, including the number of internal nodes that were fetched (visited_members) and the number of times the distance metric was invoked (computed_distances).

Other derivative metrics include the mean_recall and mean_efficiency. Recall is the share of queried vectors, that were successfully found. Efficiency describes the number of distances that had to be computed for each query, normalized to size of the index. Highest efficiency is 0.(9), lowest is zero. Highest is achieved, when the distance metric was computed just once per query. Lowest happens during exact search, when every distance to every present vector had to be computed.

computed_distances: int#

count_matches: int#

count_queries: int#

index_size: int#

property mean_efficiency: float#

property mean_recall: float#

visited_members: int#

class usearch.eval.SearchTask(queries: 'np.ndarray', neighbors: 'np.ndarray')#

neighbors: ndarray#

queries: ndarray#

slices(batch_size: int) → List[SearchTask]#: Splits this dataset into smaller chunks.

class usearch.eval.TaskResult(add_operations: 'Optional[int]' = None, add_per_second: 'Optional[float]' = None, search_operations: 'Optional[int]' = None, search_per_second: 'Optional[float]' = None, recall_at_one: 'Optional[float]' = None)#

add_operations: int | None = None#

add_per_second: float | None = None#

property add_seconds: float#

recall_at_one: float | None = None#

search_operations: int | None = None#

search_per_second: float | None = None#

property search_seconds: float#

usearch.eval.dcg(relevances: ndarray, k: int | None = None) → ndarray#

Calculate DCG (Discounted Cumulative Gain) up to position k.

Parameters:

relevances (list) – List of true relevance scores (in the order as they are ranked)
k (int) – Position up to which DCG is computed

Returns:

The DCG score at position k

Return type:

float

usearch.eval.measure_seconds(f: Callable) → Tuple[float, Any]#

Simple function profiling decorator.

Parameters:: f (Callable) – Function to be profiled
Returns:: Time elapsed in seconds and the result of the execution
Return type:: Tuple[float, Any]

usearch.eval.ndcg(relevances: ndarray, k: int | None = None) → ndarray#

Calculate NDCG (Normalized Discounted Cumulative Gain) at position k.

Parameters:

relevances (list) – List of true relevance scores (in the order as they are ranked)
k (int) – Position up to which NDCG is computed

Returns:

The NDCG score at position k

Return type:

float

usearch.eval.random_vectors(count: int, metric: ~usearch.compiled.MetricKind = <MetricKind.IP: 105>, dtype: ~usearch.compiled.ScalarKind = <ScalarKind.F32: 11>, ndim: int | None = None, index: ~usearch.index.Index | None = None) → ndarray#: Produces a collection of random vectors normalized for the provided metric and matching wanted dtype, which can both be inferred from an existing index.

usearch.eval.relevance(expected: ndarray, predicted: ndarray, k: int | None = None) → ndarray#

Calculate relevance scores. Binary relevance scores

Parameters:

expected (np.ndarray) – ground-truth keys
predicted (np.ndarray) – predicted keys

usearch.eval.self_recall(index: Index, sample: float | int = 1.0, **kwargs) → SearchStats#

Simplest benchmark for a quality of search, which queries every existing member of the index, to make sure approximate search finds the point itself.

Parameters:

index (Index) – Non-empty pre-constructed index
sample (Union[float, int]) – Share (or number) of vectors to search, defaults to 1.0

Returns:

Evaluation report with key metrics

Return type:

SearchStats

API Reference#

Index#

IO#

Evaluation#

Client#

Server#