Skip to content

Add HSNW-IVF index options #49

@mtbarta

Description

@mtbarta

My system is currently under pressure while training a new model. Benchmarking the search critical path, it's the coarse quantizer that
takes the most time at a whopping 80%.

graph

HSNW-IVF will use HSNW to find the closest centroids and cut down on the latency.


Note: This work depends on removing the fully constructed distance matrix used in KnnNearestNeighbors.

Context: KnnNearestNeighbors helps PLAID by containing a knn distance matrix with all of the centroids. PLAID relies on this to take each document's code and lookup that code's distance.

We can always use a sparse matrix for this instead, or perhaps XTR's imputing missing distances method would work well.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions