RyanCodrai/turbovec — TurboQuant 向量检索库（9.9k Stars）

A vector index built on TurboQuant, written in Rust with Python bindings. 10M documents in 31GB RAM → 4GB. Faster than FAISS.

核心痛点解决

RAG 系统本地部署时内存瓶颈。TurboQuant 思路通过底层数据压缩技术，保证检索精度的同时大幅降低高维向量存储与 RAM 占用。

核心数据

1000万文档语料：FP32 需要 31GB RAM → turbovec 仅需 4GB（16x 压缩）
ARM（Apple M3 Max）：比 FAISS IndexPQFastScan 快 12-20%
x86（Intel Xeon）：4-bit 配置全面胜出，2-bit 几乎追平

工作原理（6步）

Normalize — 去掉向量长度，存为单独 float
Random rotation — 乘同一随机正交矩阵，使每坐标独立服从已知分布
Per-coordinate calibration (TQ+) — 首步 add 时拟合两个标量（shift + scale）
Lloyd-Max scalar quantization — 预计算最优分桶（2-bit=4桶，4-bit=16桶）
Bit-pack — 1536 维向量：6144 bytes → 384 bytes（2-bit）
Length-renormalized scoring — 修正量化导致的内积低估

Python 示例

python

from turbovec import TurboQuantIndex
index = TurboQuantIndex(dim=1536, bit_width=4)
index.add(vectors)
scores, indices = index.search(query, k=10)
index.write("my_index.tq")

框架集成

LangChain（pip install turbovec[langchain]）
LlamaIndex（pip install turbovec[llama-index]）
Haystack
Agno

数据

9.9k Stars · 849 Forks · 148 Commits
MIT License
基于 Google Research TurboQuant（ICLR 2026）

RyanCodrai/turbovec — TurboQuant 向量检索库（9.9k Stars） ​

核心痛点解决 ​

核心数据 ​

工作原理（6步） ​

Python 示例 ​

框架集成 ​

数据 ​

RyanCodrai/turbovec — TurboQuant 向量检索库（9.9k Stars）

核心痛点解决

核心数据

工作原理（6步）

Python 示例

框架集成

数据