Grand Diomande Research · Full HTML Reader

cc-turboquant-index Sidecar Report

This sidecar moves TurboQuant from the Python validation path into Rust packed-code candidate generation. The Python prototype validated distortion and recall, but it pre-dequantized the corpus into fp16 for search. The sidecar keeps rows bit-packed and estimates inner products directly from packed codes.

Language as Infrastructure experiment experiment writeup candidate score 18 .md

Full Public Reader

cc-turboquant-index Sidecar Report

Date: `2026-04-22`

Crate:

text
Desktop/Comp-Core/core/retrieval/cc-turboquant-index/

Purpose

This sidecar moves TurboQuant from the Python validation path into Rust packed-code candidate generation. The Python prototype validated distortion and recall, but it pre-dequantized the corpus into fp16 for search. The sidecar keeps rows bit-packed and estimates inner products directly from packed codes.

Verification

bash
cd Desktop/Comp-Core/core/retrieval/cc-turboquant-index
cargo test

Result: `4` library tests passed plus binary/doc test harnesses.

Release Smoke Results

vectorsdimbitsqueriesrecall@10per query msbuild msratio fp32packed bytes
40967684320.8781254.80221.6435.953x2113536
40967688320.9812504.32122.6582.988x4210688
327687684160.83750037.417168.6105.953x16908288

JSON artifacts:

text
Desktop/Comp-Core/benchmarks/agp-turboquant-ane/reports/cc-turboquant-index-4096x768-b4-20260422.json
Desktop/Comp-Core/benchmarks/agp-turboquant-ane/reports/cc-turboquant-index-4096x768-b8-20260422.json
Desktop/Comp-Core/benchmarks/agp-turboquant-ane/reports/cc-turboquant-index-32768x768-b4-20260422.json

Interpretation

Sidecar v0 proves the right implementation direction: packed-code Rust scanning is faster than the Python/Numpy prototype on the same small synthetic shape while preserving the same quality regime. It also exposes the remaining performance wall. A scalar packed scan at `32768 x 768` is already `37.417ms/query`, so the final RAG++ target at hundreds of thousands of vectors requires blocked scanning, memory-mapped snapshots, exact rerank, and Apple Silicon SIMD.

This does not directly improve N'Ko ASR CER. It improves the AGP control plane around ASR by making retrieval candidates, semantic state packets, and transfer bottlenecks cheaper to move and search. CER improvement still comes from better ASR checkpoints plus bounded AGP correction decisions admitted by the Rust/Graph Kernel gate.

Promotion Decision

Attach run IDs, datasets, metrics, and reproduction commands.

Source Anchor

Comp-Core/benchmarks/agp-turboquant-ane/reports/cc-turboquant-index-sidecar-20260422.md

Detected Structure

Method · Evaluation