Skip to main content

LazyFlatVectorData

Struct LazyFlatVectorData 

Source
pub struct LazyFlatVectorData {
    pub dim: usize,
    pub num_vectors: usize,
    pub quantization: DenseVectorQuantization,
    pub doc_ids: Vec<(u32, u16)>,
    /* private fields */
}
Expand description

Lazy flat vector data — doc_ids in memory, vectors accessed via range reads.

Only the doc_id index (~6 bytes/vector) is loaded into memory. Vector data stays on disk and is accessed via mmap-backed range reads. Element size depends on quantization: f32=4, f16=2, uint8=1 bytes/dim.

Used for:

  • Brute-force search (batched scoring with native-precision SIMD)
  • Reranking (read individual vectors by doc_id via binary search)
  • doc() hydration (dequantize to f32 for stored documents)
  • Merge streaming (chunked raw vector bytes + doc_id iteration)

Fields§

§dim: usize

Vector dimension

§num_vectors: usize

Total number of vectors

§quantization: DenseVectorQuantization

Storage quantization type

§doc_ids: Vec<(u32, u16)>

In-memory doc_id index: (doc_id, ordinal) per vector

Implementations§

Source§

impl LazyFlatVectorData

Source

pub async fn open(handle: LazyFileSlice) -> Result<Self>

Open from a lazy file slice pointing to the flat binary data region.

Reads header (16 bytes) + doc_ids (~6 bytes/vector) into memory. Vector data stays lazy on disk.

Source

pub async fn read_vector_into(&self, idx: usize, out: &mut [f32]) -> Result<()>

Read a single vector by index, dequantized to f32.

out must have length >= self.dim. Returns Ok(()) on success. Used for ANN training and doc() hydration where f32 is needed.

Source

pub async fn get_vector(&self, idx: usize) -> Result<Vec<f32>>

Read a single vector by index, dequantized to f32 (allocates a new Vec).

Source

pub async fn read_vector_raw_into( &self, idx: usize, out: &mut [u8], ) -> Result<()>

Read a single vector’s raw bytes (no dequantization) into a caller-provided buffer.

out must have length >= self.vector_byte_size(). Used for native-precision reranking where raw quantized bytes are scored directly.

Source

pub async fn read_vectors_batch( &self, start_idx: usize, count: usize, ) -> Result<OwnedBytes>

Read a contiguous batch of raw quantized bytes by index range.

Returns raw bytes for vectors [start_idx..start_idx+count). Bytes are in native quantized format — pass to batch_cosine_scores_f16/u8 or batch_cosine_scores (for f32) for scoring.

Source

pub fn flat_indexes_for_doc(&self, doc_id: u32) -> (usize, &[(u32, u16)])

Find flat indexes for a given doc_id via binary search on sorted doc_ids.

doc_ids are sorted by (doc_id, ordinal) — segment builder adds docs sequentially. Returns a slice of (doc_id, ordinal) entries; the position of each entry in self.doc_ids is its flat vector index.

Returns (start_index, slice) where start_index is the position in doc_ids.

Source

pub fn get_doc_id(&self, idx: usize) -> (u32, u16)

Get doc_id and ordinal at index (from in-memory index).

Source

pub fn vector_byte_size(&self) -> usize

Bytes per vector in storage.

Source

pub fn vector_bytes_len(&self) -> u64

Total byte length of raw vector data (for chunked merger streaming).

Source

pub fn vectors_byte_offset(&self) -> u64

Byte offset where vector data starts (for direct handle access in merger).

Source

pub fn handle(&self) -> &LazyFileSlice

Access the underlying lazy file handle (for chunked byte-range reads in merger).

Source

pub fn estimated_memory_bytes(&self) -> usize

Estimated memory usage (only doc_ids are in memory).

Trait Implementations§

Source§

impl Clone for LazyFlatVectorData

Source§

fn clone(&self) -> LazyFlatVectorData

Returns a duplicate of the value. Read more
1.0.0 · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for LazyFlatVectorData

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<SS, SP> SupersetOf<SS> for SP
where SS: SubsetOf<SP>,

Source§

fn to_subset(&self) -> Option<SS>

The inverse inclusion map: attempts to construct self from the equivalent element of its superset. Read more
Source§

fn is_in_subset(&self) -> bool

Checks if self is actually part of its subset T (and can be converted to it).
Source§

fn to_subset_unchecked(&self) -> SS

Use with care! Same as self.to_subset but without any property checks. Always succeeds.
Source§

fn from_subset(element: &SS) -> SP

The inclusion map: converts self to the equivalent element of its superset.
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V