Struct lance_encoding::format::pb::List   
source · pub struct List {
    pub offsets: Option<Box<ArrayEncoding>>,
    pub null_offset_adjustment: u64,
    pub num_items: u64,
}Expand description
An array encoding for variable-length list fields
Fields§
§offsets: Option<Box<ArrayEncoding>>An array containing the offsets into an items array.
This array will have num_rows items and will never have nulls.
If the list at index i is not null then offsets[i] will
contain base + len(list) where base is defined as:
i == 0: 0
i >  0: (offsets[i-1] % null_offset_adjustment)
To help understand we can consider the following example list: [ [A, B], null, [], [C, D, E] ]
The offsets will be [2, ?, 2, 5]
If the incoming list at index i IS null then offsets[i] will
contain base + len(list) + null_offset_adjustment where base
is defined the same as above.
To complete the above example let’s assume that null_offset_adjustment
is 7.  Then the offsets will be [2, 9, 2, 5]
If there are no nulls then the offsets we write here are exactly the same as the offsets in an Arrow list array (except we omit the leading 0 which is redundant)
The reason we do this is so that reading a single list at index i only requires us to load the indices at i and i-1.
If the offset at index i is greater than `null_offset_adjustment`` then the list at index i is null.
Otherwise the length of the list is offsets\[i\] - base where
base is defined the same as above.
Let’s consider our example offsets: [2, 9, 2, 5]
We can take any range of lists and determine how many list items are referenced by the sublist.
0..3: [, 5] -> items 0..5 (base = 0* and end is 5) 0..2: [, 2] -> items 0..2 (base = 0* and end is 2) 0..1: [_, 9] -> items 0..2 (base = 0* and end is 9 % 7) 1..3: [2, 5] -> items 2..5 (base = 2 and end is 5) 1..2: [2, 2] -> items 2..2 (base = 2 and end is 2) 2..3: [9, 5] -> items 2..5 (base = 9 % 7 and end is 5)
- When the start of our range is the 0th item the base is always 0 and we only need to load a single index from disk to determine the range.
The data type of the offsets array is flexible and does not need to match the data type of the destination array. Please note that the offsets array is very likely to be efficiently encoded by bit packing deltas.
null_offset_adjustment: u64If a list is null then we add this value to the offset
This value must be greater than the length of the items so that (offset + null_offset_adjustment) is never used by a non-null list.
Note that this value cannot be equal to the length of the items because then a page with a single list would store [ X ] and we couldn’t know if that is a null list or a list with X items.
Therefore, the best choice for this value is 1 + # of items. Choosing this will maximize the bit packing that we can apply to the offsets.
num_items: u64How many items are referenced by these offsets. This is needed in order to determine which items pages map to this offsets page.
Trait Implementations§
source§impl Message for List
 
impl Message for List
source§fn encoded_len(&self) -> usize
 
fn encoded_len(&self) -> usize
source§fn encode<B>(&self, buf: &mut B) -> Result<(), EncodeError>
 
fn encode<B>(&self, buf: &mut B) -> Result<(), EncodeError>
source§fn encode_to_vec(&self) -> Vec<u8>where
    Self: Sized,
 
fn encode_to_vec(&self) -> Vec<u8>where
    Self: Sized,
source§fn encode_length_delimited<B>(&self, buf: &mut B) -> Result<(), EncodeError>
 
fn encode_length_delimited<B>(&self, buf: &mut B) -> Result<(), EncodeError>
source§fn encode_length_delimited_to_vec(&self) -> Vec<u8>where
    Self: Sized,
 
fn encode_length_delimited_to_vec(&self) -> Vec<u8>where
    Self: Sized,
source§fn decode<B>(buf: B) -> Result<Self, DecodeError>
 
fn decode<B>(buf: B) -> Result<Self, DecodeError>
source§fn decode_length_delimited<B>(buf: B) -> Result<Self, DecodeError>
 
fn decode_length_delimited<B>(buf: B) -> Result<Self, DecodeError>
source§fn merge<B>(&mut self, buf: B) -> Result<(), DecodeError>
 
fn merge<B>(&mut self, buf: B) -> Result<(), DecodeError>
self. Read moresource§fn merge_length_delimited<B>(&mut self, buf: B) -> Result<(), DecodeError>
 
fn merge_length_delimited<B>(&mut self, buf: B) -> Result<(), DecodeError>
self.source§impl Name for List
 
impl Name for List
source§const NAME: &'static str = "List"
 
const NAME: &'static str = "List"
Message.
This name is the same as it appears in the source .proto file, e.g. FooBar.source§const PACKAGE: &'static str = "lance.encodings"
 
const PACKAGE: &'static str = "lance.encodings"
., e.g. google.protobuf.source§fn full_name() -> String
 
fn full_name() -> String
Message.
It’s prefixed with the package name and names of any parent messages,
e.g. google.rpc.BadRequest.FieldViolation.
By default, this is the package name followed by the message name.
Fully-qualified names must be unique within a domain of Type URLs.source§impl PartialEq for List
 
impl PartialEq for List
impl StructuralPartialEq for List
Auto Trait Implementations§
impl Freeze for List
impl RefUnwindSafe for List
impl Send for List
impl Sync for List
impl Unpin for List
impl UnwindSafe for List
Blanket Implementations§
source§impl<T> BorrowMut<T> for Twhere
    T: ?Sized,
 
impl<T> BorrowMut<T> for Twhere
    T: ?Sized,
source§fn borrow_mut(&mut self) -> &mut T
 
fn borrow_mut(&mut self) -> &mut T
source§impl<T> Instrument for T
 
impl<T> Instrument for T
source§fn instrument(self, span: Span) -> Instrumented<Self>
 
fn instrument(self, span: Span) -> Instrumented<Self>
source§fn in_current_span(self) -> Instrumented<Self>
 
fn in_current_span(self) -> Instrumented<Self>
source§impl<T> IntoEither for T
 
impl<T> IntoEither for T
source§fn into_either(self, into_left: bool) -> Either<Self, Self>
 
fn into_either(self, into_left: bool) -> Either<Self, Self>
self into a Left variant of Either<Self, Self>
if into_left is true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read moresource§fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
 
fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
self into a Left variant of Either<Self, Self>
if into_left(&self) returns true.
Converts self into a Right variant of Either<Self, Self>
otherwise. Read more