Skip to main content

SamkhyaTableProvider

Struct SamkhyaTableProvider 

Source
pub struct SamkhyaTableProvider { /* private fields */ }
Expand description

A TableProvider wrapper that overrides statistics() with samkhya-corrected column statistics while delegating every other method to the inner provider.

§Builder

use std::sync::Arc;
use samkhya_datafusion::SamkhyaTableProvider;
use samkhya_core::stats::ColumnStats;

let wrapped = SamkhyaTableProvider::new(Arc::new(inner))
    .with_column_stats(0, ColumnStats::new().with_row_count(999).with_distinct_count(42));

§Stats fold semantics

statistics() builds a Statistics whose per-column entries come from the samkhya override map where present, falling back to the inner provider’s stats (or ColumnStatistics::new_unknown() if the inner provider returns None). Table-level num_rows is taken from the override map’s most authoritative row_count: the maximum across all override entries, since samkhya’s per-column stats describe the same underlying relation. If no override carries a row count, the inner provider’s num_rows is preserved.

Implementations§

Source§

impl SamkhyaTableProvider

Source

pub fn new(inner: Arc<dyn TableProvider>) -> Self

Wrap an existing provider. No overrides are installed until Self::with_column_stats is called.

Source

pub fn with_column_stats(self, col_idx: usize, stats: ColumnStats) -> Self

Install a samkhya override for the column at col_idx.

Indices refer to positions in the inner provider’s SchemaRef. Existing overrides for the same index are replaced.

Source

pub fn stats_call_count(&self) -> usize

Number of times statistics() has been called on this wrapper.

Useful for assertions in integration tests that verify the planner actually consulted the corrected stats.

Source

pub fn overrides(&self) -> &HashMap<usize, ColumnStats>

Borrow the override map. Read-only access for diagnostics.

Trait Implementations§

Source§

impl Debug for SamkhyaTableProvider

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl TableProvider for SamkhyaTableProvider

Source§

fn statistics(&self) -> Option<Statistics>

Fold samkhya overrides into the inner provider’s Statistics.

Schema order is preserved: column i in the returned column_statistics corresponds to field i of self.schema().

Source§

fn as_any(&self) -> &dyn Any

Returns the table provider as Any so that it can be downcast to a specific implementation.
Source§

fn schema(&self) -> SchemaRef

Get a reference to the schema for this table
Source§

fn constraints(&self) -> Option<&Constraints>

Get a reference to the constraints of the table. Returns: Read more
Source§

fn table_type(&self) -> TableType

Get the type of this table for metadata/catalog purposes.
Source§

fn get_table_definition(&self) -> Option<&str>

Get the create statement used to create this table, if available.
Source§

fn get_logical_plan(&self) -> Option<Cow<'_, LogicalPlan>>

Get the LogicalPlan of this table, if available.
Source§

fn get_column_default(&self, column: &str) -> Option<&Expr>

Get the default value for a column, if available.
Source§

fn scan<'life0, 'life1, 'life2, 'life3, 'async_trait>( &'life0 self, state: &'life1 dyn Session, projection: Option<&'life2 Vec<usize>>, filters: &'life3 [Expr], limit: Option<usize>, ) -> Pin<Box<dyn Future<Output = Result<Arc<dyn ExecutionPlan>>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait, 'life1: 'async_trait, 'life2: 'async_trait, 'life3: 'async_trait,

Create an ExecutionPlan for scanning the table with optionally specified projection, filter and limit, described below. Read more
Source§

fn supports_filters_pushdown( &self, filters: &[&Expr], ) -> Result<Vec<TableProviderFilterPushDown>>

Specify if DataFusion should provide filter expressions to the TableProvider to apply during the scan. Read more
Source§

fn insert_into<'life0, 'life1, 'async_trait>( &'life0 self, state: &'life1 dyn Session, input: Arc<dyn ExecutionPlan>, insert_op: InsertOp, ) -> Pin<Box<dyn Future<Output = Result<Arc<dyn ExecutionPlan>>> + Send + 'async_trait>>
where Self: 'async_trait, 'life0: 'async_trait, 'life1: 'async_trait,

Return an ExecutionPlan to insert data into this table, if supported. Read more

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> Same for T

Source§

type Output = T

Should always be Self
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V