Skip to main content

Aggregate

Struct Aggregate 

Source
#[non_exhaustive]
pub struct Aggregate { pub input: Arc<LogicalPlan>, pub group_expr: Vec<Expr>, pub aggr_expr: Vec<Expr>, pub schema: DFSchemaRef, }
Expand description

Aggregates its input based on a set of grouping and aggregate expressions (e.g. SUM).

§Output Schema

The output schema is the group expressions followed by the aggregate expressions in order.

For example, given the input schema "A", "B", "C" and the aggregate SUM(A) GROUP BY C+B, the output schema will be "C+B", "SUM(A)" where “C+B” and “SUM(A)” are the names of the output columns. Note that “C+B” is a single new column

Fields (Non-exhaustive)§

This struct is marked as non-exhaustive
Non-exhaustive structs could have additional fields added in future. Therefore, non-exhaustive structs cannot be constructed in external crates using the traditional Struct { .. } syntax; cannot be matched against without a wildcard ..; and struct update syntax will not work.
§input: Arc<LogicalPlan>

The incoming logical plan

§group_expr: Vec<Expr>

Grouping expressions

§aggr_expr: Vec<Expr>

Aggregate expressions.

Note these must be either Expr::AggregateFunction or Expr::Alias

§schema: DFSchemaRef

The schema description of the aggregate output

Implementations§

Source§

impl Aggregate

Source

pub const INTERNAL_GROUPING_ID: &'static str = "__grouping_id"

Internal column used when the aggregation is a grouping set.

This column packs two values into a single unsigned integer:

  • Low bits (positions 0 .. n-1): a semantic bitmask where each bit represents one of the n grouping expressions. The least significant bit corresponds to the rightmost grouping expression. A 1 bit means the corresponding column is replaced with NULL for this grouping set; a 0 bit means it is included.
  • High bits (positions n and above): a duplicate ordinal that distinguishes multiple occurrences of the same semantic grouping set pattern within a single query. The ordinal is 0 for the first occurrence, 1 for the second, and so on.

The integer type is chosen by Self::grouping_id_type to be the smallest UInt8 / UInt16 / UInt32 / UInt64 that can represent both parts.

For example, for the grouping expressions CUBE(a, b) (no duplicates), the grouping ID column will have the following values: 0b00: Both a and b are included 0b01: b is excluded 0b10: a is excluded 0b11: Both a and b are excluded

When the same set appears twice and n = 2, the duplicate ordinal is packed into bit 2: first occurrence: 0b0_01 (ordinal = 0, mask = 0b01) second occurrence: 0b1_01 (ordinal = 1, mask = 0b01)

The GROUPING function always masks the value with (1 << n) - 1 before interpreting it so the ordinal bits are invisible to user-facing SQL.

Source

pub fn try_new( input: Arc<LogicalPlan>, group_expr: Vec<Expr>, aggr_expr: Vec<Expr>, ) -> Result<Self>

Create a new aggregate operator.

Source

pub fn try_new_with_schema( input: Arc<LogicalPlan>, group_expr: Vec<Expr>, aggr_expr: Vec<Expr>, schema: DFSchemaRef, ) -> Result<Self>

Create a new aggregate operator using the provided schema to avoid the overhead of building the schema again when the schema is already known.

This method should only be called when you are absolutely sure that the schema being provided is correct for the aggregate. If in doubt, call try_new instead.

Source

pub fn group_expr_len(&self) -> Result<usize>

Get the length of the group by expression in the output schema This is not simply group by expression length. Expression may be GroupingSet, etc. In these case we need to get inner expression lengths.

Source

pub fn grouping_id_type(group_exprs: usize, max_ordinal: usize) -> DataType

Returns the data type of the grouping id.

The grouping ID packs two pieces of information into a single integer:

  • The low group_exprs bits are the semantic bitmask (a set bit means the corresponding grouping expression is NULL for this grouping set).
  • The bits above position group_exprs encode a duplicate ordinal that distinguishes multiple occurrences of the same grouping set pattern.

max_ordinal is the highest ordinal value that will appear (0 when there are no duplicate grouping sets). The type is chosen to be the smallest unsigned integer that can represent both parts.

Trait Implementations§

Source§

impl Clone for Aggregate

Source§

fn clone(&self) -> Aggregate

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for Aggregate

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl Eq for Aggregate

Source§

impl Hash for Aggregate

Source§

fn hash<__H: Hasher>(&self, state: &mut __H)

Feeds this value into the given Hasher. Read more
1.3.0 · Source§

fn hash_slice<H>(data: &[Self], state: &mut H)
where H: Hasher, Self: Sized,

Feeds a slice of this type into the given Hasher. Read more
Source§

impl PartialEq for Aggregate

Source§

fn eq(&self, other: &Aggregate) -> bool

Tests for self and other values to be equal, and is used by ==.
1.0.0 (const: unstable) · Source§

fn ne(&self, other: &Rhs) -> bool

Tests for !=. The default implementation is almost always sufficient, and should not be overridden without very good reason.
Source§

impl PartialOrd for Aggregate

Source§

fn partial_cmp(&self, other: &Self) -> Option<Ordering>

This method returns an ordering between self and other values if one exists. Read more
1.0.0 (const: unstable) · Source§

fn lt(&self, other: &Rhs) -> bool

Tests less than (for self and other) and is used by the < operator. Read more
1.0.0 (const: unstable) · Source§

fn le(&self, other: &Rhs) -> bool

Tests less than or equal to (for self and other) and is used by the <= operator. Read more
1.0.0 (const: unstable) · Source§

fn gt(&self, other: &Rhs) -> bool

Tests greater than (for self and other) and is used by the > operator. Read more
1.0.0 (const: unstable) · Source§

fn ge(&self, other: &Rhs) -> bool

Tests greater than or equal to (for self and other) and is used by the >= operator. Read more
Source§

impl StructuralPartialEq for Aggregate

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> DynEq for T
where T: Eq + Any,

Source§

fn dyn_eq(&self, other: &(dyn Any + 'static)) -> bool

Source§

impl<T> DynHash for T
where T: Hash + Any,

Source§

fn dyn_hash(&self, state: &mut dyn Hasher)

Source§

impl<Q, K> Equivalent<K> for Q
where Q: Eq + ?Sized, K: Borrow<Q> + ?Sized,

Source§

fn equivalent(&self, key: &K) -> bool

Compare self to key and return true if they are equal.
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.