Skip to main content

NetworkBroadcastExec

Struct NetworkBroadcastExec 

Source
pub struct NetworkBroadcastExec { /* private fields */ }
Expand description

Network boundary for broadcasting data to all consumer tasks.

This operator works with BroadcastExec which scales up partitions so each consumer task fetches a unique set of partition numbers. Each partition request is sent to all stage tasks because each task’s leaf node is specialized to serve a different slice of the data for the same logical partition number.

Here are some examples of how NetworkBroadcastExec distributes data:

§1 to many

┌────────────────────────┐                        ┌────────────────────────┐           ■
│  NetworkBroadcastExec  │                        │  NetworkBroadcastExec  │           │
│        (task 1)        │           ...          │        (task M)        │           │
│                        │                        │                        │        Stage N
│    Populates Caches    │                        │    Populates Caches    │           │
└────────┬─┬┬─┬┬─┬───────┘                        └────────┬─┬┬─┬┬─┬───────┘           │
         │0││1││2│                                         │0││1││2│                   │
         └▲┘└▲┘└▲┘                                         └▲┘└▲┘└▲┘                   ■
          │  │  │                                           │  │  │
          │  │  │                                           │  │  │
          │  │  │                                           │  │  │
          │  │  └─────────────┐          ┌──────────────────┘  │  │
          │  └─────────────┐  │          │     ┌───────────────┘  │
          └─────────────┐  │  │          │     │    ┌─────────────┘
                        │  │  │          │     │    │
                       ┌┴┐┌┴┐┌┴┐ ... ┌───┴┐┌───┴┐┌──┴─┐
                       │1││2││3│     │NM-3││NM-2││NM-1│                                ■
                      ┌┴─┴┴─┴┴─┴─────┴────┴┴────┴┴────┴─┐                              │
                      │          BroadcastExec          │                              │
                      │        ┌───────────────┐        │                          Stage N-1
                      │        │  Batch Cache  │        │                              │
                      │        │  ┌─┐ ┌─┐ ┌─┐  │        │                              │
                      │        │  │0│ │1│ │2│  │        │                              │
                      │        │  └─┘ └─┘ └─┘  │        │                              │
                      │        └───────────────┘        │                              │
                      └───────────┬─┬─┬─┬─┬─┬───────────┘                              │
                                  │0│ │1│ │2│                                          │
                                  └▲┘ └▲┘ └▲┘                                          ■
                                   │   │   │
                                   │   │   │
                                   │   │   │
                                  ┌┴┐ ┌┴┐ ┌┴┐                                          ■
                                  │0│ │1│ │2│                                          │
                           ┌──────┴─┴─┴─┴─┴─┴──────┐                               Stage N-2
                           │Arc<dyn ExecutionPlan> │                                   │
                           │       (task 1)        │                                   │
                           └───────────────────────┘                                   ■

§Many to many

   ┌────────────────────────┐                        ┌────────────────────────┐          ■
   │  NetworkBroadcastExec  │                        │  NetworkBroadcastExec  │          │
   │        (task 1)        │                        │        (task M)        │          │
   │                        │           ...          │                        │       Stage N
   │    Populates Caches    │                        │       Cache Hits       │          │
   └────────┬─┬┬─┬┬─┬───────┘                        └────────┬─┬┬─┬┬─┬───────┘          │
            │0││1││2│                                         │0││1││2│                  │
            └▲┘└▲┘└▲┘                                         └▲┘└▲┘└▲┘                  ■
             │  │  │                                           │  │  │
  ┌──────────┴──┼──┼────────────────────────────────┐          │  │  │
  │  ┌──────────┴──┼────────────────────────────────┼──┐       │  │  │
  │  │  ┌──────────┴────────────────────────────────┼──┼──┐    │  │  │
  │  │  │                                           │  │  │    │  │  │
  │  │  │         ┌─────────────────────────────────┼──┼──┼────┴──┼─┐│
  │  │  │         │     ┌───────────────────────────┼──┼──┼───────┴─┼┼─────┐
  │  │  │         │     │     ┌─────────────────────┼──┼──┼─────────┼┴─────┼────┐
  │  │  │         │     │     │                     │  │  │         │      │    │
 ┌┴┐┌┴┐┌┴┐ ... ┌──┴─┐┌──┴─┐┌──┴─┐                  ┌┴┐┌┴┐┌┴┐ ... ┌──┴─┐┌───┴┐┌──┴─┐      ■
 │0││1││2│     │3M-3││3M-2││3M-1│                  │0││1││2│     │3M-3││3M-2││3M-1│      │
┌┴─┴┴─┴┴─┴─────┴────┴┴────┴┴────┴┐                ┌┴─┴┴─┴┴─┴─────┴────┴┴────┴┴────┴┐     │
│         BroadcastExec          │                │         BroadcastExec          │     │
│        ┌───────────────┐       │                │        ┌───────────────┐       │     │
│        │  Batch Cache  │       │                │        │  Batch Cache  │       │     │
│        │  ┌─┐ ┌─┐ ┌─┐  │       │      ...       │        │  ┌─┐ ┌─┐ ┌─┐  │       │ Stage N-1
│        │  │0│ │1│ │2│  │       │                │        │  │0│ │1│ │2│  │       │     │
│        │  └─┘ └─┘ └─┘  │       │                │        │  └─┘ └─┘ └─┘  │       │     │
│        └───────────────┘       │                │        └───────────────┘       │     │
└───────────┬─┬─┬─┬─┬─┬──────────┘                └───────────┬─┬─┬─┬─┬─┬──────────┘     │
            │0│ │1│ │2│                                       │0│ │1│ │2│                │
            └▲┘ └▲┘ └▲┘                                       └▲┘ └▲┘ └▲┘                ■
             │   │   │                                         │   │   │
             │   │   │                                         │   │   │
             │   │   │                                         │   │   │
            ┌┴┐ ┌┴┐ ┌┴┐                                       ┌┴┐ ┌┴┐ ┌┴┐                ■
            │0│ │1│ │2│                                       │0│ │1│ │2│                │
     ┌──────┴─┴─┴─┴─┴─┴──────┐                         ┌──────┴─┴─┴─┴─┴─┴──────┐     Stage N-2
     │Arc<dyn ExecutionPlan> │          ...            │Arc<dyn ExecutionPlan> │         │
     │       (task 1)        │                         │       (task N)        │         │
     └───────────────────────┘                         └───────────────────────┘         ■

Notice in this diagram that each NetworkBroadcastExec sends a request to fetch data from each BroadcastExec in the stage below per partition. This is because each BroadcastExec has its own cache which contains partial results for the partition. It is the NetworkBroadcastExec’s job to merge these partial partitions to then broadcast complete data to the consumers.

Implementations§

Source§

impl NetworkBroadcastExec

Source

pub fn try_new( input: Arc<dyn ExecutionPlan>, producer_tasks: usize, ) -> Result<Self>

Creates a new NetworkBroadcastExec fed by the provided BroadcastExec. The input plan will be executed in a remote worker in producer_tasks number of tasks.

Trait Implementations§

Source§

impl Clone for NetworkBroadcastExec

Source§

fn clone(&self) -> NetworkBroadcastExec

Returns a duplicate of the value. Read more
1.0.0 (const: unstable) · Source§

fn clone_from(&mut self, source: &Self)

Performs copy-assignment from source. Read more
Source§

impl Debug for NetworkBroadcastExec

Source§

fn fmt(&self, f: &mut Formatter<'_>) -> Result

Formats the value using the given formatter. Read more
Source§

impl DisplayAs for NetworkBroadcastExec

Source§

fn fmt_as(&self, _t: DisplayFormatType, f: &mut Formatter<'_>) -> Result

Format according to DisplayFormatType, used when verbose representation looks different from the default one Read more
Source§

impl ExecutionPlan for NetworkBroadcastExec

Source§

fn name(&self) -> &str

Short name for the ExecutionPlan, such as ‘DataSourceExec’. Read more
Source§

fn properties(&self) -> &Arc<PlanProperties>

Return properties of the output of the ExecutionPlan, such as output ordering(s), partitioning information etc. Read more
Source§

fn children(&self) -> Vec<&Arc<dyn ExecutionPlan>>

Get a list of children ExecutionPlans that act as inputs to this plan. The returned list will be empty for leaf nodes such as scans, will contain a single value for unary nodes, or two values for binary nodes (such as joins).
Source§

fn with_new_children( self: Arc<Self>, children: Vec<Arc<dyn ExecutionPlan>>, ) -> Result<Arc<dyn ExecutionPlan>, DataFusionError>

Returns a new ExecutionPlan where all existing children were replaced by the children, in order
Source§

fn execute( &self, partition: usize, context: Arc<TaskContext>, ) -> Result<SendableRecordBatchStream, DataFusionError>

Begin execution of partition, returning a Stream of RecordBatches. Read more
Source§

fn metrics(&self) -> Option<MetricsSet>

Return a snapshot of the set of Metrics for this ExecutionPlan. If no Metrics are available, return None. Read more
Source§

fn static_name() -> &'static str
where Self: Sized,

Short name for the ExecutionPlan, such as ‘DataSourceExec’. Like name but can be called without an instance.
Source§

fn downcast_delegate(&self) -> Option<&(dyn ExecutionPlan + 'static)>

Returns the plan that provides this plan’s public ExecutionPlan downcast identity. Read more
Source§

fn schema(&self) -> Arc<Schema>

Get the schema for this execution plan
Source§

fn check_invariants(&self, check: InvariantLevel) -> Result<(), DataFusionError>

Returns an error if this individual node does not conform to its invariants. These invariants are typically only checked in debug mode. Read more
Source§

fn required_input_distribution(&self) -> Vec<Distribution>

Specifies the data distribution requirements for all the children for this ExecutionPlan, By default it’s [Distribution::UnspecifiedDistribution] for each child,
Source§

fn required_input_ordering(&self) -> Vec<Option<OrderingRequirements>>

Specifies the ordering required for all of the children of this ExecutionPlan. Read more
Source§

fn maintains_input_order(&self) -> Vec<bool>

Returns false if this ExecutionPlan’s implementation may reorder rows within or between partitions. Read more
Source§

fn benefits_from_input_partitioning(&self) -> Vec<bool>

Specifies whether the ExecutionPlan benefits from increased parallelization at its input for each child. Read more
Source§

fn reset_state( self: Arc<Self>, ) -> Result<Arc<dyn ExecutionPlan>, DataFusionError>

Reset any internal state within this ExecutionPlan. Read more
Source§

fn repartitioned( &self, _target_partitions: usize, _config: &ConfigOptions, ) -> Result<Option<Arc<dyn ExecutionPlan>>, DataFusionError>

If supported, attempt to increase the partitioning of this ExecutionPlan to produce target_partitions partitions. Read more
Source§

fn partition_statistics( &self, partition: Option<usize>, ) -> Result<Arc<Statistics>, DataFusionError>

Returns statistics for a specific partition of this ExecutionPlan node. If statistics are not available, should return Statistics::new_unknown (the default), not an error. If partition is None, it returns statistics for the entire plan.
Source§

fn supports_limit_pushdown(&self) -> bool

Returns true if a limit can be safely pushed down through this ExecutionPlan node. Read more
Source§

fn with_fetch(&self, _limit: Option<usize>) -> Option<Arc<dyn ExecutionPlan>>

Returns a fetching variant of this ExecutionPlan node, if it supports fetch limits. Returns None otherwise. Read more
Source§

fn fetch(&self) -> Option<usize>

Gets the fetch count for the operator, None means there is no fetch.
Source§

fn cardinality_effect(&self) -> CardinalityEffect

Gets the effect on cardinality, if known
Source§

fn try_swapping_with_projection( &self, _projection: &ProjectionExec, ) -> Result<Option<Arc<dyn ExecutionPlan>>, DataFusionError>

Attempts to push down the given projection into the input of this ExecutionPlan. Read more
Source§

fn gather_filters_for_pushdown( &self, _phase: FilterPushdownPhase, parent_filters: Vec<Arc<dyn PhysicalExpr>>, _config: &ConfigOptions, ) -> Result<FilterDescription, DataFusionError>

Collect filters that this node can push down to its children. Filters that are being pushed down from parents are passed in, and the node may generate additional filters to push down. For example, given the plan FilterExec -> HashJoinExec -> DataSourceExec, what will happen is that we recurse down the plan calling ExecutionPlan::gather_filters_for_pushdown: Read more
Source§

fn handle_child_pushdown_result( &self, _phase: FilterPushdownPhase, child_pushdown_result: ChildPushdownResult, _config: &ConfigOptions, ) -> Result<FilterPushdownPropagation<Arc<dyn ExecutionPlan>>, DataFusionError>

Handle the result of a child pushdown. Read more
Source§

fn with_new_state( &self, _state: Arc<dyn Any + Send + Sync>, ) -> Option<Arc<dyn ExecutionPlan>>

Injects arbitrary run-time state into this execution plan, returning a new plan instance that incorporates that state if it is relevant to the concrete node implementation. Read more
Source§

fn try_pushdown_sort( &self, _order: &[PhysicalSortExpr], ) -> Result<SortOrderPushdownResult<Arc<dyn ExecutionPlan>>, DataFusionError>

Try to push down sort ordering requirements to this node. Read more
Source§

fn with_preserve_order( &self, _preserve_order: bool, ) -> Option<Arc<dyn ExecutionPlan>>

Returns a variant of this ExecutionPlan that is aware of order-sensitivity. Read more
Source§

impl NetworkBoundary for NetworkBroadcastExec

Source§

fn with_input_stage(&self, input_stage: Stage) -> Result<Arc<dyn ExecutionPlan>>

Called when a Stage is correctly formed. The NetworkBoundary can use this information to perform any internal transformations necessary for distributed execution. Read more
Source§

fn input_stage(&self) -> &Stage

Returns the assigned input Stage, if any.
Source§

fn producer_head(&self, consumer_task_count: usize) -> ProducerHead

Defines what head node should the producer stage feeding this NetworkBoundary implementation have. This information is used during planning an executing for ensuring the head of a stage has the appropriate shape for consumption.

Auto Trait Implementations§

Blanket Implementations§

Source§

impl<T> Any for T
where T: 'static + ?Sized,

Source§

fn type_id(&self) -> TypeId

Gets the TypeId of self. Read more
Source§

impl<T> Borrow<T> for T
where T: ?Sized,

Source§

fn borrow(&self) -> &T

Immutably borrows from an owned value. Read more
Source§

impl<T> BorrowMut<T> for T
where T: ?Sized,

Source§

fn borrow_mut(&mut self) -> &mut T

Mutably borrows from an owned value. Read more
Source§

impl<T> CloneToUninit for T
where T: Clone,

Source§

unsafe fn clone_to_uninit(&self, dest: *mut u8)

🔬This is a nightly-only experimental API. (clone_to_uninit)
Performs copy-assignment from self to dest. Read more
Source§

impl<T> From<T> for T

Source§

fn from(t: T) -> T

Returns the argument unchanged.

Source§

impl<T> FromRef<T> for T
where T: Clone,

Source§

fn from_ref(input: &T) -> T

Converts to this type from a reference to the input type.
Source§

impl<T> Instrument for T

Source§

fn instrument(self, span: Span) -> Instrumented<Self>

Instruments this type with the provided Span, returning an Instrumented wrapper. Read more
Source§

fn in_current_span(self) -> Instrumented<Self>

Instruments this type with the current Span, returning an Instrumented wrapper. Read more
Source§

impl<T, U> Into<U> for T
where U: From<T>,

Source§

fn into(self) -> U

Calls U::from(self).

That is, this conversion is whatever the implementation of From<T> for U chooses to do.

Source§

impl<T> IntoEither for T

Source§

fn into_either(self, into_left: bool) -> Either<Self, Self>

Converts self into a Left variant of Either<Self, Self> if into_left is true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

fn into_either_with<F>(self, into_left: F) -> Either<Self, Self>
where F: FnOnce(&Self) -> bool,

Converts self into a Left variant of Either<Self, Self> if into_left(&self) returns true. Converts self into a Right variant of Either<Self, Self> otherwise. Read more
Source§

impl<T> IntoRequest<T> for T

Source§

fn into_request(self) -> Request<T>

Wrap the input message T in a tonic::Request
Source§

impl<L> LayerExt<L> for L

Source§

fn named_layer<S>(&self, service: S) -> Layered<<L as Layer<S>>::Service, S>
where L: Layer<S>,

Applies the layer to a service and wraps it in Layered.
Source§

impl<T> Pointable for T

Source§

const ALIGN: usize

The alignment of pointer.
Source§

type Init = T

The type for initializers.
Source§

unsafe fn init(init: <T as Pointable>::Init) -> usize

Initializes a with the given initializer. Read more
Source§

unsafe fn deref<'a>(ptr: usize) -> &'a T

Dereferences the given pointer. Read more
Source§

unsafe fn deref_mut<'a>(ptr: usize) -> &'a mut T

Mutably dereferences the given pointer. Read more
Source§

unsafe fn drop(ptr: usize)

Drops the object pointed to by the given pointer. Read more
Source§

impl<T> ToOwned for T
where T: Clone,

Source§

type Owned = T

The resulting type after obtaining ownership.
Source§

fn to_owned(&self) -> T

Creates owned data from borrowed data, usually by cloning. Read more
Source§

fn clone_into(&self, target: &mut T)

Uses borrowed data to replace owned data, usually by cloning. Read more
Source§

impl<T, U> TryFrom<U> for T
where U: Into<T>,

Source§

type Error = Infallible

The type returned in the event of a conversion error.
Source§

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

Performs the conversion.
Source§

impl<T, U> TryInto<U> for T
where U: TryFrom<T>,

Source§

type Error = <U as TryFrom<T>>::Error

The type returned in the event of a conversion error.
Source§

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

Performs the conversion.
Source§

impl<V, T> VZip<V> for T
where V: MultiLane<T>,

Source§

fn vzip(self) -> V

Source§

impl<T> WithSubscriber for T

Source§

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>
where S: Into<Dispatch>,

Attaches the provided Subscriber to this type, returning a WithDispatch wrapper. Read more
Source§

fn with_current_subscriber(self) -> WithDispatch<Self>

Attaches the current default Subscriber to this type, returning a WithDispatch wrapper. Read more