Struct datafusion::physical_plan::joins::SymmetricHashJoinExec

source ·

pub struct SymmetricHashJoinExec { /* private fields */ }

Expand description

A symmetric hash join with range conditions is when both streams are hashed on the join key and the resulting hash tables are used to join the streams. The join is considered symmetric because the hash table is built on the join keys from both streams, and the matching of rows is based on the values of the join keys in both streams. This type of join is efficient in streaming context as it allows for fast lookups in the hash table, rather than having to scan through one or both of the streams to find matching rows, also it only considers the elements from the stream that fall within a certain sliding window (w/ range conditions), making it more efficient and less likely to store stale data. This enables operating on unbounded streaming data without any memory issues.

For each input stream, create a hash table.

For each new RecordBatch in build side, hash and insert into inputs hash table. Update offsets.
Test if input is equal to a predefined set of other inputs.
If so record the visited rows. If the matched row results must be produced (INNER, LEFT), output the RecordBatch.
Try to prune other side (probe) with new RecordBatch.
If the join type indicates that the unmatched rows results must be produced (LEFT, FULL etc.), output the RecordBatch when a pruning happens or at the end of the data.

                       +-------------------------+
                       |                         |
  left stream ---------|  Left OneSideHashJoiner |---+
                       |                         |   |
                       +-------------------------+   |
                                                     |
                                                     |--------- Joined output
                                                     |
                       +-------------------------+   |
                       |                         |   |
 right stream ---------| Right OneSideHashJoiner |---+
                       |                         |
                       +-------------------------+

Prune build side when the new RecordBatch comes to the probe side. We utilize interval arithmetic
on JoinFilter's sorted PhysicalExprs to calculate the joinable range.


              PROBE SIDE          BUILD SIDE
                BUFFER              BUFFER
            +-------------+     +------------+
            |             |     |            |    Unjoinable
            |             |     |            |    Range
            |             |     |            |
            |             |  |---------------------------------
            |             |  |  |            |
            |             |  |  |            |
            |             | /   |            |
            |             | |   |            |
            |             | |   |            |
            |             | |   |            |
            |             | |   |            |
            |             | |   |            |    Joinable
            |             |/    |            |    Range
            |             ||    |            |
            |+-----------+||    |            |
            || Record    ||     |            |
            || Batch     ||     |            |
            |+-----------+||    |            |
            +-------------+\    +------------+
                            |
                            \
                             |---------------------------------

 This happens when range conditions are provided on sorted columns. E.g.

       SELECT * FROM left_table, right_table
       ON
         left_key = right_key AND
         left_time > right_time - INTERVAL 12 MINUTES AND left_time < right_time + INTERVAL 2 HOUR

or
      SELECT * FROM left_table, right_table
       ON
         left_key = right_key AND
         left_sorted > right_sorted - 3 AND left_sorted < right_sorted + 10

For general purpose, in the second scenario, when the new data comes to probe side, the conditions can be used to
determine a specific threshold for discarding rows from the inner buffer. For example, if the sort order the
two columns ("left_sorted" and "right_sorted") are ascending (it can be different in another scenarios)
and the join condition is "left_sorted > right_sorted - 3" and the latest value on the right input is 1234, meaning
that the left side buffer must only keep rows where "leftTime > rightTime - 3 > 1234 - 3 > 1231" ,
making the smallest value in 'left_sorted' 1231 and any rows below (since ascending)
than that can be dropped from the inner buffer.

Struct datafusion::physical_plan::joins::SymmetricHashJoinExec

Implementations§

impl SymmetricHashJoinExec

pub fn try_new( left: Arc<dyn ExecutionPlan>, right: Arc<dyn ExecutionPlan>, on: JoinOn, filter: Option<JoinFilter>, join_type: &JoinType, null_equals_null: bool ) -> Result<Self>

Error

pub fn left(&self) -> &Arc<dyn ExecutionPlan>

pub fn right(&self) -> &Arc<dyn ExecutionPlan>

pub fn on(&self) -> &[(Column, Column)]

pub fn filter(&self) -> Option<&JoinFilter>

pub fn join_type(&self) -> &JoinType

pub fn null_equals_null(&self) -> bool

pub fn check_if_order_information_available(&self) -> Result<bool>

Trait Implementations§

impl Debug for SymmetricHashJoinExec

fn fmt(&self, f: &mut Formatter<'_>) -> Result

impl ExecutionPlan for SymmetricHashJoinExec

fn as_any(&self) -> &dyn Any

fn schema(&self) -> SchemaRef

fn unbounded_output(&self, children: &[bool]) -> Result<bool>

fn benefits_from_input_partitioning(&self) -> bool

fn required_input_distribution(&self) -> Vec<Distribution>

fn output_partitioning(&self) -> Partitioning

fn output_ordering(&self) -> Option<&[PhysicalSortExpr]>

fn equivalence_properties(&self) -> EquivalenceProperties

fn children(&self) -> Vec<Arc<dyn ExecutionPlan>>

fn with_new_children( self: Arc<Self>, children: Vec<Arc<dyn ExecutionPlan>> ) -> Result<Arc<dyn ExecutionPlan>>

fn fmt_as(&self, t: DisplayFormatType, f: &mut Formatter<'_>) -> Result

fn metrics(&self) -> Option<MetricsSet>

fn statistics(&self) -> Statistics

fn execute( &self, partition: usize, context: Arc<TaskContext> ) -> Result<SendableRecordBatchStream>

fn required_input_ordering(&self) -> Vec<Option<Vec<PhysicalSortRequirement>>>

fn maintains_input_order(&self) -> Vec<bool>

fn ordering_equivalence_properties(&self) -> OrderingEquivalenceProperties

Auto Trait Implementations§

impl !RefUnwindSafe for SymmetricHashJoinExec

impl Send for SymmetricHashJoinExec

impl Sync for SymmetricHashJoinExec

impl Unpin for SymmetricHashJoinExec

impl !UnwindSafe for SymmetricHashJoinExec

Blanket Implementations§

impl<T> Any for Twhere T: 'static + ?Sized,

fn type_id(&self) -> TypeId

impl<T> Borrow<T> for Twhere T: ?Sized,

fn borrow(&self) -> &T

impl<T> BorrowMut<T> for Twhere T: ?Sized,

fn borrow_mut(&mut self) -> &mut T

impl<T> From<T> for T

fn from(t: T) -> T

impl<T> Instrument for T

fn instrument(self, span: Span) -> Instrumented<Self>

fn in_current_span(self) -> Instrumented<Self>

impl<T, U> Into<U> for Twhere U: From<T>,

fn into(self) -> U

impl<T> Same<T> for T

type Output = T

impl<T, U> TryFrom<U> for Twhere U: Into<T>,

type Error = Infallible

fn try_from(value: U) -> Result<T, <T as TryFrom<U>>::Error>

impl<T, U> TryInto<U> for Twhere U: TryFrom<T>,

type Error = <U as TryFrom<T>>::Error

fn try_into(self) -> Result<U, <U as TryFrom<T>>::Error>

impl<V, T> VZip<V> for Twhere V: MultiLane<T>,

fn vzip(self) -> V

impl<T> WithSubscriber for T

fn with_subscriber<S>(self, subscriber: S) -> WithDispatch<Self>where S: Into<Dispatch>,

fn with_current_subscriber(self) -> WithDispatch<Self>