Struct differential_dataflow::collection::Collection [] [src]

pub struct Collection<G: Scope, D, R: Diff = isize> {
    pub inner: Stream<G, (D, G::Timestamp, R)>,
}

A mutable collection of values of type D

The Collection type is the core abstraction in differential dataflow programs. As you write your differential dataflow computation, you write as if the collection is a static dataset to which you apply functional transformations, creating new collections. Once your computation is written, you are able to mutate the collection (by inserting and removing elements); differential dataflow will propagate changes through your functional computation and report the corresponding changes to the output collections.

Each collection has three generic parameters. The parameter G is for the scope in which the collection exists; as you write more complicated programs you may wish to introduce nested scopes (e.g. for iteration) and this parameter tracks the scope (for timely dataflow's benefit). The D parameter is the type of data in your collection, for example String, or (u32, Vec<Option<()>>). The R parameter represents the types of changes that the data undergo, and is most commonly (and defaults to) isize, representing changes to the occurrence count of each record.

Fields

The underlying timely dataflow stream.

This field is exposed to support direct timely dataflow manipulation when required, but it is not intended to be the idiomatic way to work with the collection.

Methods

impl<G: Scope, D: Data, R: Diff> Collection<G, D, R> where
    G::Timestamp: Data
[src]

[src]

Creates a new Collection from a timely dataflow stream.

This method seems to be rarely used, with the as_collection method on streams being a more idiomatic approach to convert timely streams to collections. Also, the input::Input trait provides a new_collection method which will create a new collection for you without exposing the underlying timely stream at all.

[src]

Creates a new collection by applying the supplied function to each input element.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;

fn main() {
    ::timely::example(|scope| {
        scope.new_collection_from(1 .. 10).1
             .map(|x| x * 2)
             .filter(|x| x % 2 == 1)
             .assert_empty();
    });
}

[src]

Creates a new collection by applying the supplied function to each input element.

Although the name suggests in-place mutation, this function does not change the source collection, but rather re-uses the underlying allocations in its implementation. The method is semantically equivalent to map, but can be more efficient.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {
        scope.new_collection_from(1 .. 10).1
             .map_in_place(|x| *x *= 2)
             .filter(|x| x % 2 == 1)
             .assert_empty();
    });
}

[src]

Creates a new collection by applying the supplied function to each input element and accumulating the results.

This method extracts an iterator from each input element, and extracts the full contents of the iterator. Be warned that if the iterators produce substantial amounts of data, they are currently fully drained before attempting to consolidate the results.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {
        scope.new_collection_from(1 .. 10).1
             .flat_map(|x| 0 .. x);
    });
}

[src]

Creates a new collection whose counts are the negation of those in the input.

This method is most commonly used with concat to get those element in one collection but not another. However, differential dataflow computations are still defined for all values of the difference type R, including negative counts.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {

        let data = scope.new_collection_from(1 .. 10).1;

        let odds = data.filter(|x| x % 2 == 1);
        let evens = data.filter(|x| x % 2 == 0);

        odds.negate()
            .concat(&data)
            .assert_eq(&evens);
    });
}

[src]

Creates a new collection containing those input records satisfying the supplied predicate.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {
        scope.new_collection_from(1 .. 10).1
             .map(|x| x * 2)
             .filter(|x| x % 2 == 1)
             .assert_empty();
    });
}

[src]

Creates a new collection accumulating the contents of the two collections.

Despite the name, differential dataflow collections are unordered. This method is so named because the implementation is the concatenation of the stream of updates, but it corresponds to the addition of the two collections.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {

        let data = scope.new_collection_from(1 .. 10).1;

        let odds = data.filter(|x| x % 2 == 1);
        let evens = data.filter(|x| x % 2 == 0);

        odds.concat(&evens)
            .assert_eq(&data);
    });
}

[src]

Replaces each record with another, with a new difference type.

This method is most commonly used to take records containing aggregatable data (e.g. numbers to be summed) and move the data into the difference component. This will allow differential dataflow to update in-place.

#Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {

        let nums = scope.new_collection_from(0 .. 10).1;
        let x1 = nums.flat_map(|x| 0 .. x);
        let x2 = nums.map(|x| (x, 9 - x))
                     .explode(|(x,y)| Some((x,y)));

        x1.assert_eq(&x2);
    });
}

[src]

Brings a Collection into a nested scope.

Examples

extern crate timely;
extern crate differential_dataflow;

use timely::dataflow::Scope;
use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {

        let data = scope.new_collection_from(1 .. 10).1;

        let result = scope.scoped::<(),_,_>(|child| {
            data.enter(child)
                .leave()
        });

        data.assert_eq(&result);
    });
}

[src]

Brings a Collection into a nested scope, at varying times.

The initial function indicates the time at which each element of the Collection should appear.

Examples

extern crate timely;
extern crate differential_dataflow;

use timely::dataflow::Scope;
use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {

        let data = scope.new_collection_from(1 .. 10).1;

        let result = scope.scoped(|child| {
            data.enter_at(child, |x| *x)
                .leave()
        });

        data.assert_eq(&result);
    });
}

[src]

Applies a supplied function to each update.

This method is most commonly used to report information back to the user, often for debugging purposes. Any function can be used here, but be warned that the incremental nature of differential dataflow does not guarantee that it will be called as many times as you might expect.

The (data, time, diff) triples indicate a change diff to the frequency of data which takes effect at the logical time time. When times are totally ordered (for example, usize), these updates reflect the changes along the sequence of collections. For partially ordered times, the mathematics are more interesting and less intuitive, unfortunately.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {
        scope.new_collection_from(1 .. 10).1
             .map_in_place(|x| *x *= 2)
             .filter(|x| x % 2 == 1)
             .inspect(|x| println!("error: {:?}", x));
    });
}

[src]

Applies a supplied function to each batch of updates.

This method is analogous to inspect, but operates on batches and reveals the timestamp of the timely dataflow capability associated with the batch of updates. The observed batching depends on how the system executes, and may vary run to run.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {
        scope.new_collection_from(1 .. 10).1
             .map_in_place(|x| *x *= 2)
             .filter(|x| x % 2 == 1)
             .inspect_batch(|t,xs| println!("errors @ {:?}: {:?}", t, xs));
    });
}

[src]

Attaches a timely dataflow probe to the output of a Collection.

This probe is used to determine when the state of the Collection has stabilized and can be read out.

[src]

Attaches a timely dataflow probe to the output of a Collection.

This probe is used to determine when the state of the Collection has stabilized and all updates observed. In addition, a probe is also often use to limit the number of rounds of input in flight at any moment; a computation can wait until the probe has caught up to the input before introducing more rounds of data, to avoid swamping the system.

[src]

Assert if the collections are ever different.

Because this is a dataflow fragment, the test is only applied as the computation is run. If the computation is not run, or not run to completion, there may be un-exercised times at which the collections could vary. Typically, a timely dataflow computation runs to completion on drop, and so clean exit from a program should indicate that this assertion never found cause to complain.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {

        let data = scope.new_collection_from(1 .. 10).1;

        let odds = data.filter(|x| x % 2 == 1);
        let evens = data.filter(|x| x % 2 == 0);

        odds.concat(&evens)
            .assert_eq(&data);
    });
}

[src]

Assert if the collection is ever non-empty.

Because this is a dataflow fragment, the test is only applied as the computation is run. If the computation is not run, or not run to completion, there may be un-exercised times at which the collection could be non-empty. Typically, a timely dataflow computation runs to completion on drop, and so clean exit from a program should indicate that this assertion never found cause to complain.

Examples

extern crate timely;
extern crate differential_dataflow;

use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {
        scope.new_collection_from(1 .. 10).1
             .map(|x| x * 2)
             .filter(|x| x % 2 == 1)
             .assert_empty();
    });
}

[src]

The scope containing the underlying timely dataflow stream.

impl<'a, G: Scope, T: Timestamp, D: Data, R: Diff> Collection<Child<'a, G, T>, D, R>
[src]

[src]

Returns the final value of a Collection from a nested scope to its containing scope.

Examples

extern crate timely;
extern crate differential_dataflow;

use timely::dataflow::Scope;
use differential_dataflow::input::Input;
use differential_dataflow::operators::*;

fn main() {
    ::timely::example(|scope| {

        let data = scope.new_collection_from(1 .. 10).1;

        let result = scope.scoped::<(),_,_>(|child| {
            data.enter(child)
                .leave()
        });

        data.assert_eq(&result);
    });
}

Trait Implementations

impl<G: Scope, K: Data + Hashable, V: Data, R: Diff, T> Arrange<G, K, V, R, T> for Collection<G, (K, V), R> where
    G::Timestamp: Lattice + Ord,
    T: Trace<K, V, G::Timestamp, R> + 'static,
    T::Batch: Batch<K, V, G::Timestamp, R>, 
[src]

[src]

Arranges a stream of (Key, Val) updates by Key. Accepts an empty instance of the trace type. Read more

impl<G: Scope, K: Data + Hashable, R: Diff, T> Arrange<G, K, (), R, T> for Collection<G, K, R> where
    G::Timestamp: Lattice + Ord,
    T: Trace<K, (), G::Timestamp, R> + 'static,
    T::Batch: Batch<K, (), G::Timestamp, R>, 
[src]

[src]

Arranges a stream of (Key, Val) updates by Key. Accepts an empty instance of the trace type. Read more

impl<G: Scope, K: Data + Hashable, V: Data, R: Diff> ArrangeByKey<G, K, V, R> for Collection<G, (K, V), R> where
    G::Timestamp: Lattice + Ord
[src]

[src]

Arranges a collection of (Key, Val) records by Key. Read more

impl<G: Scope, K: Data + Hashable, R: Diff> ArrangeBySelf<G, K, R> for Collection<G, K, R> where
    G::Timestamp: Lattice + Ord
[src]

[src]

Arranges a collection of Key records by Key. Read more

impl<G: Scope, K: Data + Hashable, V: Data, R: Diff> Group<G, K, V, R> for Collection<G, (K, V), R> where
    G::Timestamp: Lattice + Ord + Debug,
    <K as Hashable>::Output: Data
[src]

[src]

Groups records by their first field, and applies reduction logic to the associated values. Read more

impl<G: Scope, K: Data + Hashable, R1: Diff> Threshold<G, K, R1> for Collection<G, K, R1> where
    G::Timestamp: Lattice + Ord + Debug
[src]

[src]

Transforms the multiplicity of records. Read more

[src]

Reduces the collection to one occurrence of each distinct element. Read more

impl<G: Scope, K: Data + Hashable, R: Diff> Count<G, K, R> for Collection<G, K, R> where
    G::Timestamp: Lattice + Ord + Debug
[src]

[src]

Counts the number of occurrences of each element. Read more

impl<G, K, V, R> GroupArranged<G, K, V, R> for Collection<G, (K, V), R> where
    G: Scope,
    G::Timestamp: Lattice + Ord,
    K: Data + Hashable,
    V: Data,
    R: Diff
[src]

[src]

Applies group to arranged data, and returns an arrangement of output data. Read more

impl<G: Scope, D, R> Consolidate<D> for Collection<G, D, R> where
    D: Data + Hashable,
    R: Diff,
    G::Timestamp: Lattice + Ord
[src]

[src]

Aggregates the weights of equal records into at most one record. Read more

impl<G: Scope, D: Ord + Data + Debug, R: Diff> Iterate<G, D, R> for Collection<G, D, R>
[src]

[src]

Iteratively apply logic to the source collection until convergence. Read more

impl<G, K, V, R> Join<G, K, V, R> for Collection<G, (K, V), R> where
    G: Scope,
    K: Data + Hashable,
    V: Data,
    R: Diff,
    G::Timestamp: Lattice + Ord
[src]

[src]

Matches pairs (key,val1) and (key,val2) based on key and then applies a function. Read more

[src]

Matches pairs (key, val) and key based on key, producing the former with frequencies multiplied. Read more

[src]

Matches pairs (key, val) and key based on key, discarding values in the first collection if their key is present in the second. Read more

[src]

Matches pairs (key,val1) and (key,val2) based on key and then applies a function. Read more

impl<G, K, V, R> JoinCore<G, K, V, R> for Collection<G, (K, V), R> where
    G: Scope,
    K: Data + Hashable,
    V: Data,
    R: Diff,
    G::Timestamp: Lattice + Ord
[src]

[src]

Joins two arranged collections with the same key type. Read more

impl<G: Scope, K: Data + Hashable, R: Diff> CountTotal<G, K, R> for Collection<G, K, R> where
    G::Timestamp: TotalOrder + Lattice + Ord
[src]

[src]

Counts the number of occurrences of each element. Read more

impl<G: Scope, K: Data + Hashable, R: Diff> ThresholdTotal<G, K, R> for Collection<G, K, R> where
    G::Timestamp: TotalOrder + Lattice + Ord
[src]

[src]

Reduces the collection to one occurrence of each distinct element. Read more

[src]

Reduces the collection to one occurrence of each distinct element. Read more

impl<G, K, D> PrefixSum<G, K, D> for Collection<G, ((usize, K), D)> where
    G: Scope,
    G::Timestamp: Lattice,
    K: Data + Hash,
    D: Data + Hash
[src]

[src]

Computes the prefix sum for each element in the collection. Read more

[src]

Determine the prefix sum at each element of location.

impl<G: Clone + Scope, D: Clone, R: Clone + Diff> Clone for Collection<G, D, R> where
    G::Timestamp: Clone
[src]

[src]

Returns a copy of the value. Read more

1.0.0
[src]

Performs copy-assignment from source. Read more

Auto Trait Implementations

impl<G, D, R = isize> !Send for Collection<G, D, R>

impl<G, D, R = isize> !Sync for Collection<G, D, R>