Expand description
Various transformation constructors.
The different crate::core::Transformation
implementations in this module are accessed by calling the appropriate constructor function.
Constructors are named in the form make_xxx()
, where xxx
indicates what the resulting Transformation
does.
Re-exportsยง
pub use quantile_score_candidates::*;
Modulesยง
- b_
ary_ ๐tree - cast ๐
- cast_
metric ๐ - clamp ๐
- count ๐
- count_
cdf ๐ - covariance ๐
- dataframe ๐
- impute ๐
- index ๐
- lipschitz_
mul ๐ - make_
stable_ ๐expr - make_
stable_ ๐lazyframe - manipulation ๐
- mean ๐
- quantile_
score_ candidates - resize ๐
- sum ๐
- sum_
of_ ๐squared_ deviations - variance ๐
Structsยง
- Data
Frame Domain - Pairwise
- Marker type to represent pairwise, or cascading summation
- Sequential
- Marker type to represent sequential, or recursive summation
Traitsยง
- BAry
Tree Metric - Dataset
Domain - A
Domain
representing a dataset. - Dataset
Metric - Drop
Null Domain - Utility trait to drop null values from a dataset, regardless of the representation of nullity.
- Impute
Constant Domain - Utility trait to impute with a constant, regardless of the representation of nullity.
- RowBy
RowDomain - Stable
DslPlan - Stable
Expr
Functionsยง
- choose_
branching_ factor - Returns an approximation to the ideal
branching_factor
for a dataset of a given size, that minimizes error in cdf and quantile estimates based on b-ary trees. - make_
b_ ary_ tree - Expand a vector of counts into a b-ary tree of counts,
where each branch is the sum of its
b
immediate children. - make_
bounded_ float_ checked_ sum - Make a Transformation that computes the sum of bounded data with known dataset size.
- make_
bounded_ float_ ordered_ sum - Make a Transformation that computes the sum of bounded floats with known ordering.
- make_
bounded_ int_ monotonic_ sum - Make a Transformation that computes the sum of bounded ints, where all values share the same sign.
- make_
bounded_ int_ ordered_ sum - Make a Transformation that computes the sum of bounded ints.
You may need to use
make_ordered_random
to impose an ordering on the data. - make_
bounded_ int_ split_ sum - Make a Transformation that computes the sum of bounded ints. Adds the saturating sum of the positives to the saturating sum of the negatives.
- make_
cast - Make a Transformation that casts a vector of data from type
TIA
to typeTOA
. For each element, failure to parse results inNone
, elseSome(out)
. - make_
cast_ default - Make a Transformation that casts a vector of data from type
TIA
to typeTOA
. Any element that fails to cast is filled with default. - make_
cast_ inherent - Make a Transformation that casts a vector of data from type
TIA
to a type that can represent nullityTOA
. If cast fails, fill withTOA
โs null value. - make_
cdf - Postprocess a noisy array of float summary counts into a cumulative distribution.
- make_
clamp - Make a Transformation that clamps numeric data in
Vec<TA>
tobounds
. - make_
consistent_ b_ ary_ tree - Postprocessor that makes a noisy b-ary tree internally consistent, and returns the leaf layer.
- make_
count - Make a Transformation that computes a count of the number of records in data.
- make_
count_ by - Make a Transformation that computes the count of each unique value in data. This assumes that the category set is unknown.
- make_
count_ by_ categories - Make a Transformation that computes the number of times each category appears in the data. This assumes that the category set is known.
- make_
count_ distinct - Make a Transformation that computes a count of the number of unique, distinct records in data.
- make_
create_ dataframe Deprecated - Make a Transformation that constructs a dataframe from a
Vec<Vec<String>>
(a vector of records). - make_
df_ cast_ default Deprecated - Make a Transformation that casts the elements in a column in a dataframe from type
TIA
to typeTOA
. If cast fails, fill with default. - make_
df_ is_ equal Deprecated - Make a Transformation that checks if each element in a column in a dataframe is equivalent to
value
. - make_
drop_ null - Make a Transformation that drops null values.
- make_
find - Find the index of a data value in a set of categories.
- make_
find_ bin - Make a transformation that finds the bin index in a monotonically increasing vector of edges.
- make_
identity - Make a Transformation representing the identity function.
- make_
impute_ constant - Make a Transformation that replaces null/None data with
constant
. - make_
impute_ uniform_ float - Make a Transformation that replaces NaN values in
Vec<TA>
with uniformly distributed floats withinbounds
. - make_
index - Make a transformation that treats each element as an index into a vector of categories.
- make_
is_ equal - Make a Transformation that checks if each element is equal to
value
. - make_
is_ null - Make a Transformation that checks if each element in a vector is null or nan.
- make_
lipschitz_ float_ mul - Make a transformation that multiplies an aggregate by a constant.
- make_
mean - Make a Transformation that computes the mean of bounded data.
- make_
metric_ bounded - Make a Transformation that converts the unbounded dataset metric
MI
to the respective bounded dataset metric with a no-op. - make_
metric_ unbounded - Make a Transformation that converts the bounded dataset metric
MI
to the respective unbounded dataset metric with a no-op. - make_
ordered_ random - Make a Transformation that converts the unordered dataset metric
SymmetricDistance
to the respective ordered dataset metricInsertDeleteDistance
by assigning a random permutation. - make_
quantiles_ from_ counts - Postprocess a noisy array of summary counts into quantiles.
- make_
resize - Make a Transformation that either truncates or imputes records
with
constant
to match a providedsize
. - make_
select_ column Deprecated - Make a Transformation that retrieves the column
key
from a dataframe asVec<TOA>
. - make_
sized_ bounded_ covariance - make_
sized_ bounded_ float_ checked_ sum - Make a Transformation that computes the sum of bounded floats with known dataset size.
- make_
sized_ bounded_ float_ ordered_ sum - Make a Transformation that computes the sum of bounded floats with known ordering and dataset size.
- make_
sized_ bounded_ int_ checked_ sum - Make a Transformation that computes the sum of bounded ints. The effective range is reduced, as (bounds * size) must not overflow.
- make_
sized_ bounded_ int_ monotonic_ sum - Make a Transformation that computes the sum of bounded ints, where all values share the same sign.
- make_
sized_ bounded_ int_ ordered_ sum - Make a Transformation that computes the sum of bounded ints with known dataset size.
- make_
sized_ bounded_ int_ split_ sum - Make a Transformation that computes the sum of bounded ints with known dataset size.
- make_
split_ dataframe Deprecated - Make a Transformation that splits each record in a String into a
Vec<Vec<String>>
, and loads the resulting table into a dataframe keyed bycol_names
. - make_
split_ lines - Make a Transformation that takes a string and splits it into a
Vec<String>
of its lines. - make_
split_ records - Make a Transformation that splits each record in a
Vec<String>
into aVec<Vec<String>>
. - make_
stable_ expr - Create a stable transformation from an
Expr
. - make_
stable_ lazyframe - Create a stable transformation from a
LazyFrame
. - make_
subset_ by Deprecated - Make a Transformation that subsets a dataframe by a boolean column.
- make_
sum - Make a Transformation that computes the sum of bounded data.
Use
make_clamp
to bound data. - make_
sum_ of_ squared_ deviations - Make a Transformation that computes the sum of squared deviations of bounded data.
- make_
unordered - Make a Transformation that converts the ordered dataset metric
MI
to the respective ordered dataset metric with a no-op. - make_
variance - Make a Transformation that computes the variance of bounded data.