Skip to main content

compute_statistics_with_options

Function compute_statistics_with_options 

Source
pub fn compute_statistics_with_options(
    lf: &LazyFrame,
    sample_size: Option<usize>,
    seed: u64,
    options: ComputeOptions,
) -> Result<AnalysisResults>
Expand description

Computes comprehensive statistics for a LazyFrame.

Main entry point for statistical analysis. Computes:

  • Basic statistics (count, nulls, min, max, mean) for all columns
  • Numeric statistics (percentiles, skewness, kurtosis, outliers) for numeric columns
  • Categorical statistics (unique count, mode, top values) for categorical columns
  • Distribution detection and analysis for numeric columns (if enabled)
  • Correlation matrix for numeric columns (if enabled)

Large datasets are automatically sampled when exceeding the sampling threshold.