Skip to main content

Crate robin_sparkless

Crate robin_sparkless 

Source
Expand description

Robin Sparkless - A Rust DataFrame library with PySpark-like API

This library provides a PySpark-compatible API built on top of Polars, offering high-performance data processing in pure Rust.

§Getting started and embedding

For application code and embedding, use the prelude: use robin_sparkless::prelude::*. For a minimal FFI surface, use prelude::embed.

§Panics and errors

Some functions panic when used with invalid or empty inputs (e.g. calling when(cond).otherwise(val) without .then(), or passing no columns to format_string, elt, concat, coalesce, or named_struct in Rust). In Rust, create_map and array return Result for empty input instead of panicking. From Python, empty columns for coalesce, format_string, printf, and named_struct raise ValueError. See the documentation for each function for details.

§API stability

While the crate is in the 0.x series, we follow semver but may introduce breaking changes in minor releases (e.g. 0.1 → 0.2) until 1.0. For behavioral caveats and intentional differences from PySpark, see the repository documentation.

Re-exports§

pub use column::Column;
pub use config::SparklessConfig;
pub use dataframe::CubeRollupData;
pub use dataframe::DataFrame;
pub use dataframe::GroupedData;
pub use dataframe::JoinType;
pub use dataframe::PivotedGroupedData;
pub use dataframe::SaveMode;
pub use dataframe::WriteFormat;
pub use dataframe::WriteMode;
pub use error::EngineError;
pub use functions::SortOrder;
pub use schema::DataType;
pub use schema::StructField;
pub use schema::StructType;
pub use schema::schema_from_json;
pub use session::DataFrameReader;
pub use session::SparkSession;
pub use session::SparkSessionBuilder;
pub use traits::FromRobinDf;
pub use traits::IntoRobinDf;
pub use functions::*;

Modules§

column
config
Configuration for robin-sparkless sessions.
dataframe
DataFrame module: main tabular type and submodules for transformations, aggregations, joins, stats.
error
Engine error type for embedders.
expression
functions
plan
Plan interpreter: execute a serialized logical plan (list of ops) using the existing DataFrame API.
prelude
One-stop prelude for application code and embedding.
schema
session
sql
SQL parsing and translation to DataFrame operations. Compiled only when the sql feature is enabled.
traits
Traits for idiomatic conversion to and from robin-sparkless DataFrames.
type_coercion

Type Aliases§

Expr
Re-export the underlying expression and literal types so downstream bindings can depend on robin_sparkless::Expr / LiteralValue instead of importing Polars directly.
LiteralValue