Lazy abstraction over an eager DataFrame. It really is an abstraction over a logical plan. The methods of this struct will incrementally modify a logical plan until output is requested (via collect)

LazyGroupBy

Utility struct for lazy groupby operation.

LazyJsonLineReader

ListBinaryChunkedBuilderdtype-binary

ListBooleanChunkedBuilder

ListNameSpace

Specialized expressions for Series of DataType::List.

ListPrimitiveChunkedBuilder

ListTakeRandom

ListTakeRandomSingleChunk

ListType

ListUtf8ChunkedBuilder

Logical

Maps a logical type to a a chunked array implementation of the physical type. This saves a lot of compiler bloat and allows us to reuse functionality.

MeltArgs

Arguments for [DataFrame::melt] function

NoNull

Just a wrapper structure. Useful for certain impl specializations This is for instance use to implement impl<T> FromIterator<T::Native> for NoNull<ChunkedArray<T>> as Option<T::Native> was already implemented: impl<T> FromIterator<Option<T::Native>> for ChunkedArray<T>

Null

The literal Null

NumTakeRandomChunked

NumTakeRandomCont

NumTakeRandomSingleChunk

ObjectTakeRandomobject

ObjectTakeRandomSingleChunkobject

ObjectTypeobject

OptState

State of the allowed optimizations

OwnedObjectobject

ParquetReader

Read Apache parquet format into a DataFrame.

ParquetWriteOptionsparquet

ParquetWriter

Write a DataFrame to parquet format

PhysicalIoHelper

Wrapper struct that allow us to use a PhysicalExpr in polars-io.

PrimitiveChunkedBuilder

RankOptions

RollingGroupOptions

RollingOptionsrolling_window

RollingOptionsFixedWindow

RollingOptionsImplrolling_window

Wrapper type that has special equality properties depending on the inner type specialization

StrHashLocal

StrpTimeOptions

StructArray

A StructArray is a nested Array with an optional validity representing multiple Array with the same number of rows.

StructChunked

This is logical type StructChunked that dispatches most logic to the fields implementations

Utf8TakeRandomSingleChunk

Utf8Type

When

Intermediate state of when(..).then(..).otherwise(..) expr.

WhenThen

Intermediate state of when(..).then(..).otherwise(..) expr.

WhenThenThen

Intermediate state of chain when then exprs.

Window

Represents a window in time

ZstdLevel

Represents a valid zstd compression level.

Enums

AggExpr

AnyValue

ArrowDataType

The set of supported logical types in this crate.

ArrowTimeUnit

The time units defined in Arrow.

Queries consists of multiple expressions.

Compression codec

QuantileInterpolOptions

One of the three arguments allowed in unchecked_take

Constants

IDX_DTYPENon-bigidx

NULL

NULL_DTYPE

Traits

Argmin/ Argmax

Aggregation operations

ChunkAggSeries

Aggregations that return Series of unit length. Those can be used in broadcasting operations.

ChunkAnyValue

ChunkApply

Fastest way to do elementwise operations on a ChunkedArray when the operation is cheaper than branching due to null checking

ChunkApplyKernel

Apply kernels on the arrow array chunks in a ChunkedArray.

ChunkBytes

ChunkCast

Cast ChunkedArray<T> to ChunkedArray<N>

ChunkCompare

Compare Series and ChunkedArray’s and get a boolean mask that can be used to filter rows.

ChunkCumAggcum_agg

ChunkExpandAtIndex

Create a new ChunkedArray filled with values at that index.

ChunkExplode

Explode/ flatten a List or Utf8 Series

ChunkFillNullValue

Replace None values with a value

ChunkFilter

Filter values by a boolean mask.

ChunkFull

Fill a ChunkedArray with one value.

ChunkFullNull

ChunkPeaks

Find local minima/ maxima

ChunkQuantile

Quantile and median aggregation

ChunkReverse

Reverse a ChunkedArray

ChunkRollApplyrolling_window

This differs from ChunkWindowCustom and ChunkWindow by not using a fold aggregator, but reusing a Series wrapper and calling Series aggregators. This likely is a bit slower than ChunkWindow

ChunkSet

Create a ChunkedArray with new values by index or by boolean mask. Note that these operations clone data. This is however the only way we can modify at mask or index level as the underlying Arrow arrays are immutable.

ChunkShift

ChunkShiftFill

Shift the values of a ChunkedArray by a number of periods.

ChunkSort

Sort operations on ChunkedArray.

ChunkTake

Fast access by index.

ChunkTakeEvery

Traverse and collect every nth element

ChunkUnique

Get unique values in a ChunkedArray

ChunkVar

Variance and standard deviation aggregation.

ChunkZip

Combine 2 ChunkedArrays based on some predicate.

IndexOfSchemaprivate

This trait exists to be unify the API of polars Schema and arrows Schema

IndexToUsize

InitHashMaps

IntoGroupsProxy

Used to create the tuples for a groupby operation.

IntoLazy

IntoListNameSpace

IntoSeries

Used to convert a ChunkedArray, &dyn SeriesTrait and Series into a Series.

IntoTakeRandom

Create a type that implements a faster TakeRandom.

IntoVec

IsFirstis_first

Mask the first unique values as true

IsFloat

Safety

IsInis_in

Check if element is member of list array

IsLastis_first

Mask the last unique values as true

LhsNumOps

ListBuilderTrait

ListFromIter

ListNameSpaceExtension

ListNameSpaceImpl

Literal

LogicalType

MutableBitmapExtension

NumOpsDispatchChecked

NumericNative

PartitionedAggregation

PhysicalExpr

Take a DataFrame and evaluate the expressions. Implement this for Column, lt, eq, etc

A PolarsIterator is an iterator over a ChunkedArray which contains polars types. A PolarsIterator must implement ExactSizeIterator and DoubleEndedIterator.

PolarsNumericType

PolarsObject

Values need to implement this so that they can be stored into a Series and DataFrame

PolarsRound

PolarsSingleType

Any type that is not nested

PolarsTemporalGroupby

RepeatByrepeat_by

Repeat the values n times.

RollingAggrolling_window

A wrapper trait for any binary closure Fn(Series, Series) -> PolarsResult<Series>

A wrapper trait for any closure Fn(Vec<Series>) -> PolarsResult<Series>

StrConcatconcat_str

Concat the values into a string array.

Random access

Ensure that the same hash is used as with VecHash.

Functions

all

Selects all columns

all_exprs

Evaluate all the expressions with a bitwise and

any_exprs

Evaluate all the expressions with a bitwise or

apply_binary

apply_multiple

Apply a function/closure over the groups of multiple columns. This should only be used in a groupby aggregation.

arangearange

Create list entries that are range arrays

arg_wherearg_where

Get the indices where condition evaluates true.

argsort_by

Find the indexes that would sort these series in order of appearance. That means that the first Series will be used to determine the ordering until duplicates are found. Once duplicates are found, the next Series will be used and so on.

as_structdtype-struct

Take several expressions and collect them into a StructChunked.

avg

Find the mean of all the values in this Expression.

binary_expr

cast

Cast expression.

coalesce

Folds the expressions from left to right keeping the first no null values.

col

Create a Column Expression based on a column name.

collect_all

Collect all LazyFrame computations.

cols

Select multiple columns by name

concat

Concat multiple

concat_lst

Concat lists entries.

concat_strconcat_str and strings

Horizontally concat string columns in linear time

count

Count expression

cov

Compute the covariance between two columns.

cumfold_exprsdtype-struct

Accumulate over multiple columns horizontally / row wise.

cumreduce_exprsdtype-struct

Accumulate over multiple columns horizontally / row wise.

date_range_vec

datetimetemporal

datetime_to_timestamp_msprivate

datetime_to_timestamp_nsprivate

datetime_to_timestamp_usprivate

diag_concat_lfdiagonal_concat

Concat LazyFrames diagonally. Calls [concat] internally.

dtype_col

Select multiple columns by dtype.

dtype_cols

Select multiple columns by dtype.

durationtemporal

first

First column in DataFrame

floor_div_series

fmt_groupby_column

fold_exprs

Accumulate over multiple columns horizontally / row wise.

format_strconcat_str and strings

Format the results of an array of expressions using a format string

groupby_values

Different from groupby_windows, where define window buckets and search which values fit that pre-defined bucket, this function defines every window based on the: - timestamp (lower bound) - timestamp + period (upper bound) where timestamps are the individual values in the array time

groupby_windows

Based on the given Window, which has an

in_nanoseconds_window

IsNotNull expression.

is_null

IsNull expression

last

Last column in DataFrame