Expand description
Schema inference engine for columnar JSON/NDJSON data.
Takes the output of ndjson::preprocess (columns separated by \x00,
values within columns separated by \x01) and infers the type of each
column by examining its values.
The inferred schema can be serialized into compact binary metadata for storage in .dcx transform metadata, and deserialized by the decoder.
Structs§
- Column
Schema - Schema for a single column: type + null statistics.
- Inferred
Schema - Inferred schema for an entire columnar dataset.
Enums§
- Column
Type - Inferred type of a single column.
- Timestamp
Format - Detected timestamp format.
Functions§
- deserialize_
schema - Deserialize schema from transform metadata bytes.
- infer_
schema - Infer schema from columnar data (post
ndjson::preprocessoutput). - serialize_
schema - Serialize schema to compact binary bytes for transform metadata.