docs.rs failed to build data-transform-0.1.4
Please check the build logs for more information.
See Builds for ideas on how to fix a failed build, or Metadata for how to configure docs.rs builds.
If you believe this is docs.rs' fault, open an issue.
Please check the build logs for more information.
See Builds for ideas on how to fix a failed build, or Metadata for how to configure docs.rs builds.
If you believe this is docs.rs' fault, open an issue.
Visit the last successful build:
data-transform-0.1.2
Data Transform (dt)
A fast, readable data transformation tool for working with tabular data. Built with Rust and Polars.
Why dt?
Clearer than pandas, faster than awk, zero setup.
- Readable syntax: Named operations and clear pipelines
- Polars-powered: Parallel processing, optimized queries, columnar operations
- Single binary: No Python environments, no dependencies to manage
- Interactive REPL: See results after each transformation
- Multi-file operations: Clean syntax for joins and lookups
Installation
# Shell installer (macOS/Linux)
|
# PowerShell installer (Windows)
# Cargo (any platform with Rust installed)
Quick Start
Interactive REPL
>> data = )
)
)
>> data = | )
)
)
>> data = | )
)
)
>> data = | )
)
)
>> data | )
)
The REPL shows the table after each operation, letting you verify transformations before saving.
One-liner
From script file
Example: Multi-file lookup
# Load reference data
)
)
)
# Filter and join
| )
| ) | )
# Save results
| ) # .tsv extension auto-uses tab delimiter
Core Features
Column Selection
) # By position (1-based)
) # Range (inclusive)
) # By name
) # With renaming
) # Remove columns
Filtering & Sorting
)
)
)
)
)
Transformations
)
)
))
))
String Operations
))
))
)
Renaming
)
))
) # PC1, PC2, ..., PC50
Multi-file Operations
# Load reference table
)
# Lookup values (single-line)
) | )))
# Or split at pipe boundaries for readability
) |
)))
Documentation
See REFERENCE for complete syntax and examples.
Supported Formats
- JSON (
.json) - Structured JSON data - Parquet (
.parquet) - Columnar format
Delimited text files - Delimiter auto-detected for any file:
.csv- Defaults to comma, auto-detects if ambiguous.tsv- Defaults to tab, auto-detects if ambiguous- Any other extension (
.txt,.dat,.psv, etc.) - Auto-detects delimiter
Auto-detection analyzes file content and identifies: comma, tab, pipe, semicolon, or space.
Override auto-detection if needed:
)
)
Performance
Built on Polars, dt provides:
- Parallel processing using all CPU cores
- Columnar memory layout for cache efficiency
- Query optimization and lazy evaluation
- Type-aware operations
For typical data transformation tasks, dt is 5-10x faster than awk while being significantly more readable.
REPL Commands
.help- Show help.schema- Show current table schema.vars- Show stored variables.history- Show operation history.undo [n]- Undo operations.clear- Clear current state.exit- Exit REPL
License
MIT