datafusion 0.1.7

DataFusion is a datasource-agnostic distributed query processing framework for Rust inspired by Apache Spark
Documentation

DataFusion: Big Data Platform for Rust

License Version Docs Gitter chat

DataFusion is a distributed data processing platform implemented in Rust. It is very much inspired by Apache Spark and has a similar programming style through the use of DataFrames and SQL.

DataFusion can also be used as a crate dependency in your project if you want the ability to perform SQL queries and DataFrame style data manipulation in-process against your own data sources. In that respect, DataFusion is inspired by Apache Calcite in the Java world.

Project Home Page

The project home page is now at https://datafusion.rs and contains the roadmap as well as documentation for using this crate or running DataFusion as a distributed cluster. I am using github issues to track development tasks and feedback.

Gitter

There is a Gitter channel where you can ask questions about the project or make feature suggestions too.

Supported Rust versions

DataFusion uses travis CI to build against Rust stable and nightly and the goal is to keep it compatible with both.

Contributing

Contributers are welcome! Please see CONTRIBUTING.md for details.