DataFusion is an extensible query execution framework that uses Apache Arrow as the memory model.
DataFusion supports both SQL and a Table/DataFrame-style API for building logical query plans and also provides a query optimizer and execution engine capable of parallel execution against partitioned data sources (CSV and Parquet) using threads.
DataFusion currently supports simple projection, selection, and aggregate queries.