hawk ๐ฆ
Modern data analysis tool for JSON, YAML, CSV, and text files
hawk combines the simplicity of awk with the power of pandas, bringing unified data processing to your command line. Process any data format with the same intuitive syntax.
โก Quick Start
Installation
# Homebrew (macOS/Linux)
# Cargo (Rust)
# Verify installation
30-Second Demo
# JSON/CSV analysis - same syntax!
# Text/log processing with slicing (NEW!)
# Advanced string operations with multiple fields
๐ Why hawk?
| Feature | hawk | jq | awk | pandas |
|---|---|---|---|---|
| Multi-format | โ JSON, YAML, CSV, Text | โ JSON only | โ Text only | โ Python required |
| Unified syntax | โ Same queries everywhere | โ JSON-specific | โ Line-based | โ Complex setup |
| String operations | โ 14 built-in + slicing | โ ๏ธ Limited | โ ๏ธ Basic | โ Extensive |
| Statistical analysis | โ Built-in median, stddev | โ None | โ None | โ Full suite |
| Learning curve | ๐ข Familiar pandas-like | ๐ก Steep | ๐ข Simple | ๐ด High |
๐ฏ Key Features
Universal Data Processing
Process any format with identical syntax:
Advanced Text Processing (NEW in v0.2.3!)
# Split with slicing - extract exactly what you need
|
# โ ["2024-01-15", "10:30:45"]
# OR conditions for flexible filtering
# Powerful slicing for any operation result
Statistical Analysis Made Simple
# Instant insights from your data
๐ Documentation
Get Started in 5 Minutes
- ๐ Quick Start Guide - Essential basics
- ๐ Query Language Reference - Complete syntax
- ๐งต String Operations - Text processing guide
Master Advanced Features
- ๐ Data Analysis - Statistical workflows
- ๐ Text Processing - Log analysis and text manipulation
- ๐ผ Real-world Examples - Industry-specific use cases
Use Case Guides(In progress)
- ๐ Log Analysis - Docker, nginx, application logs
- โ๏ธ DevOps Workflows - Kubernetes, CI/CD, monitoring
- ๐ Data Science - CSV analysis, statistics, ML prep
๐ Popular Workflows
Log Analysis
# Find error patterns in application logs
# Analyze Docker container performance
Data Exploration
# Quick dataset overview
# Statistical analysis
DevOps Automation
# Kubernetes resource analysis
# Performance monitoring
โญ What's New in v0.2.3
- ๐ฏ Advanced Slicing:
.[0:10],.[-5:],group_by(.field) | .[0:3] - โ๏ธ Split with Slicing:
split(" ")[0:3],split(",")[-2:] - ๐ OR Conditions:
contains("GET|POST"),starts_with("ERROR|WARN") - ๐ Stratified Sampling: Sample from each group for unbiased analysis
- โก Performance: Optimized for large datasets with efficient memory usage
๐ค Contributing
We welcome contributions! See our Contributing Guide.
๐ License
MIT License - see LICENSE for details.
Ready to transform your data workflows? Start with our 5-minute tutorial ๐