xerg - High-Performance Parallel Grep Tool
Published on crates.io as xerg
An ultra-fast, parallel grep implementation in Rust with syntax highlighting and detailed search statistics. Built for performance with multi-core processing and optimized dependencies.
This repository is part of Pragmatic AI Labs Rust Bootcamp
Features
- ✅ Parallel Processing: Multi-core file processing with intelligent thread pool management
- ✅ Pattern Matching: Regular expression engine with optimized performance
- ✅ Structured Streaming: Streams structured matches per file with headers and statistics
- ✅ Directory Traversal: Recursive scanning with symlink support
- ✅ Colorized Output: Customizable syntax highlighting (red, green, blue, bold)
- ✅ Search Statistics: Optional detailed metrics with
--statsflag - ✅ Quality Assurance: Comprehensive test suite and optimized dependencies
Quick Start
Installation
Install from crates.io (Recommended)
Build from Source
-
Clone the repository:
-
Build the project:
Development Setup
Use the included Makefile for common development tasks:
Usage
Using the installed binary:
# Basic search in current directory
# Search with colored output, statistics, and specific path
For development (from source):
# Basic search in current directory
# Search with colored output, statistics, and specific path
# Or use the built binary directly
Command-Line Options
| Option | Description | Example |
|---|---|---|
pattern |
Search pattern (required) | "use" |
path |
File or directory to search (optional, defaults to current directory) | src/ |
--color <COLOR> |
Highlight color: red, green, blue, bold |
--color blue |
--stats |
Show detailed search statistics | --stats |
--help |
Display help information | --help |
--version |
Show version information | --version |
Search Statistics
;
;
;
;
;
; ; ; ; ; ;
Structured Result Format: Machine-readable summary with semicolon delimiters and millisecond-precision timing. Perfect for performance analysis and automated testing.
Metrics: files = processed files, lines = total lines read, matches = pattern occurrences, skipped = unreadable lines, errors = access failures, time = execution time
Architecture
The project follows a modular architecture with clear separation of concerns:
Core Modules
main.rs: CLI entry point and argument parsinglib.rs: Core integration layer connecting all modulessearch.rs: Parallel file processing with Rayoncrawler.rs: Directory traversal with symlink supporthighlighter.rs: Regex-based text highlightingcolors.rs: ANSI color managementresult.rs: Message handling and statistics formatting
Dependencies
| Crate | Purpose |
|---|---|
clap |
CLI argument parsing |
num_cpus |
Thread optimization |
rayon |
Parallel processing |
regex |
Pattern matching |
walkdir |
Directory traversal |
Binary Size: 2.2MB optimized release build with minimal feature flags enabled
Performance
Multi-core Processing: Utilizes cores - 1 threads for optimal performance without system lock-up
Memory Efficient: Line-by-line processing handles files of any size
Structured Streaming: Streams structured matches per file as processing completes
Optimized I/O: Buffered reading and compiled regex reuse
Planned Features
- Auto-color detection (
--color=auto) - Silent mode (
-s,--silent) - Case insensitive search (
-i,--ignore-case) - Invert matching (
-v,--invert-match) - Multi-pattern support
- File type filtering
- Line number display (
-n,--line-number)
Contributing
See CONTRIBUTING.md for guidelines on how to contribute to this project.
This is a learning-focused project demonstrating comprehensive Rust development practices. Contributions that enhance the educational value are especially welcome.
License
This project is open source and available under the MIT License.
Built during the Pragmatic AI Labs Rust Bootcamp