Data Matching Module
High-performance data matching, CPF/CNPJ validation, and cross-source consolidation for Brazilian data sources.
Features
- CPF Matcher: Normalize and validate Brazilian CPF numbers
- CNPJ Matcher: Normalize and validate Brazilian CNPJ numbers
- Name Matcher: Fuzzy matching with Brazilian name conventions
- Data Matcher: Cross-source entity resolution and consolidation
- Data Pipeline: Async processing with LRU caching
- Parallel Pipeline: High-throughput concurrent processing with DashMap
- Metrics: Comprehensive observability with EMA processing times
- SQL Extractor: PostgreSQL data extraction with dynamic schema (requires
postgresfeature)
Example
use ;
let matcher = new;
let results = matcher.match_across_sources;
SQL Extraction (with postgres feature)
use ;
let config = new;
let pool = config.create_pool.await?;
let extractor = new
.with_query;
let data_source = extractor.extract.await?;