Expand description
File filtering using layered blocklist/allowlist/heuristics.
Determines which files should be processed for codemap extraction.
Enums§
- Filter
Error - Errors that can occur during file filtering.
- Filter
Result - Result of filtering a file.
- Language
- Supported programming languages for codemap extraction.
- Reject
Reason - Reason why a file was rejected.
Functions§
- detect_
language - Detect language from file path based on extension.
- is_
binary - Check if content contains binary data (null bytes).
- is_
blocklisted - Check if an extension is blocklisted.
- is_
blocklisted_ filename - Check if a filename is blocklisted.
- is_
generated - Check if content appears to be generated.
- is_
minified - Check if content appears to be minified (very long lines). Uses early exit - returns as soon as a long line is found.
- passes_
extension_ filter - Check if a path passes basic extension filtering (no content check).
- should_
process - Determine if a file should be processed for codemap extraction.