Expand description
Copyright detection module.
Detects copyright statements, holder names, and author information from source code files using a four-stage pipeline:
- Text preparation (normalization)
- Candidate line selection
- Lexing (POS tagging) and parsing (grammar rules)
- Refinement and junk filtering
Modules§
Structs§
Functions§
- detect_
copyrights - detect_
credits_ authors - Detect authors from a CREDITS-formatted file’s content.
- is_
credits_ file - Check if a file path is a CREDITS/AUTHORS file by its filename.
- refine_
copyright - Refine a detected copyright string. Returns
Noneif the result is empty.