Constants§
- KNOWN_
DOMAINS - Auto-generated known domains list.
- KNOWN_
DOMAIN_ COUNT - OCR_
CONFUSIONS - Auto-generated OCR confusion table. Sorted by confusable length descending for longest-match-first normalization.
- OCR_
CONFUSION_ COUNT - POPULAR_
REPOS - Auto-generated popular repos list.
- PUBLIC_
SUFFIXES - Auto-generated public suffix list.
- PUBLIC_
SUFFIX_ COUNT
Functions§
- is_
known_ domain - Check if a domain is in the known high-value targets list.
- is_
popular_ repo - Check if a repo (owner/name) is in the popular repos list.
- is_
public_ suffix - Check if a suffix is in the public suffix list.
- known_
domains - Get all known domains for confusable checking.
- ocr_
confusions - Get the OCR confusion table for confusable domain normalization.
- registrable_
domain - Extract the registrable domain (eTLD+1) from a hostname. Returns None if the entire hostname is a public suffix or has no suffix match.