Skip to main content

Module data

Module data 

Source

Constants§

KNOWN_DOMAINS
Auto-generated known domains list.
KNOWN_DOMAIN_COUNT
OCR_CONFUSIONS
Auto-generated OCR confusion table. Sorted by confusable length descending for longest-match-first normalization.
OCR_CONFUSION_COUNT
POPULAR_REPOS
Auto-generated popular repos list.
PUBLIC_SUFFIXES
Auto-generated public suffix list.
PUBLIC_SUFFIX_COUNT

Functions§

is_known_domain
Check if a domain is in the known high-value targets list.
is_popular_repo
Check if a repo (owner/name) is in the popular repos list.
is_public_suffix
Check if a suffix is in the public suffix list.
known_domains
Get all known domains for confusable checking.
ocr_confusions
Get the OCR confusion table for confusable domain normalization.
registrable_domain
Extract the registrable domain (eTLD+1) from a hostname. Returns None if the entire hostname is a public suffix or has no suffix match.