Skip to main content

Module normalize

Module normalize 

Source

Functions§

extract_surname_token
Return the last whitespace-delimited token of a name, stripping tussenvoegsel from single-field full-name representations.
normalize_digits_only
normalize_plate
Strip hyphens and spaces from a license plate and uppercase it. “25-XKL-9” becomes “25XKL9”.
normalize_text
strip_tussenvoegsel
Strip common Dutch tussenvoegsel prefixes so that “VAN DEN BERG” and “BERG” produce the same phonetic key.
transliterate_and_normalize
Transliterate non-Latin Unicode to ASCII via any_ascii, then apply standard normalization (NFKD diacritic stripping + uppercase + collapse whitespace). Useful for Arabic/Cyrillic name input before phonetic encoding.