Crate ironcore_search_helpers[][src]

Functions

Make an index, for the string s considering all tri-grams. The string will be latinised, lowercased and stripped of special chars before being broken into tri-grams. The values will be prefixed with partition_id and salt before being hashed. Each entry in the HasheSet will be truncated to 32 bits and will be encoded as a big endian number. If the string is longer than 200 characters, this will return an error.

Make an index, for the string s considering all tri-grams. The string will be latinised, lowercased and stripped of special chars before being broken into tri-grams. The values will be prefixed with partition_id and salt before being hashed. Each entry in the HashSet will be truncated to 32 bits and will be encoded as a big endian number. This function will also add some random entries to the HashSet to not expose how many tri-grams were actually found.

Generate a version of the input string where each character has been latinized using the same function as our tokenization routines.