Structs§
Functions§
- aggregate_
by_ language - build_
copy_ output - Step 2: Build final output from relative paths and content collected in step 1. Format:
- collect_
filtered_ texts - Step 1: Extract filtered relative file paths and their UTF-8 content.
Returns
(relative_path, content)for each file, sorted by path. - count_
non_ empty_ lines - count_
tokens_ in_ path - count_
tokens_ in_ path_ with_ progress - Like
count_tokens_in_path, but reports progress via the provided callback. The callback receives(processed_files, total_files). - count_
tokens_ in_ text - get_
encoder - language_
from_ path