Expand description
Extracts text from RAW TEXT layer files, constructs a 9-depth block hierarchy (identity → layers → clusters → items → sentences → tokens → syllables → chars → bytes), and writes the binary output files (microscope.bin, data.bin, meta.bin, merkle.bin, embeddings.bin).