stylometry-analyzer 0.1.1

Minimal CLI tool that combines one or more `.txt` files, extracts user-authored text, and enforces a minimum size. Hash-embeds text chunks and queries a local vector DB to classify writing style. Not semantic/tone based search for now. Prints results to stdout and writes `results.json`.
Documentation
# Stylometry Analyzer

[![Rust](https://img.shields.io/badge/Rust-1.70%2B-blue.svg)](https://www.rust-lang.org)
[![License](https://img.shields.io/badge/License-MIT-green.svg)](LICENSE)
[![CI](https://github.com/thegreatbey/stylometry/actions/workflows/ci.yml/badge.svg)](https://github.com/thegreatbey/stylometry/actions/workflows/ci.yml)
[![Crates.io](https://img.shields.io/crates/v/stylometry-analyzer.svg)](https://crates.io/crates/stylometry-analyzer)
[![Docs.rs](https://img.shields.io/docsrs/stylometry-analyzer)](https://docs.rs/stylometry-analyzer)

Minimal CLI tool that:
- Combines one or more `.txt` files, extracts user-authored text, and enforces a minimum size.
- Hash-embeds text chunks and queries a local vector DB to classify writing style.
- Prints results to stdout and writes `results.json`.

## Prerequisites
- Rust toolchain (stable)
- **yvdb** running locally at `http://127.0.0.1:8080`

## Usage
```
cargo run -- --file path\to\file1.txt path\to\file2.txt
```
Spaces in paths are fine if quoted.

## Notes
- stylometry analyzer requires a minimum of 70kb text.  Ideally I'd like to up this to 750kb.
- multiple text files can be used and minimal delimiters are used in main.rs to slice out the text so user's voice is remaining alone...mitigate noise, basically.
- On first run, the tool seeds reference embeddings into yvdb; later runs skip seeding.
- Progress prints show the current step and pre-seed status.