koharu-models
Model wrappers and CLI tools for the Koharu app. Each module lazily downloads its weights from Hugging Face via koharu-core::download::hf_hub and runs on ONNX Runtime or candle.
Modules
comic_text_detector: ONNX model that finds speech bubbles/text blocks and returns bounding boxes plus a segmentation mask.manga_ocr: encoder/decoder OCR pipeline that reads cropped text regions.lama: LaMa inpainting with tiled blending to remove text using a mask.llm: quantized GGUF loader (Llama or Qwen2) using candle with chat-style prompting and generation controls.
CLI tools
Feature cuda enables the CUDA execution provider for ONNX Runtime and candle; without it the models fall back to CPU.
Licensed under Apache-2.0 (../LICENSE-APACHE).