Expand description
Gleaning-based entity extraction module Gleaning-based entity extraction with TRUE LLM inference
This module implements iterative gleaning refinement using actual LLM calls, not pattern matching. Based on Microsoft GraphRAG and LightRAG research.
Expected performance: 15-30 seconds per chunk per round. For a 1000-page book with 4 gleaning rounds, expect 2-4 hours of processing time.
Structsยง
- Extraction
Completion Status - Status of entity extraction completion
- Gleaning
Config - Configuration for gleaning-based entity extraction
- Gleaning
Entity Extractor - Entity extractor with iterative gleaning refinement using TRUE LLM calls
- Gleaning
Statistics - Statistics for gleaning extraction process