Skip to main content

Module dedup

Module dedup 

Source
Expand description

Deduplication of extracted nodes and edges.

After per-file extraction, duplicate nodes (same ID) and edges (same source + target + relation triple) are removed to produce a clean graph.

Functions§

dedup_file
Deduplicate nodes within a single file’s ExtractionResult.
dedup_results
Merge multiple ExtractionResults into one, deduplicating across all of them.