Expand description
Graph Weight Optimization (Simplified DW-GRPO)
This module implements a simplified version of Dynamic Weighted Group Relative Policy Optimization (DW-GRPO) for optimizing relationship weights in the knowledge graph.
Key features:
- Heuristic-based optimization (not full reinforcement learning)
- Gradient-free hill climbing for weight adjustment
- Multi-objective optimization (relevance, faithfulness, conciseness)
- Stagnation detection and dynamic weight adjustment
- Performance tracking across iterations
Structsยง
- Graph
Weight Optimizer - Graph weight optimizer using simplified DW-GRPO approach
- Objective
Weights - Weights for combining multiple objectives
- Optimization
Step - A single optimization iteration with metrics
- Optimizer
Config - Configuration for the optimizer
- Test
Query - Test query with expected answer for evaluation