Expand description
Utility calculation algorithms for episode ranking
Ported from MemRL with adaptations for Smelt’s needs.
Structs§
- Decay
Params - Parameters for exponential decay
- Propagation
Result - Result of a propagation run
- Utility
Ranker - Utility-based ranker for episodes
Functions§
- apply_
decay - Apply exponential decay to a utility value
- bellman_
propagate - Run Bellman propagation to spread utility through the memory
- temporal_
credit_ assignment - Run temporal credit assignment
- wilson_
score - Calculate Wilson score lower bound