Expand description
mimirs-eval: EvolMem-inspired memory evaluation benchmark for MimirsWell
Implements systematic evaluation of memory quality across 7 cognitive dimensions from the EvolMem framework (Shen et al., 2026, arXiv:2601.03543):
Declarative Memory:
- Retrieval — accuracy of recalling relevant information
- Summarization — quality of memory consolidation/abstraction
- Isolation — prevention of cross-source memory interference
- Inference — reasoning from stored facts
- Reproduction — faithful reconstruction of past interactions
Non-Declarative Memory: 6. Learning — acquisition of operational rules from experience 7. Habituation — stability of automatic memory patterns across sessions
Structs§
- Dimension
Score - Score for a single evaluation dimension (0.0 to 1.0).
- Eval
Config - Configuration for memory evaluation.
- Eval
Engine - Main evaluation engine for memory quality assessment.
- Eval
Report - Complete evaluation report across all 7 dimensions.
- Eval
Weights - Weights for each evaluation dimension in the overall score.
Enums§
- Eval
Error - Errors that can occur during memory evaluation.