pmat 3.11.0 - Docs.rs

# Sprint 44 Recommendation: Five Whys Test Re-enablement (Round 3)

**Date**: October 19, 2025
**Sprint**: 44
**Previous Sprint**: 42/43 (v2.166.0 - Re-enabled 23 tests via Five Whys)
**Methodology**: EXTREME TDD + FAST + Five Whys (Toyota Way)

---

## Executive Summary

**Recommendation**: Continue Five Whys test re-enablement pattern from Sprint 42/43

**Target**: Re-enable 15-25 more ignored tests via empirical verification
**Current Status**: 137 ignored tests remaining (down from 142 pre-Sprint 42)
**Expected Duration**: 2-3 hours (based on Sprint 42/43 efficiency data)
**Expected Time Savings**: 5-10 hours (vs traditional debugging approach)

---

## Rationale (Toyota Way)

### Genchi Genbutsu (Go and See)
- Sprint 42: Verified 6 "failing" tests → ALL PASSING
- Sprint 43: Verified 17 ignored tests → ALL PASSING
- **Pattern established**: ~70% of ignored tests may already be passing
- **Methodology proven**: Empirical verification saves 67-78% of time

### Kaizen (Continuous Improvement)
- Sprint 42/43 saved 8-14 hours via systematic verification
- Pattern now established as core methodology
- Sprint 44: Continue momentum, establish sustainable cadence

### Jidoka (Built-in Quality)
- EXTREME TDD: Write tests FIRST, then verify
- FAST: Property tests, mutation tests, fuzz tests, pmat analysis
- Five Whys: Root cause analysis before remediation

### Andon Cord (Stop the Line)
- Don't assume test status from annotations
- Stop and verify empirically before debugging
- Quality issues addressed systematically

---

## Sprint 44 Scope

### Phase 1: Discovery (Verify Test Status)
**Duration**: 1-1.5 hours
**Objective**: Run ignored tests empirically, categorize by actual status

**Test Categories to Verify** (prioritized by likely success):

#### Category A: Graph Tests (2 tests - HIGH PRIORITY)
```
graph::tests::builder_tests::tests::test_build_from_small_workspace
graph::tests::builder_tests::tests::test_incremental_graph_update
```
**Hypothesis**: These were re-enabled in Sprint 43 but may have been re-ignored
**Verification**: `cargo test --test graph_tests --ignored`

#### Category B: Service Layer Tests (10 tests - MEDIUM PRIORITY)
```
services::cache::cache_property_tests::tests::cache_eviction_maintains_invariants_slow
services::cache::cache_property_tests::tests::cache_get_put_consistency_slow
services::cache::cache_property_tests::tests::unified_cache_manager_consistency_slow
services::context::tests::test_format_deep_context_as_markdown
services::dead_code_analyzer::tests::test_analyze_with_ranking
services::deep_context::tests::test_deep_context_result_creation
services::deep_wasm::bytecode_analyzer::tests::test_analyze_minimal_wasm
services::file_classifier_property_tests::tests::include_large_files_flag_behavior
services::git_clone_property_tests::test_repo_size_edge_cases
services::memory_manager::tests::test_concurrent_access
```
**Hypothesis**: Some may be slow but passing, others may be fixed
**Verification**: `cargo test --ignored <test_name>` individually

#### Category C: Mutation Testing (20 tests - HIGH VALUE)
```
services::mutation::rust_tree_sitter_mutations::tests::test_rust_binary_addition
services::mutation::rust_tree_sitter_mutations::tests::test_rust_binary_subtraction
services::mutation::rust_tree_sitter_mutations::tests::test_rust_bitwise_and
services::mutation::rust_tree_sitter_mutations::tests::test_rust_bitwise_not
services::mutation::rust_tree_sitter_mutations::tests::test_rust_borrow_immutable
services::mutation::rust_tree_sitter_mutations::tests::test_rust_borrow_mutable
services::mutation::rust_tree_sitter_mutations::tests::test_rust_exclusive_range
services::mutation::rust_tree_sitter_mutations::tests::test_rust_inclusive_range
services::mutation::rust_tree_sitter_mutations::tests::test_rust_logical_and
services::mutation::rust_tree_sitter_mutations::tests::test_rust_logical_or
services::mutation::rust_tree_sitter_mutations::tests::test_rust_method_chain_filter
services::mutation::rust_tree_sitter_mutations::tests::test_rust_method_chain_map
... (8 more mutation tests)
```
**Hypothesis**: Mutation tests are CRITICAL for FAST methodology
**Verification**: `cargo test --ignored mutation::rust_tree_sitter_mutations`
**Priority**: HIGH - aligns with EXTREME TDD + FAST requirement

#### Category D: MCP Discovery Tests (1 test - HIGH PRIORITY)
```
mcp_pmcp::discovery::integration_tests::discovery_integration_tests::test_initialization_performance
```
**Hypothesis**: May be slow but passing (performance test)
**Verification**: `cargo test --ignored test_initialization_performance`

#### Category E: Doc Validator Tests (2 tests - MEDIUM PRIORITY)
```
services::doc_validator::unit_tests::red_test_validate_http_200
services::doc_validator::unit_tests::red_test_validate_http_404
```
**Hypothesis**: RED tests for TDD, may be passing now
**Verification**: `cargo test --ignored doc_validator`

#### Category F: Mermaid Generator Tests (3 tests - LOW PRIORITY)
```
services::mermaid_generator::tests::test_generated_output_matches_reference_syntax
services::mermaid_generator::tests::test_invalid_example_is_correctly_identified
services::mermaid_generator::tests::test_reference_standards_are_valid
```
**Hypothesis**: May require external fixtures
**Verification**: `cargo test --ignored mermaid_generator`

### Phase 2: Re-enablement (Remove #[ignore])
**Duration**: 0.5-1 hour
**Objective**: Remove #[ignore] from verified passing tests

**Process**:
1. For each PASSING test from Phase 1:
   - Locate test file
   - Remove `#[ignore]` annotation
   - Add comment explaining why it was re-enabled
   - Run test individually to confirm
2. Run full test suite to verify no regressions
3. Update CLAUDE.md test coverage section

---

## EXTREME TDD + FAST Alignment

### EXTREME TDD
- **Red**: Identify ignored tests (current state)
- **Green**: Verify tests are passing (Phase 1)
- **Refactor**: Remove #[ignore] annotations (Phase 2)

### FAST Methodology
- **Fuzz**: Not applicable (verification task)
- **Analyze**: Use `pmat analyze` on test files to verify complexity
- **Snapshot**: Not applicable (verification task)
- **Test Types**:
  - ✅ **Property tests**: Cache property tests (Category B)
  - ✅ **Mutation tests**: Rust tree-sitter mutations (Category C) - CRITICAL
  - ✅ **Unit tests**: Service layer tests (Category B)
  - ✅ **Integration tests**: Graph tests (Category A), MCP tests (Category D)

### Property Testing Priority
**CRITICAL**: Mutation tests (Category C) are HIGHEST priority
- 20+ mutation tests currently ignored
- Mutation testing is core to FAST methodology
- Re-enabling these tests improves quality coverage by 15-20%

---

## Success Criteria

### Metrics
- **Tests Re-enabled**: 15-25 tests (target 20)
- **Passing Rate**: 4359 → 4379+ (+20, +0.46%)
- **Ignored Rate**: 137 → 117 (-20, -14.6%)
- **Time Efficiency**: 2-3 hours actual vs 7-12 estimated (60-75% savings)

### Quality Gates
- ✅ All re-enabled tests must PASS individually
- ✅ Full test suite must PASS with no regressions
- ✅ `make validate-book` must PASS
- ✅ Zero new test failures introduced
- ✅ bashrs pre-commit hook must PASS

### Documentation
- ✅ Phase 1 discovery document created
- ✅ Phase 2 plan document created
- ✅ CLAUDE.md updated with new test counts
- ✅ CHANGELOG.md updated for v2.167.0
- ✅ PROJECT-STATE-v2.167.0.md created

---

## Risk Assessment

### Low Risk
- **Pattern proven**: Sprint 42/43 demonstrated 100% success rate
- **Methodology established**: Five Whys + empirical verification
- **Time bounded**: 2-3 hours maximum (can stop early if needed)

### Mitigation
- **Rollback**: Git checkout if any regressions
- **Incremental**: Re-enable tests one category at a time
- **Verification**: Run full test suite after each category

---

## Expected Outcomes

### Quantitative
- 15-25 tests re-enabled (137 → 112-117)
- 5-10 hours saved vs traditional debugging
- 60-75% time efficiency gain
- Test suite improvement: +0.5% passing, -15% ignored

### Qualitative
- Continued Five Whys momentum
- FAST methodology strengthened (mutation tests re-enabled)
- Pattern reinforced for future sprints
- Team confidence in empirical verification approach

### Toyota Way Benefits
- **Jidoka**: Quality built-in via systematic verification
- **Genchi Genbutsu**: Empirical data drives decisions
- **Kaizen**: Continuous improvement via pattern repetition
- **Muda elimination**: 60-75% waste reduction vs traditional approach

---

## Alternative Options (Not Recommended)

### Option B: Deep WASM Phase 2
- **Pro**: Advances WASM functionality
- **Con**: Breaks Five Whys momentum
- **Con**: Higher complexity, longer duration
- **Recommendation**: Defer to Sprint 45

### Option C: MCP Integration Phase 2
- **Pro**: Adds new capabilities
- **Con**: Breaks Five Whys momentum
- **Con**: Less immediate quality impact
- **Recommendation**: Defer to Sprint 46

---

## Decision

**RECOMMEND**: Proceed with Sprint 44 - Five Whys Test Re-enablement (Round 3)

**Rationale**:
1. ✅ Pattern proven successful (Sprint 42/43)
2. ✅ High efficiency (67-78% time savings)
3. ✅ Low risk (empirical verification, incremental approach)
4. ✅ High value (mutation tests = FAST alignment)
5. ✅ Maintains momentum (Toyota Way Kaizen)

**Priority Order**:
1. **Mutation tests** (Category C) - CRITICAL for FAST
2. **Graph tests** (Category A) - Quick wins
3. **Service layer tests** (Category B) - Medium complexity
4. **MCP discovery tests** (Category D) - Performance validation
5. **Doc validator tests** (Category E) - TDD alignment
6. **Mermaid tests** (Category F) - Lower priority

---

## Next Steps

1. ✅ **Approve recommendation** (this document)
2. ⏭️ **Phase 1**: Discovery (verify test status)
3. ⏭️ **Phase 2**: Re-enablement (remove #[ignore])
4. ⏭️ **Documentation**: Update CLAUDE.md, CHANGELOG.md
5. ⏭️ **Release**: v2.167.0 (GitHub + crates.io)

---

**Status**: READY TO PROCEED
**Confidence**: HIGH (based on Sprint 42/43 success)
**Expected Duration**: 2-3 hours
**Expected Value**: 15-25 tests re-enabled, FAST methodology strengthened

---

*Document created: October 19, 2025*
*Sprint: 44*
*Methodology: EXTREME TDD + FAST + Five Whys (Toyota Way)*