Major improvements to scoring and data collection: **Scoring Changes:** - Failed SMART reads now return 0/100 health (was 50/100) - Critical health issues get much higher penalties: * Reallocated sectors: -50 pts, 5x multiplier (was -20, 2x) * Pending sectors: -60 pts, 10x multiplier (was -25, 5x) * Uncorrectable sectors: -70 pts, 15x multiplier (was -30, 5x) * NVMe media errors: -60 pts, 10x multiplier (was -25, 5x) - Revised weights: 80% health, 15% capacity, 5% resilience (was 60/30/10) - Added priority bonuses: * Failed SMART + small drive (<5TB): +30 points * Failed SMART alone: +20 points * Health issues + small drive: +15 points **Priority Order Now Enforced:** 1. Failed SMART drives (score 90-100) 2. Small drives beginning to fail (70-85) 3. Small healthy drives (40-60) 4. Large failing drives (60-75) **Enhanced SMART Collection:** - Added metadata.devices field parsing - Enhanced dm-device and /dev/mapper/ resolution - Added ceph-volume lvm list fallback - Retry logic with 3 command variations per device - Try with/without sudo, different device flags **Expected Impact:** - osd.28 with reallocated sectors jumps from #14 to top 3 - SMART collection failures should drop from 6 to 0-2 - All failing drives rank above healthy drives regardless of size 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <noreply@anthropic.com>
24 KiB
24 KiB