feat(health): detect no-Xid GPU freezes (open-module VA-space faults) #46

Merged
jessey merged 4 commits from feat/gpu-vaspace-spt into main 2026-05-29 14:10:59 +00:00
Owner
No description provided.
jessey added 3 commits 2026-05-29 14:09:57 +00:00
The kernel-log scanner only caught Xid codes, OOM, panic, MCE, AER, thermal,
and amdgpu resets — so a hard freeze that logs NO Xid slipped through entirely.
Add detection for the NVIDIA open-kernel-module VA-space mapping fault
(gpu_vaspace.c / dmaAllocMapping / NVKMS GEM-allocation failures), which can
storm for minutes and end in a freeze without the GPU ever "falling off the
bus". Also flag when the open kernel module (nvidia-*-open) is loaded — the
context behind these faults — and add an AI-knowledge entry so the assistant
distinguishes it from the Xid 79 hardware drop.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
Some titles never show up in a Steam/Lutris/Heroic scan — standalone mod
launchers like SPT (Single-Player Tarkov), itch.io downloads, hand-installed
executables. Add a user-authored custom-games list (core/customgames.py) shown
alongside the other sources in `rigdoctor games` and the GUI.

Each entry can carry a launch command and a log directory:
  - `rigdoctor games add "SPT" --command .../tarkov.sh` (logs/ auto-detected)
  - `rigdoctor games play "SPT"` launches it under the crash-capture wrapper
    (wrap.run gains an explicit game-name override, since there's no SteamAppId)
  - the diagnostic now feeds the game's own logs to the analysis: gamelogs
    .collect(game=...) tails the registered log dir (SPT's server/launcher logs)
    alongside the kernel log, freshness-scoped by mtime.

Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
chore(release): v0.42.0
tests / core (pull_request) Successful in 17s
tests / gui-smoke (pull_request) Successful in 29s
0f9cb4b684
Co-Authored-By: Claude Opus 4.8 (1M context) <noreply@anthropic.com>
jessey added 1 commit 2026-05-29 14:10:03 +00:00
Merge branch 'main' into feat/gpu-vaspace-spt
tests / core (pull_request) Successful in 12s
tests / gui-smoke (pull_request) Successful in 29s
b65f36bb2d
jessey scheduled this pull request to auto merge when all checks succeed 2026-05-29 14:10:07 +00:00
jessey scheduled this pull request to auto merge when all checks succeed 2026-05-29 14:10:56 +00:00
jessey merged commit ac4863b0d4 into main 2026-05-29 14:10:59 +00:00
jessey deleted branch feat/gpu-vaspace-spt 2026-05-29 14:10:59 +00:00
Sign in to join this conversation.