This workflow is used when the Windows host shows freeze/reboot/blue-screen symptoms during QuecPython operations.
- Read-only crash evidence collection.
- Immediate risk reduction policy.
- Gated recovery path before re-enabling flash/deploy workflows.
- Stop all high-risk actions:
- firmware flash,
- QPYcom
cp/lsloops, - force process kill (
taskkill /F), - long-running soak with deploy enabled.
- Keep only read-only probes:
- AT command query,
- REPL one-shot query,
- event-log collection.
- Do not run manual
taskkill /F QPYcom.exestress loops orQPYcom --followhang reproduction on unstable hosts.
python scripts/qpy_crash_triage.py --days 2 --json-out review/host_crash_triage.json --jsonExpected evidence:
Kernel-Power 41restart timeline.WER-SystemErrorReportingbugcheck records and minidump path.- application crash correlations (for example
ARPProtection.exe,QPYcom.exe). - security/USB filter signals (for example
BzProtect,sysdiag,hrdevmon).
- Default to smoke safe mode:
python scripts/device_smoke_test.py --risk-mode safe --auto-ports --json-report review/device-smoke-safe.json- Default soak safe mode:
python scripts/qpy_soak_runner.py --risk-mode safe --duration-seconds 600 --interval-seconds 60 --auto-ports --json- QPYcom actions require explicit acknowledgment:
python scripts/qpy_device_fs_cli.py --json --port COM6 --allow-qpycom-risk ls --path /usrdevice_smoke_test.pyforce kill requires dual guard (disabled by default):--kill-qpycom-mode force --allow-force-killQPY_FORCE_KILL_GUARD=I_UNDERSTAND_RISK
Escalate to OS/driver-level debugging when any condition is true:
- repeated bugcheck within 24h,
- bugcheck code remains unchanged across multiple incidents (for example
0x00000139), - host restarts without completing a safe-mode read-only probe.
Recommended escalation:
- analyze latest minidump with WinDbg,
- align security software policy/whitelist for serial and flashing tools,
- verify USB/serial driver stack stability before resuming flash/deploy.
Only resume standard/aggressive mode after all are satisfied:
- at least one complete safe smoke run passes,
- no new kernel bugcheck during observation window,
- chosen firmware/deploy operation has explicit operator confirmation.