IINTS-AF Theory Stress Lab¶
The Theory Stress Lab is a deterministic scientific red-team mode for the SDK. It is designed for Jetson and CI runs that continuously look for weak points in the physiological and safety theories encoded in IINTS-AF.
It is pre-clinical simulation QA. It is not medical validation and must not be used for patient care.
Run It¶
iints jetson theory-stress run --output-dir results/theory_stress_lab --profile jetson --seed 42
For a longer Jetson pass, repeat the configured suite with shifted seeds:
iints jetson theory-stress run \
--output-dir results/theory_stress_lab_overnight \
--profile jetson \
--seed 42 \
--repeats 100
For CI gating, fail when any invariant fails:
iints jetson theory-stress run --profile ci --output-dir results/theory_stress_ci --fail-on-weakness
The module can also run directly:
python -m iints.tools.theory_stress_lab --output-dir results/theory_stress_lab --profile ci
Outputs¶
The tool writes:
summary.md: human-readable report for research noteschecks.json: machine-readable check payloadweakness_rankings.csv: ranked weak points sorted by lowest score
Current v1 Checks¶
no_negative_states: glucose, FFA, ketones, beta mass, IOB, and COB must remain finite and non-negativehypo_blocks_insulin: Safety Supervisor must block insulin during hypoglycemiaiob_limits_bolus: high IOB must cap extra bolus requests using mass balancepump_failure_raises_ffa_ketones: insulin absence should raise FFA and ketone pressuresensor_lag_is_bounded: CGM lag should be visible but not teleportingexercise_does_not_create_impossible_crash: exercise + IOB should not create non-physiological glucose motionmeal_response_has_plausible_peak: meal absorption should peak in a plausible size/time windowillness_increases_insulin_need_without_exploding: illness should raise glucose pressure without numerical explosion
Interpretation¶
A failed check is not automatically a product failure. It is a lead for model improvement. The intended workflow is:
- run the lab on Jetson or CI
- inspect
weakness_rankings.csv - inspect the model or parameter causing the top weak point
- adjust the physiological theory or safety bound
- rerun the lab
This makes the Jetson a local scientific bug-hunter for the SDK.