Oura Ring Gen 4 sensor data — not clinical measurementsN=1 case study — not validated for clinical decisionsHEV diagnosed Mar 18; interpret findings cautiously in this Day 55 post-ruxolitinib window

Causal Inference: Ruxolitinib

Data period: 2026-01-08 to 2026-05-09 (122 days) | Total runtime: 1.0s
Generated 2026-05-11 07:21 · Post-HSCT Patient
Four complementary causal analysis methods explore whether Oura biometrics shifted after ruxolitinib (10 mg BID, started 2026-03-16) on Oura Ring biometrics. Data period: 2026-01-08 to 2026-05-09 (122 days).
Pre-intervention
Info
67days
Post-intervention
Normal
55days
Raw p<0.05
Info
0/0
FDR-significant
None
0/0
Lowest raw p
Not significant
p=1.000
N/A | q=1.0000
Methods used
Info
4
CI + PCMCI+ + TE + Mediation
Table of contents:
  1. Individual Metric Treatment Response
  2. CausalImpact - Bayesian Structural Time Series
  3. Statistical Power & Interpretation
  4. Placebo tests (falsification)
  5. Granger Causality Network (PCMCI+)
  6. Transfer Entropy
  7. Intervention Response Decomposition
  8. Clinical Interpretation

0. Individual Metric Treatment Response

Method: Non-parametric Mann-Whitney U test compares pre- vs post-ruxolitinib distributions for each core biometric. Effect sizes reported as Cohen's d with bootstrap 95% CI (2000 iterations). These direct statistical tests provide intuitive per-metric significance before the multivariate causal methods below.
HRV (RMSSD)
Significant
p<0.001
d=+1.54 (large)
Lowest HR
Significant
p<0.001
d=-1.57 (large)
Average HR
Significant
p<0.001
d=-1.42 (large)
Sleep Efficiency
Significant
p<0.001
d=+0.66 (medium)
Metric Pre-Rux Mean Post-Rux Mean Change Cohen's d Mann-Whitney 95% CI (diff)
HRV (RMSSD)10.00 ms22.87 ms+12.88 ms+1.54 (large)Sig p<0.001[+9.74, +16.31]
Lowest HR76.72 bpm66.52 bpm-10.20 bpm-1.57 (large)Sig p<0.001[-12.62, -7.76]
Average HR85.17 bpm74.92 bpm-10.25 bpm-1.42 (large)Sig p<0.001[-12.97, -7.69]
Sleep Efficiency78.62 %81.40 %+2.78 %+0.66 (medium)Sig p<0.001[+1.02, +4.34]
Note: These per-metric tests complement the Bayesian CausalImpact analysis below. For full changepoint detection and multi-patient comparison, see the Comparative Treatment Response report.

1. CausalImpact - Bayesian Structural Time Series Analysis

Method: Bayesian Structural Time Series (BSTS) models pre-intervention dynamics and generates a counterfactual prediction for the post-period. The difference between actual and counterfactual estimates the causal effect, with full posterior uncertainty. MCMC: 5,000 iterations, weekly seasonal component (nseasons=7). Benjamini-Hochberg FDR correction for multiple testing.

FAILED CausalImpact package not installed. Install with: pip install pycausalimpact

1a. Statistical Power & Interpretation

Purpose: All 11 metrics sorted by statistical significance, with Benjamini-Hochberg corrected q-values for multiple testing.

No CausalImpact results available for statistical power analysis.

1b. Placebo tests (intervention date falsification)

Method: CausalImpact is run with 3 random placebo dates in the pre-period on the 3 most significant metrics. Placebo dates should NOT show significant effects.

CausalImpact failed, skipping placebo

2. Granger Causality Network (PCMCI+)

Method: PCMCI+ (tigramite) tests for time-lagged causal relationships between biometric variables using partial correlation. Tau_max = 7 days.

Full period (0 days): No significant causal links found.

Pre-ruxolitinib (0 days): No significant causal links found.

3. Transfer Entropy

Method: Transfer entropy quantifies directional information flow between biometric streams. Comparison of TE matrices for pre- and full period reveals changes in information coupling after ruxolitinib start.

Full period (121 days)

SourceTargetTE (bits)Net TE
TotalSleepLowestHR0.7815+0.2621
DeepDurLowestHR0.7815+0.2030
SpO2LowestHR0.7815+0.1262
REMpctLowestHR0.7581+0.1604
SpO2RespRate0.7544+0.1160
REMdurRespRate0.7374+0.2222
TotalSleepRespRate0.7374+0.2181
DeepDurRespRate0.7374+0.1759

Pre-ruxolitinib (67 days)

SourceTargetTE (bits)Net TE
REMdurRespRate0.5564+0.1891
TotalSleepRespRate0.5564+0.3064
DeepDurRespRate0.5564+0.2439
RMSSDmaxRespRate0.5564+0.3689
LowestHRRespRate0.5564+0.2127
AvgHRRespRate0.5564+0.1071
SpO2RespRate0.5564+0.1132
REMpctRespRate0.5252+0.0835

Change in information flow

Largest increase: REMpct -> LowestHR (+0.4456 bits)

Largest decrease: REMdur -> REMdur (+0.0000 bits)

4. Intervention Response Decomposition

Method: Linear mediation analysis (Baron-Kenny) decomposes total ruxolitinib effect into four mediating pathways. Bootstrap (2000 iterations) for confidence intervals.
+8.4
Total effect (readiness score)
64.8 -> 73.3
Pre -> Post average
p<0.001
Raw p-value Significant
PathwayMediator (pre->post)a (T->M) b (M->Y)Indirect effect [95% CI] % mediatedp-value
Direct cardiac
Ruxolitinib -> HR change -> Readiness
92.57 -> 85.16-0.760-0.571+0.4339
[+0.2124, +0.7015]
47.5%0.0000 Sig
Autonomic
Ruxolitinib -> HRV change -> Readiness
10.00 -> 23.45+1.265+0.633+0.8005
[+0.5755, +1.0733]
87.6%0.0000 Sig
Sleep-mediated
Ruxolitinib -> Sleep efficiency -> Readiness
78.62 -> 81.40+0.629+0.225+0.1417
[+0.0144, +0.3332]
14.8%0.0190 Sig
Inflammatory
Ruxolitinib -> Temperature deviation -> Readiness
0.06 -> -0.03-0.341-0.201+0.0683
[-0.0236, +0.2371]
7.5%0.2030 NS

5. Clinical Interpretation

Executive summary

N/A
Strongest signal: p=1.000
0/0
Metrics trending in expected direction
55
Post-intervention days (Day 14 target)
  • N/A: strongest hypothesis-generating raw p-value signal (p=1.0000, q=1.0000). It does not survive FDR correction, so confirmation still depends on more post-treatment follow-up.
  • CausalImpact: 0 of 0 biometric streams show significant causal change (p < 0.05). After Benjamini-Hochberg FDR correction: 0 of 0 remain significant (q < 0.05).
  • Placebo validation: N/A - ?/? placebo tests reached significance. This keeps the result vulnerable to false positives rather than establishing a confirmed intervention effect.
  • PCMCI+: 0 significant time-lagged causal links identified in the biometric network
  • Mediation analysis: 3 of 4 mediating pathways show significant indirect effect

Limitations

  • Short post-period (55 days): All results are preliminary. Minimum 14-21 days of post-intervention data recommended for robust causal inference.
  • Confounders: Linear methods cannot capture non-linear interactions. Seasonal variation, activity level, and other medications are not controlled for.
  • Wearable data: Oura Ring is not a medical device. Measurements have inherent noise that can affect causal estimates.
  • Single patient: N=1 study without control group. Causality cannot be definitively established, but Bayesian posterior probability of effect provides a strength measure.
  • HEV diagnosis: HEV was diagnosed 2026-03-18 (2 days after ruxolitinib start). Hepatitis may confound biometric changes.

Recommendations

  1. Repeat analysis after 2-3 weeks of ruxolitinib treatment for robust causal inference
  2. Add HEV-related biomarkers (ALT, bilirubin) as time-varying covariates
  3. Consider synthetic control method when longer time series are available
  4. Combine with clinical endpoints (GVHD scoring, ferritin) for multimodal analysis