Skip to main content

California Freight Cleanup → Investigation 3-10

Does the top portfolio survive a full joint-uncertainty stress test?

D mean NB = $38.9B (P5 $12.4B, P95 $80.5B) • P(NB > 0) = 99.99%

Investigation 3-6 decomposed variance in portfolio D’s competitive position using the EPA-narrow VSL band and 4 inputs. Investigation 3-10 extends both dimensions: 6 inputs spanning the full the cascade's uncertainty space, and the canonical-wide VSL envelope. The quantity of interest is raw 30-year net benefit in dollars, not the dominance margin. Together the two investigations bracket the cascade-level sensitivity answer for the CEC freight solicitation Criteria 2 and 3.

The earlier sensitivity analysis (Investigation 3-6) established which inputs drive variance in how far ahead the leading portfolio wins. A follow-on question is equally important: which inputs drive variance in the size of the net benefit in dollars, and does the recommendation survive stress-testing all six major uncertainties at once?

Investigation 3-10 runs the full six-input joint stress test, using the widest defensible priors across the cascade. If the all-in portfolio’s net benefit goes negative for any material fraction of scenarios, the headline recommendation is fragile. If it holds, it is defensible under simultaneous extreme realizations of every major input.

The Saltelli design (SALib v1.5.2, scrambled, seed 20260503) generates N × (2D+2) = 114,688 wrapper calls per quantity of interest. Six inputs are varied jointly:

InputDistributionSource
βPM2.5Normal(μ=0.02439, σ=0.00447)Investigation 6-3 L3 multi-pollutant posterior
VSLTriangular($5M / $11.6M / $20M)rfaq.constants canonical wide envelope (Phase 7 single source)
surrogate_σ (µg/m³)Normal(3.0, 0.8), clipped at 0Tessum 2023 ECHO-AIR/ISRM RMSE
emissions_scaleLognormal(0, 0.20)EPA NEI 2023 TSD §3.6
cost_overrun1 + |HalfNormal(σ=0.20)|Investigation M-2 cost_overrun.sigma
effectiveness_scaleLognormal(0, 0.30)Phase 2 envelope effectiveness CV

The wrapper computes a 30-year NPV net benefit for portfolio D_all_in_4B and, as a secondary comparison, portfolio A_free_lunch. The NPV structure mirrors Investigation 6-4/Investigation 6-6: deaths_avoided × VSL × annuity_factor minus cost × overrun. The annuity factor at 3% discount rate over 30 years is ≈ 19.60.

Two quantities of interest are decomposed: D_all_in_4B (the Investigation 6-6 regret-minimizing winner) and A_free_lunch (the low-cost baseline). The primary robustness gate is P(NB(D) > 0) > 0.95. A secondary metric is P(NB(D) > NB(A)), which confirms D’s absolute and relative robustness simultaneously.

Distribution of D_all_in_4B 30-year net benefit across 114,688 Saltelli draws. The distribution is right-skewed with mean $38.9B and P5 $12.4B; the left tail barely touches zero (P(NB < 0) ≈ 0.01%).
Distribution of D_all_in_4B 30-year NPV net benefit across 114,688 Saltelli draws. Mean $38.9 billion (P5 $12.4B, P50 $34.4B, P95 $80.5B). The tail barely reaches zero: P(NB < 0) = 0.01%. The recommendation is robust under the full 6-D joint prior.
Bar chart of total-order Sobol indices for D_all_in_4B: effectiveness_scale 0.426, VSL 0.298, emissions_scale 0.196, β_PM2.5 0.164, cost_overrun 0.004, surrogate_σ 0.004. Effectiveness scale dominates by a clear margin.
Total-order Sobol indices (ST) for D_all_in_4B net benefit. Effectiveness scale accounts for 39% of ΣST; VSL for 27%. Cost overrun and surrogate σ are negligible (<1% each).

The all-in portfolio passes every robustness test: positive in 99.99% of scenarios, better than the free-lunch baseline in 99.93%

Under the full 6-D joint prior, D’s net benefit is positive in all but 99.99% of draws. D outranks the free-lunch baseline (A) in 99.94% of draws. The headline portfolio recommendation from Investigation 6-6 is not a fragile point estimate—it survives simultaneous extreme realizations of all six major cascade uncertainty sources.

How well the programs actually work in the field is the largest remaining uncertainty (39% of total variance)

The Phase 2 electrification effectiveness envelope (Lognormal CV = 0.30) is the largest single source of NB variance (ST = 0.427, 39.2% of ΣST). This is a policy-implementation variable—how efficiently program spending translates into PM2.5 emission reductions—not a scientific measurement. Better program-level effectiveness monitoring would reduce NB uncertainty more than any measurement campaign targeting βPM2.5 or VSL.

VSL is rank-2 (ST = 0.298) under the wide envelope; rank-3 under EPA-narrow (Investigation 3-6)

Under the canonical-wide VSL ($5M–$20M), VSL contributes 27.2% of total-order variance—a 79% increase in share relative to the EPA-narrow band in Investigation 3-6. This is expected and informative, not a contradiction: the two investigations bracket how much the driver ranking depends on VSL band choice. Each answer built on the last.

Construction cost uncertainty and model residual error are negligible (<1% each)

ST(cost_overrun) = 0.004; ST(surrogate_σ) = 0.004. Construction cost uncertainty and PM2.5 surrogate residual error do not materially affect D’s net benefit variance. Improving surrogate accuracy beyond 3.0 ± 0.8 µg/m³ would not move the portfolio recommendation.

A_free_lunch driver profile mirrors D_all_in_4B (same top-3 ranking)

Both portfolios share the same top-3 drivers (effectiveness_scale ST = 0.427, VSL ST = 0.298, emissions_scale ST = 0.197) with essentially identical index magnitudes. The ranking reflects the shared NB formula structure, not D-specific features.

ItemSHA-256 (12-char)
results.json1ac44c301ef3
analysis.md
scenario.md
Upstream: Investigation 6-3 (CRF posterior) investigations/21_crf-hierarchical-bayes/latest/results.json 3104ba850408
Upstream: Investigation 6-6 (regret baseline) investigations/44_crf-conditional-decision/latest/results.json 5ce9bcd8b87b
Upstream: Investigation M-1 (portfolio frontier) investigations/15_portfolio-frontier/latest/results.json 145dbfd826d0
Run timestamp 2026-05-04T07:48:10   Sobol N = 8192   114,688 wrapper calls per QoI   SALib v1.5.2   seed = 20260503

Note: analysis.md records stale-upstream warnings for Investigation M-1, 22, 23, and 44 (sha256 drift since last Investigation 3-10 run). The robustness finding (P(NB > 0) = 99.99%, top driver = effectiveness_scale) is not expected to change materially on re-run.