Skip to main content

California Freight Cleanup → Investigation 3-5

What accuracy ceiling does EPA's monitor-informed fused product actually set?

5-fold CV-RMSE 0.913 µg/m³ • R² +0.928 • AQS-in-fitting caveat • NOT the operational hero stat

EPA’s FAQSD fuses a chemistry model with the nationwide monitor network using spatial Bayesian downscaling. Against the same validation monitors it scores RMSE 0.913 µg/m³ — about three times better than our corrected surrogate. The catch: FAQSD was trained on the same monitors used for validation. The ~3× gap between it and our 2.76 result is approximately what having access to the full monitor network in fitting is worth.

What is the realistic accuracy ceiling for PM2.5 estimation at AQS sites when the predictor has access to the AQS network in fitting? Without L5, a reviewer might assume L4’s 0.91 µg/m³ is far from best achievable. L5’s 0.913 µg/m³ reveals two things: L4 operates within a factor of ~3 of a product that has effectively seen most of the AQS network; and the ~3× gap from L4 to L5 is approximately the value of AQS-at-test-site information — the honest cost of the cascade’s leakage discipline.

No model is fit. FAQSD daily PM2.5 files (2019–2022) are loaded, averaged to annual mean per census tract centroid (∼9k California tracts), and sampled at each AQS site by nearest-tract-centroid join with a 10 km cap (dropping sparse rural sites that fall outside FAQSD coverage). Observed annual means come from the Investigation 3-1 AQS daily panel. The 5-fold readout reuses Investigation 3-1’s site groupings for rung-comparability; the fold-to-fold RMSE spread is noise around a near-constant (no model to overfit).

FAQSD background. EPA’s FAQSD applies the Berrocal, Gelfand & Holland 2010 hierarchical Bayesian downscaling model (DOI 10.1214/09-AOAS305) to fuse CMAQ fields with AQS monitoring data. The spatial covariance in that framework propagates neighbor AQS information to any given tract centroid. A held-out AQS site is not directly in the FAQSD fitting, but its nearest-tract-centroid value is influenced by neighboring monitors via the spatial covariance. This is the leakage mechanism.

RMSE 0.913 µg/m³, R² +0.928 — with the caveat that the monitor data is in the fitting

Global in-sample RMSE 0.910 µg/m³, MFB +0.020, R² +0.928 across 256 site-years (64 sites × 4 years, 2019–2022). Per-fold RMSE 0.801.01 (SD 0.081) — essentially flat. Both regulatory gates pass by construction.

FoldTest sitesnRMSEMFB
016560.8000.010+0.914
115600.908+0.022+0.936
213520.9610.001+0.904
312480.884+0.048+0.938
410401.014+0.051+0.936

Consistent across years (0.83–1.04 RMSE), with 2020’s wildfire season as the hardest

YearRMSE µg/m³MFBn
20190.857+0.02864
20201.036+0.01764
20210.830+0.01564
20220.903+0.01864

2020 is the worst year (1.036), consistent with wildfire-season stress on the FAQSD fused field; 2021 is best (0.830). All MFBs are +0.015 to +0.028 — uniformly small positive bias guaranteed structurally by AQS-in-fitting.

Full ladder: 6.7× compression from the raw model to the fused ceiling — our production surrogate sits in between

RungMethodRMSE µg/m³AQS in fitting?
L1ISRM × NEI matrix lookup6.078No
L2InMAP-direct steady-state11.463No
L3van Donkelaar V5.NA.05.024.343No
L4MFGP-corrected (Investigation 3-4) — operational0.913Train sites only (blocked at test)
L5FAQSD Bayesian-fused — reference0.913Yes (leakage)

The 6.7× compression from L1 to L5 quantifies how much of the CMAQ/ISRM residual is recoverable when the predictor sees the AQS network directly. The remaining ~3× gap from L4 (2.76) to L5 (0.913) is approximately the value of AQS-at-test-site information — the honest cost of the cascade’s leakage discipline.

Item
run.py[internal artifact]
results.jsoninvestigations/43_l5-faqsd-reference/latest/results.json
Method labelfaqsd_bayesian_downscaling_fused
FAQSD inputsdata/raw/faqsd/{2019,2020,2021,2022}_pm25_daily_average.txt.gz (sha256s in results.json)
Berrocal et al. 2010DOI 10.1214/09-AOAS305 — spatial Bayesian downscaling framework
Upstream: Investigation 3-1 foldssha256 c63ae2d281ce — L1 headline RMSE 6.078
Upstream: Investigation 3-2 L2 RMSE11.463 µg/m³ (sha256 20cdce2d11d4)
Upstream: Investigation 3-3 L3 RMSE4.343 µg/m³ (sha256 a368ef9c6ed9)
Consumed by Investigation 3-4l5_faqsd.by_year.2019.rmse_ugm3 = 0.857 (sha256 278e28fe52db)
Last run2026-05-01 (results sha256 278e28fe52db)