ADM Prediction (Made Before Running Models)
Predicted winner: Domain frames the problem. Cross-population transfer is a calibration problem, not a learning problem. Published Asian-specific biomarker differences (HbA1c, CRP, BMI) are well-documented. The question is whether these adjustments are still accurate or outdated by westernization.
Expected: Domain knowledge provides the framework; data calibrates the numbers. Actual: Published adjustments confirmed for 4/7 biomarkers, but 3/7 reversed — outdated literature hurts. Fine-tuning on 30% local data outperforms pure transfer. Prediction confirmed.
Clinical reference ranges were developed primarily in Western populations. When applied to East Asian populations, systematic biases emerge — CRP levels differ by 85%, triglycerides are 15% higher in China, and HbA1c thresholds may misclassify diabetes risk. Published adjustment factors exist but date from early 2000s studies. Dietary patterns, urbanization rates, and metabolic syndrome prevalence have shifted dramatically in both countries since then.
NHANES 2017-2018
9,254 participants. Nationally representative US sample with full biomarker panels.
CHNS 2009
9,549 participants. Multi-province Chinese longitudinal study with comparable biomarkers.
Domain Model
Published literature adjustment factors for cross-population biomarker conversion.
Fine-Tuned Model
Model trained on NHANES, fine-tuned on a small CHNS subset to learn current population shifts.
The raw transfer baseline applies NHANES-trained norms directly to CHNS data with no adjustment. This measures the naive cross-population gap.
The adjusted transfer applies published adjustment factors from the medical literature — offsets for HbA1c, CRP, HDL, etc., derived from studies in the early 2000s. These capture known population differences but may be outdated.
The fine-tuned transfer trains on NHANES, then fine-tunes on a held-out CHNS subset. This lets the model learn current population-specific patterns rather than relying on 20-year-old literature values. The result: 57% error reduction vs raw transfer and 42% vs published adjustments.
Side-by-side comparison of mean biomarker values across the two populations. Some differences are well-documented (CRP, triglycerides); others are less studied. The magnitude of difference varies substantially across biomarkers.
Three strategies for applying NHANES norms to CHNS data: raw transfer (no adjustment), published literature adjustments, and fine-tuning on cross-population data. Fine-tuning outperforms because the world has changed since the adjustment factors were published.
Domain knowledge correctly identifies which biomarkers need adjustment. The model discovers updated magnitudes and interaction patterns the literature doesn't capture.
The population distributions overlap substantially but are shifted. A fixed threshold (e.g., HbA1c ≥ 6.5% for diabetes) misclassifies more in one population than the other. Visualizing the overlap makes clear why simple offsets are insufficient.
Two populations only: NHANES (USA) and CHNS (China). Transfer to other populations (South Asian, Sub-Saharan African, Latin American) would require separate validation.
Different survey years: NHANES 2017-2018 vs CHNS 2009. The 8-year gap introduces temporal confounding — some “population differences” may be secular trends.
Not all biomarkers matched: NHANES and CHNS measure overlapping but not identical biomarker panels. Comparison limited to 7 shared biomarkers.
Fine-tuning sample size: The fine-tuned model uses a held-out subset of CHNS. In practice, obtaining labeled data from a new population is the bottleneck — the method assumes some local data is available.
- Fine-tuning beats published adjustments by 42% — literature values from the early 2000s no longer reflect current population metabolic profiles. Urbanization, dietary shifts, and obesity trends have changed the numbers.
- Domain knowledge identifies WHAT to adjust; ML calibrates HOW MUCH — published literature correctly flags HbA1c, CRP, and BMI as requiring cross-population adjustment. But the magnitudes are wrong. The hybrid approach uses both.
- CRP shows the largest gap (85% difference) — inflammation markers are the most sensitive to population-specific dietary and lifestyle patterns, making fixed adjustment factors particularly unreliable.