Integrating Machine Learning into Epidemiologic Research and its Implications for Non‑Response Bias

Maleki, Zahra; Khatibi, Reza; Yazdansetad, Zahra; Vali, Mohebat

دوره 4، شماره 2 - ( 1-1404 ) جلد 4 شماره 2 صفحات 73-72 | برگشت به فهرست نسخه ها

Mendeley

Zotero

RefWorks

Maleki Z, Khatibi R, Yazdansetad Z, Vali M. Integrating Machine Learning into Epidemiologic Research and its Implications for Non‑Response Bias. JRHMS 2025; 4 (2) :72-73
URL: http://jrhms.thums.ac.ir/article-1-133-fa.html

Integrating Machine Learning into Epidemiologic Research and its Implications for Non‑Response Bias. عنوان نشریه. 1404; 4 (2) :72-73

URL: http://jrhms.thums.ac.ir/article-1-133-fa.html

Integrating Machine Learning into Epidemiologic Research and its Implications for Non‑Response Bias

چکیده: (151 مشاهده)

The integration of Machine Learning (ML) and Artificial Intelligence (AI) into epidemiological research offers unprecedented opportunities for data analysis, prediction, and enhanced inference. However, these opportunities come with methodological challenges—particularly when addressing non‑response bias, a longstanding threat to validity in observational studies (1).
Importance of ML in Epidemiologic Context
Machine learning enhances epidemiologic analyses by handling high‑dimensional data, capturing complex nonlinear relationships, and enabling predictive accuracy beyond that of traditional regression models. In causal inference, ML can complement parametric models by reducing model specification bias when functional forms are unknown. However, it introduces the risk of plug‑in bias when ML estimates are used directly in effect estimation formulas without careful methodological calibration (2).
Non‑Response Bias in Epidemiologic Research
Non‑response bias occurs when individuals selected for a study (e.g., longitudinal cohort or cross‑sectional survey) do not provide complete data, and their absence is associated with exposures or outcomes of interest. This leads to non‑random missingness, compromising generalizability and potentially skewing estimates. For instance, in patient‑reported outcomes research, socioeconomically deprived and non‑white respondents had significantly lower response rates, altering the representativeness of survey results (3).
ML for Predicting and Mitigating Non‑Response
Recent evidence shows the utility of ML to predict survey response propensity, thereby supporting targeted retention strategies. In the Millennium Cohort Study, for example, researchers developed supervised ML classifiers to predict response to follow‑up surveys, achieving improved predictive performance compared with standard models. These findings highlight that ML can identify patterns of non‑response, enabling epidemiologists to implement targeted outreach or adjust analytic weights to address differential participation (4).
Methodological Considerations and Best Practices
To ensure robust epidemiologic application of ML in the context of non‑response bias, the following practices are critical:
Explicit Bias Assessment: Employ fairness metrics and bias detection tools during model development to measure disparities across subgroups.
Weighted and Calibrated Models: Integrate survey weights or inverse probability weighting within ML training to adjust for known nonresponse mechanisms.
Transparent Reporting: Report model inputs, assumptions, and potential limitations according to established guidelines (e.g., STROBE with ML extensions).
Sensitivity Analyses: Perform sensitivity analyses to evaluate how predictions and effect estimates change under different assumptions about missing data mechanisms.
Conclusion
The application of ML in epidemiology can substantially improve predictive performance and operational efficiency. Nonetheless, the threat of non‑response bias remains profound when ML models unknowingly reflect underlying participation disparities. Thus, a commitment to bias mitigation, rigorous methodological scrutiny, and transparent reporting is imperative to uphold the validity and equity of epidemiologic research in the era of AI and ML.

متن کامل [PDF 745 kb] (67 دریافت)

موضوع مقاله: عمومى
دریافت: 1405/2/1 | پذیرش: 1405/2/1 | انتشار: 1405/3/31

بازنشر اطلاعات
	این مقاله تحت شرایط Creative Commons Attribution-NonCommercial 4.0 International License قابل بازنشر است.

کلیه حقوق این وب سایت متعلق به Journal of Research in Health and Medical Sciences می باشد.

طراحی و برنامه نویسی : یکتاوب افزار شرق

Designed & Developed by : Yektaweb

پایگاه های مرتبط

کلمات کلیدی

نظرسنجی