arXiv:2311.18807 Abstract | arXiv Analytics

arXiv:2311.18807 [cs.LG]Abstract References Reviews Resources

Pre-registration for Predictive Modeling

Jake M. Hofman, Angelos Chatzimparmpas, Amit Sharma, Duncan J. Watts, Jessica Hullman

Published 2023-11-30Version 1

Amid rising concerns of reproducibility and generalizability in predictive modeling, we explore the possibility and potential benefits of introducing pre-registration to the field. Despite notable advancements in predictive modeling, spanning core machine learning tasks to various scientific applications, challenges such as overlooked contextual factors, data-dependent decision-making, and unintentional re-use of test data have raised questions about the integrity of results. To address these issues, we propose adapting pre-registration practices from explanatory modeling to predictive modeling. We discuss current best practices in predictive modeling and their limitations, introduce a lightweight pre-registration template, and present a qualitative study with machine learning researchers to gain insight into the effectiveness of pre-registration in preventing biased estimates and promoting more reliable research outcomes. We conclude by exploring the scope of problems that pre-registration can address in predictive modeling and acknowledging its limitations within this context.

Categories: cs.LG, stat.ME

Keywords: predictive modeling, spanning core machine learning tasks, current best practices, lightweight pre-registration template, amid rising concerns

Related articles: Most relevant | Search more

arXiv:2402.01077 [cs.LG] (Published 2024-02-02, updated 2024-08-13)

Recent Advances in Predictive Modeling with Electronic Health Records

Jiaqi Wang et al.

arXiv:1811.06109 [cs.LG] (Published 2018-11-14)

Predictive Modeling with Delayed Information: a Case Study in E-commerce Transaction Fraud Control

Junxuan Li, Yung-wen Liu, Yuting Jia, Yifei Ren, Jay Nanduri

arXiv:2309.12036 [cs.LG] (Published 2023-09-21)

Uplift vs. predictive modeling: a theoretical analysis

Théo Verhelst, Robin Petit, Wouter Verbeke, Gianluca Bontempi

arXiv Analytics

arXiv:2311.18807 [cs.LG]Abstract References Reviews Resources

Pre-registration for Predictive Modeling

Links

Toolbox

arXiv:2311.18807 [cs.LG]AbstractReferencesReviewsResources

Pre-registration for Predictive Modeling

Links

Toolbox

arXiv:2311.18807 [cs.LG]Abstract References Reviews Resources