Xiao, Z. and Higgins, S. (2017) 'The power of noise and the art of prediction.', International journal of educational research., 87 . pp. 36-46.
Data analysis usually aims to identify a particular signal, such as an intervention effect. Conventional analyses often assume a specific data generation process, which implies a theoretical model that best fits the data. Machine learning techniques do not make such an assumption. In fact, they encourage multiple models to compete on the same data. Applying logistic regression and machine learning algorithms to real and simulated datasets with different features of noise and signal, we demonstrate that no single model dominates others under all circumstances. By showing when different models shine or struggle, we argue that it is important to conduct predictive analyses using cross-validation for better evidence that informs decision making.
|Full text:||(AM) Accepted Manuscript|
Download PDF (10143Kb)
|Publisher Web site:||https://doi.org/10.1016/j.ijer.2017.10.006|
|Publisher statement:||© 2018 This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/|
|Date accepted:||25 October 2017|
|Date deposited:||18 July 2018|
|Date of first online publication:||01 December 2017|
|Date first made open access:||01 June 2019|
Save or Share this output
|Look up in GoogleScholar|