Xiao, Z. and Higgins, S. (2017) 'The power of noise and the art of prediction.', International journal of educational research., 87 . pp. 36-46.
Data analysis usually aims to identify a particular signal, such as an intervention effect. Conventional analyses often assume a specific data generation process, which implies a theoretical model that best fits the data. Machine learning techniques do not make such an assumption. In fact, they encourage multiple models to compete on the same data. Applying logistic regression and machine learning algorithms to real and simulated datasets with different features of noise and signal, we demonstrate that no single model dominates others under all circumstances. By showing when different models shine or struggle, we argue that it is important to conduct predictive analyses using cross-validation for better evidence that informs decision making.
|Full text:||Publisher-imposed embargo until 01 June 2019. |
(AM) Accepted Manuscript
First Live Deposit - 18 July 2018
File format - PDF (10143Kb)
|Publisher Web site:||https://doi.org/10.1016/j.ijer.2017.10.006|
|Publisher statement:||© 2018 This manuscript version is made available under the CC-BY-NC-ND 4.0 license http://creativecommons.org/licenses/by-nc-nd/4.0/|
|Record Created:||18 Jul 2018 15:58|
|Last Modified:||18 Jul 2018 16:22|
|Social bookmarking:||Export: EndNote, Zotero | BibTex|
|Look up in GoogleScholar | Find in a UK Library|