Skip to main content

Research Repository

Advanced Search

Looking Back- Looking Forward; Statistics and the Data Science Tsunami

Ridgway, J.; Nicholson, J.; Ridgway, R.

Looking Back- Looking Forward; Statistics and the Data Science Tsunami Thumbnail


Authors

J. Nicholson



Abstract

The discipline of statistics arose from pressing needs to address a variety of social and scientific problems. The founders of the Royal Statistical Society in the UK, and the American Statistical Association were very diverse in their backgrounds and interests, but shared a common purpose – namely, to address difficult and interesting challenges. They also acted in similar ways, by working across disciplines, and inventing mathematics and models suited to new problems. Computer scientists have also addressed real-world problems, have pioneered interesting and exciting approaches to handling new sorts of data (e.g. from sensors and social media) and have developed new analytic tools (notably, tools based on machine learning); their work is having dramatic (and sometimes unexpected) impacts on society. Early encounters between statisticians and computer scientists often resembled ‘turf wars’ – with claims that statistics was fast becoming redundant, and that computer scientists’ ignorance of core statistical concepts such as sample bias would prove fatal to their entire enterprise. The problems that beset the start of the twentieth century have not gone away; modern societies face a wide range of existential threats such as global warming and nuclear war. As before, collaboration across disciplines, and the creation of new modelling tools are needed to address these problems. Here we begin by drawing lessons from the development of computer science in its earliest days, focussing on Babbage’s Analytical Engine. We then highlight key epistemological differences between traditional statistics and traditional computer science, such as the role of theory and the use of ‘black-box’ models. We argue the case for the development of the Epistemological Engine – a tool for analysing and improving the processes of knowledge creation and utilisation that will require the skills of both statisticians and data scientists. We conclude by identifying competences and dispositions relevant to students of statistics and data science, drawing on both contemporary developments and the earliest days of computing.

Citation

Ridgway, J., Nicholson, J., & Ridgway, R. (2020). Looking Back- Looking Forward; Statistics and the Data Science Tsunami. In Proceeding of the 62nd ISI World Statistics Congress 2019: Special Topic Session: Volume 3 (47-56)

Conference Name ISI World Statistics Congress
Conference Location Kuala Lumpur, Malaysia.
Start Date Aug 18, 2019
End Date Aug 23, 2019
Acceptance Date Mar 28, 2019
Online Publication Date Apr 30, 2019
Publication Date 2020-02
Deposit Date Oct 21, 2019
Publicly Available Date Sep 10, 2020
Volume 3
Pages 47-56
Book Title Proceeding of the 62nd ISI World Statistics Congress 2019: Special Topic Session: Volume 3
Public URL https://durham-repository.worktribe.com/output/1141697
Publisher URL https://2019.isiproceedings.org/

Files





You might also like



Downloadable Citations