We use cookies to ensure that we give you the best experience on our website. By continuing to browse this repository, you give consent for essential cookies to be used. You can read more about our Privacy and Cookie Policy.

Durham Research Online
You are in:

Identifying changes in the cybersecurity threat landscape using the LDA-Web Topic Modelling Data Search Engine.

Al Moubayed, N. and Wall, D. and McGough, A. S. (2017) 'Identifying changes in the cybersecurity threat landscape using the LDA-Web Topic Modelling Data Search Engine.', in Human aspects of information security, privacy and trust : 5th International Conference, HAS 2017, held as part of HCI International 2017, Vancouver, BC, Canada, July 9-14, 2017, proceedings. Cham: Springer, pp. 287-295. Lecture notes in computer science. (10292).


Successful Cybersecurity depends on the processing of vast quantities of data from a diverse range of sources such as police reports, blogs, intelligence reports, security bulletins, and news sources. This results in large volumes of unstructured text data that is difficult to manage or investigate manually. In this paper we introduce a tool that summarises, categorises and models such data sets along with a search engine to query the model produced from the data. The search engine can be used to find links, similarities and differences between different documents in a way beyond the current search approaches. The tool is based on the probabilistic topic modelling technique which goes further than the lexical analysis of documents to model the subtle relationships between words, documents, and abstract topics. It will assists researchers to query the underlying models latent in the documents and tap into the repository of documents allowing them o be ordered thematically.

Item Type:Book chapter
Full text:(AM) Accepted Manuscript
Download PDF
Publisher Web site:
Publisher statement:The final publication is available at Springer via
Date accepted:21 February 2017
Date deposited:21 March 2017
Date of first online publication:13 May 2017
Date first made open access:13 May 2018

Save or Share this output

Look up in GoogleScholar