Skip to main content

Research Repository

Advanced Search

Towards the Development of a Hybrid Parser for Natural Languages

Jaf, Sardar; Allan, Ramsay; Jones, Andrew V.; Ng, Nicholas

Towards the Development of a Hybrid Parser for Natural Languages Thumbnail


Authors

Sardar Jaf

Ramsay Allan

Andrew V. Jones

Nicholas Ng



Abstract

In order to understand natural languages, we have to be able to determine the relations between words, in other words we have to be able to 'parse' the input text. This is a difficult task, especially for Arabic, which has a number of properties that make it particularly difficult to handle. There are two approaches to parsing natural languages: grammar-driven and data-driven. Each of these approaches poses its own set of problems, which we discuss in this paper. The goal of our work is to produce a hybrid parser, which retains the advantages of the data-driven approach but is guided by grammar rules in order to produce more accurate output. This work consists of two stages: the first stage is to develop a baseline data-driven parser, which is guided by a machine learning algorithm for establishing dependency relations between words. The second stage is to integrate grammar rules into the baseline parser. In this paper, we describe the first stage of our work, which is now implemented, and a number of experiments that have been conducted on this parser. We also discuss the result of these experiments and highlight the different factors that are affecting parsing speed and the correctness of the parser results.

Citation

Jaf, S., Allan, R., Jones, A. V., & Ng, N. (2013). Towards the Development of a Hybrid Parser for Natural Languages. In 2013 Imperial College Computing Student Workshop (ICCSW’13) (49-56). https://doi.org/10.4230/oasics.iccsw.2013.49

Conference Name 2013 Imperial College Computing Student Workshop.
Conference Location London, United Kingdom
Start Date Sep 26, 2013
End Date Sep 27, 2013
Publication Date Sep 1, 2013
Deposit Date Feb 12, 2016
Publicly Available Date Feb 22, 2016
Pages 49-56
Series Title OASIcs - OpenAccess Series in Informatics
Series Number 35
Book Title 2013 Imperial College Computing Student Workshop (ICCSW’13).
DOI https://doi.org/10.4230/oasics.iccsw.2013.49
Keywords Hybrid Parsing, Arabic Parsing, Grammar-Driven Parser, Data-Driven Parser, Natural Language Processing.

Files




You might also like



Downloadable Citations