Answering complex questions : supervised approaches

Sadid-Al-Hasan, Sheikh; University of Lethbridge. Faculty of Arts and Science

Answering complex questions : supervised approaches

dc.contributor.author	Sadid-Al-Hasan, Sheikh
dc.contributor.author	University of Lethbridge. Faculty of Arts and Science
dc.contributor.supervisor	Chali, Yllias
dc.date.accessioned	2011-07-12T16:47:25Z
dc.date.available	2011-07-12T16:47:25Z
dc.date.issued	2009
dc.degree.level	Masters
dc.description	x, 108 leaves : ill. ; 29 cm	en_US
dc.description.abstract	The term “Google” has become a verb for most of us. Search engines, however, have certain limitations. For example ask it for the impact of the current global financial crisis in different parts of the world, and you can expect to sift through thousands of results for the answer. This motivates the research in complex question answering where the purpose is to create summaries of large volumes of information as answers to complex questions, rather than simply offering a listing of sources. Unlike simple questions, complex questions cannot be answered easily as they often require inferencing and synthesizing information from multiple documents. Hence, this task is accomplished by the query-focused multidocument summarization systems. In this thesis we apply different supervised learning techniques to confront the complex question answering problem. To run our experiments, we consider the DUC-2007 main task. A huge amount of labeled data is a prerequisite for supervised training. It is expensive and time consuming when humans perform the labeling task manually. Automatic labeling can be a good remedy to this problem. We employ five different automatic annotation techniques to build extracts from human abstracts using ROUGE, Basic Element (BE) overlap, syntactic similarity measure, semantic similarity measure and Extended String Subsequence Kernel (ESSK). The representative supervised methods we use are Support Vector Machines (SVM), Conditional Random Fields (CRF), Hidden Markov Models (HMM) and Maximum Entropy (MaxEnt). We annotate DUC-2006 data and use them to train our systems, whereas 25 topics of DUC-2007 data set are used as test data. The evaluation results reveal the impact of automatic labeling methods on the performance of the supervised approaches to complex question answering. We also experiment with two ensemble-based approaches that show promising results for this problem domain.	en_US
dc.identifier.uri	https://hdl.handle.net/10133/2478
dc.language.iso	en_US	en_US
dc.publisher	Lethbridge, Alta. : University of Lethbridge, Dept. of Mathematics and Computer Science, c2009	en_US
dc.publisher.department	Department of Mathematics and Computer Science	en_US
dc.publisher.faculty	Arts and Science	en_US
dc.relation.ispartofseries	Thesis (University of Lethbridge. Faculty of Arts and Science)	en_US
dc.subject	Natural language processing (Computer science)	en_US
dc.subject	Supervised learning (Machine learning)	en_US
dc.subject	Semantic computing	en_US
dc.subject	Computational linguistics	en_US
dc.subject	Information retrieval	en_US
dc.subject	Dissertations, Academic	en_US
dc.title	Answering complex questions : supervised approaches	en_US
dc.type	Thesis	en_US

Files

Original bundle

Now showing 1 - 1 of 1

Name:: SADIDALHASAN_SHEIKH_MSC_2009.pdf
Size:: 470.9 KB
Format:: Adobe Portable Document Format
Description:

Download

License bundle

Now showing 1 - 1 of 1

Name:: license.txt
Size:: 1.63 KB
Format:: Item-specific license agreed upon to submission
Description:

Download

Collections

Arts and Science, Faculty of
University of Lethbridge Theses

Library

Answering complex questions : supervised approaches

Files

Original bundle

License bundle

Collections

Students

Information for

Campus

Follow us on social media: