Toward abstractive multi-document summarization using submodular function-based framework, sentence compression and merging
dc.contributor.author | Tanvee, Moin Mahmud | |
dc.contributor.author | University of Lethbridge. Faculty of Arts and Science | |
dc.contributor.supervisor | Chali, Yllias | |
dc.date.accessioned | 2017-05-03T19:08:40Z | |
dc.date.available | 2017-05-03T19:08:40Z | |
dc.date.issued | 2016 | |
dc.degree.level | Masters | en_US |
dc.description.abstract | Automatic multi-document summarization is a process of generating a summary that contains the most important information from multiple documents. In this thesis, we design an automatic multi-document summarization system using different abstraction-based methods and submodularity. Our proposed model considers summarization as a budgeted submodular function maximization problem. The model integrates three important measures of a summary - namely importance, coverage, and non-redundancy, and we design a submodular function for each of them. In addition, we integrate sentence compression and sentence merging. When evaluated on the DUC 2004 data set, our generic summarizer has outperformed the state-of-the-art summarization systems in terms of ROUGE-1 recall and f1-measure. For query-focused summarization, we used the DUC 2007 data set where our system achieves statistically similar results to several well-established methods in terms of the ROUGE-2 measure. | en_US |
dc.embargo | No | en_US |
dc.identifier.uri | https://hdl.handle.net/10133/4841 | |
dc.language.iso | en_US | en_US |
dc.proquestyes | No | en_US |
dc.publisher | Lethbridge, Alta : University of Lethbridge, Dept. of Mathematics and Computer Science | en_US |
dc.publisher.department | Department of Mathematics and Computer Science | en_US |
dc.publisher.faculty | Arts and Science | en_US |
dc.relation.ispartofseries | Thesis (University of Lethbridge. Faculty of Arts and Science) | en_US |
dc.subject | automatic text summarization | en_US |
dc.subject | abstraction-based | en_US |
dc.subject | submodular function | en_US |
dc.subject | generic-focused summarization | en_US |
dc.subject | query-focused summarization | en_US |
dc.subject | greedy algorithm | en_US |
dc.subject | Natural language processing (Computer science) -- Research | en_US |
dc.subject | Querying (Computer science) | en_US |
dc.subject | Database searching | en_US |
dc.subject | Parsing (Computer grammar) | en_US |
dc.subject | Information retrieval | en_US |
dc.subject | Question-answering systems -- Research | en_US |
dc.subject | Computer science -- Mathematics | en_US |
dc.title | Toward abstractive multi-document summarization using submodular function-based framework, sentence compression and merging | en_US |
dc.type | Thesis | en_US |