Effective Summarization Method of Text Documents
- 18 October 2005
- conference paper
- conference paper
- Published by Institute of Electrical and Electronics Engineers (IEEE)
- p. 264-271
- https://doi.org/10.1109/wi.2005.57
Abstract
In this paper, we propose text summarization method that creates text summary by definition of the relevance score of each sentence and extracting sentences from the original documents. This summarization method takes into account the weight of each sentence in the document. The essence of the method suggested is in preliminary identification of every sentence in the document with characteristic vector of words, which appear in the document, and calculation of relevance score for each sentence. The relevance score of sentence is determined through its comparison with all the other sentences in the document and with the document title by cosine measure. Prior to application of this method, the scope of features is defined and then the weight of each word in the sentence is calculated with account of those features. The weights of features, influencing relevance of words, are determined using genetic algorithms.Keywords
This publication has 18 references indexed in Scilit:
- Web-page classification through summarizationPublished by Association for Computing Machinery (ACM) ,2004
- Improving text categorization using the importance of sentencesInformation Processing & Management, 2004
- A Heuristic Genetic Algorithm for Solving Resource Allocation ProblemsKnowledge and Information Systems, 2003
- Automatic textual document categorization based on generalized instance sets and a metamodelIeee Transactions On Pattern Analysis and Machine Intelligence, 2003
- Genetic Mining of HTML Structures for Effective Web-Document RetrievalApplied Intelligence, 2003
- Penalty functions and the knapsack problemPublished by Institute of Electrical and Electronics Engineers (IEEE) ,2002
- Automatic text structuring and summarizationInformation Processing & Management, 1997
- Genetic Algorithms + Data Structures = Evolution ProgramsPublished by Springer Science and Business Media LLC ,1996
- On modeling of information retrieval concepts in vector spacesACM Transactions on Database Systems, 1987
- A vector space model for automatic indexingCommunications of the ACM, 1975