Paper Title
An Automatic Text Summarization For Malayalam Using Sentence Extraction
Abstract
Text Summarization is the process of generating a short summary for the document that contains the significant
portion of information. In an automatic text summarization process, a text is given to the computer and the computer returns
a shorter less redundant extract of the original text. The proposed method is a sentence extraction based single document text
summarization which produces a generic summary for a Malayalam document. Sentences are ranked based on feature scores
and Googles PageRank formula. Top k ranked sentences will be included in summary where k depends on the compression
ratio between original text and summary. Performance evaluation will be done by comparing the summarization outputs with
manual summaries generated by human evaluators.
Keywords—Text summarization, Sentence Extraction, Stemming, TF-ISF score, Sentence similarity, PageRank formula,
Summary generation.