Article


Article Code : 13971115177384(DOI : 10.7508/jist.2019.01.003)

Article Title : Farsi Conceptual Text Summarizer: A New Model in Continuous Vector Space

Journal Number : 25 Winter 2019

Visited : 64

Files : 703 KB


List of Authors

  Full Name Email Grade Degree Corresponding Author
1 Mohammad Ebrahim Khademi ekhademi@chmail.ir Post Graduate Student Graduate Student
2 Mohammad Fakhredanesh fakhredanesh@mut.ac.ir Assistant Professor PhD
3 Seyed Mojtaba Hoseini mojtabahoseini@aut.ac.ir Assistant Professor PhD

Abstract

Traditional methods of summarization were very costly and time-consuming. This led to the emergence of automatic methods for text summarization. Extractive summarization is an automatic method for generating summary by identifying the most important sentences of a text. In this paper, two innovative approaches are presented for summarizing the Persian texts. In these methods, using a combination of deep learning and statistical methods, we cluster the concepts of the text and, based on the importance of the concepts in each sentence, we derive the sentences that have the most conceptual burden. In the first unsupervised method, without using any hand-crafted features, we achieved state-of-the-art results on the Pasokh single-document corpus as compared to the best supervised Persian methods. In order to have a better understanding of the results, we have evaluated the human summaries generated by the contributing authors of the Pasokh corpus as a measure of the success rate of the proposed methods. In terms of recall, these have achieved favorable results. In the second method, by giving the coefficient of title effect and its increase, the average ROUGE-2 values increased to 0.4% on the Pasokh single-document corpus compared to the first method and the average ROUGE-1 values increased to 3% on the Khabir news corpus.