مرکز منطقه ای اطلاع رسانی علوم و فناوری Journal of Information Systems and Telecommunication (JIST) 2322-1437 7 26 2020 1 12 SGF (Semantic Graphs Fusion): A Knowledge-based Representation of Textual Resources for Text Mining Applications SGF (Semantic Graphs Fusion): A Knowledge-based Representation of Textual Resources for Text Mining Applications 120 133 10.7508/jist.2019.02.004 en Morteza Jaderyan Bu Ali SIna University Hassan Khotanlou Bu Ali Sina Uinversity 2019 5 28 The proper representation of textual documents has been the greatest challenge in text mining applications. In this paper, a knowledge-based representation model for text documents is introduced. The system works by integrating structured knowledge in the core components of the system. Semantic, lexical, syntactical and structural features are identified by the pre-processing module. The enrichment module is introduced to identify contextually similar concepts and concept maps for improving the representation. The information content of documents and the enriched contents are fused (merged) into the graphical structure of semantic network to form a unified and comprehensive representation of documents. The 20Newsgroup and Reuters-21578 dataset are used for evaluation. The evaluation results suggest that the proposed method exhibits a high level of accuracy, recall and precision. The results also indicate that even when a small portion of information content is available, the proposed method performs well in standard text mining applications.

http://jist.ir/fa/Article/Download/15287