﻿<?xml version="1.0" encoding="utf-8"?>
<ArticleSet>
  <ARTICLE>
    <Journal>
      <PublisherName>مرکز منطقه ای اطلاع رسانی علوم و فناوری</PublisherName>
      <JournalTitle>Journal of Information Systems and Telecommunication (JIST) </JournalTitle>
      <ISSN>2322-1437</ISSN>
      <Volume>7</Volume>
      <Issue>26</Issue>
      <PubDate PubStatus="epublish">
        <Year>2020</Year>
        <Month>1</Month>
        <Day>12</Day>
      </PubDate>
    </Journal>
    <ArticleTitle>SGF (Semantic Graphs Fusion): A Knowledge-based Representation of Textual Resources for Text Mining Applications</ArticleTitle>
    <VernacularTitle>SGF (Semantic Graphs Fusion): A Knowledge-based Representation of Textual Resources for Text Mining Applications</VernacularTitle>
    <FirstPage>120</FirstPage>
    <LastPage>133</LastPage>
    <ELocationID EIdType="doi">10.7508/jist.2019.02.004</ELocationID>
    <Language>en</Language>
    <AuthorList>
      <Author>
        <FirstName>Morteza</FirstName>
        <LastName>Jaderyan</LastName>
        <Affiliation>Bu Ali SIna University</Affiliation>
      </Author>
      <Author>
        <FirstName>Hassan</FirstName>
        <LastName>Khotanlou</LastName>
        <Affiliation>Bu Ali Sina Uinversity</Affiliation>
      </Author>
    </AuthorList>
    <History PubStatus="received">
      <Year>2019</Year>
      <Month>5</Month>
      <Day>28</Day>
    </History>
    <Abstract>The proper representation of textual documents has been the greatest challenge in text mining applications. In this paper, a knowledge-based representation model for text documents is introduced. The system works by integrating structured knowledge in the core components of the system. Semantic, lexical, syntactical and structural features are identified by the pre-processing module. The enrichment module is introduced to identify contextually similar concepts and concept maps for improving the representation. The information content of documents and the enriched contents are fused (merged) into the graphical structure of semantic network to form a unified and comprehensive representation of documents. The 20Newsgroup and Reuters-21578 dataset are used for evaluation. The evaluation results suggest that the proposed method exhibits a high level of accuracy, recall and precision. The results also indicate that even when a small portion of information content is available, the proposed method performs well in standard text mining applications. </Abstract>
    <ObjectList>
      <Object Type="Keyword">
        <Param Name="Value">Semantic document representation;</Param>
      </Object>
      <Object Type="Keyword">
        <Param Name="Value">Ontology;</Param>
      </Object>
      <Object Type="Keyword">
        <Param Name="Value">Knowledge base (KB);</Param>
      </Object>
      <Object Type="Keyword">
        <Param Name="Value">Semantic network;</Param>
      </Object>
      <Object Type="Keyword">
        <Param Name="Value">Information fusion;</Param>
      </Object>
    </ObjectList>
    <ArchiveCopySource DocType="Pdf">http://jist.ir/fa/Article/Download/15287</ArchiveCopySource>
  </ARTICLE>
</ArticleSet>