Chuangkrud, Piyawat, Leelanupab, Teerapong, Damrongrat, Chaianun and Kanungsukkasem, Nont (2021) Keyword-Text Graph Representation for Short Text Classification In: 13th International Conference on Information Technology and Electrical Engineering (ICITEE).
Short text classification is an essential task in Natural Language Processing. This task is widely applied to many applications, such as spam filtering, question-answering, artificial conversational agent, sentiment analysis, review mining, etc. Short texts usually encounter a great challenge for classification due to data sparseness as they do not provide suļ¬icient contextual information. In this paper, we introduce Keyword-Text Graph Convolutional Networks (KwTGCN) for short text classification. We also propose a method to identify keywords by estimating word distribution over different categories. These category keywords are then used to build a special keyword-text graph of short text corpus. We employ Graph Convolutional Network (GCN) and our keyword-text graph to generate the representation of short text corpus based on the relations of document-keyword and document-word as well as the word co-occurrence. This document, word and keyword representation is further used as an input feature for the next layer of short text classification. The experimental results on multiple benchmark datasets show that our proposed model outperforms the state-of-the-art models for short text classification in multiple attempts.
Item Type:
Conference or Workshop Item (Speech)
Subjects:
Subjects > Computer Science > Artificial Intelligence
Subjects > Computer Science > Computation and Language (Computational Linguistics and Natural Language and Speech Processing)
Subjects > Computer Science > Machine Learning
Deposited by:
Nont Kanungsukkasem
Date Deposited:
2024-11-19 12:39:32
Last Modified:
2024-12-02 11:54:21