题名 |
Learning Discriminative Sentiment Chunk Vectors for Twitter Sentiment Analysis |
DOI |
10.6138/JIT.2017.18.7.20170410 |
作者 |
Leiming Yan;Wenying Zheng;Huajie (Harry) Zhang;Hao Tao;Ming He |
关键词 |
Bag-of-words ; Word embedding ; Sentiment analysis ; Deep learning |
期刊名称 |
網際網路技術學刊 |
卷期/出版年月 |
18卷7期(2017 / 12 / 01) |
页次 |
1605 - 1613 |
内容语文 |
英文 |
中文摘要 |
Due to the informal and freely constructed sentence structures, it is a difficult classification task to detect the sentiment polarity of tweets, especially for multi-class cases. Extracting features with more valuable information from tweets is crucial for sentiment analysis. In this paper, to address this problem, a hybrid feature space combining bag-of-words and word embedding, named as Discriminative Sentiment Chunk (DSC) vector, is proposed. Then a semi-supervised method is proposed based on Autoencoder technique to learn discriminative sentiment chunk vectors, which convert a high dimensional bag-of-words vector into a continuous vector space with lower dimension without losing the chunk order. Our experimental results show that using discriminative sentiment chunks gains better accuracies and F1 scores on different twitter datasets and outperforms some popular bag-of-words oriented methods and a few deep network approaches |
主题分类 |
基礎與應用科學 >
資訊科學 |