Sentiment and topic analysis on social media: A multi-task multi-label classification approach

Shu Huang, Wei Peng, Jingxuan Li, Dongwon Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

24 Scopus citations

Abstract

Both sentiment analysis and topic classification are frequently used in customer care and marketing. They can help people understand the brand perception and customer opinions from social media, such as online posts, tweets, forums, and blogs. As such, in recent years, many solutions have been proposed for both tasks. However, we believe that the following two problems have not been addressed adequately: (1) Conventional solutions usually treat the two tasks in isolation. When the two tasks are closely related (e.g., posts about "customer care" often have a "negative" tone), exploring their correlation may yield a better accuracy; (2) Each post is usually assigned with only one sentiment label and one topic label. Since social media is, compared to traditional document corpus, more noisy, ambiguous, and sparser, single label classification may not be able to capture the post classes accurately. To address these two problems, in this paper, we propose a multi-task multi-label (MTML) classification model that performs classification of both sentiments and topics concurrently. It incorporates results of each task from prior steps to promote and reinforce the other iteratively. For each task, the model is trained with multiple labels so that they can help address class ambiguity. In the empirical validation, we compare the accuracy of MTML model against four competing methods in two different settings. Results show that MTML produces a much higher accuracy of both sentiment and topic classifications.

Original languageEnglish (US)
Title of host publicationProceedings of the 5th Annual ACM Web Science Conference, WebSci'13
PublisherAssociation for Computing Machinery
Pages172-181
Number of pages10
ISBN (Print)9781450318891
DOIs
StatePublished - Jan 1 2013
Event3rd Annual ACM Web Science Conference, WebSci 2013 - Paris, France
Duration: May 2 2013May 4 2013

Publication series

NameProceedings of the 5th Annual ACM Web Science Conference, WebSci'13
Volumevolume

Other

Other3rd Annual ACM Web Science Conference, WebSci 2013
CountryFrance
CityParis
Period5/2/135/4/13

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications

Cite this

Huang, S., Peng, W., Li, J., & Lee, D. (2013). Sentiment and topic analysis on social media: A multi-task multi-label classification approach. In Proceedings of the 5th Annual ACM Web Science Conference, WebSci'13 (pp. 172-181). (Proceedings of the 5th Annual ACM Web Science Conference, WebSci'13; Vol. volume). Association for Computing Machinery. https://doi.org/10.1145/2464464.2464512