Identifying the role of individual user messages in an online discussion and its use in thread retrieval

Sumit Bhatia, Prakhar Biyani, Prasenjit Mitra

Research output: Contribution to journalArticle

10 Citations (Scopus)

Abstract

Online discussion forums have become a popular medium for users to discuss with and seek information from other users having similar interests. A typical discussion thread consists of a sequence of posts posted by multiple users. Each post in a thread serves a different purpose providing different types of information and, thus, may not be equally useful for all applications. Identifying the purpose and nature of each post in a discussion thread is thus an interesting research problem as it can help in improving information extraction and intelligent assistance techniques. We study the problem of classifying a given post as per its purpose in the discussion thread and employ features based on the post's content, structure of the thread, behavior of the participating users, and sentiment analysis of the post's content. We evaluate our approach on two forum data sets belonging to different genres and achieve strong classification performance. We also analyze the relative importance of different features used for the post classification task. Next, as a use case, we describe how the post class information can help in thread retrieval by incorporating this information in a state-of-the-art thread retrieval model.

Original languageEnglish (US)
Pages (from-to)276-288
Number of pages13
JournalJournal of the Association for Information Science and Technology
Volume67
Issue number2
DOIs
StatePublished - Feb 1 2016

Fingerprint

genre
assistance
Thread
performance
Sentiment analysis
Relative importance
Information extraction

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Networks and Communications
  • Information Systems and Management
  • Library and Information Sciences

Cite this

@article{3798e9ca3d014497adf6b62a4e300757,
title = "Identifying the role of individual user messages in an online discussion and its use in thread retrieval",
abstract = "Online discussion forums have become a popular medium for users to discuss with and seek information from other users having similar interests. A typical discussion thread consists of a sequence of posts posted by multiple users. Each post in a thread serves a different purpose providing different types of information and, thus, may not be equally useful for all applications. Identifying the purpose and nature of each post in a discussion thread is thus an interesting research problem as it can help in improving information extraction and intelligent assistance techniques. We study the problem of classifying a given post as per its purpose in the discussion thread and employ features based on the post's content, structure of the thread, behavior of the participating users, and sentiment analysis of the post's content. We evaluate our approach on two forum data sets belonging to different genres and achieve strong classification performance. We also analyze the relative importance of different features used for the post classification task. Next, as a use case, we describe how the post class information can help in thread retrieval by incorporating this information in a state-of-the-art thread retrieval model.",
author = "Sumit Bhatia and Prakhar Biyani and Prasenjit Mitra",
year = "2016",
month = "2",
day = "1",
doi = "10.1002/asi.23373",
language = "English (US)",
volume = "67",
pages = "276--288",
journal = "Journal of the Association for Information Science and Technology",
issn = "2330-1635",
publisher = "John Wiley and Sons Ltd",
number = "2",

}

Identifying the role of individual user messages in an online discussion and its use in thread retrieval. / Bhatia, Sumit; Biyani, Prakhar; Mitra, Prasenjit.

In: Journal of the Association for Information Science and Technology, Vol. 67, No. 2, 01.02.2016, p. 276-288.

Research output: Contribution to journalArticle

TY - JOUR

T1 - Identifying the role of individual user messages in an online discussion and its use in thread retrieval

AU - Bhatia, Sumit

AU - Biyani, Prakhar

AU - Mitra, Prasenjit

PY - 2016/2/1

Y1 - 2016/2/1

N2 - Online discussion forums have become a popular medium for users to discuss with and seek information from other users having similar interests. A typical discussion thread consists of a sequence of posts posted by multiple users. Each post in a thread serves a different purpose providing different types of information and, thus, may not be equally useful for all applications. Identifying the purpose and nature of each post in a discussion thread is thus an interesting research problem as it can help in improving information extraction and intelligent assistance techniques. We study the problem of classifying a given post as per its purpose in the discussion thread and employ features based on the post's content, structure of the thread, behavior of the participating users, and sentiment analysis of the post's content. We evaluate our approach on two forum data sets belonging to different genres and achieve strong classification performance. We also analyze the relative importance of different features used for the post classification task. Next, as a use case, we describe how the post class information can help in thread retrieval by incorporating this information in a state-of-the-art thread retrieval model.

AB - Online discussion forums have become a popular medium for users to discuss with and seek information from other users having similar interests. A typical discussion thread consists of a sequence of posts posted by multiple users. Each post in a thread serves a different purpose providing different types of information and, thus, may not be equally useful for all applications. Identifying the purpose and nature of each post in a discussion thread is thus an interesting research problem as it can help in improving information extraction and intelligent assistance techniques. We study the problem of classifying a given post as per its purpose in the discussion thread and employ features based on the post's content, structure of the thread, behavior of the participating users, and sentiment analysis of the post's content. We evaluate our approach on two forum data sets belonging to different genres and achieve strong classification performance. We also analyze the relative importance of different features used for the post classification task. Next, as a use case, we describe how the post class information can help in thread retrieval by incorporating this information in a state-of-the-art thread retrieval model.

UR - http://www.scopus.com/inward/record.url?scp=84957009372&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84957009372&partnerID=8YFLogxK

U2 - 10.1002/asi.23373

DO - 10.1002/asi.23373

M3 - Article

AN - SCOPUS:84957009372

VL - 67

SP - 276

EP - 288

JO - Journal of the Association for Information Science and Technology

JF - Journal of the Association for Information Science and Technology

SN - 2330-1635

IS - 2

ER -