Text classification algorithm study based on rough set theory

Xun Lin, Zhishu Li, Yong Zhou, Yuan Xue

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Text Classification is an important research area in Chinese information processing, whose goal is on the base of analyzing the text content to give the allocation of one or more of the text to more appropriate classes to enhance the text retrieval, storage, applications such as processing efficiency.In this paper, text dataset is transformed to information system without attribute of decision making and the core content of attribute reduction has been applied to text classification. Experiment shows that the precision rate and recall rate are enhanced in this method; furthermore, it does not require any a priori information .In this paper, The first Determination of the text vector, The second generates Text set information systems, The third Attribute value discretization.

Original languageEnglish (US)
Title of host publicationProceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010
Pages117-120
Number of pages4
DOIs
StatePublished - Dec 1 2010
Event2010 International Forum on Information Technology and Applications, IFITA 2010 - Kunming, China
Duration: Jul 16 2010Jul 18 2010

Publication series

NameProceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010
Volume1

Other

Other2010 International Forum on Information Technology and Applications, IFITA 2010
CountryChina
CityKunming
Period7/16/107/18/10

Fingerprint

Rough set theory
Information systems
Decision making
Processing
Experiments

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications

Cite this

Lin, X., Li, Z., Zhou, Y., & Xue, Y. (2010). Text classification algorithm study based on rough set theory. In Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010 (pp. 117-120). [5635167] (Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010; Vol. 1). https://doi.org/10.1109/IFITA.2010.203
Lin, Xun ; Li, Zhishu ; Zhou, Yong ; Xue, Yuan. / Text classification algorithm study based on rough set theory. Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010. 2010. pp. 117-120 (Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010).
@inproceedings{313d43b6574144a7baf79280ce41b6c9,
title = "Text classification algorithm study based on rough set theory",
abstract = "Text Classification is an important research area in Chinese information processing, whose goal is on the base of analyzing the text content to give the allocation of one or more of the text to more appropriate classes to enhance the text retrieval, storage, applications such as processing efficiency.In this paper, text dataset is transformed to information system without attribute of decision making and the core content of attribute reduction has been applied to text classification. Experiment shows that the precision rate and recall rate are enhanced in this method; furthermore, it does not require any a priori information .In this paper, The first Determination of the text vector, The second generates Text set information systems, The third Attribute value discretization.",
author = "Xun Lin and Zhishu Li and Yong Zhou and Yuan Xue",
year = "2010",
month = "12",
day = "1",
doi = "10.1109/IFITA.2010.203",
language = "English (US)",
isbn = "9780769541150",
series = "Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010",
pages = "117--120",
booktitle = "Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010",

}

Lin, X, Li, Z, Zhou, Y & Xue, Y 2010, Text classification algorithm study based on rough set theory. in Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010., 5635167, Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010, vol. 1, pp. 117-120, 2010 International Forum on Information Technology and Applications, IFITA 2010, Kunming, China, 7/16/10. https://doi.org/10.1109/IFITA.2010.203

Text classification algorithm study based on rough set theory. / Lin, Xun; Li, Zhishu; Zhou, Yong; Xue, Yuan.

Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010. 2010. p. 117-120 5635167 (Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010; Vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Text classification algorithm study based on rough set theory

AU - Lin, Xun

AU - Li, Zhishu

AU - Zhou, Yong

AU - Xue, Yuan

PY - 2010/12/1

Y1 - 2010/12/1

N2 - Text Classification is an important research area in Chinese information processing, whose goal is on the base of analyzing the text content to give the allocation of one or more of the text to more appropriate classes to enhance the text retrieval, storage, applications such as processing efficiency.In this paper, text dataset is transformed to information system without attribute of decision making and the core content of attribute reduction has been applied to text classification. Experiment shows that the precision rate and recall rate are enhanced in this method; furthermore, it does not require any a priori information .In this paper, The first Determination of the text vector, The second generates Text set information systems, The third Attribute value discretization.

AB - Text Classification is an important research area in Chinese information processing, whose goal is on the base of analyzing the text content to give the allocation of one or more of the text to more appropriate classes to enhance the text retrieval, storage, applications such as processing efficiency.In this paper, text dataset is transformed to information system without attribute of decision making and the core content of attribute reduction has been applied to text classification. Experiment shows that the precision rate and recall rate are enhanced in this method; furthermore, it does not require any a priori information .In this paper, The first Determination of the text vector, The second generates Text set information systems, The third Attribute value discretization.

UR - http://www.scopus.com/inward/record.url?scp=79952179449&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=79952179449&partnerID=8YFLogxK

U2 - 10.1109/IFITA.2010.203

DO - 10.1109/IFITA.2010.203

M3 - Conference contribution

AN - SCOPUS:79952179449

SN - 9780769541150

T3 - Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010

SP - 117

EP - 120

BT - Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010

ER -

Lin X, Li Z, Zhou Y, Xue Y. Text classification algorithm study based on rough set theory. In Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010. 2010. p. 117-120. 5635167. (Proceedings - 2010 International Forum on Information Technology and Applications, IFITA 2010). https://doi.org/10.1109/IFITA.2010.203