A non-parametric approach to pair-wise dynamic topic correlation detection

Song Yang, Zhang Lu, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We introduce dynamic correlated topic models (DCTM) for analyzing discrete data over time. This model is inspired by the hierarchical Gaussian process latent variable models (GP-LVM). DCTM is essentially a non-linear dimension reduction technique which is capable of (1) detecting topic evolution within a document corpus, (2) discovering topic correlations between document corpora, (3) monitoring topic and correlation trends dynamically. Unlike generative aspect models such like LDA, DCTM demonstrates a much faster converging rate with better model fitting to the data. We empirically assess our approach using 268,231 scientific documents, from the year 1988 to 2005. Posterior inferences suggest that DCTM is useful for capturing topic and correlation dynamics, as well as predicting their trends.

Original languageEnglish (US)
Title of host publicationProceedings - 8th IEEE International Conference on Data Mining, ICDM 2008
Pages1031-1036
Number of pages6
DOIs
StatePublished - Dec 1 2008
Event8th IEEE International Conference on Data Mining, ICDM 2008 - Pisa, Italy
Duration: Dec 15 2008Dec 19 2008

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
ISSN (Print)1550-4786

Other

Other8th IEEE International Conference on Data Mining, ICDM 2008
CountryItaly
CityPisa
Period12/15/0812/19/08

Fingerprint

Monitoring

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Yang, S., Lu, Z., & Giles, C. L. (2008). A non-parametric approach to pair-wise dynamic topic correlation detection. In Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008 (pp. 1031-1036). [4781220] (Proceedings - IEEE International Conference on Data Mining, ICDM). https://doi.org/10.1109/ICDM.2008.20
Yang, Song ; Lu, Zhang ; Giles, C. Lee. / A non-parametric approach to pair-wise dynamic topic correlation detection. Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008. 2008. pp. 1031-1036 (Proceedings - IEEE International Conference on Data Mining, ICDM).
@inproceedings{c5c10dd652284ce4baafabe04bf8298b,
title = "A non-parametric approach to pair-wise dynamic topic correlation detection",
abstract = "We introduce dynamic correlated topic models (DCTM) for analyzing discrete data over time. This model is inspired by the hierarchical Gaussian process latent variable models (GP-LVM). DCTM is essentially a non-linear dimension reduction technique which is capable of (1) detecting topic evolution within a document corpus, (2) discovering topic correlations between document corpora, (3) monitoring topic and correlation trends dynamically. Unlike generative aspect models such like LDA, DCTM demonstrates a much faster converging rate with better model fitting to the data. We empirically assess our approach using 268,231 scientific documents, from the year 1988 to 2005. Posterior inferences suggest that DCTM is useful for capturing topic and correlation dynamics, as well as predicting their trends.",
author = "Song Yang and Zhang Lu and Giles, {C. Lee}",
year = "2008",
month = "12",
day = "1",
doi = "10.1109/ICDM.2008.20",
language = "English (US)",
isbn = "9780769535029",
series = "Proceedings - IEEE International Conference on Data Mining, ICDM",
pages = "1031--1036",
booktitle = "Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008",

}

Yang, S, Lu, Z & Giles, CL 2008, A non-parametric approach to pair-wise dynamic topic correlation detection. in Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008., 4781220, Proceedings - IEEE International Conference on Data Mining, ICDM, pp. 1031-1036, 8th IEEE International Conference on Data Mining, ICDM 2008, Pisa, Italy, 12/15/08. https://doi.org/10.1109/ICDM.2008.20

A non-parametric approach to pair-wise dynamic topic correlation detection. / Yang, Song; Lu, Zhang; Giles, C. Lee.

Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008. 2008. p. 1031-1036 4781220 (Proceedings - IEEE International Conference on Data Mining, ICDM).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - A non-parametric approach to pair-wise dynamic topic correlation detection

AU - Yang, Song

AU - Lu, Zhang

AU - Giles, C. Lee

PY - 2008/12/1

Y1 - 2008/12/1

N2 - We introduce dynamic correlated topic models (DCTM) for analyzing discrete data over time. This model is inspired by the hierarchical Gaussian process latent variable models (GP-LVM). DCTM is essentially a non-linear dimension reduction technique which is capable of (1) detecting topic evolution within a document corpus, (2) discovering topic correlations between document corpora, (3) monitoring topic and correlation trends dynamically. Unlike generative aspect models such like LDA, DCTM demonstrates a much faster converging rate with better model fitting to the data. We empirically assess our approach using 268,231 scientific documents, from the year 1988 to 2005. Posterior inferences suggest that DCTM is useful for capturing topic and correlation dynamics, as well as predicting their trends.

AB - We introduce dynamic correlated topic models (DCTM) for analyzing discrete data over time. This model is inspired by the hierarchical Gaussian process latent variable models (GP-LVM). DCTM is essentially a non-linear dimension reduction technique which is capable of (1) detecting topic evolution within a document corpus, (2) discovering topic correlations between document corpora, (3) monitoring topic and correlation trends dynamically. Unlike generative aspect models such like LDA, DCTM demonstrates a much faster converging rate with better model fitting to the data. We empirically assess our approach using 268,231 scientific documents, from the year 1988 to 2005. Posterior inferences suggest that DCTM is useful for capturing topic and correlation dynamics, as well as predicting their trends.

UR - http://www.scopus.com/inward/record.url?scp=67049116452&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=67049116452&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2008.20

DO - 10.1109/ICDM.2008.20

M3 - Conference contribution

AN - SCOPUS:67049116452

SN - 9780769535029

T3 - Proceedings - IEEE International Conference on Data Mining, ICDM

SP - 1031

EP - 1036

BT - Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008

ER -

Yang S, Lu Z, Giles CL. A non-parametric approach to pair-wise dynamic topic correlation detection. In Proceedings - 8th IEEE International Conference on Data Mining, ICDM 2008. 2008. p. 1031-1036. 4781220. (Proceedings - IEEE International Conference on Data Mining, ICDM). https://doi.org/10.1109/ICDM.2008.20