Latent Gaussian models for topic modeling

Changwei Hu, Eunsu Ryu, David Carlson, Yingjian Wang, Lawrence Carin

Research output: Chapter in Book/Report/Conference proceedingConference contribution

1 Scopus citations

Abstract

A new approach is proposed for topic modeling, in which the latent matrix factorization employs Gaussian priors, rather than the Dirichlet-class priors widely used in such models. The use of a latent-Gaussian model permits simple and efficient approximate Bayesian posterior inference, via the Laplace approximation. On multiple datasets, the proposed approach is demonstrated to yield results as accurate as state-of-the-art approaches based on Dirichlet constructions, at a small fraction of the computation. The framework is general enough to jointly model text and binary data, here demonstrated to produce accurate and fast results for joint analysis of voting rolls and the associated legislative text. Further, it is demonstrated how the technique may be scaled up to massive data, with encouraging performance relative to alternative methods.
Original languageEnglish (US)
Title of host publicationJournal of Machine Learning Research
PublisherMicrotome Publishinginfo@mtome.com
Pages393-401
Number of pages9
StatePublished - Jan 1 2014
Externally publishedYes

Fingerprint

Dive into the research topics of 'Latent Gaussian models for topic modeling'. Together they form a unique fingerprint.

Cite this