Fightin' words: Lexical feature selection and evaluation for identifying the content of political conflict

Burt L. Monroe, Michael P. Colaresi, Kevin M. Quinn

Research output: Contribution to journalArticle

148 Scopus citations


Entries in the burgeoning "text-as-data" movement are often accompanied by lists or visualizations of how word (or other lexical feature) usage differs across some pair or set of documents. These are intended either to establish some target semantic concept (like the content of partisan frames) to estimate word-specific measures that feed forward into another analysis (like locating parties in ideological space) or both. We discuss a variety of techniques for selecting words that capture partisan, or other, differences in political speech and for evaluating the relative importance of those words. We introduce and emphasize several new approaches based on Bayesian shrinkage and regularization. We illustrate the relative utility of these approaches with analyses of partisan, gender, and distributive speech in the U.S. Senate.

Original languageEnglish (US)
Pages (from-to)372-403
Number of pages32
JournalPolitical Analysis
Issue number4 SPEC. ISS.
StatePublished - Dec 1 2008


All Science Journal Classification (ASJC) codes

  • Sociology and Political Science
  • Political Science and International Relations

Cite this