Generating summary keywords for emails using topics

Mark Dredze, Hanna M. Wallach, Danny Puller, Fernando Pereira

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Email summary keywords, used to concisely represent the gist of an email, can help users manage and prioritize large numbers of messages. We develop an unsupervised learning framework for selecting summary keywords from emails using latent representations of the underlying topics in a user's mailbox. This approach selects words that describe each message in the context of existing topics rather than simply selecting keywords based on a single message in isolation. We present and compare four methods for selecting summary keywords based on two well-known models for inferring latent topics: latent semantic analysis and latent Dirichlet allocation. The quality of the summary keywords is assessed by generating summaries for emails from twelve users in the Enron corpus. The summary keywords are then used in place of entire messages in two proxy tasks: automated foldering and recipient prediction. We also evaluate the extent to which summary keywords enhance the information already available in a typical email user interface by repeating the same tasks using email subject lines.

Original languageEnglish (US)
Title of host publicationProceedings of the 13th International Conference on Intelligent User Interfaces 2008, IUI'08
Pages199-206
Number of pages8
DOIs
StatePublished - 2008
Externally publishedYes
Event13th International Conference on Intelligent User Interfaces 2008, IUI'08 - Maspalomas, Gran Canaria, Spain
Duration: Jan 13 2008Jan 16 2008

Publication series

NameInternational Conference on Intelligent User Interfaces, Proceedings IUI

Conference

Conference13th International Conference on Intelligent User Interfaces 2008, IUI'08
Country/TerritorySpain
CityMaspalomas, Gran Canaria
Period1/13/081/16/08

Keywords

  • Email
  • Foldering
  • Keyword generation
  • Recipient prediction
  • Topic modeling

ASJC Scopus subject areas

  • Software
  • Human-Computer Interaction

Fingerprint

Dive into the research topics of 'Generating summary keywords for emails using topics'. Together they form a unique fingerprint.

Cite this