Predicting twitter user demographics from names alone

Zach Wood-Doughty, Nicholas Andrews, Rebecca Marvin, Mark Dredze

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Social media analysis frequently requires tools that can automatically infer demographics to contextualize trends. These tools often require hundreds of user-authored messages for each user, which may be prohibitive to obtain when analyzing millions of users. We explore character-level neural models that learn a representation of a user's name and screen name to predict gender and ethnicity, allowing for demographic inference with minimal data. We release trained models which may enable new demographic analyses that would otherwise require enormous amounts of data collection.

Original languageEnglish (US)
Title of host publicationProceedings of the 2nd Workshop on Computational Modeling of PFople's Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies, NAACL-HTL 2018
EditorsMalvina Nissim, Viviana Patti, Barbara Plank, Claudia Wagner
PublisherAssociation for Computational Linguistics (ACL)
Pages105-111
Number of pages7
ISBN (Electronic)9781948087179
StatePublished - 2018
Externally publishedYes
Event2nd Workshop on Computational Modeling of PFople's Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018 - New Orleans, United States
Duration: Jun 6 2018 → …

Publication series

NameProceedings of the 2nd Workshop on Computational Modeling of PFople's Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018

Conference

Conference2nd Workshop on Computational Modeling of PFople's Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018
Country/TerritoryUnited States
CityNew Orleans
Period6/6/18 → …

ASJC Scopus subject areas

  • Computer Science Applications
  • Artificial Intelligence
  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Predicting twitter user demographics from names alone'. Together they form a unique fingerprint.

Cite this