Johns Hopkins or Johnny-Hopkins: Classifying individuals versus organizations on Twitter

Zach Wood-Doughty, Praateek Mahajan, Mark Dredze

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

Twitter accounts include a range of different types of users. While many individuals use Twitter, organizations also have Twitter accounts. Identifying opinions and trends from Twitter requires the accurate differentiation of these two groups. Previous work (McCorriston et al., 2015) presented a method for determining if an account was an individual or organization based on account profile and a collection of tweets. We present a method that relies solely on the account profile, allowing for the classification of individuals versus organizations based on a single tweet. Our method obtains accuracies comparable to methods that rely on much more information by leveraging two improvements: a character-based convolutional neural network, and an automatically-derived corpus an order of magnitude larger than the previously available dataset. We make both the dataset and the resulting tool available.

Original languageEnglish (US)
Title of host publicationProceedings of the 2nd Workshop on Computational Modeling of PFople's Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics
Subtitle of host publicationHuman Language Technologies, NAACL-HTL 2018
EditorsMalvina Nissim, Viviana Patti, Barbara Plank, Claudia Wagner
PublisherAssociation for Computational Linguistics (ACL)
Pages56-61
Number of pages6
ISBN (Electronic)9781948087179
StatePublished - 2018
Externally publishedYes
Event2nd Workshop on Computational Modeling of PFople's Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018 - New Orleans, United States
Duration: Jun 6 2018 → …

Publication series

NameProceedings of the 2nd Workshop on Computational Modeling of PFople's Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018

Conference

Conference2nd Workshop on Computational Modeling of PFople's Opinions, PersonaLity, and Emotions in Social Media, PEOPLES 2018 at the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HTL 2018
Country/TerritoryUnited States
CityNew Orleans
Period6/6/18 → …

ASJC Scopus subject areas

  • Computer Science Applications
  • Artificial Intelligence
  • Language and Linguistics
  • Linguistics and Language

Fingerprint

Dive into the research topics of 'Johns Hopkins or Johnny-Hopkins: Classifying individuals versus organizations on Twitter'. Together they form a unique fingerprint.

Cite this