Feature engineering and machine learning for causality assessment in pharmacovigilance: Lessons learned from application to the FDA Adverse Event Reporting System

Kory Kreimeyer, Oanh Dang, Jonathan Spiker, Monica A. Muñoz, Gary Rosner, Robert Ball, Taxiarchis Botsis

Research output: Contribution to journalArticlepeer-review

Abstract

Background: Our objective was to support the automated classification of Food and Drug Administration (FDA) Adverse Event Reporting System (FAERS) reports for their usefulness in assessing the possibility of a causal relationship between a drug product and an adverse event. Method: We used a data set of 326 redacted FAERS reports that was previously annotated using a modified version of the World Health Organization–Uppsala Monitoring Centre criteria for drug causality assessment by a group of SEs at the FDA and supported a similar study on the classification of reports using supervised machine learning and text engineering methods. We explored many potential features, including the incorporation of natural language processing on report text and information from external data sources, for supervised learning and developed models for predicting the classification status of reports. We then evaluated the models on a larger data set of previously unseen reports. Results: The best-performing models achieved recall and F1 scores on both data sets above 0.80 for the identification of assessable reports (i.e. those containing enough information to make an informed causality assessment) and above 0.75 for the identification of reports meeting at least a Possible causality threshold. Conclusions: Causal inference from FAERS reports depends on many components with complex logical relationships that are yet to be made fully computable. Efforts focused on readily addressable tasks, such as quickly eliminating unassessable reports, fit naturally in SE's thought processes to provide real enhancements for FDA workflows.

Original languageEnglish (US)
Article number104517
JournalComputers in Biology and Medicine
Volume135
DOIs
StatePublished - Aug 2021

Keywords

  • Case classification
  • Causality assessment
  • Clinical natural language processing
  • Decision support
  • Pharmacovigilance

ASJC Scopus subject areas

  • Computer Science Applications
  • Health Informatics

Fingerprint

Dive into the research topics of 'Feature engineering and machine learning for causality assessment in pharmacovigilance: Lessons learned from application to the FDA Adverse Event Reporting System'. Together they form a unique fingerprint.

Cite this