Using natural language processing of clinical text to enhance identification of opioid-related overdoses in electronic health records data

Brian Hazlehurst; Carla A. Green; Nancy A. Perrin; John Brandes; David S. Carrell; Andrew Baer; Angela DeVeaugh-Geiss; Paul M. Coplan

doi:10.1002/pds.4810

Using natural language processing of clinical text to enhance identification of opioid-related overdoses in electronic health records data

Brian Hazlehurst, Carla A. Green, Nancy A. Perrin, John Brandes, David S. Carrell, Andrew Baer, Angela DeVeaugh-Geiss, Paul M. Coplan

Research output: Contribution to journal › Article › peer-review

4 Scopus citations

Abstract

Purpose: To enhance automated methods for accurately identifying opioid-related overdoses and classifying types of overdose using electronic health record (EHR) databases. Methods: We developed a natural language processing (NLP) software application to code clinical text documentation of overdose, including identification of intention for self-harm, substances involved, substance abuse, and error in medication usage. Using datasets balanced with cases of suspected overdose and records of individuals at elevated risk for overdose, we developed and validated the application using Kaiser Permanente Northwest data, then tested portability of the application using Kaiser Permanente Washington data. Datasets were chart-reviewed to provide a gold standard for comparison and evaluation of the automated method. Results: The method performed well in identifying overdose (sensitivity = 0.80, specificity = 0.93), intentional overdose (sensitivity = 0.81, specificity = 0.98), and involvement of opioids (excluding heroin, sensitivity = 0.72, specificity = 0.96) and heroin (sensitivity = 0.84, specificity = 1.0). The method performed poorly at identifying adverse drug reactions and overdose due to patient error and fairly at identifying substance abuse in opioid-related unintentional overdose (sensitivity = 0.67, specificity = 0.96). Evaluation using validation datasets yielded significant reductions, in specificity and negative predictive values only, for many classifications mentioned above. However, these measures remained above 0.80, thus, performance observed during development was largely maintained during validation. Similar results were obtained when evaluating portability, although there was a significant reduction in sensitivity for unintentional overdose that was attributed to missing text clinical notes in the database. Conclusions: Methods that process text clinical notes show promise for improving accuracy and fidelity at identifying and classifying overdoses according to type using EHR data.

Original language	English (US)
Pages (from-to)	1143-1151
Number of pages	9
Journal	Pharmacoepidemiology and Drug Safety
Volume	28
Issue number	8
DOIs	https://doi.org/10.1002/pds.4810
State	Published - Aug 1 2019
Externally published	Yes

Keywords

electronic health records
methods
natural language processing
opioid overdose
pharmacoepidemiology

ASJC Scopus subject areas

Epidemiology
Pharmacology (medical)

Access to Document

10.1002/pds.4810

Cite this

@article{7783bb5fd3e04799b423f4fefda235f7,

title = "Using natural language processing of clinical text to enhance identification of opioid-related overdoses in electronic health records data",

abstract = "Purpose: To enhance automated methods for accurately identifying opioid-related overdoses and classifying types of overdose using electronic health record (EHR) databases. Methods: We developed a natural language processing (NLP) software application to code clinical text documentation of overdose, including identification of intention for self-harm, substances involved, substance abuse, and error in medication usage. Using datasets balanced with cases of suspected overdose and records of individuals at elevated risk for overdose, we developed and validated the application using Kaiser Permanente Northwest data, then tested portability of the application using Kaiser Permanente Washington data. Datasets were chart-reviewed to provide a gold standard for comparison and evaluation of the automated method. Results: The method performed well in identifying overdose (sensitivity = 0.80, specificity = 0.93), intentional overdose (sensitivity = 0.81, specificity = 0.98), and involvement of opioids (excluding heroin, sensitivity = 0.72, specificity = 0.96) and heroin (sensitivity = 0.84, specificity = 1.0). The method performed poorly at identifying adverse drug reactions and overdose due to patient error and fairly at identifying substance abuse in opioid-related unintentional overdose (sensitivity = 0.67, specificity = 0.96). Evaluation using validation datasets yielded significant reductions, in specificity and negative predictive values only, for many classifications mentioned above. However, these measures remained above 0.80, thus, performance observed during development was largely maintained during validation. Similar results were obtained when evaluating portability, although there was a significant reduction in sensitivity for unintentional overdose that was attributed to missing text clinical notes in the database. Conclusions: Methods that process text clinical notes show promise for improving accuracy and fidelity at identifying and classifying overdoses according to type using EHR data.",

keywords = "electronic health records, methods, natural language processing, opioid overdose, pharmacoepidemiology",

author = "Brian Hazlehurst and Green, {Carla A.} and Perrin, {Nancy A.} and John Brandes and Carrell, {David S.} and Andrew Baer and Angela DeVeaugh-Geiss and Coplan, {Paul M.}",

note = "Publisher Copyright: {\textcopyright} 2019 The Authors Pharmacoepidemiology and Drug Safety Published by John Wiley & Sons Ltd",

year = "2019",

month = aug,

day = "1",

doi = "10.1002/pds.4810",

language = "English (US)",

volume = "28",

pages = "1143--1151",

journal = "Pharmacoepidemiology and Drug Safety",

issn = "1053-8569",

publisher = "John Wiley and Sons Ltd",

number = "8",

}

TY - JOUR

T1 - Using natural language processing of clinical text to enhance identification of opioid-related overdoses in electronic health records data

AU - Hazlehurst, Brian

AU - Green, Carla A.

AU - Perrin, Nancy A.

AU - Brandes, John

AU - Carrell, David S.

AU - Baer, Andrew

AU - DeVeaugh-Geiss, Angela

AU - Coplan, Paul M.

PY - 2019/8/1

Y1 - 2019/8/1

N2 - Purpose: To enhance automated methods for accurately identifying opioid-related overdoses and classifying types of overdose using electronic health record (EHR) databases. Methods: We developed a natural language processing (NLP) software application to code clinical text documentation of overdose, including identification of intention for self-harm, substances involved, substance abuse, and error in medication usage. Using datasets balanced with cases of suspected overdose and records of individuals at elevated risk for overdose, we developed and validated the application using Kaiser Permanente Northwest data, then tested portability of the application using Kaiser Permanente Washington data. Datasets were chart-reviewed to provide a gold standard for comparison and evaluation of the automated method. Results: The method performed well in identifying overdose (sensitivity = 0.80, specificity = 0.93), intentional overdose (sensitivity = 0.81, specificity = 0.98), and involvement of opioids (excluding heroin, sensitivity = 0.72, specificity = 0.96) and heroin (sensitivity = 0.84, specificity = 1.0). The method performed poorly at identifying adverse drug reactions and overdose due to patient error and fairly at identifying substance abuse in opioid-related unintentional overdose (sensitivity = 0.67, specificity = 0.96). Evaluation using validation datasets yielded significant reductions, in specificity and negative predictive values only, for many classifications mentioned above. However, these measures remained above 0.80, thus, performance observed during development was largely maintained during validation. Similar results were obtained when evaluating portability, although there was a significant reduction in sensitivity for unintentional overdose that was attributed to missing text clinical notes in the database. Conclusions: Methods that process text clinical notes show promise for improving accuracy and fidelity at identifying and classifying overdoses according to type using EHR data.

AB - Purpose: To enhance automated methods for accurately identifying opioid-related overdoses and classifying types of overdose using electronic health record (EHR) databases. Methods: We developed a natural language processing (NLP) software application to code clinical text documentation of overdose, including identification of intention for self-harm, substances involved, substance abuse, and error in medication usage. Using datasets balanced with cases of suspected overdose and records of individuals at elevated risk for overdose, we developed and validated the application using Kaiser Permanente Northwest data, then tested portability of the application using Kaiser Permanente Washington data. Datasets were chart-reviewed to provide a gold standard for comparison and evaluation of the automated method. Results: The method performed well in identifying overdose (sensitivity = 0.80, specificity = 0.93), intentional overdose (sensitivity = 0.81, specificity = 0.98), and involvement of opioids (excluding heroin, sensitivity = 0.72, specificity = 0.96) and heroin (sensitivity = 0.84, specificity = 1.0). The method performed poorly at identifying adverse drug reactions and overdose due to patient error and fairly at identifying substance abuse in opioid-related unintentional overdose (sensitivity = 0.67, specificity = 0.96). Evaluation using validation datasets yielded significant reductions, in specificity and negative predictive values only, for many classifications mentioned above. However, these measures remained above 0.80, thus, performance observed during development was largely maintained during validation. Similar results were obtained when evaluating portability, although there was a significant reduction in sensitivity for unintentional overdose that was attributed to missing text clinical notes in the database. Conclusions: Methods that process text clinical notes show promise for improving accuracy and fidelity at identifying and classifying overdoses according to type using EHR data.

KW - electronic health records

KW - methods

KW - natural language processing

KW - opioid overdose

KW - pharmacoepidemiology

UR - http://www.scopus.com/inward/record.url?scp=85067473767&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85067473767&partnerID=8YFLogxK

U2 - 10.1002/pds.4810

DO - 10.1002/pds.4810

M3 - Article

C2 - 31218780

AN - SCOPUS:85067473767

SN - 1053-8569

VL - 28

SP - 1143

EP - 1151

JO - Pharmacoepidemiology and Drug Safety

JF - Pharmacoepidemiology and Drug Safety

IS - 8

ER -

Using natural language processing of clinical text to enhance identification of opioid-related overdoses in electronic health records data

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this