Automated Detection of Off-Label Drug Use

Posted on February 5, 2016

Overview of methods and results.

Overview of methods and results.

Training and testing a classifier to recognize used-to-treat relationships.

Training and testing a classifier to recognize used-to-treat relationships.

Distribution of indication classes in predicted novel usages.

Distribution of indication classes in predicted novel usages.

Using prior knowledge to calculate drug-drug and indication-indication similarity.

Using prior knowledge to calculate drug-drug and indication-indication similarity.

Abstract

Off-label drug use, defined as use of a drug in a manner that deviates from its approved use defined by the drug’s FDA label, is problematic because such uses have not been evaluated for safety and efficacy. Studies estimate that 21% of prescriptions are off-label, and only 27% of those have evidence of safety and efficacy. We describe a data-mining approach for systematically identifying off-label usages using features derived from free text clinical notes and features extracted from two databases on known usage (Medi-Span and DrugBank). We trained a highly accurate predictive model that detects novel off-label uses among 1,602 unique drugs and 1,472 unique indications. We validated 403 predicted uses across independent data sources. Finally, we prioritize well-supported novel usages for further investigation on the basis of drug safety and cost.