1265 Welch Road
Stanford
CA 94305
Jean Coquet PhD,
Postdoctoral Scholar,
BMIR, Stanford University
ABSTRACT:
Clinical care guidelines recommend that diagnosed prostate cancer patients at high risk for metastatic spread receive a bone scan prior to treatment and that low risk patients not receive it. The objective was to develop an automated pipeline to interrogate heterogeneous data to evaluate the use of bone scans using a two different Natural Language Processing (NLP) approaches. Our cohort was divided into risk groups based on Electronic Health Records (EHR). Information on bone scan utilization was identified in both structured data and free text from clinical notes. Our pipeline annotated sentences with a combination of a rule-based method using the ConText algorithm (a generalization of NegEx) and a Convolutional Neural Network (CNN) method using word2vec to produce word embeddings. A total of 5,500 patients and 369,764 notes were included in the study. A total of 39% of patients were high-risk and 73% of these received a bone scan; of the 18% low risk patients, 10% received one. The accuracy of CNN model with one-layer architecture outperformed the rule-based model one (F-measure = 0.918 and 0.806 respectively). We demonstrate a combination of both models could maximize precision or recall, based on the study question.