Description
Objective: Up to 34% of individuals who undergo anterior cruciate ligament reconstruction (ACLR) require an unplanned subsequent knee surgery, including surgeries for failed ACLR, contralateral anterior cruciate ligament tear, meniscus or cartilage injury, and loss of motion. Electronic health record (EHR) data provides a rich resource for studying these procedures, but identification can be costly when chart review is used or inaccurate when claims-based codes are used. The purpose of this study was to determine the performance of 1) structured data (diagnoses and procedure codes) alone versus 2) structured data plus information extracted from unstructured operative reports using natural language processing (NLP) to identify cases of subsequent knee surgery after ACLR.
Results: The records of 378 individuals were reviewed, including 328 cases in the derivation set (169 cases with subsequent knee surgery CPT codes; 159 without) and 50 in the validation set, all with subsequent knee surgery CPT codes. Using the presence or absence of identified CPT codes alone, individuals who did or did not undergo subsequent knee surgery were identified with > 0.98 specificity, recall, precision, and F-1 score. When identifying the specific subsequent surgeries performed, only meniscus procedures were identified with > 0.9 performance metrics using CPT codes alone. The highest performance metrics for each subsequent surgery category were achieved using a combination algorithm that combined use of CPT codes, diagnoses codes, and/or information extracted from the operative report (F-1 score ranged from 0.839 for cartilage procedures to 1 for posterior cruciate ligament procedures and synovial procedures).