Machine learning model to predict early recurrence in patients with perihilar cholangiocarcinoma planned treatment with curative resection: a multicenter study

J Gastrointest Surg. 2024 Dec;28(12):2039-2047. doi: 10.1016/j.gassur.2024.09.027. Epub 2024 Oct 3.

Abstract

Background: Early recurrence is the leading cause of death for patients with perihilar cholangiocarcinoma (pCCA) after surgery. Identifying high-risk patients preoperatively is important. This study aimed to construct a preoperative prediction model for the early recurrence of patients with pCCA to facilitate planned treatment with curative resection.

Methods: This study ultimately enrolled 400 patients with pCCA after curative resection in 5 hospitals between 2013 and 2019. They were randomly divided into training (n = 300) and testing groups (n = 100) at a ratio of 3:1. Associated variables were identified via least absolute shrinkage and selection operator (LASSO) regression. Four machine learning models were constructed: support vector machine, random forest (RF), logistic regression, and K-nearest neighbors. The predictive ability of the models was evaluated via receiving operating characteristic (ROC) curves, precision-recall curve (PRC) curves, and decision curve analysis. Kaplan-Meier (K-M) survival curves were drawn for the high-/low-risk population.

Results: Five factors: carbohydrate antigen 19-9, tumor size, total bilirubin, hepatic artery invasion, and portal vein invasion, were selected by LASSO regression. In both the training and testing groups, the ROC curve (area under the curve: 0.983 vs 0.952) and the PRC (0.981 vs 0.939) showed that RF was the best. The cutoff value for distinguishing high- and low-risk patients was 0.51. K-M survival curves revealed that in both groups, there was a significant difference in RFS between high- and low-risk patients (P < .001).

Conclusion: This study used preoperative variables from a large, multicenter database to construct a machine learning model that could effectively predict the early recurrence of pCCA in patients to facilitate planned treatment with curative resection and help clinicians make better treatment decisions.

Keywords: Curative resection; Early recurrence; Machine learning; Perihilar cholangiocarcinoma; Preoperative variables.

Publication types

  • Multicenter Study

MeSH terms

  • Aged
  • Bile Duct Neoplasms* / mortality
  • Bile Duct Neoplasms* / pathology
  • Bile Duct Neoplasms* / surgery
  • Female
  • Humans
  • Kaplan-Meier Estimate
  • Klatskin Tumor* / mortality
  • Klatskin Tumor* / pathology
  • Klatskin Tumor* / surgery
  • Machine Learning*
  • Male
  • Middle Aged
  • Neoplasm Invasiveness
  • Neoplasm Recurrence, Local*
  • Portal Vein / surgery
  • Predictive Value of Tests
  • ROC Curve
  • Retrospective Studies
  • Risk Assessment / methods