Today's Clinical Lab - News, Editorial and Products for the Clinical Laboratory
Clincians wearing scrubs and stethoscope holds a blue ribbon with prostate written on it.
Machine learning model combines genetic data, PSA levels and statistical score to predict recurrence of prostate cancer, better than what clinical data alone gives.
iStock, hidesy

New ML Model Improves Prostate Cancer Recurrence Prediction

Machine learning model combines fusion gene expression, serum PSA level, and Gleason score to accurately predict cancer recurrence

Published:Feb 15, 2023
|3 min read
Register for free to listen to this article
Listen with Speechify

PHILADELPHIA, PA — Predicting the course of prostate cancer is challenging because only a fraction of prostate cancer patients experience recurrence after radical prostatectomy or radiation therapy. Yet, prostate cancer remains one of the fatal malignancies in men in the US. 

Investigators have developed a machine learning model that combines profiles of fusion genes known to be widespread in prostate cancer with the commonly used Gleason score and prostate-specific antigen (PSA) level. The machine learning model consistently improved the prediction of prostate cancer recurrence by the clinical tests alone or in combination. The results are reported in The American Journal of Pathology, published by Elsevier.

“Gleason score and PSA level have been used with varying success in predicting clinical outcomes in patients with prostate cancer,” explained Jian-Hua Luo, MD, PhD, Department of Pathology, University of Pittsburgh School of Medicine, the lead investigator. “However, they provide limited insight into the mechanism of the disease. Gene fusion events are known to be widespread in prostate cancer, but their potential in predicting the course of the disease was unknown.”

Data from a multi-institutional cohort that included 271 samples of radical prostatectomy from the University of Pennsylvania Medical Center (UPMC), 191 from the University of Wisconsin–Madison, and 112 from Stanford Medical Center were analyzed. All 14 of the fusion genes known to be present in prostate cancer were detected in the samples from the combined cohort. Gleason and serum PSA scores were also available.

The investigators first developed a training model using the UPMC data. Several machine learning algorithms were applied to the fusion gene profiling data to determine the best parameters of 14 fusion gene combinations for predicting prostate cancer recurrence. The best algorithms were then applied to the whole training cohort to build a model.

How the combination model improved accuracy

Prediction of cancer recurrence based on Gleason score alone had 77.9 percent accuracy, and PSA alone correctly predicted 73.5 percent of prostate cancer recurrence. When the Gleason score data were incorporated into the machine learning analysis with the fusion data, a total of 442 models of different combinations showed an accuracy above 80 percent for the combined models. When PSA alone was combined with fusion data, 265 models of different combinations showed prediction rates above 75 percent. The combination of fusion data, Gleason score, and PSA improved the prediction of prostate cancer; 317 models yielded prediction rates of 80 percent or better.

Next, 764 machine learning models trained using data from the UPMC cohort were applied to the Wisconsin–Stanford cohort, and then to the UPMC–Stanford–Wisconsin cohort. Again, the combination of fusion data, Gleason score, and PSA outperformed the prediction of cancer recurrence by PSA or Gleason score alone or combined. 

Cancer did not recur for five years after surgery in 81.9 percent of patients if the cancer was predicted as nonrecurrent, while only 17.2 percent of patients were recurrence-free if their cancer was predicted as recurrent by the same model. With the Gleason plus PSA model, 78.3 percent of patients had no cancer recurrence if the cancer was predicted as nonrecurrent by the model, and 26.2 percent of patients had no cancer recurrence for five years if the cancer was predicted as recurrent.

Luo notes that profiles of fusion genes have added value for clinical patient management because some gene fusions are important molecular processes in generating prostate cancer. Whereas, the others are known to make cancer susceptible to certain drugs. “One of the big surprises in the analysis is that the fusion genes called CCNH-C5orf30 turned out to be an indicator of favorable clinical outcomes. It is unusual for a genomic abnormality created by cancer cells to restrain cancer’s aggressiveness,” he said.

“The detection of fusion genes provides new mechanistic insight into prostate cancer progression enabling proactive measures to be taken,” said Luo. “The incorporation of fusion gene detection into the prostate cancer diagnostic scheme benefits patients with regard to diagnosis, prognosis, cancer progression surveillance, and treatment. Further, if these machine learning models are applied to clinical practice in the future, more lives may be saved.”

- This press release was originally published on the Elsevier website