Machine Learning Improves Risk Stratification in Myelodysplastic Neoplasms: An Analysis of the Spanish Group of Myelodysplastic Syndromes

Other authors

Institut Català de la Salut

[Mosquera Orgueira A, Perez Encinas MM] Complexo Hospitalario Universitario de Santiago de Compostela, Department of Hematology, Instituto de Investigacións Sanitarias de Santiago, Santiago de Compostela, Spain. [Diaz Varela NA] Hospital Central de Asturias, Oviedo, Spain. [Mora E] Hematology Department, Hospital Universitario y Politécnico La Fe, Valencia, Spain. [Díaz-Beyá M] Hospital Clinic, Dept. of Hematology, IDIBAPS, Barcelona, Spain. [Montoro MJ] Servei d’Hematologia, Vall d’Hebron Hospital Universitari, Barcelona, Spain. Vall d’Hebron Institute of Oncology (VHIO), Barcelona, Spain

Vall d'Hebron Barcelona Hospital Campus

Publication date

2023-11-08T09:45:33Z

2023-11-08T09:45:33Z

2023-10



Abstract

Machine learning; Risk stratification; Myelodysplastic neoplasms


Aprendizaje automático; Estratificación del riesgo; Neoplasias mielodisplásicas


Aprenentatge automàtic; Estratificació del risc; Neoplàsies mielodisplàstiques


Myelodysplastic neoplasms (MDS) are a heterogeneous group of hematological stem cell disorders characterized by dysplasia, cytopenias, and increased risk of acute leukemia. As prognosis differs widely between patients, and treatment options vary from observation to allogeneic stem cell transplantation, accurate and precise disease risk prognostication is critical for decision making. With this aim, we retrieved registry data from MDS patients from 90 Spanish institutions. A total of 7202 patients were included, which were divided into a training (80%) and a test (20%) set. A machine learning technique (random survival forests) was used to model overall survival (OS) and leukemia-free survival (LFS). The optimal model was based on 8 variables (age, gender, hemoglobin, leukocyte count, platelet count, neutrophil percentage, bone marrow blast, and cytogenetic risk group). This model achieved high accuracy in predicting OS (c-indexes; 0.759 and 0.776) and LFS (c-indexes; 0.812 and 0.845). Importantly, the model was superior to the revised International Prognostic Scoring System (IPSS-R) and the age-adjusted IPSS-R. This difference persisted in different age ranges and in all evaluated disease subgroups. Finally, we validated our results in an external cohort, confirming the superiority of the Artificial Intelligence Prognostic Scoring System for MDS (AIPSS-MDS) over the IPSS-R, and achieving a similar performance as the molecular IPSS. In conclusion, the AIPSS-MDS score is a new prognostic model based exclusively on traditional clinical, hematological, and cytogenetic variables. AIPSS-MDS has a high prognostic accuracy in predicting survival in MDS patients, outperforming other well-established risk-scoring systems.

Document Type

Article


Published version

Language

English

Publisher

Wolters Kluwer Health

Related items

HemaSphere;7(10)

https://doi.org/10.1097/HS9.0000000000000961

Recommended citation

This citation was generated automatically.

Rights

Attribution 4.0 International

http://creativecommons.org/licenses/by/4.0/

This item appears in the following Collection(s)