SVM recursive feature elimination analyses of structural brain MRI predicts near-term relapses in patients with clinically isolated syndromes suggestive of multiple sclerosis

Other authors

Institut Català de la Salut

[Wottschel V] Department of Radiology and Nuclear Medicine, Amsterdam University Medical Centers, Location VUmc, Amsterdam, The Netherlands. Queen Square MS Centre, University College London, London, United Kingdom. [Chard DT] Queen Square MS Centre, University College London, London, United Kingdom. National Institute of Health Research (NIHR), University College London Hospitals, Biomedical Research Centre, London, United Kingdom. [Enzinger C] Research Unit for Neuronal Repair and Plasticity, Department of Neurology, Medical University of Graz, Graz, Austria. [Filippi M] Neuroimaging Research Unit, Institute of Experimental Neurology, Division of Neuroscience, San Raffaele Scientific Institute, Vita-Salute San Raffaele University, Milan, Italy. [Frederiksen JL] Rigshospitalet-Glostrup and University of Copenhagen, Copenhagen, Denmark. [Gasperini C] San Camillo-Forlanini Hospital, Rome, Italy. [Rovira A, Tintoré M] Vall d'Hebron Hospital Universitari, Barcelona, Spain

Vall d'Hebron Barcelona Hospital Campus

Publication date

2021-04-12T08:08:27Z

2021-04-12T08:08:27Z

2019



Abstract

Esclerosi múltiple; Classificació d'aprenentatge automàtic; Selecció de funcions


Esclerosis múltiple; Clasificación de aprendizaje automático; Selección de características


Multiple sclerosis; Machine learning classification; Feature selection


Machine learning classification is an attractive approach to automatically differentiate patients from healthy subjects, and to predict future disease outcomes. A clinically isolated syndrome (CIS) is often the first presentation of multiple sclerosis (MS), but it is difficult at onset to predict who will have a second relapse and hence convert to clinically definite MS. In this study, we thus aimed to distinguish CIS converters from non-converters at onset of a CIS, using recursive feature elimination and weight averaging with support vector machines. We also sought to assess the influence of cohort size and cross-validation methods on the accuracy estimate of the classification. We retrospectively collected 400 patients with CIS from six European MAGNIMS MS centres. Patients underwent brain MRI at onset of a CIS according to local standard-of-care protocols. The diagnosis of clinically definite MS at one-year follow-up was the standard against which the accuracy of the model was tested. For each patient, we derived MRI-based features, such as grey matter probability, white matter lesion load, cortical thickness, and volume of specific cortical and white matter regions. Features with little contribution to the classification model were removed iteratively through an interleaved sample bootstrapping and feature averaging approach. Classification of CIS outcome at one-year follow-up was performed with 2-fold, 5-fold, 10-fold and leave-one-out cross-validation for each centre cohort independently and in all patients together. The estimated classification accuracy across centres ranged from 64.9% to 88.1% using 2-fold cross-validation and from 73% to 92.9% using leave-one-out cross-validation. The classification accuracy estimate was higher in single-centre, smaller data sets than in combinations of data sets, being the lowest when all patients were merged together. Regional MRI features such as WM lesions, grey matter probability in the thalamus and the precuneus or cortical thickness in the cuneus and inferior temporal gyrus predicted the occurrence of a second relapse in patients at onset of a CIS using support vector machines. The increased accuracy estimate of the classification achieved with smaller and single-centre samples may indicate a model bias (overfitting) when data points were limited, but also more homogeneous. We provide an overview of classifier performance from a range of cross-validation schemes to give insight into the variability across schemes. The proposed recursive feature elimination approach with weight averaging can be used both in single- and multi-centre data sets in order to bridge the gap between group-level comparisons and making predictions for individual patients.


This project received funding from the European Union's Horizon2020 Research and Innovation Program EuroPOND under grant agreement number 666992, and it was supported by the National Institute for Health Research University College London Hospitals Biomedical Research Centre. We thank all participating partners of the MAGNIMS study group for sharing their data with us.

Document Type

Article


Published version

Language

English

Publisher

Elsevier

Related items

NeuroImage: Clinical;24

https://doi.org/10.1016/j.nicl.2019.102011

info:eu-repo/grantAgreement/EC/H2020/EU.3.1.1.

Recommended citation

This citation was generated automatically.

Rights

Attribution-NonCommercial-NoDerivatives 4.0 International

http://creativecommons.org/licenses/by-nc-nd/4.0/

This item appears in the following Collection(s)