Data Imputation and Compression For Parkinson's Disease Clinical Questionnaires - Inserm - Institut national de la santé et de la recherche médicale Accéder directement au contenu
Pré-Publication, Document De Travail Année : 2020

Data Imputation and Compression For Parkinson's Disease Clinical Questionnaires

Maxime Peralta
  • Fonction : Auteur
  • PersonId : 1069533
Pierre Jannin
Claire Haegelen
  • Fonction : Auteur
  • PersonId : 908533

Résumé

Medical questionnaires are a valuable source of information but are often difficult to analyse due to both their size and the high possibility of having missing values. This is a problematic issue in biomedical data science as it may complicate how individual questionnaire data is represented for statistical or machine learning analysis. In this paper, we propose a deeply-learnt residual autoencoder to simultaneously perform non-linear data imputation and dimensionality reduction. We present an extensive analysis of the dynamics of the performances of this autoencoder regarding the compression rate and the proportion of missing values. This method is evaluated on motor and non-motor clinical questionnaires of the Parkinson's Progression Markers Initiative (PPMI) database and consistently outperforms linear coupled imputation and reduction approaches.
Fichier principal
Vignette du fichier
PatiNAE_preprint.pdf (384.37 Ko) Télécharger le fichier
Origine : Fichiers produits par l'(les) auteur(s)
Loading...

Dates et versions

hal-02570967 , version 1 (12-05-2020)
hal-02570967 , version 2 (18-05-2020)
hal-02570967 , version 3 (11-04-2021)

Licence

Paternité - Pas d'utilisation commerciale - Partage selon les Conditions Initiales

Identifiants

  • HAL Id : hal-02570967 , version 1

Citer

Maxime Peralta, Pierre Jannin, Claire Haegelen, John Baxter. Data Imputation and Compression For Parkinson's Disease Clinical Questionnaires. 2020. ⟨hal-02570967v1⟩
144 Consultations
391 Téléchargements

Partager

Gmail Facebook X LinkedIn More