Skip to Main content Skip to Navigation
New interface
Journal articles

Topological data analysis reveals genotype–phenotype relationships in primary ciliary dyskinesia

Amelia Shoemark 1, 2 Bruna Rubbo 3 Marie Legendre 4, 5 Mahmoud Fassad 6, 7 Eric Haarman 8 Sunayna Best 6, 9 Irma C.M. Bon 8 Joost Brandsma 3 Pierre-Regis Burgel 10, 11 Gunnar Carlsson 12 Siobhan Carr 1 Mary Carroll 3 Matt Edwards 1 Estelle Escudier 4, 5 Isabelle Honoré 10 David Hunt 3 Gregory Jouvion 4, 5 Michel Loebinger 13, 1 Bernard Maitre 14, 15 Deborah Morris-Rosendahl 1 Jean-Francois Papon 16, 17, 18, 19 Camille Parsons 3 Mitali Patel 6 N. Simon Thomas 3 Guillaume Thouvenin 3, 20, 21 Woolf Walker 3 Robert Wilson 1 Claire Hogg 1, 22 Hannah Mitchison 6 Jane Lucas 23, 3 
Abstract : Background Primary ciliary dyskinesia (PCD) is a heterogeneous inherited disorder caused by mutations in approximately 50 cilia-related genes. PCD genotype–phenotype relationships have mostly arisen from small case series because existing statistical approaches to investigating relationships have been unsuitable for rare diseases. Methods We applied a topological data analysis (TDA) approach to investigate genotype–phenotype relationships in PCD. Data from separate training and validation cohorts included 396 genetically defined individuals carrying pathogenic variants in PCD genes. To develop the TDA models, 12 clinical and diagnostic variables were included. TDA-driven hypotheses were subsequently tested using traditional statistics. Results Disease severity at diagnosis, measured by forced expiratory volume in 1 s (FEV 1 ) z-score, was significantly worse in individuals with CCDC39 mutations (compared to other gene mutations) and better in those with DNAH11 mutations; the latter also reported less neonatal respiratory distress. Patients without neonatal respiratory distress had better preserved FEV 1 at diagnosis. Individuals with DNAH5 mutations were phenotypically diverse. Cilia ultrastructure and beat pattern defects correlated closely to specific causative gene groups, confirming these tests can be used to support a genetic diagnosis. Conclusions This large scale, multi-national study presents PCD as a syndrome with overlapping symptoms and variations in phenotype according to genotype. TDA modelling confirmed genotype–phenotype relationships reported by smaller studies ( e.g. FEV 1 worse with CCDC39 mutation) and identified new relationships, including FEV 1 preservation with DNAH11 mutations and diversity of severity with DNAH5 mutations.
Document type :
Journal articles
Complete list of metadata
Contributor : Sandrine Couvet Connect in order to contact the contributor
Submitted on : Thursday, September 29, 2022 - 9:53:53 AM
Last modification on : Wednesday, November 2, 2022 - 11:49:48 AM


 Restricted access
To satisfy the distribution rights of the publisher, the document is embargoed until : jamais

Please log in to resquest access to the document



Amelia Shoemark, Bruna Rubbo, Marie Legendre, Mahmoud Fassad, Eric Haarman, et al.. Topological data analysis reveals genotype–phenotype relationships in primary ciliary dyskinesia. European Respiratory Journal, 2021, 58 (2), pp.2002359. ⟨10.1183/13993003.02359-2020⟩. ⟨inserm-03791242⟩



Record views