Skip to Main content Skip to Navigation
Journal articles

Significant variation between SNP-based HLA imputations in diverse populations: the last mile is the hardest

Abstract : Four single nucleotide polymorphism (SNP)-based human leukocyte antigen (HLA) imputation methods (e-HLA, HIBAG, HLA*IMP:02 and MAGPrediction) were trained using 1000 Genomes SNP and HLA genotypes and assessed for their ability to accurately impute molecular HLA-A, -B, -C and -DRB1 genotypes in the Human Genome Diversity Project cell panel. Imputation concordance was high (>89%) across all methods for both HLA-A and HLA-C, but HLA-B and HLA-DRB1 proved generally difficult to impute. Overall, <27.8% of subjects were correctly imputed for all HLA loci by any method. Concordance across all loci was not enhanced via the application of confidence thresholds; reliance on confidence scores across methods only led to noticeable improvement (+3.2%) for HLA-DRB1. As the HLA complex is highly relevant to the study of human health and disease, a standardized assessment of SNP-based HLA imputation methods is crucial for advancing genomic research. Considerable room remains for the improvement of HLA-B and especially HLA-DRB1 imputation methods, and no imputation method is as accurate as molecular genotyping. The application of large, ancestrally diverse HLA and SNP reference data sets and multiple imputation methods has the potential to make SNP-based HLA imputation methods a tractable option for determining HLA genotypes.
Document type :
Journal articles
Complete list of metadata

Cited literature [52 references]  Display  Hide  Download
Contributor : Sylvie Le Bihan Connect in order to contact the contributor
Submitted on : Thursday, June 13, 2019 - 1:26:49 PM
Last modification on : Wednesday, April 27, 2022 - 4:28:26 AM


Publisher files allowed on an open archive



Derek J Pappas, Antoine Lizee, Vanja Paunic, Karl R Beutner, Allan Motyer, et al.. Significant variation between SNP-based HLA imputations in diverse populations: the last mile is the hardest. Pharmacogenomics Journal, Nature Publishing Group, 2018, 18 (3), pp.367-376. ⟨10.1038/tpj.2017.7⟩. ⟨inserm-02155110⟩



Record views


Files downloads