Significant variation between SNP-based HLA imputations in diverse populations: the last mile is the hardest

Abstract : Four single nucleotide polymorphism (SNP)-based human leukocyte antigen (HLA) imputation methods (e-HLA, HIBAG, HLA*IMP:02 and MAGPrediction) were trained using 1000 Genomes SNP and HLA genotypes and assessed for their ability to accurately impute molecular HLA-A, -B, -C and -DRB1 genotypes in the Human Genome Diversity Project cell panel. Imputation concordance was high (>89%) across all methods for both HLA-A and HLA-C, but HLA-B and HLA-DRB1 proved generally difficult to impute. Overall, <27.8% of subjects were correctly imputed for all HLA loci by any method. Concordance across all loci was not enhanced via the application of confidence thresholds; reliance on confidence scores across methods only led to noticeable improvement (+3.2%) for HLA-DRB1. As the HLA complex is highly relevant to the study of human health and disease, a standardized assessment of SNP-based HLA imputation methods is crucial for advancing genomic research. Considerable room remains for the improvement of HLA-B and especially HLA-DRB1 imputation methods, and no imputation method is as accurate as molecular genotyping. The application of large, ancestrally diverse HLA and SNP reference data sets and multiple imputation methods has the potential to make SNP-based HLA imputation methods a tractable option for determining HLA genotypes.
Document type :
Journal articles
Complete list of metadatas

Cited literature [52 references]  Display  Hide  Download

https://www.hal.inserm.fr/inserm-02155110
Contributor : Ana Paula Dutra Azevedo <>
Submitted on : Thursday, June 13, 2019 - 1:26:49 PM
Last modification on : Wednesday, August 21, 2019 - 1:42:15 PM

File

nihms879450.pdf
Publisher files allowed on an open archive

Identifiers

Collections

Citation

Derek Pappas, Antoine Lizee, Vanja Paunic, Karl Beutner, Allan Motyer, et al.. Significant variation between SNP-based HLA imputations in diverse populations: the last mile is the hardest. Pharmacogenomics Journal, Nature Publishing Group, 2018, 18 (3), pp.367-376. ⟨10.1038/tpj.2017.7⟩. ⟨inserm-02155110⟩

Share

Metrics

Record views

44

Files downloads

86