Adjustment for Genotype Imputation Uncertainty Corrects for Inflated Type I Error in Family-Based Association Testing

Tyler R.C. Day, Joshua C. Bis, Nicola Chapman, Alejandro Q. Nato, Andrea R.V.R. Horimoto, Harkirat Sohi, Rafael Nafikov, Elizabeth E. Blue, Mohamad Saad, Ellen M. Wijsman*

*Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Genotype imputation is a widely-used data augmentation approach that is applied to samples of related and/or unrelated individuals. Association testing may then be carried out on the complete data with commonly-used methods. This approach has typically not accounted for the mix of observed and imputed data, although recent work has noted the potential for introduction of confounding in case-control studies. In the Alzheimer's Disease Sequencing Project family sample we found severe inflation of the test statistics in logistic regression analysis following genotype imputation, even after standard covariate adjustments. Here we dissect sources of this inflation, which is driven by three factors: frequency-dependent bias in imputation-induced allele frequencies, differential measurement error, and differential genotyping rates in cases versus controls that introduces confounding. To address the problem, we propose a statistic, imputation deviance ((Formula presented.)), which can be easily computed from the observed and imputed genotype probabilities. We show that (Formula presented.), as an additional fixed-effect covariate, controls the genome-wide inflation in analysis of this family-based sample, and we speculate that use of imputation deviance may also provide a practical approach to correct for genotype imputation effects in other settings, particularly when a data set is unbalanced and includes related individuals.

Original languageEnglish
Article numbere70021
JournalGenetic Epidemiology
Volume49
Issue number8
DOIs
Publication statusPublished - Dec 2025
Externally publishedYes

Keywords

  • GWAS
  • Pedigree
  • WGS
  • data augmentation
  • genomic control
  • missing data
  • mixed model

Fingerprint

Dive into the research topics of 'Adjustment for Genotype Imputation Uncertainty Corrects for Inflated Type I Error in Family-Based Association Testing'. Together they form a unique fingerprint.

Cite this