Pathogen exposure misclassification can bias association signals in GWAS of infectious diseases when using population-based common control subjects

Research output: Contribution to journalArticlepeer-review

Abstract

Genome-wide association studies (GWASs) have been performed to identify host genetic factors for a range of phenotypes, including for infectious diseases. The use of population-based common control subjects from biobanks and extensive consortia is a valuable resource to increase sample sizes in the identification of associated loci with minimal additional expense. Non-differential misclassification of the outcome has been reported when the control subjects are not well characterized, which often attenuates the true effect size. However, for infectious diseases the comparison of affected subjects to population-based common control subjects regardless of pathogen exposure can also result in selection bias. Through simulated comparisons of pathogen-exposed cases and population-based common control subjects, we demonstrate that not accounting for pathogen exposure can result in biased effect estimates and spurious genome-wide significant signals. Further, the observed association can be distorted depending upon strength of the association between a locus and pathogen exposure and the prevalence of pathogen exposure. We also used a real data example from the hepatitis C virus (HCV) genetic consortium comparing HCV spontaneous clearance to persistent infection with both well-characterized control subjects and population-based common control subjects from the UK Biobank. We find biased effect estimates for known HCV clearance-associated loci and potentially spurious HCV clearance associations. These findings suggest that the choice of control subjects is especially important for infectious diseases or outcomes that are conditional upon environmental exposures.

Original languageEnglish (US)
Pages (from-to)336-348
Number of pages13
JournalAmerican journal of human genetics
Volume110
Issue number2
DOIs
StatePublished - Feb 2 2023

Keywords

  • GWAS
  • common controls
  • genetic epidemiology
  • infectious disease
  • misclassification bias
  • population-based controls

ASJC Scopus subject areas

  • Genetics(clinical)
  • Genetics

Fingerprint

Dive into the research topics of 'Pathogen exposure misclassification can bias association signals in GWAS of infectious diseases when using population-based common control subjects'. Together they form a unique fingerprint.

Cite this