PARE: A framework for removal of confounding effects from any distance-based dimension reduction method

Andrew A. Chen, Kelly Clark, Blake E. Dewey, Anna DuVal, Nicole Pellegrini, Govind Nair, Youmna Jalkh, Samar Khalil, Jon Zurawski, Peter A. Calabresi, Daniel S. Reich, Rohit Bakshi, Haochang Shou, Russell T. Shinohara

Research output: Contribution to journalArticlepeer-review

Abstract

Dimension reduction tools preserving similarity and graph structure such as t-SNE and UMAP can capture complex biological patterns in high-dimensional data. However, these tools typically are not designed to separate effects of interest from unwanted effects due to confounders. We introduce the partial embedding (PARE) framework, which enables removal of confounders from any distance-based dimension reduction method. We then develop partial t-SNE and partial UMAP and apply these methods to genomic and neuroimaging data. For lower-dimensional visualization, our results show that the PARE framework can remove batch effects in single-cell sequencing data as well as separate clinical and technical variability in neuroimaging measures. We demonstrate that the PARE framework extends dimension reduction methods to highlight biological patterns of interest while effectively removing confounding effects.

Original languageEnglish (US)
Article numbere1012241
JournalPLoS computational biology
Volume20
Issue number7 July
DOIs
StatePublished - Jul 2024

ASJC Scopus subject areas

  • Ecology, Evolution, Behavior and Systematics
  • Modeling and Simulation
  • Ecology
  • Molecular Biology
  • Genetics
  • Cellular and Molecular Neuroscience
  • Computational Theory and Mathematics

Fingerprint

Dive into the research topics of 'PARE: A framework for removal of confounding effects from any distance-based dimension reduction method'. Together they form a unique fingerprint.

Cite this