TY - JOUR
T1 - Non-parametric, hypothesis-based analysis of microarrays for comparison of several phenotypes
AU - Kowalski, Jeanne
AU - Drake, Charles
AU - Schwartz, Ronald H.
AU - Powell, Jonathan
N1 - Funding Information:
This work was supported in part by National Institute of Allergy and Infectious Diseases grants K22AI51186-01 (J.K.) and K22AI 01773-02 (J.P.), a National Cancer Institute grant 5P30 CA06973-40 (J.K.) and a V Foundation Award (J.P.).
PY - 2004/2/12
Y1 - 2004/2/12
N2 - Motivation: We present a statistical framework for the analysis of high-dimensional microarray data, where the goal is to compare intensities among several groups based on as few as a single sample from each group. In this setting, it is of interest to compare gene expression among several phenotypes to define candidate genes that simultaneously characterize several criteria, simultaneously, among the comparison groups. We motivate the approach by a comparative microarray experiment in which clones of a cell were singly exposed to several distinct but related conditions. The experiment was conducted to elucidate genes involved in pathways leading to T cell clonal anergy. Results: By integrating inference principles within a bioinformatics setting, we introduce a two-stage approach to select candidate genes that characterize several criteria. The method is unified in its non-parametric approach to inference and description. For inference, we construct a testable hypothesis based on the criteria of interest in a high-dimensional space, while preserving the dependence among genes. Upon rejecting the null, we estimate the cardinality of a set of individual candidate genes (or gene pairs) that depict the events of interest. With this estimate, we then select individual genes (or gene pairs) based upon a two-dimensional ranking that examines relations within and between genes, among comparison groups, using singular value decomposition in combination with inner product concepts.
AB - Motivation: We present a statistical framework for the analysis of high-dimensional microarray data, where the goal is to compare intensities among several groups based on as few as a single sample from each group. In this setting, it is of interest to compare gene expression among several phenotypes to define candidate genes that simultaneously characterize several criteria, simultaneously, among the comparison groups. We motivate the approach by a comparative microarray experiment in which clones of a cell were singly exposed to several distinct but related conditions. The experiment was conducted to elucidate genes involved in pathways leading to T cell clonal anergy. Results: By integrating inference principles within a bioinformatics setting, we introduce a two-stage approach to select candidate genes that characterize several criteria. The method is unified in its non-parametric approach to inference and description. For inference, we construct a testable hypothesis based on the criteria of interest in a high-dimensional space, while preserving the dependence among genes. Upon rejecting the null, we estimate the cardinality of a set of individual candidate genes (or gene pairs) that depict the events of interest. With this estimate, we then select individual genes (or gene pairs) based upon a two-dimensional ranking that examines relations within and between genes, among comparison groups, using singular value decomposition in combination with inner product concepts.
UR - http://www.scopus.com/inward/record.url?scp=1342330540&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=1342330540&partnerID=8YFLogxK
U2 - 10.1093/bioinformatics/btg418
DO - 10.1093/bioinformatics/btg418
M3 - Article
C2 - 14960463
AN - SCOPUS:1342330540
SN - 1367-4803
VL - 20
SP - 364
EP - 373
JO - Bioinformatics
JF - Bioinformatics
IS - 3
ER -