TY - JOUR
T1 - An argument for mechanism-based statistical inference in cancer
AU - Geman, Donald
AU - Ochs, Michael
AU - Price, Nathan D.
AU - Tomasetti, Cristian
AU - Younes, Laurent
N1 - Funding Information:
The work of D. Geman and L. Younes was partially supported by the National Science Foundation under NSF DMS1228248. N. Price’s work was supported by a Camille Dreyfus Teacher-Scholar Award and NIH 2P50GM076547.
Publisher Copyright:
© 2014, Springer-Verlag Berlin Heidelberg.
PY - 2015/5/1
Y1 - 2015/5/1
N2 - Cancer is perhaps the prototypical systems disease, and as such has been the focus of extensive study in quantitative systems biology. However, translating these programs into personalized clinical care remains elusive and incomplete. In this perspective, we argue that realizing this agenda—in particular, predicting disease phenotypes, progression and treatment response for individuals—requires going well beyond standard computational and bioinformatics tools and algorithms. It entails designing global mathematical models over network-scale configurations of genomic states and molecular concentrations, and learning the model parameters from limited available samples of high-dimensional and integrative omics data. As such, any plausible design should accommodate: biological mechanism, necessary for both feasible learning and interpretable decision making; stochasticity, to deal with uncertainty and observed variation at many scales; and a capacity for statistical inference at the patient level. This program, which requires a close, sustained collaboration between mathematicians and biologists, is illustrated in several contexts, including learning biomarkers, metabolism, cell signaling, network inference and tumorigenesis.
AB - Cancer is perhaps the prototypical systems disease, and as such has been the focus of extensive study in quantitative systems biology. However, translating these programs into personalized clinical care remains elusive and incomplete. In this perspective, we argue that realizing this agenda—in particular, predicting disease phenotypes, progression and treatment response for individuals—requires going well beyond standard computational and bioinformatics tools and algorithms. It entails designing global mathematical models over network-scale configurations of genomic states and molecular concentrations, and learning the model parameters from limited available samples of high-dimensional and integrative omics data. As such, any plausible design should accommodate: biological mechanism, necessary for both feasible learning and interpretable decision making; stochasticity, to deal with uncertainty and observed variation at many scales; and a capacity for statistical inference at the patient level. This program, which requires a close, sustained collaboration between mathematicians and biologists, is illustrated in several contexts, including learning biomarkers, metabolism, cell signaling, network inference and tumorigenesis.
UR - http://www.scopus.com/inward/record.url?scp=84931833538&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84931833538&partnerID=8YFLogxK
U2 - 10.1007/s00439-014-1501-x
DO - 10.1007/s00439-014-1501-x
M3 - Review article
C2 - 25381197
AN - SCOPUS:84931833538
SN - 0340-6717
VL - 134
SP - 479
EP - 495
JO - Human genetics
JF - Human genetics
IS - 5
ER -