TY - GEN
T1 - Cloud-scale genomic signals processing classification analysis for gene expression microarray data
AU - Harvey, Benjamin
AU - Ji, Soo Yeon
N1 - Publisher Copyright:
© 2014 IEEE.
PY - 2014/11/2
Y1 - 2014/11/2
N2 - As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
AB - As microarray data available to scientists continues to increase in size and complexity, it has become overwhelmingly important to find multiple ways to bring inference though analysis of DNA/mRNA sequence data that is useful to scientists. Though there have been many attempts to elucidate the issue of bringing forth biological inference by means of wavelet preprocessing and classification, there has not been a research effort that focuses on a cloud-scale classification analysis of microarray data using Wavelet thresholding in a Cloud environment to identify significantly expressed features. This paper proposes a novel methodology that uses Wavelet based Denoising to initialize a threshold for determination of significantly expressed genes for classification. Additionally, this research was implemented and encompassed within cloud-based distributed processing environment. The utilization of Cloud computing and Wavelet thresholding was used for the classification 14 tumor classes from the Global Cancer Map (GCM). The results proved to be more accurate than using a predefined p-value for differential expression classification. This novel methodology analyzed Wavelet based threshold features of gene expression in a Cloud environment, furthermore classifying the expression of samples by analyzing gene patterns, which inform us of biological processes. Moreover, enabling researchers to face the present and forthcoming challenges that may arise in the analysis of data in functional genomics of large microarray datasets.
UR - http://www.scopus.com/inward/record.url?scp=84929501067&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84929501067&partnerID=8YFLogxK
U2 - 10.1109/EMBC.2014.6943968
DO - 10.1109/EMBC.2014.6943968
M3 - Conference contribution
C2 - 25570336
AN - SCOPUS:84929501067
T3 - 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2014
SP - 1843
EP - 1846
BT - 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2014
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, EMBC 2014
Y2 - 26 August 2014 through 30 August 2014
ER -