Abstract
Screening and early identification of primary immunodeficiency disease (PID) genes is a major challenge for physicians. Many resources have catalogued molecular alterations in known PID genes along with their associated clinical and immunological phenotypes. However, these resources do not assist in identifying candidate PID genes. We have recently developed a platform designated Resource of Asian PDIs, which hosts information pertaining to molecular alterations, protein-protein interaction networks, mouse studies and microarray gene expression profiling of all known PID genes. Using this resource as a discovery tool, we describe the development of an algorithm for prediction of candidate PID genes. Using a support vector machine learning approach, we have predicted 1442 candidate PID genes using 69 binary features of 148 known PID genes and 3162 non-PID genes as a training data set. The power of this approach is illustrated by the fact that six of the predicted genes have recently been experimentally confirmed to be PID genes. The remaining genes in this predicted data set represent attractive candidates for testing in patients where the etiology cannot be ascribed to any of the known PID genes.
Original language | English (US) |
---|---|
Pages (from-to) | 345-351 |
Number of pages | 7 |
Journal | DNA Research |
Volume | 16 |
Issue number | 6 |
DOIs | |
State | Published - Dec 2009 |
Keywords
- HPRD
- Human Proteinpedia
- NetPath
- RAPID
- SVM
ASJC Scopus subject areas
- Genetics
- Molecular Biology
- General Medicine