Computational gene prediction using multiple sources of evidence

Research output: Contribution to journalArticlepeer-review

89 Scopus citations


This article describes a computational method to construct gene models by using evidence generated from a diverse set of sources, including those typical of a genome annotation pipeline. The program, called Combiner, takes as input a genomic sequence and the locations of gene predictions from ab initio gene finders, protein sequence alignments, expressed sequence tag and cDNA alignments, splice site predictions, and other evidence. Three different algorithms for combining evidence in the Combiner were implemented and tested on 1783 confirmed genes in Arabidopsis thaliana. Our results show that combining gene prediction evidence consistently outperforms even the best individual gene finder and, in some cases, can produce dramatic improvements in sensitivity and specificity.

Original languageEnglish (US)
Pages (from-to)142-148
Number of pages7
JournalGenome research
Issue number1
StatePublished - Jan 2004
Externally publishedYes

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)


Dive into the research topics of 'Computational gene prediction using multiple sources of evidence'. Together they form a unique fingerprint.

Cite this