Plant, Soil and Nutrition Research Site Logo
ARS Home About Us Helptop nav spacerContact Us En Espanoltop nav spacer
Printable VersionPrintable Version     E-mail this pageE-mail this page
Agricultural Research Service United States Department of Agriculture
Search
  Advanced Search
 
Programs and Projects
Subjects of Investigation
Research Projects
Functional and Comparative Proteomics Center
Research Infrastructure
Interesting Links
Ithaca, NY Location
 

Research Project: IDENTIFICATION OF FUNCTIONAL SEQUENCE IN PLANT GENOMES THROUGH BIOINFORMATIC, GENOMIC, AND GENETIC APPROACHES

Location: Plant, Soil and Nutrition Research

Title: Evidence-based gene predictions in plant genomes

Authors
item Liang, Chengzhi -
item Mao, Long -
item Ware, Doreen
item Stein, Lincoln -

Submitted to: Genome Research
Publication Type: Peer Reviewed Journal
Publication Acceptance Date: September 3, 2009
Publication Date: October 19, 2009
Citation: Liang, C., Mao, L., Ware, D., Stein, L. 2009. Evidence-based gene predictions in plant genomes. Genome Research. 10(2):1912-1923.

Interpretive Summary: Sequence of a genome is the starting point or blueprint to that describes the “parts” including the genes of an organism. In this work we present recent work on improvements for computationally generating protein-coding gene structures by using information that comes from expressed genes and proteins from the same species as well as closely related organisms. This method builds upon existing open source software, and allows research to combine evidence from different organisms to build gene structures in another. In this paper we present results of the performance of the software comparing to the existing annotations in arabidopsis and rice, as well as the preliminary analysis on a small region in maize.

Technical Abstract: Automated evidence-based gene building is a rapid and cost-effective way to provide reliable gene annotations on newly sequenced genomes. One of the limitations of evidence-based gene builders, however, is their requirement for gene expression evidence—known proteins, full-length cDNAs, or expressed sequence tags (ESTs)—in the species of interest. This limitation is of particular concern for plant genomes, where the rate of genome sequencing is greatly outpacing the rate of EST- and cDNA-sequencing projects. To overcome this limitation, we have developed an evidence-based gene build system (the Gramene pipeline) that can use gene expression evidence across related species. Using the previously annotated plant genomes, the dicot Arabidopsis thaliana and the monocot Oryza sativa, we show that the cross-species ESTs from within monocot or dicot class are a valuable source of evidence for gene predictions. We compare the Gramene pipeline to several widely used gene prediction programs in rice; this comparison shows the pipeline performs favorably at both the gene and exon levels with cross-species gene products only. We discuss the results of testing the pipeline on a 22-Mb region of the newly sequenced maize genome and discuss potential application of the pipeline to other genomes.

   

 
Project Team
Ware, Doreen
Kochian, Leon
 
Publications
   Publications
 
Related National Programs
  Plant Genetic Resources, Genomics and Genetic Improvement (301)
 
Related Projects
   CHARACTERIZING PLANT GENOMES FOR AGRICULTURE IMPROVEMENT
   USE OF GENETIC AND GENOMIC TOOLS IN SORGHUM TO ENHANCE OUR UNDERSTANDING OF DEVELOPMENTAL PROCESSES LIMITING YIELD AND/OR QUALITY IN GRASSES
 
 
Last Modified: 05/19/2013
ARS Home | USDA.gov | Site Map | Policies and Links 
FOIA | Accessibility Statement | Privacy Policy | Nondiscrimination Statement | Information Quality | USA.gov | White House