Skip to main content
ARS Home » Research » Publications at this Location » Publication #147768

Title: DEVELOPMENT OF A SIMPLE WEB INTERFACE FOR MANAGING AND ANALYZING EST DATA

Author
item MATUKUMALLI, LAKSHMI - GEORGE MASON UNIVERSITY
item GREFENSTETTE, JOHN - GEORGE MASON UNIVERSITY
item Sonstegard, Tad
item Van Tassell, Curtis - Curt

Submitted to: BARC Poster Day
Publication Type: Abstract Only
Publication Acceptance Date: 4/1/2003
Publication Date: 4/1/2003
Citation: Matukumalli, L.K., Grefenstette, J.J., Sonstegard, T.S., Van Tassell, C.P. 2003. Development of a simple web interface for managing and analyzing est data [abstract]. BARC Poster Day.

Interpretive Summary:

Technical Abstract: Expressed sequence tags (EST) are partial sequences of expressed genes prepared by reverse transcribing mRNA and cloning the cDNA fragments into a plasmid. In performing large-scale EST projects for any species, many different types of information and data types are generated. This can include contact, publication and library information that is to be submitted to GenBank along with the EST sequence and the analysis data related to annotation. EST-PAGE provides a bioinformatics solution for EST data entry, database management, process control and data retrieval from a unified web interface that can be easily customized and adapted by groups working on diverse EST sequencing projects. Although several EST pipeline applications were developed, this software is not freely available. For these reasons, we developed a simple web interface for managing and analyzing data generated from EST sequencing projects, which is named EST-PAGE. PAGE is an acronym corresponding to the data management steps available in the interface that can be summarized as: P (Processing of chromatogram for base calling), A (Analysis of the sequence data with vector screening, filtering low complexity sequences and checking for E.coli contamination), G (GenBank submission of good sequences to dbEST), E (Exploration of EST data for redundancy, presence of novel sequences by clustering and annotation). EST-PAGE is written in Perl, and takes advantage of standard modules such as bioperl and CGI-Perl to reduce coding costs and to encourage standardization and easy modification of the system to suit other needs. Open source free database Postgres (MySql version also available) is used to store the data. EST- PAGE can be used by all groups within ARS-USDA. This software will be made freely available.