![]() |
|
| Archive Edition | |
|
Sponsored
by the U.S. Department of
Energy Human Genome Program
|
Santa Fe, New Mexico, November 13-17, 1994
|
Introduction to the Workshop
The electronic form of this document may be cited in the following style: Abstracts scanned from text submitted for November 1994 DOE Human Genome Program Contractor-Grantee Workshop. Inaccuracies have not been corrected. |
Software Support for Large Scale SequencingJoe Gatewood[1], Robert M. Pecherer[2], and Elaine Best[3] Current and projected DNA sequencing rates effectively prohibit direct human interaction on experimentally derived raw sequence data -- i.e., inspection, elimination of cloning artifacts, and editing in general. The investigator-as-data-processor bottleneck is further compounded during sequence analysis where DNA homology comparisons against public sequence databases result in redundant or extraneous information: Relevant homology is diluted by the irrelevant. Our goals in developing software to support large scale sequencing are:
The first three goals have been addressed. For primer directed sequencing, we have developed database representations and an exploring assembly algorithms. Our system architecture includes four components: Enduser interface, database, context management, and analytical tools. The enduser interface is implemented using Gain Momentum (Sybase) and programmed in GEL, a proprietary scripting language specialized for interactive I/O and task management. Database functionality is provided by the Relational DBMS Sybase using recursive DNA representations. (See "Recursive Relational Representation for DNA and Attribute-Value Lists: Techniques for Reducing Schema Modifications", Pecherer et al., DOE Human Genome Contractors Workshop IV, Santa Fe, NM, November 13-17, 1994.) Context management and analytical tools are written in C for performance and flexibility. Context management provides data management capability for the objects and collections obtained from the database and/or operated upon by analysis software. The analytical tools include homology comparison, feature selection, and primer selection algorithms. All analytical tools are designed as integral system components to avoid file parsing. We have implemented BLAST with a global alignment capability (BLASTga) to avoid segmented DNA homologies and have incorporated post screening of homology results to eliminate redundant extraneous information resulting from repetitive elements. Research funded by U.S. Department of Energy under Contract W-7405-ENG-36.
|
Send the url of this page to a friend
Last modified: Wednesday, October 29, 2003
Home * Contacts * Disclaimer
Base URL: www.ornl.gov/hgmis
Site sponsored by the U.S. Department of Energy
Office of Science, Office
of Biological and Environmental Research, Human
Genome Program