Introduction to the Workshop
URLs Provided by Attendees
- Ethical, Legal, and Social Issues
The electronic form of this document may be cited in the following style:
Human Genome Program, U.S. Department of Energy, DOE Human Genome Program Contractor-Grantee Workshop IV, 1994.
Abstracts scanned from text submitted for November 1994 DOE Human Genome Program Contractor-Grantee Workshop. Inaccuracies have not been corrected.
An Integrated Approach to Genomic Sequence Analysis
Terry Gaasterland, Natalia Alaltsev, and Ross Overbeek
Mathematics and Computer Science Division
Argonne National Laboratory
We are working on various projects with the objective of developing an integrated framework for genomic sequence analysis. One effort has involved supporting the analysis of Mycoplasma capricolum sequence data (produced by Pat Gillevet of NIH and Walter Gilbert of Harvard). This data was given to us in the form of 372 contigs representing 214kb of sequence. Our analysis produced a set of detailed results (prediction of over 160 specific coding sequences) and a general-purpose tool which we are now using to analyze microbial sequence data.
Other projects include (1) construction of a phylogenetic tree based on rRNA, containing over 2700 taxa, using maximum likelihood (working with Gary Olsen and Carl Woese of the Ribosomal Database Project); (2) development of an integrated database containing sequence data from over 600 microbial organisms, protein sequence and motif data, phylogenetic trees, and alignments; (3) design of an automatic system for analyzing microbial sequence data; and (4) production of a framework to support analysis of organism metabolism (working closely with E. Selkov, developer of the EMP database on enzymes and metabolic pathways).