Progress, and Applications
of the Human Genome Project
Sponsored by the U.S. Department of Energy Human Genome Program
Human Genome News Archive Edition
|Available in PDF
In this issue...
DOE '99 Oakland Highlights
In the News
Web, Other Resources, Publications
Report from 1999 DOE Genome Meeting
Oakland presentations emphasized that genome-sequencing projects are producing data at a rate exceeding current analytical and data-management capabilities. Additionally, some current computing problems are expected to scale up exponentially as the data increase.
Genome Annotation Consortium
Ed Uberbacher, Jay Snoddy, and Phil LoCasio (all at Oak Ridge National Laboratory) offered an update on progress at ORNL and the multi-institutional Genome Annotation Consortium (GAC), which was established to address massive computational and informational challenges.
The goals of this work are to develop a system for whole-genome annotation that (1) organizes various types of data around genome frameworks that can be cross-indexed, compared, and cross-navigated and (2) allows multiple analytical methods to be applied to the same data. Steps in the annotation process include the following:
Other significant computational challenges include integrating the major community maps, which often have inconsistences and discrepancies, and performing comprehensive sequence analyses for gene modeling, which requires the time-consuming application of several algorithms. Furthermore, completing some desired analyses for protein classification currently could require about 70 days on a 1024-node processor. Challenges are similar for such other comparative processes as genome-to-genome alignment for studying mouse and human synteny. As sequence numbers and lengths increase, challenges become even greater for making phylogenetic gene and species trees. Meeting these and other high-performance biological computing needs, the speakers emphasized, demands a centralized approach with advanced infrastructure and specialized facilities.
Uberbacher gave an overview of GAC progress in developing tools, servers, and special data views to serve the community. Achievements include establishment of data-acquisition and semiautomated sequence-assembly components and modules that are integrated to allow comprehensive genome-wide analysis. He noted that the computation-based GRAIL-EXP is finding about 10 times more human genes than investigators had identified previously, as indicated in the GenBank annotation. All human and microbial gene-analysis tools are available to researchers. At present, the Oak Ridge group is focusing on urgent annotation challenges from the massive sequencing ramp-up under way at the DOE Joint Genome Institute.
For More Information:
Multiple Genome Analysis: WIT
A poster by Natalia Maltsev (Argonne National Laboratory, ANL) and colleagues described ANL's WIT system. WIT was designed and implemented to support genetic sequence and comparative analysis of sequenced genomes and metabolic reconstructions from sequence data. It now contains data from 34 genomes (some incomplete).
The authors believe that parallel analysis of a large number of phylogenetically diverse genomes can add much to the understanding of higher-level functional subsystems and major physiological designs. They reported a new method for using conserved clusters of genes from numerous genomes to predict functional coupling between genes. Although early results are encouraging, investigators believe the precision of prediction and the amount of accessible functional coupling will increase dramatically as more genomes are added. They emphasized that this class of data may well become a significant resource for establishing the function of hypothetical proteins, better understanding the functions of paralogous genes, and reconstructing connections in higher-level functional subsystems.
The electronic form of the newsletter may be cited in the following
Last modified: Wednesday, October 29, 2003
Home * Contacts * Disclaimer
Document Use and Credits
Publications and webpages on this site were created by the U.S. Department of Energy Genome Program's Biological and Environmental Research Information System (BERIS). Permission to use these documents is not needed, but please credit the U.S. Department of Energy Genome Programs and provide the website http://genomics.energy.gov. All other materials were provided by third parties and not created by the U.S. Department of Energy. You must contact the person listed in the citation before using those documents.
Base URL: www.ornl.gov/hgmis
Site sponsored by the U.S. Department of Energy Office of Science, Office of Biological and Environmental Research, Human Genome Program