Beyond the Identification of Transcribed
Sequences:
Functional and Expression Analysis
11th Annual Workshop
November 9-12, 2001
Washington D.C.
Richard Mural
45 West Gude Drive
Rockville, MD 20850
telephone: 240-453-3665
email: richard.mural@celera.com
prestype: Platform
presenter: Richard Mural
Richard J. Mural, Celera Genomics. Celera Genomics, Rockville, MD. 20850. www.celera.com
We have generated whole genome shotgun sequence from 3 strains of the laboratory mouse (129X1/SvJ, A/J, DBA/2) to a total coverage of 5.5X coverage. Paired-end sequences from libraries with insert sizes of 2, 10 and 50 kbp (about 23 million reads) were assembled using the whole genome assembly protocol that we have previously described for the assembly of the human genome (Venter, et al. Science, vol. 291, 2001). Nearly 50% of this sequence assembles into scaffold >5 megabases in length. More recently we have completed a new assembly which incorporates data generated by the public mouse genome project from the C57/BL6 strain (this assembly has 7 to 8X coverage of the mouse genome).
We have also analyzed the assembled mouse genome with a modification of the
computational analysis pipeline and auto-annotation system, Otto, that we used
for the computational annotation of the human genome. This analysis pipeline
included mouse full-length cDNAs as well as human genes. The results of this
analysis and a preliminary comparison of the human and mouse gene complement
will be presented.