Skip to main content

Predicted structural proteome of Sphagnum divinum and proteome-scale annotation...

Publication Type
Journal Name
Publication Date

Motivation: Sphagnum-dominated peatlands store a substantial amount of terrestrial carbon. The genus is undersampled and under-studied. No experimental crystal structure from any Sphagnum species exists in the Protein Data Bank and fewer than 200 Sphagnum-related genes have structural models available in the AlphaFold Protein Structure Database. Tools and resources are needed to help bridge these gaps, and to enable the analysis of other structural proteomes now made possible by accurate structure prediction.

Results: We present the predicted structural proteome (25,134 primary transcripts) of S. divinum computed using AlphaFold, structural alignment results of all high-confidence models against an annotated non-redundant crystallographic database of over 90,000 structures, a structure-based classification of putative Enzyme Commission (EC) numbers across this proteome, and the computational method to perform this proteome-scale structure-based annotation.

Availability: All data and code are available in public repositories, detailed at The structural models of the S. divinum proteome have been deposited in the ModelArchive repository at

Supplementary information: Supplementary data are available at Bioinformatics online.