Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Structural characterization of genomes by large scale sequence-structure threading.

Artem Cherkasov1, Steven J M Jones

  • 1Genome Sciences Centre, British Columbia Cancer Agency, Vancouver, British Columbia, Canada. artc@interchange.ubc.ca

BMC Bioinformatics
|April 6, 2004
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Deep docking, part 2: an amplified DDU platform for ultra-large virtual screening.

Chemical science·2026
Same author

The Use of DeepQSAR Models for the Discovery of Peptides With Enhanced Antimicrobial and Antibiofilm Potential.

Molecular informatics·2026
Same author

Retraction of "The Use of DeepQSAR Models for The Discovery of Peptides with Enhanced Antimicrobial and Antibiofilm Potential".

Journal of chemical information and modeling·2026
Same author

Pharmacokinetics, pathology and efficacy of SARS-CoV-2 main protease inhibitor VPC285785 in a murine model of coronavirus infection.

Scientific reports·2026
Same author

CACHE Challenge #3: Targeting the Nsp3 Macrodomain of SARS-CoV-2.

Journal of chemical information and modeling·2026
Same author

Design of Highly Potent Antibiofilm, Antimicrobial Peptides Using Explainable Artificial Intelligence.

Journal of chemical information and modeling·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Sequence-structure threading successfully characterized protein folds in over 60% of eukaryotic, archaeal, and bacterial proteomes. This large-scale genomic analysis reveals universal patterns in protein architecture and distribution across life.

Area of Science:

  • Structural bioinformatics
  • Genomics
  • Computational biology

Background:

  • Utilized sequence-structure threading for comprehensive structural characterization.
  • Analyzed complete proteomes from 37 diverse organisms (archaea, bacteria, eukaryotes).
  • Total dataset comprised 167,888 genes from various species, including human and mouse.

Purpose of the Study:

  • To evaluate the performance and general applicability of sequence-structure threading for large-scale genomic studies.
  • To establish protein fold repertoires across different superkingdoms.
  • To investigate the distribution patterns of protein classes, architectures, and topologies.

Main Methods:

  • Applied sequence-structure threading to analyze complete proteomes.
  • Classified protein structures using the CATH 2.4 database.

Related Experiment Videos

  • Performed statistical analysis on genomic occurrences of protein superfamilies and topologies.
  • Main Results:

    • Assigned protein folds to over 60% of eukaryotic, 68% of archaeal, and 70% of bacterial proteomes.
    • Identified "alpha and beta" as the most abundant CATH class, "3-Layer (aba) Sandwich" as the most common architecture, and Rossmann fold as the most frequent topology.
    • Characterized repertoires of protein classes, architectures, and topologies for diverse organisms.

    Conclusions:

    • Genomic occurrences of protein superfamilies and topologies exhibit power-law distributions.
    • Established double logarithmic "frequency - genomic occurrence" dependences characteristic of scale-free systems.
    • Demonstrated the power-law nature of protein distribution across individual organisms and superkingdoms.