Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Organization01:24

Protein Organization

Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence.
Protein Organization01:24

Protein Organization

Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence.
Classification of Systems-I01:26

Classification of Systems-I

Linearity is a system property characterized by a direct input-output relationship, combining homogeneity and additivity.
Homogeneity dictates that if an input x(t) is multiplied by a constant c, the output y(t) is multiplied by the same constant. Mathematically, this is expressed as:
Protein Folding Quality Check in the RER01:29

Protein Folding Quality Check in the RER

ER is the primary site for the maturation and folding of soluble and transmembrane secretory proteins. The calnexin cycle is a specific chaperone system that folds and assesses the confirmation of N-glycosylated proteins before they can exit the ER lumen. The primary players of this quality check pipeline are the lectins, ER-resident chaperones, and a glucosyl transferase enzyme. In case the calnexin system in the lumen fails to salvage a misfolded protein, it is transported to the cytoplasm...
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to form...
Survival Tree01:19

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a survival tree begins...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

PyMolGen: Database-Driven Molecular Generation of Drug-Like Compounds.

Journal of chemical information and modeling·2026
Same author

Design and characterization of bioinstructive gold nanocomposite hydrogels for 3D bioprinting.

Journal of materials chemistry. B·2026
Same author

Late presentation of peripheral corneal infiltrates after collagen crosslinking in keratoconus.

BMJ case reports·2026
Same author

Proposed Biosynthesis of the Complex Ring-Fused Diterpene Rameswaralide. Mechanistic Insights Using Density Functional Theory.

The Journal of organic chemistry·2026
Same author

Rega: A Platform for the Prediction of the Regioselectivity of C-H Functionalization Reactions.

Journal of chemical information and modeling·2026
Same author

DyeDactic workflow to predict halochromism of biosynthetic colourants.

Communications chemistry·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 11, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Automatic structure classification of small proteins using random forest.

Pooja Jain1, Jonathan D Hirst

  • 1School of Chemistry, The University of Nottingham, University Park, Nottingham, NG7 2RD, UK.

BMC Bioinformatics
|July 3, 2010
PubMed
Summary
This summary is machine-generated.

Random forest machine learning accurately predicts protein structural classifications using structural descriptors. This method effectively classifies protein domains, even for complex structures, demonstrating its utility in structural biology.

Related Experiment Videos

Last Updated: Jun 11, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Area of Science:

  • Computational biology
  • Structural bioinformatics
  • Machine learning applications

Background:

  • Supervised machine learning, specifically random forest, is employed for predicting Structural Classification of Proteins (SCOP) classifications.
  • The algorithm utilizes structural descriptors to match target structures with templates having similar numbers of secondary structure elements (SSEs).
  • Initial validation focused on domains with three SSEs, with subsequent assessments on larger domains (four to six SSEs).

Purpose of the Study:

  • To evaluate the efficacy of random forest in predicting SCOP structural classifications.
  • To demonstrate the algorithm's applicability to protein domains of varying complexity (3-6 SSEs).
  • To assess the potential of random forest for addressing SCOP classification challenges and predicting classifications for unassigned PDB structures.

Main Methods:

  • Application of the random forest algorithm to predict SCOP classifications.
  • Training the model on SCOP version 1.69 and testing on an independent dataset from SCOP version 1.73.
  • Analysis of predictive accuracy and Matthew's correlation coefficient (MCC) across different SCOP levels and domain sizes.

Main Results:

  • Achieved up to 94% predictive accuracy on an independent test set.
  • Matthew's correlation coefficient (MCC) for classification ranged from 0.61 to 0.83 across SCOP Class, Fold, Super-family, and Family levels.
  • Predictive accuracy (MCC) decreased as the number of constituent SSEs increased.

Conclusions:

  • Random forest is a useful tool for classifying protein domains within the SCOP hierarchy.
  • The algorithm can potentially address issues like new structural level introductions and singleton level mergers in SCOP.
  • The study successfully mimicked a real-world scenario by predicting classifications for PDB structures awaiting SCOP assignment.