Jove
Visualize
Contact Us

Related Concept Videos

Protein Organization01:24

Protein Organization

6.4K
Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence....
6.4K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.8K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.8K
Protein Folding01:22

Protein Folding

117.8K
Overview
117.8K
Protein and Protein Structure02:15

Protein and Protein Structure

79.4K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
79.4K
Tail-anchoring of Proteins in the ER Membrane01:45

Tail-anchoring of Proteins in the ER Membrane

3.1K
Tail-anchored, or TA, proteins are estimated to make up to 3-5% of membrane proteins found in the eukaryotic cell. Such proteins have a single transmembrane domain located approximately 30 amino acid residues upstream from the C-terminal end. As a result, the signal recognition particle (SRP) cannot guide a TA protein to the ER membrane for cotranslational insertion. Hence, they are integrated into the ER membrane post-translationally using their C-terminal end as the anchor. TA proteins...
3.1K
Protein Families02:47

Protein Families

15.3K
Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key...
15.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Reference-free discovery with barcoded single-cell sequencing.

Nature biotechnology·2026
Same author

FunctionaL Assigning Sequence Homing (FLASH) maps phenotype to sequence with deep and machine learning.

bioRxiv : the preprint server for biology·2026
Same author

Fast and accurate multiple-protein-sequence alignment at scale with FAMSA2.

Nature biotechnology·2026
Same author

A Reference-Free Algorithm Discovers Regulation in the Plant Transcriptome.

Plant direct·2026
Same author

An nf-core framework for the systematic comparison of alternative modeling tools: the multiple sequence alignment case study.

NAR genomics and bioinformatics·2025
Same author

Ultrafast and accurate sequence alignment and clustering of viral genomes.

Nature methods·2025
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Video

Updated: Jun 21, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.6K

Efficient protein structure archiving using ProteStAr.

Sebastian Deorowicz1, Adam Gudyś1

  • 1Department of Algorithmics and Software, Silesian University of Technology, Akademicka 16, Gliwice, PL-44100, Poland.

Bioinformatics (Oxford, England)
|July 10, 2024
PubMed
Summary
This summary is machine-generated.

ProteStAr efficiently compresses large protein structure files (CIF/PDB) using a novel prediction method. This tool significantly reduces data size for large-scale analyses, outperforming existing methods.

More Related Videos

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

17.0K
Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web
09:51

Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web

Published on: July 16, 2017

15.4K

Related Experiment Videos

Last Updated: Jun 21, 2025

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.6K
Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules
10:58

Protein WISDOM: A Workbench for In silico De novo Design of BioMolecules

Published on: July 25, 2013

17.0K
Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web
09:51

Investigating Protein Sequence-structure-dynamics Relationships with Bio3D-web

Published on: July 16, 2017

15.4K

Area of Science:

  • Structural biology
  • Bioinformatics
  • Computational chemistry

Background:

  • DeepMind's AlphaFold 2 generates millions of protein structures in CIF/PDB formats.
  • Large datasets (tens of terabytes) hinder large-scale structural analyses.
  • Existing compression methods are insufficient for predicted protein structure data.

Purpose of the Study:

  • Introduce ProteStAr, a specialized compressor for protein structure files (CIF/PDB) and associated PAE files.
  • Develop a novel compression approach for efficient encoding of atomic coordinates.
  • Enable faster and more accessible large-scale analysis of predicted protein structures.

Main Methods:

  • Developed a novel algorithm for predicting atom coordinates based on previously analyzed atoms.
  • Implemented both lossless and lossy compression modes with controlled error.
  • Optimized the algorithm for multicore CPUs, achieving speeds of approximately 1 GB/s.
  • Provided Python and C++ APIs for enhanced usability.

Main Results:

  • ProteStAr achieves superior compression ratios compared to BinaryCIF, Foldcomp, and PDC at comparable reconstruction accuracy.
  • The method efficiently encodes atomic coordinates, the largest data component in structure files.
  • High-speed compression and decompression (approx. 1 GB/s) leverage multicore architectures.
  • Lossy mode allows controlled error for further data reduction.

Conclusions:

  • ProteStAr offers a significant improvement in compressing large protein structure datasets.
  • The tool facilitates large-scale structural bioinformatics analyses by reducing data storage and transfer requirements.
  • Its speed, efficiency, and accessibility (via APIs) make it valuable for researchers.