Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Protein Organization01:24

Protein Organization

6.5K
Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence....
6.5K
Protein and Protein Structures02:15

Protein and Protein Structures

10.5K
10.5K
Protein and Protein Structure02:15

Protein and Protein Structure

79.6K
Proteins are one of the most abundant organic molecules in living systems and have the most diverse range of functions of all macromolecules. Proteins may be structural, regulatory, contractile, or protective. They may serve in transport, storage, or membranes; or they may be toxins or enzymes. Their structures, like their functions, vary greatly. They are all, however, amino acid polymers arranged in a linear sequence.
A protein's shape is critical to its function. For example, an enzyme...
79.6K
Protein-Protein Interfaces02:04

Protein-Protein Interfaces

3.8K
3.8K
Protein Complex Assembly02:41

Protein Complex Assembly

2.1K
2.1K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.9K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

DANCE: Deep Learning-Assisted Analysis of ProteiN Sequences Using Chaos Enhanced Kaleidoscopic Images.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same author

<i>Special Issue, Part II</i> 20th International Symposium on Bioinformatics Research and Applications (ISBRA 2024).

Journal of computational biology : a journal of computational molecular cell biology·2025
Same author

<i>Special Issue, Part I</i> 20th International Symposium on Bioinformatics Research and Applications (ISBRA 2024).

Journal of computational biology : a journal of computational molecular cell biology·2025
Same author

<i>Special Issue, Part 2</i> 19th International Symposium on Bioinformatics Research and Applications (ISBRA 2023).

Journal of computational biology : a journal of computational molecular cell biology·2024
Same author

<i>Special Issue, Part I</i> 19th International Symposium on Bioinformatics Research and Applications (ISBRA 2023).

Journal of computational biology : a journal of computational molecular cell biology·2024
Same author

From PDB files to protein features: a comparative analysis of PDB bind and STCRDAB datasets.

Medical & biological engineering & computing·2024
Same journal

Tissue MicroRNAs in Arrhythmogenic Cardiomyopathy: A Systematic Review of Studies in Human Myocardium and Animal Models with Implications for Post-Mortem Molecular Diagnostics.

Genes·2026
Same journal

Genetic Variants and Dental Caries Susceptibility: An Umbrella Review and Multilevel Meta-Analysis.

Genes·2026
Same journal

Generative AI and Language Models in Human Genetics and Health: From Variant Interpretation to Clinical Decision Support.

Genes·2026
Same journal

Familial White-Sutton Syndrome Caused by a Pathogenic POGZ p.Arg508* Variant: Intrafamilial Variability from Childhood to Adulthood.

Genes·2026
Same journal

Genetic Influence on LDL-Cholesterol Levels: Role of Polygenic Risk Scores and Lp(a) Beyond Monogenic Hypercholesterolemia.

Genes·2026
Same journal

THBS1 as a Key Regulator of Myoblasts: Validation of Its Inhibitory Roles in Skeletal Muscle Development.

Genes·2026
See all related articles

Related Experiment Video

Updated: Jul 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

587

When Protein Structure Embedding Meets Large Language Models.

Sarwan Ali1, Prakash Chourasia1, Murray Patterson1

  • 1Department of Computer Science, Georgia State University, Atlanta, GA 30303, USA.

Genes
|January 23, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces novel protein embeddings using 3D structure and sequence data for improved protein classification. The method enhances accuracy in predicting protein functions, benefiting drug discovery and disease diagnosis.

Keywords:
LLMPDB filesclassificationprotein structurerepresentation learning

More Related Videos

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.7K
Analyzing Protein Architectures and Protein-Ligand Complexes by Integrative Structural Mass Spectrometry
07:33

Analyzing Protein Architectures and Protein-Ligand Complexes by Integrative Structural Mass Spectrometry

Published on: October 15, 2018

14.3K

Related Experiment Videos

Last Updated: Jul 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

587
A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

68.7K
Analyzing Protein Architectures and Protein-Ligand Complexes by Integrative Structural Mass Spectrometry
07:33

Analyzing Protein Architectures and Protein-Ligand Complexes by Integrative Structural Mass Spectrometry

Published on: October 15, 2018

14.3K

Area of Science:

  • Bioinformatics and Structural Biology
  • Computational Biology
  • Machine Learning in Life Sciences

Background:

  • Protein structure analysis is crucial for drug discovery, disease diagnosis, and evolutionary studies.
  • Current protein classification methods often rely on sequence-based embeddings, neglecting vital 3D structural information.
  • Existing approaches lack a unified strategy combining structural and sequence features for efficient protein analysis.

Purpose of the Study:

  • To develop a novel method for creating numerical protein embeddings that integrate 3D structural information with sequence data.
  • To enhance the performance of protein classification and function prediction by synergistically combining diverse feature sets.
  • To address the limitations of sequence-only embeddings and Euclidean space assumptions in representing complex protein data.

Main Methods:

  • Leveraging 3D protein structure information through contact maps to design Euclidean space embeddings.
  • Integrating structure-based embeddings with features from large language models (LLMs) and traditional feature engineering.
  • Utilizing benchmark datasets such as PDB Bind and STCRDAB for experimental validation.

Main Results:

  • The proposed method demonstrates superior performance in supervised protein analysis and function prediction compared to existing approaches.
  • Combined embeddings effectively capture both structural and sequential characteristics of proteins.
  • Experimental results validate the efficacy of the novel embedding strategy on diverse protein datasets.

Conclusions:

  • The novel embedding approach significantly advances protein classification and function prediction by incorporating 3D structural data.
  • This method offers a more comprehensive representation of proteins, improving accuracy in bioinformatics applications.
  • The findings pave the way for more sophisticated machine learning models in structural biology and drug discovery.