Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

14.8K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
14.8K
Protein Networks02:26

Protein Networks

4.6K
An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
4.6K
Conservation of Protein Domains02:26

Conservation of Protein Domains

4.3K
4.3K
Protein Organization01:24

Protein Organization

9.8K
Proteins are polymers of amino acid residues. They are versatile and responsible for different cellular functions, including DNA replication, molecular transport, catalysis, and structural support. Proteins have a hierarchical structure comprising at least three levels of organization: primary, secondary, and tertiary structure. Some large proteins have a quaternary structure where individual protein subunits are linked together.
The primary structure of a protein is its amino acid sequence....
9.8K
Protein Organization01:13

Protein Organization

159.6K
Overview
159.6K
RNA-seq03:21

RNA-seq

12.2K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
12.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

scKSFD: federated distillation model with knowledge sharing for cell type classification of clinical transcriptome data.

BMC bioinformatics·2026
Same author

Machine learning-based prediction of ischemic cardio-cerebrovascular events after endovascular or microsurgical treatment of unruptured intracranial aneurysms and risk stratification by the early post-treatment triglyceride-glucose index.

Frontiers in neurology·2026
Same author

Development of a Subsequence Correlation Coefficient Feature Vector Method for High-Resolution HIV-1 Subtype Classification - China, 2004-2022.

China CDC weekly·2026
Same author

Prediction of HIV-1 sensitivity to broadly neutralizing antibodies using statistical distribution sampling (SDS) technology.

BMC bioinformatics·2026
Same author

Dysregulation of xenobiotic metabolism and mitochondrial dysfunction exacerbate acetaminophen-induced hepatotoxicity in human antigen R-deficient male mice.

Toxicology·2026
Same author

Endovascular therapy for symptomatic non-acute middle cerebral artery occlusion: impact on Lenticulostriate arteries reperfusion and clinical safety.

BMC neurology·2026
Same journal

The male-biased sex ratio in humans and its role in the transition from promiscuity to pair bonding.

Journal of theoretical biology·2026
Same journal

Quantifying the counter-intuitive effects of vaccination by coupling the transmission dynamics of COVID-19 and the evolution of human behaviors.

Journal of theoretical biology·2026
Same journal

An integrative model of FGF2-induced signaling and muscle cell proliferation.

Journal of theoretical biology·2026
Same journal

A hybrid reaction-diffusion and mechanical stimulus model for mandibular bone remodeling under chewing and vibratory loading.

Journal of theoretical biology·2026
Same journal

Integrated tick management strategies in fragmented peridomestic environments.

Journal of theoretical biology·2026
Same journal

Joint likelihood-free inference of the number of selected single nucleotide polymorphisms and their selection coefficients in an evolving population.

Journal of theoretical biology·2026
See all related articles

Related Experiment Video

Updated: Mar 1, 2026

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

4.0K

A novel alignment-free vector method to cluster protein sequences.

Lily He1, Yongkun Li1, Rong Lucy He2

  • 1Department of Mathematical Sciences, Tsinghua University, Beijing 100084, PR China.

Journal of Theoretical Biology
|June 8, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a novel 24-dimensional feature vector for protein classification, utilizing amino acid biochemical properties. This fast and accurate method aids in protein sequence analysis and phylogenetic inference, especially for viral genomes.

Keywords:
Alignment-freeBiochemical propertiesPhylogenyVector

More Related Videos

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.5K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.8K

Related Experiment Videos

Last Updated: Mar 1, 2026

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

4.0K
Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group
07:49

Creating and Applying a Reference to Facilitate the Discussion and Classification of Proteins in a Diverse Group

Published on: August 16, 2017

7.5K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.8K

Area of Science:

  • Biochemistry
  • Bioinformatics
  • Computational Biology

Background:

  • Protein sequence classification is vital in biology, with rapidly growing sequence databases.
  • Traditional methods like multiple sequence alignment are computationally intensive and unsuitable for genomes with rearrangements, such as viral genomes.

Purpose of the Study:

  • To develop a fast and accurate method for protein classification and phylogenetic inference.
  • To address limitations of traditional sequence alignment methods for large datasets and complex genomes.

Main Methods:

  • A novel 24-dimensional feature vector was developed based on three key amino acid biochemical properties: hydropathy index, polar requirement, and side chain chemical composition.
  • The method incorporates amino acid numbers and their positions within protein sequences.
  • The approach was tested on beta-globin, mammals, and three virus datasets.

Main Results:

  • The proposed method demonstrated high speed and accuracy in classifying protein sequences.
  • The tool proved effective for inferring the phylogeny of organisms across diverse datasets.
  • The feature vector approach successfully captured essential information from protein sequences.

Conclusions:

  • The new feature vector method offers an efficient and accurate alternative for protein classification and phylogenetic analysis.
  • This approach is particularly beneficial for analyzing large-scale genomic data and identifying evolutionary relationships.
  • The study highlights the utility of biochemical properties in computational biology for sequence analysis.