Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

From DNA to Protein03:06

From DNA to Protein

18.5K
The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
18.5K
Sanger Sequencing01:57

Sanger Sequencing

754.7K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
754.7K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.0K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.0K
Signal Sequences and Sorting Receptors01:41

Signal Sequences and Sorting Receptors

5.4K
Signal sequences are short amino acid sequences that guide newly synthesized proteins to their proper location within the cell. Classical signal sequences are fifteen to sixty amino acids long and present at the N-terminus of a polypeptide chain. Each signal sequence has a conserved segment of basic residues towards their N terminus, a hydrophobic core, and a C-terminus rich in polar residues. The C-terminus also contains a signal cleavage site and features a -3 -1 sequence motif. The -3-1...
5.4K
¹³C NMR: Distortionless Enhancement by Polarization Transfer (DEPT)01:20

¹³C NMR: Distortionless Enhancement by Polarization Transfer (DEPT)

1.1K
When proton-coupled carbon-13 spectra are simplified by a broadband proton decoupling technique, structural information about the coupled protons is lost. Distortionless enhancement by polarization transfer (DEPT) is a technique that provides information on the number of hydrogens attached to each carbon in a molecule. While the DEPT experiment utilizes complex pulse sequences, the pulse delay and flip angle are specifically manipulated. The resulting signals have different phases depending on...
1.1K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Nanoscale Hydroxyapatites Reprogram Phyllosphere Microbiomes and Disease Resistance-Related Phytohormone Biosynthesis in <i>Magnaporthe Oryzae</i>-Infected Rice.

Journal of agricultural and food chemistry·2026
Same author

Arginine metabolism and the NF-ĸB pathway jointly regulate the airway inflammation in asthma mediated by ILC2s.

International immunopharmacology·2026
Same author

Revealing shared gene signatures and HIF-1α pathway in allergic rhinitis and asthma: a bioinformatics and experimental study.

The Journal of asthma : official journal of the Association for the Care of Asthma·2026
Same author

A dual-asymmetric hemostatic Janus hydrogel patch inhibits postoperative recurrence and metastasis of triple-negative breast cancer by suppressing COX-2/PGE<sub>2</sub> axis.

Biomaterials·2026
Same author

Unlocking the acquired ultraviolet-resistance and enhanced antioxidant potential of pterostilbene-loaded pharmaceutical-dots.

International journal of pharmaceutics: X·2026
Same author

The Circadian Rhythm of Asthma Immune Metabolism.

Journal of immunology research·2026
Same journal

Modeling the impact of budget limitation on the screening and treatment pathway of HPV-induced precancerous cervical lesions.

Mathematical biosciences and engineering : MBE·2026
Same journal

Modeling the effects of trait-mediated dispersal on coexistence of two species: Competition and non-consumptive predator-prey.

Mathematical biosciences and engineering : MBE·2026
Same journal

A close look at the viral reduction rate in target cell limited models.

Mathematical biosciences and engineering : MBE·2026
Same journal

A stochastic agent-based model for simulating tumor-immune dynamics and evaluating therapeutic strategies.

Mathematical biosciences and engineering : MBE·2026
Same journal

Addressing domain shift via imbalance-aware domain adaptation in embryo development assessment.

Mathematical biosciences and engineering : MBE·2026
Same journal

Effect of drug resistance on an HIV epidemic in heterogeneous populations.

Mathematical biosciences and engineering : MBE·2026
See all related articles

Related Experiment Video

Updated: Jul 17, 2025

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation
07:15

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation

Published on: January 16, 2019

11.0K

A normalized differential sequence feature encoding method based on amino acid sequences.

Xiaoman Zhao1,2, Xue Wang1, Zhou Jin1

  • 1Institute of Intelligent Machinery, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei 230031, China.

Mathematical Biosciences and Engineering : MBE
|September 7, 2023
PubMed
Summary
This summary is machine-generated.

A new coding method, normalized difference sequence characteristics (NDSF), improves human protein interaction prediction. Combining NDSF with principal component analysis (PCA) and XGBoost achieved 99.2% accuracy, outperforming other methods.

Keywords:
amino acid sequencesdimensionality reduction methodsintegrated learningprotein interactionssequence feature extraction

More Related Videos

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.4K
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

1.9K

Related Experiment Videos

Last Updated: Jul 17, 2025

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation
07:15

Determining the Likelihood of Variant Pathogenicity Using Amino Acid-level Signal-to-Noise Analysis of Genetic Variation

Published on: January 16, 2019

11.0K
An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.4K
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

1.9K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Protein interactions are fundamental to cellular processes like apoptosis and immune response.
  • Accurate prediction of protein interactions is crucial for understanding cellular functions and disease mechanisms.

Purpose of the Study:

  • To develop and evaluate an optimized coding method for human protein interaction prediction.
  • To compare the performance of ensemble learning methods using different dimensionality reduction techniques.

Main Methods:

  • A novel coding method, normalized difference sequence characteristics (NDSF), was developed using amino acid sequence positional and correlational features.
  • Principal Component Analysis (PCA) and Local Linear Embedding (LLE) were employed for dimensionality reduction of the NDSF-coded protein sequences.
  • Four ensemble learning algorithms (AdaBoost, Extra trees, LightGBM, XGBoost) were assessed for classification performance.

Main Results:

  • The NDSF coding method demonstrated higher accuracy and reduced loss and coding time compared to the MOS method.
  • Linear dimensionality reduction (PCA) yielded significantly better classification accuracy than nonlinear reduction (LLE).
  • The XGBoost classifier, utilizing NDSF features reduced by PCA, achieved a peak accuracy of 99.2%.

Conclusions:

  • NDSF combined with PCA and XGBoost presents a highly effective strategy for classifying human protein interactions.
  • This approach offers improved accuracy and efficiency for protein interaction prediction in bioinformatics.
  • The findings highlight the importance of feature coding and dimensionality reduction in machine learning for biological data analysis.