Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

From DNA to Protein03:06

From DNA to Protein

22.5K
The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
22.5K
Single-Strand DNA Binding Proteins01:03

Single-Strand DNA Binding Proteins

16.7K
For successful DNA replication, the unwinding of double-stranded DNA must be accompanied by stabilization and protection of the separated single strands of the DNA. This crucial task is performed by single-strand DNA-binding (SSB) proteins. They bind to the DNA in a sequence-independent manner, which means that the nitrogenous bases of the DNA need not be present in a specific order for binding of SSB proteins to it. The binding of SSB proteins straightens single-stranded DNA (ssDNA) and makes...
16.7K
Conserved Binding Sites01:49

Conserved Binding Sites

5.2K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
5.2K
DNA Base Pairing02:27

DNA Base Pairing

33.6K
Erwin Chargaff’s rules on DNA equivalence paved the way for the discovery of base pairing in DNA. Chargaff’s rules state that in a double-stranded DNA molecule,
33.6K
Cis-regulatory Sequences02:02

Cis-regulatory Sequences

11.9K
Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...
11.9K
Cooperative Binding of Transcription Regulators02:13

Cooperative Binding of Transcription Regulators

7.4K
Transcriptional regulators bind to specific cis-regulatory sequences in the DNA to regulate gene transcription. These cis-regulatory sequences are very short, usually less than ten nucleotide pairs in length. The short length means that there is a high probability of the exact same sequence randomly occurring throughout the genome.  Since regulators can also bind to groups of similar sequences, this further increases the chances of random binding. Transcriptional regulators form...
7.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

ESMDisPred: A Structure-Aware CNN-Transformer Architecture for Intrinsically Disordered Protein Prediction.

bioRxiv : the preprint server for biology·2026
Same author

EnsembleRegNet: Interpretable deep learning for transcriptional network inference from single-cell RNA-seq.

Computational biology and chemistry·2025
Same author

PPILS: Protein-protein interaction prediction with language of biological coding.

Computers in biology and medicine·2025
Same author

SumoPred-PLM: human SUMOylation and SUMO2/3 sites Prediction using Pre-trained Protein Language Model.

NAR genomics and bioinformatics·2024
Same author

DRBpred: A sequence-based machine learning method to effectively predict DNA- and RNA-binding residues.

Computers in biology and medicine·2024
Same author

TAFPred: Torsion Angle Fluctuations Prediction from Protein Sequences.

Biology·2023
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Feb 7, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.8K

StackDPPred: a stacking based prediction of DNA-binding protein from sequence.

Avdesh Mishra1, Pujan Pokhrel1, Md Tamjidul Hoque1

  • 1Department of Computer Science, University of New Orleans, New Orleans, LA, USA.

Bioinformatics (Oxford, England)
|July 23, 2018
PubMed
Summary
This summary is machine-generated.

Predicting DNA-binding proteins from sequence is crucial for genome annotation. A new computational method, StackDPPred, uses evolutionary profiles and contact energy for accurate DNA-binding protein identification, outperforming existing approaches.

More Related Videos

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

2.6K
Methyl-binding DNA capture Sequencing for Patient Tissues
08:40

Methyl-binding DNA capture Sequencing for Patient Tissues

Published on: October 31, 2016

9.1K

Related Experiment Videos

Last Updated: Feb 7, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction
16:41

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

69.8K
Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions
06:50

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

2.6K
Methyl-binding DNA capture Sequencing for Patient Tissues
08:40

Methyl-binding DNA capture Sequencing for Patient Tissues

Published on: October 31, 2016

9.1K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Identifying DNA-binding proteins solely from sequence information presents a significant challenge in genome annotation.
  • DNA-binding proteins are vital for fundamental biological processes including DNA replication, repair, transcription, and splicing.
  • Current experimental methods for identifying DNA-binding proteins are costly and time-consuming, necessitating efficient computational approaches.

Purpose of the Study:

  • To develop an effective computational method for predicting DNA-binding proteins using only sequence information.
  • To improve the accuracy of DNA-binding protein prediction beyond existing methods that rely solely on Position-Specific Scoring Matrix (PSSM) profiles.
  • To provide a tool that can accelerate genome annotation and guide experimental validation.

Main Methods:

  • Proposed StackDPPred, a stacking-based machine learning method.
  • Utilized features extracted from PSSM profiles and residue-specific contact energy.
  • Employed jackknife validation on a benchmark dataset of 1063 proteins (518 DNA-binding, 545 non DNA-binding).

Main Results:

  • StackDPPred achieved high performance metrics: 89.96% accuracy (ACC), 0.799 Matthews Correlation Coefficient (MCC), and 94.50% Area Under the Curve (AUC).
  • The method demonstrated superior performance compared to several state-of-the-art approaches.
  • Consistent outperformance was observed on two independent test datasets, validating its robustness.

Conclusions:

  • StackDPPred offers an effective computational solution for predicting DNA-binding proteins directly from their amino acid sequences.
  • The integration of PSSM and contact energy features enhances prediction accuracy.
  • The developed method serves as a valuable tool for rapid annotation and experimental guidance in genomics research.