Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

From DNA to Protein

From DNA to Protein

The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...

Single-Strand DNA Binding Proteins

Single-Strand DNA Binding Proteins

For successful DNA replication, the unwinding of double-stranded DNA must be accompanied by stabilization and protection of the separated single strands of the DNA. This crucial task is performed by single-strand DNA-binding (SSB) proteins. They bind to the DNA in a sequence-independent manner, which means that the nitrogenous bases of the DNA need not be present in a specific order for binding of SSB proteins to it. The binding of SSB proteins straightens single-stranded DNA (ssDNA) and makes...

Conserved Binding Sites

Conserved Binding Sites

Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...

DNA Base Pairing

DNA Base Pairing

Erwin Chargaff’s rules on DNA equivalence paved the way for the discovery of base pairing in DNA. Chargaff’s rules state that in a double-stranded DNA molecule,

Cis-regulatory Sequences

Cis-regulatory Sequences

Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...

Cooperative Binding of Transcription Regulators

Cooperative Binding of Transcription Regulators

Transcriptional regulators bind to specific cis-regulatory sequences in the DNA to regulate gene transcription. These cis-regulatory sequences are very short, usually less than ten nucleotide pairs in length. The short length means that there is a high probability of the exact same sequence randomly occurring throughout the genome. Since regulators can also bind to groups of similar sequences, this further increases the chances of random binding. Transcriptional regulators form...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

ESMDisPred: A Structure-Aware CNN-Transformer Architecture for Intrinsically Disordered Protein Prediction.

bioRxiv : the preprint server for biology·2026

Same author

EnsembleRegNet: Interpretable deep learning for transcriptional network inference from single-cell RNA-seq.

Computational biology and chemistry·2025

Same author

PPILS: Protein-protein interaction prediction with language of biological coding.

Computers in biology and medicine·2025

Same author

SumoPred-PLM: human SUMOylation and SUMO2/3 sites Prediction using Pre-trained Protein Language Model.

NAR genomics and bioinformatics·2024

Same author

DRBpred: A sequence-based machine learning method to effectively predict DNA- and RNA-binding residues.

Computers in biology and medicine·2024

Same author

TAFPred: Torsion Angle Fluctuations Prediction from Protein Sequences.

Biology·2023

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 7, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

StackDPPred: a stacking based prediction of DNA-binding protein from sequence.

Avdesh Mishra¹, Pujan Pokhrel¹, Md Tamjidul Hoque¹

¹Department of Computer Science, University of New Orleans, New Orleans, LA, USA.

Bioinformatics (Oxford, England)

|July 23, 2018

Summary

This summary is machine-generated.

Predicting DNA-binding proteins from sequence is crucial for genome annotation. A new computational method, StackDPPred, uses evolutionary profiles and contact energy for accurate DNA-binding protein identification, outperforming existing approaches.

More Related Videos

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

Methyl-binding DNA capture Sequencing for Patient Tissues

Methyl-binding DNA capture Sequencing for Patient Tissues

Published on: October 31, 2016

Related Experiment Videos

Last Updated: Feb 7, 2026

A Protocol for Computer-Based Protein Structure and Function Prediction

A Protocol for Computer-Based Protein Structure and Function Prediction

Published on: November 3, 2011

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Author Spotlight: A Computational Approach to Decipher Amino Acid Preferences in Multispecific Protein-Protein Interactions

Published on: January 26, 2024

Methyl-binding DNA capture Sequencing for Patient Tissues

Methyl-binding DNA capture Sequencing for Patient Tissues

Published on: October 31, 2016

Area of Science:

Genomics
Bioinformatics
Computational Biology

Background:

Identifying DNA-binding proteins solely from sequence information presents a significant challenge in genome annotation.
DNA-binding proteins are vital for fundamental biological processes including DNA replication, repair, transcription, and splicing.
Current experimental methods for identifying DNA-binding proteins are costly and time-consuming, necessitating efficient computational approaches.

Purpose of the Study:

To develop an effective computational method for predicting DNA-binding proteins using only sequence information.
To improve the accuracy of DNA-binding protein prediction beyond existing methods that rely solely on Position-Specific Scoring Matrix (PSSM) profiles.
To provide a tool that can accelerate genome annotation and guide experimental validation.

Main Methods:

Proposed StackDPPred, a stacking-based machine learning method.
Utilized features extracted from PSSM profiles and residue-specific contact energy.
Employed jackknife validation on a benchmark dataset of 1063 proteins (518 DNA-binding, 545 non DNA-binding).

Main Results:

StackDPPred achieved high performance metrics: 89.96% accuracy (ACC), 0.799 Matthews Correlation Coefficient (MCC), and 94.50% Area Under the Curve (AUC).
The method demonstrated superior performance compared to several state-of-the-art approaches.
Consistent outperformance was observed on two independent test datasets, validating its robustness.

Conclusions:

StackDPPred offers an effective computational solution for predicting DNA-binding proteins directly from their amino acid sequences.
The integration of PSSM and contact energy features enhances prediction accuracy.
The developed method serves as a valuable tool for rapid annotation and experimental guidance in genomics research.