Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

DNA Microarrays02:34

DNA Microarrays

17.2K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
17.2K
Conserved Binding Sites01:49

Conserved Binding Sites

4.2K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
4.2K
Next-generation Sequencing03:00

Next-generation Sequencing

87.4K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
87.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Rethinking bioinformatics expertise in the era of artificial intelligence.

NPJ digital medicine·2026
Same author

Enhancing protein structure prediction: evaluating the role of amino acid physicochemical features in homology search.

Briefings in bioinformatics·2026
Same author

Protrec2: tissue-specific network-based missing protein recovery method.

Briefings in bioinformatics·2025
Same author

Establishing the Asia & Pacific Bioinformatics Joint Congress: a historic milestone in regional bioinformatics collaboration.

Briefings in bioinformatics·2025
Same author

Comprehensive benchmarking of methods for mutation calling in circulating tumor DNA.

Nature communications·2025
Same author

Efficient trace reconstruction in DNA storage systems using bidirectional beam search.

iScience·2025
Same journal

STED: flexible cross-modal topic modeling infers cell-type-specific regulatory landscapes from bulk epigenomics.

Briefings in bioinformatics·2026
Same journal

A knowledge-guided deep learning framework for quantitative nucleic acid testing.

Briefings in bioinformatics·2026
Same journal

Optimal transport for label transfer in single-cell multi-omics integration.

Briefings in bioinformatics·2026
Same journal

Continuous multi-omics pathway enrichment analysis resolves hidden functional heterogeneity.

Briefings in bioinformatics·2026
Same journal

Evaluating completeness, coherence, and consistency of genome-scale function annotations.

Briefings in bioinformatics·2026
Same journal

Transformers for single-cell RNA sequencing: a survey.

Briefings in bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 5, 2025

DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling
08:04

DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling

Published on: October 8, 2019

8.6K

Benchmarking recent computational tools for DNA-binding protein identification.

Xizi Luo1, Amadeus Song Yi Chi1, Andre Huikai Lin1

  • 1School of Computing, National University of Singapore, Singapore 119077, Singapore.

Briefings in Bioinformatics
|December 10, 2024
PubMed
Summary
This summary is machine-generated.

Identifying DNA-binding proteins (DBPs) is key for understanding gene regulation. This study benchmarks computational tools, reveals data leakage issues, and presents a consensus method for improved DBP identification.

Keywords:
BLASTCD-HITDNA-binding proteindeep learningmachine learningmotif

More Related Videos

Real-time Analysis of Transcription Factor Binding, Transcription, Translation, and Turnover to Display Global Events During Cellular Activation
12:54

Real-time Analysis of Transcription Factor Binding, Transcription, Translation, and Turnover to Display Global Events During Cellular Activation

Published on: March 7, 2018

13.5K
Biosensor-based High Throughput Biopanning and Bioinformatics Analysis Strategy for the Global Validation of Drug-protein Interactions
08:31

Biosensor-based High Throughput Biopanning and Bioinformatics Analysis Strategy for the Global Validation of Drug-protein Interactions

Published on: December 1, 2020

4.9K

Related Experiment Videos

Last Updated: Jun 5, 2025

DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling
08:04

DNA Sequence Recognition by DNA Primase Using High-Throughput Primase Profiling

Published on: October 8, 2019

8.6K
Real-time Analysis of Transcription Factor Binding, Transcription, Translation, and Turnover to Display Global Events During Cellular Activation
12:54

Real-time Analysis of Transcription Factor Binding, Transcription, Translation, and Turnover to Display Global Events During Cellular Activation

Published on: March 7, 2018

13.5K
Biosensor-based High Throughput Biopanning and Bioinformatics Analysis Strategy for the Global Validation of Drug-protein Interactions
08:31

Biosensor-based High Throughput Biopanning and Bioinformatics Analysis Strategy for the Global Validation of Drug-protein Interactions

Published on: December 1, 2020

4.9K

Area of Science:

  • Genomics
  • Bioinformatics
  • Molecular Biology

Background:

  • DNA-binding proteins (DBPs) are essential for fundamental cellular processes including gene regulation, DNA replication, and transcriptional control.
  • Accurate identification of DBPs is critical for comprehensive genome annotation and understanding biological mechanisms.

Purpose of the Study:

  • To conduct an unbiased benchmark of 11 state-of-the-art and traditional computational tools for DNA-binding protein identification.
  • To address and highlight the data leakage issue prevalent in conventional evaluation datasets.
  • To introduce novel evaluation datasets for more robust and reliable benchmarking of DBP identification tools.

Main Methods:

  • Performed an unbiased benchmarking of 11 advanced and traditional computational tools (e.g., ScanProsite, BLAST, HMMER) for DBP identification.
  • Introduced new, curated evaluation datasets to mitigate data leakage and ensure realistic performance assessment.
  • Developed a comprehensive evaluation pipeline to analyze model limitations, feature extraction, and training methodologies.

Main Results:

  • Identified significant data leakage in conventional datasets, leading to overestimated performance metrics.
  • Highlighted limitations in existing computational models, feature extraction techniques, and training strategies for DBP identification.
  • Demonstrated that a consensus method combining predictions from top computational tools and BLAST significantly improves DBP identification accuracy.

Conclusions:

  • The study provides a critical evaluation of current DBP identification tools and methodologies.
  • New datasets and a consensus approach offer a more reliable framework for DBP identification and future tool development.
  • User-friendly software implementing the consensus method is provided to facilitate DBP identification in genomic studies.