Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Integrating genomic data to predict transcription factor binding.

Dustin T Holloway1, Mark Kon, Charles DeLisi

  • 1Molecular Biology Cell Biology and Biochemistry, Boston University, Boston, MA 02215, USA. dth128@bu.edu

Genome Informatics. International Conference on Genome Informatics
|December 20, 2005
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Complex analyticity of the nonlinear Poisson-Boltzmann equation for the interface problem with random domains.

Numerische mathematik·2026
Same author

Analytic regularity of strong solutions for the complexified stochastic nonlinear Poisson-Boltzmann Equation.

Computers & mathematics with applications (Oxford, England : 1987)·2026
Same author

Uncertainty quantification of receptor ligand binding sites prediction.

ArXiv·2024
Same author

An Agrigenomics Trifecta: Greenhouse Gas Drawdown, Food Security, and New Drugs.

Cold Spring Harbor perspectives in biology·2023
Same author

The Role of Synthetic Biology in Atmospheric Greenhouse Gas Reduction: Prospects and Challenges.

Biodesign research·2023
Same author

Inferring microbial co-occurrence networks from amplicon data: a systematic evaluation.

mSystems·2023
Same journal

Linear regression models predicting strength of transcriptional activity of promoters.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

Sign: large-scale gene network estimation environment for high performance computing.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

Docking-calculation-based method for predicting protein-RNA interactions.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

Mechanism of cell cycle disruption by multiple p53 pulses.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

Database for crude drugs and Kampo medicine.

Genome informatics. International Conference on Genome Informatics·2012
Same journal

A dynamic programming algorithm to predict synthesis processes of tree-structured compounds with graph grammar.

Genome informatics. International Conference on Genome Informatics·2011
See all related articles

This study improves transcription factor binding site (TFBS) prediction accuracy in yeast by integrating genomic data. Support vector machines combined with multiple data types achieve nearly 80% accuracy, reducing false positives from traditional methods.

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Transcription factor binding sites (TFBS) are crucial for gene regulation.
  • Position specific scoring matrices (PSSMs) are commonly used for TFBS prediction but yield many false positives.

Purpose of the Study:

  • To reduce false positive TFBS predictions.
  • To enhance TFBS prediction accuracy by integrating diverse genomic data.
  • To compare Bayesian allocation and support vector machine (SVM) classification methods.

Main Methods:

  • Integration of genomic data including binding site degeneracy, conservation, phylogenetic profiling, TF binding site clustering, gene expression, GO functional annotation, and k-mer counts.
  • Application of Bayesian allocation and support vector machine (SVM) classification.

Related Experiment Videos

  • Validation against ChIP-chip data, Transfac, and Saccharomyces Genome Database.
  • Main Results:

    • SVM classification integrating all genomic data achieved the highest accuracy.
    • SVM identified key contributing data types for accurate TFBS classification.
    • Support vector machine achieved an average classification accuracy of nearly 80% with high sensitivity.

    Conclusions:

    • Integrating multiple genomic datasets significantly improves TFBS prediction accuracy.
    • Support vector machines are effective tools for TFBS prediction, outperforming traditional methods.
    • This approach offers a more reliable method for identifying true transcription factor binding sites.