Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Bayesian basecalling for DNA sequence analysis using hidden Markov models.

Kuo-Ching Liang, Xiaodong Wang, Dimitris Anastassiou

    IEEE/ACM Transactions on Computational Biology and Bioinformatics
    |August 2, 2007
    PubMed
    Summary
    This summary is machine-generated.

    Related Concept Videos

    You might also read

    Related Articles

    Articles linked to this work by shared authors, journal, and citation graph.

    Sort by
    Same author

    Adipose Stromal Cell-Derived Cancer-Associated Fibroblasts Promote Pancreatic Adenocarcinoma Progression Through SFRP4 Signaling.

    Cancers·2026
    Same author

    The Role of Recruited Adipose Stromal Cells and Their Fibroblastic Differentiation in Cancer.

    Frontiers in bioscience (Landmark edition)·2025
    Same author

    Crowd-sourced benchmarking of single-sample tumor subclonal reconstruction.

    Nature biotechnology·2024
    Same author

    CASCC: a co-expression-assisted single-cell RNA-seq data clustering method.

    Bioinformatics (Oxford, England)·2024
    Same author

    The fibro-adipogenic progenitor APOD+DCN+LUM+ cell population in aggressive carcinomas.

    Cancer metastasis reviews·2024
    Same author

    Adipose Stromal Cell-Derived Cancer-Associated Fibroblasts Suppress FGFR Inhibitor Efficacy.

    Cancer research·2024
    Same journal

    circ2DGNN: circRNA-Disease Association Prediction via Transformer-Based Graph Neural Network.

    IEEE/ACM transactions on computational biology and bioinformatics·2024
    Same journal

    Hierarchical Hypergraph Learning in Association- Weighted Heterogeneous Network for miRNA- Disease Association Identification.

    IEEE/ACM transactions on computational biology and bioinformatics·2024
    Same journal

    Discriminative Domain Adaption Network for Simultaneously Removing Batch Effects and Annotating Cell Types in Single-Cell RNA-Seq.

    IEEE/ACM transactions on computational biology and bioinformatics·2024
    Same journal

    MLW-BFECF: A Multi-Weighted Dynamic Cascade Forest Based on Bilinear Feature Extraction for Predicting the Stage of Kidney Renal Clear Cell Carcinoma on Multi-Modal Gene Data.

    IEEE/ACM transactions on computational biology and bioinformatics·2024
    Same journal

    An End-to-End Knowledge Graph Fused Graph Neural Network for Accurate Protein-Protein Interactions Prediction.

    IEEE/ACM transactions on computational biology and bioinformatics·2024
    Same journal

    Generative Biomedical Event Extraction With Constrained Decoding Strategy.

    IEEE/ACM transactions on computational biology and bioinformatics·2024
    See all related articles

    This study introduces a Bayesian approach using Markov chain Monte Carlo (MCMC) for DNA sequence basecalling. The MCMC method improves accuracy and reduces training data needs compared to traditional algorithms.

    Area of Science:

    • Genomics and Bioinformatics
    • Computational Biology
    • Statistical Modeling in Genetics

    Background:

    • Electropherograms of DNA sequences are commonly modeled using Hidden Markov Models (HMMs).
    • Basecalling, the process of determining DNA sequences from electropherograms, typically utilizes the Viterbi algorithm.
    • Accurate HMM parameter estimation is crucial and often requires a dedicated training step.

    Purpose of the Study:

    • To propose and evaluate a novel Bayesian approach for DNA sequence basecalling.
    • To leverage Markov Chain Monte Carlo (MCMC) methods for enhanced HMM parameter estimation.
    • To integrate prior biological knowledge and utilize both training and basecalling data for improved accuracy.

    Main Methods:

    • Development of a Bayesian framework for DNA sequence basecalling.

    Related Experiment Videos

  • Application of Markov Chain Monte Carlo (MCMC) sampling for parameter estimation.
  • Utilizing the Hidden Markov Model (HMM) framework for sequence data analysis.
  • Main Results:

    • The proposed Bayesian MCMC basecaller demonstrates superior performance compared to the state-of-the-art Viterbi algorithm.
    • The MCMC approach significantly reduces total errors in DNA sequence basecalling.
    • This method requires substantially less training data than other statistical basecalling techniques.

    Conclusions:

    • Bayesian inference with MCMC offers a more accurate and data-efficient approach to DNA sequence basecalling.
    • The method effectively incorporates prior biological information, enhancing HMM parameter estimation.
    • This approach represents a significant advancement for genomic sequence analysis, as validated on the Legionella pneumophila genome.