Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
Gene Duplication and Divergence02:37

Gene Duplication and Divergence

The seminal work of Ohno in 1970 popularized the idea of gene duplication and divergence. DNA sequence comparison studies reveal that a large portion of the genes in bacteria, archaebacteria, and eukaryotes was  generated by gene duplication and divergence, indicating its critical role in evolution.
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are characterized.
DNA Microarrays02:34

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
RNA-seq03:21

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while microarray-based...
Biostatistics: Overview01:20

Biostatistics: Overview

Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Mind the Gap: Human and AI Uncertainties in Cardiac MRI Segmentation.

Cardiovascular engineering and technology·2026
Same author

Gut pathogen Clostridium symbiosum rewires macrophage succinylation to drive enteric neuron loss in inflammatory bowel disease.

Cell host & microbe·2026
Same author

Multi-Channel Fusion Deep Wavelet Spectrum Network for Epileptic Signal Classification.

IEEE journal of biomedical and health informatics·2025
Same author

Nature-Inspired Meta-Heuristic Algorithms for Detecting Protein Complexes in Protein-Protein Interaction Networks: A Survey.

IEEE transactions on computational biology and bioinformatics·2025
Same author

Expert Consensus on Subxiphoid and Subcostal Arch Thoracoscopic Resection for the Treatment of Thymoma.

Thoracic cancer·2025
Same author

ASPM mediates nuclear entrapment of FOXM1 via liquid-liquid phase separation to promote progression of hepatocarcinoma.

Genome biology·2025
Same journal

circ2DGNN: circRNA-Disease Association Prediction via Transformer-Based Graph Neural Network.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Hierarchical Hypergraph Learning in Association- Weighted Heterogeneous Network for miRNA- Disease Association Identification.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Discriminative Domain Adaption Network for Simultaneously Removing Batch Effects and Annotating Cell Types in Single-Cell RNA-Seq.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

MLW-BFECF: A Multi-Weighted Dynamic Cascade Forest Based on Bilinear Feature Extraction for Predicting the Stage of Kidney Renal Clear Cell Carcinoma on Multi-Modal Gene Data.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

An End-to-End Knowledge Graph Fused Graph Neural Network for Accurate Protein-Protein Interactions Prediction.

IEEE/ACM transactions on computational biology and bioinformatics·2024
Same journal

Generative Biomedical Event Extraction With Constrained Decoding Strategy.

IEEE/ACM transactions on computational biology and bioinformatics·2024
See all related articles

Related Experiment Video

Updated: May 21, 2026

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes
05:07

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

Robust Bayesian clustering for replicated gene expression data.

Jianyong Sun1, Jonathan M Garibaldi, Kim Kenobi

  • 1Centre for Plant Integrative Biology (CPIB), School of Bioscience, The University of Nottingham, Sutton Bonington. j.sun@cpib.ac.uk

IEEE/ACM Transactions on Computational Biology and Bioinformatics
|May 30, 2012
PubMed
Summary
This summary is machine-generated.

This study introduces a robust Bayesian mixture model for clustering biological data with replicated measurements. The model accurately clusters data points and identifies outliers, improving outlier detection rates.

More Related Videos

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

Related Experiment Videos

Last Updated: May 21, 2026

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes
05:07

Rup (RNA-seq Usability Assessment Pipeline) - Quality Control for Bulk RNA-seq Experiments in Eukaryotes

Published on: November 7, 2025

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress
05:22

Analyzing Multifactorial RNA-Seq Experiments with DiCoExpress

Published on: July 29, 2022

Area of Science:

  • Bioinformatics
  • Statistical modeling
  • Data analysis

Background:

  • Biological data often includes replicated measurements, which are correlated and require careful statistical handling.
  • Existing clustering methods may not adequately account for the correlation in replicated measurements, potentially affecting accuracy and outlier detection.

Purpose of the Study:

  • To develop a robust Bayesian mixture model for clustering datasets with replicated measurements.
  • To accurately cluster data points while considering measurement correlations and identifying potential outliers.
  • To improve outlier detection rates by incorporating replicated measurement information.

Main Methods:

  • A robust Bayesian mixture model was proposed for clustering data with replicated measurements.
  • A tree-structured variational Bayes (VB) algorithm was developed for efficient model fitting.
  • The model's performance was evaluated against the infinite Gaussian mixture model.

Main Results:

  • The proposed model demonstrated favorable comparisons with the infinite Gaussian mixture model, offering computational simplicity.
  • Including replicated measurements significantly improved outlier detection rates across various measurement uncertainty conditions.
  • The model successfully applied to clustering biological transcriptomics mRNA expression data sets.

Conclusions:

  • The robust Bayesian mixture model effectively handles correlated replicated measurements in biological data.
  • The model enhances both data clustering accuracy and outlier identification capabilities.
  • This approach offers a valuable tool for analyzing complex biological datasets, particularly transcriptomics data.