Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Extraction: Partition and Distribution Coefficients

Extraction: Partition and Distribution Coefficients

The distribution law or Nernst's distribution law is the law that governs the distribution of a solute between two immiscible solvents. This law, also known as the partition law, states that if a solute is added to the mixture of two immiscible solvents at a constant temperature, the solute is distributed between the two solvents in such a way that the ratio of solute concentrations in the solvents remains constant at equilibrium.
For extracting a solute from an aqueous phase into an organic...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Binomial Probability Distribution

Binomial Probability Distribution

A binomial distribution is a probability distribution for a procedure with a fixed number of trials, where each trial can have only two outcomes.
The outcomes of a binomial experiment fit a binomial probability distribution. A statistical experiment can be classified as a binomial experiment if the following conditions are met:
There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of trials.
There are only two possible outcomes,...

¹H NMR: Complex Splitting

¹H NMR: Complex Splitting

A proton M that is coupled to a proton X results in doublet signals for M. However, NMR-active nuclei can be simultaneously coupled to more than one nonequivalent nucleus. When M is coupled to a second proton A, such as in styrene oxide, each peak in the doublet is split into another doublet.
Splitting diagrams or splitting tree diagrams are routinely used to depict such complex couplings. While drawing splitting diagrams, the splitting with the larger coupling constant is usually applied first.

Probability Histograms

Probability Histograms

A probability histogram is a visual representation of a probability distribution. Similar a typical histogram, the probability histogram consists of contiguous (adjoining) boxes. It has both a horizontal axis and a vertical axis. The horizontal axis is labeled with what the data represents. The vertical axis is labeled with probability. Each rectangular bar in the histogram is 1 unit wide, which suggests that the area under each bar equals the probability, P(x), where x is 1, 2, 3, and so on.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Fast Value Tracking for Deep Reinforcement Learning.

... International Conference on Learning Representations·2026

Same author

Causal-StoNet: Causal Inference for High-Dimensional Complex Data.

... International Conference on Learning Representations·2026

Same author

Conformal Prediction in Clinical Artificial Intelligence: Enhancing Model Reliability and Interpretability.

Chest·2026

Same author

Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior.

Journal of data science : JDS·2025

Same author

Extended fiducial inference for individual treatment effects via deep neural networks.

Statistics and computing·2025

Same author

A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules.

Statistica Sinica·2025

Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026

Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026

Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026

Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026

Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026

Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 16, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Robust clustering using exponential power mixtures.

Jian Zhang¹, Faming Liang

¹Department of Mathematics, University of York, Heslington, York, UK. jz538@york.ac.uk

|February 19, 2010

Summary

This summary is machine-generated.

This study introduces a robust clustering method for gene expression data, outperforming traditional models like Gaussian mixture and k-means when data shows complex correlations or non-Gaussian patterns.

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Related Experiment Videos

Last Updated: Jun 16, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Area of Science:

Bioinformatics
Statistical genomics
Computational biology

Background:

Clustering gene expression data is crucial for discovering biological insights.
Conventional methods struggle with inherent gene correlations and non-normal distributions.
Existing models like Gaussian mixture (GM), k-means (KM), and partitioning around medoids (PAM) lack robustness.

Purpose of the Study:

To develop a more robust clustering method for gene expression data.
To address challenges posed by general dependence and non-normality in biological data.
To improve information extraction from complex gene expression datasets.

Main Methods:

Utilized the exponential power mixture model for enhanced robustness.
Developed an expectation-conditional maximization algorithm for parameter estimation.
Employed the Bayesian information criterion to determine the optimal number of mixture components.

Main Results:

The proposed exponential power mixture model demonstrates increased robustness against data dependence and non-normality.
Maximum likelihood estimators (MLEs) are proven consistent under sparse dependence.
Numerical results show superior performance compared to GM, KM, and PAM in correlated or non-Gaussian data scenarios.

Conclusions:

The exponential power mixture model offers a more reliable approach for clustering gene expression data with complex structures.
This method enhances the accuracy of information extraction from biological datasets.
The developed algorithm provides a powerful tool for genomic data analysis.