Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
Extraction: Partition and Distribution Coefficients01:14

Extraction: Partition and Distribution Coefficients

The distribution law or Nernst's distribution law is the law that governs the distribution of a solute between two immiscible solvents. This law, also known as the partition law, states that if a solute is added to the mixture of two immiscible solvents at a constant temperature, the solute is distributed between the two solvents in such a way that the ratio of solute concentrations in the solvents remains constant at equilibrium.
For extracting a solute from an aqueous phase into an organic...
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).
Binomial Probability Distribution01:15

Binomial Probability Distribution

A binomial distribution is a probability distribution for a procedure with a fixed number of trials, where each trial can have only two outcomes.
The outcomes of a binomial experiment fit a binomial probability distribution. A statistical experiment can be classified as a binomial experiment if the following conditions are met:
There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of trials.
There are only two possible outcomes,...
¹H NMR: Complex Splitting01:13

¹H NMR: Complex Splitting

A proton M that is coupled to a proton X results in doublet signals for M. However, NMR-active nuclei can be simultaneously coupled to more than one nonequivalent nucleus. When M is coupled to a second proton A, such as in styrene oxide, each peak in the doublet is split into another doublet.
Splitting diagrams or splitting tree diagrams are routinely used to depict such complex couplings. While drawing splitting diagrams, the splitting with the larger coupling constant is usually applied first.
Probability Histograms01:17

Probability Histograms

A probability histogram is a visual representation of a probability distribution. Similar a typical histogram, the probability histogram consists of contiguous (adjoining) boxes. It has both a horizontal axis and a vertical axis. The horizontal axis is labeled with what the data represents. The vertical axis is labeled with probability. Each rectangular bar in the histogram is 1 unit wide, which suggests that the area under each bar equals the probability, P(x), where x is 1, 2, 3, and so on.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Fast Value Tracking for Deep Reinforcement Learning.

... International Conference on Learning Representationsยท2026
Same author

Causal-StoNet: Causal Inference for High-Dimensional Complex Data.

... International Conference on Learning Representationsยท2026
Same author

Conformal Prediction in Clinical Artificial Intelligence: Enhancing Model Reliability and Interpretability.

Chestยท2026
Same author

Magnitude Pruning of Large Pretrained Transformer Models with a Mixture Gaussian Prior.

Journal of data science : JDSยท2025
Same author

Extended fiducial inference for individual treatment effects via deep neural networks.

Statistics and computingยท2025
Same author

A New Paradigm for Generative Adversarial Networks based on Randomized Decision Rules.

Statistica Sinicaยท2025
Same journal

Fast penalized generalized estimating equationsย for large longitudinal functional datasets.

Biometricsยท2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometricsยท2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometricsยท2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometricsยท2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometricsยท2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometricsยท2026
See all related articles

Related Experiment Video

Updated: Jun 16, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Robust clustering using exponential power mixtures.

Jian Zhang1, Faming Liang

  • 1Department of Mathematics, University of York, Heslington, York, UK. jz538@york.ac.uk

Biometrics
|February 19, 2010
PubMed
Summary
This summary is machine-generated.

This study introduces a robust clustering method for gene expression data, outperforming traditional models like Gaussian mixture and k-means when data shows complex correlations or non-Gaussian patterns.

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Related Experiment Videos

Last Updated: Jun 16, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Area of Science:

  • Bioinformatics
  • Statistical genomics
  • Computational biology

Background:

  • Clustering gene expression data is crucial for discovering biological insights.
  • Conventional methods struggle with inherent gene correlations and non-normal distributions.
  • Existing models like Gaussian mixture (GM), k-means (KM), and partitioning around medoids (PAM) lack robustness.

Purpose of the Study:

  • To develop a more robust clustering method for gene expression data.
  • To address challenges posed by general dependence and non-normality in biological data.
  • To improve information extraction from complex gene expression datasets.

Main Methods:

  • Utilized the exponential power mixture model for enhanced robustness.
  • Developed an expectation-conditional maximization algorithm for parameter estimation.
  • Employed the Bayesian information criterion to determine the optimal number of mixture components.

Main Results:

  • The proposed exponential power mixture model demonstrates increased robustness against data dependence and non-normality.
  • Maximum likelihood estimators (MLEs) are proven consistent under sparse dependence.
  • Numerical results show superior performance compared to GM, KM, and PAM in correlated or non-Gaussian data scenarios.

Conclusions:

  • The exponential power mixture model offers a more reliable approach for clustering gene expression data with complex structures.
  • This method enhances the accuracy of information extraction from biological datasets.
  • The developed algorithm provides a powerful tool for genomic data analysis.