Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Probability Distributions01:32

Probability Distributions

12.3K
 The probability of a random variable x  is the likelihood of its occurrence. A probability distribution represents the probabilities of a random variable using a formula, graph, or table. There are two types of probability distribution– discrete probability distribution and continuous probability distribution.
A discrete probability distribution is a probability distribution of discrete random variables. It can be categorized into binomial probability distribution and Poisson...
12.3K
Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

5.2K
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
5.2K
Cluster Sampling Method01:20

Cluster Sampling Method

15.0K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
15.0K
Probability Histograms01:17

Probability Histograms

13.3K
A probability histogram is a visual representation of a probability distribution. Similar a typical histogram, the probability histogram consists of contiguous (adjoining) boxes. It has both a horizontal axis and a vertical axis. The horizontal axis is labeled with what the data represents. The vertical axis is labeled with probability. Each rectangular bar in the histogram is 1 unit wide, which suggests that the area under each bar equals the probability, P(x), where x is 1, 2, 3, and so on.
13.3K
Binomial Probability Distribution01:15

Binomial Probability Distribution

16.0K
A binomial distribution is a probability distribution for a procedure with a fixed number of trials, where each trial can have only two outcomes.
The outcomes of a binomial experiment fit a binomial probability distribution. A statistical experiment can be classified as a binomial experiment if the following conditions are met:
There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of trials.
There are only two possible outcomes,...
16.0K
Relative Frequency Distribution00:55

Relative Frequency Distribution

13.9K
A relative frequency distribution is the proportion or fraction of times a value occurs in a data set. To find the relative frequencies, one can divide each frequency by the total number of data points in the sample. It is very similar to a regular frequency distribution, except that instead of reporting how many data values fall in a class, a relative frequency distribution reports the fraction of data values that fall in a class. These fractions or proportions are called relative frequencies...
13.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Shorebird loss increases soil CO<sub>2</sub> emissions in coastal wetlands under restoration.

Fundamental research·2026
Same author

Evidence for an Electronically Driven Charge Density Wave in a 1D Metallic MOF.

ACS central science·2026
Same author

Redox-Active Doubly Boron-Doped Indenofluorenes: Isomer-Directed Access to Open- and Closed-Shell Electronic States Induces Strong Near-Infrared Activity.

Journal of the American Chemical Society·2026
Same author

Mammalian-like steroidogenesis in plants gives rise to endocrine-mimetic cardenolides.

Science advances·2026
Same author

Sterol trafficking in yeast studied by one- and two-photon live-cell imaging of an intrinsically fluorescent ergosterol analog.

Methods and applications in fluorescence·2026
Same author

The risk of psychosis associated with cannabis use by people with attention-deficit hyperactivity disorder: A systematic review.

Australasian psychiatry : bulletin of Royal Australian and New Zealand College of Psychiatrists·2026
Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
See all related articles

Related Experiment Video

Updated: Feb 22, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.4K

Clustering distributions with the marginalized nested Dirichlet process.

Daiane Aparecida Zuanetti1, Peter Müller2, Yitan Zhu3

  • 1Department of Statistics, UFSCar, São Carlos, Brazil.

Biometrics
|September 30, 2017
PubMed
Summary
This summary is machine-generated.

We developed a new clustering method using a marginal nested Dirichlet process for analyzing gene-gene interactions. This approach offers exact inference for clustering genes based on interaction patterns, outperforming existing methods.

Keywords:
Clustering distributionsGene interactionsNested Dirichlet processTCGAZodiac

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

3.0K
Conditions Affecting Social Space in Drosophila melanogaster
08:04

Conditions Affecting Social Space in Drosophila melanogaster

Published on: November 5, 2015

12.9K

Related Experiment Videos

Last Updated: Feb 22, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.4K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

3.0K
Conditions Affecting Social Space in Drosophila melanogaster
08:04

Conditions Affecting Social Space in Drosophila melanogaster

Published on: November 5, 2015

12.9K

Area of Science:

  • Computational biology
  • Statistical modeling
  • Bioinformatics

Background:

  • Clustering gene expression data is crucial for understanding biological pathways.
  • Existing methods for clustering distributions or gene-gene interaction patterns have limitations.
  • Accurate inference of gene relationships is essential for disease research.

Purpose of the Study:

  • To introduce a novel marginal nested Dirichlet process for clustering distributions and histograms.
  • To apply this model for clustering genes based on gene-gene interaction patterns.
  • To enable simulation-exact inference for improved clustering accuracy.

Main Methods:

  • Developed a marginal version of the nested Dirichlet process.
  • Applied the model to cluster genes using gene-gene interaction coefficients from the Zodiac database (TCGA data).
  • Adjusted for covariates including copy number variation, methylation, and protein activation in an auto-logistic model.

Main Results:

  • The proposed model allows for simulation-exact inference, unlike truncated approximations.
  • Demonstrated favorable performance compared to k-means, truncated NDP, and hierarchical clustering.
  • Successfully clustered genes related to DNA mismatch repair (DMR) based on interaction distributions.

Conclusions:

  • The marginal nested Dirichlet process provides an effective and accurate method for clustering distributions, particularly for gene-gene interaction data.
  • This approach offers a significant improvement over existing clustering techniques for biological data analysis.
  • The model's ability to perform exact inference enhances its reliability in complex biological studies.