Search research articles

Related Concept Videos

Sampling Plans

Sampling Plans

Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA: Equal Sample Sizes

One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...

One-Way ANOVA: Unequal Sample Sizes

One-Way ANOVA: Unequal Sample Sizes

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

Friedman Two-way Analysis of Variance by Ranks

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...

Variability: Analysis

Variability: Analysis

Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Practical implementation of AI in a non-academic, non-commercial Pathology laboratory: Real world experience and lessons learned.

Histopathology·2025

Same author

Comparing Reverse Complementary Genomic Words Based on Their Distance Distributions and Frequencies.

Interdisciplinary sciences, computational life sciences·2017

Same author

Exploiting Multiple Descriptor Sets in QSAR Studies.

Journal of chemical information and modeling·2016

Same author

A natural robustification of the ordinary instrumental variables estimator.

Biometrics·2013

Same author

Population density and feeding duration of cabbage looper larvae on tomato plants alter the levels of plant volatile emissions.

Pest management science·2011

Same author

Herbivore-induced plant volatiles allow detection of Trichoplusia ni (Lepidoptera: Noctuidae) infestation on greenhouse tomato plants.

Pest management science·2010

Same journal

Biomedical Concept Recognition with Error-aware Negative-enhanced Ranking Framework.

Bioinformatics (Oxford, England)·2026

Same journal

TEDLH: Domain HMMs for sensitive detection of remote homologues.

Bioinformatics (Oxford, England)·2026

Same journal

PLNFGL: Joint Estimation of Multi-Condition Gene Networks from Single-cell RNA-seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

MCFST: Spatial domain identification method based on multi-view graph convolutional network and graph fusion network.

Bioinformatics (Oxford, England)·2026

Same journal

SpaBiT: Enhancing Spatial Transcriptomics Resolution via Bidirectional Attention Transformers.

Bioinformatics (Oxford, England)·2026

Same journal

EDEL: Enhancing Dense Retrievers for Curation of Biomedical Knowledge Bases.

Bioinformatics (Oxford, England)·2026

See all related articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Search research articles

Related Experiment Video

Updated: Dec 24, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Pooled variable scaling for cluster analysis.

Jakob Raymaekers¹, Ruben H Zamar²

¹Department of Mathematics, KU Leuven, Leuven 3001, Belgium.

Bioinformatics (Oxford, England)

|April 14, 2020

Summary

This summary is machine-generated.

A new pooled variance-based scaling method improves cluster analysis by maintaining variable importance. This safe and efficient approach is crucial for bioinformatics and medical research, especially for high-dimensional genomic data.

More Related Videos

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Related Experiment Videos

Last Updated: Dec 24, 2025

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Area of Science:

Bioinformatics
Medical Sciences Research
Cluster Analysis

Background:

Clustering methods often lack scale invariance due to Euclidean distances.
Scale-invariant methods can lose invariance with regularization or variable selection, making results sensitive to measurement units.

Purpose of the Study:

To develop a safe and efficient scaling procedure for cluster analysis.
To address the sensitivity of clustering results to measurement units in bioinformatics and medical research.

Main Methods:

Proposed a novel scaling approach based on pooled variance prior to cluster analysis.
Evaluated the method through extensive simulations and real-data examples, including a high-dimensional genomic dataset.

Main Results:

The proposed scaling method avoids dampening informative variables, unlike standard deviation or range scaling.
Demonstrated the safety and general utility of the new scaling approach.
Successfully applied the method to cluster gene expression data from breast cancer cell tissues.

Conclusions:

The pooled variance-based scaling method offers a robust solution for scale-invariant cluster analysis.
This approach is particularly beneficial for high-dimensional data in bioinformatics and medical research.
An R implementation is available for practical application.