Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Modified Boxplots

Modified Boxplots

A standard box and whisker plot informs us about the spread of the data in a given sample. One can identify the minimum value, maximum value, first quartile value, second quartile or median value, and third quartile.
However, the box plot does not tell the reader about outliers - values that lie far from the center of the data. We can modify the standard box and whisker plot to identify the outliers and visualize the actual spread of the data in a sample.
Initially, we calculate the adjusted...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Reducing Line Loss

Reducing Line Loss

In a three-phase circuit, line loss is an indicator of energy dissipated as heat due to the resistance of transmission lines. To address this, incorporating transformers into the system—a step-up transformer at the source and a step-down transformer at the load—is a strategic solution. Two three-phase transformers are introduced to improve this.
With a step-up transformer at the source, the voltage is increased, thereby reducing the current in the transmission lines since power loss...

Skewness

Skewness

The measures of central tendency calculated from a data set may not reveal much about its intrinsic distribution. If a plot is made of the data set’s values, the mean and the median may not only differ, but also the plot may have more values on one side of the central tendencies. Such a data set is said to be skewed towards that side.
The longer the tail of the plot on one side, the more skewed it is. The skewness of a data set’s values suggests that the measures of central tendency...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Species-rich and genomically diverse: comparative genomics reveals how fusions, fissions, and sex chromosomes have shaped beetle evolution.

bioRxiv : the preprint server for biology·2026

Same author

scDIG: An R Shiny Application for Interactive Density-Based Gating of Single-Cell Proteomic and Transcriptomic Data.

bioRxiv : the preprint server for biology·2026

Same author

WayFindR: investigating feedback in biological pathways.

NAR genomics and bioinformatics·2026

Same author

Clustering Digestive Tract Tumors Using Transcriptomic and Mutation Data.

Cancers·2026

Same author

OpenScientist: evaluating an open agentic AI co-scientist to accelerate biomedical discovery.

medRxiv : the preprint server for health sciences·2026

Same author

Towards an AI biomedical scientist: Accelerating discoveries in neurodegenerative disease.

The journal of prevention of Alzheimer's disease·2025

Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026

Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026

Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026

Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026

Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026

Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 10, 2025

Author Spotlight: Optimizing Cryo-EM Analysis with CryoSieve for Enhanced Particle Selection Efficiency

Author Spotlight: Optimizing Cryo-EM Analysis with CryoSieve for Enhanced Particle Selection Efficiency

Published on: May 10, 2024

SillyPutty: Improved clustering by optimizing the silhouette width.

Polina Bombina¹, Dwayne Tally², Zachary B Abrams³

¹Department of Biostatistics, Data Science, and Epidemiology, Georgia Cancer Center at Augusta University, Augusta, GA, USA.

Biorxiv : the Preprint Server for Biology

|November 21, 2023

Summary

This summary is machine-generated.

We developed SillyPutty, a novel unsupervised clustering method for biomedical science. It performs comparably to existing methods and excels when combined with hierarchical clustering for improved accuracy and speed.

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE

High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE

Published on: May 16, 2017

Related Experiment Videos

Last Updated: Jul 10, 2025

Author Spotlight: Optimizing Cryo-EM Analysis with CryoSieve for Enhanced Particle Selection Efficiency

Author Spotlight: Optimizing Cryo-EM Analysis with CryoSieve for Enhanced Particle Selection Efficiency

Published on: May 10, 2024

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE

High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE

Published on: May 16, 2017

Area of Science:

Biomedical science
Computational biology
Data mining

Background:

Unsupervised clustering is crucial for analyzing complex biomedical datasets.
Existing clustering algorithms have limitations in accuracy and speed for certain applications.

Approach:

Developed SillyPutty, a new unsupervised clustering algorithm.
Generated synthetic datasets using the Umpire R package for rigorous testing.
Compared SillyPutty against established algorithms using metrics like Silhouette Width and Adjusted Rand Index.

Key Points:

SillyPutty demonstrates comparable accuracy to state-of-the-art clustering methods as a standalone tool.
The combination of hierarchical clustering and SillyPutty yields superior performance in both accuracy and computational efficiency.
Performance was evaluated using multiple established metrics for robust assessment.

Conclusions:

SillyPutty is a validated and effective method for unsupervised clustering in biomedical research.
Hierarchical clustering followed by SillyPutty offers an optimal approach for speed and accuracy.
This combined method provides a powerful new tool for biomedical data analysis.