Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

11.9K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
11.9K
Modified Boxplots00:57

Modified Boxplots

9.8K
A standard box and whisker plot informs us about the spread of the data in a given sample. One can identify the minimum value, maximum value, first quartile value, second quartile or median value, and third quartile.
However, the box plot does not tell the reader about outliers - values that lie far from the center of the data. We can modify the standard box and whisker plot to identify the outliers and visualize the actual spread of the data in a sample.
Initially, we calculate the adjusted...
9.8K
Improving Translational Accuracy02:07

Improving Translational Accuracy

11.2K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
11.2K
Reducing Line Loss01:18

Reducing Line Loss

155
In a three-phase circuit, line loss is an indicator of energy dissipated as heat due to the resistance of transmission lines. To address this, incorporating transformers into the system—a step-up transformer at the source and a step-down transformer at the load—is a strategic solution. Two three-phase transformers are introduced to improve this.
With a step-up transformer at the source, the voltage is increased, thereby reducing the current in the transmission lines since power loss...
155
Skewness01:06

Skewness

11.3K
The measures of central tendency calculated from a data set may not reveal much about its intrinsic distribution. If a plot is made of the data set’s values, the mean and the median may not only differ, but also the plot may have more values on one side of the central tendencies. Such a data set is said to be skewed towards that side.
The longer the tail of the plot on one side, the more skewed it is. The skewness of a data set’s values suggests that the measures of central tendency...
11.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Species-rich and genomically diverse: comparative genomics reveals how fusions, fissions, and sex chromosomes have shaped beetle evolution.

bioRxiv : the preprint server for biology·2026
Same author

scDIG: An R Shiny Application for Interactive Density-Based Gating of Single-Cell Proteomic and Transcriptomic Data.

bioRxiv : the preprint server for biology·2026
Same author

WayFindR: investigating feedback in biological pathways.

NAR genomics and bioinformatics·2026
Same author

Clustering Digestive Tract Tumors Using Transcriptomic and Mutation Data.

Cancers·2026
Same author

OpenScientist: evaluating an open agentic AI co-scientist to accelerate biomedical discovery.

medRxiv : the preprint server for health sciences·2026
Same author

Towards an AI biomedical scientist: Accelerating discoveries in neurodegenerative disease.

The journal of prevention of Alzheimer's disease·2025
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jul 10, 2025

Author Spotlight: Optimizing Cryo-EM Analysis with CryoSieve for Enhanced Particle Selection Efficiency
06:41

Author Spotlight: Optimizing Cryo-EM Analysis with CryoSieve for Enhanced Particle Selection Efficiency

Published on: May 10, 2024

1.7K

SillyPutty: Improved clustering by optimizing the silhouette width.

Polina Bombina1, Dwayne Tally2, Zachary B Abrams3

  • 1Department of Biostatistics, Data Science, and Epidemiology, Georgia Cancer Center at Augusta University, Augusta, GA, USA.

Biorxiv : the Preprint Server for Biology
|November 21, 2023
PubMed
Summary
This summary is machine-generated.

We developed SillyPutty, a novel unsupervised clustering method for biomedical science. It performs comparably to existing methods and excels when combined with hierarchical clustering for improved accuracy and speed.

More Related Videos

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.0K
High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE
13:28

High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE

Published on: May 16, 2017

50.4K

Related Experiment Videos

Last Updated: Jul 10, 2025

Author Spotlight: Optimizing Cryo-EM Analysis with CryoSieve for Enhanced Particle Selection Efficiency
06:41

Author Spotlight: Optimizing Cryo-EM Analysis with CryoSieve for Enhanced Particle Selection Efficiency

Published on: May 10, 2024

1.7K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.0K
High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE
13:28

High-resolution Single Particle Analysis from Electron Cryo-microscopy Images Using SPHIRE

Published on: May 16, 2017

50.4K

Area of Science:

  • Biomedical science
  • Computational biology
  • Data mining

Background:

  • Unsupervised clustering is crucial for analyzing complex biomedical datasets.
  • Existing clustering algorithms have limitations in accuracy and speed for certain applications.

Approach:

  • Developed SillyPutty, a new unsupervised clustering algorithm.
  • Generated synthetic datasets using the Umpire R package for rigorous testing.
  • Compared SillyPutty against established algorithms using metrics like Silhouette Width and Adjusted Rand Index.

Key Points:

  • SillyPutty demonstrates comparable accuracy to state-of-the-art clustering methods as a standalone tool.
  • The combination of hierarchical clustering and SillyPutty yields superior performance in both accuracy and computational efficiency.
  • Performance was evaluated using multiple established metrics for robust assessment.

Conclusions:

  • SillyPutty is a validated and effective method for unsupervised clustering in biomedical research.
  • Hierarchical clustering followed by SillyPutty offers an optimal approach for speed and accuracy.
  • This combined method provides a powerful new tool for biomedical data analysis.