Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

11.0K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
11.0K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

4.0K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
4.0K
Crystallographic Point Groups01:29

Crystallographic Point Groups

126
Crystallographic point groups represent the various symmetry operations that can occur within crystals. They are unique in that at least one point will always remain unchanged during these actions. For instance, consider the triclinic system. This system, devoid of any axis or plane of symmetry, aligns with the C1 and Ci point groups.where Cᵢ is characterized solely by a center of inversion.Contrastingly, the monoclinic system introduces an element of symmetry. This system with one plane...
126
Sampling Plans01:23

Sampling Plans

1.5K
Sampling is a crucial step in analytical chemistry, allowing researchers to collect representative data from a large population. Common sampling methods include random, judgmental, systematic, stratified, and cluster sampling.
Random sampling is a method where each member of the population has an equal chance of being selected for the sample. It involves selecting individuals randomly, often using random number generators or lottery-type methods. For example, when analyzing the properties of a...
1.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Occupational therapy in pain management: An exploration and description of current UK practice.

The British journal of occupational therapy·2025
Same author

How prepared are newly qualified allied health professionals for practice in the UK? A systematic review.

BMJ open·2024
Same author

Automating Dynamic Consent Decisions for the Processing of Social Media Data in Health Research.

Journal of empirical research on human research ethics : JERHRE·2019
Same author

Artificial intelligence in pharmaceutical research and development.

Future medicinal chemistry·2018
Same author

Enzyme function and its evolution.

Current opinion in structural biology·2017
Same author

DichroMatch at the protein circular dichroism data bank (DM@PCDDB): A web-based tool for identifying protein nearest neighbors using circular dichroism spectroscopy.

Protein science : a publication of the Protein Society·2017
Same journal

2DKD: a toolkit for content-based local image search.

Source code for biology and medicine·2020
Same journal

Computing and graphing probability values of pearson distributions: a SAS/IML macro.

Source code for biology and medicine·2020
Same journal

iPBAvizu: a PyMOL plugin for an efficient 3D protein structure superimposition approach.

Source code for biology and medicine·2019
Same journal

Social support for collaboration and group awareness in life science research teams.

Source code for biology and medicine·2019
Same journal

MZPAQ: a FASTQ data compression tool.

Source code for biology and medicine·2019
Same journal

IPCAPS: an R package for iterative pruning to capture population structure.

Source code for biology and medicine·2019
See all related articles

Related Experiment Video

Updated: May 3, 2026

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry
12:11

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

9.2K

PFClust: an optimised implementation of a parameter-free clustering algorithm.

Khadija Musayeva, Tristan Henderson, John Bo Mitchell

  • 1EaStCHEM School of Chemistry and Biomedical Sciences Research Complex, University of St Andrews, North Haugh, St Andrews, Scotland KY16 9ST, UK. lazaros.mavridis.lm@gmail.com.

Source Code for Biology and Medicine
|February 5, 2014
PubMed
Summary
This summary is machine-generated.

PFClust, a partitioning-based clustering algorithm, now offers an optimized implementation for faster processing of large datasets. This advancement allows for efficient discovery of clusters with arbitrary shapes, sizes, and densities.

More Related Videos

JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics
07:28

JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics

Published on: October 19, 2021

2.7K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.3K

Related Experiment Videos

Last Updated: May 3, 2026

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry
12:11

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

9.2K
JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics
07:28

JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics

Published on: October 19, 2021

2.7K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

6.3K

Area of Science:

  • Data Science
  • Computational Biology
  • Bioinformatics

Background:

  • Determining the optimal number of clusters is a significant challenge in cluster analysis.
  • Existing algorithms often struggle to automatically identify the inherent data structure.

Purpose of the Study:

  • To present a new, optimized implementation of the PFClust algorithm.
  • To enhance the speed and efficiency of the PFClust algorithm for processing large datasets.

Main Methods:

  • Development of an optimized implementation for the PFClust algorithm.
  • Testing the algorithm on diverse datasets to evaluate its performance.

Main Results:

  • The enhanced PFClust algorithm demonstrates improved speed for processing large datasets.
  • PFClust successfully identifies clusters of varying shapes, sizes, and densities.
  • Previous applications include clustering macromolecular structures and druglike compounds.

Conclusions:

  • The new implementation of PFClust offers a considerable speed improvement over the original version.
  • PFClust provides an effective solution for automatic determination of optimal cluster numbers.