Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

15.7K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
15.7K
Survival Tree01:19

Survival Tree

500
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
500
Aggregates Classification01:29

Aggregates Classification

1.2K
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
1.2K
How Data are Classified: Numerical Data00:59

How Data are Classified: Numerical Data

41.5K
Data that are countable or measurable in specific units are called numerical or quantitative data. Quantitative data are always numbers. Quantitative data are the result of counting or measuring the attributes of a population. Amount of money, pulse rate, weight, number of people living in a town, and number of students who opt for statistics are examples of quantitative data.
Quantitative data may be either discrete or continuous. All quantitative data that take on only specific numerical...
41.5K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

5.0K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
5.0K
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

48.7K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
48.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Immune-Inflammatory markers and heart failure incidence and mortality: a population-based longitudinal study.

Frontiers in cardiovascular medicine·2026
Same author

Hollow MnO<sub>2</sub> nanosystem for glioblastoma sono-immunotherapy based on blood-brain barrier crossing and oxygen supply strategy.

Journal of colloid and interface science·2026
Same author

Contrast-enhanced ultrasound enhancement-pattern classification for procedure selection and perioperative outcomes in cesarean scar pregnancy.

Frontiers in surgery·2026
Same author

Microemulsion flooding based on surfactant system: Recent advances and current challenges in enhanced oil recovery.

Advances in colloid and interface science·2026
Same author

Accelerated Reaction Exploration across Scales: A Hybrid Operando and Modeling Study of Oxidation Kinetics in Monolayer Tungsten Disulfide.

Journal of the American Chemical Society·2026
Same author

Discovery of novel IKKβ allosteric inhibitors as dual-antioxidant for cerebral ischemia-reperfusion injury therapy by upregulating HO-1 and SLC7A11.

Bioorganic chemistry·2026
Same journal

RETRACTION: Real-Time Modulation of Physical Training Intensity Based on Wavelet Recursive Fuzzy Neural Networks.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Multidimensional Heterogeneous Network Link Adaptation Based on Mobile Environment.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Framework to Segment and Evaluate Multiple Sclerosis Lesion in MRI Slices Using VGG-UNet.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Facial Emotion Recognition Using a Novel Fusion of Convolutional Neural Network and Local Binary Pattern in Crime Investigation.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Automatic Intelligent System Using Medical of Things for Multiple Sclerosis Detection.

Computational intelligence and neuroscience·2026
Same journal

RETRACTION: Intangible Cultural Heritage Reproduction and Revitalization: Value Feedback, Practice, and Exploration Based on the IPA Model.

Computational intelligence and neuroscience·2026
See all related articles

Related Experiment Video

Updated: Apr 15, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.5K

A fast clustering algorithm for data with a few labeled instances.

Jinfeng Yang1, Yong Xiao1, Jiabing Wang2

  • 1Electric Power Research Institute of Guangdong Power Grid Corporation, Guangzhou 510080, China.

Computational Intelligence and Neuroscience
|April 11, 2015
PubMed
Summary
This summary is machine-generated.

This study introduces a fast clustering algorithm and efficient metric learning for improved data analysis. The methods enhance clustering quality by maximizing the split-to-diameter ratio using labeled data.

More Related Videos

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

12.1K
JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics
07:28

JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics

Published on: October 19, 2021

3.7K

Related Experiment Videos

Last Updated: Apr 15, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.5K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

12.1K
JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics
07:28

JUMPn: A Streamlined Application for Protein Co-Expression Clustering and Network Analysis in Proteomics

Published on: October 19, 2021

3.7K

Area of Science:

  • Data Science
  • Machine Learning
  • Clustering Algorithms

Background:

  • Clustering quality is often assessed using metrics like cluster diameter and split.
  • Existing metric learning algorithms can be computationally intensive, often relying on semidefinite programming.

Purpose of the Study:

  • To develop a fast clustering algorithm that finds optimal solutions under specific conditions.
  • To propose computationally efficient metric learning algorithms that maximize the split-to-diameter ratio (RSD).
  • To demonstrate the effectiveness of learned metrics and supervision in improving clustering quality.

Main Methods:

  • A novel clustering algorithm is presented, guaranteeing optimality for certain RSD values.
  • Metric learning is formulated as a linear programming problem, offering computational advantages over semidefinite programming.
  • Empirical evaluations are conducted to validate the proposed methods.

Main Results:

  • The proposed clustering algorithm achieves optimal solutions when the RSD of the optimal solution exceeds one.
  • The linear programming-based metric learning approach is computationally efficient.
  • Empirical results show significant improvements in clustering quality due to learned metrics and supervision.

Conclusions:

  • The developed clustering and metric learning techniques offer efficient and effective solutions for data analysis.
  • The learned metric and use of supervision demonstrably enhance the quality of clustering outcomes.
  • This work provides a computationally efficient alternative for metric learning in clustering tasks.