Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Choosing Between z and t Distribution01:25

Choosing Between z and t Distribution

2.9K
The z and the Student t distribution estimate the population mean using the sample mean and standard deviation. However, to decide which distribution to use for a calculation, one needs to determine the sample size, the nature of the distribution, and whether the population standard deviation is known. If the population standard deviation is known and the population is normally distributed, or if the sample size is greater than 30, the z distribution is preferred. The Student t distribution is...
2.9K
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

713
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
713
Survival Tree01:19

Survival Tree

498
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
498
Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

4.5K
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
4.5K
Introduction to Nonparametric Statistics01:28

Introduction to Nonparametric Statistics

1.5K
Nonparametric statistics offer a powerful alternative to traditional parametric methods, useful when assumptions about the population distribution cannot be made. Unlike parametric tests, which require data to follow a specific distribution with well-defined parameters (such as the mean and standard deviation), nonparametric tests do not require such constraints. This makes them particularly valuable when dealing with small sample sizes, skewed data, or ordinal and categorical variables.
One of...
1.5K
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

435
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
435

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The p38α MAPK Deletion in Oligodendroglia does not Attenuate Myelination Defects in a Mouse Model of Periventricular Leukomalacia.

Neuroscience·2018
Same author

Mechanisms of enhanced antiglioma efficacy of polysorbate 80-modified paclitaxel-loaded PLGA nanoparticles by focused ultrasound.

Journal of cellular and molecular medicine·2018
Same author

Cloning, expression and characterisation of a cysteine protease from Trichinella spiralis.

Folia parasitologica·2018
Same author

Establishment of a Human iPSC- and Nanofiber-Based Microphysiological Blood-Brain Barrier System.

ACS applied materials & interfaces·2018
Same author

Improved design and analysis of CRISPR knockout screens.

Bioinformatics (Oxford, England)·2018
Same author

Boron doping induced thermal conductivity enhancement of water-based 3C-Si(B)C nanofluids.

Nanotechnology·2018
Same journal

Deblurring structural edges in variable thickness topology optimization via density-gradient-informed projection.

Structural and multidisciplinary optimization : journal of the International Society for Structural and Multidisciplinary Optimization·2026
Same journal

Reinforcement learning-based control co-design of digital twin-enabled full-vehicle active suspension systems.

Structural and multidisciplinary optimization : journal of the International Society for Structural and Multidisciplinary Optimization·2026
Same journal

A novel multi-thickness topology optimization method for balancing structural performance and manufacturability.

Structural and multidisciplinary optimization : journal of the International Society for Structural and Multidisciplinary Optimization·2026
Same journal

Adjoint-based PDE-constrained optimization of viscoelastic floating membrane for maximum wave power absorption.

Structural and multidisciplinary optimization : journal of the International Society for Structural and Multidisciplinary Optimization·2026
Same journal

Differentiable modelling and optimization of multi-planar slicing for multi-axis additive manufacturing.

Structural and multidisciplinary optimization : journal of the International Society for Structural and Multidisciplinary Optimization·2026
Same journal

Geometrically nonlinear high-fidelity aerostructural optimization for highly flexible wings.

Structural and multidisciplinary optimization : journal of the International Society for Structural and Multidisciplinary Optimization·2025
See all related articles

Related Experiment Video

Updated: Apr 23, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K

Optimal SVM parameter selection for non-separable and unbalanced datasets.

Peng Jiang1, Samy Missoum1, Zhao Chen2

  • 1Aerospace and Mechanical Engineering Department, University of Arizona, Tucson, Arizona.

Structural and Multidisciplinary Optimization : Journal of the International Society for Structural and Multidisciplinary Optimization
|September 27, 2014
PubMed
Summary
This summary is machine-generated.

This study evaluates Area Under the ROC Curve (AUC), accuracy, and balanced accuracy for Support Vector Machine (SVM) classification on challenging real-world datasets. Findings guide optimal metric selection for non-separable, unbalanced data in clinical applications.

Keywords:
Cross validationNon-separable and unbalanced datasetsSupport vector machinesValidation metrics

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.0K

Related Experiment Videos

Last Updated: Apr 23, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.0K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.0K

Area of Science:

  • Machine Learning
  • Biomedical Data Analysis
  • Computational Biology

Background:

  • Support Vector Machine (SVM) classifiers are crucial for data analysis, but their optimal parameter selection is challenging with non-separable and unbalanced datasets, common in experimental and clinical settings.
  • Validation metrics like Area Under the ROC Curve (AUC), accuracy, and balanced accuracy are used to assess classifier performance, yet their effectiveness varies with data characteristics.

Purpose of the Study:

  • To investigate and compare the performance of AUC, accuracy, and balanced accuracy as validation metrics for Support Vector Machine (SVM) parameter selection.
  • To evaluate these metrics in the context of non-separable and unbalanced datasets, simulating real-world experimental or clinical data challenges.

Main Methods:

  • Utilized computational data to create fully separable datasets, then projected them into lower dimensional subspaces to generate representative non-separable datasets.
  • Introduced a 'weighted likelihood' metric, leveraging knowledge of the separable dataset, to quantitatively compare the performance of AUC, accuracy, and balanced accuracy.
  • Applied the methods to a hip fracture prediction classification model using data from a parameterized finite element model of a femur.

Main Results:

  • The study analyzed the performance of AUC, accuracy, and balanced accuracy across varying levels of data separability, unbalance ratios, and training set sizes.
  • Demonstrated how the choice of validation metric impacts the selection of optimal Support Vector Machine (SVM) parameters, particularly for difficult datasets.
  • The 'weighted likelihood' provided a reference for evaluating metric performance in simulated real-world scenarios.

Conclusions:

  • The effectiveness of validation metrics (AUC, accuracy, balanced accuracy) is highly dependent on dataset characteristics, especially separability and class balance.
  • Careful selection of validation metrics is critical for accurate Support Vector Machine (SVM) model optimization in clinical and experimental applications.
  • This research offers insights into robust metric selection for Support Vector Machine (SVM) classifiers when dealing with imperfect, real-world data.