Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

47.3K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
47.3K
Extraction: Partition and Distribution Coefficients01:14

Extraction: Partition and Distribution Coefficients

5.4K
The distribution law or Nernst's distribution law is the law that governs the distribution of a solute between two immiscible solvents. This law, also known as the partition law, states that if a solute is added to the mixture of two immiscible solvents at a constant temperature, the solute is distributed between the two solvents in such a way that the ratio of solute concentrations in the solvents remains constant at equilibrium.
For extracting a solute from an aqueous phase into an...
5.4K
Survival Tree01:19

Survival Tree

462
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
462
Aggregates Classification01:29

Aggregates Classification

1.1K
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
1.1K
How Data are Classified: Numerical Data00:59

How Data are Classified: Numerical Data

40.3K
Data that are countable or measurable in specific units are called numerical or quantitative data. Quantitative data are always numbers. Quantitative data are the result of counting or measuring the attributes of a population. Amount of money, pulse rate, weight, number of people living in a town, and number of students who opt for statistics are examples of quantitative data.
Quantitative data may be either discrete or continuous. All quantitative data that take on only specific numerical...
40.3K
Cluster Sampling Method01:20

Cluster Sampling Method

15.4K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
15.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Multidimensional feature tuning in category-selective areas of human visual cortex.

The Journal of neuroscience : the official journal of the Society for Neuroscience·2026
Same author

Fixation duration on natural scenes is explained by memory encoding not processing demand.

Nature neuroscience·2026
Same author

Determinants of visual ambiguity resolution.

Communications psychology·2026
Same author

Dynamic Representation of Multidimensional Object Properties in the Human Brain.

The Journal of neuroscience : the official journal of the Society for Neuroscience·2026
Same author

Drawings of THINGS: A large-scale drawing dataset of 1854 object concepts.

Behavior research methods·2026
Same author

CNeuroMod-THINGS, a densely-sampled fMRI dataset for visual neuroscience.

Scientific data·2026
Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026
Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026
Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026
Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026
Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026
Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Mar 15, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K

An Efficient Data Partitioning to Improve Classification Performance While Keeping Parameters Interpretable.

Kristjan Korjus1, Martin N Hebart2, Raul Vicente1

  • 1Computational Neuroscience Lab, Institute of Computer Science, University of Tartu, Tartu, Estonia.

Plos One
|August 27, 2016
PubMed
Summary
This summary is machine-generated.

This study introduces "Cross-validation and cross-testing," a novel machine learning method that re-uses test data to improve classifier performance and parameter selection, especially with limited datasets. The approach enhances discovery probability while maintaining statistical validity and parameter interpretability.

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

8.2K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.4K

Related Experiment Videos

Last Updated: Mar 15, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

8.1K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

8.2K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.4K

Area of Science:

  • Machine Learning
  • Computational Neuroscience
  • Statistical Modeling

Background:

  • Supervised machine learning commonly uses separate data splits for training, validation, and testing.
  • Cross-validation is standard for parameter tuning but can be inefficient with limited data.
  • Existing methods face a trade-off between statistical power and model optimization when data is scarce.

Purpose of the Study:

  • To introduce a novel machine learning approach called "Cross-validation and cross-testing" (CVCT).
  • To improve the trade-off between generalization performance estimation and model fitting with limited data.
  • To validate the CVCT approach using simulated and real-world electrophysiological data.

Main Methods:

  • Implementation of the proposed "Cross-validation and cross-testing" (CVCT) method.
  • Validation using simulated datasets.
  • Application to human and rodent electrophysiological recordings.

Main Results:

  • The CVCT approach demonstrated a higher probability of discovering significant results compared to standard cross-validation and testing.
  • The method maintained the nominal alpha level, ensuring statistical validity.
  • CVCT preserves parameter interpretability, unlike nested cross-validation.

Conclusions:

  • "Cross-validation and cross-testing" offers an improved strategy for machine learning with limited data.
  • The approach enhances statistical power without compromising parameter interpretability.
  • CVCT is particularly beneficial when model parameter interpretability is crucial.