Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Randomized Experiments01:13

Randomized Experiments

8.2K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
8.2K
Random Variables01:09

Random Variables

15.1K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
15.1K
Random Sampling Method01:09

Random Sampling Method

12.8K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
12.8K
Group Design02:01

Group Design

9.8K
The most basic experimental design involves two groups: the experimental group and the control group. The two groups are designed to be the same except for one difference— experimental manipulation. The experimental group gets the experimental manipulation—that is, the treatment or variable being tested—and the control group does not. Since experimental manipulation is the only difference between the experimental and control groups, we can be sure that any differences between...
9.8K
Wald-Wolfowitz Runs Test I01:17

Wald-Wolfowitz Runs Test I

756
The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...
756
Random Error01:04

Random Error

3.6K
Random or indeterminate errors originate from various uncontrollable variables, such as variations in environmental conditions, instrument imperfections, or the inherent variability of the phenomena being measured. Usually, these errors cannot be predicted, estimated, or characterized because their direction and magnitude often vary in magnitude and direction even during consecutive measurements. As a result, they are difficult to eliminate. However, the aggregate effect of these errors can be...
3.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Exploring memory effects: Sparse identification in vector-borne diseases.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Dynamics of infectious disease spread between transportation hubs and surrounding communities.

Infectious Disease Modelling·2026
Same author

An agent-based platform for simulating the impact of chlamydia vaccine in the US population.

Nature communications·2026
Same author

Lipid metabolism dysregulation in rheumatoid arthritis: from multi-omics discovery to adjunctive therapy with conjugated linoleic acid.

Arthritis research & therapy·2026
Same author

Transformer-Based Multiomics Study Identifies Important Role of Glycine, Serine, and Threonine Metabolism Pathway in Rheumatoid Arthritis Complicated by Anemia.

Computational and structural biotechnology journal·2026
Same author

High-Throughput Single-Cell-Resolved Spatial Proteomics Enabled by an Ordered Colloidal Crystal Column.

Angewandte Chemie (International ed. in English)·2026
Same journal

CardiaTics: An explainable AI integrated heart disease diagnosis model with feature engineering and stacked ensemble approach.

Journal of big data·2026
Same journal

Comprehensive representation of health-related phenotypes in one million dogs using topic modelling of electronic health records.

Journal of big data·2026
Same journal

UniqueNOSD: a novel framework for NoSQL over SQL databases.

Journal of big data·2025
Same journal

<i>F</i>u<i>n</i>Da: scalable serverless data analytics and in situ query processing.

Journal of big data·2025
Same journal

Integrating Big Data, Artificial Intelligence, and motion analysis for emerging precision medicine applications in Parkinson's Disease.

Journal of big data·2024
Same journal

Interpolation-split: a data-centric deep learning approach with big interpolated data to boost airway segmentation performance.

Journal of big data·2024
See all related articles

Related Experiment Video

Updated: Oct 13, 2025

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.5K

Domain randomization for neural network classification.

Svetozar Zarko Valtchev1, Jianhong Wu1

  • 1Laboratory of Industrial and Applied Mathematics, York University, 4700 Keele St, M3J 1P3 Toronto, ON Canada.

Journal of Big Data
|November 11, 2021
PubMed
Summary
This summary is machine-generated.

Synthetic data, generated using domain randomization (DR), can train neural network classifiers like convolutional neural networks (CNNs) to rival state-of-the-art accuracy. Varying subjects is key, improving generalization to new domains.

Keywords:
Domain randomizationNeural network classifiersSynthetic image generation

More Related Videos

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

7.7K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.1K

Related Experiment Videos

Last Updated: Oct 13, 2025

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks
11:18

Closed-loop Neuro-robotic Experiments to Test Computational Properties of Neuronal Networks

Published on: March 2, 2015

10.5K
Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques
08:05

Measuring Statistical Learning Across Modalities and Domains in School-Aged Children Via an Online Platform and Neuroimaging Techniques

Published on: June 30, 2020

7.7K
Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations
12:27

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

7.1K

Area of Science:

  • Computer Science
  • Artificial Intelligence
  • Machine Learning

Background:

  • Training neural networks, especially convolutional neural networks (CNNs), demands extensive labeled image datasets, often tens of thousands per category.
  • Acquiring and labeling these datasets is costly, time-consuming, and labor-intensive.
  • CNNs frequently struggle with generalization to out-of-domain test sets.

Purpose of the Study:

  • To investigate the efficacy of synthetic data generated via domain randomization (DR) for training neural network classifiers.
  • To determine the impact of various DR parameters on classifier accuracy and generalization.
  • To compare the performance of models trained on synthetic data against those trained on real-world data.

Main Methods:

  • Generated synthetic image datasets using domain randomization (DR) techniques.
  • Trained convolutional neural network (CNN) classifiers on the synthetic datasets.
  • Evaluated classifier performance on a baseline cats vs. dogs classification task and out-of-domain test sets.
  • Analyzed the significance of different DR parameters, including subject variety, lighting, and textures.

Main Results:

  • A well-generated synthetic dataset using DR achieved high accuracy (up to 88%) on a cats vs. dogs classification task, rivaling models trained on real datasets.
  • A wide variety of subjects was identified as the most crucial DR parameter for model accuracy.
  • Secondary parameters like lighting and textures had less impact on model performance.
  • Models trained on domain-randomized images demonstrated superior transfer learning capabilities to new domains compared to models trained on real photos.
  • Model performance remained stable with an increasing number of categories.

Conclusions:

  • Synthetic data generated through domain randomization offers a cost-effective and efficient alternative to large, manually labeled real-world datasets for training CNNs.
  • Domain randomization is a viable technique to improve the generalization ability of neural network classifiers.
  • Prioritizing subject variety in synthetic data generation is critical for maximizing classifier performance and out-of-domain transfer.