Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Prediction Intervals01:03

Prediction Intervals

2.5K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.5K
Variation01:19

Variation

6.3K
An important characteristic of any set of data is the variation in the data. In some data sets, the data values are concentrated closely near the mean; in other data sets, the data values are more widely spread out from the mean. The most common measure of variation, or spread, is the standard deviation, which is the square root of variance.
When independent and dependent variables are plotted on a scatter plot, the slope of a line is a value that describes the rate of change between the two...
6.3K
Uncertainty in Measurement: Accuracy and Precision03:37

Uncertainty in Measurement: Accuracy and Precision

93.8K
Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value. 
93.8K
Standard Error of the Mean01:13

Standard Error of the Mean

9.8K
The sampling variability of a statistic is defined as how much the statistic varies from one sample to another. The sampling variability of a statistic is typically measured by measuring its standard error.The standard error of the mean is an example of a standard error. It is a unique standard deviation known as the standard deviation of the sampling distribution of the mean. The standard error of the mean is a statistic that calculates how correctly a sample distribution represents a...
9.8K
Testing a Claim about Standard Deviation01:19

Testing a Claim about Standard Deviation

2.1K
A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...
2.1K
Estimating Population Mean with Known Standard Deviation01:16

Estimating Population Mean with Known Standard Deviation

7.3K
To construct a confidence interval for a single unknown population mean μ, where the population standard deviation is known, we need sample mean as an estimate for μ and we need the margin of error. Here, the margin of error (EBM) is called the error bound for a population mean (abbreviated EBM). The sample mean is the point estimate of the unknown population mean μ.
The confidence interval estimate will have the form as follows:
(point estimate - error bound, point estimate +...
7.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Enriching Bacteria-Specific RNA From Host Samples Before NGS With Transcript-Capture.

Bio-protocol·2026
Same author

Restraint care in the pediatric intensive care unit: a qualitative investigation of nurses' perspectives and experiences.

Frontiers in pediatrics·2026
Same author

Modulation of Ferroptosis During Early <i>Mycobacterium tuberculosis</i> Infection Contributes to Beijing Lineage Strain SA161 Virulence.

bioRxiv : the preprint server for biology·2026
Same author

Design of sensor cluster configurations and impact localization in composite plates using deterministic maximum likelihood.

Ultrasonics·2026
Same author

A Bifunctional Ferroelectric Catalyst Enabling Simultaneous Photoelectrochemical Water Oxidation and Two-Electron Oxygen Reduction.

Small (Weinheim an der Bergstrasse, Germany)·2026
Same author

Precision nutrition through diet-gut microbiome interactions: Emerging insights driven by artificial intelligence, microbiome health metrics, and mechanistic modeling.

Gut microbes reports·2026
Same journal

Invaders taking over-Mollusc faunal change in volcanic barrier lakes of the Albertine Rift biodiversity hotspot.

PloS one·2026
Same journal

AI-driven molecular diversification and ligand-based optimization of macitentan derivatives targeting VEGFR1 and endothelin signaling pathways.

PloS one·2026
Same journal

Performance patterns and records in the world aquatics masters championships: Where do the most frequently represented nations among the top-ten masters swimmers come from?

PloS one·2026
Same journal

Modeling diurnal Temperature-Rainfall relationships under multicollinearity using PLS-SEM: A case study of Ghana.

PloS one·2026
Same journal

Organizational culture, social capital, and emergency capacity in primary healthcare institutions: A cross-sectional structural equation modeling study comparing ordinary and older communities.

PloS one·2026
Same journal

Impact of kidney function on the metabolome in the general population.

PloS one·2026
See all related articles

Related Experiment Video

Updated: Apr 21, 2026

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

5.8K

Measuring the effect of inter-study variability on estimating prediction error.

Shuyi Ma1, Jaeyun Sung2, Andrew T Magis3

  • 1Institute for Systems Biology, Seattle, Washington, United States of America; Department of Chemical and Biomolecular Engineering, University of Illinois, Urbana, Illinois, United States of America.

Plos One
|October 21, 2014
PubMed
Summary
This summary is machine-generated.

Molecular signatures often fail in clinical translation due to study-effects. Comparing randomized cross-validation (RCV) and inter-study validation (ISV) reveals how data variability impacts predictive performance, guiding the achievement of reliable biomarker discovery.

More Related Videos

Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding
06:33

Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding

Published on: October 11, 2018

6.5K
Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities
10:26

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Published on: September 11, 2021

3.1K

Related Experiment Videos

Last Updated: Apr 21, 2026

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

5.8K
Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding
06:33

Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding

Published on: October 11, 2018

6.5K
Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities
10:26

Problem-Solving Before Instruction PS-I: A Protocol for Assessment and Intervention in Students with Different Abilities

Published on: September 11, 2021

3.1K

Area of Science:

  • Biomedical informatics
  • Genomics
  • Translational medicine

Background:

  • Biomarker discovery frequently yields molecular signatures with promising predictive performance that fail to translate to clinical settings.
  • This failure is often attributed to inconsistent classification performance across studies, stemming from technical batch-effects and biological variations.
  • These sources of variability persist even with advanced data collection technologies.

Purpose of the Study:

  • To quantify the impact of combined "study-effects" on disease signature predictive performance.
  • To compare the performance estimates derived from randomized cross-validation (RCV) and inter-study validation (ISV).
  • To establish a method for estimating when sufficient data diversity is achieved for reliable biomarker translation.

Main Methods:

  • Quantified "study-effects" by comparing RCV (random sample subsets) and ISV (entire studies excluded for testing).
  • Assessed how RCV's assumption of identically distributed data differs from ISV's approach.
  • Measured the RCV-ISV performance difference as a function of the number of studies to quantify study-effect influence.

Main Results:

  • Analyzed gene expression data from 1,470 microarray and 769 RNA-seq samples across multiple lung phenotypes and studies.
  • Observed a greater RCV-ISV performance discrepancy in phenotypes with fewer contributing studies.
  • Demonstrated that ISV performance converges towards RCV performance as data from more studies are integrated.

Conclusions:

  • Examining the rate at which ISV performance approaches RCV performance with increasing study numbers provides an estimate of sufficient data diversity.
  • This approach helps determine when a molecular signature is likely to translate to new clinical settings without significant accuracy loss.
  • Highlights the importance of inter-study validation for assessing the robustness and generalizability of biomarkers.