Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

598
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
598
Prediction Intervals01:03

Prediction Intervals

3.6K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
3.6K
Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

5.8K
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
5.8K
Test for Homogeneity01:23

Test for Homogeneity

2.6K
The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...
2.6K
Introduction to Nonparametric Statistics01:28

Introduction to Nonparametric Statistics

1.7K
Nonparametric statistics offer a powerful alternative to traditional parametric methods, useful when assumptions about the population distribution cannot be made. Unlike parametric tests, which require data to follow a specific distribution with well-defined parameters (such as the mean and standard deviation), nonparametric tests do not require such constraints. This makes them particularly valuable when dealing with small sample sizes, skewed data, or ordinal and categorical variables.
One of...
1.7K
Estimating Population Mean with Unknown Standard Deviation01:22

Estimating Population Mean with Unknown Standard Deviation

9.1K
In practice, we rarely know the population standard deviation. In the past, when the sample size was large, this did not present a problem to statisticians. They used the sample standard deviation s as an estimate for σ and proceeded as before to calculate a confidence interval with close enough results. However, statisticians ran into problems when the sample size was small. A small sample size caused inaccuracies in the confidence interval.
William S. Gosset (1876–1937) of the...
9.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Dual-energy CT pulmonary angiography using dual-flow mixture bolus tracking: potential for contrast medium and radiation exposure reduction.

Insights into imaging·2026
Same author

Discovery of Selective Inhibitors for β-Actin Polymerization as Potential Anticancer Agents <i>In Vitro</i> and <i>In Vivo</i>.

Journal of medicinal chemistry·2026
Same author

Extreme-Thickness Meta-Membrane for Controlling Terahertz Vectorial Beams.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026
Same author

The inter-generational transmission of parental adverse childhood experiences on children's depression and anxiety symptoms: The mediating effect of parenting factors.

Child abuse & neglect·2026
Same author

Psychosocial contributions of discrimination to traumatic stress adaptation among transgender and nonbinary adults: A longitudinal examination.

Journal of traumatic stress·2026
Same author

Risk factors and external validation of nomogram model for patients with postoperative hemoglobin < 80 g/L after lumbar interbody fusion surgery.

BMC musculoskeletal disorders·2026
Same journal

Probabilistic Joint and Individual Variation Explained (ProJIVE) for Data Integration.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

fastkqr: A Fast Algorithm for Kernel Quantile Regression.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Empirical Bayes Covariance Decomposition, and a Solution to the Multiple Tuning Problem in Sparse PCA.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Joint Registration and Conformal Prediction for Partially Observed Functional Data.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Efficient Decision Trees for Tensor Regressions.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
Same journal

Fast Bayesian Functional Principal Components Analysis.

Journal of computational and graphical statistics : a joint publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America·2026
See all related articles

Related Experiment Video

Updated: Apr 10, 2026

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

6.5K

Distributed Nonparametric Regression with Heterogeneity Through Prediction-Based Aggregation.

Zhao Chen1, Runze Li2, Yikai Xu1

  • 1School of Data Science, Fudan University, Shanghai, China.

Journal of Computational and Graphical Statistics : a Joint Publication of American Statistical Association, Institute of Mathematical Statistics, Interface Foundation of North America
|April 9, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a novel data-driven weighted aggregation method for distributed statistical modeling, enhancing privacy and communication efficiency in large-scale datasets. The procedure optimizes model performance in heterogeneous environments, offering significant advantages for data analysis.

Keywords:
Adaptive weighted combinationAdditive modelB-splineDistributed learningPrediction accuracy

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.8K

Related Experiment Videos

Last Updated: Apr 10, 2026

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups
14:14

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

6.5K
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.8K

Area of Science:

  • Distributed statistical modeling
  • Machine learning
  • Data privacy

Background:

  • Large-scale datasets present challenges in statistical modeling, particularly regarding data privacy and communication efficiency.
  • Heterogeneous distributed environments require adaptable modeling approaches.

Purpose of the Study:

  • To propose a data-driven weighted aggregation procedure for distributed statistical modeling.
  • To enhance communication efficiency and adaptability in heterogeneous environments.
  • To ensure data privacy in large-scale data analysis.

Main Methods:

  • A data-driven weighted aggregation procedure utilizing the squared prediction error matrix.
  • Analysis of asymptotical optimal weights and corresponding risk.
  • Investigation of the minimax property for nonparametric function estimates.
  • Monte Carlo simulations for finite sample performance evaluation.

Main Results:

  • The proposed estimates achieve asymptotical optimal weights concerning quadratic loss and risk.
  • Limits of data-driven weights are derived.
  • Demonstrated effectiveness through simulations and a real-world heart rate prediction dataset.

Conclusions:

  • The developed weighted aggregation procedure is effective for distributed statistical modeling.
  • The method ensures communication efficiency and adaptability in heterogeneous settings.
  • It provides a robust approach for analyzing large-scale, privacy-sensitive data.