Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Interpretation of Confidence Intervals01:19

Interpretation of Confidence Intervals

8.9K
A confidence interval is a better estimate of the population than a point estimate, as it uses a range of values from a sample instead of a single value.
Confidence intervals have confidence coefficients that are crucial for their interpretation. The most common confidence coefficients are 0.90, 0.95, and 0.99, which can be written as percentages–90%, 95%, and 99%, respectively.
Suppose a person calculates a confidence interval with a confidence coefficient of 0.95. In that case, they can...
8.9K
Confidence Intervals01:21

Confidence Intervals

9.7K
An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a  sample proportion. However, unlike the point estimate which is a single value, the confidence interval  contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A...
9.7K
Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

4.9K
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
4.9K
Uncertainty: Confidence Intervals00:54

Uncertainty: Confidence Intervals

9.3K
The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...
9.3K
Confidence Interval for Estimating Population Mean01:25

Confidence Interval for Estimating Population Mean

8.6K
A point estimate of the population mean is obtained from a single sample. Such a point estimate does not represent a population well because it needs to account for variability in the population. Single point estimate can also be biased despite the sample being selected randomly. Thus, a point estimate is often unreliable. A confidence interval is needed to reduce this unreliability.
A confidence interval for the mean is a range of values that provides an estimate of the population mean. As the...
8.6K
Confidence Coefficient01:24

Confidence Coefficient

10.0K
The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...
10.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Turbulent Aggregation and Deposition Mechanism of Respirable Dust Pollutants under Wet Dedusting using a Two-Fluid Model with the Population Balance Method.

International journal of environmental research and public health·2019
Same author

The Involvement of Descending Pain Inhibitory System in Electroacupuncture-Induced Analgesia.

Frontiers in integrative neuroscience·2019
Same author

Direct modification of polyketone resin for anion exchange membrane of alkaline fuel cells.

Journal of colloid and interface science·2019
Same author

Palladium-Catalyzed Site-Selective C(sp<sup>3</sup>)-H Arylation of Phenylacetaldehydes.

Organic letters·2019
Same author

Electrochemical Oxidation of 5-Hydroxymethylfurfural on Nickel Nitride/Carbon Nanosheets: Reaction Pathway Determined by In Situ Sum Frequency Generation Vibrational Spectroscopy.

Angewandte Chemie (International ed. in English)·2019
Same author

Chiral Phosphoric-Acid-Catalyzed Cascade Prins Cyclization.

Organic letters·2019
Same journal

A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values.

Journal of multivariate analysis·2026
Same journal

Hierarchical structure-guided high-dimensional multi-view clustering.

Journal of multivariate analysis·2026
Same journal

Quadratic inference with dense functional responses.

Journal of multivariate analysis·2025
Same journal

Graph-constrained Analysis for Multivariate Functional Data.

Journal of multivariate analysis·2025
Same journal

From multivariate to functional data analysis: fundamentals, recent developments, and emerging areas.

Journal of multivariate analysis·2024
Same journal

Modeling the Cholesky factors of covariance matrices of multivariate longitudinal data.

Journal of multivariate analysis·2024
See all related articles

Related Experiment Video

Updated: Dec 10, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K

Distributed Simultaneous Inference in Generalized Linear Models via Confidence Distribution.

Lu Tang1, Ling Zhou2, Peter X-K Song3

  • 1Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261, USA.

Journal of Multivariate Analysis
|September 1, 2020
PubMed
Summary
This summary is machine-generated.

We introduce a scalable distributed method for analyzing large datasets in generalized linear models. Our approach combines results from smaller data subsets, ensuring accurate simultaneous inference comparable to centralized analysis.

Keywords:
Bias correctionConfidence distributionInferenceLassoMeta-analysisParallel computing

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.8K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.4K

Related Experiment Videos

Last Updated: Dec 10, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.8K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.4K

Area of Science:

  • Statistics
  • Machine Learning
  • Data Science

Background:

  • Analyzing large-scale datasets (N >> p) in generalized linear models presents computational challenges for single machines.
  • Existing distributed 'divide-and-combine' strategies face issues with uneven data partitions and require robust regularization.
  • Lack of clear theoretical guidance for combining regularized estimates hinders simultaneous inference in distributed settings.

Purpose of the Study:

  • To develop a statistically sound and scalable distributed method for simultaneous inference in generalized linear models.
  • To address the challenges of combining regularized estimates from partitioned datasets.
  • To provide a practical approach for analyzing big data where centralized computation is infeasible.

Main Methods:

  • Proposed a novel 'divide-and-combine' strategy for distributed data analysis.
  • Employed bias-corrected lasso-type estimators to handle potential numerical instability from data partitioning.
  • Utilized confidence distributions to effectively combine results from distributed sub-analyses for simultaneous inference.

Main Results:

  • The developed distributed method achieves estimation efficiency equivalent to centralized maximum likelihood estimation.
  • The combined estimator demonstrates robust performance even with unevenly sized data partitions.
  • Simulated and real-world data analyses confirm that the proposed method yields inference nearly identical to centralized benchmarks.

Conclusions:

  • The proposed distributed method offers a scalable and theoretically justified solution for simultaneous inference in large-scale generalized linear models.
  • This approach effectively overcomes the limitations of traditional centralized analysis for big data.
  • The method provides a reliable alternative for researchers and practitioners dealing with massive datasets stored in distributed systems.