Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Interpretation of Confidence Intervals

Interpretation of Confidence Intervals

A confidence interval is a better estimate of the population than a point estimate, as it uses a range of values from a sample instead of a single value.
Confidence intervals have confidence coefficients that are crucial for their interpretation. The most common confidence coefficients are 0.90, 0.95, and 0.99, which can be written as percentages–90%, 95%, and 99%, respectively.
Suppose a person calculates a confidence interval with a confidence coefficient of 0.95. In that case, they can...

Confidence Intervals

Confidence Intervals

An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a sample proportion. However, unlike the point estimate which is a single value, the confidence interval contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A...

Distributions to Estimate Population Parameter

Distributions to Estimate Population Parameter

The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...

Uncertainty: Confidence Intervals

Uncertainty: Confidence Intervals

The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...

Confidence Interval for Estimating Population Mean

Confidence Interval for Estimating Population Mean

A point estimate of the population mean is obtained from a single sample. Such a point estimate does not represent a population well because it needs to account for variability in the population. Single point estimate can also be biased despite the sample being selected randomly. Thus, a point estimate is often unreliable. A confidence interval is needed to reduce this unreliability.
A confidence interval for the mean is a range of values that provides an estimate of the population mean. As the...

Confidence Coefficient

Confidence Coefficient

The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Turbulent Aggregation and Deposition Mechanism of Respirable Dust Pollutants under Wet Dedusting using a Two-Fluid Model with the Population Balance Method.

International journal of environmental research and public health·2019

Same author

The Involvement of Descending Pain Inhibitory System in Electroacupuncture-Induced Analgesia.

Frontiers in integrative neuroscience·2019

Same author

Direct modification of polyketone resin for anion exchange membrane of alkaline fuel cells.

Journal of colloid and interface science·2019

Same author

Palladium-Catalyzed Site-Selective C(sp<sup>3</sup>)-H Arylation of Phenylacetaldehydes.

Organic letters·2019

Same author

Electrochemical Oxidation of 5-Hydroxymethylfurfural on Nickel Nitride/Carbon Nanosheets: Reaction Pathway Determined by In Situ Sum Frequency Generation Vibrational Spectroscopy.

Angewandte Chemie (International ed. in English)·2019

Same author

Chiral Phosphoric-Acid-Catalyzed Cascade Prins Cyclization.

Organic letters·2019

Same journal

A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values.

Journal of multivariate analysis·2026

Same journal

Hierarchical structure-guided high-dimensional multi-view clustering.

Journal of multivariate analysis·2026

Same journal

Quadratic inference with dense functional responses.

Journal of multivariate analysis·2025

Same journal

Graph-constrained Analysis for Multivariate Functional Data.

Journal of multivariate analysis·2025

Same journal

From multivariate to functional data analysis: fundamentals, recent developments, and emerging areas.

Journal of multivariate analysis·2024

Same journal

Modeling the Cholesky factors of covariance matrices of multivariate longitudinal data.

Journal of multivariate analysis·2024

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 10, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Distributed Simultaneous Inference in Generalized Linear Models via Confidence Distribution.

Lu Tang¹, Ling Zhou², Peter X-K Song³

¹Department of Biostatistics, University of Pittsburgh, Pittsburgh, PA 15261, USA.

Journal of Multivariate Analysis

|September 1, 2020

Summary

This summary is machine-generated.

We introduce a scalable distributed method for analyzing large datasets in generalized linear models. Our approach combines results from smaller data subsets, ensuring accurate simultaneous inference comparable to centralized analysis.

Keywords:

Bias correction Confidence distribution Inference Lasso Meta-analysis Parallel computing

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Related Experiment Videos

Last Updated: Dec 10, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

Statistics
Machine Learning
Data Science

Background:

Analyzing large-scale datasets (N >> p) in generalized linear models presents computational challenges for single machines.
Existing distributed 'divide-and-combine' strategies face issues with uneven data partitions and require robust regularization.
Lack of clear theoretical guidance for combining regularized estimates hinders simultaneous inference in distributed settings.

Purpose of the Study:

To develop a statistically sound and scalable distributed method for simultaneous inference in generalized linear models.
To address the challenges of combining regularized estimates from partitioned datasets.
To provide a practical approach for analyzing big data where centralized computation is infeasible.

Main Methods:

Proposed a novel 'divide-and-combine' strategy for distributed data analysis.
Employed bias-corrected lasso-type estimators to handle potential numerical instability from data partitioning.
Utilized confidence distributions to effectively combine results from distributed sub-analyses for simultaneous inference.

Main Results:

The developed distributed method achieves estimation efficiency equivalent to centralized maximum likelihood estimation.
The combined estimator demonstrates robust performance even with unevenly sized data partitions.
Simulated and real-world data analyses confirm that the proposed method yields inference nearly identical to centralized benchmarks.

Conclusions:

The proposed distributed method offers a scalable and theoretically justified solution for simultaneous inference in large-scale generalized linear models.
This approach effectively overcomes the limitations of traditional centralized analysis for big data.
The method provides a reliable alternative for researchers and practitioners dealing with massive datasets stored in distributed systems.