Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

[The impact of ultrasound-based diaphragmatic targeted functional exercise bundle strategy on clinical outcomes of patients with acute exacerbation of chronic obstructive pulmonary disease combined with type II respiratory failure receiving mechanical ventilation].

Zhonghua wei zhong bing ji jiu yi xue·2026

Same author

Medicare Insurance Type and Broad Genomic Profiling in Metastatic Cancer.

JAMA network open·2026

Same author

Doubly Robust Estimators of the Restricted Mean Time in Favor Estimands in Individual- and Cluster-Randomized Trials.

Statistics in medicine·2026

Same author

JOINT IDENTIFICATION OF SPATIALLY VARIABLE GENES VIA A NETWORK-ASSISTED BAYESIAN REGULARIZATION APPROACH.

The annals of applied statistics·2026

Same author

Subgroup Analysis of Differential Networks with Latent Variables.

Statistics and computing·2026

Same author

Reactive Hydrogen-Mediated Peroxydisulfate Activation for Boosting Carbon Nitride Electrochemiluminescence.

Analytical chemistry·2026

Same journal

Classification Under Local Differential Privacy with Model Reversal and Model Averaging.

Journal of machine learning research : JMLR·2026

Same journal

Sparse Semiparametric Discriminant Analysis for High-dimensional Zero-inflated Data.

Journal of machine learning research : JMLR·2026

Same journal

Unsupervised Tree Boosting for Learning Probability Distributions.

Journal of machine learning research : JMLR·2026

Same journal

A Two-Stage Penalized Least Squares Method for Constructing Large Systems of Structural Equations.

Journal of machine learning research : JMLR·2026

Same journal

Bayesian Multinomial Logistic Normal Models through Marginally Latent Matrix-T Processes.

Journal of machine learning research : JMLR·2026

Same journal

Multi-source Learning via Completion of Block-wise Overlapping Noisy Matrices.

Journal of machine learning research : JMLR·2026

See all related articles

Search research articles

Related Experiment Videos

Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis.

Yuanxing Chen¹, Qingzhao Zhang², Shuangge Ma³

¹Department of Statistics and Data Science, School of Economics, Xiamen University, Xiamen, 361005, China.

Journal of Machine Learning Research : JMLR

|June 17, 2026

Summary

This summary is machine-generated.

This study introduces a novel penalization approach for distributed learning, enabling automatic client clustering and improved model performance. The method enhances understanding of client interconnections while reducing parameters for better data integration.

Keywords:

clustering structure data heterogeneity high dimensionality penalization sparsity

Related Experiment Videos

Area of Science:

Machine Learning
Statistical Modeling
Data Science

Background:

Distributed data integration is common across fields like finance and omics.
Privacy concerns often prevent raw data integration, necessitating distributed learning with summary statistics.
Existing methods often assume homogeneous models across clients, limiting applicability to heterogeneous data.

Purpose of the Study:

To develop a novel penalization approach for distributed learning that accommodates client clustering.
To enable automatic clustering of clients into groups with shared models.
To improve understanding of interconnections among distributed data sources and reduce model complexity.

Main Methods:

A novel penalization approach incorporating group penalization for variable selection and fusion penalization for client clustering.
Development of an effective Alternating Direction Method of Multipliers (ADMM) algorithm.
Theoretical establishment of estimation, selection, and clustering consistency properties under mild conditions.

Main Results:

The proposed method effectively clusters clients, assigning similar models within clusters and distinct models across clusters.
Group and fusion penalization facilitate regularized estimation, variable selection, and automatic client clustering.
The ADMM algorithm provides an efficient solution for the developed penalized objective function.

Conclusions:

The novel penalization approach offers a superior method for distributed learning with clustered clients, addressing data heterogeneity.
The technique enhances interpretability by revealing client interconnections and reduces computational burden through parameter reduction.
Simulation studies and real-world data analysis validate the practical utility and effectiveness of the proposed distributed learning framework.