Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Cluster Sampling Method01:20

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
Multi-input and Multi-variable systems01:22

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence of...
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models01:06

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
Aggregates Classification01:29

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

[The impact of ultrasound-based diaphragmatic targeted functional exercise bundle strategy on clinical outcomes of patients with acute exacerbation of chronic obstructive pulmonary disease combined with type II respiratory failure receiving mechanical ventilation].

Zhonghua wei zhong bing ji jiu yi xue·2026
Same author

Medicare Insurance Type and Broad Genomic Profiling in Metastatic Cancer.

JAMA network open·2026
Same author

Doubly Robust Estimators of the Restricted Mean Time in Favor Estimands in Individual- and Cluster-Randomized Trials.

Statistics in medicine·2026
Same author

JOINT IDENTIFICATION OF SPATIALLY VARIABLE GENES VIA A NETWORK-ASSISTED BAYESIAN REGULARIZATION APPROACH.

The annals of applied statistics·2026
Same author

Subgroup Analysis of Differential Networks with Latent Variables.

Statistics and computing·2026
Same author

Reactive Hydrogen-Mediated Peroxydisulfate Activation for Boosting Carbon Nitride Electrochemiluminescence.

Analytical chemistry·2026
Same journal

Classification Under Local Differential Privacy with Model Reversal and Model Averaging.

Journal of machine learning research : JMLR·2026
Same journal

Sparse Semiparametric Discriminant Analysis for High-dimensional Zero-inflated Data.

Journal of machine learning research : JMLR·2026
Same journal

Unsupervised Tree Boosting for Learning Probability Distributions.

Journal of machine learning research : JMLR·2026
Same journal

A Two-Stage Penalized Least Squares Method for Constructing Large Systems of Structural Equations.

Journal of machine learning research : JMLR·2026
Same journal

Bayesian Multinomial Logistic Normal Models through Marginally Latent Matrix-T Processes.

Journal of machine learning research : JMLR·2026
Same journal

Multi-source Learning via Completion of Block-wise Overlapping Noisy Matrices.

Journal of machine learning research : JMLR·2026
See all related articles

Related Experiment Videos

Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis.

Yuanxing Chen1, Qingzhao Zhang2, Shuangge Ma3

  • 1Department of Statistics and Data Science, School of Economics, Xiamen University, Xiamen, 361005, China.

Journal of Machine Learning Research : JMLR
|June 17, 2026
PubMed
Summary
This summary is machine-generated.

This study introduces a novel penalization approach for distributed learning, enabling automatic client clustering and improved model performance. The method enhances understanding of client interconnections while reducing parameters for better data integration.

Keywords:
clustering structuredata heterogeneityhigh dimensionalitypenalizationsparsity

Related Experiment Videos

Area of Science:

  • Machine Learning
  • Statistical Modeling
  • Data Science

Background:

  • Distributed data integration is common across fields like finance and omics.
  • Privacy concerns often prevent raw data integration, necessitating distributed learning with summary statistics.
  • Existing methods often assume homogeneous models across clients, limiting applicability to heterogeneous data.

Purpose of the Study:

  • To develop a novel penalization approach for distributed learning that accommodates client clustering.
  • To enable automatic clustering of clients into groups with shared models.
  • To improve understanding of interconnections among distributed data sources and reduce model complexity.

Main Methods:

  • A novel penalization approach incorporating group penalization for variable selection and fusion penalization for client clustering.
  • Development of an effective Alternating Direction Method of Multipliers (ADMM) algorithm.
  • Theoretical establishment of estimation, selection, and clustering consistency properties under mild conditions.

Main Results:

  • The proposed method effectively clusters clients, assigning similar models within clusters and distinct models across clusters.
  • Group and fusion penalization facilitate regularized estimation, variable selection, and automatic client clustering.
  • The ADMM algorithm provides an efficient solution for the developed penalized objective function.

Conclusions:

  • The novel penalization approach offers a superior method for distributed learning with clustered clients, addressing data heterogeneity.
  • The technique enhances interpretability by revealing client interconnections and reduces computational burden through parameter reduction.
  • Simulation studies and real-world data analysis validate the practical utility and effectiveness of the proposed distributed learning framework.