Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

160
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
160
Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Regression Analysis01:11

Regression Analysis

5.6K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.6K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.3K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.3K
Calibration Curves: Linear Least Squares01:20

Calibration Curves: Linear Least Squares

1.2K
A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...
1.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Targeted Delivery of Indole-3-Pyruvic Acid Suppresses Macrophage Ferroptosis to Enhance CD8<sup>+</sup> T Cell-Mediated Immunotherapy Response in Bladder Cancer.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026
Same author

An H<sub>2</sub>O<sub>2</sub> and MPO programmable responsive MRI probe for early detection of drug-induced acute kidney injury via spatiotemporal monitoring of renal oxidative stress and inflammation.

Redox biology·2026
Same author

Active Hydrogen Reservoir Enabled by p-d Orbital Hybridization in PdSb Metallene for Electrocatalytic Alkynol Semi‑Hydrogenation at Large Current Densities.

Advanced materials (Deerfield Beach, Fla.)·2026
Same author

New insight into light utilization efficiency: an evaluation of semitransparent solar cells for building-integrated photovoltaic windows.

Scientific reports·2026
Same author

Imaging characteristics and discrimination model development for early gastric cancer using multi-spectral CT.

Surgical endoscopy·2026
Same author

Digestive Tract Structure and Seasonal Dynamics of Gut Microbiota in <i>Hypomesus nipponensis</i> from Bosten Lake.

Animals : an open access journal from MDPI·2026
Same journal

Fast penalized generalized estimating equations for large longitudinal functional datasets.

Biometrics·2026
Same journal

Causally-interpretable random-effects meta-analysis.

Biometrics·2026
Same journal

Statistical inference for mean function of partially observed functional time series.

Biometrics·2026
Same journal

Subgroup identification via Interaction Tree and Mixed Model for Repeated Measures with application to Alzheimer's disease.

Biometrics·2026
Same journal

Finite mixtures of linear quantile regressions with concomitant variables: a solution to endogeneity in longitudinal data modeling.

Biometrics·2026
Same journal

Discussion on "INTACT: a method for integration of longitudinal physical activity data from multiple sources" by Jingru Zhang, Erjia Cui, Hongzhe Li, and Haochang Shou.

Biometrics·2026
See all related articles

Related Experiment Video

Updated: Jun 12, 2025

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

6.3K

Leveraging independence in high-dimensional mixed linear regression.

Ning Wang1, Kai Deng2, Qing Mai2

  • 1Department of Statistics, Beijing Normal University, Zhuhai, 519000, China.

Biometrics
|September 24, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a novel penalized expectation-maximization (EM) algorithm for high-dimensional mixed linear regression, improving regression coefficient estimation and variable selection. The method efficiently handles numerous predictors, outperforming existing approaches in complex datasets.

Keywords:
EM algorithmfinite mixture modelgroup lassolatent variable model

More Related Videos

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits
08:27

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Published on: September 27, 2019

6.9K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.8K

Related Experiment Videos

Last Updated: Jun 12, 2025

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

6.3K
Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits
08:27

Applying an eMASS Customization Program as a Research Tool to Evaluate Consumer Benefits

Published on: September 27, 2019

6.9K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

16.8K

Area of Science:

  • Statistics
  • Machine Learning
  • Bioinformatics

Background:

  • High-dimensional data presents challenges for mixed linear regression.
  • Existing methods often oversimplify predictor variability and lack synergistic selection.
  • The expectation-maximization (EM) algorithm is a common approach for likelihood maximization.

Purpose of the Study:

  • To develop an efficient method for estimating regression coefficients and selecting predictors in high-dimensional mixed linear regression.
  • To address limitations of existing EM-based procedures by accounting for predictor variability.
  • To achieve synergistic variable selection across mixture components.

Main Methods:

  • Leveraging the independence between predictors and latent indicator variables for efficient computation.
  • Incorporating fast group-penalized EM estimation.
  • Establishing non-asymptotic convergence rates for the proposed estimator.

Main Results:

  • The proposed method facilitates efficient computation and synergistic variable selection.
  • Non-asymptotic convergence rates to true regression parameters are established.
  • Demonstrated effectiveness through extensive simulations and real-world data analysis.

Conclusions:

  • The novel penalized EM algorithm offers a robust solution for high-dimensional mixed linear regression.
  • The method enhances both parameter estimation and variable selection accuracy.
  • Applicable to biological data, such as predicting anticancer drug sensitivity.