Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Trial and Error and Algorithm

Trial and Error and Algorithm

A problem-solving strategy is a plan of action used to find a solution. Different strategies have distinct action plans. Trial and error involves trying different solutions until one works. For instance, to fix a broken printer, you might check ink levels, ensure the paper tray isn't jammed, and verify the printer's connection to your laptop. This method can be time-consuming but is commonly used. Thomas Edison, for example, used trial and error to find a suitable filament for the light...

Optimal Foraging

Optimal Foraging

How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.

Dimensional Analysis

Dimensional Analysis

Dimensional analysis, also known as the factor label method, is a versatile approach for mathematical operations. The main principle behind this approach is: the units of quantities must be subjected to the same mathematical operations as their associated numbers. This method can be applied to computations ranging from simple unit conversions to more complex and multi-step calculations involving several different quantities and their units.
Conversion Factors and Dimensional Analysis
The unit...

How Data are Classified: Numerical Data

How Data are Classified: Numerical Data

Data that are countable or measurable in specific units are called numerical or quantitative data. Quantitative data are always numbers. Quantitative data are the result of counting or measuring the attributes of a population. Amount of money, pulse rate, weight, number of people living in a town, and number of students who opt for statistics are examples of quantitative data.
Quantitative data may be either discrete or continuous. All quantitative data that take on only specific numerical...

Optimization Problems

Optimization Problems

Optimization problems often involve identifying maximum or minimum values under specific constraints. A well-known example is determining the longest horizontal pipe that can be moved around a right-angled corner, where a 3-meter-wide hallway meets a 2-meter-wide hallway. This scenario, common in architectural design and industrial transport, can be understood conceptually through geometric and trigonometric reasoning.To visualize the problem, consider the pipe as a straight line that touches...

How Data are Classified: Categorical Data

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Penicillin Allergy Labels, Broad-Spectrum Antibiotic Use, and Chronic Obstructive Pulmonary Disease Exacerbations: A Population-Based Cohort Study from China.

Infection and drug resistance·2026

Same author

Interaction model of client health behavior-based nursing intervention improves outcomes in patients with pressure injury: A quasi-experimental study.

Scientific reports·2026

Same author

Applications of machine learning in the diagnosis of non-alcoholic fatty liver disease: a systematic review and meta-analysis.

BMC gastroenterology·2026

Same author

Development and validation of a multimodal clinical-radiomics-deep learning nomogram based on automated chest CT segmentation for classifying COPD severity: a multicenter study.

Frontiers in medicine·2026

Same author

Long-term effects of thinning intensity on individual growth and stand basal area recovery in a mixed broadleaf-Korean pine forest.

Frontiers in plant science·2026

Same author

A novel subunit vaccine based on glycoprotein H and glycoprotein L fusion against pseudorabies virus infection.

Vaccine·2026

Same journal

Simplifying debiased inference via automatic differentiation and probabilistic programming.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026

Same journal

Principal stratification with U-statistics under principal ignorability.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026

Same journal

Causal K-Means Clustering.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026

Same journal

Inference of dependency knowledge graph for Electronic Health Records.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026

Same journal

Correction to: Inference of dependency knowledge graph for Electronic Health Records.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026

Same journal

Harmonized Estimation of Subgroup-Specific Treatment Effects in Randomized Trials: The Use of External Control Data.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jan 24, 2026

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

An imputation-regularized optimization algorithm for high dimensional missing data problems and beyond.

Faming Liang¹, Bochao Jia², Jingnan Xue³

¹Purdue University, West Lafayette, USA.

Journal of the Royal Statistical Society. Series B, Statistical Methodology

|May 28, 2019

Summary

This summary is machine-generated.

This study introduces a novel general algorithm to address high-dimensional missing data problems. The method iteratively imputes missing values and uses regularized optimization for accurate parameter estimation, enhancing statistical analysis.

Keywords:

Expectation-maximization algorithm Gaussian graphical model Gibbs sampler Imputation consistency Random-coefficient model Variable selection

More Related Videos

Management of Respiratory Motion Artefacts in 18F-fluorodeoxyglucose Positron Emission Tomography using an Amplitude-Based Optimal Respiratory Gating Algorithm

Management of Respiratory Motion Artefacts in 18F-fluorodeoxyglucose Positron Emission Tomography using an Amplitude-Based Optimal Respiratory Gating Algorithm

Published on: July 23, 2020

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

Related Experiment Videos

Last Updated: Jan 24, 2026

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

Management of Respiratory Motion Artefacts in 18F-fluorodeoxyglucose Positron Emission Tomography using an Amplitude-Based Optimal Respiratory Gating Algorithm

Management of Respiratory Motion Artefacts in 18F-fluorodeoxyglucose Positron Emission Tomography using an Amplitude-Based Optimal Respiratory Gating Algorithm

Published on: July 23, 2020

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

Area of Science:

Statistics
Machine Learning
Data Science

Background:

Missing data are common in high-dimensional analyses, posing challenges for standard algorithms like expectation-maximization.
Existing solutions are often problem-specific, lacking a generalizable approach for diverse high-dimensional missing data scenarios.

Purpose of the Study:

To propose a novel, general algorithm for effectively handling missing data in high-dimensional statistical and machine learning problems.
To provide a robust framework that overcomes the limitations of existing specialized methods.

Main Methods:

A two-step iterative process involving conditional imputation of missing data and a regularized optimization step.
Utilizing Kullback-Leibler divergence on pseudo-complete data and sparsity constraints for consistent parameter estimation in high dimensions.

Main Results:

The proposed algorithm demonstrates consistency in parameter estimation under general conditions, even with high-dimensional data.
Successful application illustrated across high-dimensional Gaussian graphical models, variable selection, and random-coefficient models.

Conclusions:

The developed algorithm offers a versatile and effective solution for high-dimensional missing data problems.
This work bridges the gap by providing a general method applicable to various complex statistical modeling tasks.