Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Trial and Error and Algorithm01:12

Trial and Error and Algorithm

403
A problem-solving strategy is a plan of action used to find a solution. Different strategies have distinct action plans. Trial and error involves trying different solutions until one works. For instance, to fix a broken printer, you might check ink levels, ensure the paper tray isn't jammed, and verify the printer's connection to your laptop. This method can be time-consuming but is commonly used. Thomas Edison, for example, used trial and error to find a suitable filament for the light...
403
Optimal Foraging00:48

Optimal Foraging

13.7K
How animals obtain and eat their food is called foraging behavior. Foraging can include searching for plants and hunting for prey and depends on the species and environment.
13.7K
Dimensional Analysis03:40

Dimensional Analysis

60.6K
Dimensional analysis, also known as the factor label method, is a versatile approach for mathematical operations. The main principle behind this approach is: the units of quantities must be subjected to the same mathematical operations as their associated numbers. This method can be applied to computations ranging from simple unit conversions to more complex and multi-step calculations involving several different quantities and their units.
Conversion Factors and Dimensional Analysis
The unit...
60.6K
How Data are Classified: Numerical Data00:59

How Data are Classified: Numerical Data

37.1K
Data that are countable or measurable in specific units are called numerical or quantitative data. Quantitative data are always numbers. Quantitative data are the result of counting or measuring the attributes of a population. Amount of money, pulse rate, weight, number of people living in a town, and number of students who opt for statistics are examples of quantitative data.
Quantitative data may be either discrete or continuous. All quantitative data that take on only specific numerical...
37.1K
Optimization Problems01:26

Optimization Problems

58
Optimization problems often involve identifying maximum or minimum values under specific constraints. A well-known example is determining the longest horizontal pipe that can be moved around a right-angled corner, where a 3-meter-wide hallway meets a 2-meter-wide hallway. This scenario, common in architectural design and industrial transport, can be understood conceptually through geometric and trigonometric reasoning.To visualize the problem, consider the pipe as a straight line that touches...
58
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

43.1K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
43.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Penicillin Allergy Labels, Broad-Spectrum Antibiotic Use, and Chronic Obstructive Pulmonary Disease Exacerbations: A Population-Based Cohort Study from China.

Infection and drug resistance·2026
Same author

Interaction model of client health behavior-based nursing intervention improves outcomes in patients with pressure injury: A quasi-experimental study.

Scientific reports·2026
Same author

Applications of machine learning in the diagnosis of non-alcoholic fatty liver disease: a systematic review and meta-analysis.

BMC gastroenterology·2026
Same author

Development and validation of a multimodal clinical-radiomics-deep learning nomogram based on automated chest CT segmentation for classifying COPD severity: a multicenter study.

Frontiers in medicine·2026
Same author

Long-term effects of thinning intensity on individual growth and stand basal area recovery in a mixed broadleaf-Korean pine forest.

Frontiers in plant science·2026
Same author

A novel subunit vaccine based on glycoprotein H and glycoprotein L fusion against pseudorabies virus infection.

Vaccine·2026
Same journal

Simplifying debiased inference via automatic differentiation and probabilistic programming.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Principal stratification with U-statistics under principal ignorability.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Causal K-Means Clustering.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Inference of dependency knowledge graph for Electronic Health Records.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Correction to: Inference of dependency knowledge graph for Electronic Health Records.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
Same journal

Harmonized Estimation of Subgroup-Specific Treatment Effects in Randomized Trials: The Use of External Control Data.

Journal of the Royal Statistical Society. Series B, Statistical methodology·2026
See all related articles

Related Experiment Video

Updated: Jan 24, 2026

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.7K

An imputation-regularized optimization algorithm for high dimensional missing data problems and beyond.

Faming Liang1, Bochao Jia2, Jingnan Xue3

  • 1Purdue University, West Lafayette, USA.

Journal of the Royal Statistical Society. Series B, Statistical Methodology
|May 28, 2019
PubMed
Summary
This summary is machine-generated.

This study introduces a novel general algorithm to address high-dimensional missing data problems. The method iteratively imputes missing values and uses regularized optimization for accurate parameter estimation, enhancing statistical analysis.

Keywords:
Expectation-maximization algorithmGaussian graphical modelGibbs samplerImputation consistencyRandom-coefficient modelVariable selection

More Related Videos

Management of Respiratory Motion Artefacts in 18F-fluorodeoxyglucose Positron Emission Tomography using an Amplitude-Based Optimal Respiratory Gating Algorithm
06:53

Management of Respiratory Motion Artefacts in 18F-fluorodeoxyglucose Positron Emission Tomography using an Amplitude-Based Optimal Respiratory Gating Algorithm

Published on: July 23, 2020

6.1K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.9K

Related Experiment Videos

Last Updated: Jan 24, 2026

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.7K
Management of Respiratory Motion Artefacts in 18F-fluorodeoxyglucose Positron Emission Tomography using an Amplitude-Based Optimal Respiratory Gating Algorithm
06:53

Management of Respiratory Motion Artefacts in 18F-fluorodeoxyglucose Positron Emission Tomography using an Amplitude-Based Optimal Respiratory Gating Algorithm

Published on: July 23, 2020

6.1K
ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data
05:12

ExCYT: A Graphical User Interface for Streamlining Analysis of High-Dimensional Cytometry Data

Published on: January 16, 2019

11.9K

Area of Science:

  • Statistics
  • Machine Learning
  • Data Science

Background:

  • Missing data are common in high-dimensional analyses, posing challenges for standard algorithms like expectation-maximization.
  • Existing solutions are often problem-specific, lacking a generalizable approach for diverse high-dimensional missing data scenarios.

Purpose of the Study:

  • To propose a novel, general algorithm for effectively handling missing data in high-dimensional statistical and machine learning problems.
  • To provide a robust framework that overcomes the limitations of existing specialized methods.

Main Methods:

  • A two-step iterative process involving conditional imputation of missing data and a regularized optimization step.
  • Utilizing Kullback-Leibler divergence on pseudo-complete data and sparsity constraints for consistent parameter estimation in high dimensions.

Main Results:

  • The proposed algorithm demonstrates consistency in parameter estimation under general conditions, even with high-dimensional data.
  • Successful application illustrated across high-dimensional Gaussian graphical models, variable selection, and random-coefficient models.

Conclusions:

  • The developed algorithm offers a versatile and effective solution for high-dimensional missing data problems.
  • This work bridges the gap by providing a general method applicable to various complex statistical modeling tasks.