Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Causality in Epidemiology01:21

Causality in Epidemiology

377
Causality or causation is a fundamental concept in epidemiology, vital for understanding the relationships between various factors and health outcomes. Despite its importance, there's no single, universally accepted definition of causality within the discipline. Drawing from a systematic review, causality in epidemiology encompasses several definitions, including production, necessary and sufficient, sufficient-component, counterfactual, and probabilistic models. Each has its strengths and...
377
Survival Tree01:19

Survival Tree

78
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
78
Censoring Survival Data01:09

Censoring Survival Data

75
Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...
75
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

119
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
119
Statistical Methods for Analyzing Epidemiological Data01:25

Statistical Methods for Analyzing Epidemiological Data

346
Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:
346
Clearance Models: Noncompartmental Models01:17

Clearance Models: Noncompartmental Models

57
Clearance is a pharmacokinetic parameter traditionally defined by compartment models, signifying the rate at which a drug is expelled from the body. However, a noncompartmental model offers an alternative method for assessing clearance, primarily employing empirical data obtained after administering a single drug dose.
The noncompartmental approach capitalizes on extensive sampling data, correlating the volume of distribution to systemic exposure and the administered dosage. This method enables...
57

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Doubly regularized generalized linear models for spatial observations with high-dimensional covariates.

Journal of the Royal Statistical Society. Series C, Applied statistics·2026
Same author

Intraindividual cognitive variability predicts amyloid beta, tau PET, and dementia conversion in Down syndrome: a potential marker of cognitive resilience.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2026
Same author

Building an Interoperable Rare Disease Multi-omic Resource: The GREGoR Data Model and Dataset.

bioRxiv : the preprint server for biology·2026
Same author

Covariate-Adjusted Inference for Differential Analysis of High-Dimensional Networks.

Sankhya. Series A. (2008)·2026
Same author

Digital Atlases to Unlock the Potential of Brain Biorepository Tissues for Interdisciplinary Research.

bioRxiv : the preprint server for biology·2026
Same author

Artificial Intelligence-Enhanced Electrocardiography and Health Records to Predict Cardiac Arrest.

JACC. Advances·2026
Same journal

Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift.

Proceedings of machine learning research·2026
Same journal

Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video.

Proceedings of machine learning research·2026
Same journal

Perspective: Machine Learning for Health Should Consider Social Drivers of Health.

Proceedings of machine learning research·2026
Same journal

Classifying Phonotrauma Severity from Vocal Fold Images with Soft Ordinal Regression.

Proceedings of machine learning research·2026
Same journal

Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?

Proceedings of machine learning research·2026
Same journal

Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential.

Proceedings of machine learning research·2026
See all related articles

Related Experiment Video

Updated: Jun 20, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Directed Graphical Models and Causal Discovery for Zero-Inflated Data.

Shiqing Yu1, Mathias Drton2, Ali Shojaie3

  • 1Department of Statistics, University of Washington, Seattle, Washington, 98195, U.S.A.

Proceedings of Machine Learning Research
|July 19, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a new statistical model for analyzing single-cell gene expression data, addressing challenges posed by zero-inflated patterns. The developed directed graphical models accurately identify gene regulatory networks from complex biological data.

Keywords:
Bayesian networkcausal discoverydirected acyclic graphidentifiability

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K
Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

6.3K

Related Experiment Videos

Last Updated: Jun 20, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.1K
Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills
06:52

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

6.3K

Area of Science:

  • Computational Biology
  • Genomics
  • Statistical Genetics

Background:

  • Single-cell gene expression measurements offer high-resolution insights into cellular regulatory mechanisms.
  • Existing statistical methods struggle with zero-inflated data common in single-cell transcriptomics.
  • Directed graphical models are suitable for inferring gene regulatory relationships but require adaptation for zero-inflated data.

Purpose of the Study:

  • To develop a novel directed graphical model capable of handling zero-inflated single-cell gene expression data.
  • To enable accurate identification of gene regulatory networks from complex single-cell data.
  • To address the identifiability challenges in directed acyclic graph (DAG) recovery for such data.

Main Methods:

  • Proposed directed graphical models utilizing Hurdle conditional distributions.
  • Parametrization based on polynomials of parent variables and their zero/nonzero indicators.
  • Development of graph recovery methods and validation through simulated experiments and real single-cell data analysis.

Main Results:

  • Demonstrated that the proposed zero-inflated models allow for the identification of the exact directed acyclic graph under a weak assumption.
  • Successfully applied the model to real single-cell gene expression data from T helper cells.
  • Simulated experiments confirmed the identifiability and accuracy of the graph estimation methods.

Conclusions:

  • The developed directed graphical models effectively address zero-inflation in single-cell gene expression data.
  • The proposed methods enable robust identification of gene regulatory networks.
  • This approach advances the analysis of complex single-cell transcriptomic data for biological discovery.