Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Causality in Epidemiology

Causality in Epidemiology

Causality or causation is a fundamental concept in epidemiology, vital for understanding the relationships between various factors and health outcomes. Despite its importance, there's no single, universally accepted definition of causality within the discipline. Drawing from a systematic review, causality in epidemiology encompasses several definitions, including production, necessary and sufficient, sufficient-component, counterfactual, and probabilistic models. Each has its strengths and...

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Censoring Survival Data

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

Statistical Methods for Analyzing Epidemiological Data

Statistical Methods for Analyzing Epidemiological Data

Epidemiological data primarily involves information on specific populations' occurrence, distribution, and determinants of health and diseases. This data is crucial for understanding disease patterns and impacts, aiding public health decision-making and disease prevention strategies. The analysis of epidemiological data employs various statistical methods to interpret health-related data effectively. Here are some commonly used methods:

Clearance Models: Noncompartmental Models

Clearance Models: Noncompartmental Models

Clearance is a pharmacokinetic parameter traditionally defined by compartment models, signifying the rate at which a drug is expelled from the body. However, a noncompartmental model offers an alternative method for assessing clearance, primarily employing empirical data obtained after administering a single drug dose.
The noncompartmental approach capitalizes on extensive sampling data, correlating the volume of distribution to systemic exposure and the administered dosage. This method enables...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Doubly regularized generalized linear models for spatial observations with high-dimensional covariates.

Journal of the Royal Statistical Society. Series C, Applied statistics·2026

Same author

Intraindividual cognitive variability predicts amyloid beta, tau PET, and dementia conversion in Down syndrome: a potential marker of cognitive resilience.

Alzheimer's & dementia : the journal of the Alzheimer's Association·2026

Same author

Building an Interoperable Rare Disease Multi-omic Resource: The GREGoR Data Model and Dataset.

bioRxiv : the preprint server for biology·2026

Same author

Covariate-Adjusted Inference for Differential Analysis of High-Dimensional Networks.

Sankhya. Series A. (2008)·2026

Same author

Digital Atlases to Unlock the Potential of Brain Biorepository Tissues for Interdisciplinary Research.

bioRxiv : the preprint server for biology·2026

Same author

Artificial Intelligence-Enhanced Electrocardiography and Health Records to Predict Cardiac Arrest.

JACC. Advances·2026

Same journal

Towards the Efficient Inference by Incorporating Automated Computational Phenotypes under Covariate Shift.

Proceedings of machine learning research·2026

Same journal

Endo-SemiS: Towards Robust Semi-Supervised Image Segmentation for Endoscopic Video.

Proceedings of machine learning research·2026

Same journal

Perspective: Machine Learning for Health Should Consider Social Drivers of Health.

Proceedings of machine learning research·2026

Same journal

Classifying Phonotrauma Severity from Vocal Fold Images with Soft Ordinal Regression.

Proceedings of machine learning research·2026

Same journal

Does Domain-Specific Retrieval Augmented Generation Help LLMs Answer Consumer Health Questions?

Proceedings of machine learning research·2026

Same journal

Quantitative Convergence Analysis of Projected Stochastic Gradient Descent for Non-Convex Losses via the Goldstein Subdifferential.

Proceedings of machine learning research·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 20, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Directed Graphical Models and Causal Discovery for Zero-Inflated Data.

Shiqing Yu¹, Mathias Drton², Ali Shojaie³

¹Department of Statistics, University of Washington, Seattle, Washington, 98195, U.S.A.

Proceedings of Machine Learning Research

|July 19, 2024

Summary

This summary is machine-generated.

This study introduces a new statistical model for analyzing single-cell gene expression data, addressing challenges posed by zero-inflated patterns. The developed directed graphical models accurately identify gene regulatory networks from complex biological data.

Keywords:

Bayesian network causal discovery directed acyclic graph identifiability

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

Related Experiment Videos

Last Updated: Jun 20, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Using Cholesky Decomposition to Explore Individual Differences in Longitudinal Relations between Reading Skills

Published on: September 17, 2019

Area of Science:

Computational Biology
Genomics
Statistical Genetics

Background:

Single-cell gene expression measurements offer high-resolution insights into cellular regulatory mechanisms.
Existing statistical methods struggle with zero-inflated data common in single-cell transcriptomics.
Directed graphical models are suitable for inferring gene regulatory relationships but require adaptation for zero-inflated data.

Purpose of the Study:

To develop a novel directed graphical model capable of handling zero-inflated single-cell gene expression data.
To enable accurate identification of gene regulatory networks from complex single-cell data.
To address the identifiability challenges in directed acyclic graph (DAG) recovery for such data.

Main Methods:

Proposed directed graphical models utilizing Hurdle conditional distributions.
Parametrization based on polynomials of parent variables and their zero/nonzero indicators.
Development of graph recovery methods and validation through simulated experiments and real single-cell data analysis.

Main Results:

Demonstrated that the proposed zero-inflated models allow for the identification of the exact directed acyclic graph under a weak assumption.
Successfully applied the model to real single-cell gene expression data from T helper cells.
Simulated experiments confirmed the identifiability and accuracy of the graph estimation methods.

Conclusions:

The developed directed graphical models effectively address zero-inflation in single-cell gene expression data.
The proposed methods enable robust identification of gene regulatory networks.
This approach advances the analysis of complex single-cell transcriptomic data for biological discovery.