Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Outliers and Influential Points01:08

Outliers and Influential Points

4.1K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
4.1K
Testing a Claim about Population Proportion01:24

Testing a Claim about Population Proportion

3.4K
A complete procedure for testing a claim about a population proportion is provided here.
There are two methods of testing a claim about a population proportion: (1) Using the sample proportion from the data where a binomial distribution is approximated to the normal distribution and (2) Using the binomial probabilities calculated from the data.
The first method uses normal distribution as an approximation to the binomial distribution. The requirements are as follows: sample size is large...
3.4K
Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

4.1K
The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
4.1K
What Are Outliers?01:12

What Are Outliers?

3.9K
Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...
3.9K
One-Way ANOVA: Equal Sample Sizes01:15

One-Way ANOVA: Equal Sample Sizes

3.3K
One-Way ANOVA can be performed on three or more samples with equal or unequal sample sizes. When one-way ANOVA is performed on two datasets with samples of equal sizes, it can be easily observed that the computed F statistic is highly sensitive to the sample mean.
Different sample means can result in different values for the variance estimate: variance between samples. This is because the variance between samples is calculated as the product of the sample size and the variance between the...
3.3K
Choosing Between z and t Distribution01:25

Choosing Between z and t Distribution

2.8K
The z and the Student t distribution estimate the population mean using the sample mean and standard deviation. However, to decide which distribution to use for a calculation, one needs to determine the sample size, the nature of the distribution, and whether the population standard deviation is known. If the population standard deviation is known and the population is normally distributed, or if the sample size is greater than 30, the z distribution is preferred. The Student t distribution is...
2.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Evaluating vectors for the design of a spillover-disrupting Lassa virus transmissible vaccine.

PLoS computational biology·2026
Same author

From single-sequences to evolutionary trajectories: protein language models capture the evolutionary potential of SARS-CoV-2.

Nature communications·2026
Same author

Integration of bioinformatic tools for the detection of SARS-CoV-2 co-infection cases.

Microbial genomics·2026
Same author

The potential of H5N1 viruses to adapt to bovine cells varies throughout evolution.

Nature communications·2025
Same author

Avian-origin influenza A viruses tolerate elevated pyrexic temperatures in mammals.

Science (New York, N.Y.)·2025
Same author

ViralBottleneck: an R package for estimating viral transmission bottlenecks from deep sequencing data using multiple methods.

Virus evolution·2025
Same journal

Optimal Weighted Tests for Replication Studies and the 'Two-Trials Rule' With Multiple Hypotheses.

Statistics in medicine·2026
Same journal

Identifiable Copula-Double-Cox Models: A Fully Parametric Framework for Dependent Right-Censored Survival Data.

Statistics in medicine·2026
Same journal

Moving From Individualized Risk-Based Prevention to Benefit-Based Prevention: Estimating Individualized Life-Years Gained From Prevention Services as a Basis for Eligibility.

Statistics in medicine·2026
Same journal

A Mixture of Distributed Lag Non-Linear Models to Account for Spatially Heterogeneous Exposure-Lag-Response Associations.

Statistics in medicine·2026
Same journal

Practical Considerations for Gaussian Process Modeling for Causal Inference in Quasi-Experimental Studies With Panel Data.

Statistics in medicine·2026
Same journal

Covariate Adjustment for Wilcoxon Two Sample Statistic and Test.

Statistics in medicine·2026
See all related articles

Related Experiment Video

Updated: Jul 27, 2025

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K

Identifying potential significant factors impacting zero-inflated proportion data.

Mélina Ribaud1, Edith Gabriel1, Joseph Hughes2

  • 1INRAE, BioSP, Avignon, France.

Statistics in Medicine
|June 8, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a novel permutation-based method to identify key factors influencing zero-inflated proportion data (ZIPD). The approach effectively explains correlations and predicts response variable ranks in epidemiological data.

Keywords:
COVID-19Spearman's correlationequine influenzaperformance indicatorpermutation testranking

More Related Videos

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.2K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.1K

Related Experiment Videos

Last Updated: Jul 27, 2025

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index
06:55

Inverse Probability of Treatment Weighting Propensity Score using the Military Health System Data Repository and National Death Index

Published on: January 8, 2020

14.5K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.2K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.1K

Area of Science:

  • Biostatistics
  • Epidemiology
  • Data Science

Background:

  • Classical supervised methods struggle with dependent, continuous, bounded, and zero-inflated proportion data (ZIPD).
  • Identifying significant factors for ZIPD is crucial in fields like epidemiology.

Purpose of the Study:

  • To propose a novel within-block permutation-based methodology for identifying factors impacting ZIPD.
  • To develop a performance indicator for quantifying explained correlation by significant factors.
  • To enable prediction of response variable ranks based on observed factors.

Main Methods:

  • A within-block permutation-based approach is utilized.
  • The methodology identifies discrete or continuous factors significantly correlated with ZIPD.
  • A performance indicator is proposed to measure the explained correlation percentage.

Main Results:

  • The proposed methodology effectively identifies significant factors for ZIPD.
  • A performance indicator quantifies the explanatory power of identified factors.
  • The approach successfully predicts response variable ranks in simulated and real-world epidemiological data.

Conclusions:

  • The developed methodology offers a robust solution for analyzing ZIPD.
  • This approach enhances understanding of transmission probabilities (e.g., Influenza) and mortality dynamics (e.g., COVID-19).
  • The method provides valuable tools for epidemiological research and data analysis.