Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Binomial Probability Distribution01:15

Binomial Probability Distribution

A binomial distribution is a probability distribution for a procedure with a fixed number of trials, where each trial can have only two outcomes.
The outcomes of a binomial experiment fit a binomial probability distribution. A statistical experiment can be classified as a binomial experiment if the following conditions are met:
There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of trials.
There are only two possible outcomes,...
Parametric Survival Analysis: Weibull and Exponential Methods01:14

Parametric Survival Analysis: Weibull and Exponential Methods

Parametric survival analysis models survival data by assuming a specific probability distribution for the time until an event occurs. The Weibull and exponential distributions are two of the most commonly used methods in this context, due to their versatility and relatively straightforward application.
Weibull Distribution
The Weibull distribution is a flexible model used in parametric survival analysis. It can handle both increasing and decreasing hazard rates, depending on its shape parameter...
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures from...
Distributions to Estimate Population Parameter01:26

Distributions to Estimate Population Parameter

The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...
Poisson Probability Distribution01:09

Poisson Probability Distribution

A Poisson probability distribution is a discrete probability distribution. It gives the probability of a number of events occurring in a fixed interval of time or space if these events happen at a known average rate and independently of the time since the last event. For example, a book editor might be interested in the number of words spelled incorrectly in a particular book. It might be that, on average, there are five words spelled incorrectly in 100 pages. The interval is 100 pages.
The...
Multiple Regression01:25

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

The BILAG-2004 index is associated with development of new damage in SLE.

Rheumatology (Oxford, England)·2022
Same author

Comparison of Responsiveness of British Isles Lupus Assessment Group 2004 Index, Systemic Lupus Erythematosus Disease Activity Index 2000, and British Isles Lupus Assessment Group 2004 Systems Tally.

Arthritis care & research·2021
Same author

Understanding between-cluster variation in prevalence and limits for how much variation is plausible.

Statistical methods in medical research·2020
Same author

Letter to the Editor: Is the R coefficient of interest in cluster randomized trials with a binary outcome?

Statistical methods in medical research·2020
Same author

Methodological standards for the development and evaluation of clinical prediction rules: a review of the literature.

Diagnostic and prognostic research·2019
Same author

Correlated multistate models for multiple processes: an application to renal disease progression in systemic <i>lupus erythematosus</i>.

Journal of the Royal Statistical Society. Series C, Applied statistics·2019
Same journal

A Bayesian functional concurrent zero-inflated Dirichlet-multinomial regression model with application to infant microbiome.

Biostatistics (Oxford, England)·2026
Same journal

Towards optimal environmental policies: policy learning under arbitrary bipartite network interference.

Biostatistics (Oxford, England)·2026
Same journal

Multilevel functional quantile principal component analysis.

Biostatistics (Oxford, England)·2026
Same journal

Adaptive transfer learning for time-to-event modeling with applications in disease risk assessment.

Biostatistics (Oxford, England)·2026
Same journal

High-dimensional test for one-sided hypotheses.

Biostatistics (Oxford, England)·2026
Same journal

NBSR: a Negative Binomial Softmax Regression model for microRNA-seq data analysis.

Biostatistics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: May 16, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Dirichlet negative multinomial regression for overdispersed correlated count data.

Daniel M Farewell1, Vernon T Farewell

  • 1Cochrane Institute of Primary Care and Public Health, Cardiff University, Neuadd Meirionnydd, Heath Park, Cardiff CF14 4YS, UK. farewelld@cf.ac.uk

Biostatistics (Oxford, England)
|December 11, 2012
PubMed
Summary
This summary is machine-generated.

This study introduces a new random effects model for the Dirichlet negative multinomial distribution. This model offers improved accuracy over generalized estimating equations, especially in clustered data with high correlation.

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Related Experiment Videos

Last Updated: May 16, 2026

Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

  • Statistics
  • Biostatistics
  • Statistical Modeling

Background:

  • The Dirichlet negative multinomial distribution is a flexible model for overdispersed count data.
  • Existing regression models may not adequately capture complex correlation structures in clustered data.
  • Generalized estimating equations (GEE) are commonly used but can be sensitive to model misspecification.

Purpose of the Study:

  • To develop a generic random effects formulation for the Dirichlet negative multinomial distribution.
  • To introduce a convenient regression parameterization for this distribution.
  • To compare the performance of the proposed model against GEE in terms of estimation accuracy.

Main Methods:

  • Development of a random effects formulation for the Dirichlet negative multinomial distribution.
  • Introduction of a regression parameterization simplifying model estimation.
  • Conducting a simulation study to evaluate model performance under various correlation structures.
  • Application of the model to a clinical trial recruitment dataset.

Main Results:

  • The proposed Dirichlet negative multinomial regression model demonstrated smaller median absolute error compared to GEE.
  • Performance improvement was particularly significant in scenarios with high correlation between observations within clusters.
  • The model effectively estimated explanatory variable effects and sources of variation.

Conclusions:

  • The developed random effects Dirichlet negative multinomial model provides a robust alternative for analyzing clustered count data.
  • It offers enhanced accuracy and better handling of correlation compared to traditional methods like GEE.
  • The model is applicable to real-world data, such as in clinical trial recruitment studies.