Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Censoring Survival Data01:09

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different reasons...
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures from...
Methods of Medium Optimization01:28

Methods of Medium Optimization

Optimizing growth media enhances microbial proliferation and maximizes product yield. Statistical experimental design methodologies provide structured and reproducible approaches, offering progressively higher levels of robustness and efficiency.The One-Factor-at-a-Time (OFAT) MethodThe One-Factor-at-a-Time (OFAT) method involves adjusting a single variable while keeping all others constant. However, it cannot detect interactions between variables, often leading to suboptimal outcomes when...
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
Strategies for Assessing and Addressing Confounding01:25

Strategies for Assessing and Addressing Confounding

Confounding is a critical issue in epidemiological studies, often leading to misleading conclusions about associations between exposures and outcomes. It occurs when the relationship between the exposure and the outcome is mixed with the effects of other factors that influence the outcome. Given that, addressing confounding is of high importance for drawing accurate inferences in research.
Confounding can be addressed at both the design phase of a study and through analytical methods after data...
Multiple Allele Traits01:49

Multiple Allele Traits

The Concept of Multiple Allelism

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Regression with race-modifiers: towards equity and interpretability.

medRxiv : the preprint server for health sciences·2024
Same author

Racial residential segregation shapes the relationship between early childhood lead exposure and fourth-grade standardized test scores.

Proceedings of the National Academy of Sciences of the United States of America·2022
Same author

Subset selection for linear mixed models.

Biometrics·2022
Same author

Semiparametric count data regression for self-reported mental health.

Biometrics·2021
Same author

Bayesian variable selection for understanding mixtures in environmental exposures.

Statistics in medicine·2021
Same author

Integer-valued functional data analysis for measles forecasting.

Biometrics·2019
Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026
Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026
Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026
Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026
Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026
Same journal

SMART-MC: Characterizing the Dynamics of Multiple Sclerosis Therapy Transitions Using a Covariate-Based Markov Model.

Journal of the American Statistical Association·2026
See all related articles

Related Experiment Videos

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Daniel R Kowal1

  • 1Department of Statistics and Data Science, Cornell University, Ithaca, NY.

Journal of the American Statistical Association
|June 12, 2026
PubMed
Summary
This summary is machine-generated.

Abundance-based constraints (ABCs) offer a new way to analyze regression models with categorical interactions. This method ensures main effects remain unchanged and improves statistical power, allowing for better discovery of group-specific effects.

Keywords:
Discrete dataInteractionsPenalized EstimationRegression analysis

Related Experiment Videos

Area of Science:

  • Statistics
  • Regression Analysis
  • Econometrics

Background:

  • Categorical covariates are common in regression analysis.
  • Standard linear models with categorical interactions can alter main effect estimates and introduce bias.
  • Existing methods struggle with interpretable and equitable analysis of group-specific effects.

Purpose of the Study:

  • Introduce abundance-based constraints (ABCs) as an alternative parameterization for linear models.
  • Demonstrate that ABCs leave main effect estimates unchanged when categorical interactions are added.
  • Showcase ABCs' ability to enhance statistical power and maintain interpretability.

Main Methods:

  • Developed and applied a novel parameterization and estimation scheme using abundance-based constraints (ABCs).
  • Utilized simulated data to verify the invariance properties of ABCs for estimation and inference.
  • Employed ABCs to analyze demographic heterogeneities in STEM educational outcomes.

Main Results:

  • ABCs ensure that adding categorical interactions does not alter main effect estimates.
  • ABCs enhance the statistical power of main effects under reasonable conditions.
  • The R package lmabc facilitates the application of this method.

Conclusions:

  • ABCs provide an interpretable and equitable approach to regression analysis with categorical interactions.
  • Analysts can confidently include categorical interactions to uncover group-specific effects without compromising main effects.
  • This method has practical applications in fields like education research, as demonstrated by the North Carolina STEM outcomes study.