Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Statistical Hypothesis Testing

Statistical Hypothesis Testing

Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...

Hypothesis Test for Test of Independence

Hypothesis Test for Test of Independence

The test of independence is a chi-square-based test used to determine whether two variables or factors are independent or dependent. This hypothesis test is used to examine the independence of the variables. One can construct two qualitative survey questions or experiments based on the variables in a contingency table. The goal is to see if the two variables are unrelated (independent) or related (dependent). The null and alternative hypotheses for this test are:
H0: The two variables (factors)...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

Types of Hypothesis Testing

Types of Hypothesis Testing

There are three types of hypothesis tests: right-tailed, left-tailed, and two-tailed.
When the null and alternative hypotheses are stated, it is observed that the null hypothesis is a neutral statement against which the alternative hypothesis is tested. The alternative hypothesis is a claim that instead has a certain direction. If the null hypothesis claims that p = 0.5, the alternative hypothesis would be an opposing statement to this and can be put either p > 0.5, p < 0.5, or p...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Errors In Hypothesis Tests

Errors In Hypothesis Tests

When performing a hypothesis test, there are four possible outcomes depending on the actual truth (or falseness) of the null hypothesis and the decision to reject or not.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

STELLAR: A flexible ensemble learning framework integrating rare variants to enhance polygenic risk prediction.

medRxiv : the preprint server for health sciences·2026

Same author

Statistics and AI - A Fireside Conversation.

Harvard data science review·2026

Same author

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026

Same author

TESTING FOR THE CAUSAL MEDIATION EFFECTS OF MULTIPLE MEDIATORS USING THE KERNEL MACHINE DIFFERENCE METHOD IN GENOME-WIDE EPIGENETIC STUDIES.

The annals of applied statistics·2026

Same author

Associations between women's childhood maltreatment and thyroid function before and during pregnancy.

Scientific reports·2026

Same author

Sodium valproate induces pancreatic injury by disruption of one-carbon metabolism.

British journal of pharmacology·2026

Same journal

Towards a Unified Theory for Semiparametric Data Fusion with Individual-Level Data.

Annals of statistics·2026

Same journal

One-Step Estimation of Differentiable Hilbert-Valued Parameters.

Annals of statistics·2026

Same journal

GENERALIZATION ERROR BOUNDS OF DYNAMIC TREATMENT REGIMES IN PENALIZED REGRESSION-BASED LEARNING.

Annals of statistics·2026

Same journal

EFFICIENT AND MULTIPLY ROBUST RISK ESTIMATION UNDER GENERAL FORMS OF DATASET SHIFT.

Annals of statistics·2026

Same journal

TESTING HIGH-DIMENSIONAL REGRESSION COEFFICIENTS IN LINEAR MODELS.

Annals of statistics·2026

Same journal

COUNTERFACTUAL INFERENCE IN SEQUENTIAL EXPERIMENTS.

Annals of statistics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 5, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

HYPOTHESIS TESTING FOR HIGH-DIMENSIONAL SPARSE BINARY REGRESSION.

Rajarshi Mukherjee¹, Natesh S Pillai², Xihong Lin³

¹Department of Statistics, Stanford University, Sequoia Hall, 390 Serra Mall, Stanford, California 94305-4065, USA.

Annals of Statistics

|August 7, 2015

Summary

This summary is machine-generated.

This study explores hypothesis testing for rare genetic variants in high-dimensional binary regression. We identified a new detection boundary phenomenon influenced by data sparsity and signal strength, crucial for association studies.

Keywords:

Higher Criticism Minimax hypothesis testing binary regression detection boundary sparsity

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Related Experiment Videos

Last Updated: Apr 5, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

The Innovation Arena: A Method for Comparing Innovative Problem-Solving Across Groups

Published on: May 13, 2022

Area of Science:

Statistics
Genetics
High-dimensional data analysis

Background:

Hypothesis testing is crucial for identifying genetic variants associated with diseases.
High-dimensional and sparse binary regression models present unique challenges in statistical inference.
Existing methods may not fully capture the complexities of rare variant detection in sparse genetic data.

Purpose of the Study:

To investigate the detection boundary for minimax hypothesis testing in high-dimensional, sparse binary regression.
To understand the impact of design matrix sparsity on hypothesis testing power.
To develop and evaluate optimal tests for detecting rare variant effects.

Main Methods:

Derivation of the detection boundary as a function of design matrix sparsity index and signal strength.
Analysis of asymptotic power for different sparsity levels.
Development of an extended Higher Criticism Test for sparse regimes.
Comparison with the generalized likelihood ratio test in dense regimes.

Main Results:

A novel phenomenon in detection boundary behavior specific to sparse binary regression was observed.
The detection boundary is critically dependent on the design matrix sparsity index and signal strength.
For high sparsity, tests become asymptotically powerless regardless of signal strength.
The extended Higher Criticism Test is shown to be rate optimal and sharp in the sparse regime.

Conclusions:

The study provides a theoretical framework for understanding hypothesis testing in sparse binary regression.
Optimal testing strategies depend heavily on the sparsity characteristics of the genetic data.
The proposed extended Higher Criticism Test offers an effective solution for rare variant detection in sparse settings.