Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Censoring Survival Data

Censoring Survival Data

Survival analysis is a statistical method used to analyze time-to-event data, often employed in fields such as medicine, engineering, and social sciences. One of the key challenges in survival analysis is dealing with incomplete data, a phenomenon known as "censoring." Censoring occurs when the event of interest (such as death, relapse, or system failure) has not occurred for some individuals by the end of the study period or is otherwise unobservable, and it might have many different reasons...

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance, comparing...

Testing a Claim about Mean: Unknown Population SD

Testing a Claim about Mean: Unknown Population SD

A complete procedure of testing a hypothesis about a population mean when the population standard deviation is unknown is explained here.
Estimating a population mean requires the samples to be approximately normally distributed. The data should be collected from the randomly selected samples having no sampling bias. There is no specific requirement for sample size. But if the sample size is less than 30, and we don't know the population standard deviation, a different approach is used; instead...

Testing a Claim about Population Proportion

Testing a Claim about Population Proportion

A complete procedure for testing a claim about a population proportion is provided here.
There are two methods of testing a claim about a population proportion: (1) Using the sample proportion from the data where a binomial distribution is approximated to the normal distribution and (2) Using the binomial probabilities calculated from the data.
The first method uses normal distribution as an approximation to the binomial distribution. The requirements are as follows: sample size is large...

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This number is...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Simultaneous Immunofluorescence-Based In Situ mRNA Expression and Protein Detection in Bone Marrow Biopsy Samples.

Bio-protocol·2026

Same author

Robust causal gene network estimation for large-scale single-cell perturbation screens using reduced control function.

bioRxiv : the preprint server for biology·2026

Same author

Development of an automated, imaging-based preoperative screening model for early identification of malnutrition in an abdominal surgery cohort.

medRxiv : the preprint server for health sciences·2026

Same author

Factors Associated with Adherence to Recommended Colorectal Surveillance Intervals in Lynch Syndrome.

Cancers·2026

Same author

Rejoinder to the discussion on "INTACT: A method for integration of longitudinal physical activity data from multiple sources".

Biometrics·2026

Same author

INTACT: a method for integration of longitudinal physical activity data from multiple sources.

Biometrics·2026

Same journal

Improving Overall Risk Ranking via Subgroup-Level Information Borrowing in Survival Risk Stratification.

Statistics and its interface·2026

Same journal

High-dimensional Bayesian mediation analysis with adaptive Laplace priors.

Statistics and its interface·2026

Same journal

Imaging mediation analysis for longitudinal outcomes: a case study of childhood brain tumor survivorship.

Statistics and its interface·2025

Same journal

Variable selection for doubly robust causal inference.

Statistics and its interface·2025

Same journal

Smooth online parameter estimation for time varying VAR models with application to rat local field potential activity data.

Statistics and its interface·2025

Same journal

A Double Regression Method for Graphical Modeling of High-dimensional Nonlinear and Non-Gaussian Data.

Statistics and its interface·2025

See all related articles

Search research articles

Related Experiment Video

Updated: May 14, 2026

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

Published on: September 20, 2022

Optimal False Discovery Rate Control for Dependent Data.

Jichun Xie¹, T Tony Cai, John Maris

¹Department of Statistics, The Fox School of Business and Management, Temple University, jichun@temple.edu.

Statistics and Its Interface

|February 5, 2013

Summary

This summary is machine-generated.

This study introduces an optimal procedure for controlling the false discovery rate (FDR) with dependent test statistics. The proposed method effectively reduces false non-discovery rates, improving upon existing FDR control techniques in genetic studies.

More Related Videos

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Related Experiment Videos

Last Updated: May 14, 2026

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

An Integrated Workflow of Identification and Quantification on FDR Control-Based Untargeted Metabolome

Published on: September 20, 2022

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

Statistics
Bioinformatics
Genetics

Background:

Controlling the false discovery rate (FDR) is crucial in high-dimensional data analysis, especially when test statistics exhibit dependencies.
Existing methods often struggle to maintain optimal performance under dependent test statistics.

Purpose of the Study:

To develop an optimal procedure for false discovery rate control with dependent test statistics.
To propose a data-driven method that approximates the optimal procedure for multivariate normal data.
To evaluate the performance of the proposed method against existing FDR controlling procedures.

Main Methods:

Development of an optimal joint oracle procedure to minimize the false non-discovery rate (FNDR) under an FDR constraint.
Proposal of a data-driven marginal plug-in procedure for approximating the joint procedure.
Asymptotic optimality analysis for multivariate normal data with short-range dependent covariance structures.

Main Results:

The marginal plug-in procedure is shown to be asymptotically optimal for specific data structures.
Numerical simulations demonstrate effective FDR control and a reduced FNDR compared to p-value based methods.
Application to a neuroblastoma GWAS identified additional potentially associated genetic variants.

Conclusions:

The proposed marginal procedure offers an effective approach for FDR control with dependent data.
This method enhances the discovery of relevant genetic variants in complex association studies.
The procedure provides a valuable tool for statistical inference in genomics and other fields with dependent data.