Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This number is...

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

Outliers and Influential Points

Outliers and Influential Points

An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the vertical...

What Are Outliers?

What Are Outliers?

Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...

Genome-wide Association Studies-GWAS

Genome-wide Association Studies-GWAS

Genome-wide association studies or GWAS are used to identify whether common SNPs are associated with certain diseases. Suppose specific SNPs are more frequently observed in individuals with a particular disease than those without the disease. In that case, those SNPs are said to be associated with the disease. Chi-square analysis is performed to check the probability of the allele likely to be associated with the disease.
GWAS does not require the identification of the target gene involved in...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance, comparing...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Glycemic response trajectories on metformin monotherapy in real-world diabetes care.

medRxiv : the preprint server for health sciences·2026

Same author

Robust ranking of renewable energy alternatives handling uncertainty using novel hesitant bi-fuzzy MEREC-MOORA and Dombi aggregation approach.

Scientific reports·2026

Same author

The Impact of Social Vulnerability on Exercise Outcomes: A Longitudinal Study of Physical Function in Older People With HIV.

Journal of the International Association of Providers of AIDS Care·2026

Same author

Special issue: cell and gene causal inference in the design and analysis of gene therapy clinical trials.

Journal of biopharmaceutical statistics·2026

Same author

Mapping the last mile: Micro-stratification for sustained visceral leishmaniasis elimination in Bangladesh.

PLoS neglected tropical diseases·2026

Same author

The effects of high-intensity interval training versus continuous moderate-intensity exercise on body composition among older adults with HIV.

The journals of gerontology. Series A, Biological sciences and medical sciences·2026

Same journal

Correction.

Journal of biopharmaceutical statistics·2026

Same journal

Leveraging external controls in clinical trials: estimands, estimation, assumptions.

Journal of biopharmaceutical statistics·2026

Same journal

Special issue of nonclinical statistics in regulatory applications guest editors' notes.

Journal of biopharmaceutical statistics·2026

Same journal

Comparison of flexible parametric modeling and nonparametric methods to estimate restricted mean survival time: A simulation study.

Journal of biopharmaceutical statistics·2026

Same journal

Simulated treatment comparisons with jackknife pseudo values for estimating population-adjusted marginal treatment effects.

Journal of biopharmaceutical statistics·2026

Same journal

Sample sizes for randomized controlled trials utilizing Bayesian response adaptive randomization for continuous outcomes.

Journal of biopharmaceutical statistics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 14, 2026

Competitive Genomic Screens of Barcoded Yeast Libraries

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

Discrete nonparametric algorithms for outlier detection with genomic data.

Debashis Ghosh¹

¹Department of Statistics, Penn State University, University Park, Pennsylvania, USA. ghoshd@psu.edu

Journal of Biopharmaceutical Statistics

|March 24, 2010

Summary

This summary is machine-generated.

This study focuses on selecting appropriate test statistics for differential gene expression analysis in high-throughput studies. It proposes using q-value estimation for discrete p-values, enhancing the analysis of outlying expression values.

More Related Videos

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Related Experiment Videos

Last Updated: Jun 14, 2026

Competitive Genomic Screens of Barcoded Yeast Libraries

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Area of Science:

Genomics
Bioinformatics
Statistical Genetics

Background:

Differential expression analysis is crucial for high-throughput genetic studies, particularly with gene expression microarrays.
The choice of test statistic in multiple comparisons for differential expression analysis is often overlooked.
Discrete p-values present a challenge in standard multiple comparison procedures.

Purpose of the Study:

To investigate the impact of test statistic choice on differential expression analysis.
To adapt multiple comparison procedures for assessing outlying gene expression values.
To explore theoretical properties of sequential testing with discrete p-values.

Main Methods:

Recasting multiple-comparison procedures to assess outlying expression values.
Theoretical exploration of sequential testing procedures for discrete p-values.
Application of q-value estimation procedures to differential expression analysis.

Main Results:

The study highlights the importance of test statistic selection in identifying significant differential expression.
Proposed methods address the complexities arising from discrete p-values in gene expression data.
Demonstrated utility of q-value estimation in a prostate cancer gene expression profiling experiment.

Conclusions:

The choice of test statistic significantly impacts differential gene expression analysis outcomes.
Q-value estimation provides a robust approach for handling discrete p-values in this context.
The methodology offers improved analytical tools for high-throughput genetic studies.