Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

3.4K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
3.4K
Outliers and Influential Points01:08

Outliers and Influential Points

5.7K
An outlier is an observation of data that does not fit the rest of the data. It is sometimes called an extreme value. When you graph an outlier, it will appear not to fit the pattern of the graph. Some outliers are due to mistakes (for example, writing down 50 instead of 500), while others may indicate that something unusual is happening. Outliers are present far from the least squares line in the vertical direction. They have large "errors," where the "error" or residual is the...
5.7K
What Are Outliers?01:12

What Are Outliers?

4.8K
Outliers are observed data points that are far from the least squares line. They have unusual values and need to be examined carefully. Though an outlier may result from erroneous data, at other times, it may hold valuable information about the population under study and should be included in the data. Hence, it is crucial to examine what causes a data point to be an outlier.
The z score is used to find outliers or unusual values. It should be noted that any values beyond -2 and +2 are...
4.8K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

6.8K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
6.8K
DNA Microarrays02:34

DNA Microarrays

20.4K
Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
20.4K
Modified Boxplots00:57

Modified Boxplots

10.8K
A standard box and whisker plot informs us about the spread of the data in a given sample. One can identify the minimum value, maximum value, first quartile value, second quartile or median value, and third quartile.
However, the box plot does not tell the reader about outliers - values that lie far from the center of the data. We can modify the standard box and whisker plot to identify the outliers and visualize the actual spread of the data in a sample.
Initially, we calculate the adjusted...
10.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A Pan-pangenome illuminates complex structural variation and selection in humans, chimpanzees, and bonobos.

bioRxiv : the preprint server for biology·2026
Same author

Haplotype-resolved DNA methylation at the <i>APOE</i> locus identifies allele-specific epigenetic signatures relevant to Alzheimer's disease risk.

NPJ dementia·2026
Same author

Lossless pangenome indexing using tag arrays.

Algorithms for molecular biology : AMB·2026
Same author

Population-scale Y chromosome assemblies reveal recurrent remodeling within constrained architectures.

bioRxiv : the preprint server for biology·2026
Same author

Accessing medically relevant complex regions with a pangenome graph of 20 near-complete Japanese haplotypes.

Nature communications·2026
Same author

A complete human pancreatic cancer genome.

bioRxiv : the preprint server for biology·2026
Same journal

Bayesian Methods for Subgroup Efficacy and Safety: Application to Japanese Patients in JAVELIN Renal 101.

JCO clinical cancer informatics·2026
Same journal

Effect of a Multidimensional Digital Health Intervention on Quality of Life in Breast Cancer Survivors: A Randomized Controlled Trial.

JCO clinical cancer informatics·2026
Same journal

Can Small Open-Source Language Models With Retrieval-Augmented Generation Match GPT-4 Performance in Breast Cancer Clinical Decision Support?

JCO clinical cancer informatics·2026
Same journal

Machine Learning Algorithm for the Detection of Tumor Microsatellite Instability Based on Multiomics Biomarkers.

JCO clinical cancer informatics·2026
Same journal

Foundation Model-Driven Regions of Interest Classification and Renaming in Cancer Radiotherapy: A Customizable, Retraining-Free Workflow Across Institutions.

JCO clinical cancer informatics·2026
Same journal

Announcing a New Article Type in <i>JCO Clinical Cancer Informatics</i>: The Resource Report.

JCO clinical cancer informatics·2026
See all related articles

Related Experiment Video

Updated: Dec 27, 2025

Competitive Genomic Screens of Barcoded Yeast Libraries
11:59

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

18.7K

Bayesian Framework for Detecting Gene Expression Outliers in Individual Samples.

John Vivian1, Jordan M Eizenga1, Holly C Beale2

  • 1Computational Genomics Laboratory, University of California, Santa Cruz, Santa Cruz, CA.

JCO Clinical Cancer Informatics
|February 26, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces a new Bayesian framework for detecting gene expression outliers in single patient samples. The method accurately quantifies gene overexpression without needing a matched comparison set, improving cancer diagnostics.

More Related Videos

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K

Related Experiment Videos

Last Updated: Dec 27, 2025

Competitive Genomic Screens of Barcoded Yeast Libraries
11:59

Competitive Genomic Screens of Barcoded Yeast Libraries

Published on: August 11, 2011

18.7K
A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types
12:39

A Novel Bayesian Change-point Algorithm for Genome-wide Analysis of Diverse ChIPseq Data Types

Published on: December 10, 2012

11.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

1.2K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Targeting upregulated genes with antineoplastics requires accurate quantification of gene expression.
  • Quantifying upregulation in single patient samples is challenging due to high intersample variance and limited availability of matched unaffected tissue samples.
  • Existing differential expression tools often fail to accommodate comparisons to single patient samples.

Purpose of the Study:

  • To develop a Bayesian statistical framework for robust gene expression outlier detection in individual patient samples.
  • To enable accurate quantification of gene over- and underexpression without a manually selected comparison set.
  • To address limitations in current methods for analyzing single-sample gene expression data in clinical settings.

Main Methods:

  • A Bayesian statistical framework was developed for outlier detection in gene expression data.
  • The method generates a consensus background distribution for each gene using all available data, eliminating the need for manual comparison set selection.
  • This approach quantifies gene over- and underexpression relative to the consensus distribution.

Main Results:

  • The method robustly quantifies gene overexpression, even with suboptimal or mismatched comparison samples.
  • It successfully identifies appropriate comparison sets from mixed-lineage samples.
  • The framework rediscovers known gene-cancer expression patterns, demonstrating its validity on simulated and real-world data.

Conclusions:

  • This exploratory method is effective for identifying expression outliers in comparative RNA sequencing (RNA-seq) analysis of individual samples.
  • The approach is being considered by Treehouse, a pediatric precision medicine group, for processing its RNA-seq data to identify therapeutic leads.
  • The framework offers a novel solution for single-sample gene expression analysis in precision medicine.