Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Statistical Analysis: Overview01:11

Statistical Analysis: Overview

14.0K
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
14.0K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

18.5K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
18.5K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.8K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.9K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.9K
Variability: Analysis01:11

Variability: Analysis

394
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
394
Systematic Error: Methodological and Sampling Errors01:15

Systematic Error: Methodological and Sampling Errors

8.4K
In the case of systematic errors, the sources can be identified, and the errors can be subsequently minimized by addressing these sources. According to the source, systematic errors can be divided into sampling, instrumental, methodological, and personal errors.
Sampling errors originate from improper sampling methods or the wrong sample population. These errors can be minimized by refining the sampling strategy. Defective instruments or faulty calibrations are the sources of instrumental...
8.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Prognostic data extraction harnessing a privacy-preserving large language model: a clinician-AI collaborative retrospective evaluation in head and neck oncology.

NPJ precision oncology·2026
Same author

Effective Performance of the 2022 American College of Rheumatology/EULAR Classification Criteria for Antineutrophil Cytoplasmic Antibody-Associated Vasculitis in Pediatric Patients: An ARChiVe Study.

Arthritis & rheumatology (Hoboken, N.J.)·2026
Same author

The Alberta Quality Assessment Tool: Risk of Bias (AQAT:RoB) for the Evaluation of Medical Large Language Model Question-Answer Studies: Development and Pilot Validation.

Journal of medical Internet research·2026
Same author

The Alberta Risk of Bias Assessment Tool (AQAT:RoB) for the Evaluation of Medical Large Language Model Question-Answer Studies: Development and Pilot Validation.

Journal of medical Internet research·2026
Same author

The effect of Fogg model-based virtual education on health literacy and stages of behavior change in cervical cancer screening: study protocol for a randomized controlled trial.

Trials·2026
Same author

Federated Pseudo-Labeling: A Data-Centric, Privacy-Preserving Framework for Medical Image Segmentation.

IEEE journal of biomedical and health informatics·2025
Same journal

CNV-ECOD: A copy number variation detection method based on ECOD algorithm using next-generation sequencing data.

Journal of bioinformatics and computational biology·2026
Same journal

ReinVar: A model-free paradigm-based reinforcement learning approach to detect copy number variation.

Journal of bioinformatics and computational biology·2026
Same journal

When pipelines run but coordinates fail: A simple spatial specificity check for false locality in post-GWAS analysis.

Journal of bioinformatics and computational biology·2026
Same journal

Comparative benchmarking of template-based, evolutionary-diffusion, and generative language models for IsPETase structure prediction.

Journal of bioinformatics and computational biology·2026
Same journal

Trap spaces as labelled ideals of SCC posets: A structural-functional theory of reachability in asynchronous boolean networks.

Journal of bioinformatics and computational biology·2026
Same journal

Erratum - DDINet: Drug-drug interaction prediction network based on multi-molecular fingerprint features and multi-head attention centered weighted autoencoder.

Journal of bioinformatics and computational biology·2026
See all related articles

Related Experiment Video

Updated: Jan 1, 2026

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research
09:35

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

18.2K

Measuring consistency among gene set analysis methods: A systematic study.

Farhad Maleki1, Katie L Ovens1, Daniel J Hogan1

  • 1Department of Computer Science, University of Saskatchewan, 110 Science Place, Saskatoon SK S7N 5C9, Canada.

Journal of Bioinformatics and Computational Biology
|December 21, 2019
PubMed
Summary
This summary is machine-generated.

The choice of gene set analysis method significantly impacts results from gene expression data. GAGE, PAGE, and ORA showed higher reproducibility in identifying significant gene sets.

Keywords:
Gene set analysisenrichment analysisgene expressionpathway analysis

More Related Videos

Gel-seq: A Method for Simultaneous Sequencing Library Preparation of DNA and RNA Using Hydrogel Matrices
09:19

Gel-seq: A Method for Simultaneous Sequencing Library Preparation of DNA and RNA Using Hydrogel Matrices

Published on: March 26, 2018

9.5K
A Semiautomated ChIP-Seq Procedure for Large-scale Epigenetic Studies
08:04

A Semiautomated ChIP-Seq Procedure for Large-scale Epigenetic Studies

Published on: August 13, 2020

3.9K

Related Experiment Videos

Last Updated: Jan 1, 2026

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research
09:35

A Protocol for Using Gene Set Enrichment Analysis to Identify the Appropriate Animal Model for Translational Research

Published on: August 16, 2017

18.2K
Gel-seq: A Method for Simultaneous Sequencing Library Preparation of DNA and RNA Using Hydrogel Matrices
09:19

Gel-seq: A Method for Simultaneous Sequencing Library Preparation of DNA and RNA Using Hydrogel Matrices

Published on: March 26, 2018

9.5K
A Semiautomated ChIP-Seq Procedure for Large-scale Epigenetic Studies
08:04

A Semiautomated ChIP-Seq Procedure for Large-scale Epigenetic Studies

Published on: August 13, 2020

3.9K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Gene set analysis (GSA) is crucial for interpreting gene expression data.
  • Numerous GSA methods exist, raising concerns about method-dependent results.

Purpose of the Study:

  • To systematically evaluate the impact of different GSA methods on results.
  • To assess the reproducibility and biological relevance of GSA findings across various datasets and methods.

Main Methods:

  • Analysis of 13 popular GSA methods using 6 diverse gene expression datasets (DNA microarray and RNA-Seq).
  • Comparison of the number of reported gene sets, bias towards larger sets, and agreement among top significant gene sets (20 and 100).
  • Biological validation using a juvenile idiopathic arthritis (JIA) dataset.

Main Results:

  • Significant variation in the number of gene sets identified (up to 2 orders of magnitude) and bias towards large gene sets by some methods.
  • Substantial disagreement in top significant gene sets across methods and even across datasets of the same condition.
  • GAGE, PAGE, and ORA demonstrated higher reproducibility for top gene sets.
  • GAGE, GSEA, ORA, and PAGE showed varying degrees of biological relevance for JIA.

Conclusions:

  • The choice of gene set analysis method profoundly influences biological insights derived from gene expression data.
  • Method selection is critical for reliable and reproducible GSA results.
  • GAGE, PAGE, and ORA are recommended for higher reproducibility, with GAGE showing promise in biological relevance.