Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Systematic Error: Methodological and Sampling Errors01:15

Systematic Error: Methodological and Sampling Errors

9.9K
In the case of systematic errors, the sources can be identified, and the errors can be subsequently minimized by addressing these sources. According to the source, systematic errors can be divided into sampling, instrumental, methodological, and personal errors.
Sampling errors originate from improper sampling methods or the wrong sample population. These errors can be minimized by refining the sampling strategy. Defective instruments or faulty calibrations are the sources of instrumental...
9.9K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

6.9K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
6.9K
Types of Errors: Detection and Minimization01:12

Types of Errors: Detection and Minimization

10.8K
Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...
10.8K
Sign Test for Matched Pairs01:17

Sign Test for Matched Pairs

386
The sign test for matched pairs offers a robust method for comparing two paired samples, often for the effects of an intervention in one of them. This method is very useful in situations where the underlying distribution of the data is unknown. The test compares two related samples—often pre- and post-treatment measurements on the same subjects—to determine if there are significant differences in their median values.
To conduct the sign test, we first calculate the differences in...
386
Analysis Methods of Pharmacokinetic Data: Model and Model-Independent Approaches01:14

Analysis Methods of Pharmacokinetic Data: Model and Model-Independent Approaches

499
Drug disposition in the body is a complex process and can be studied using two major approaches: the model and the model-independent approaches.
The model approach uses mathematical models to describe changes in drug concentration over time. Pharmacokinetic models help characterize drug behavior in patients, predict drug concentration in the body fluids, calculate optimum dosage regimens, and evaluate the risk of toxicity. However, ensuring that the model fits the experimental data accurately...
499
Sampling Methods: Sample Types01:18

Sampling Methods: Sample Types

3.1K
Sampling materials are classified into three main types: solid, liquid, and gas.
Solid samples include a variety of substances, such as sediments from water bodies, soil, metals, and biological tissues. Two standard methods for extracting sediments from water bodies are grab sampling and piston coring. Grab sampling involves using a device to collect a discrete sediment sample from the bottom of a water body with minimal disturbance. Grab samples do not always represent the entire area due to...
3.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Real-world and Genomic data-based Asthma Insights through Network Analysis (REGAIN study): protocol for a novel retrospective and prospective longitudinal asthma cohort study.

BMJ open·2026
Same author

Reply.

Journal of the American College of Radiology : JACR·2025
Same author

Professional fulfillment in interventional radiology.

CVIR endovascular·2025
Same author

Diversity in Interventional Radiology Residency Programs.

Journal of the American College of Radiology : JACR·2025
Same author

Method for determining primary radiation shielding thickness of industrial X-ray-generating devices.

Applied radiation and isotopes : including data, instrumentation and methods for use in agriculture, industry and medicine·2025
Same author

MARCH2, a Novel Oncogene-regulated SNAIL E3 Ligase, Suppresses Triple-negative Breast Cancer Metastases.

Cancer research communications·2024
Same journal

NanoporeDB: A Structural Resource Of Multimeric Protein Nanopores For Single-Molecule Sensing.

GigaScience·2026
Same journal

From the Brain Cell Atlas to Precision Neurology: A review of the application of AI-driven multi-omics in brain science.

GigaScience·2026
Same journal

Comparison of Deep Learning Approaches for Extreme Low-SNR Image Restoration.

GigaScience·2026
Same journal

ScopeViewer: A Browser-Based Solution for Visualizing Large Biological Images.

GigaScience·2026
Same journal

ChatMDV: Reducing Technical Barriers in Bioinformatics Analysis using Large Language Models.

GigaScience·2026
Same journal

ClusterGraph: a new tool for visualisation and compression of multidimensional data.

GigaScience·2026
See all related articles

Related Experiment Video

Updated: Jan 22, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

4.8K

A probabilistic multi-omics data matching method for detecting sample errors in integrative analysis.

Eunjee Lee1,2,3, Seungyeul Yoo1,2, Wenhui Wang1,2

  • 1Department of Genetics and Genomic Sciences, Icahn School of Medicine at Mount Sinai, One Gustave L. Levy Place, New York, NY 10029, USA.

Gigascience
|July 11, 2019
PubMed
Summary
This summary is machine-generated.

Data errors in large omics datasets are common and can lead to incorrect conclusions. We developed proMODMatcher, a robust method to identify and correct sample labeling errors, ensuring data integrity for reliable scientific discovery.

Keywords:
data curationdata erroromics data integration

More Related Videos

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

2.1K
The MPLEx Protocol for Multi-omic Analyses of Soil Samples
10:12

The MPLEx Protocol for Multi-omic Analyses of Soil Samples

Published on: May 30, 2018

11.7K

Related Experiment Videos

Last Updated: Jan 22, 2026

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization
08:27

Large-Scale Multi-Omics Genome-Wide Association Studies Mo-GWAS: Guidelines for Sample Preparation and Normalization

Published on: July 27, 2021

4.8K
Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

2.1K
The MPLEx Protocol for Multi-omic Analyses of Soil Samples
10:12

The MPLEx Protocol for Multi-omic Analyses of Soil Samples

Published on: May 30, 2018

11.7K

Area of Science:

  • Bioinformatics
  • Genomics
  • Data Science

Background:

  • Large-scale omics data generation is prone to inevitable errors like sample swapping and mislabeling.
  • Data errors can obscure true biological signals and lead to erroneous scientific conclusions.
  • Accurate data annotation is critical for merging different omics data types in integrative analyses.

Purpose of the Study:

  • To develop a robust method for identifying and correcting data annotation and sample errors in large omics databases.
  • To ensure the integrity of multi-omics datasets before integrative analysis.

Main Methods:

  • Developed proMODMatcher, a probabilistic multi-omics data matching procedure.
  • Applied the procedure to simulated datasets to assess statistical power.
  • Utilized proMODMatcher on The Cancer Genome Atlas and International Cancer Genome Consortium multi-omics datasets.

Main Results:

  • proMODMatcher demonstrated robust statistical power, even with small cis-associations or large sample sizes.
  • Identified sample errors in multiple cancer datasets from The Cancer Genome Atlas and International Cancer Genome Consortium.
  • The procedure successfully identified the source of sample-labeling errors.

Conclusions:

  • Sample-labeling errors are prevalent in large multi-omics datasets.
  • Correction of these errors is essential prior to conducting integrative analyses.
  • Ensuring data accuracy is crucial for reliable omics research.