Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Genome Copying Errors

Genome Copying Errors

DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their survival. Therefore, the copying errors are checked and repaired at three levels.

Detection of Gross Error: The Q Test

Detection of Gross Error: The Q Test

When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...

Design Example: Setting a Curve Using Design Data

Design Example: Setting a Curve Using Design Data

Designing and plotting a curve using field data requires precise calculations and execution. A horizontal curve with a radius of 200 meters and an intersection angle of 20 degrees is established using the method of perpendicular offsets from the long chord. The long chord, which spans between the curve's endpoints, is calculated to be 69.46 meters in length. To maintain accuracy in plotting, intervals of 3 meters are selected along the chord.The engineer determines the offset distances for each...

Types of Errors: Detection and Minimization

Types of Errors: Detection and Minimization

Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...

Genomics

Genomics

Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...

Effects of EDTA on End-Point Detection Methods

Effects of EDTA on End-Point Detection Methods

Different methods, such as visual observance of metal-ion indicators, spectroscopic techniques, and potentiometric methods, can determine the endpoint of an EDTA titration.
In the visual method, metal-ion indicators (metallochromic dyes), which have distinct colors in their free and complex forms, are added to the mixture to signal the titration's end point. They form stable complexes with metal ions, but these complexes are weaker than the corresponding metal–EDTA complexes. As a...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Investigating ancient human DNA preservation on cave walls and in rock art.

Nature communications·2026

Same author

ChASM: a statistically rigorous method for the detection of chromosomal aneuploidies in ancient DNA studies.

Bioinformatics (Oxford, England)·2026

Same author

Genomic history of early dogs in Europe.

Nature·2026

Same author

A high-coverage Neandertal genome from the Altai Mountains reveals population structure among Neandertals.

Proceedings of the National Academy of Sciences of the United States of America·2026

Same author

RAB3GAP2 is a regulator of skeletal muscle endothelial cell proliferation and associated with capillary-to-fiber ratio.

Cell reports·2026

Same author

Natural Selection of a Virus-Protective FUT2 Variant Following the Transition to Agriculture.

Molecular biology and evolution·2025

Same journal

Genome-wide analysis of an endangered axolotl endemic to Mexico reveals genomic variation associated with body condition, environment and infection by a pathogenic fungus.

Genome biology and evolution·2026

Same journal

Conservation of IAMT preference for indole acetic acid methylation across 250 million years of seed plant divergence, with only one recent evolutionary switch in Ocimum.

Genome biology and evolution·2026

Same journal

Regulatory logic and transposable element dynamics in Caenorhabditis genomes.

Genome biology and evolution·2026

Same journal

Interchromosomal translocations and large deletions drive the evolution of the outlier chromosome in the smallest photosynthetic eukaryote.

Genome biology and evolution·2026

Same journal

Chromosome-scale genome assemblies of duckweeds provide insights into genomic plasticity, aquatic adaptation and morphological reduction.

Genome biology and evolution·2026

Same journal

Towards a functional genetics of adaptation: insights from microbial experimental evolution.

Genome biology and evolution·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 5, 2026

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

Turning Vice into Virtue: Using Batch-Effects to Detect Errors in Large Genomic Data Sets.

Fabrizio Mafessoni¹, Rashmi B Prasad², Leif Groop^2,3

¹Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.

Genome Biology and Evolution

|September 12, 2018

Summary

This summary is machine-generated.

Combining sequencing data from multiple sources can introduce systematic errors. This study presents a novel method to detect these errors by analyzing variant pairs, improving data quality in large genomic datasets.

More Related Videos

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain

Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain

Published on: July 27, 2009

Related Experiment Videos

Last Updated: Feb 5, 2026

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain

Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain

Published on: July 27, 2009

Area of Science:

Genomics
Bioinformatics
Computational Biology

Background:

Large-scale genomic studies often require combining data from diverse sequencing centers and platforms.
Heterogeneity in data generation introduces systematic errors, complicating variant detection and analysis.
Existing methods struggle to comprehensively identify and correct these batch effects.

Purpose of the Study:

To develop and validate a novel computational method for detecting systematic errors in combined genomic datasets.
To quantify the prevalence of such errors, particularly in coding versus non-coding regions.
To demonstrate the utility of batch effects for identifying data quality issues.

Main Methods:

Devised a method analyzing pairs of variants on different chromosomes that co-occur within individuals.
Studied the abundance of these variant pairs across different genomes to identify systematic errors (batch effects).
Applied the method to the 1000 Genomes dataset and compared findings with data from different sequencing technologies.

Main Results:

Identified systematic errors enriched in coding regions of the 1000 Genomes dataset, affecting ~1% of high-frequency variants.
Errors outside coding regions were significantly rarer (<0.001%).
Predicted errors were less frequent in data from different sequencing technologies, supporting their validity, and were observed in other large datasets.

Conclusions:

The developed method effectively detects systematic errors arising from combining diverse genomic data.
Batch effects, often viewed as a nuisance, can be leveraged to improve genomic data quality.
Findings highlight the importance of accounting for data generation variability in large-scale genomic analyses.