Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Copying Errors02:46

Genome Copying Errors

5.2K
DNA replication is a well-evolved process that copies millions of base pairs with high fidelity during each cell division. Occasionally a wrong base or a long stretch of wrong bases may get added to the daughter strands. If the errors are left unchecked, cells might accumulate several mutations that might endanger their  survival. Therefore, the copying errors are checked and repaired at three levels.
5.2K
Detection of Gross Error: The Q Test01:00

Detection of Gross Error: The Q Test

7.0K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
7.0K
Design Example: Setting a Curve Using Design Data01:09

Design Example: Setting a Curve Using Design Data

239
Designing and plotting a curve using field data requires precise calculations and execution. A horizontal curve with a radius of 200 meters and an intersection angle of 20 degrees is established using the method of perpendicular offsets from the long chord. The long chord, which spans between the curve's endpoints, is calculated to be 69.46 meters in length. To maintain accuracy in plotting, intervals of 3 meters are selected along the chord.The engineer determines the offset distances for each...
239
Types of Errors: Detection and Minimization01:12

Types of Errors: Detection and Minimization

11.4K
Error is the deviation of the obtained result from the true, expected value or the estimated central value. Errors are expressed in absolute or relative terms.
Absolute error in a measurement is the numerical difference from the true or central value. Relative error is the ratio between absolute error and the true or central value, expressed as a percentage.
Errors can be classified by source, magnitude, and sign. There are three types of errors: systematic, random, and gross.
Systematic or...
11.4K
Genomics02:02

Genomics

40.7K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
40.7K
Effects of EDTA on End-Point Detection Methods01:18

Effects of EDTA on End-Point Detection Methods

667
Different methods, such as visual observance of metal-ion indicators, spectroscopic techniques, and potentiometric methods, can determine the endpoint of an EDTA titration.
In the visual method, metal-ion indicators (metallochromic dyes), which have distinct colors in their free and complex forms, are added to the mixture to signal the titration's end point. They form stable complexes with metal ions, but these complexes are weaker than the corresponding metal–EDTA complexes. As a...
667

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Investigating ancient human DNA preservation on cave walls and in rock art.

Nature communications·2026
Same author

ChASM: a statistically rigorous method for the detection of chromosomal aneuploidies in ancient DNA studies.

Bioinformatics (Oxford, England)·2026
Same author

Genomic history of early dogs in Europe.

Nature·2026
Same author

A high-coverage Neandertal genome from the Altai Mountains reveals population structure among Neandertals.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

RAB3GAP2 is a regulator of skeletal muscle endothelial cell proliferation and associated with capillary-to-fiber ratio.

Cell reports·2026
Same author

Natural Selection of a Virus-Protective FUT2 Variant Following the Transition to Agriculture.

Molecular biology and evolution·2025
Same journal

Genome-wide analysis of an endangered axolotl endemic to Mexico reveals genomic variation associated with body condition, environment and infection by a pathogenic fungus.

Genome biology and evolution·2026
Same journal

Conservation of IAMT preference for indole acetic acid methylation across 250 million years of seed plant divergence, with only one recent evolutionary switch in Ocimum.

Genome biology and evolution·2026
Same journal

Regulatory logic and transposable element dynamics in Caenorhabditis genomes.

Genome biology and evolution·2026
Same journal

Interchromosomal translocations and large deletions drive the evolution of the outlier chromosome in the smallest photosynthetic eukaryote.

Genome biology and evolution·2026
Same journal

Chromosome-scale genome assemblies of duckweeds provide insights into genomic plasticity, aquatic adaptation and morphological reduction.

Genome biology and evolution·2026
Same journal

Towards a functional genetics of adaptation: insights from microbial experimental evolution.

Genome biology and evolution·2026
See all related articles

Related Experiment Video

Updated: Feb 5, 2026

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens
09:14

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

7.5K

Turning Vice into Virtue: Using Batch-Effects to Detect Errors in Large Genomic Data Sets.

Fabrizio Mafessoni1, Rashmi B Prasad2, Leif Groop2,3

  • 1Department of Evolutionary Genetics, Max Planck Institute for Evolutionary Anthropology, Leipzig, Germany.

Genome Biology and Evolution
|September 12, 2018
PubMed
Summary
This summary is machine-generated.

Combining sequencing data from multiple sources can introduce systematic errors. This study presents a novel method to detect these errors by analyzing variant pairs, improving data quality in large genomic datasets.

More Related Videos

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.7K
Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain
08:07

Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain

Published on: July 27, 2009

13.2K

Related Experiment Videos

Last Updated: Feb 5, 2026

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens
09:14

Informatic Analysis of Sequence Data from Batch Yeast 2-Hybrid Screens

Published on: June 28, 2018

7.5K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.7K
Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain
08:07

Batch Immunostaining for Large-Scale Protein Detection in the Whole Monkey Brain

Published on: July 27, 2009

13.2K

Area of Science:

  • Genomics
  • Bioinformatics
  • Computational Biology

Background:

  • Large-scale genomic studies often require combining data from diverse sequencing centers and platforms.
  • Heterogeneity in data generation introduces systematic errors, complicating variant detection and analysis.
  • Existing methods struggle to comprehensively identify and correct these batch effects.

Purpose of the Study:

  • To develop and validate a novel computational method for detecting systematic errors in combined genomic datasets.
  • To quantify the prevalence of such errors, particularly in coding versus non-coding regions.
  • To demonstrate the utility of batch effects for identifying data quality issues.

Main Methods:

  • Devised a method analyzing pairs of variants on different chromosomes that co-occur within individuals.
  • Studied the abundance of these variant pairs across different genomes to identify systematic errors (batch effects).
  • Applied the method to the 1000 Genomes dataset and compared findings with data from different sequencing technologies.

Main Results:

  • Identified systematic errors enriched in coding regions of the 1000 Genomes dataset, affecting ~1% of high-frequency variants.
  • Errors outside coding regions were significantly rarer (<0.001%).
  • Predicted errors were less frequent in data from different sequencing technologies, supporting their validity, and were observed in other large datasets.

Conclusions:

  • The developed method effectively detects systematic errors arising from combining diverse genomic data.
  • Batch effects, often viewed as a nuisance, can be leveraged to improve genomic data quality.
  • Findings highlight the importance of accounting for data generation variability in large-scale genomic analyses.