Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

21.2K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
21.2K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

18.9K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
18.9K
Sanger Sequencing01:57

Sanger Sequencing

776.3K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
776.3K
Point and Frameshift Mutations01:30

Point and Frameshift Mutations

1.4K
Point mutations are genetic alterations involving the change of a single nucleotide base pair in DNA. Depending on how the alteration affects protein synthesis, they can lead to various consequences.Point mutations fall into the following types:Silent mutations occur when a nucleotide change does not alter the amino acid sequence due to the redundancy of the genetic code. For instance, changing ACC to ACA still encodes threonine, leaving the protein function unaffected. This occurs because...
1.4K
Conservative Site-specific Recombination and Phase Variation02:53

Conservative Site-specific Recombination and Phase Variation

7.0K
Because the DNA segments are cut and reorganized in a direction-specific manner, site-specific recombination has emerged as an efficient genetic engineering technique. Flippase and Cyclization recombinases or Flp and Cre, respectively, are two members of the tyrosine recombinase family derived from bacteriophages, that are used to mediate site-specific DNA insertions, deletions, and targeted expression of proteins in mammalian cell lines.
The recognition sites for Cre recombinase called LoxP...
7.0K
Bias01:22

Bias

7.8K
Bias refers to any tendency that prevents a question from being considered unprejudiced. In research, bias occurs when one outcome or answer is selected or encouraged over others in sampling or testing. Bias can occur during any research phase, including study design, data collection, analysis, and publication.
In statistics, a sampling bias is created when a sample is collected from a population, and some members of the population are not as likely to be chosen as others (remember, each member...
7.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Convergence and divergence of DNA methylation and gene expression patterns in neopolyploid Arabidopsis kamchatica.

Nature communications·2026
Same author

An inverse correlation between structural linguistic and human genetic diversity.

Proceedings of the National Academy of Sciences of the United States of America·2026
Same author

Purifying selection on deleterious variants affected by the combination of subgenomes and gene expression in bread wheat.

Cell reports·2026
Same author

A likelihood ratio test for detecting shifts in homeolog expression ratios in allopolyploids.

The New phytologist·2025
Same author

De novo annotation reveals transcriptomic complexity across the hexaploid wheat pan-genome.

Nature communications·2025
Same author

Patterns of genetic admixture reveal similar rates of borrowing across diverse scenarios of language contact.

Science advances·2025
Same journal

Mild oxidative stress and dietary epigenetic modulators direct DNA methylation remodeling toward stress-resilience pathways.

BMC genomics·2026
Same journal

Integrative ATAC-Seq and RNA-Seq analysis identifies key genes for intramuscular fat content in Laiwu pigs.

BMC genomics·2026
Same journal

A comprehensive long RNA landscape of multi-regional porcine lung-derived small extracellular vesicles.

BMC genomics·2026
Same journal

pGWAS-Portal: a comprehensive online platform for integrative post-genome-wide association study analysis.

BMC genomics·2026
Same journal

Physiological and transcriptomic analyses of Rosa persica in response to drought stress and functional validation of the transcription factor RpERF113-like.

BMC genomics·2026
Same journal

Integrated analysis of chromatin accessibility and transcriptome profiles in granulosa cells of sheep with different FecB genotypes.

BMC genomics·2026
See all related articles

Related Experiment Video

Updated: Mar 5, 2026

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

34.9K

Positional bias in variant calls against draft reference assemblies.

Roman V Briskine1,2, Kentaro K Shimizu3,4

  • 1Department of Evolutionary Biology and Environmental Studies, University of Zurich, Winterthurerstrasse 190, Zurich, CH-8057, Switzerland. roman.briskine@ieu.uzh.ch.

BMC Genomics
|March 30, 2017
PubMed
Summary
This summary is machine-generated.

Variant calling in draft genomes shows a positional bias, particularly in repetitive regions, leading to false positives. Filtering variants in repetitive elements can improve data quality for non-model species resequencing.

Keywords:
Draft reference genomePolymorphismsPositional biasRepetitive elementsReseqencingSNPsVariants

More Related Videos

Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.6K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.8K

Related Experiment Videos

Last Updated: Mar 5, 2026

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease
09:34

Targeted Next-generation Sequencing and Bioinformatics Pipeline to Evaluate Genetic Determinants of Constitutional Disease

Published on: April 4, 2018

34.9K
Rare Event Detection Using Error-corrected DNA and RNA Sequencing
10:36

Rare Event Detection Using Error-corrected DNA and RNA Sequencing

Published on: August 3, 2018

12.6K
Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER
14:06

Detection of Rare Genomic Variants from Pooled Sequencing Using SPLINTER

Published on: June 23, 2012

15.8K

Area of Science:

  • Genomics
  • Bioinformatics
  • Population Genetics

Background:

  • Whole genome resequencing (WGR) enables population genetic and genome-wide association studies in non-model species using draft reference genomes.
  • Draft assemblies, though lower in quality, are crucial for extending these analyses beyond model organisms.
  • Understanding biases in variant calling pipelines is essential for accurate genomic data interpretation.

Purpose of the Study:

  • To identify and characterize a positional bias in variant calling against draft reference assemblies.
  • To investigate the impact of assembly algorithms and sequence characteristics on variant calling accuracy.

Main Methods:

  • Assessed variant frequency at different positions within contigs and scaffolds.
  • Analyzed publicly available draft assemblies and simulated short-read data.
  • Investigated the effect of k-mer length, read length, and repetitive sequences on variant calling bias.

Main Results:

  • A significant positional bias was detected in variant calling, with excess variants near contig/scaffold ends.
  • This bias is linked to k-mer and read lengths and is exacerbated by repetitive sequences.
  • Simulations confirmed false positives arise from reads of spatially distant repeats, particularly in contig assemblies.

Conclusions:

  • Draft genome assemblies from common assemblers exhibit a positional bias affecting variant calling in non-model species.
  • The bias stems from the inherent handling of repetitive sequences by assembly algorithms.
  • Filtering variants in repetitive regions is recommended to enhance data quality when high-quality assemblies are not feasible.