Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

14.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
14.0K
Improving Translational Accuracy02:07

Improving Translational Accuracy

3.5K
3.5K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.8K
Sanger Sequencing01:57

Sanger Sequencing

772.6K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
772.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Longitudinal Analysis of Peripheral MicroRNA Expression and Depressive Symptom Severity Change in a Community Cohort.

Epigenomes·2026
Same author

Clinical and genetic characterization of constitutional MLH1 promoter hypermethylation: Implications for Lynch syndrome diagnosis.

Genetics in medicine : official journal of the American College of Medical Genetics·2026
Same author

Unmasking Compound Heterozygosity in GYG1 Myopathy: Diagnostic Insights From RNA-Seq and Long-Read Genomics.

Clinical genetics·2026
Same author

The relationship between social adversity, micro-RNA expression and post-traumatic stress in a prospective, community-based cohort.

Nature. Mental health·2026
Same author

Identification of a Novel Missense Homozygous Variant in LINS1 in Two Distinct Iranian Families With Consanguineous Marriage.

Molecular genetics & genomic medicine·2026
Same author

DNA Methylation as a Mediator of Cardiovascular Disease Risk in Relation to PTSD Severity: Identification of Potential Epigenetic Biomarkers.

medRxiv : the preprint server for health sciences·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
Same journal

Benchmarking DNA barcode decoding strategies under high error rates.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jan 4, 2026

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

3.8K

Recommendations for performance optimizations when using GATK3.8 and GATK4.

Jacob R Heldenbrand1, Saurabh Baheti2, Matthew A Bockol3

  • 1National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, 1205 W. Clark St., Urbana, IL, USA.

BMC Bioinformatics
|November 10, 2019
PubMed
Summary
This summary is machine-generated.

The Genome Analysis Toolkit (GATK) 3.8 and GATK4 offer performance improvements for genomic variant calling. GATK3.8 is faster for single samples, while GATK4 is more cost-effective for large-scale analyses.

Keywords:
Best practicesCluster computingComputational performanceGATKGenomic variant callingParallelization

More Related Videos

Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing
11:02

Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing

Published on: October 18, 2013

19.8K
ATAC-Seq Optimization for Cancer Epigenetics Research
07:13

ATAC-Seq Optimization for Cancer Epigenetics Research

Published on: June 30, 2022

5.1K

Related Experiment Videos

Last Updated: Jan 4, 2026

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance
04:58

Author Spotlight: Investigating the Role of Repetitive DNA Misregulation in Cancer Initiation and Immunotherapy Resistance

Published on: December 13, 2024

3.8K
Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing
11:02

Detecting Somatic Genetic Alterations in Tumor Specimens by Exon Capture and Massively Parallel Sequencing

Published on: October 18, 2013

19.8K
ATAC-Seq Optimization for Cancer Epigenetics Research
07:13

ATAC-Seq Optimization for Cancer Epigenetics Research

Published on: June 30, 2022

5.1K

Area of Science:

  • Genomics
  • Bioinformatics

Background:

  • Genome Analysis Toolkit (GATK) is standard for genomic variant calling.
  • GATK has undergone significant evolution with GATK3.8 and GATK4 releases.
  • GATK4 represents a major rewrite with Spark implementation.

Purpose of the Study:

  • Analyze performance improvements in GATK3.8 and GATK4.
  • Provide guidance for optimal deployment in productive environments.
  • Help the community stay updated with GATK performance changes.

Main Methods:

  • Re-evaluated threading, garbage collection, I/O, and data-level parallelization.
  • Compared trade-offs between GATK3.8 and GATK4.
  • Optimized parameter values for variant calling procedures.

Main Results:

  • Reduced execution time by 29.3% for GATK3.8 and 16.9% for GATK4.
  • Achieved whole human genome analysis in hours with data splitting.
  • GATK4 demonstrated greater cost-effectiveness, enabling single-threaded analysis for multiple samples.

Conclusions:

  • For time-sensitive single-sample analysis, GATK3.8 with data splitting is recommended.
  • For cost-effective routine or population analyses, GATK4 processing multiple samples on one node is advised.
  • GATK4 offers significant cost savings per sample for large-scale studies.