Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.7K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.7K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.0K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.0K
Estimating Population Standard Deviation01:26

Estimating Population Standard Deviation

3.0K
When the population standard deviation is unknown and the sample size is large, the sample standard deviation s is commonly used as a point estimate of σ. However, it can sometimes under or overestimate the population standard deviation. To overcome this drawback, confidence intervals are determined to estimate population parameters and eliminate any calculation bias accurately. However, this only applies to random samples from normally distributed populations. Knowing the sample mean and...
3.0K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

327
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
327
Speciation Rates01:07

Speciation Rates

20.9K
Overview
20.9K
Comparing Copy Number Variations and SNPs02:26

Comparing Copy Number Variations and SNPs

17.0K
Sequencing of the human genome has opened up several best-kept secrets of the genome. Scientists have identified thousands of genome variations that exist within a population. These variations can be a single nucleotide or a larger chromosomal variation.
Copy number variations or CNVs are the structural variations that cover more than 1kb of DNA sequence. The single nucleotide polymorphism (SNP), on the other hand, is a single nucleotide change or a point mutation that is found in more than 1%...
17.0K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Movi 2: Fast and Space-Efficient Queries on Pangenomes.

Bioinformatics (Oxford, England)·2026
Same author

Rapid phylogenomic analysis for viral surveillance and metagenomic profiling with Omni2Tree.

bioRxiv : the preprint server for biology·2026
Same author

The PhenX Toolkit: Long COVID Collection of Standard Protocols.

Current protocols·2026
Same author

Movi Color: fast and accurate taxonomic classification with the move structure.

ACM-BCB ... ... : the ... ACM Conference on Bioinformatics, Computational Biology and Biomedicine. ACM Conference on Bioinformatics, Computational Biology and Biomedicine·2026
Same author

Minimizing reference bias with an imputed personalized reference.

Genome research·2026
Same author

RLBWT-Based LCP Computation in Compressed Space for Terabase-Scale Pangenome Analysis.

bioRxiv : the preprint server for biology·2026
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: May 24, 2025

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq
07:09

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

9.4K

EvANI benchmarking workflow for evolutionary distance estimation.

Sina Majidian1, Stephen Hwang2, Mohsen Zakeri1

  • 1Department of Computer Science, Johns Hopkins University, Baltimore, USA.

Biorxiv : the Preprint Server for Biology
|March 3, 2025
PubMed
Summary
This summary is machine-generated.

We developed EvANI, a framework to evaluate genome comparison methods. Our findings show k-mer based approaches offer efficient and accurate estimations of average nucleotide identity (ANI) for evolutionary studies.

Keywords:
Average nucleotide identityBLASTEvolutionGenomeK-merSketching

More Related Videos

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.0K
Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization
12:37

Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization

Published on: April 14, 2016

38.3K

Related Experiment Videos

Last Updated: May 24, 2025

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq
07:09

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

9.4K
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.0K
Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization
12:37

Efficient Nucleic Acid Extraction and 16S rRNA Gene Sequencing for Bacterial Community Characterization

Published on: April 14, 2016

38.3K

Area of Science:

  • Genomics
  • Bioinformatics
  • Evolutionary Biology

Background:

  • Long-read sequencing advances enable high-quality genome assemblies, facilitating evolutionary relationship studies.
  • Average nucleotide identity (ANI) is crucial for species delineation and database searching but computationally intensive.
  • Sketch-based methods accelerate ANI estimation using heuristics, necessitating rigorous evaluation.

Purpose of the Study:

  • To introduce EvANI, an evaluation framework for assessing genome comparison algorithms.
  • To analyze the impact of assumptions and heuristics in sketch-based ANI estimation.
  • To identify optimal methods for accurate and efficient genome-wide comparisons.

Main Methods:

  • Development of simulated and real benchmark datasets.
  • Application of a rank-correlation-based metric for evaluating distance estimates.
  • Comparison of various ANI estimation algorithms, including ANIb and k-mer based approaches.

Main Results:

  • ANIb provides the most accurate tree distance but is computationally inefficient.
  • K-mer based methods demonstrate high efficiency and consistent accuracy in ANI estimation.
  • Optimal k-mer lengths vary across clades, suggesting a need for multiple k values (e.g., k=10, k=19 for Chlamydiales).
  • Maximal exact match approaches offer a balance between efficiency and accuracy.

Conclusions:

  • EvANI provides a robust method for evaluating genome comparison tools.
  • K-mer based methods are highly effective for large-scale genomic comparisons.
  • A single fixed k-mer length may not be universally optimal; adaptable approaches are beneficial.