Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.7K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.7K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.8K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.8K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

3.3K
3.3K
Speciation Rates01:07

Speciation Rates

22.4K
Overview
22.4K
Phylogeny01:23

Phylogeny

56.4K
Phylogeny is concerned with the evolutionary diversification of organisms or groups of organisms. A group of organisms with a name is called a taxon (singular). Taxa (plural) can span different levels of the evolutionary hierarchy. For instance, the group containing all birds is a taxon (comprising the class Aves), and the group of all species of daisies (the genus Bellis) is a taxon. Phylogenies can likewise include just one genus (i.e., depict species relationships) or span an entire kingdom.
56.4K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.5K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Min-frame transformation enables more sensitive viral genome alignment.

bioRxiv : the preprint server for biology·2026
Same author

TIPP-SD: A new method for species detection in microbiomes.

PLoS computational biology·2026
Same author

Phylogenetic Placement Using SCAMPP and Batch-SCAMPP.

Methods in molecular biology (Clifton, N.J.)·2026
Same author

Reproductive and Developmental Toxicity of Human Umbilical Cord Blood Mononuclear Cells.

Biomedicines·2026
Same author

The Earth BioGenome Project Phase II: illuminating the eukaryotic tree of life.

Frontiers in science·2026
Same author

BSCAMPP: Batch-Scaled Phylogenetic Placement on Large Trees.

IEEE transactions on computational biology and bioinformatics·2025
Same journal

Diversification dynamics in the global radiation of gobies.

Systematic biology·2026
Same journal

Correction to: nQMaker: Estimating Time Nonreversible Amino Acid Substitution Models.

Systematic biology·2026
Same journal

Phylogenomic challenges in polyploid-rich lineages: Insights from paralog processing and reticulation methods using the complex genus Packera (Asteraceae: Senecioneae).

Systematic biology·2026
Same journal

An evolving view of phylogenetic biogeography.

Systematic biology·2026
Same journal

Modeling Site-and-Branch-Heterogeneity with GFmix.

Systematic biology·2026
Same journal

Coalescent-based branch length estimation improves dating of species trees.

Systematic biology·2026
See all related articles

Related Experiment Video

Updated: Dec 14, 2025

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.3K

Phylogeny Estimation Given Sequence Length Heterogeneity.

Vladimir Smirnov1, Tandy Warnow1

  • 1Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL, 61801, USA.

Systematic Biology
|July 22, 2020
PubMed
Summary
This summary is machine-generated.

Accurate phylogeny estimation with large, mixed-length sequence datasets is challenging. Aligning all sequences first using UPP and RAxML, rather than phylogenetic placement, yields superior tree accuracy, especially with high evolutionary rates.

More Related Videos

A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

35.9K
Amplification of Near Full-length HIV-1 Proviruses for Next-Generation Sequencing
10:18

Amplification of Near Full-length HIV-1 Proviruses for Next-Generation Sequencing

Published on: October 16, 2018

12.5K

Related Experiment Videos

Last Updated: Dec 14, 2025

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.3K
A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

35.9K
Amplification of Near Full-length HIV-1 Proviruses for Next-Generation Sequencing
10:18

Amplification of Near Full-length HIV-1 Proviruses for Next-Generation Sequencing

Published on: October 16, 2018

12.5K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Evolutionary Biology

Background:

  • Phylogeny estimation is crucial for biological studies but faces challenges with large datasets.
  • Increasingly large datasets, including fragmentary sequences, require robust phylogenetic methods.
  • Sequence length heterogeneity complicates accurate tree reconstruction.

Purpose of the Study:

  • To compare two primary approaches for phylogeny estimation using large datasets with mixed sequence lengths.
  • To evaluate the performance of different computational pipelines under varying evolutionary rates and sequence characteristics.
  • To identify optimal methods for constructing phylogenetic trees from complex, real-world biological data.

Main Methods:

  • Two main strategies were compared: (1) aligning the entire dataset followed by maximum likelihood tree computation, and (2) building a tree from full-length sequences and then using phylogenetic placement for fragmentary sequences.
  • Simulated datasets (1000 sequences, varying evolutionary rates) and two biological datasets were analyzed.
  • Specific tools evaluated included UPP for alignment, RAxML and FastTree for tree construction, and phylogenetic placement methods.

Main Results:

  • Significant performance differences were observed, particularly with high evolutionary rates and substantial sequence length heterogeneity.
  • The approach of aligning all sequences (using UPP) and then computing a tree (using RAxML) demonstrated superior accuracy.
  • Phylogenetic placement methods and FastTree showed lower accuracy when dealing with fragmentary sequences and high evolutionary rates.

Conclusions:

  • Aligning all sequences before tree computation, specifically with UPP and RAxML, is recommended for accurate phylogeny estimation from large, heterogeneous datasets.
  • Current phylogenetic placement methods may not be optimal for datasets with significant sequence length variation and rapid evolution.
  • Further development of phylogenetic methods is needed to effectively handle large-scale, complex sequence data.