Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

6.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
6.8K
Modern Molecular Taxonomy01:29

Modern Molecular Taxonomy

520
Advancements in molecular biology have revolutionized the identification and characterization of bacteria, with multiple methods leveraging DNA sequencing for enhanced precision. As sequencing technologies improve and costs decline, these approaches are increasingly used in clinical, environmental, and evolutionary studies.Multilocus Sequence Typing (MLST) examines several housekeeping genes, essential chromosomal genes encoding cellular functions, to distinguish strains. Approximately...
520
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.6K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.6K
Phylogenetic Trees03:21

Phylogenetic Trees

49.1K
Phylogenetic trees come in many forms. It matters in which sequence the organisms are arranged from the bottom to the top of the tree, but the branches can rotate at their nodes without altering the information. The lines connecting individual nodes can be straight, angled, or even curved.
49.1K
Genome Annotation and Assembly03:36

Genome Annotation and Assembly

20.4K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
20.4K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.9K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Fast and accurate multiple-protein-sequence alignment at scale with FAMSA2.

Nature biotechnology·2026
Same author

Novel Insights into the Clinical Features, Genetic Spectrum and Clonal Evolution of Patients Carrying NLRP3 Mosaicism.

Journal of clinical immunology·2025
Same author

The multiomics blueprint of the individual with the most extreme lifespan.

Cell reports. Medicine·2025
Same author

An nf-core framework for the systematic comparison of alternative modeling tools: the multiple sequence alignment case study.

NAR genomics and bioinformatics·2025
Same author

Empowering bioinformatics communities with Nextflow and nf-core.

Genome biology·2025
Same author

A resource of RNA-binding protein motifs across eukaryotes reveals evolutionary dynamics and gene-regulatory function.

Nature biotechnology·2025
Same journal

Prime editing for precise genome engineering and modulation of fungal metabolism.

Nature biotechnology·2026
Same journal

Retargeted serine integrases for one-step, precise integration of large DNA sequences in human cells.

Nature biotechnology·2026
Same journal

A retargeted recombinase for precise insertion of large DNA.

Nature biotechnology·2026
Same journal

Experiment-guided AlphaFold3 resolves measurement-consistent protein ensembles.

Nature biotechnology·2026
Same journal

Spatially resolved profiling of extracellular vesicles in tissues with Spatial-EV-seq.

Nature biotechnology·2026
Same journal

Mapping the spatial landscape of extracellular vesicles in tissues with Spatial-EV-seq.

Nature biotechnology·2026
See all related articles

Related Experiment Video

Updated: Jan 2, 2026

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.4K

Large multiple sequence alignments with a root-to-leaf regressive method.

Edgar Garriga1, Paolo Di Tommaso1, Cedrik Magis1

  • 1Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Barcelona, Spain.

Nature Biotechnology
|December 4, 2019
PubMed
Summary
This summary is machine-generated.

A new regressive algorithm improves multiple sequence alignment (MSA) accuracy for large datasets. This method scales to millions of sequences, outperforming traditional progressive MSA techniques.

More Related Videos

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

10.9K
A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

36.0K

Related Experiment Videos

Last Updated: Jan 2, 2026

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.4K
Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

10.9K
A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

36.0K

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Multiple sequence alignments (MSAs) are crucial for structural and evolutionary predictions.
  • Existing progressive MSA algorithms struggle with accuracy and scalability for large datasets.
  • The complexity of aligning millions of sequences necessitates novel computational approaches.

Purpose of the Study:

  • To introduce a novel regressive algorithm for multiple sequence alignment.
  • To enhance the accuracy and scalability of MSA for extremely large genomic datasets.
  • To enable the analysis of datasets comprising millions of sequences on standard hardware.

Main Methods:

  • Developed a regressive algorithm that aligns the most dissimilar sequences first.
  • Employed an efficient divide-and-conquer strategy to manage large-scale alignments.
  • Integrated third-party alignment tools to run in linear time, irrespective of inherent complexity.

Main Results:

  • The regressive algorithm successfully aligns up to 1.4 million sequences on a standard workstation.
  • Demonstrated substantial accuracy improvements for datasets exceeding 10,000 sequences.
  • The method offers linear time complexity, overcoming limitations of previous algorithms.

Conclusions:

  • The regressive algorithm provides a scalable and accurate solution for large-scale MSA.
  • This approach is suitable for analyzing massive genomic datasets, such as the Earth BioGenome Project.
  • Enables new possibilities for evolutionary and structural studies using unprecedented data volumes.