Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.8K
Phylogenetic Trees03:21

Phylogenetic Trees

45.5K
Phylogenetic trees come in many forms. It matters in which sequence the organisms are arranged from the bottom to the top of the tree, but the branches can rotate at their nodes without altering the information. The lines connecting individual nodes can be straight, angled, or even curved.
45.5K
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

10.9K
Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to...
10.9K
Gene Evolution - Fast or Slow?02:05

Gene Evolution - Fast or Slow?

7.2K
The genomes of eukaryotes are punctuated by long stretches of sequence which do not code for proteins or RNAs. Although some of these regions do contain crucial regulatory sequences, the vast majority of this DNA serves no known function. Typically, these regions of the genome are the ones in which the fastest change, in evolutionary terms, is observed, because there is typically little to no selection pressure acting on these regions to preserve their sequences.
In contrast, regions which code...
7.2K
Phylogeny01:23

Phylogeny

44.4K
Phylogeny is concerned with the evolutionary diversification of organisms or groups of organisms. A group of organisms with a name is called a taxon (singular). Taxa (plural) can span different levels of the evolutionary hierarchy. For instance, the group containing all birds is a taxon (comprising the class Aves), and the group of all species of daisies (the genus Bellis) is a taxon. Phylogenies can likewise include just one genus (i.e., depict species relationships) or span an entire kingdom.
44.4K
Conserved Binding Sites01:49

Conserved Binding Sites

4.2K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
4.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Next release of the European Marine Omics Biodiversity Observation Network (EMO BON) shotgun metagenomic data from water and sediment samples (Release 2).

Biodiversity data journal·2026
Same author

First release of the European marine omics biodiversity observation network (EMO BON) shotgun metagenomics data from water and sediment samples.

Biodiversity data journal·2025
Same author

Multiple evolutionary transitions of reproductive strategies in a phylum of aquatic colonial invertebrates.

Proceedings. Biological sciences·2023
Same author

metaGOflow: a workflow for the analysis of marine Genomic Observatories shotgun metagenomics data.

GigaScience·2023
Same author

Integrated phylogenomic analyses unveil reticulate evolution in Parthenocissus (Vitaceae), highlighting speciation dynamics in the Himalayan-Hengduan Mountains.

The New phytologist·2022
Same author

Comparative mitogenomic analyses and gene rearrangements reject the alleged polyphyly of a bivalve genus.

PeerJ·2022
Same journal

Evaluation of cold resistance in pear (<i>Pyrus</i> L.) germplasms: integrating physiological and biochemical responses with anatomical traits under low temperature stress.

PeerJ·2026
Same journal

Evaluation of retinal and choroidal microvasculature parameters by OCTA in patients with premature ovarian insufficiency: a prospective case control study.

PeerJ·2026
Same journal

Development and prognostic evaluation of a combined SII-LNR score in resectable gastric and gastroesophageal junction adenocarcinoma treated with perioperative FLOT: a retrospective single-center study.

PeerJ·2026
Same journal

Validity and reliability evaluation of the Chinese version of the attention-deficit/hyperactivity disorder stigma questionnaire.

PeerJ·2026
Same journal

Relationship between mental disorders and non-traumatic cerebral hemorrhage: cross-sectional analysis and mendelian randomization.

PeerJ·2026
Same journal

Association between intestinal functional disorders and anal fistula: evidence from a retrospective case-control study.

PeerJ·2026
See all related articles

Related Experiment Video

Updated: Jul 19, 2025

A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

35.4K

Data-specific substitution models improve protein-based phylogenetics.

João M Brazão1, Peter G Foster2, Cymon J Cox1

  • 1Centro de Ciências do Mar, Universidade do Algarve, Faro, Algarve, Portugal.

Peerj
|August 14, 2023
PubMed
Summary
This summary is machine-generated.

Estimating accurate, data-specific amino-acid substitution models for proteins is crucial for phylogenetic analysis. This study found that data-specific models, particularly those from IQ-TREE and P4 (maximum likelihood), offer superior accuracy and efficiency over empirical models.

Keywords:
Amino-acid substitution modelsData-specific modelsModel estimationPhylogeneticsProtein evolution

More Related Videos

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.3K
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

15.9K

Related Experiment Videos

Last Updated: Jul 19, 2025

A Practical Guide to Phylogenetics for Nonexperts
12:00

A Practical Guide to Phylogenetics for Nonexperts

Published on: February 5, 2014

35.4K
Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

7.3K
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

15.9K

Area of Science:

  • Computational Biology
  • Phylogenetics
  • Bioinformatics

Background:

  • Estimating accurate amino-acid substitution models is vital for phylogenetic analysis but computationally challenging.
  • Existing empirical models may not adequately represent specific protein datasets, potentially leading to inaccurate evolutionary inferences.

Purpose of the Study:

  • To evaluate the computational efficiency and accuracy of five methods for estimating data-specific amino-acid substitution models.
  • To compare the performance of data-specific models against commonly used empirical models (cpREV, WAG).
  • To demonstrate the benefits of using data-specific models in phylogenetic analyses of published datasets.

Main Methods:

  • Simulated protein alignments were generated using known models and trees.
  • Five methods (Codeml, FastMG, IQ-TREE, P4-ML, P4-Bayesian) were used to estimate data-specific models.
  • Maximum likelihood scores and tree topologies were compared between data-specific and empirical models against the simulation model.

Main Results:

  • Data-specific models significantly outperformed empirical models, showing better fit and inferring more accurate trees.
  • IQ-TREE and P4 (maximum likelihood) methods produced trees statistically indistinguishable from the simulation model.
  • Re-analysis of published datasets with IQ-TREE-estimated models revealed improved data fit, longer inferred trees, and altered topologies in over half of the cases.

Conclusions:

  • Data-specific amino-acid substitution models provide a more accurate representation of protein evolution than empirical models.
  • IQ-TREE and P4 (maximum likelihood) are recommended for their accuracy and efficiency in estimating these models.
  • Computational burden and software availability are no longer significant limitations for generating superior data-specific models for phylogenetic analyses.