Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Gene Families

Gene Families

Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...

Protein Families

Protein Families

Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism. Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members. If these new proteins contain similar amino acids in key...

Protein Families

Protein Families

Multi-species Conserved Sequences

Multi-species Conserved Sequences

Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...

Conserved Binding Sites

Conserved Binding Sites

Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...

Conserved Binding Sites

Conserved Binding Sites

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Low-field strength MRI (0.55T) for stereotactic and functional neurosurgery using deep learning-based reconstruction algorithm: Preliminary experiences.

AJNR. American journal of neuroradiology·2026

Same author

PRISM: A unified platform for phage isolation and characterization from single-droplet microenvironments.

Science advances·2026

Same author

Ground truth data set of Gas Chromatography Mass Spectrometry (GCMS) analysed synthesised methylamphetamine.

Data in brief·2026

Same author

Clinician Perceptions of Barriers and Strategies to Improve Pediatric Hypertension Detection.

JAMA network open·2026

Same author

Stabilized Full-Length Measles Fusion Protein Elicits Potent Immunity and Protection In Vivo.

bioRxiv : the preprint server for biology·2025

Same author

Situational motionless camouflage of a loliginid squid.

Scientific reports·2025

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 29, 2025

An Integrated Approach for Microprotein Identification and Sequence Analysis

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

Primary orthologs from local sequence context.

Kun Gao¹, Jonathan Miller²

¹School of Science, Southwest University of Science and Technology, 59 Qinglong Road, Mianyang, Sichuan Province, 621010, People's Republic of China. kgao@mail.ustc.edu.cn.

BMC Bioinformatics

|February 8, 2020

Summary

This summary is machine-generated.

Identifying primary orthologs, crucial for understanding gene evolutionary history, can now be done efficiently using short-range genomic sequence context. This new method, based on non-nested maximal matches, is faster and more accurate than traditional alignment techniques.

Keywords:

Genomic context K-mer Primary/positional orthology Reciprocal best hit Whole-genome alignment

More Related Videos

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

Related Experiment Videos

Last Updated: Dec 29, 2025

An Integrated Approach for Microprotein Identification and Sequence Analysis

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

Area of Science:

Genomics
Evolutionary Biology
Bioinformatics

Background:

Inferring gene evolutionary history is vital in biology.
Conserved non-coding sequences in mammalian genomes require methods beyond protein-coding analysis.
Distinguishing primary orthologs from other homologs necessitates genomic context, often computationally intensive.

Purpose of the Study:

To develop a computationally efficient method for identifying primary orthologs using short-range genomic sequence context.
To overcome limitations of similarity-based and traditional alignment methods in ortholog identification.

Main Methods:

Utilizing genome intersection to extract "non-nested maximal matches" from mammalian genomes.
Employing short-range sequence context, as minimal as a single maximal match, for ortholog inference.
Developing a parameter-free, intersection-based approach without repeat-masking or alignment.

Main Results:

Non-nested maximal matches accurately identify primary (positional) orthologs with high precision and recall across genomes.
The method is over 30 times faster than commonly used whole-genome alignment techniques.
Novel putative orthologs, such as approximately 1000 gene pairs in human-chimpanzee, were identified using reciprocal best hits (mmRBHs).

Conclusions:

An intersection-based method effectively infers sequence evolutionary history using short-range genomic context.
This approach is computationally efficient, parameter-free, and suitable for genome-wide ortholog identification.
The method can identify orthologs in repeat-masked regions and may be applicable to unassembled genomes.