Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Gene Families01:57

Gene Families

9.7K
Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...
9.7K
Protein Families02:47

Protein Families

16.5K
Protein families are groups of homologous proteins; that is, they have similarities in amino acid sequences and three-dimensional structures. Protein families usually occur because of gene duplication, where an additional copy of a gene is inserted into the genome of an organism.   Mutations that change the amino acids but still allow the protein to be properly synthesized, will lead to new protein family members.   If these new proteins contain similar amino acids in key...
16.5K
Protein Families02:47

Protein Families

4.1K
4.1K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.6K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.6K
Conserved Binding Sites01:49

Conserved Binding Sites

5.0K
Many proteins’ biological role depends on their interactions with their ligands, small molecules that bind to specific locations on the protein known as ligand-binding sites. Ligand-binding sites are often conserved among homologous proteins as these sites are critical for protein function.
Binding sites are often located in large pockets, and if their location on a protein’s surface is unknown, it can be predicted using various approaches. The energetic method computationally...
5.0K
Conserved Binding Sites01:49

Conserved Binding Sites

1.9K
1.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Low-field strength MRI (0.55T) for stereotactic and functional neurosurgery using deep learning-based reconstruction algorithm: Preliminary experiences.

AJNR. American journal of neuroradiology·2026
Same author

PRISM: A unified platform for phage isolation and characterization from single-droplet microenvironments.

Science advances·2026
Same author

Ground truth data set of Gas Chromatography Mass Spectrometry (GCMS) analysed synthesised methylamphetamine.

Data in brief·2026
Same author

Clinician Perceptions of Barriers and Strategies to Improve Pediatric Hypertension Detection.

JAMA network open·2026
Same author

Stabilized Full-Length Measles Fusion Protein Elicits Potent Immunity and Protection In Vivo.

bioRxiv : the preprint server for biology·2025
Same author

Situational motionless camouflage of a loliginid squid.

Scientific reports·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Dec 29, 2025

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.9K

Primary orthologs from local sequence context.

Kun Gao1, Jonathan Miller2

  • 1School of Science, Southwest University of Science and Technology, 59 Qinglong Road, Mianyang, Sichuan Province, 621010, People's Republic of China. kgao@mail.ustc.edu.cn.

BMC Bioinformatics
|February 8, 2020
PubMed
Summary
This summary is machine-generated.

Identifying primary orthologs, crucial for understanding gene evolutionary history, can now be done efficiently using short-range genomic sequence context. This new method, based on non-nested maximal matches, is faster and more accurate than traditional alignment techniques.

Keywords:
Genomic contextK-merPrimary/positional orthologyReciprocal best hitWhole-genome alignment

More Related Videos

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq
07:09

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

10.3K
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.4K

Related Experiment Videos

Last Updated: Dec 29, 2025

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

3.9K
A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq
07:09

A Bioinformatics Pipeline for Investigating Molecular Evolution and Gene Expression using RNA-seq

Published on: May 28, 2021

10.3K
Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin
08:57

Using Phylogenetic Analysis to Investigate Eukaryotic Gene Origin

Published on: August 14, 2018

16.4K

Area of Science:

  • Genomics
  • Evolutionary Biology
  • Bioinformatics

Background:

  • Inferring gene evolutionary history is vital in biology.
  • Conserved non-coding sequences in mammalian genomes require methods beyond protein-coding analysis.
  • Distinguishing primary orthologs from other homologs necessitates genomic context, often computationally intensive.

Purpose of the Study:

  • To develop a computationally efficient method for identifying primary orthologs using short-range genomic sequence context.
  • To overcome limitations of similarity-based and traditional alignment methods in ortholog identification.

Main Methods:

  • Utilizing genome intersection to extract "non-nested maximal matches" from mammalian genomes.
  • Employing short-range sequence context, as minimal as a single maximal match, for ortholog inference.
  • Developing a parameter-free, intersection-based approach without repeat-masking or alignment.

Main Results:

  • Non-nested maximal matches accurately identify primary (positional) orthologs with high precision and recall across genomes.
  • The method is over 30 times faster than commonly used whole-genome alignment techniques.
  • Novel putative orthologs, such as approximately 1000 gene pairs in human-chimpanzee, were identified using reciprocal best hits (mmRBHs).

Conclusions:

  • An intersection-based method effectively infers sequence evolutionary history using short-range genomic context.
  • This approach is computationally efficient, parameter-free, and suitable for genome-wide ortholog identification.
  • The method can identify orthologs in repeat-masked regions and may be applicable to unassembled genomes.