Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Gene Families01:57

Gene Families

Gene families consist of groups of genes proposed to have originated from a common ancestor. Typically these arise through events in which a gene or genes are mistakenly duplicated during cell division. Unlike their parent genes (which are subject to selection pressure to maintain function), these gene copies do not need to preserve their sequences and may evolve at a relatively faster rate.
Occasionally these regions can be adapted to take on new roles within the organism, becoming novel genes...
From DNA to Protein03:06

From DNA to Protein

The flow of genetic information in cells from DNA to mRNA to protein is described by the central dogma, which states that genes specify the sequence of mRNAs, which in turn specify the sequence of amino acids making up all proteins. The decoding of one molecule to another is performed by specific proteins and RNAs. Because the information stored in DNA is so central to cellular function, it makes intuitive sense that the cell would make mRNA copies of this information for protein synthesis...
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved DNA...
Cis-regulatory Sequences02:02

Cis-regulatory Sequences

Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...
Cis-regulatory Sequences02:02

Cis-regulatory Sequences

Cis-regulatory sequences are short fragments of non-coding DNA that are present on the same chromosomes as the genes that they regulate. These fragments serve as binding sites for transcriptional regulators, proteins that are responsible for controlling gene transcription and differential gene expression across cell types in eukaryotes. Cis-regulatory sequences can be close to the gene of interest or thousands of bases away in the DNA sequence; however, those sequences that are further away are...
Gene Duplication and Divergence02:37

Gene Duplication and Divergence

The seminal work of Ohno in 1970 popularized the idea of gene duplication and divergence. DNA sequence comparison studies reveal that a large portion of the genes in bacteria, archaebacteria, and eukaryotes was  generated by gene duplication and divergence, indicating its critical role in evolution.
The duplicated copies of the gene are called Paralogs. Paralogs with similar sequences and functions form a gene family. Across several species, a large number of gene families are characterized.

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Pre-transition nutrition dose and mortality using a CRP-Free operational metabolic transition framework: A MIMIC-IV transportability analysis.

Clinical nutrition ESPEN·2026
Same author

Long-term comparative efficacy and safety of DOACs versus Warfarin in Patients with AF and CKD: a Nationwide Observational Study.

Heart rhythm·2026
Same author

Noradrenaline-trajectory phenotypes in septic shock: derivation and external validation in two independent cohorts.

Intensive care medicine experimental·2026
Same author

RetiMap: Automated Retinal Vascular Measures Link Microvascular Structure to Metabolic Health and Predict Cardiovascular Risk.

JACC. Basic to translational science·2026
Same author

The promise of adaptive health in the United Arab Emirates and beyond.

Nature genetics·2026
Same author

Population-scale characterization of the oral microbiome and associations with metabolic health.

Nature communications·2026
Same journal

Complete sequencing of medaka genomes reveals the architecture of centromeric satellites, giant mobile elements, and sex chromosomes.

Genome research·2026
Same journal

Convergence and conflict among telomere specialized transposons across 60 million years of Drosophilid evolution.

Genome research·2026
Same journal

A unified analysis of cell type- and trajectory-associated pathways in single-cell data using Phoenix.

Genome research·2026
Same journal

Resf1 is required for proper placental development and configuration of trophoblast cell-specific heterochromatin.

Genome research·2026
Same journal

Telomere-driven replicative crisis is driven by large-scale changes in genomic architecture.

Genome research·2026
Same journal

Spatially informed reference-free cell-type deconvolution for spatial transcriptomics with SpatialCD.

Genome research·2026
See all related articles

Related Experiment Video

Updated: Jun 8, 2026

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

Overlapping codes within protein-coding sequences.

Shalev Itzkovitz1, Eran Hodis, Eran Segal

  • 1Department of Computer Science and Applied Mathematics, Weizmann Institute of Science, Rehovot 76100, Israel

Genome Research
|September 16, 2010
PubMed
Summary
This summary is machine-generated.

Genomes use protein-coding regions to store multiple signals beyond amino acid sequences. This study reveals extensive overlapping codes across species, with bacteria encoding more information than eukaryotes.

More Related Videos

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data
08:23

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data

Published on: February 18, 2022

Oncogenic Gene Fusion Detection Using Anchored Multiplex Polymerase Chain Reaction Followed by Next Generation Sequencing
09:49

Oncogenic Gene Fusion Detection Using Anchored Multiplex Polymerase Chain Reaction Followed by Next Generation Sequencing

Published on: July 5, 2019

Related Experiment Videos

Last Updated: Jun 8, 2026

An Integrated Approach for Microprotein Identification and Sequence Analysis
09:37

An Integrated Approach for Microprotein Identification and Sequence Analysis

Published on: July 12, 2022

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data
08:23

De novo Identification of Actively Translated Open Reading Frames with Ribosome Profiling Data

Published on: February 18, 2022

Oncogenic Gene Fusion Detection Using Anchored Multiplex Polymerase Chain Reaction Followed by Next Generation Sequencing
09:49

Oncogenic Gene Fusion Detection Using Anchored Multiplex Polymerase Chain Reaction Followed by Next Generation Sequencing

Published on: July 5, 2019

Area of Science:

  • Genomics
  • Bioinformatics
  • Molecular Biology

Background:

  • Genomes contain various signals organized linearly.
  • The genetic code's redundancy allows for overlapping signals within protein-coding regions.
  • The extent and purpose of this overlapping information are largely unknown.

Purpose of the Study:

  • To systematically investigate the presence and nature of overlapping codes within protein-coding regions.
  • To compare the encoding capacity across diverse species, from viruses to plants.

Main Methods:

  • Comparative genomics analysis of protein-coding regions from over 700 species.
  • Statistical comparison against randomized sequences preserving amino acid sequence and codon bias.
  • Identification of specific signal types, including RNA secondary structure and regulatory elements.

Main Results:

  • Protein-coding regions across all surveyed phyla encode significant additional information.
  • Bacteria exhibit higher information density in coding regions compared to eukaryotes.
  • Identified signals include RNA secondary structure, transcription/translation signals, and microRNA target sites.

Conclusions:

  • Genomes extensively exploit protein-coding regions for overlapping information storage.
  • This strategy varies across taxa, with distinct signals in bacteria and eukaryotes.
  • Suggests a deeper evolutionary role for protein-coding sequences beyond protein synthesis.