Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

RNA-seq03:21

RNA-seq

11.7K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
11.7K
Next-generation Sequencing03:00

Next-generation Sequencing

97.6K
The first human genome sequencing project cost $2.7 billion and was declared complete in 2003, after 15 years of international cooperation and collaboration between several research teams and funding agencies. Today, with the advent of next-generation sequencing technologies, the cost and time of sequencing a human genome have dropped over 100 fold.
Next-Generation Sequencing Methods
Although all next-generation methods use different technologies, they all share a set of standard features....
97.6K
Maxam-Gilbert Sequencing01:05

Maxam-Gilbert Sequencing

12.6K
In the same year as the discovery of the Sanger sequencing method, another group of scientists, Allan Maxam and Walter Gilbert, demonstrated their chemical-cleavage method for DNA sequencing. The Maxam-Gilbert method relies on using different chemicals that can cleave the DNA sequence at specific sites, the separation of resulting DNA fragments of variable size using electrophoresis, and deciphering the DNA sequence from the resulting gel bands.
Challenges of the Maxam-Gilbert Method
The...
12.6K
RACE - Rapid Amplification of cDNA Ends02:35

RACE - Rapid Amplification of cDNA Ends

7.1K
Rapid Amplification of cDNA Ends, or RACE, is one of the most effective methods to obtain a full-length cDNA from an mRNA sequence between a known internal region to the unknown sequence at the 5’ or 3’ end. The unknown region is cloned in the cDNA by a gene-specific primer that binds the known end, and a hybrid primer that attaches a predefined anchor sequence to the unknown end of the cDNA. The sequence in between is amplified by PCR with an anchor primer and a gene-specific...
7.1K
Sanger Sequencing01:57

Sanger Sequencing

772.9K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
772.9K
Multi-species Conserved Sequences02:51

Multi-species Conserved Sequences

4.6K
Next-generation sequencing technologies have created large genomic databases of a variety of animals and plants. Ever since the human genome project was completed, scientists studied the genome of primates, mammals, and other phylogenetically distant living beings. Such large-scale  studies have provided new insights into the evolutionary relationship between organisms.
Although the genome of each species varies greatly from each other, a few sequences are highly conserved. Such conserved...
4.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Efficient and Tidy Manipulation of Annotated Matrix Data with plyxp.

bioRxiv : the preprint server for biology·2026
Same author

The GA4GH Categorical Variation Representation Specification: A Unified Computational Framework for Reasoning over Genomic Variant Categories.

bioRxiv : the preprint server for biology·2026
Same author

Searching the Druggable Genome using Large Language Models.

bioRxiv : the preprint server for biology·2026
Same author

The IGVF catalog-from genetic variation to function.

Nucleic acids research·2025
Same author

Identifying Locally Recurrent Versus Second Primary Breast Cancer: Genomic Versus Clinical Criteria from Carolina Breast Cancer Study Phase III.

Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology·2025
Same author

JASPAR 2026: expansion of transcription factor binding profiles and integration of deep learning models.

Nucleic acids research·2025
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jan 10, 2026

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.1K

Taming the reference genome jungle: the refget sequence collection standard.

Donald R Campbell1, Timothee Cezard2, Sveinung Gundersen3

  • 1Department of Genome Sciences, School of Medicine, University of Virginia, 22908, Charlottesville VA.

Biorxiv : the Preprint Server for Biology
|November 24, 2025
PubMed
Summary
This summary is machine-generated.

The new Sequence Collections (seqcol) standard provides a framework for unambiguously identifying and comparing reference genomes and other sequence data. This addresses inconsistencies in genomics, improving data reuse and reproducibility in research.

More Related Videos

High-throughput Identification of Gene Regulatory Sequences Using Next-generation Sequencing of Circular Chromosome Conformation Capture 4C-seq
09:06

High-throughput Identification of Gene Regulatory Sequences Using Next-generation Sequencing of Circular Chromosome Conformation Capture 4C-seq

Published on: October 5, 2018

10.7K
CIRCLE-Seq for Interrogation of Off-Target Gene Editing
08:23

CIRCLE-Seq for Interrogation of Off-Target Gene Editing

Published on: November 1, 2024

1.3K

Related Experiment Videos

Last Updated: Jan 10, 2026

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.1K
High-throughput Identification of Gene Regulatory Sequences Using Next-generation Sequencing of Circular Chromosome Conformation Capture 4C-seq
09:06

High-throughput Identification of Gene Regulatory Sequences Using Next-generation Sequencing of Circular Chromosome Conformation Capture 4C-seq

Published on: October 5, 2018

10.7K
CIRCLE-Seq for Interrogation of Off-Target Gene Editing
08:23

CIRCLE-Seq for Interrogation of Off-Target Gene Editing

Published on: November 1, 2024

1.3K

Area of Science:

  • Genomics
  • Bioinformatics
  • Data Standards

Background:

  • Reference genomes are crucial for genomic analysis but suffer from ambiguity and incompatibility.
  • Inconsistent naming, undocumented differences, and lack of formal comparison mechanisms hinder genomic research.
  • Existing standards do not adequately address the need for unambiguous representation and comparison of sequence collections.

Purpose of the Study:

  • To introduce the GA4GH refget Sequence Collections (seqcol) standard for unambiguous representation, retrieval, and comparison of sequence collections.
  • To enable precise identification and compatibility assessment of reference genomes and transcriptomes.
  • To provide a scalable and reproducible solution to the reference genome compatibility crisis.

Main Methods:

  • Developed a structured data schema for sequence collections.
  • Implemented a canonical encoding algorithm for content-based, globally unique identifiers.
  • Created a retrieval API and a comparison protocol for sequence collections.
  • Applied the seqcol standard to 60 human and 36 mouse reference genomes using digest-based comparisons.

Main Results:

  • The seqcol standard enables precise identification of sequence collections across decentralized systems.
  • Analysis revealed substantial incompatibility and duplicate references among major genome providers.
  • Identified consistent subsets of sequences and coordinate systems across different reference genomes.
  • Demonstrated compatibility assessments beyond exact identity, including order-relaxed matches.

Conclusions:

  • The refget seqcol standard offers a robust solution to reference genome incompatibility, enhancing transparency and reproducibility.
  • Improved interoperability across genomic tools and datasets is achieved through the seqcol standard.
  • The provided Python package, web API, and comparison interface support widespread adoption and application of the seqcol standard.