Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

16.5K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
16.5K
Evolutionary Relationships through Genome Comparisons02:54

Evolutionary Relationships through Genome Comparisons

5.8K
Genome comparison is one of the excellent ways to interpret the evolutionary relationships between organisms. The basic principle of genome comparison is that if two species share a common feature, it is likely encoded by the DNA sequence conserved between both species. The advent of genome sequencing technologies in the late 20th century enabled scientists to understand the concept of conservation of domains between species and helped them to deduce evolutionary relationships across diverse...
5.8K
Genomics02:02

Genomics

35.1K
Genomics is the science of genomes: it is the study of all the genetic material of an organism. In humans, the genome consists of information carried in 23 pairs of chromosomes in the nucleus, as well as mitochondrial DNA. In genomics, both coding and non-coding DNA is sequenced and analyzed. Genomics allows a better understanding of all living things, their evolution, and their diversity. It has a myriad of uses: for example, to build phylogenetic trees, to improve productivity and...
35.1K
Oligosaccharide Assembly01:24

Oligosaccharide Assembly

2.6K
Protein glycosylation starts in the ER lumen and continues in the Golgi apparatus. Glycosyltransferases catalyze the addition of sugar molecules or glycosylation of proteins. Usually, these enzymes add sugars to the hydroxyl groups of selected serine or threonine residues to form O-linked glycans or the amino groups of asparagine residues to form N-linked glycans. Different positions on the same polypeptide chain can contain differently linked glycans.
Multiple sugar molecules that may or may...
2.6K
Genome Size and the Evolution of New Genes03:21

Genome Size and the Evolution of New Genes

2.5K
2.5K
Sanger Sequencing01:57

Sanger Sequencing

800.2K
DNA sequencing is a fundamental technique that is routinely used in the biological sciences. This method can be applied to a range of questions at different scales - from the sequencing of a cloned DNA fragment or the study of a mutation in a gene up to whole-genome sequencing. However, despite the widespread use of sequencing today, it was not until 1977 that Fredrick Sanger and his collaborators developed the chain-termination method to decode DNA sequences. It relies on the separation of a...
800.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Ankh-Score Produces Better Sequence Alignments Than AlphaFold3.

Proteins·2026
Same author

Protein embeddings and local alignments.

Computational and structural biotechnology journal·2026
Same author

Component puzzle protein-protein interaction prediction.

Briefings in bioinformatics·2025
Same author

Explainability of Protein Deep Learning Models.

International journal of molecular sciences·2025
Same author

Scoring alignments by embedding vector similarity.

Briefings in bioinformatics·2024
Same author

Seq-InSite: sequence supersedes structure for protein interaction site prediction.

Bioinformatics (Oxford, England)·2024
Same journal

SNPio: a Python interface for population genomic data processing.

BMC bioinformatics·2026
Same journal

SpaHNR: a spatial domain identification method via sparse attention-based hierarchical node representation and multi-view contrastive learning.

BMC bioinformatics·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Apr 23, 2026

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies
12:08

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Published on: August 20, 2021

7.3K

SAGE: String-overlap Assembly of GEnomes.

Lucian Ilie1, Bahlul Haider, Michael Molnar

  • 1Department of Computer Science, University of Western Ontario, N6A 5B7 London, Ontario, Canada. ilie@csd.uwo.ca.

BMC Bioinformatics
|September 17, 2014
PubMed
Summary
This summary is machine-generated.

SAGE is a new de novo genome assembly program that utilizes string-overlap graphs for improved accuracy. It offers advancements in read error correction, graph construction, and scaffolding, outperforming existing assemblers.

More Related Videos

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

9.8K
Automated Robotic Liquid Handling Assembly of Modular DNA Devices
11:22

Automated Robotic Liquid Handling Assembly of Modular DNA Devices

Published on: December 1, 2017

14.2K

Related Experiment Videos

Last Updated: Apr 23, 2026

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies
12:08

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Published on: August 20, 2021

7.3K
Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine
10:40

Comprehensive Workflow for the Genome-wide Identification and Expression Meta-analysis of the ATL E3 Ubiquitin Ligase Gene Family in Grapevine

Published on: December 22, 2017

9.8K
Automated Robotic Liquid Handling Assembly of Modular DNA Devices
11:22

Automated Robotic Liquid Handling Assembly of Modular DNA Devices

Published on: December 1, 2017

14.2K

Area of Science:

  • Bioinformatics
  • Genomics
  • Computational Biology

Background:

  • De novo genome assembly from next-generation sequencing data is a critical bioinformatics challenge.
  • Existing assembly methods require further improvements for biological applications.

Purpose of the Study:

  • To introduce SAGE, a novel program for de novo genome assembly.
  • To present an alternative approach to de Bruijn graph-based assemblers.

Main Methods:

  • SAGE employs a string-overlap graph approach, diverging from common de Bruijn graph methods.
  • Innovations include efficient transitive reduction, generalized edge multiplicity for read copy counts, and min-cost flow for edge merging.
  • Incorporates mate pair information for enhanced assembly.

Main Results:

  • SAGE demonstrated favorable assembly performance on short and medium-sized genomes compared to leading assemblers.
  • The program integrates multiple novel techniques across the assembly pipeline.

Conclusions:

  • SAGE introduces significant innovations in read error correction, graph construction, and scaffolding.
  • These advancements aim to push the state-of-the-art in genomic research.