Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Genome Annotation and Assembly

Genome Annotation and Assembly

The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.

RNA-seq

RNA-seq

RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases.
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...

Peptide Identification Using Tandem Mass Spectrometry

Peptide Identification Using Tandem Mass Spectrometry

Tandem mass spectrometry, also known as MS/MS or MS2, is an analytical technique that employs two mass analyzers. Essentially it is a series of mass spectrometers that helps isolate a particular biomolecule and then helps study its chemical properties.
This technique helps gather information regarding the protein from which the peptide was obtained and to study the peptides’ amino acid sequence. Identifying peptides from a complex mixture is an important component of the growing field of...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Alternol-Induced Oxidative Modification of SQSTM1/p62 Is Associated with Nrf2 Signaling and Autophagy-Related Responses in Prostate Cancer Cells.

Antioxidants (Basel, Switzerland)·2026

Same author

Parameter Efficient Deep Learning Models for Multi-Target Binding Affinity and hERG Cardiotoxicity Prediction.

IEEE transactions on computational biology and bioinformatics·2026

Same author

NbBayesLM: bayesian prediction of nanobody thermostability using protein language model.

Frontiers in bioinformatics·2026

Same author

<i>Special Issue:</i> 13th International Conference on Computational Advances in Bio and Medical Sciences.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same author

A sensitive and specific non-invasive urine biomarker panel for prostate cancer detection.

EBioMedicine·2025

Same author

Bidirectional subsethood of shared marker profiles enables accurate virus classification.

Microbiome·2025

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Apr 18, 2026

Novel Sequence Discovery by Subtractive Genomics

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

SFA-SPA: a suffix array based short peptide assembler for metagenomic data.

Youngik Yang¹, Cuncong Zhong¹, Shibu Yooseph¹

¹Informatics Department, J. Craig Venter Institute, La Jolla, CA 92037, USA.

Bioinformatics (Oxford, England)

|February 1, 2015

Summary

This summary is machine-generated.

This study enhances protein sequence reconstruction from metagenomic data. The improved algorithm efficiently assembles proteins from large datasets, aiding microbial community analysis.

More Related Videos

Metagenomic Analysis of Silage

Metagenomic Analysis of Silage

Published on: January 13, 2017

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Published on: August 20, 2021

Related Experiment Videos

Last Updated: Apr 18, 2026

Novel Sequence Discovery by Subtractive Genomics

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

Metagenomic Analysis of Silage

Metagenomic Analysis of Silage

Published on: January 13, 2017

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Published on: August 20, 2021

Area of Science:

Metagenomics
Computational Biology
Bioinformatics

Background:

Metagenomic datasets allow the study of microbial community metabolism and functions.
Accurate reconstruction of protein sequences is crucial for functional analysis.
Previous algorithms faced limitations with large-scale metagenomic data.

Purpose of the Study:

To present computational improvements for protein sequence reconstruction from metagenomic data.
To enable accurate protein assembly from large metagenomic datasets.
To enhance the study of microbial community metabolism and functional roles.

Main Methods:

Developed an improved short peptide assembly algorithm.
Implemented a suffix array data structure for fast querying.
Redesigned assembly steps for multi-threaded execution.

Main Results:

Achieved practical reconstruction of proteins from large metagenomic datasets (hundreds of millions of reads).
Maintained accuracy in protein sequence reconstruction.
Significantly improved computational efficiency of the assembly process.

Conclusions:

The enhanced algorithm makes protein reconstruction from large metagenomic datasets feasible.
This advancement facilitates deeper insights into microbial community functions and metabolism.
The software is available under the GPLv3 license.