Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Genome Annotation and Assembly03:36

Genome Annotation and Assembly

22.3K
The genome refers to all of the genetic material in an organism. It can range from a few million base pairs in microbial cells to several billion base pairs in many eukaryotic organisms. Genome assembly refers to the process of taking the DNA sequencing data and putting it all back together in a correct order to create a close representation of the original genome. This is followed by the identification of functional elements on the newly assembled genome, a process called genome annotation.
22.3K
RNA-seq03:21

RNA-seq

12.7K
RNA sequencing, or RNA-Seq, is a high-throughput sequencing technology used to study the transcriptome of a cell. Transcriptomics helps to interpret the functional elements of a genome and identify the molecular constituents of an organism. Additionally, it also helps in understanding the development of an organism and the occurrence of diseases. 
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
12.7K
Peptide Identification Using Tandem Mass Spectrometry01:33

Peptide Identification Using Tandem Mass Spectrometry

8.8K
Tandem mass spectrometry, also known as MS/MS or MS2, is an analytical technique that employs two mass analyzers. Essentially it is a series of mass spectrometers that helps isolate a particular biomolecule and then helps study its chemical properties.
This technique helps gather information regarding the protein from which the peptide was obtained and to study the peptides’ amino acid sequence. Identifying peptides from a complex mixture is an important component of the growing field of...
8.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Alternol-Induced Oxidative Modification of SQSTM1/p62 Is Associated with Nrf2 Signaling and Autophagy-Related Responses in Prostate Cancer Cells.

Antioxidants (Basel, Switzerland)·2026
Same author

Parameter Efficient Deep Learning Models for Multi-Target Binding Affinity and hERG Cardiotoxicity Prediction.

IEEE transactions on computational biology and bioinformatics·2026
Same author

NbBayesLM: bayesian prediction of nanobody thermostability using protein language model.

Frontiers in bioinformatics·2026
Same author

<i>Special Issue:</i> 13th International Conference on Computational Advances in Bio and Medical Sciences.

Journal of computational biology : a journal of computational molecular cell biology·2026
Same author

A sensitive and specific non-invasive urine biomarker panel for prostate cancer detection.

EBioMedicine·2025
Same author

Bidirectional subsethood of shared marker profiles enables accurate virus classification.

Microbiome·2025
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Apr 18, 2026

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.3K

SFA-SPA: a suffix array based short peptide assembler for metagenomic data.

Youngik Yang1, Cuncong Zhong1, Shibu Yooseph1

  • 1Informatics Department, J. Craig Venter Institute, La Jolla, CA 92037, USA.

Bioinformatics (Oxford, England)
|February 1, 2015
PubMed
Summary
This summary is machine-generated.

This study enhances protein sequence reconstruction from metagenomic data. The improved algorithm efficiently assembles proteins from large datasets, aiding microbial community analysis.

More Related Videos

Metagenomic Analysis of Silage
08:43

Metagenomic Analysis of Silage

Published on: January 13, 2017

19.3K
Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies
12:08

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Published on: August 20, 2021

6.1K

Related Experiment Videos

Last Updated: Apr 18, 2026

Novel Sequence Discovery by Subtractive Genomics
09:40

Novel Sequence Discovery by Subtractive Genomics

Published on: January 25, 2019

9.3K
Metagenomic Analysis of Silage
08:43

Metagenomic Analysis of Silage

Published on: January 13, 2017

19.3K
Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies
12:08

Hybrid De Novo Genome Assembly for the Generation of Complete Genomes of Urinary Bacteria using Short- and Long-read Sequencing Technologies

Published on: August 20, 2021

6.1K

Area of Science:

  • Metagenomics
  • Computational Biology
  • Bioinformatics

Background:

  • Metagenomic datasets allow the study of microbial community metabolism and functions.
  • Accurate reconstruction of protein sequences is crucial for functional analysis.
  • Previous algorithms faced limitations with large-scale metagenomic data.

Purpose of the Study:

  • To present computational improvements for protein sequence reconstruction from metagenomic data.
  • To enable accurate protein assembly from large metagenomic datasets.
  • To enhance the study of microbial community metabolism and functional roles.

Main Methods:

  • Developed an improved short peptide assembly algorithm.
  • Implemented a suffix array data structure for fast querying.
  • Redesigned assembly steps for multi-threaded execution.

Main Results:

  • Achieved practical reconstruction of proteins from large metagenomic datasets (hundreds of millions of reads).
  • Maintained accuracy in protein sequence reconstruction.
  • Significantly improved computational efficiency of the assembly process.

Conclusions:

  • The enhanced algorithm makes protein reconstruction from large metagenomic datasets feasible.
  • This advancement facilitates deeper insights into microbial community functions and metabolism.
  • The software is available under the GPLv3 license.