Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Proteomics01:33

Proteomics

A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term proteomics...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Expanding the human proteome with microproteins and peptideins.

Nature·2026
Same author

An AI-Ready Phosphorylation Meta-Analysis for <i>Saccharomyces cerevisiae</i>.

Journal of proteome research·2026
Same author

A Landscape Analysis of Human SUMOylation.

Molecular & cellular proteomics : MCP·2026
Same author

Automated Metadata Extraction from mzML Files with RunAssessor.

Journal of proteome research·2026
Same author

A Labrador PeptideAtlas and DIA spectral assay library - resources for proteomics research in dogs.

Scientific data·2026
Same author

An expanded reference catalog of translated open reading frames for biomedical research.

Nucleic acids research·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: May 16, 2026

Hydra, a Computer-Based Platform for Aiding Clinicians in Cardiovascular Analysis and Diagnosis
07:51

Hydra, a Computer-Based Platform for Aiding Clinicians in Cardiovascular Analysis and Diagnosis

Published on: September 26, 2018

Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework.

Steven Lewis1, Attila Csordas, Sarah Killcoyne

  • 1Institute for Systems Biology, Seattle, WA, USA. steven.lewis@systemsbiology.org

BMC Bioinformatics
|December 11, 2012
PubMed
Summary
This summary is machine-generated.

This study introduces a scalable proteomics search engine using Hadoop MapReduce for faster mass spectrometry data analysis. The software efficiently matches spectra against large databases, improving computational performance.

More Related Videos

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools
07:01

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools

Published on: August 19, 2025

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

Related Experiment Videos

Last Updated: May 16, 2026

Hydra, a Computer-Based Platform for Aiding Clinicians in Cardiovascular Analysis and Diagnosis
07:51

Hydra, a Computer-Based Platform for Aiding Clinicians in Cardiovascular Analysis and Diagnosis

Published on: September 26, 2018

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools
07:01

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools

Published on: August 19, 2025

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins
05:08

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

Area of Science:

  • Proteomics
  • Computational Biology
  • Bioinformatics

Background:

  • Shotgun mass spectrometry proteomics involves computationally intensive spectral matching against large sequence databases.
  • High data generation rates from mass spectrometers necessitate efficient search solutions.
  • Increasing scope of proteomic searches demands improved computational strategies.

Purpose of the Study:

  • To develop a distributed computing solution for accelerating shotgun mass spectrometry based proteomics.
  • To enhance the efficiency of matching spectra against large sequence and post-translational modification databases.

Main Methods:

  • Implementation of a sequence database search engine on the Hadoop MapReduce framework.
  • Utilizing the K-score algorithm for spectral matching.
  • Design and discussion of the architecture for distributed processing.

Main Results:

  • The developed search engine demonstrates efficient performance on the Hadoop MapReduce framework.
  • The K-score algorithm implementation yields comparable results to the original version.
  • Scalability of the system is validated, showing performance improvements with increased resources.

Conclusions:

  • The software is highly scalable for large peptide databases, numerous modifications, and extensive spectra.
  • Performance scales linearly with the number of processors, enabling expanded throughput.
  • The solution effectively addresses the computational challenges in large-scale proteomics data analysis.