Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Proteomics

Proteomics

A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term proteomics...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Expanding the human proteome with microproteins and peptideins.

Nature·2026

Same author

An AI-Ready Phosphorylation Meta-Analysis for <i>Saccharomyces cerevisiae</i>.

Journal of proteome research·2026

Same author

A Landscape Analysis of Human SUMOylation.

Molecular & cellular proteomics : MCP·2026

Same author

Automated Metadata Extraction from mzML Files with RunAssessor.

Journal of proteome research·2026

Same author

A Labrador PeptideAtlas and DIA spectral assay library - resources for proteomics research in dogs.

Scientific data·2026

Same author

An expanded reference catalog of translated open reading frames for biomedical research.

Nucleic acids research·2026

Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026

Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026

Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026

Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026

Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026

Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: May 16, 2026

Hydra, a Computer-Based Platform for Aiding Clinicians in Cardiovascular Analysis and Diagnosis

Hydra, a Computer-Based Platform for Aiding Clinicians in Cardiovascular Analysis and Diagnosis

Published on: September 26, 2018

Hydra: a scalable proteomic search engine which utilizes the Hadoop distributed computing framework.

Steven Lewis¹, Attila Csordas, Sarah Killcoyne

¹Institute for Systems Biology, Seattle, WA, USA. steven.lewis@systemsbiology.org

BMC Bioinformatics

|December 11, 2012

Summary

This summary is machine-generated.

This study introduces a scalable proteomics search engine using Hadoop MapReduce for faster mass spectrometry data analysis. The software efficiently matches spectra against large databases, improving computational performance.

More Related Videos

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools

Published on: August 19, 2025

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

Related Experiment Videos

Last Updated: May 16, 2026

Hydra, a Computer-Based Platform for Aiding Clinicians in Cardiovascular Analysis and Diagnosis

Hydra, a Computer-Based Platform for Aiding Clinicians in Cardiovascular Analysis and Diagnosis

Published on: September 26, 2018

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools

Published on: August 19, 2025

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Application of I TASSER, trRosetta, UCSF Chimera, HADDOCK server, and HEX loria for De Novo and In Silico Design of Proteins

Published on: July 8, 2025

Area of Science:

Proteomics
Computational Biology
Bioinformatics

Background:

Shotgun mass spectrometry proteomics involves computationally intensive spectral matching against large sequence databases.
High data generation rates from mass spectrometers necessitate efficient search solutions.
Increasing scope of proteomic searches demands improved computational strategies.

Purpose of the Study:

To develop a distributed computing solution for accelerating shotgun mass spectrometry based proteomics.
To enhance the efficiency of matching spectra against large sequence and post-translational modification databases.

Main Methods:

Implementation of a sequence database search engine on the Hadoop MapReduce framework.
Utilizing the K-score algorithm for spectral matching.
Design and discussion of the architecture for distributed processing.

Main Results:

The developed search engine demonstrates efficient performance on the Hadoop MapReduce framework.
The K-score algorithm implementation yields comparable results to the original version.
Scalability of the system is validated, showing performance improvements with increased resources.

Conclusions:

The software is highly scalable for large peptide databases, numerous modifications, and extensive spectra.
Performance scales linearly with the number of processors, enabling expanded throughput.
The solution effectively addresses the computational challenges in large-scale proteomics data analysis.