Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Tagging and Fusion Proteins01:24

Tagging and Fusion Proteins

Proteins are involved in several cellular processes and biochemical reactions. Analyzing a specific protein of interest requires it to be isolated from the other proteins in the cell. This is achieved by overexpressing the specific gene in a suitable host to produce large quantities of the target protein. A tag or label is recombined with the gene to produce a fusion protein containing the target protein and the tag. The tags on these fusion proteins can then be used for easy detection and...
Proteomics01:33

Proteomics

A proteome is the entire set of proteins that a cell type produces. We can study proteomes using the knowledge of genomes because genes code for mRNAs, and the mRNAs encode proteins. Although mRNA analysis is a step in the right direction, not all mRNAs are translated into proteins.
Proteomics is the study of proteomes' function. It involves the large-scale systematic study of the proteome to denote the protein complement expressed by a genome. Scientist Mark Wilkins coined the term proteomics...
Protein Networks02:26

Protein Networks

An organism can have thousands of different proteins, and these proteins must cooperate to ensure the health of an organism. Proteins bind to other proteins and form complexes to carry out their functions. Many proteins interact with multiple other proteins creating a complex network of protein interactions.
These interactions can be represented through maps depicting protein-protein interaction networks, represented as nodes and edges. Nodes are circles that are representative of a protein,...
Conservation of Protein Domains Over Different Proteins02:26

Conservation of Protein Domains Over Different Proteins

Protein domains are small structurally independent units that are part of a single amino acid chain.  Although these domains are often structurally independent, they may rely on synergistic effects to perform their functions as part of a larger protein. Protein domains may be conserved within the same organism, as well as across different organisms.
A limited set of protein domains often duplicate and recombine during evolution. These domains can be organized in different combinations to form...
Protein-protein Interfaces02:04

Protein-protein Interfaces

Many proteins form complexes to carry out their functions, making protein-protein interactions (PPIs) essential for an organism's survival. Most PPIs are stabilized by numerous weak noncovalent chemical forces. The physical shape of the interfaces determines the way two proteins interact. Many globular proteins have closely-matching shapes on their surfaces, which form a large number of weak bonds. Additionally, many PPIs occur between two helices or between a surface cleft and a polypeptide...
Immunoprecipitation01:20

Immunoprecipitation

Immunoprecipitation, or IP, is a widely used technique that employs protein-antibody interactions to isolate proteins or protein complexes in their native state for studying protein-protein interactions, quaternary structures, or supramolecular complexes. Various modifications of the technique, including chromatin IP, cross-linking IP, and fluorescence IP, are commonly used.
Chromatin Immunoprecipitation
Chromatin immunoprecipitation, also known as ChIP, is used to study protein-DNA or...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Orchestrator multi-agent clinical decision support system for secondary headache diagnosis in primary care.

Journal of the American Medical Informatics Association : JAMIA·2026
Same author

The Effect of Enforcing Fairness on Reshaping Explanations in Machine Learning Models.

AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science·2026
Same author

A multimodal generative model for structured and unstructured electronic health records.

npj health systems·2026
Same author

Predicting the timing of first sustained cognitive worsening in Alzheimer's disease using real-world clinical data and machine learning.

medRxiv : the preprint server for health sciences·2026
Same author

i2b2-ML: module to facilitate machine learning in the informatics for integrating biology and the bedside platform.

JAMIA open·2026
Same author

Long COVID Persistence and Surveillance Gaps Across 58 US Hospitals.

JAMA network open·2026
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 20, 2026

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

Knowledge-based variable selection for learning rules from proteomic data.

Jonathan L Lustgarten1, Shyam Visweswaran, Robert P Bowser

  • 1Department of Biomedical Informatics, University of Pittsburgh, 200 Meyran Ave, Parkvale M-183, Pittsburgh, PA, USA. JLL47@pitt.edu

BMC Bioinformatics
|September 19, 2009
PubMed
Summary
This summary is machine-generated.

This study introduces a novel method using a proteomic knowledge base to improve disease biomarker identification from mass spectrometry data. The approach significantly enhances rule-learning algorithm performance by reducing data dimensionality.

More Related Videos

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools
07:01

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools

Published on: August 19, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Related Experiment Videos

Last Updated: Jun 20, 2026

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports
07:35

A Knowledge Graph Approach to Elucidate the Role of Organellar Pathways in Disease via Biomedical Reports

Published on: October 13, 2023

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools
07:01

Navigating the Mass Spectrometry-Based Proteomic Data Using Free Computational Tools

Published on: August 19, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

  • Biomedical data analysis
  • Proteomics
  • Computational biology

Background:

  • Biomedical data analysis can be improved by incorporating biological knowledge.
  • High-dimensional proteomic mass spectral data presents challenges for identifying disease biomarkers.
  • Existing methods may not fully leverage biological information for biomarker discovery.

Purpose of the Study:

  • To present a novel method for enhancing rule-learning algorithm performance in identifying disease biomarkers.
  • To utilize a proteomic knowledge base for pre-processing high-dimensional proteomic mass spectral data.
  • To improve the accuracy and efficiency of biomarker discovery.

Main Methods:

  • Development and application of a novel method integrating a proteomic knowledge base with a rule-learning algorithm.
  • Utilizing the Empirical Proteomics Ontology Knowledge Base (EPO-KB) for selecting relevant m/zs (mass-to-charge ratios) in proteomic datasets.
  • Employing EPO-KB as a pre-processing step to reduce data dimensionality before analysis.

Main Results:

  • The knowledge-based pre-processing method, using EPO-KB, significantly reduced data dimensionality by 95%.
  • This approach resulted in a statistically significant increase in performance compared to no variable selection and random variable selection.
  • The method demonstrated enhanced identification of putative disease biomarkers from proteomic mass spectral data.

Conclusions:

  • Knowledge-based variable selection, even with a limited resource like EPO-KB, boosts the performance of rule-learning algorithms.
  • The proposed method offers a viable strategy for disease classification using high-dimensional proteomic data.
  • Integrating biological knowledge is crucial for advancing biomarker discovery in proteomics.