Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Data Validation01:15

Data Validation

194
Method validation is a crucial process in analytical chemistry designed to confirm that a given method consistently produces reliable and high-quality results. This process is essential when a method is applied to different sample matrices or when procedural modifications are made, ensuring that the results meet acceptable standards across various applications.
Key parameters for method validation include:
194
Testing a Claim about Standard Deviation01:19

Testing a Claim about Standard Deviation

2.5K
A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...
2.5K
Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test01:09

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

1.7K
In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with...
1.7K
Test for Homogeneity01:23

Test for Homogeneity

2.0K
The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...
2.0K
Significance Testing: Overview01:04

Significance Testing: Overview

3.4K
Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...
3.4K
Bonferroni Test01:10

Bonferroni Test

2.8K
The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...
2.8K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Developing Drugs for Tissue-Agnostic Indications: A Paradigm Shift in Leveraging Cancer Biology for Precision Medicine.

Clinical pharmacology and therapeutics·2020
Same author

Evidence for BCR/ABL1-positive T-cell acute lymphoblastic leukemia arising in an early lymphoid progenitor cell.

Pediatric blood & cancer·2019
Same author

The Effect of Molecular Weight on Passage of Proteins Through the Blood-Aqueous Barrier.

Investigative ophthalmology & visual science·2019
Same author

Analysis of serum Hsp90 as a potential biomarker of β cell autoimmunity in type 1 diabetes.

PloS one·2019
Same author

Statistical protein quantification and significance analysis in label-free LC-MS experiments with complex designs.

BMC bioinformatics·2012
Same author

Statistical design and analysis of label-free LC-MS proteomic experiments: a case study of coronary artery disease.

Methods in molecular biology (Clifton, N.J.)·2011
Same journal

Elastic functional Cox regression model with shape predictors.

Journal of applied statistics·2026
Same journal

An improved two-stage binary relevance method for multilabel classification.

Journal of applied statistics·2026
Same journal

Classification of multivariate functional data with an application to ADHD fMRI data.

Journal of applied statistics·2026
Same journal

Assessing the performance of longitudinal T-lymphocytes as biomarkers of immune recovery in HIV-infected children with or without TB co-infection.

Journal of applied statistics·2026
Same journal

Sparse long-only Markowitz portfolio optimization.

Journal of applied statistics·2026
Same journal

Homogeneity of multinomial populations when data are classified into a large number of groups.

Journal of applied statistics·2026
See all related articles

Related Experiment Video

Updated: Jul 28, 2025

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.0K

A statistical testing procedure for validating class labels.

Melissa C Key1,2, Susanne Ragg3, Benzion Boukai4

  • 1Infoscitex, Inc., Dayton, OH, USA.

Journal of Applied Statistics
|June 1, 2023
PubMed
Summary
This summary is machine-generated.

This study introduces a new method to validate protein identities in proteomics, improving accuracy even with mislabeled data. The procedure effectively identifies and corrects errors in protein classification for reliable results.

Keywords:
Non-parametricclassificationhypothesis testingmachine learningproteomics

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K
Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates
08:56

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

2.2K

Related Experiment Videos

Last Updated: Jul 28, 2025

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.0K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.0K
Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates
08:56

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

2.2K

Area of Science:

  • Proteomics
  • Bioinformatics
  • Statistical Biology

Background:

  • Label-free shotgun proteomics workflows face challenges in accurately validating protein identities.
  • Existing methods may struggle with identifying mislabeled instances within complex biological datasets.

Purpose of the Study:

  • To develop a robust testing procedure for validating protein (class) labels in proteomics.
  • To identify outlier instances (peptides) misclassified within their assigned protein groups.

Main Methods:

  • A non-parametric statistical approach is proposed based on the assumption that intra-class distances are smaller than inter-class distances.
  • The method controls the overall type I error probability across instances within a class.
  • Theoretical error bounds for type II errors are also investigated.

Main Results:

  • The procedure effectively reduces the proportion of mislabeled instances, even with up to 25% initial mislabeling.
  • High specificity is maintained, ensuring accurate classification of correctly labeled instances.
  • Demonstrated applicability on a real-world proteomics dataset from children with sickle cell disease.

Conclusions:

  • The developed testing procedure offers a viable solution for validating protein identities in label-free proteomics.
  • The method enhances data quality and reliability by identifying and correcting misclassified peptides.
  • This approach has significant implications for accurate biomarker discovery and clinical applications in proteomics.