Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Data Validation

Data Validation

Method validation is a crucial process in analytical chemistry designed to confirm that a given method consistently produces reliable and high-quality results. This process is essential when a method is applied to different sample matrices or when procedural modifications are made, ensuring that the results meet acceptable standards across various applications.
Key parameters for method validation include:

Testing a Claim about Standard Deviation

Testing a Claim about Standard Deviation

A complete procedure to test a claim about population standard deviation or population variance is explained here.
The hypothesis testing for the claim of population standard deviation (or variance) requires the data and samples to be random and unbiased. The population distribution also must be normal. There is no specific requirement on the sample size as the estimation is based on the chi-square distribution.
As a first step, the hypothesis (null and alternative) concerning the claim about...

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

Statistical Methods to Analyze Parametric Data: Student t-Test and Goodness-of-Fit Test

In parametric statistics, two fundamental tests stand out for their utility and wide application: the Student's t-test and goodness-of-fit tests. These tests provide researchers with a robust method for drawing insights from data, testing hypotheses, and making informed decisions based on their findings.
The Student's t-test is a statistical test that examines if there is a statistically significant difference between the means of two groups. This test is instrumental when dealing with...

Test for Homogeneity

Test for Homogeneity

The goodness–of–fit test can be used to decide whether a population fits a given distribution, but it will not suffice to decide whether two populations follow the same unknown distribution. A different test, called the test for homogeneity, can be used to conclude whether two populations have the same distribution. To calculate the test statistic for a test for homogeneity, follow the same procedure as with the test of independence. The hypotheses for the test for homogeneity can...

Significance Testing: Overview

Significance Testing: Overview

Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...

Bonferroni Test

Bonferroni Test

The Bonferroni test is a statistical test named after Carlo Emilio Bonferroni, an Italian mathematician best known for Bonferroni inequalities. This statistical test is a type of multiple comparison test to determine which means are different than the rest. Bonferroni test can minimize the Type 1 error by reducing the significance level alpha, which otherwise increases with sample pairs.
The means of different samples are first paired in all possible combinations.
The null hypothesis of the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Developing Drugs for Tissue-Agnostic Indications: A Paradigm Shift in Leveraging Cancer Biology for Precision Medicine.

Clinical pharmacology and therapeutics·2020

Same author

Evidence for BCR/ABL1-positive T-cell acute lymphoblastic leukemia arising in an early lymphoid progenitor cell.

Pediatric blood & cancer·2019

Same author

The Effect of Molecular Weight on Passage of Proteins Through the Blood-Aqueous Barrier.

Investigative ophthalmology & visual science·2019

Same author

Analysis of serum Hsp90 as a potential biomarker of β cell autoimmunity in type 1 diabetes.

PloS one·2019

Same author

Statistical protein quantification and significance analysis in label-free LC-MS experiments with complex designs.

BMC bioinformatics·2012

Same author

Statistical design and analysis of label-free LC-MS proteomic experiments: a case study of coronary artery disease.

Methods in molecular biology (Clifton, N.J.)·2011

Same journal

Elastic functional Cox regression model with shape predictors.

Journal of applied statistics·2026

Same journal

An improved two-stage binary relevance method for multilabel classification.

Journal of applied statistics·2026

Same journal

Classification of multivariate functional data with an application to ADHD fMRI data.

Journal of applied statistics·2026

Same journal

Assessing the performance of longitudinal T-lymphocytes as biomarkers of immune recovery in HIV-infected children with or without TB co-infection.

Journal of applied statistics·2026

Same journal

Sparse long-only Markowitz portfolio optimization.

Journal of applied statistics·2026

Same journal

Homogeneity of multinomial populations when data are classified into a large number of groups.

Journal of applied statistics·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 28, 2025

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

A statistical testing procedure for validating class labels.

Melissa C Key^1,2, Susanne Ragg³, Benzion Boukai⁴

¹Infoscitex, Inc., Dayton, OH, USA.

Journal of Applied Statistics

|June 1, 2023

Summary

This summary is machine-generated.

This study introduces a new method to validate protein identities in proteomics, improving accuracy even with mislabeled data. The procedure effectively identifies and corrects errors in protein classification for reliable results.

Keywords:

Non-parametric classification hypothesis testing machine learning proteomics

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

Related Experiment Videos

Last Updated: Jul 28, 2025

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Automatic Image Processing to Determine the Community Size Structure of Riverine Macroinvertebrates

Published on: January 13, 2023

Area of Science:

Proteomics
Bioinformatics
Statistical Biology

Background:

Label-free shotgun proteomics workflows face challenges in accurately validating protein identities.
Existing methods may struggle with identifying mislabeled instances within complex biological datasets.

Purpose of the Study:

To develop a robust testing procedure for validating protein (class) labels in proteomics.
To identify outlier instances (peptides) misclassified within their assigned protein groups.

Main Methods:

A non-parametric statistical approach is proposed based on the assumption that intra-class distances are smaller than inter-class distances.
The method controls the overall type I error probability across instances within a class.
Theoretical error bounds for type II errors are also investigated.

Main Results:

The procedure effectively reduces the proportion of mislabeled instances, even with up to 25% initial mislabeling.
High specificity is maintained, ensuring accurate classification of correctly labeled instances.
Demonstrated applicability on a real-world proteomics dataset from children with sickle cell disease.

Conclusions:

The developed testing procedure offers a viable solution for validating protein identities in label-free proteomics.
The method enhances data quality and reliability by identifying and correcting misclassified peptides.
This approach has significant implications for accurate biomarker discovery and clinical applications in proteomics.