Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Percentile

Percentile

A percentile indicates the relative standing of a data value when data are sorted into numerical order from smallest to largest. It represents the percentages of data values that are less than or equal to the pth percentile. For example, 15% of data values are less than or equal to the 15th percentile.

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Statistical Hypothesis Testing

Statistical Hypothesis Testing

Hypothesis testing is a critical statistical procedure facilitating informed, evidence-based decisions. It begins with a hypothesis, which is a tentative explanation, or a prediction about a population parameter. This hypothesis can be either a null hypothesis (H0), indicating no effect or difference, or an alternative hypothesis (Ha), suggesting an effect or difference.
Statistical significance measures the probability that an observed result occurred by chance. If this probability, known as...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Dynamic Single-Binding Event Profiling With on-Chip Microlenses for Wash-Free Digital Biosensing.

Advanced science (Weinheim, Baden-Wurttemberg, Germany)·2026

Same author

Interfacial engineering-mediated S-Scheme heterojunction with dual-ion cycling for enhanced photo-Fenton degradation of levofloxacin using a magnetically recyclable MnFe<sub>2</sub>O<sub>4</sub>@MIL-101(Fe) catalyst.

Journal of environmental sciences (China)·2026

Same author

Engineering CO<sub>2</sub> Pre-Activation in In-MOF for Enhancing its Electroreduction Activity.

Small (Weinheim an der Bergstrasse, Germany)·2026

Same author

Utility of Methadone:Metabolite Ratio as Marker of Methadone Metabolism During Methadone Initiation in Pregnancy.

Journal of addiction medicine·2026

Same author

Increasing physical activity in moderate-severe traumatic brain injury: protocol for a two-stage randomized controlled trial of a remote, mHealth-enhanced intervention.

Frontiers in rehabilitation sciences·2026

Same author

MECP2-Dependent Cancers Can Be Targeted by Epigenetic Drugs: A New Role for Epigenetic Cancer Therapy.

Molecular cancer therapeutics·2026

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Sep 13, 2025

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Quantile index predictors using R package hyper.gam.

Tingting Zhan¹, Misung Yi², Inna Chervoneva¹

¹Division of Biostatistics & Bioinformatics, Department of Pharmacology, Physiology & Cancer Biology, Sidney Kimmel Medical College, Thomas Jefferson University, Philadelphia, PA 19107, United States.

Bioinformatics (Oxford, England)

|July 30, 2025

Summary

This summary is machine-generated.

This study introduces hyper.gam, an R package for discovering functional protein biomarkers from single-cell expression data. It enables the use of entire protein expression distributions for robust biomarker identification.

More Related Videos

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Related Experiment Videos

Last Updated: Sep 13, 2025

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Area of Science:

Biomedical research
Computational biology
Biostatistics

Background:

Single-cell protein expression analysis is crucial in biomedical research, particularly for phenotyping tumor microenvironment cells.
Functional protein biomarkers require quantitative analysis of expression levels, but methods for utilizing full expression distributions are limited.

Purpose of the Study:

To develop a supervised learning framework for deriving biomarkers from single-cell expression data quantiles.
To provide a user-friendly R package (hyper.gam) for analyzing heterogeneous protein expression levels.

Main Methods:

The hyper.gam R package converts single-cell data into sample quantile functions.
Scalar-on-function regression models are employed to estimate an integrand surface.
The estimated surface generates quantile index predictors for new datasets.

Main Results:

The hyper.gam package offers a supervised learning framework for biomarker discovery using single-cell quantile functions.
It provides tools for estimating integrand surfaces and defining quantile index predictors.
The package includes user-friendly interfaces and visualization tools for exploring results.

Conclusions:

Hyper.gam facilitates the development of novel functional protein biomarkers by leveraging complete single-cell expression distributions.
The package addresses the need for methods that account for expression heterogeneity in tissues.
It is applicable to various single-cell data types beyond protein expression.