Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.1K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.1K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

729
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
729
Truncation in Survival Analysis01:09

Truncation in Survival Analysis

322
Truncation in survival analysis refers to the exclusion of individuals or events from the dataset based on specific criteria related to the time of the event. This exclusion can happen in two primary forms: left truncation and right truncation.
Left truncation occurs when individuals who experienced the event of interest before a certain time are not included in the study. This is often due to a "delayed entry" into the study where only those who survive until a certain entry point are...
322
Pharmacokinetic Models: Comparison and Selection Criterion01:26

Pharmacokinetic Models: Comparison and Selection Criterion

153
Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.
153
Parameters Affecting Nonlinear Elimination: Zero-Order Input, First-Order Absorption and Two-Compartment Model01:13

Parameters Affecting Nonlinear Elimination: Zero-Order Input, First-Order Absorption and Two-Compartment Model

127
Drugs administered through various routes can lead to nonlinear elimination, resulting in complex pharmacokinetic behaviors crucial to understanding efficacious drug dosing.
When a drug is administered through a constant intravenous infusion and eliminated via nonlinear pharmacokinetics, it follows zero-order input. For example, oral drugs undergo first-order absorption upon administration and are eliminated through nonlinear pharmacokinetics.
In the case of subcutaneously administered drugs,...
127
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

308
Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures...
308

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

A pilot study of magnetic resonance fingerprinting and radiomics analysis in autosomal dominant polycystic kidney disease.

Kidney international·2026
Same author

Creation of a Merged Harmonized Large Database of Subject-Level Data from Acute Secondary Prevention Studies in Minor Non-Cardioembolic Stroke or TIA.

International journal of cerebrovascular disease and stroke·2026
Same author

POLQ-driven repair scars shape the immunogenic landscape of homologous recombination-deficient pancreatic cancer.

bioRxiv : the preprint server for biology·2026
Same author

Door-in-door-out times and outcomes in patients with acute ischaemic stroke transferred for endovascular therapy in the USA: a retrospective cohort study.

The Lancet. Neurology·2026
Same author

Letting Biology Validate the EPR Imaging of Tumor pO<sub>2</sub>.

Advances in experimental medicine and biology·2026
Same author

Impact of fine particulate matter air pollution (PM<sub>2·5</sub>) on Nigerian children's performance on tests of cognitive and neurobehavioral development at age seven years.

Environment international·2025
Same journal

Asymptotic online FWER control for dependent test statistics.

Statistical methods in medical research·2026
Same journal

Regression analysis of misclassified current status data with potentially unknown test accuracy.

Statistical methods in medical research·2026
Same journal

Bayesian multivariate linear mixed-effects models with varied association structures.

Statistical methods in medical research·2026
Same journal

Inference about the ratio of age-standardized rates between two overlapping populations.

Statistical methods in medical research·2026
Same journal

A robust neural network with random effects for subject-specific prediction of clustered count data.

Statistical methods in medical research·2026
Same journal

A comparison of methods for designing hybrid type 2 cluster-randomized trials with continuous effectiveness and implementation endpoints.

Statistical methods in medical research·2026
See all related articles

Related Experiment Video

Updated: Sep 17, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K

Two-stage subsampling variable selection for sparse high-dimensional generalized linear models.

Marinela Capanu1, Mihai Giurcanu2, Colin B Begg1

  • 1Memorial Sloan Kettering Cancer Center, NYC, NY, USA.

Statistical Methods in Medical Research
|July 2, 2025
PubMed
Summary
This summary is machine-generated.

This study introduces a novel two-stage subsampling method for variable selection in high-dimensional generalized linear models. The approach effectively identifies true predictors, improving model accuracy and reducing false positives in omics data analysis.

Keywords:
Subsamplinghigh-dimensional regressionpartial least squares regressionsmoothly clipped absolute deviancestability selectionvariable selection

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

915
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K

Related Experiment Videos

Last Updated: Sep 17, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

915
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.6K

Area of Science:

  • High-dimensional data analysis
  • Statistical modeling
  • Bioinformatics

Background:

  • Model selection in high-dimensional data, particularly omics data, remains a significant challenge.
  • Existing methods offer room for improvement in accuracy and efficiency.

Purpose of the Study:

  • To propose a novel two-stage subsampling approach for variable selection in high-dimensional generalized linear regression models.
  • To enhance the accuracy and reliability of variable selection in complex datasets.

Main Methods:

  • A two-stage subsampling strategy combining smoothly clipped absolute deviance (SCAD) penalty regularization and partial least squares (PLS) regression.
  • Stage 1: Variable screening using SCAD and PLS on repeated subsamples.
  • Stage 2: Refined variable selection using Akaike information criterion (AIC) on reduced predictor sets from Stage 1.

Main Results:

  • The proposed method demonstrates superior performance compared to existing approaches in simulation studies.
  • Achieved high probability of selecting the true model with a low number of false positives.
  • Proven n1/2-consistency for the first-stage estimator.

Conclusions:

  • The two-stage subsampling approach offers a robust and effective solution for variable selection in high-dimensional settings.
  • The method is applicable to various regression models including logistic, Poisson, and linear regression.
  • Successfully illustrated on gene expression cancer datasets, highlighting its practical utility.