Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Variability: Analysis01:11

Variability: Analysis

576
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
576
Genetic Screens02:46

Genetic Screens

5.8K
Genetic screens are tools used to identify genes and mutations responsible for phenotypes of interest. Genetic screens help identify individuals or a group of people at risk of developing  genetic diseases and help them with early intervention, targeted therapy, and reproductive options.
Forward genetic screens
Forward or “classical” genetic screens involve creating random mutations in an organism’s DNA using radiation, mutagens, or insertion of additional bases, which...
5.8K
Factors Affecting Activity Coefficient01:17

Factors Affecting Activity Coefficient

1.7K
The extended Debye-Hückel equation indicates that the activity coefficient of an ion in an aqueous solution at 25°C depends on three partially interdependent properties: the ionic strength of the solution, the charge of the ion, and the ion size. 
The activity coefficient value for an ion is close to one when the solution has almost zero ionic strength, i.e., when the solution shows close to ideal behavior. As the ionic strength of the solution increases from 0 to 0.1 mol/L, a...
1.7K
Biostatistics: Overview01:20

Biostatistics: Overview

945
Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...
945
Multiple Regression01:25

Multiple Regression

4.2K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
4.2K
Factorial Design02:01

Factorial Design

14.7K
Factorial Analysis is an experimental design that applies Analysis of Variance (ANOVA) statistical procedures to examine a change in a dependent variable due to more than one independent variable, also known as factors. Changes in worker productivity can be reasoned, for example, to be influenced by salary and other conditions, such as skill level. One way to test this hypothesis is by categorizing salary into three levels (low, moderate, and high) and skills sets into two levels (entry level...
14.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Integrative learning of individualized treatment rules from multiple studies with partially overlapping treatments.

Biometrics·2026
Same author

SEMIPARAMETRIC ANALYSIS OF INTERVAL-CENSORED DATA SUBJECT TO INACCURATE DIAGNOSES WITH A TERMINAL EVENT.

The annals of applied statistics·2026
Same author

DYNAMIC CLASSIFICATION OF LATENT DISEASE PROGRESSION WITH AUXILIARY SURROGATE LABELS.

The annals of applied statistics·2026
Same author

Asymptotic Inference for Multi-Stage Stationary Treatment Policy with Variable Selection.

Journal of machine learning research : JMLR·2026
Same author

Data fusion methods for the heterogeneity of treatment effect and confounding function.

Bernoulli : official journal of the Bernoulli Society for Mathematical Statistics and Probability·2026
Same author

Leveraging precision medicine analytics to optimize inflammation reduction and enhance physical function in older adults.

The journals of gerontology. Series A, Biological sciences and medical sciences·2026
Same journal

Modeling Disease-specific Survival in Observational Studies with Missing Cause of Death: Leveraging Information from Clinical Trial Data.

Computational statistics & data analysis·2026
Same journal

A simultaneous confidence-bounded true discovery proportion perspective on localizing differences in smooth terms in regression models.

Computational statistics & data analysis·2025
Same journal

MIXANDMIX: numerical techniques for the computation of empirical spectral distributions of population mixtures.

Computational statistics & data analysis·2024
Same journal

Locally sparse quantile estimation for a partially functional interaction model.

Computational statistics & data analysis·2024
Same journal

Flexible Regularized Estimation in High-Dimensional Mixed Membership Models.

Computational statistics & data analysis·2024
Same journal

GPU Accelerated Estimation of a Shared Random Effect Joint Model for Dynamic Prediction.

Computational statistics & data analysis·2024
See all related articles

Related Experiment Video

Updated: Feb 28, 2026

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

8.2K

Principal Components Adjusted Variable Screening.

Zhongkai Liu1, Rui Song1, Donglin Zeng2

  • 1Department of Statistics, North Carolina State University, Raleigh, NC, USA.

Computational Statistics & Data Analysis
|June 13, 2017
PubMed
Summary
This summary is machine-generated.

This study introduces a principal components adjusted variable screening method to improve high-dimensional variable selection. The new approach addresses limitations of marginal screening by accounting for omitted predictors, showing superior performance in simulations and real data analyses.

Keywords:
Generalized linear modelsPrincipal componentsSure screeningVariable selection

More Related Videos

Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

6.3K
Universal Screening for Prevention of Reading, Writing, and Math Disabilities in Spanish
14:43

Universal Screening for Prevention of Reading, Writing, and Math Disabilities in Spanish

Published on: July 18, 2020

8.6K

Related Experiment Videos

Last Updated: Feb 28, 2026

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

8.2K
Computerized Adaptive Testing System of Functional Assessment of Stroke
05:21

Computerized Adaptive Testing System of Functional Assessment of Stroke

Published on: January 7, 2019

6.3K
Universal Screening for Prevention of Reading, Writing, and Math Disabilities in Spanish
14:43

Universal Screening for Prevention of Reading, Writing, and Math Disabilities in Spanish

Published on: July 18, 2020

8.6K

Area of Science:

  • Statistics
  • Bioinformatics
  • Genomics

Background:

  • Marginal screening is a popular high-dimensional variable selection technique.
  • Marginal screening suffers from model misspecification due to ignoring joint predictor effects.

Purpose of the Study:

  • To propose a novel principal components adjusted variable screening method.
  • To enhance variable selection accuracy in generalized linear models for high-dimensional data.

Main Methods:

  • Utilizing top principal components as surrogate covariates.
  • Adjusting for omitted predictor variability in generalized linear models.
  • Evaluating performance via simulations and real-world datasets (gene expression and SNP data).

Main Results:

  • The proposed method demonstrates superior numerical performance compared to existing techniques.
  • Effective variable selection was achieved in high-dimensional genomic datasets.
  • The method accounts for variability from omitted predictors.

Conclusions:

  • Principal components adjusted variable screening offers an effective improvement over traditional marginal screening.
  • This method enhances the reliability of variable selection in complex, high-dimensional biological data.
  • The approach is validated on large-scale genechip and SNP datasets.