Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...

Significance Testing: Overview

Significance Testing: Overview

Significance testing is a set of statistical methods used to test whether a claim about a parameter is valid. In analytical chemistry, significance testing is used primarily to determine whether the difference between two values comes from determinate or random errors. The effect of a particular change in the measurement protocol, analyst, or sample itself can cause a deviation from the expected result. In the case of a suspected deviation/outlier, we need to be able to confirm mathematically...

Wald-Wolfowitz Runs Test I

Wald-Wolfowitz Runs Test I

The Wald-Wolfowitz test, also known as the runs test, is a nonparametric statistical test used to assess the randomness of a sequence of two different types of elements (e.g., positive/negative values, successes/failures). It examines whether the order of the elements in a sequence is random or if there is a pattern or trend present. This nonparametric test applies to any ordered data despite the population and sample data distribution, even if a higher sample size is available.
The test works...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Variability: Analysis

Variability: Analysis

Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

TyG Index and Frailty as Composite Biomarkers of Cardiometabolic Risk and Mortality Across CKM Stages 0-3.

Metabolites·2026

Same author

Near-Infrared Fluorescent Probes Targeting LAG-3 for Guiding Immunomodulation and Efficacy Monitoring of Stereotactic Body Radiotherapy in Liver Cancer.

Journal of hepatocellular carcinoma·2026

Same author

Optimal gene panel selection for targeted spatial transcriptomics experiments.

Nucleic acids research·2026

Same author

Decoding Choroid Plexus Pathology in Alzheimer's Disease: A Longitudinal Radiomics Approach for Prodromal Identification and Risk Stratification.

CNS neuroscience & therapeutics·2026

Same author

A chromosome-level genome assembly of Lycoris radiata unveils evolutionary origin of Amaryllidaceae alkaloids and elucidates the complete pathway of galanthamine biosynthesis.

Plant communications·2026

Same author

Creep Characteristics and Damage Constitutive Model of White Sandstone Under Short-Term Freeze-Thaw Cycles.

Materials (Basel, Switzerland)·2026

Same journal

Instrumental Variable Estimation of Marginal Structural Mean Models for Time-Varying Treatment.

Journal of the American Statistical Association·2026

Same journal

Semiparametric Joint Modeling for Survival Analysis with Longitudinal Covariates.

Journal of the American Statistical Association·2026

Same journal

Dimension Reduction for Large-Scale Federated Data: Statistical Rate and Asymptotic Inference.

Journal of the American Statistical Association·2026

Same journal

Facilitating Heterogeneous Effect Estimation via Statistically Efficient Categorical Modifiers.

Journal of the American Statistical Association·2026

Same journal

Nonparametric Density Estimation of a Long-Term Trend from Repeated Semicontinuous Data.

Journal of the American Statistical Association·2026

Same journal

Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Clinicogenomic Data.

Journal of the American Statistical Association·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 26, 2025

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

A Model-free Variable Screening Method Based on Leverage Score.

Wenxuan Zhong¹, Yiwen Liu², Peng Zeng³

¹Department of Statistics, University of Georgia, Athens, GA, 30602.

Journal of the American Statistical Association

|June 22, 2023

Summary

This summary is machine-generated.

This study introduces a novel weighted leverage variable screening method for analyzing massive scientific datasets. The method efficiently identifies true predictors in complex models, demonstrating success in gene identification from spatial transcriptome data.

Keywords:

Bayesian information criteria General index model Leverage score Singular value decomposition Variable screening

More Related Videos

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

Pooled CRISPR-Based Genetic Screens in Mammalian Cells

Pooled CRISPR-Based Genetic Screens in Mammalian Cells

Published on: September 4, 2019

Related Experiment Videos

Last Updated: Jul 26, 2025

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

Pooled CRISPR-Based Genetic Screens in Mammalian Cells

Pooled CRISPR-Based Genetic Screens in Mammalian Cells

Published on: September 4, 2019

Area of Science:

Data science and computational biology.
Statistical learning and machine learning applications in scientific research.

Background:

Massive datasets in science necessitate efficient data analysis methods.
Conventional statistical learning techniques face computational challenges with large sample sizes and numerous predictors.
Leverage score sampling has shown promise for linear regression but not for variable selection.

Purpose of the Study:

To propose a novel weighted leverage variable screening method for effective variable selection in large-scale datasets.
To extend the application of leverage score sampling beyond linear regression to general index models.
To address the computational challenges in extracting meaningful information from massive scientific data.

Main Methods:

Development of a weighted leverage variable screening method utilizing both left and right singular vectors of the design matrix.
Theoretical analysis to demonstrate the consistency of selected predictors.
Empirical validation through extensive simulation studies and application to real-world biological data.

Main Results:

The proposed method consistently includes true predictors for both linear and general index models.
Weighted leverage screening is shown to be computationally efficient and effective.
Successful identification of carcinoma-related genes using spatial transcriptome data.

Conclusions:

The weighted leverage variable screening method offers a computationally efficient and effective approach for variable selection in massive datasets.
This method advances the application of leverage score sampling for complex statistical modeling.
The approach has practical implications for biological data analysis, such as identifying disease-related genes.