Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Sieve Analysis and Grading Curves

Sieve Analysis and Grading Curves

Sieve analysis is a method used to determine the particle size distribution of aggregate materials. This process involves the following steps:

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

End Point Prediction: Gran Plot

End Point Prediction: Gran Plot

A Gran plot is used to predict the equivalence volume or endpoint of a potentiometric or acid-base titration without reaching the endpoint. Typically, titration data is collected as a function of the titrant's volume up to a point less than the equivalence volume and then transformed into a linear format. The straight line is extended to the x-axis, indicating the necessary titrant volume to achieve the equivalence point.
For potentiometric titration, the Gran plot is created by plotting...

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Design and analysis of individually randomized group-treatment trials with time to event outcomes.

Lifetime data analysis·2025

Same author

Randomized Phase II Cancer Clinical Trials to Validate Predictive Biomarkers.

Biomedicines·2024

Same author

A Dunnett-Type Test and Its Sample Size Calculation for Comparing <i>K</i> ROC Curves with a Control.

Diagnostics (Basel, Switzerland)·2024

Same author

Sample size calculation for comparing two ROC curves.

Pharmaceutical statistics·2024

Same author

Confidence intervals for odds ratio from multistage randomized phase II trials.

Statistics in medicine·2024

Same author

Estimation of the odds ratio from multi-stage randomized trials.

Pharmaceutical statistics·2024

Same journal

Correction: Rao et al. Ensemble Deep-Learning-Based Prognostic and Prediction for Recurrence of Sporadic Odontogenic Keratocysts on Hematoxylin and Eosin Stained Pathological Images of Incisional Biopsies. <i>J. Pers. Med.</i> 2022, <i>12</i>, 1220.

Journal of personalized medicine·2026

Same journal

Three-Dimensional Bronchovascular Modelling in Sublobar Pulmonary Resection: A Tool for Personalised Thoracic Surgery.

Journal of personalized medicine·2026

Same journal

Serum Albumin, Globulin and Albumin-Globulin Ratios as Biomarkers of Clinical Outcomes in COVID-19 Pneumonia.

Journal of personalized medicine·2026

Same journal

New Advances and Perspectives in Ophthalmology: Progress and Modern Challenges Toward Personalized Eye Care.

Journal of personalized medicine·2026

Same journal

Bridging Ancestry-Stratified Bias in Pharmacogenomics AI: Toward Metabolomics-Inclusive Multi-Omics Precision Medicine.

Journal of personalized medicine·2026

Same journal

Hormone-Driven Growth Signaling as a Therapeutic Target in Acute Myeloid Leukemia: Implications for Drug-Resistant Disease.

Journal of personalized medicine·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 18, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Repeated Sieving for Prediction Model Building with High-Dimensional Data.

Lu Liu¹, Sin-Ho Jung¹

¹Department of Biostatistics and Bioinformatics, Duke University, Durham, NC 27708, USA.

Journal of Personalized Medicine

|July 27, 2024

Summary

This summary is machine-generated.

A new repeated sieving method improves patient outcome prediction by selecting fewer, more significant variables than LASSO and Elastic Net. This machine learning approach enhances prediction accuracy and reduces future data collection costs.

Keywords:

Cox regression ROC curve logistic regression machine learning variable selection

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Surrogate Model Development for Digital Experiments in Welding

Surrogate Model Development for Digital Experiments in Welding

Published on: March 28, 2025

Related Experiment Videos

Last Updated: Jun 18, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Surrogate Model Development for Digital Experiments in Welding

Surrogate Model Development for Digital Experiments in Welding

Published on: March 28, 2025

Area of Science:

Biostatistics
Machine Learning
Personalized Medicine

Background:

Accurate patient outcome prediction is crucial for personalized medicine.
High-dimensional data (genomics, EHRs) require effective variable selection for prediction models.
Existing methods like LASSO and Elastic Net can over-select features, impacting model accuracy and cost.

Purpose of the Study:

To introduce and evaluate a novel machine learning method, repeated sieving, for variable selection in high-dimensional data.
To compare the performance of repeated sieving against established methods like LASSO and Elastic Net.
To assess the impact of variable selection on prediction accuracy and future data collection costs.

Main Methods:

Proposed a repeated sieving machine learning method, extending regression with stepwise variable selection.
Compared repeated sieving with LASSO (L1-norm penalty) and Elastic Net (L1/L2-norm penalties).
Evaluated methods using extensive numerical studies and real-world data examples.

Main Results:

Repeated sieving selected significantly fewer features compared to LASSO and Elastic Net.
The proposed method demonstrated higher prediction accuracy than existing machine learning approaches.
Numerical studies and real data confirmed the superior performance of repeated sieving.

Conclusions:

The repeated sieving method offers superior performance in both variable selection and prediction accuracy for high-dimensional data.
This approach effectively addresses the over-selection issue common in other machine learning methods.
Repeated sieving reduces the cost associated with future data collection for prediction models.