Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Sieve Analysis and Grading Curves01:19

Sieve Analysis and Grading Curves

327
Sieve analysis is a method used to determine the particle size distribution of aggregate materials. This process involves the following steps:
327
Prediction Intervals01:03

Prediction Intervals

2.2K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.2K
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation01:24

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

448
This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...
448
End Point Prediction: Gran Plot01:07

End Point Prediction: Gran Plot

305
A Gran plot is used to predict the equivalence volume or endpoint of a potentiometric or acid-base titration without reaching the endpoint. Typically, titration data is collected as a function of the titrant's volume up to a point less than the equivalence volume and then transformed into a linear format. The straight line is extended to the x-axis, indicating the necessary titrant volume to achieve the equivalence point.
For potentiometric titration, the Gran plot is created by plotting...
305
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.3K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Design and analysis of individually randomized group-treatment trials with time to event outcomes.

Lifetime data analysis·2025
Same author

Randomized Phase II Cancer Clinical Trials to Validate Predictive Biomarkers.

Biomedicines·2024
Same author

A Dunnett-Type Test and Its Sample Size Calculation for Comparing <i>K</i> ROC Curves with a Control.

Diagnostics (Basel, Switzerland)·2024
Same author

Sample size calculation for comparing two ROC curves.

Pharmaceutical statistics·2024
Same author

Confidence intervals for odds ratio from multistage randomized phase II trials.

Statistics in medicine·2024
Same author

Estimation of the odds ratio from multi-stage randomized trials.

Pharmaceutical statistics·2024
Same journal

Correction: Rao et al. Ensemble Deep-Learning-Based Prognostic and Prediction for Recurrence of Sporadic Odontogenic Keratocysts on Hematoxylin and Eosin Stained Pathological Images of Incisional Biopsies. <i>J. Pers. Med.</i> 2022, <i>12</i>, 1220.

Journal of personalized medicine·2026
Same journal

Three-Dimensional Bronchovascular Modelling in Sublobar Pulmonary Resection: A Tool for Personalised Thoracic Surgery.

Journal of personalized medicine·2026
Same journal

Serum Albumin, Globulin and Albumin-Globulin Ratios as Biomarkers of Clinical Outcomes in COVID-19 Pneumonia.

Journal of personalized medicine·2026
Same journal

New Advances and Perspectives in Ophthalmology: Progress and Modern Challenges Toward Personalized Eye Care.

Journal of personalized medicine·2026
Same journal

Bridging Ancestry-Stratified Bias in Pharmacogenomics AI: Toward Metabolomics-Inclusive Multi-Omics Precision Medicine.

Journal of personalized medicine·2026
Same journal

Hormone-Driven Growth Signaling as a Therapeutic Target in Acute Myeloid Leukemia: Implications for Drug-Resistant Disease.

Journal of personalized medicine·2026
See all related articles

Related Experiment Video

Updated: Jun 18, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

Repeated Sieving for Prediction Model Building with High-Dimensional Data.

Lu Liu1, Sin-Ho Jung1

  • 1Department of Biostatistics and Bioinformatics, Duke University, Durham, NC 27708, USA.

Journal of Personalized Medicine
|July 27, 2024
PubMed
Summary
This summary is machine-generated.

A new repeated sieving method improves patient outcome prediction by selecting fewer, more significant variables than LASSO and Elastic Net. This machine learning approach enhances prediction accuracy and reduces future data collection costs.

Keywords:
Cox regressionROC curvelogistic regressionmachine learningvariable selection

More Related Videos

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

683
Surrogate Model Development for Digital Experiments in Welding
09:17

Surrogate Model Development for Digital Experiments in Welding

Published on: March 28, 2025

736

Related Experiment Videos

Last Updated: Jun 18, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

683
Surrogate Model Development for Digital Experiments in Welding
09:17

Surrogate Model Development for Digital Experiments in Welding

Published on: March 28, 2025

736

Area of Science:

  • Biostatistics
  • Machine Learning
  • Personalized Medicine

Background:

  • Accurate patient outcome prediction is crucial for personalized medicine.
  • High-dimensional data (genomics, EHRs) require effective variable selection for prediction models.
  • Existing methods like LASSO and Elastic Net can over-select features, impacting model accuracy and cost.

Purpose of the Study:

  • To introduce and evaluate a novel machine learning method, repeated sieving, for variable selection in high-dimensional data.
  • To compare the performance of repeated sieving against established methods like LASSO and Elastic Net.
  • To assess the impact of variable selection on prediction accuracy and future data collection costs.

Main Methods:

  • Proposed a repeated sieving machine learning method, extending regression with stepwise variable selection.
  • Compared repeated sieving with LASSO (L1-norm penalty) and Elastic Net (L1/L2-norm penalties).
  • Evaluated methods using extensive numerical studies and real-world data examples.

Main Results:

  • Repeated sieving selected significantly fewer features compared to LASSO and Elastic Net.
  • The proposed method demonstrated higher prediction accuracy than existing machine learning approaches.
  • Numerical studies and real data confirmed the superior performance of repeated sieving.

Conclusions:

  • The repeated sieving method offers superior performance in both variable selection and prediction accuracy for high-dimensional data.
  • This approach effectively addresses the over-selection issue common in other machine learning methods.
  • Repeated sieving reduces the cost associated with future data collection for prediction models.