Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Toward the Mean01:52

Regression Toward the Mean

6.5K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.5K
Weighted Mean00:57

Weighted Mean

5.6K
While taking the arithmetic, geometric, or harmonic mean of a sample data set, equal importance is assigned to all the data points. However, all the values may not always be equally important in some data sets. An intrinsic bias might make it more important to give more weightage to specific values over others.
For example, consider the number of goals scored in the matches of a tournament. While computing the average number of goals scored in the tournament, it may be more important to...
5.6K
Regression Analysis01:11

Regression Analysis

6.2K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
6.2K
Multiple Regression01:25

Multiple Regression

3.2K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.2K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

2.3K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
2.3K
Survival Tree01:19

Survival Tree

172
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
172

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Association of Fetal Gene Regulatory Gene Deletions With Poor Cognition in Schizophrenia and Community-Based Samples.

The American journal of psychiatry·2026
Same author

Exploring Group Differences in Attenuated Symptoms and Functioning Between Clinical High-Risk Individuals With and Without Posttraumatic Stress Disorder.

Early intervention in psychiatry·2026
Same author

Cannabis and tobacco co-use predicts psychosis in clinical high risk cohorts.

Nature. Mental health·2026
Same author

Tobacco Use is Related to Parietal-Hippocampal Connectivity in People at Clinical High Risk for Psychosis.

medRxiv : the preprint server for health sciences·2026
Same author

Electroencephalography Microstate Instability and Clinical Outcomes in Individuals at Clinical High Risk of Psychosis.

JAMA psychiatry·2026
Same author

Premorbid adjustment problems, negative symptoms, and cognitive impairment in a large international sample at clinical high risk for psychosis: Findings from the Accelerating Medicines Partnership-Schizophrenia.

Schizophrenia bulletin·2026
Same journal

Therapeutic potential of crude protein extracts from two Egyptian freshwater snails Lanistes carinatus and Bellamya unicolor.

Scientific reports·2026
Same journal

Microbial contamination of donor corneas and post-keratoplasty endophthalmitis: a comparison between Japanese and U.S. eye banks using cold storage.

Scientific reports·2026
Same journal

Prevalence and contributing factors of virological non-suppression among adult patients on first-line antiretroviral therapy in tertiary hospitals in Ethiopia.

Scientific reports·2026
Same journal

An in vitro comparison of color stability between alkasite and different restorative materials in various staining solutions.

Scientific reports·2026
Same journal

Toward accessible mRNA LNP formulation: systematic evaluation of mixing strategies and key parameters.

Scientific reports·2026
Same journal

A network analysis of personality traits, mentalizing, and psychological health in Chinese college students.

Scientific reports·2026
See all related articles

Related Experiment Video

Updated: Sep 28, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K

A greedy regression algorithm with coarse weights offers novel advantages.

Clark D Jeffries1, John R Ford2, Jeffrey L Tilson3

  • 1Renaissance Computing Institute, University of North Carolina, Chapel Hill, NC, USA. clark_jeffries@med.unc.edu.

Scientific Reports
|April 1, 2022
PubMed
Summary
This summary is machine-generated.

We introduce a novel Coarse Approximation Linear Function (CALF) for building simple yet powerful predictive models. CALF uses +1 or -1 weights to efficiently select key predictors, improving model stability and handling collinearity.

More Related Videos

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.0K
Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.3K

Related Experiment Videos

Last Updated: Sep 28, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.7K
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.0K
Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.3K

Area of Science:

  • Statistics
  • Machine Learning
  • Bioinformatics

Background:

  • Regularized regression is a standard method for identifying predictors of outcomes.
  • Existing methods can be sensitive to data changes and struggle with collinear variables.

Purpose of the Study:

  • To present a novel Coarse Approximation Linear Function (CALF) for predictor selection and predictive modeling.
  • To develop a method that creates simple, powerful models with robust predictor weighting.

Main Methods:

  • CALF is a linear regression strategy applied to normalized data.
  • It utilizes non-zero weights of +1 or -1.
  • Optimization targets qualitative metrics like p-value, AUC, or Pearson correlation.

Main Results:

  • CALF effectively selects important predictors and builds simple, powerful models.
  • The method demonstrates stability, with weights largely unaffected by small data changes.
  • CALF robustly handles collinear or nearly collinear predictors by selecting at most one.

Conclusions:

  • CALF offers a frugal approach to predictor selection and predictive modeling.
  • Its use of fixed weights and handling of collinearity enhance model interpretability and stability.
  • CALF provides a valuable alternative for risk prediction modeling, especially when qualitative metrics are prioritized.