Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Regression Analysis01:11

Regression Analysis

5.6K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.6K
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

40
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
40
Multiple Regression01:25

Multiple Regression

2.9K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
2.9K
Prediction Intervals01:03

Prediction Intervals

2.2K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
2.2K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.3K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Longitudinal study of genome-wide DNA methylation in individuals with and without post-acute symptoms following SARS-CoV-2 infection.

Epigenomics·2026
Same author

Statistical methods to disentangle genetic effects influencing infertility and early fetal viability with a genome-wide application.

PLoS genetics·2025
Same author

Maternal and Fetal SERPINA3 Polymorphisms and Risk of Preeclampsia: A Dyad and Triad Based Case-Control Study.

Current issues in molecular biology·2025
Same author

Methylation differences between assisted reproductive technology-conceived and naturally conceived children near <i>BRCA1</i> and <i>NBR2</i>.

Epigenetics·2025
Same author

Extensive cross-species transmission of pathogens and antibiotic resistance genes in mammals neglected by public health surveillance.

Cell·2025
Same author

Maternal Immune-Mediated Conditions and ADHD Risk in Offspring.

Research square·2025
Same journal

OpenIMC: an open-source platform for analyzing single-cell and spatial proteomics by imaging mass cytometry.

BMC bioinformatics·2026
Same journal

NAP: an open source pipeline for cross-domain microbiome profiling using Nanopore sequencing-derived amplicon data.

BMC bioinformatics·2026
Same journal

SurvGME: an R package for survival analysis with graphical and measurement error models.

BMC bioinformatics·2026
Same journal

SimMapNet: a Bayesian framework for gene regulatory network inference using gene ontology similarities as external hint.

BMC bioinformatics·2026
Same journal

Dual channel drug-drug interactions extraction based on cross attention.

BMC bioinformatics·2026
Same journal

FeSseqdb: a curated sequence-level database and interpretable machine learning framework for identifying iron-sulfur proteins.

BMC bioinformatics·2026
See all related articles

Related Experiment Video

Updated: Jun 4, 2025

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.2K

MinLinMo: a minimalist approach to variable selection and linear model prediction.

Jon Bohlin1,2, Siri E Håberg3,4, Per Magnus3

  • 1Department of Method Development and Analytics, Section for modeling and bioinformatics, Norwegian Institute of Public Health, Oslo, Norway. Jon.Bohlin@fhi.no.

BMC Bioinformatics
|December 19, 2024
PubMed
Summary
This summary is machine-generated.

The MinLinMo software creates small, efficient linear prediction models from complex data. These parsimonious models achieve comparable performance to larger ones, simplifying causal inference and reducing computational demands.

Keywords:
Machine learningParsimonious linear modelsVariable selection

More Related Videos

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.5K
O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

6.5K

Related Experiment Videos

Last Updated: Jun 4, 2025

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients
07:34

Probing the Limits of Egg Recognition Using Egg Rejection Experiments Along Phenotypic Gradients

Published on: August 22, 2018

8.2K
A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
12:18

A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

Published on: January 11, 2020

7.5K
O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression
06:50

O-cresol Concentration Online Measurement Based On Near Infrared Spectroscopy Via Partial Least Square Regression

Published on: November 8, 2019

6.5K

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Statistical Genetics

Background:

  • High-dimensional data analysis often yields complex prediction models.
  • Causal inference in large models presents practical challenges.
  • Need for efficient and interpretable prediction models in biological research.

Purpose of the Study:

  • Introduce MinLinMo, a software package for generating small linear prediction models.
  • Prioritize model parsimony, speed, and minimal memory usage.
  • Facilitate causal inference with high-dimensional biological datasets.

Main Methods:

  • Developed the stand-alone software package MinLinMo.
  • Focused on selecting predictors correlated with the outcome.
  • Emphasized parsimony, minimal memory footprint, and computational speed.

Main Results:

  • MinLinMo successfully generated parsimonious prediction models for epigenetic datasets.
  • Models predicted chronological age (15 predictors), gestational age (14 predictors), and birth weight (10 predictors).
  • MinLinMo models performed comparably to established models using hundreds of predictors.

Conclusions:

  • MinLinMo offers an effective approach to building small, interpretable prediction models.
  • Parsimonious models derived from MinLinMo facilitate causal inference in high-dimensional data.
  • The software provides a computationally efficient alternative for biological data analysis.