Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Reliability and Validity01:29

Reliability and Validity

14.0K
Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.
14.0K
Variability: Analysis01:11

Variability: Analysis

520
Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...
520
Random Variables01:09

Random Variables

17.9K
A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...
17.9K
Pharmacokinetic Models: Comparison and Selection Criterion01:26

Pharmacokinetic Models: Comparison and Selection Criterion

363
Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.
363
Graphs of Equations in Two Variables01:30

Graphs of Equations in Two Variables

253
An equation with two variables, typically written in the form y = f(x) or Ax + By = C, describes a relationship between quantities represented by x and y. Each solution to such an equation is an ordered pair (x, y) that satisfies the equation when substituted. These pairs can be represented graphically to understand the variables' relationship visually.A common technique for constructing the graph of a two-variable equation is to create a value table. Begin by choosing several values for the...
253
Variables Affecting Phosphorescence and Fluorescence01:26

Variables Affecting Phosphorescence and Fluorescence

1.5K
Fluorescence and phosphorescence are essential phenomena in fields like analytical chemistry, biological imaging, and materials science, where they detect molecular properties and visualize cellular structures. Understanding the variables that influence these luminescent behaviors is crucial for maximizing accuracy and efficiency in their applications. These variables can broadly be grouped into chemical structure, solvent properties, and external conditions, each playing a distinct role in...
1.5K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Gut microbiome composition and functional potential associate with incident type 2 diabetes in 4,685 adults from a Swedish prospective cohort.

Cell reports. Medicine·2026
Same author

Multi-omics analysis of associations between host demographics and saliva metabolome, sugar profiles, and microbiome profiles.

Scientific reports·2026
Same author

Analysis of microbiome high-dimensional experimental design data using generalized linear models and ANOVA simultaneous component analysis.

Frontiers in microbiomes·2026
Same author

A data-driven SSM/PCA analysis approach for differential diagnosis of parkinsonism using <sup>11</sup>C-PE2I PET.

NeuroImage. Clinical·2026
Same author

ACMTF-R: Supervised multi-omics data integration uncovering shared and distinct outcome-associated variation.

PloS one·2026
Same author

<i>xcms</i> in Peak Form: Now Anchoring a Complete Metabolomics Data Preprocessing and Analysis Software Ecosystem.

Analytical chemistry·2025
Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026
Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026
Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026
Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026
Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026
Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026
See all related articles

Related Experiment Video

Updated: Feb 5, 2026

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.5K

Variable selection and validation in multivariate modelling.

Lin Shi1,2, Johan A Westerhuis3,4, Johan Rosén5

  • 1Department of Molecular Sciences, Swedish University of Agricultural Sciences, Uppsala SE-750 07, Sweden.

Bioinformatics (Oxford, England)
|August 31, 2018
PubMed
Summary
This summary is machine-generated.

The new MUVR algorithm enhances multivariate analysis by accurately selecting all relevant variables, improving model prediction, and reducing overfitting. This approach ensures more reliable results in complex data analysis.

More Related Videos

Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

17.4K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.6K

Related Experiment Videos

Last Updated: Feb 5, 2026

Cross-Modal Multivariate Pattern Analysis
13:51

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

20.5K
Basics of Multivariate Analysis in Neuroimaging Data
06:35

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

17.4K
An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.6K

Area of Science:

  • Multivariate data analysis
  • Bioinformatics
  • Statistical modeling

Background:

  • Robust multivariate models require rigorous variable selection and validation to prevent overfitting and ensure generalizability.
  • Existing algorithms often struggle to identify all relevant variables, leading to selection bias and increased false positives.
  • There is a critical need for advanced algorithms that can identify both minimal-optimal and all-relevant variables alongside proper cross-validation.

Purpose of the Study:

  • To develop and validate the Multivariate Utility of Variables Recursive (MUVR) algorithm for improved multivariate analysis.
  • To enhance predictive performance, minimize overfitting, and reduce false positives in model construction.
  • To simultaneously identify minimal-optimal and all-relevant variable sets for diverse analytical tasks.

Main Methods:

  • The MUVR algorithm employs recursive variable elimination within a repeated double cross-validation (rdCV) framework.
  • It supports partial least squares (PLS) and random forest (RF) modeling techniques.
  • The method is designed for regression, classification, and multilevel analyses, integrating variable selection and validation.

Main Results:

  • MUVR successfully constructed parsimonious models with minimal overfitting across three omics datasets.
  • The algorithm demonstrated improved model performance compared to existing state-of-the-art rdCV methods.
  • MUVR outperformed other variable selection algorithms like Boruta and VSURF by offering simultaneous selection and validation.

Conclusions:

  • The MUVR algorithm provides a robust solution for variable selection in multivariate analysis, addressing the all-relevant problem.
  • It offers improved predictive accuracy and reduced overfitting, making it a valuable tool for omics data and other complex datasets.
  • The open-source availability of MUVR as an R package facilitates its widespread adoption and application in scientific research.