Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Reliability and Validity

Reliability and Validity

Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.

Variability: Analysis

Variability: Analysis

Measures of variability are statistical metrics that reveal the dispersion pattern within a dataset. They are pivotal in biostatistics, providing insights into the heterogeneity within health and biological data. Variability signifies the degree to which data points diverge from one another, helping researchers understand the potential range of values and associated uncertainty within the data.
The range is a simple measure of variability, indicating the difference between the highest and...

Random Variables

Random Variables

A random variable is a single numerical value that indicates the outcome of a procedure. The concept of random variables is fundamental to the probability theory and was introduced by a Russian mathematician, Pafnuty Chebyshev, in the mid-nineteenth century.
Uppercase letters such as X or Y denote a random variable. Lowercase letters like x or y denote the value of a random variable. If X is a random variable, then X is written in words, and x is given as a number.
For example, let X = the...

Pharmacokinetic Models: Comparison and Selection Criterion

Pharmacokinetic Models: Comparison and Selection Criterion

Physiological and compartmental models are valuable tools used in studying biological systems. These models rely on differential equations to maintain mass balance within the system, ensuring an accurate representation of the dynamic processes at play.
Physiological models take a detailed approach by considering specific molecular processes. They can predict drug distribution, metabolism, and elimination changes, providing a comprehensive understanding of how drugs interact with the body.

Graphs of Equations in Two Variables

Graphs of Equations in Two Variables

An equation with two variables, typically written in the form y = f(x) or Ax + By = C, describes a relationship between quantities represented by x and y. Each solution to such an equation is an ordered pair (x, y) that satisfies the equation when substituted. These pairs can be represented graphically to understand the variables' relationship visually.A common technique for constructing the graph of a two-variable equation is to create a value table. Begin by choosing several values for the...

Variables Affecting Phosphorescence and Fluorescence

Variables Affecting Phosphorescence and Fluorescence

Fluorescence and phosphorescence are essential phenomena in fields like analytical chemistry, biological imaging, and materials science, where they detect molecular properties and visualize cellular structures. Understanding the variables that influence these luminescent behaviors is crucial for maximizing accuracy and efficiency in their applications. These variables can broadly be grouped into chemical structure, solvent properties, and external conditions, each playing a distinct role in...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Gut microbiome composition and functional potential associate with incident type 2 diabetes in 4,685 adults from a Swedish prospective cohort.

Cell reports. Medicine·2026

Same author

Multi-omics analysis of associations between host demographics and saliva metabolome, sugar profiles, and microbiome profiles.

Scientific reports·2026

Same author

Analysis of microbiome high-dimensional experimental design data using generalized linear models and ANOVA simultaneous component analysis.

Frontiers in microbiomes·2026

Same author

A data-driven SSM/PCA analysis approach for differential diagnosis of parkinsonism using <sup>11</sup>C-PE2I PET.

NeuroImage. Clinical·2026

Same author

ACMTF-R: Supervised multi-omics data integration uncovering shared and distinct outcome-associated variation.

PloS one·2026

Same author

<i>xcms</i> in Peak Form: Now Anchoring a Complete Metabolomics Data Preprocessing and Analysis Software Ecosystem.

Analytical chemistry·2025

Same journal

conMItion: an R package adjusting confounding factors for associations in multi-omics.

Bioinformatics (Oxford, England)·2026

Same journal

SpaMFG: a Spatial Multi-omics Integration Method based on Feature Grouping.

Bioinformatics (Oxford, England)·2026

Same journal

CSCN: Inference of Cell-Specific Causal Networks Using Single-Cell RNA-Seq Data.

Bioinformatics (Oxford, England)·2026

Same journal

Sparse CCA-Based Mediation Analysis with High-Dimensional Exposures and Mediators.

Bioinformatics (Oxford, England)·2026

Same journal

Enhancing Cross-Context Generalization in Drug Perturbation Prediction with a Multimodal Conditional Diffusion Framework.

Bioinformatics (Oxford, England)·2026

Same journal

Primer Design through Submodular Function Estimation.

Bioinformatics (Oxford, England)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Feb 5, 2026

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Variable selection and validation in multivariate modelling.

Lin Shi^1,2, Johan A Westerhuis^3,4, Johan Rosén⁵

¹Department of Molecular Sciences, Swedish University of Agricultural Sciences, Uppsala SE-750 07, Sweden.

Bioinformatics (Oxford, England)

|August 31, 2018

Summary

This summary is machine-generated.

The new MUVR algorithm enhances multivariate analysis by accurately selecting all relevant variables, improving model prediction, and reducing overfitting. This approach ensures more reliable results in complex data analysis.

More Related Videos

Basics of Multivariate Analysis in Neuroimaging Data

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Related Experiment Videos

Last Updated: Feb 5, 2026

Cross-Modal Multivariate Pattern Analysis

Cross-Modal Multivariate Pattern Analysis

Published on: November 9, 2011

Basics of Multivariate Analysis in Neuroimaging Data

Basics of Multivariate Analysis in Neuroimaging Data

Published on: July 24, 2010

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

Multivariate data analysis
Bioinformatics
Statistical modeling

Background:

Robust multivariate models require rigorous variable selection and validation to prevent overfitting and ensure generalizability.
Existing algorithms often struggle to identify all relevant variables, leading to selection bias and increased false positives.
There is a critical need for advanced algorithms that can identify both minimal-optimal and all-relevant variables alongside proper cross-validation.

Purpose of the Study:

To develop and validate the Multivariate Utility of Variables Recursive (MUVR) algorithm for improved multivariate analysis.
To enhance predictive performance, minimize overfitting, and reduce false positives in model construction.
To simultaneously identify minimal-optimal and all-relevant variable sets for diverse analytical tasks.

Main Methods:

The MUVR algorithm employs recursive variable elimination within a repeated double cross-validation (rdCV) framework.
It supports partial least squares (PLS) and random forest (RF) modeling techniques.
The method is designed for regression, classification, and multilevel analyses, integrating variable selection and validation.

Main Results:

MUVR successfully constructed parsimonious models with minimal overfitting across three omics datasets.
The algorithm demonstrated improved model performance compared to existing state-of-the-art rdCV methods.
MUVR outperformed other variable selection algorithms like Boruta and VSURF by offering simultaneous selection and validation.

Conclusions:

The MUVR algorithm provides a robust solution for variable selection in multivariate analysis, addressing the all-relevant problem.
It offers improved predictive accuracy and reduced overfitting, making it a valuable tool for omics data and other complex datasets.
The open-source availability of MUVR as an R package facilitates its widespread adoption and application in scientific research.