Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Residual Plots

Residual Plots

A residual plot is a statistical representation of data used to analyze correlation and regression results. It helps verify the requirements for drawing specific conclusions about correlation and regression. To obtain the residual plot, first, the residual for each data value is calculated, which is simply the vertical distance between the observed and the predicted value obtained from the regression equation.
When the residual values are plotted against the variable x, it is called a residual...

Multiple Regression

Multiple Regression

Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...

Contingency Table

Contingency Table

A contingency table provides a way of portraying data that can facilitate calculating probabilities. It is a method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; The table helps determine conditional probabilities quite quickly and can help systematically organize, analyze and quantify data. The table displays sample values concerning two variables that may be dependent or contingent on one...

Residuals and Least-Squares Property

Residuals and Least-Squares Property

The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...

Randomized Experiments

Randomized Experiments

The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

CD271 Identifies a Subpopulation with Enhanced Neural-like Potential Within Wharton Jelly Derived Mesenchymal Stem/Stromal Cells.

International journal of molecular sciences·2026

Same author

A Diverse Fiber Mixture, Reflective of a Nutritionally Balanced Diet, Is Well Tolerated by Healthy Adults.

Current developments in nutrition·2026

Same author

Phenethylamines in pre-workout supplements alter arterial pressure, heart rate, and body temperature in rats.

European journal of pharmacology·2026

Same author

Investigating short-term lung inflammation using exhaled breath VOCs from exposure to candles: a randomized controlled crossover study among young mild asthmatics.

Journal of breath research·2026

Same author

Artificial intelligence in dementia care: challenges, controversies, and policy implications.

Frontiers in dementia·2026

Same author

Paternal Malnutrition has Organ-Specific Intergenerational Effects on Mitochondrial Function and Oxidative Stress Induced DNA Damage in Male Mouse Offspring.

Molecular nutrition & food research·2026

Same journal

Smartphone-assisted fluorescence and colorimetric dual-mode sensor for visual quantitative detection of nitrite and nitrate in real samples.

Analytica chimica acta·2026

Same journal

Folding integrated all-paper photoelectrochemical immunoassay using annealed ZnO for point-of-care detection of ferritin.

Analytica chimica acta·2026

Same journal

Dual-mode electrochemical-SERS detection of chloramphenicol based on dual-signal enhancement.

Analytica chimica acta·2026

Same journal

Multi-screening of beta-lactam antibiotics in milk based on Fe<sub>3</sub>O<sub>4</sub>@phage/bacteria system and aggregation induced emission luminogen.

Analytica chimica acta·2026

Same journal

A porous phosphate-rich β-cyclodextrin polymer for efficient and broad-spectrum enrichment of antibiotics.

Analytica chimica acta·2026

Same journal

Corrigendum to "LUMIN: A novel algorithm for automated mixture quantification using 1D <sup>1</sup>H NMR spectra" [Analytica Chimica Acta 1411 (2026) 345639].

Analytica chimica acta·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 9, 2025

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

Constructing bi-plots for random forest: Tutorial.

Lionel Blanchet¹, Raffaele Vitale², Robert van Vorstenbosch¹

¹Department of Pharmacology and Toxicology, School of Nutrition, Toxicology and Translational Research in Metabolism (NUTRIM), Maastricht University Medical Center+, Maastricht, the Netherlands.

Analytica Chimica Acta

|September 15, 2020

Summary

This summary is machine-generated.

This study introduces a novel pseudo-sample approach for Random Forest models, enhancing variable importance visualization. The method creates bi-plots for better understanding complex data relationships across diverse fields.

Keywords:

Bi-plots Principal coordinates analysis Proximity matrix Pseudo samples Random forest interpretation

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Related Experiment Videos

Last Updated: Dec 9, 2025

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

Area of Science:

Machine Learning
Data Science
Computational Biology

Background:

Technological advancements have led to a data explosion, creating opportunities in machine learning.
Ensemble techniques, like Random Forest (RF), are crucial for building high-performance predictive models.
Current RF variable importance methods lack sample-specific insights.

Purpose of the Study:

To present a novel pseudo-sample principle for Random Forest models.
To enable sample-group-specific variable importance visualization.
To demonstrate the versatility of the approach across different data types.

Main Methods:

Development of a pseudo-sample principle for Random Forest.
Construction of bi-plots (spin plots) associated with RF models.
Application to simulated and real-world datasets (political science, food chemistry, human microbiome).

Main Results:

The pseudo-sample principle successfully generates bi-plots for RF models.
These bi-plots provide versatile visualization of multivariate models.
The approach reveals variable importance and relationships specific to sample groups.

Conclusions:

The pseudo-sample bi-plot approach enhances Random Forest interpretability.
This method offers valuable insights into variable importance across diverse datasets.
It represents a significant advancement in visualizing complex machine learning model outputs.