Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

295
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
295
Residual Plots01:07

Residual Plots

5.8K
A residual plot is a statistical representation of data used to analyze correlation and regression results. It helps verify the requirements for drawing specific conclusions about correlation and regression. To obtain the residual plot, first, the residual for each data value is calculated, which is simply the vertical distance between the observed and the predicted value obtained from the regression equation.
When the residual values are plotted against the variable x, it is called a residual...
5.8K
Multiple Regression01:25

Multiple Regression

3.6K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.6K
Contingency Table01:29

Contingency Table

3.6K
A contingency table provides a way of portraying data that can facilitate calculating probabilities. It is a method of displaying a frequency distribution as a table with rows and columns to show how two variables may be dependent (contingent) upon each other; The table helps determine conditional probabilities quite quickly and can help systematically organize, analyze and quantify data. The table displays sample values concerning two variables that may be dependent or contingent on one...
3.6K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

8.7K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
8.7K
Randomized Experiments01:13

Randomized Experiments

8.7K
The randomization process involves assigning study participants randomly to experimental or control groups based on their probability of being equally assigned. Randomization is meant to eliminate selection bias and balance known and unknown confounding factors so that the control group is similar to the treatment group as much as possible. A computer program and a random number generator can be used to assign participants to groups in a way that minimizes bias.
Simple randomization
Simple...
8.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

CD271 Identifies a Subpopulation with Enhanced Neural-like Potential Within Wharton Jelly Derived Mesenchymal Stem/Stromal Cells.

International journal of molecular sciences·2026
Same author

A Diverse Fiber Mixture, Reflective of a Nutritionally Balanced Diet, Is Well Tolerated by Healthy Adults.

Current developments in nutrition·2026
Same author

Phenethylamines in pre-workout supplements alter arterial pressure, heart rate, and body temperature in rats.

European journal of pharmacology·2026
Same author

Investigating short-term lung inflammation using exhaled breath VOCs from exposure to candles: a randomized controlled crossover study among young mild asthmatics.

Journal of breath research·2026
Same author

Artificial intelligence in dementia care: challenges, controversies, and policy implications.

Frontiers in dementia·2026
Same author

Paternal Malnutrition has Organ-Specific Intergenerational Effects on Mitochondrial Function and Oxidative Stress Induced DNA Damage in Male Mouse Offspring.

Molecular nutrition & food research·2026
Same journal

Smartphone-assisted fluorescence and colorimetric dual-mode sensor for visual quantitative detection of nitrite and nitrate in real samples.

Analytica chimica acta·2026
Same journal

Folding integrated all-paper photoelectrochemical immunoassay using annealed ZnO for point-of-care detection of ferritin.

Analytica chimica acta·2026
Same journal

Dual-mode electrochemical-SERS detection of chloramphenicol based on dual-signal enhancement.

Analytica chimica acta·2026
Same journal

Multi-screening of beta-lactam antibiotics in milk based on Fe<sub>3</sub>O<sub>4</sub>@phage/bacteria system and aggregation induced emission luminogen.

Analytica chimica acta·2026
Same journal

A porous phosphate-rich β-cyclodextrin polymer for efficient and broad-spectrum enrichment of antibiotics.

Analytica chimica acta·2026
Same journal

Corrigendum to "LUMIN: A novel algorithm for automated mixture quantification using 1D <sup>1</sup>H NMR spectra" [Analytica Chimica Acta 1411 (2026) 345639].

Analytica chimica acta·2026
See all related articles

Related Experiment Video

Updated: Dec 9, 2025

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

400

Constructing bi-plots for random forest: Tutorial.

Lionel Blanchet1, Raffaele Vitale2, Robert van Vorstenbosch1

  • 1Department of Pharmacology and Toxicology, School of Nutrition, Toxicology and Translational Research in Metabolism (NUTRIM), Maastricht University Medical Center+, Maastricht, the Netherlands.

Analytica Chimica Acta
|September 15, 2020
PubMed
Summary
This summary is machine-generated.

This study introduces a novel pseudo-sample approach for Random Forest models, enhancing variable importance visualization. The method creates bi-plots for better understanding complex data relationships across diverse fields.

Keywords:
Bi-plotsPrincipal coordinates analysisProximity matrixPseudo samplesRandom forest interpretation

More Related Videos

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.6K

Related Experiment Videos

Last Updated: Dec 9, 2025

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model
07:13

Comparison of Predictive Performance of Three Lymph Node Staging Systems in Colorectal Signet Ring Cell Carcinoma Based on Machine Learning Model

Published on: April 18, 2025

400
Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K
Establishing a Competing Risk Regression Nomogram Model for Survival Data
04:57

Establishing a Competing Risk Regression Nomogram Model for Survival Data

Published on: October 23, 2020

10.6K

Area of Science:

  • Machine Learning
  • Data Science
  • Computational Biology

Background:

  • Technological advancements have led to a data explosion, creating opportunities in machine learning.
  • Ensemble techniques, like Random Forest (RF), are crucial for building high-performance predictive models.
  • Current RF variable importance methods lack sample-specific insights.

Purpose of the Study:

  • To present a novel pseudo-sample principle for Random Forest models.
  • To enable sample-group-specific variable importance visualization.
  • To demonstrate the versatility of the approach across different data types.

Main Methods:

  • Development of a pseudo-sample principle for Random Forest.
  • Construction of bi-plots (spin plots) associated with RF models.
  • Application to simulated and real-world datasets (political science, food chemistry, human microbiome).

Main Results:

  • The pseudo-sample principle successfully generates bi-plots for RF models.
  • These bi-plots provide versatile visualization of multivariate models.
  • The approach reveals variable importance and relationships specific to sample groups.

Conclusions:

  • The pseudo-sample bi-plot approach enhances Random Forest interpretability.
  • This method offers valuable insights into variable importance across diverse datasets.
  • It represents a significant advancement in visualizing complex machine learning model outputs.