Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

80
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
80
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
Regression Analysis01:11

Regression Analysis

5.7K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
5.7K
Regression Toward the Mean01:52

Regression Toward the Mean

6.3K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.3K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.4K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.4K
Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.6K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.6K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Construction of a nomogram combining CT and serum markers for predicting macrolide resistance gene mutation status in pediatric Mycoplasma pneumoniae pneumonia.

Frontiers in pediatrics·2026
Same author

Data Fusion for Partial Identification of Causal Effects.

Advances in neural information processing systems·2026
Same author

Carrier Transport Control for Enhanced Performance in Dual-Color Quantum Well Infrared Photodetectors.

Nanomaterials (Basel, Switzerland)·2026
Same author

A Double Machine Learning Approach for Combining Experimental and Observational Studies.

Observational studies·2026
Same author

Electrically resolving 1 nanometer semiconductor quantum wells and superlattices in photodiodes by conductive scanning probe microscopy.

Nanoscale·2026
Same author

Data-hugging shields proprietary AI models from research that could disprove them.

NPJ artificial intelligence·2026
Same journal

Interpretable Failure Detection with Human-Level Concepts.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

ChatCLIDS: Simulating Persuasive AI Dialogues to Promote Closed-Loop Insulin Adoption in Type 1 Diabetes Care.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

Beyond Accuracy: On the Effects of Fine-tuning Towards Vision-Language Model's Prediction Rationality.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

<i>OrgaCast</i>: A Trustworthy Spatiotemporal Diffusion Model for Fluorescence Organoid Forecasting.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
Same journal

iDT-diet: Toward Personalized Health Forecasting-An Intelligent Digital Twin Model for Diet-Influenced Biomarker Trajectories (Student Abstract).

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026
See all related articles

Related Experiment Video

Updated: Jun 28, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K

Optimal Sparse Regression Trees.

Rui Zhang1, Rui Xin1, Margo Seltzer2

  • 1Duke University.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence
|April 23, 2024
PubMed
Summary
This summary is machine-generated.

This study introduces a new method for creating provably optimal sparse regression trees, a type of artificial intelligence model. The approach significantly speeds up the optimization process for complex datasets.

More Related Videos

Use of Principal Components for Scaling Up Topographic Models to Map Soil Redistribution and Soil Organic Carbon
09:44

Use of Principal Components for Scaling Up Topographic Models to Map Soil Redistribution and Soil Organic Carbon

Published on: October 16, 2018

10.2K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K

Related Experiment Videos

Last Updated: Jun 28, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.3K
Use of Principal Components for Scaling Up Topographic Models to Map Soil Redistribution and Soil Organic Carbon
09:44

Use of Principal Components for Scaling Up Topographic Models to Map Soil Redistribution and Soil Organic Carbon

Published on: October 16, 2018

10.2K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K

Area of Science:

  • Artificial Intelligence
  • Machine Learning
  • Data Science

Background:

  • Regression trees are foundational AI models with broad applicability, especially in critical applications.
  • The optimization of regression trees is computationally challenging, limiting research in provably optimal solutions.

Purpose of the Study:

  • To develop a method for constructing provably-optimal sparse regression trees.
  • To address the computational hardness associated with full optimization of regression trees.

Main Methods:

  • A dynamic-programming-with-bounds approach is proposed for building optimal sparse regression trees.
  • A novel lower bound, derived from a 1D k-Means clustering solution, is utilized.

Main Results:

  • The proposed method efficiently finds provably optimal sparse regression trees.
  • Optimal trees are frequently identified within seconds, even for large and complex datasets with correlated features.

Conclusions:

  • The dynamic-programming-with-bounds approach offers a computationally feasible solution for achieving provably optimal sparse regression trees.
  • This advancement facilitates the application of highly optimized regression trees in demanding scenarios.