Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

339
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
339
Bootstrapping01:24

Bootstrapping

758
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
758
Random Sampling Method01:09

Random Sampling Method

14.0K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
14.0K
Uncertainty: Confidence Intervals00:54

Uncertainty: Confidence Intervals

9.9K
The confidence interval is the range of values around the mean that contains the true mean. It is expressed as a probability percentage. The interpretation of a 95% confidence interval, for instance, is that the statistician is 95% confident that the true mean falls within the interval. The upper and lower limits of this range are known as confidence limits. The confidence limits for the true mean are estimated from the sample's mean, the standard deviation, and the statistical factor...
9.9K
Confidence Intervals01:21

Confidence Intervals

9.9K
An unbiased point estimate is often insufficient to predict a population estimate, such as population mean or population proportion. In this scenario, a confidence interval is used. A confidence interval is an estimate similar to a  sample proportion. However, unlike the point estimate which is a single value, the confidence interval  contains a range of values. These values have lower and upper limits, known as confidence limits, and can be designated as L1 and L2, respectively.
A...
9.9K
Confidence Coefficient01:24

Confidence Coefficient

10.1K
The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...
10.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Phenotyping obesity through a two-dimensional tree structure reveals cardiometabolic heterogeneity.

Cell reports. Medicine·2025
Same author

Causation between the gut microbiota and inguinal hernia: a two-sample double-sided Mendelian randomization study.

Scientific reports·2024
Same author

Preparation of Dihydromyricetin-Loaded Self-Emulsifying Drug Delivery System and Its Anti-Alcoholism Effect.

Pharmaceutics·2023
Same author

ZITS++: Image Inpainting by Improving the Incremental Transformer on Structural Priors.

IEEE transactions on pattern analysis and machine intelligence·2023
Same author

Association of Social Support with Rehabilitation Outcome among Older Adults with Hip Fracture Surgery: A Prospective Cohort Study at Post-Acute Care Facility in Asia.

Journal of the American Medical Directors Association·2023
Same author

Artificially Intelligent Olfaction for Fast and Noninvasive Diagnosis of Bladder Cancer from Urine.

ACS sensors·2022
Same journal

Intervention Feasible Region and Driver Risk Capacity Aware Human-Machine Collaborative Safe Trajectory Planning.

IEEE transactions on neural networks and learning systems·2026
Same journal

A Unified Differential Denoising Learning Framework With a Pre-Trained Model and Fuzzy Graph Networks for Drug-Drug Interaction Prediction.

IEEE transactions on neural networks and learning systems·2026
Same journal

Self-Supervised Continuous Dynamic Graph Representation Learning via Hawkes Processes.

IEEE transactions on neural networks and learning systems·2026
Same journal

cPU: Consistent Risk Estimator for Positive-Unlabeled Learning.

IEEE transactions on neural networks and learning systems·2026
Same journal

Tuning-Free Latent Diffusion Models for Ultrahigh-Resolution Image Editing.

IEEE transactions on neural networks and learning systems·2026
Same journal

Hidden Data Recovery and Forecasting via Next-Generation Reservoir Computing With Multiscale Delay Selection.

IEEE transactions on neural networks and learning systems·2026
See all related articles

Related Experiment Video

Updated: Dec 29, 2025

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
04:35

Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

Published on: July 3, 2020

3.6K

Entropy and Confidence-Based Undersampling Boosting Random Forests for Imbalanced Problems.

Zhe Wang, Chenjie Cao, Yujin Zhu

    IEEE Transactions on Neural Networks and Learning Systems
    |January 30, 2020
    PubMed
    Summary
    This summary is machine-generated.

    We introduce ECUBoost, a novel framework using entropy and confidence for imbalanced data. This method improves generalization by intelligently undersampling majority class data without losing valuable information.

    More Related Videos

    A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
    12:18

    A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

    Published on: January 11, 2020

    7.8K

    Related Experiment Videos

    Last Updated: Dec 29, 2025

    Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach
    04:35

    Development of an Individual-Tree Basal Area Increment Model using a Linear Mixed-Effects Approach

    Published on: July 3, 2020

    3.6K
    A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment
    12:18

    A Machine Learning Approach to Design an Efficient Selective Screening of Mild Cognitive Impairment

    Published on: January 11, 2020

    7.8K

    Area of Science:

    • Machine Learning
    • Data Science
    • Artificial Intelligence

    Background:

    • Imbalanced datasets pose significant challenges in machine learning, often leading to biased models and poor generalization.
    • Traditional undersampling methods risk discarding informative majority class samples, hindering model performance.

    Purpose of the Study:

    • To propose a novel framework, entropy and confidence-based undersampling boosting (ECUBoost), to effectively address imbalanced learning problems.
    • To enhance the generalization performance of ensemble models by integrating an advanced undersampling technique.

    Main Methods:

    • ECUBoost employs both entropy and confidence metrics to guide the undersampling process, ensuring the preservation of valid and structurally representative majority samples.
    • The framework is designed to be compatible with non-iterative algorithms, such as decision trees, by leveraging confidence-based undersampling.
    • Random forests are utilized as the base classifiers within the ECUBoost framework.

    Main Results:

    • Experimental evaluations on both artificial and KEEL datasets demonstrate the effectiveness of the proposed ECUBoost method.
    • The approach successfully mitigates the negative impact of data imbalance without compromising the integrity of the majority class data.

    Conclusions:

    • ECUBoost offers a robust and effective solution for handling imbalanced datasets in machine learning.
    • The method's ability to preserve informative samples and its compatibility with various algorithms make it a valuable contribution to the field of imbalanced learning.