Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Survival Tree

Survival Tree

Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
Building a Survival Tree
Constructing a...

Mathematical Modeling: Problem Solving

Mathematical Modeling: Problem Solving

Mathematical modeling transforms real-world scenarios into mathematical expressions, allowing for structured problem-solving and analysis. This process involves defining the situation, assigning variables to measurable quantities, selecting an appropriate model, and solving the resulting equation. Such models are invaluable in finance, providing precise methods to evaluate investments, loans, and repayment structures.A widely used example is the calculation of fixed monthly payments on a loan,...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...

Bias

Bias

Bias refers to any tendency that prevents a question from being considered unprejudiced. In research, bias occurs when one outcome or answer is selected or encouraged over others in sampling or testing. Bias can occur during any research phase, including study design, data collection, analysis, and publication.
In statistics, a sampling bias is created when a sample is collected from a population, and some members of the population are not as likely to be chosen as others (remember, each member...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Safety and hemodynamic efficacy of the LVIS stent in the endovascular treatment of intracranial wide-necked aneurysms: a single-center retrospective study.

Chinese neurosurgical journal·2026

Same author

Sensor-Driven Short-Term Forecasting on the Metropolitan LA Traffic Dataset: A Comparative Study for Multi-Step Prediction.

Sensors (Basel, Switzerland)·2026

Same author

<i>Schistosoma japonicum</i> Worms Alter the miRNA Expression Profile of Hepatic Stellate Cells with Potential Implications for Liver Fibrosis and Hepatocellular Carcinoma.

Tropical medicine and infectious disease·2026

Same author

Comparison of survival outcomes for people with HR+/HER2- metastatic breast cancer who received palbociclib, ribociclib, or abemaciclib with an aromatase inhibitor: a plain language summary.

Future oncology (London, England)·2026

Same author

Treatment outcomes with palbociclib plus an aromatase inhibitor in patients with metastatic breast cancer who also have cardiovascular diseases: a plain language summary.

Future oncology (London, England)·2026

Same author

Training and transfer effect of evoked brain responses by brain-computer interaction.

IEEE transactions on bio-medical engineering·2026

Same journal

Research on a Regional Availability Evaluation Model for Road-Area High-Entropy Energy Based on Synergy Factors.

Entropy (Basel, Switzerland)·2026

Same journal

Atmospheric Turbulence Channel Modeling and Performance Analysis of a CO-ZP-OFDM Coherent Optical Communication System for UAV Air-to-Ground Scenarios.

Entropy (Basel, Switzerland)·2026

Same journal

Information Geometry and Asymptotic Theory for SMML Estimators.

Entropy (Basel, Switzerland)·2026

Same journal

Correlation Entropy and Power-Law Kinetics.

Entropy (Basel, Switzerland)·2026

Same journal

Research on the Contagion of Systemic Financial Risk Under the Impact of Climate Risks-From the Perspective of Complex Networks and Machine Learning.

Entropy (Basel, Switzerland)·2026

Same journal

The Statistical-Mechanical Meaning of the Wave Function of Quantum Mechanics.

Entropy (Basel, Switzerland)·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 29, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Risk-Sensitive Machine Learning for Financial Decision Modeling Under Imbalanced Data: Evidence from Bank

Bowen Dong¹, Xinyu Zhang², Yang Liu³

¹School of Electrical Automation and Information Engineering, Tianjin University, Tianjin 300072, China.

Entropy (Basel, Switzerland)

|March 28, 2026

Summary

This summary is machine-generated.

This study improves bank telemarketing predictions by combining oversampling and cost-sensitive learning. Ensemble models like CatBoost significantly boost identification of customers likely to subscribe, even with imbalanced data.

Keywords:

class imbalance difficulty in minority-class identification under imbalance financial decision modeling imbalance modeling interpretability machine learning risk-sensitive learning

Related Experiment Videos

Last Updated: Mar 29, 2026

An R-Based Landscape Validation of a Competing Risk Model

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

Area of Science:

Machine Learning
Data Science
Financial Analytics

Background:

Bank telemarketing campaigns face low subscription rates due to customer differences and imbalanced datasets.
Predictive modeling for telemarketing outcomes is challenging because of severe class imbalance.

Purpose of the Study:

To enhance the prediction of bank telemarketing outcomes using a data-driven approach.
To integrate synthetic minority oversampling and cost-sensitive learning for improved predictive accuracy.

Main Methods:

Utilized the Portuguese Bank Marketing dataset (41,188 instances, 11.3% positive response).
Evaluated eight machine learning models (Logistic Regression, Decision Tree, Random Forest, Ensemble methods) using cross-validation.
Applied synthetic minority oversampling and cost-sensitive learning techniques.

Main Results:

Ensemble models (CatBoost, XGBoost, LightGBM) outperformed traditional baselines.
Achieved significant gains in minority-class recall and overall discrimination.
The best model reached an F1-score of 0.540, positive class recall of 0.812, and ROC-AUC of 0.908.

Conclusions:

Combining resampling strategies with cost-sensitive optimization offers a robust method for imbalanced telemarketing data.
SHAP analysis identified key predictors: campaign duration, previous contact outcomes, and macroeconomic indicators.
This approach supports reproducible, data-driven financial decision-making by addressing minority-class identification challenges.