Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

469
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
469
Mathematical Modeling: Problem Solving01:29

Mathematical Modeling: Problem Solving

519
Mathematical modeling transforms real-world scenarios into mathematical expressions, allowing for structured problem-solving and analysis. This process involves defining the situation, assigning variables to measurable quantities, selecting an appropriate model, and solving the resulting equation. Such models are invaluable in finance, providing precise methods to evaluate investments, loans, and repayment structures.A widely used example is the calculation of fixed monthly payments on a loan,...
519
Prediction Intervals01:03

Prediction Intervals

3.5K
The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y. 
3.5K
Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data01:16

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

581
Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance,...
581
Bias01:22

Bias

8.0K
Bias refers to any tendency that prevents a question from being considered unprejudiced. In research, bias occurs when one outcome or answer is selected or encouraged over others in sampling or testing. Bias can occur during any research phase, including study design, data collection, analysis, and publication.
In statistics, a sampling bias is created when a sample is collected from a population, and some members of the population are not as likely to be chosen as others (remember, each member...
8.0K
Regression Analysis01:11

Regression Analysis

8.9K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
8.9K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Safety and hemodynamic efficacy of the LVIS stent in the endovascular treatment of intracranial wide-necked aneurysms: a single-center retrospective study.

Chinese neurosurgical journal·2026
Same author

Sensor-Driven Short-Term Forecasting on the Metropolitan LA Traffic Dataset: A Comparative Study for Multi-Step Prediction.

Sensors (Basel, Switzerland)·2026
Same author

<i>Schistosoma japonicum</i> Worms Alter the miRNA Expression Profile of Hepatic Stellate Cells with Potential Implications for Liver Fibrosis and Hepatocellular Carcinoma.

Tropical medicine and infectious disease·2026
Same author

Comparison of survival outcomes for people with HR+/HER2- metastatic breast cancer who received palbociclib, ribociclib, or abemaciclib with an aromatase inhibitor: a plain language summary.

Future oncology (London, England)·2026
Same author

Treatment outcomes with palbociclib plus an aromatase inhibitor in patients with metastatic breast cancer who also have cardiovascular diseases: a plain language summary.

Future oncology (London, England)·2026
Same author

Training and transfer effect of evoked brain responses by brain-computer interaction.

IEEE transactions on bio-medical engineering·2026
Same journal

Research on a Regional Availability Evaluation Model for Road-Area High-Entropy Energy Based on Synergy Factors.

Entropy (Basel, Switzerland)·2026
Same journal

Atmospheric Turbulence Channel Modeling and Performance Analysis of a CO-ZP-OFDM Coherent Optical Communication System for UAV Air-to-Ground Scenarios.

Entropy (Basel, Switzerland)·2026
Same journal

Information Geometry and Asymptotic Theory for SMML Estimators.

Entropy (Basel, Switzerland)·2026
Same journal

Correlation Entropy and Power-Law Kinetics.

Entropy (Basel, Switzerland)·2026
Same journal

Research on the Contagion of Systemic Financial Risk Under the Impact of Climate Risks-From the Perspective of Complex Networks and Machine Learning.

Entropy (Basel, Switzerland)·2026
Same journal

The Statistical-Mechanical Meaning of the Wave Function of Quantum Mechanics.

Entropy (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Mar 29, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.7K

Risk-Sensitive Machine Learning for Financial Decision Modeling Under Imbalanced Data: Evidence from Bank

Bowen Dong1, Xinyu Zhang2, Yang Liu3

  • 1School of Electrical Automation and Information Engineering, Tianjin University, Tianjin 300072, China.

Entropy (Basel, Switzerland)
|March 28, 2026
PubMed
Summary
This summary is machine-generated.

This study improves bank telemarketing predictions by combining oversampling and cost-sensitive learning. Ensemble models like CatBoost significantly boost identification of customers likely to subscribe, even with imbalanced data.

Keywords:
class imbalancedifficulty in minority-class identification under imbalancefinancial decision modelingimbalance modelinginterpretabilitymachine learningrisk-sensitive learning

Related Experiment Videos

Last Updated: Mar 29, 2026

An R-Based Landscape Validation of a Competing Risk Model
05:37

An R-Based Landscape Validation of a Competing Risk Model

Published on: September 16, 2022

2.7K

Area of Science:

  • Machine Learning
  • Data Science
  • Financial Analytics

Background:

  • Bank telemarketing campaigns face low subscription rates due to customer differences and imbalanced datasets.
  • Predictive modeling for telemarketing outcomes is challenging because of severe class imbalance.

Purpose of the Study:

  • To enhance the prediction of bank telemarketing outcomes using a data-driven approach.
  • To integrate synthetic minority oversampling and cost-sensitive learning for improved predictive accuracy.

Main Methods:

  • Utilized the Portuguese Bank Marketing dataset (41,188 instances, 11.3% positive response).
  • Evaluated eight machine learning models (Logistic Regression, Decision Tree, Random Forest, Ensemble methods) using cross-validation.
  • Applied synthetic minority oversampling and cost-sensitive learning techniques.

Main Results:

  • Ensemble models (CatBoost, XGBoost, LightGBM) outperformed traditional baselines.
  • Achieved significant gains in minority-class recall and overall discrimination.
  • The best model reached an F1-score of 0.540, positive class recall of 0.812, and ROC-AUC of 0.908.

Conclusions:

  • Combining resampling strategies with cost-sensitive optimization offers a robust method for imbalanced telemarketing data.
  • SHAP analysis identified key predictors: campaign duration, previous contact outcomes, and macroeconomic indicators.
  • This approach supports reproducible, data-driven financial decision-making by addressing minority-class identification challenges.