Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Confidence Coefficient

Confidence Coefficient

The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...

Accuracy and Precision

Accuracy and Precision

Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value. Highly accurate...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Reliability and Validity

Reliability and Validity

Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.

Stereotype Content Model

Stereotype Content Model

The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...

Sensitivity, Specificity, and Predicted Value

Sensitivity, Specificity, and Predicted Value

In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Additive Manufacturing to Mimic the Nonlinear Mechanical Behavior of Cardiac Soft Tissue.

Polymers·2025

Same author

Posterior optic neuropathy as a rare manifestation in the progression of Medulloblastoma.

Acta neurologica Belgica·2025

Same author

Reproductive dynamics of the sea cucumber Holothuria arguinensis: Biological insights to support fisheries management.

The Science of the total environment·2025

Same author

Estimating the air quality standard exceedance areas and the spatial representativeness of urban air quality stations applying microscale modelling.

The Science of the total environment·2025

Same author

Dual-Loaded Chitosan-Based Nanoparticles: A Novel approach for treating polymicrobial osteomyelitis.

International journal of pharmaceutics·2025

Same author

Comparative Analysis of Data Augmentation Approaches for Blood Pressure Prediction.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025

Same journal

Deep multi-modal features based spatio-temporal video regression for non-invasive hemoglobin estimation.

Medical & biological engineering & computing·2026

Same journal

Reduced mechanical strength correlates with decreased elastin content in aortic intima-media tissue: association with dissection in human ascending aortas.

Medical & biological engineering & computing·2026

Same journal

How plaque morphology and stenosis severity govern stent-artery interaction and deployment outcomes: a computational study.

Medical & biological engineering & computing·2026

Same journal

Investigating a relation between amyloid beta plaque burden and accumulated neurotoxicity caused by amyloid beta oligomers.

Medical & biological engineering & computing·2026

Same journal

A robot-assisted eye positioning method with high precision and repeatability for ocular particle therapy: mechanical and geometric assessment.

Medical & biological engineering & computing·2026

Same journal

Enhanced puncture event detection for teleoperated needle insertion robotic system.

Medical & biological engineering & computing·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 24, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Machine learning models' assessment: trust and performance.

S Sousa¹, S Paredes^2,3, T Rocha^1,4

¹CISUC, Center for Informatics and Systems of University of Coimbra, University of Coimbra, Pólo II, 3030-290, Coimbra, Portugal.

Medical & Biological Engineering & Computing

|June 7, 2024

Summary

This summary is machine-generated.

Developing a novel evaluation approach, this study assesses machine learning model trust and performance simultaneously. A rule-based approach demonstrated high trust and superior performance for cardiovascular risk assessment, aiding clinical decision-making.

Keywords:

Clinical decision support systems Explainable AI Interpretability Trust

More Related Videos

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

Design and Analysis for Fall Detection System Simplification

Design and Analysis for Fall Detection System Simplification

Published on: April 6, 2020

Related Experiment Videos

Last Updated: Jun 24, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

Design and Analysis for Fall Detection System Simplification

Design and Analysis for Fall Detection System Simplification

Published on: April 6, 2020

Area of Science:

Medical Informatics
Machine Learning in Healthcare
Clinical Decision Support

Background:

Machine learning (ML) models are often black boxes, hindering trust and adoption in healthcare.
Lack of trust is a significant barrier to the widespread application of ML in clinical settings.
A simultaneous evaluation of trust and performance is needed for reliable healthcare ML tools.

Purpose of the Study:

To develop and validate an evaluation framework assessing both trust and performance of ML models.
To compare the trust and performance of various ML models against a clinical standard for cardiovascular risk stratification.
To identify ML models suitable for clinical application based on trust and performance metrics.

Main Methods:

Trust assessment incorporated model robustness, confidence intervals (95% CI), and interpretability via feature ranking comparison with clinical evidence.
Performance was evaluated using the geometric mean.
Five models (GRACE score, logistic regression, Naïve Bayes, decision trees, rule-based approach) were compared using a Portuguese cardiovascular risk dataset (N=1544).

Main Results:

Simultaneous assessment of trust and performance was successfully implemented.
The rule-based approach exhibited a high level of operational trust.
The rule-based approach outperformed the GRACE score in performance and enhanced physician acceptance.

Conclusions:

The developed evaluation approach effectively assesses ML model trust and performance concurrently.
The rule-based approach shows significant potential for clinical application in cardiovascular risk assessment.
Improved trust and performance of ML models can enhance physician acceptance and aid clinical decision-making.