Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Confidence Coefficient01:24

Confidence Coefficient

7.6K
The confidence coefficient is also known as the confidence level or degree of confidence. It is the percent expression for the probability, 1-α, that the confidence interval contains the true population parameter assuming that the confidence interval is obtained after sufficient unbiased sampling; for example, if the CL = 90%, then in 90 out of 100 samples the interval estimate will enclose the true population parameter. Here α is the area under the curve, distributed equally under...
7.6K
Accuracy and Precision01:52

Accuracy and Precision

8.8K
Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value.  Highly accurate...
8.8K
Improving Translational Accuracy02:07

Improving Translational Accuracy

10.1K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
10.1K
Reliability and Validity01:29

Reliability and Validity

12.7K
Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.
12.7K
Stereotype Content Model02:16

Stereotype Content Model

14.7K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
14.7K
Sensitivity, Specificity, and Predicted Value01:13

Sensitivity, Specificity, and Predicted Value

284
In healthcare diagnostics, laboratory tests play a crucial role in identifying and diagnosing a wide range of medical conditions. However, interpreting test results is not always straightforward. An abnormal test result does not always confirm the presence of a disease, just as a normal result does not guarantee its absence. To assess the reliability of these diagnostic tools, healthcare practitioners rely on two key statistical indicators: sensitivity and specificity.
Sensitivity is the...
284

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Additive Manufacturing to Mimic the Nonlinear Mechanical Behavior of Cardiac Soft Tissue.

Polymers·2025
Same author

Posterior optic neuropathy as a rare manifestation in the progression of Medulloblastoma.

Acta neurologica Belgica·2025
Same author

Reproductive dynamics of the sea cucumber Holothuria arguinensis: Biological insights to support fisheries management.

The Science of the total environment·2025
Same author

Estimating the air quality standard exceedance areas and the spatial representativeness of urban air quality stations applying microscale modelling.

The Science of the total environment·2025
Same author

Dual-Loaded Chitosan-Based Nanoparticles: A Novel approach for treating polymicrobial osteomyelitis.

International journal of pharmaceutics·2025
Same author

Comparative Analysis of Data Augmentation Approaches for Blood Pressure Prediction.

Annual International Conference of the IEEE Engineering in Medicine and Biology Society. IEEE Engineering in Medicine and Biology Society. Annual International Conference·2025
Same journal

Deep multi-modal features based spatio-temporal video regression for non-invasive hemoglobin estimation.

Medical & biological engineering & computing·2026
Same journal

Reduced mechanical strength correlates with decreased elastin content in aortic intima-media tissue: association with dissection in human ascending aortas.

Medical & biological engineering & computing·2026
Same journal

How plaque morphology and stenosis severity govern stent-artery interaction and deployment outcomes: a computational study.

Medical & biological engineering & computing·2026
Same journal

Investigating a relation between amyloid beta plaque burden and accumulated neurotoxicity caused by amyloid beta oligomers.

Medical & biological engineering & computing·2026
Same journal

A robot-assisted eye positioning method with high precision and repeatability for ocular particle therapy: mechanical and geometric assessment.

Medical & biological engineering & computing·2026
Same journal

Enhanced puncture event detection for teleoperated needle insertion robotic system.

Medical & biological engineering & computing·2026
See all related articles

Related Experiment Video

Updated: Jun 24, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

544

Machine learning models' assessment: trust and performance.

S Sousa1, S Paredes2,3, T Rocha1,4

  • 1CISUC, Center for Informatics and Systems of University of Coimbra, University of Coimbra, Pólo II, 3030-290, Coimbra, Portugal.

Medical & Biological Engineering & Computing
|June 7, 2024
PubMed
Summary
This summary is machine-generated.

Developing a novel evaluation approach, this study assesses machine learning model trust and performance simultaneously. A rule-based approach demonstrated high trust and superior performance for cardiovascular risk assessment, aiding clinical decision-making.

Keywords:
Clinical decision support systemsExplainable AIInterpretabilityTrust

More Related Videos

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

6.8K
Design and Analysis for Fall Detection System Simplification
08:05

Design and Analysis for Fall Detection System Simplification

Published on: April 6, 2020

10.7K

Related Experiment Videos

Last Updated: Jun 24, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

544
Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

6.8K
Design and Analysis for Fall Detection System Simplification
08:05

Design and Analysis for Fall Detection System Simplification

Published on: April 6, 2020

10.7K

Area of Science:

  • Medical Informatics
  • Machine Learning in Healthcare
  • Clinical Decision Support

Background:

  • Machine learning (ML) models are often black boxes, hindering trust and adoption in healthcare.
  • Lack of trust is a significant barrier to the widespread application of ML in clinical settings.
  • A simultaneous evaluation of trust and performance is needed for reliable healthcare ML tools.

Purpose of the Study:

  • To develop and validate an evaluation framework assessing both trust and performance of ML models.
  • To compare the trust and performance of various ML models against a clinical standard for cardiovascular risk stratification.
  • To identify ML models suitable for clinical application based on trust and performance metrics.

Main Methods:

  • Trust assessment incorporated model robustness, confidence intervals (95% CI), and interpretability via feature ranking comparison with clinical evidence.
  • Performance was evaluated using the geometric mean.
  • Five models (GRACE score, logistic regression, Naïve Bayes, decision trees, rule-based approach) were compared using a Portuguese cardiovascular risk dataset (N=1544).

Main Results:

  • Simultaneous assessment of trust and performance was successfully implemented.
  • The rule-based approach exhibited a high level of operational trust.
  • The rule-based approach outperformed the GRACE score in performance and enhanced physician acceptance.

Conclusions:

  • The developed evaluation approach effectively assesses ML model trust and performance concurrently.
  • The rule-based approach shows significant potential for clinical application in cardiovascular risk assessment.
  • Improved trust and performance of ML models can enhance physician acceptance and aid clinical decision-making.