Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Accuracy, limits, and approximation

Accuracy, limits, and approximation

Accuracy, limits, and approximations are common in many fields, especially in engineering calculations. These concepts are imperative for ensuring that a given value is as close as possible to its true value.
Accuracy is defined as the closeness of the measured value to the true or actual value. In engineering mechanics, repeated measurements are taken during theoretical or experimental analyses to ensure that the result is precise and accurate.
The accuracy of any solution is based on the...

Hindsight Biases

Hindsight Biases

Hindsight bias leads you to believe that the event you just experienced was predictable, even though it really wasn’t. In other words, you knew all along that things would turn out the way they did. Can you relate this to the phrase "Hindsight is 20/20" now?

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Language and Cognition

Language and Cognition

Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.

Accuracy and Errors in Hypothesis Testing

Accuracy and Errors in Hypothesis Testing

Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5% chance...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Mendelian Randomization and Single-Cell RNA Sequencing Reveal CKAP4 and PFDN5 as Tumor Cell-Specific Causal Genes for Glioblastoma.

International journal of general medicine·2026

Same author

Youth Empowered Self-Care: a direct-to-participant intervention to promote mental well-being.

BMC public health·2026

Same author

A Strong Adhesive with High Switching Ratio Achieved by Phototriggered Azobenzene-Terminated Hyperbranched Polymer.

ACS applied materials & interfaces·2026

Same authorSame journal

Interpretable Failure Detection with Human-Level Concepts.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026

Same author

Functional and structural insights into a cold-active GH64 laminaripentaose-producing β-1,3-glucanase from Candidatus saccharibacteria.

Enzyme and microbial technology·2026

Same author

Controlled release of chlorine dioxide from α-cyclodextrin complexes embedded in poly(lactic acid)/poly(butylene adipate-<i>co</i>-butylene terephthalate) films for sustainable food packaging.

Food chemistry: X·2026

Same journal

ChatCLIDS: Simulating Persuasive AI Dialogues to Promote Closed-Loop Insulin Adoption in Type 1 Diabetes Care.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026

Same journal

<i>OrgaCast</i>: A Trustworthy Spatiotemporal Diffusion Model for Fluorescence Organoid Forecasting.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026

Same journal

Apo2Mol: 3D Molecule Generation via Dynamic Pocket-Aware Diffusion Models.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026

Same journal

iDT-diet: Toward Personalized Health Forecasting-An Intelligent Digital Twin Model for Diet-Influenced Biomarker Trajectories (Student Abstract).

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026

Same journal

<math><mi>Δ</mi> <mi>t</mi></math> -Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 2, 2026

Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments

Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments

Published on: January 23, 2017

Beyond Accuracy: On the Effects of Fine-tuning Towards Vision-Language Model's Prediction Rationality.

Qitong Wang¹, Tang Li¹, Kien X Nguyen¹

¹DeepREAL Lab, Department of Computer & Information Sciences, University of Delaware.

Proceedings of the ... AAAI Conference on Artificial Intelligence. AAAI Conference on Artificial Intelligence

|June 1, 2026

Summary

This summary is machine-generated.

Fine-tuning Vision-Language Models (VLMs) can improve accuracy but may rely on invalid evidence. New metrics reveal that while fine-tuned VLMs are more accurate with valid evidence, their trustworthiness requires careful evaluation.

Related Experiment Videos

Last Updated: Jun 2, 2026

Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments

Measuring Attention and Visual Processing Speed by Model-based Analysis of Temporal-order Judgments

Published on: January 23, 2017

Area of Science:

Computer Vision
Artificial Intelligence
Machine Learning

Background:

Vision-Language Models (VLMs) like CLIP are widely used.
Fine-tuning VLMs is common in safety-critical domains.
Prediction rationality (correctness and valid evidence) is vital in these domains.

Purpose of the Study:

Investigate the impact of fine-tuning on VLM prediction rationality.
Introduce novel metrics: Prediction Trustworthiness and Inference Reliability.

Main Methods:

Conducted extensive experiments across various settings.
Evaluated fine-tuned VLMs using the proposed metrics.
Assessed model performance under distributional shifts.

Main Results:

Fine-tuning improved prediction accuracy but sometimes relied on invalid evidence.
Fine-tuned VLMs showed higher accuracy when using valid evidence.
Findings remained consistent across different settings and shifts.

Conclusions:

Standard fine-tuning may decrease VLM trustworthiness by increasing reliance on invalid evidence.
Valid evidence identification is key for reliable predictions from fine-tuned VLMs.
Research offers new insights into VLM fine-tuning for critical applications.