Related Concept Videos
Improving Translational Accuracy
Detection of Gross Error: The Q Test
Language and Cognition
Accuracy and Errors in Hypothesis Testing
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...
Accuracy and Precision
Fundamental Attribution Error
You might also read
Related Articles
Articles linked to this work by shared authors, journal, and citation graph.
DomainDemo: a dataset of domain-sharing activities among different demographic groups on Twitter.
Quantifying the vulnerabilities of the online public square to adversarial manipulation tactics.
Identifying and characterizing superspreaders of low-credibility content on Twitter.
Measuring the Burden of Infodemics: Summary of the Methods and Results of the Fifth WHO Infodemic Management Conference.
One Year of COVID-19 Vaccine Misinformation on Twitter: Longitudinal Study.
Sociodemographics and Transdiagnostic Mental Health Symptoms in SOCIAL (Studies of Online Cohorts for Internalizing Symptoms and Language) I and II: Cross-sectional Survey and Botometer Analysis.
In This Issue.
Correction for Otsuki et al., Extracellular sulfatases support cartilage homeostasis by regulating BMP and FGF signaling pathways.
Hive mind: Microbial communities and the making of memory.
Targets for disease modification in schizophrenia: New findings add to evidence for the involvement of the immune complement system.
Correction for Wang et al., The role of reduced aerosol masking from air pollutant emission reductions in recent global warming acceleration (2013-2023).
Correction for Mishra, Ecology is not yet ready for AI-and why that matters.
Related Experiment Video
Updated: Jun 5, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
Published on: December 6, 2024
Fact-checking information from large language models can decrease headline discernment.
Matthew R DeVerna1, Harry Yaojun Yan1,2, Kai-Cheng Yang1,3
1Observatory on Social Media, Indiana University, Bloomington, IN 47408.
Large language models (LLMs) show promise in fact-checking online information, but AI-generated fact checks do not improve users' ability to discern accuracy or share true news, and can even be harmful.
More Related Videos
06:33Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding
Published on: October 11, 2018
09:09Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
Published on: September 27, 2024
Area of Science:
- Information Science
- Human-Computer Interaction
- Artificial Intelligence
Background:
- Fact-checking is crucial for combating online misinformation.
- Scaling fact-checking is challenging due to information volume.
- AI language models show potential for automated fact-checking.
Purpose of the Study:
- Investigate the impact of AI-generated fact-checks on belief and sharing intent for political news.
- Compare AI fact-checks with human-generated fact-checks.
- Identify potential harms and benefits of AI fact-checking.
Main Methods:
- Preregistered randomized controlled experiment.
- Assessed participants' belief and sharing intent for political headlines.
- Utilized fact-checking information from a popular large language model (LLM).
Main Results:
- LLM fact-checks did not improve headline accuracy discernment or sharing of accurate news.
- Human-generated fact-checks enhanced discernment.
- AI fact-checks decreased belief in true headlines mislabeled as false and increased belief in uncertain false headlines.
- AI fact-checking increased sharing intent for correctly labeled true headlines.
Conclusions:
- AI fact-checking information does not significantly improve users' ability to discern accuracy.
- AI fact-checking can introduce specific harms, such as misinformed beliefs.
- Human fact-checks are more effective in enhancing discernment.
- Policies are needed to mitigate unintended consequences of AI fact-checking applications.