相关概念视频
Improving Translational Accuracy
9.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
9.0K
Detection of Gross Error: The Q Test
5.6K
When one or more data points appear far from the rest of the data, there is a need to determine whether they are outliers and whether they should be eliminated from the data set to ensure an accurate representation of the measured value. In many cases, outliers arise from gross errors (or human errors) and do not accurately reflect the underlying phenomenon. In some cases, however, these apparent outliers reflect true phenomenological differences. In these cases, we can use statistical methods...
5.6K
Language and Cognition
323
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
323
Accuracy and Errors in Hypothesis Testing
175
Hypothesis testing is a fundamental statistical tool that begins with the assumption that the null hypothesis H0 is true. During this process, two types of errors can occur: Type I and Type II. A Type I error refers to the incorrect rejection of a true null hypothesis, while a Type II error involves the failure to reject a false null hypothesis.
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...
In hypothesis testing, the probability of making a Type I error, denoted as α, is commonly set at 0.05. This significance level indicates a 5%...
175
Accuracy and Precision
8.7K
Scientists typically make repeated measurements of a quantity to ensure the quality of their findings and to evaluate both the precision and the accuracy of their results. Measurements are said to be precise if they yield very similar results when repeated in the same manner. A measurement is considered accurate if it yields a result that is very close to the true or the accepted value. Precise values agree with each other; accurate values agree with a true value. Highly accurate...
8.7K
Fundamental Attribution Error
12.8K
According to some social psychologists, people tend to overemphasize internal factors as explanations—or attributions—for the behavior of other people. They tend to assume that the behavior of another person is a trait of that person, and to underestimate the power of the situation on the behavior of others. They tend to fail to recognize when the behavior of another is due to situational variables, and thus to the person’s state. This erroneous assumption is...
12.8K
您也可能阅读
相关文章
通过共同作者、期刊和引用图与本文相关的文章。
排序
Same author
DomainDemo: a dataset of domain-sharing activities among different demographic groups on Twitter.
Scientific data·2025
Same author
Quantifying the vulnerabilities of the online public square to adversarial manipulation tactics.
PNAS nexus·2024
Same author
Identifying and characterizing superspreaders of low-credibility content on Twitter.
PloS one·2024
Same author
Measuring the Burden of Infodemics: Summary of the Methods and Results of the Fifth WHO Infodemic Management Conference.
JMIR infodemiology·2023
Same author
One Year of COVID-19 Vaccine Misinformation on Twitter: Longitudinal Study.
Journal of medical Internet research·2023
Same author
Sociodemographics and Transdiagnostic Mental Health Symptoms in SOCIAL (Studies of Online Cohorts for Internalizing Symptoms and Language) I and II: Cross-sectional Survey and Botometer Analysis.
JMIR formative research·2022
Same journal
In This Issue.
Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal
Correction for Otsuki et al., Extracellular sulfatases support cartilage homeostasis by regulating BMP and FGF signaling pathways.
Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal
Hive mind: Microbial communities and the making of memory.
Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal
Targets for disease modification in schizophrenia: New findings add to evidence for the involvement of the immune complement system.
Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal
Correction for Wang et al., The role of reduced aerosol masking from air pollutant emission reductions in recent global warming acceleration (2013-2023).
Proceedings of the National Academy of Sciences of the United States of America·2026
Same journal
Correction for Mishra, Ecology is not yet ready for AI-and why that matters.
Proceedings of the National Academy of Sciences of the United States of America·2026
从大型语言模型中检查事实信息可以降低标题辨别能力.
Matthew R DeVerna1, Harry Yaojun Yan1,2, Kai-Cheng Yang1,3
1Observatory on Social Media, Indiana University, Bloomington, IN 47408.
概括
大型语言模型 (LLM) 在事实检查在线信息方面表现有前途,但人工智能生成的事实检查不能提高用户识别准确性或分享真实新闻的能力,甚至可能是有害的.
更多相关视频
06:33Decomposing the Variance in Reading Comprehension to Reveal the Unique and Common Effects of Language and Decoding
Published on: October 11, 2018
6.7K
09:09Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
Published on: September 27, 2024
402
科学领域:
- 信息科学 信息科学 信息科学
- 人与计算机的交互
- 人工智能的人工智能
背景情况:
- 检查事实对于打击在线虚假信息至关重要.
- 由于信息量,扩大事实核查具有挑战性.
- 人工智能语言模型显示了自动事实检查的潜力.
研究的目的:
- 调查人工智能产生的事实检查对政治新闻的信念和分享意图的影响.
- 将AI事实检查与人类生成的事实检查进行比较.
- 确定人工智能事实核查的潜在危害和益处.
主要方法:
- 预先注册的随机对照实验.
- 评估了参与者对政治头条新闻的信念和分享意图.
- 利用来自一个流行的大型语言模型 (LLM) 的事实检查信息.
主要成果:
- 法律学的事实检查并没有改善标题准确性的辨别或准确新闻的共享.
- 人类产生的事实检查增强了辨别能力.
- 人工智能事实检查减少了对被错误标记为虚假的真实头条新闻的信任,增加了对不确定的虚假头条新闻的信任.
- 人工智能事实检查增加了对正确标记的真实标题的分享意图.
结论:
- 人工智能事实检查信息并没有显著提高用户识别准确性的能力.
- 人工智能事实检查可以引入特定的伤害,例如错误的信息信念.
- 人类的事实检查在提高辨别能力方面更有效.
- 需要制定政策来缓解人工智能事实检查应用程序的意外后果.
