Scientific Writing in the Era of Large Language Models: A Computational Analysis of AI Versus Human-Created Content
View abstract on PubMed
Summary
This summary is machine-generated.Large language models (LLMs) generate scientific text with distinct linguistic features, but these are not always identifiable by human experts. AI detection tools are crucial for maintaining scientific integrity.
Area Of Science
- Artificial Intelligence
- Scientific Communication
- Computational Linguistics
Background
- Large language models (LLMs) can accelerate scientific literature synthesis but risk spreading misinformation.
- Distinguishing AI-generated from human-authored scientific text is a growing challenge.
Purpose Of The Study
- To characterize linguistic features differentiating AI-generated and human-authored scientific text.
- To evaluate the performance of AI detection tools in this task.
Main Methods
- Computational synthesis of 34 cerebrovascular essays (12 AI-generated, 22 human-authored).
- Expert review by <i>Stroke</i> editorial board members.
- Comparison of expert performance with GPTZero AI detection tool.
- Linguistic feature extraction: syntax, semantics, readability, grade level, and perplexity.
Main Results
- AI-generated text exhibited lower word count, complexity, perplexity, and readability scores.
- Human experts misclassified 31.8% of human-written essays.
- GPTZero accurately classified 100% of AI-generated and 95.5% of human-written essays, but relied on limited text segments.
Conclusions
- LLMs produce scientific content with measurable linguistic differences from human writing.
- These differences are not consistently identifiable by human experts.
- Technology-assisted AI detection tools are essential for safeguarding scientific communication integrity.
Related Concept Videos
An important concept in studying metabolism and energy is that of chemical equilibrium. Most chemical reactions are reversible. They can proceed in both directions, releasing energy into their environment in one direction, and absorbing it from the environment in the other direction. The same is true for the chemical reactions involved in cell metabolism, such as the breaking down and building up of proteins into and from individual amino acids, respectively. Reactants within a closed system...
Cognitive psychology is the field of psychology dedicated to examining how people think. It attempts to explain how and why we think the way we do by studying the interactions among human thinking, emotion, creativity, language, and problem-solving, as well as other cognitive processes. Cognitive psychology studies how information is processed and manipulated in remembering, thinking, and knowing.
This field emerged in the mid-20th century, following a period dominated by behaviorism, which...
When we take repeated measurements on the same or replicated samples, we will observe inconsistencies in the magnitude. These inconsistencies are called errors. To categorize and characterize these results and their errors, the researcher can use statistical analysis to determine the quality of the measurements and/or suitability of the methods.
One of the most commonly used statistical quantifiers is the mean, which is the ratio between the sum of the numerical values of all results and the...
The scientific method is a detailed, empirical problem-solving process used by biologists and other scientists. This iterative approach involves formulating a question based on observation, developing a testable potential explanation for the observation (called a hypothesis), making and testing predictions based on the hypothesis, and using the findings to create new hypotheses and predictions.
Generally, predictions are tested using carefully-designed experiments. Based on the outcome of these...
The chi-square test is a statistical hypothesis test. It is used to check whether there is a significant difference between an expected value and an observed value. In the context of genetics, it enables us to either accept or reject a hypothesis, based on how much the observed values deviate from the expected values.
The chi-square test was developed by Pearson in 1990.
The first step of performing a Chi-square analysis is to establish a null hypothesis, which assumes that there is no real...
Overview
Genetic engineering is the process of modifying an organism’s DNA to introduce new, desirable traits. Many organisms, from bacteria to plants and animals, have been genetically modified for academic, medical, agricultural, and industrial purposes. While genetic engineering has definite benefits, ethical concerns surround modifying humans and our food supply.
Scientists can Deliberately Modify an Organism’s Genome
Genetic engineering is possible because the genetic...

