Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Machines: Problem Solving II

Machines: Problem Solving II

Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.

Language and Cognition

Language and Cognition

Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.

Machines: Problem Solving I

Machines: Problem Solving I

A toggle clamp is a mechanical device commonly used for holding and clamping objects in various applications, such as woodworking, metalworking, and assembly operations. Consider a toggle clamp subjected to a force of 200 N at the handle. The vertical clamping force can be calculated, provided the dimensions of the toggle clamp are known.
The toggle clamp system is a machine structure consisting of movable, pin-connected multi-force members that form a stabilized system to transmit forces. The...

Stereotype Content Model

Stereotype Content Model

The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...

Language Development

Language Development

Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Adversarial control of synchronization in complex oscillator networks.

Chaos (Woodbury, N.Y.)·2025

Same author

Large-scale moral machine experiment on large language models.

PloS one·2025

Same author

Natural Images Allow Universal Adversarial Attacks on Medical Image Classification Using Deep Neural Networks with Transfer Learning.

Journal of imaging·2022

Same author

Universal adversarial attacks on deep neural networks for medical image classification.

BMC medical imaging·2021

Same author

Vulnerability of deep neural networks for detecting COVID-19 cases from chest X-ray images to universal adversarial attacks.

PloS one·2020

Same author

Revisiting the hypothesis of an energetic barrier to genome complexity between eukaryotes and prokaryotes.

Royal Society open science·2020

Same journal

Desert lizards modulate nutritional responses to match seasonal biological needs.

Royal Society open science·2026

Same journal

Multi-generational fidelity, ecological and social determinants of roosting in a cooperatively breeding bird (<i>Argya squamiceps</i>).

Royal Society open science·2025

Same journal

Multifaceted polarization and information reliability in climate change discussions on social media platforms.

Royal Society open science·2025

Same journal

Comparing the kinematics related to inflicted head injury between violent shaking of a 6-week-old and a 1-year-old infant surrogate.

Royal Society open science·2025

Same journal

Partner choice increases observed reciprocity-based cooperation but decreases unobserved stake-based cooperation.

Royal Society open science·2025

Same journal

Importation models for travel-related SARS-CoV-2 cases reported in Newfoundland and Labrador during the COVID-19 pandemic.

Royal Society open science·2025

See all related articles

Search research articles

Related Experiment Video

Updated: Jul 4, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

The moral machine experiment on large language models.

Kazuhiro Takemoto¹

¹Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan.

Royal Society Open Science

|February 8, 2024

Summary

This summary is machine-generated.

Large language models (LLMs) show moral judgment alignment with humans in autonomous driving scenarios, but some models exhibit distinct deviations and more uncompromising decisions compared to human preferences.

Keywords:

ChatGPT autonomous driving large language models moral machine

More Related Videos

Experimental Paradigm for Measuring the Effect of Induced Emotion on Grammar Learning

Experimental Paradigm for Measuring the Effect of Induced Emotion on Grammar Learning

Published on: January 29, 2020

One Dimensional Turing-Like Handshake Test for Motor Intelligence

One Dimensional Turing-Like Handshake Test for Motor Intelligence

Published on: December 15, 2010

Related Experiment Videos

Last Updated: Jul 4, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Experimental Paradigm for Measuring the Effect of Induced Emotion on Grammar Learning

Experimental Paradigm for Measuring the Effect of Induced Emotion on Grammar Learning

Published on: January 29, 2020

One Dimensional Turing-Like Handshake Test for Motor Intelligence

One Dimensional Turing-Like Handshake Test for Motor Intelligence

Published on: December 15, 2010

Area of Science:

Artificial Intelligence Ethics
Human-Computer Interaction
Autonomous Systems Morality

Background:

Large language models (LLMs) are increasingly integrated into critical sectors, necessitating an understanding of their ethical decision-making.
Autonomous driving systems require robust ethical frameworks to navigate complex moral dilemmas.

Purpose of the Study:

To investigate the moral judgment tendencies of prominent LLMs using the Moral Machine framework.
To compare LLM ethical decision-making with established human preferences in simulated accident scenarios.
To identify potential discrepancies and similarities between LLM and human moral reasoning for autonomous driving applications.

Main Methods:

Utilized the Moral Machine framework to present ethical dilemmas to various LLMs.
Collected and analyzed decision-making data from GPT-3.5, GPT-4, PaLM 2, and Llama 2.
Compared LLM responses against a large dataset of human preferences.

Main Results:

LLMs and humans generally align on prioritizing human lives over animals and saving more individuals.
PaLM 2 and Llama 2 demonstrated notable deviations from human moral preferences.
Significant quantitative differences were observed, with LLMs potentially making more absolute judgments than humans.

Conclusions:

LLMs exhibit both alignment and divergence from human moral judgments in autonomous driving contexts.
Specific LLMs like PaLM 2 and Llama 2 require further ethical refinement for safety-critical applications.
Understanding these ethical frameworks is crucial for the responsible development and deployment of AI in autonomous vehicles.