Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Improving Translational Accuracy02:07

Improving Translational Accuracy

10.4K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
10.4K
Machines: Problem Solving II01:30

Machines: Problem Solving II

310
Machines are complex structures consisting of movable, pin-connected multi-force members that work together to transmit forces. Consider a lifting tong carrying a 100 kg load. It comprises movable sections DAF and CBG linked together with member AB.
310
Language and Cognition01:27

Language and Cognition

346
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
346
Machines: Problem Solving I01:22

Machines: Problem Solving I

324
A toggle clamp is a mechanical device commonly used for holding and clamping objects in various applications, such as woodworking, metalworking, and assembly operations. Consider a toggle clamp subjected to a force of 200 N at the handle. The vertical clamping force can be calculated, provided the dimensions of the toggle clamp are known.
The toggle clamp system is a machine structure consisting of movable, pin-connected multi-force members that form a stabilized system to transmit forces. The...
324
Stereotype Content Model02:16

Stereotype Content Model

14.7K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
14.7K
Language Development01:22

Language Development

366
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
366

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Adversarial control of synchronization in complex oscillator networks.

Chaos (Woodbury, N.Y.)·2025
Same author

Large-scale moral machine experiment on large language models.

PloS one·2025
Same author

Natural Images Allow Universal Adversarial Attacks on Medical Image Classification Using Deep Neural Networks with Transfer Learning.

Journal of imaging·2022
Same author

Universal adversarial attacks on deep neural networks for medical image classification.

BMC medical imaging·2021
Same author

Vulnerability of deep neural networks for detecting COVID-19 cases from chest X-ray images to universal adversarial attacks.

PloS one·2020
Same author

Revisiting the hypothesis of an energetic barrier to genome complexity between eukaryotes and prokaryotes.

Royal Society open science·2020
Same journal

Desert lizards modulate nutritional responses to match seasonal biological needs.

Royal Society open science·2026
Same journal

Multi-generational fidelity, ecological and social determinants of roosting in a cooperatively breeding bird (<i>Argya squamiceps</i>).

Royal Society open science·2025
Same journal

Multifaceted polarization and information reliability in climate change discussions on social media platforms.

Royal Society open science·2025
Same journal

Comparing the kinematics related to inflicted head injury between violent shaking of a 6-week-old and a 1-year-old infant surrogate.

Royal Society open science·2025
Same journal

Partner choice increases observed reciprocity-based cooperation but decreases unobserved stake-based cooperation.

Royal Society open science·2025
Same journal

Importation models for travel-related SARS-CoV-2 cases reported in Newfoundland and Labrador during the COVID-19 pandemic.

Royal Society open science·2025
See all related articles

Related Experiment Video

Updated: Jul 4, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

561

The moral machine experiment on large language models.

Kazuhiro Takemoto1

  • 1Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, Iizuka, Fukuoka 820-8502, Japan.

Royal Society Open Science
|February 8, 2024
PubMed
Summary
This summary is machine-generated.

Large language models (LLMs) show moral judgment alignment with humans in autonomous driving scenarios, but some models exhibit distinct deviations and more uncompromising decisions compared to human preferences.

Keywords:
ChatGPTautonomous drivinglarge language modelsmoral machine

More Related Videos

Experimental Paradigm for Measuring the Effect of Induced Emotion on Grammar Learning
05:33

Experimental Paradigm for Measuring the Effect of Induced Emotion on Grammar Learning

Published on: January 29, 2020

6.0K
One Dimensional Turing-Like Handshake Test for Motor Intelligence
14:05

One Dimensional Turing-Like Handshake Test for Motor Intelligence

Published on: December 15, 2010

26.8K

Related Experiment Videos

Last Updated: Jul 4, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

561
Experimental Paradigm for Measuring the Effect of Induced Emotion on Grammar Learning
05:33

Experimental Paradigm for Measuring the Effect of Induced Emotion on Grammar Learning

Published on: January 29, 2020

6.0K
One Dimensional Turing-Like Handshake Test for Motor Intelligence
14:05

One Dimensional Turing-Like Handshake Test for Motor Intelligence

Published on: December 15, 2010

26.8K

Area of Science:

  • Artificial Intelligence Ethics
  • Human-Computer Interaction
  • Autonomous Systems Morality

Background:

  • Large language models (LLMs) are increasingly integrated into critical sectors, necessitating an understanding of their ethical decision-making.
  • Autonomous driving systems require robust ethical frameworks to navigate complex moral dilemmas.

Purpose of the Study:

  • To investigate the moral judgment tendencies of prominent LLMs using the Moral Machine framework.
  • To compare LLM ethical decision-making with established human preferences in simulated accident scenarios.
  • To identify potential discrepancies and similarities between LLM and human moral reasoning for autonomous driving applications.

Main Methods:

  • Utilized the Moral Machine framework to present ethical dilemmas to various LLMs.
  • Collected and analyzed decision-making data from GPT-3.5, GPT-4, PaLM 2, and Llama 2.
  • Compared LLM responses against a large dataset of human preferences.

Main Results:

  • LLMs and humans generally align on prioritizing human lives over animals and saving more individuals.
  • PaLM 2 and Llama 2 demonstrated notable deviations from human moral preferences.
  • Significant quantitative differences were observed, with LLMs potentially making more absolute judgments than humans.

Conclusions:

  • LLMs exhibit both alignment and divergence from human moral judgments in autonomous driving contexts.
  • Specific LLMs like PaLM 2 and Llama 2 require further ethical refinement for safety-critical applications.
  • Understanding these ethical frameworks is crucial for the responsible development and deployment of AI in autonomous vehicles.