Theory of Attribution II: Kelley's Covariation Theory
Fundamental Attribution Error
You might also read
Articles linked to this work by shared authors, journal, and citation graph.
Updated: Jun 7, 2026

Eye Tracking During A Complex Aviation Task For Insights Into Information Processing
Published on: April 4, 2025
Mark Boyer1, Adrian Robinson1, Torin K Clark1
1Department of Aerospace Engineering, University of Colorado-Boulder, Boulder, CO, USA.
This study investigates how different ways of explaining AI decisions influence human performance, trust, and workload when working with an autonomous agent in a simulated space rover mission. The researchers developed a new framework to compare these explanation methods and found that combining global and contrastive information leads to better outcomes for human-AI teams.
Area of Science:
Background:
Prior research has shown that artificial intelligence systems often lack transparency for human operators. This gap motivated investigations into how automated agents communicate their decision-making processes to users. No prior work had resolved how specific communication styles impact high-stakes environments like space missions. That uncertainty drove the need for realistic simulations involving dual-task demands. It was already known that trust calibration remains a persistent challenge in human-autonomy teaming. Designers frequently struggle to balance system performance with operator cognitive load. This study addresses the lack of standardized metrics for evaluating diverse explanation strategies. Understanding these dynamics is vital for future deep-space exploration missions.
Purpose Of The Study:
The aim of this study is to evaluate how different explanation types in an Explainable AI system affect human-autonomy teaming performance. Researchers sought to determine the impact of these explanations on workload, trust, and situation awareness. The study addresses the challenge of maintaining effective collaboration in dynamic, high-taskload environments like space exploration. Motivation for this work stems from the need to calibrate human trust in autonomous agents. No prior research had fully explored how various communication styles influence operator efficiency in these specific settings. The investigators also intended to introduce a new, holistic evaluation method for comparing XAI systems. This framework is designed to help designers navigate the complex tradeoffs inherent in human-centered system development. Ultimately, the study provides evidence-based guidance for creating more effective AI teammates for future missions.
Main Methods:
Review approach involved a controlled experiment with thirty-one participants completing eighteen distinct trials. The team employed a dual-task simulation requiring manual rover navigation alongside autonomous agent supervision. Researchers manipulated the presentation of global, contrastive, and deductive information provided by the AI. Performance incentives were integrated to ensure high levels of engagement throughout the testing process. The investigators gathered data on workload, trust, and situation awareness to assess the impact of these variables. A holistic evaluation framework was developed to synthesize these diverse metrics into a single comparative model. This approach allowed for the systematic analysis of how different explanation formats influence human behavior. The study design focused on capturing the complexities of high-taskload environments typical of future space missions.
Main Results:
Key findings from the literature reveal that explanation type significantly influences manual performance with a p-value of 0.0003. Autonomy performance also showed significant variance across conditions at p<0.0001. Team performance metrics followed this trend with a p-value of 0.0001. Workload levels were significantly impacted by the communication style at p<0.0001. Trust ratings similarly varied based on the explanation provided at p<0.0001. Participant preference for specific explanation types reached significance at p=0.001. Conversely, situation awareness did not show a statistically significant difference across conditions, yielding a p-value of 0.41. Participants performed better when receiving their preferred explanation type, which was confirmed at p=0.049.
Conclusions:
The researchers propose that the style of information provided by an agent significantly alters operator efficiency and subjective experience. Synthesis and implications suggest that designers should prioritize specific combinations of feedback to optimize team output. The authors claim that a unified evaluation framework allows for a clearer understanding of design tradeoffs. Their findings indicate that contrastive and global formats are superior for user preference and task success. The study demonstrates that situation awareness does not necessarily change based on the provided explanation format. These results highlight the importance of human-centered testing in complex operational settings. The authors conclude that tailoring communication strategies is necessary for effective collaboration between humans and machines. Future design efforts should focus on these multi-metric approaches to improve overall system reliability.
The researchers propose that combining global and contrastive explanations improves team performance and user preference. In contrast, deductive explanations were less effective, as participants demonstrated higher success rates when receiving their preferred global or contrastive feedback types.
The study utilized a dual-task simulator where participants performed manual rover driving while simultaneously supervising an autonomous exploration agent. This environment was designed to mimic the high-taskload conditions of spaceflight, requiring rapid decision-making from the human operator.
A standardized, multi-metric evaluation framework is necessary to compare XAI systems across diverse outcomes. This approach allows designers to quantify tradeoffs between workload, trust, and performance, which are otherwise difficult to measure in isolation.
The researchers used performance data, workload assessments, trust surveys, and situation awareness measurements. These metrics provided a holistic view of how different explanation types affected the human-AI team, allowing for a comprehensive comparison of the various communication strategies tested.
The authors observed significant effects on manual performance (p=0.0003), autonomy performance (p<0.0001), and team performance (p<0.0001). Conversely, situation awareness (p=0.41) showed no significant change, indicating that explanation type does not universally impact all cognitive outcomes.
The authors claim that designers must consider the specific explanation method for XAI in space exploration tasks. They propose that human-centered evaluation is vital for ensuring that autonomous teammates support, rather than hinder, human decision-making in demanding environments.