Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Healthcare Acceptability and Delayed Care Among Older People Living with HIV in the All of Us Program.

AIDS and behavior·2026

Same author

Culturally and linguistically adapting a transdiagnostic LGBTQ-affirming cognitive behavioral skills intervention for Vietnamese gay and bisexual men at risk for HIV: pre-adaptation qualitative interviews.

AIDS care·2025

Same author

Examining Barriers and Facilitators to Physical Activity Among a Diverse Cohort of MSM Living with HIV.

AIDS and behavior·2025

Same author

A Dialectical Behavior Therapy (DBT)-Informed mHealth Intervention to Enhance Coping Skills and Mental Health among Men Who Have Sex with Men Living with HIV in China: A Mixed-Methods Feasibility Pilot Study.

AIDS and behavior·2025

Same author

Evaluating Generative AI in Mental Health: Systematic Review of Capabilities and Limitations.

JMIR mental health·2025

Same author

Identifying Subgroups of Intersectional Stigma, Discrimination, and the Association with Mental Health Outcomes Among HIV-Positive Men Who Have Sex with Men: A Latent Class Analysis.

AIDS and behavior·2025

Same journal

Supporting Radiology Resident Education and Clinical Decision-Making With Large Language Models: Comparative Study of Reasoning Models DeepSeek-R1 and ChatGPT-o1.

JMIR AI·2026

Same journal

Patient Perceptions on the Use of Artificial Intelligence in Creating Clinical Research Documents: Survey Study.

JMIR AI·2026

Same journal

Application of Language Models for the Analysis of Adverse Drug Events in Pharmaceutical Research and Development: Scoping Review.

JMIR AI·2026

Same journal

Correction: Deep Learning for Age Estimation and Sex Prediction Using Mandibular-Cropped Cephalometric Images: Comparative Model Development and Validation Study.

JMIR AI·2026

Same journal

AI-Assisted Systematic Literature Review of the Economic Burden of Pneumococcal Disease: Development and Validation Study.

JMIR AI·2026

Same journal

Knowledge-Augmented Large Language Model for Multimodal Electronic Health Record-Based Risk Prediction: Development and Validation Study.

JMIR AI·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Mar 27, 2026

Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models

Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models

Published on: December 23, 2025

Large Language Model-Powered Diagnostic Co-Pilot ("CapyEngine") for Mental Disorders: Development, Evaluation, and

Liying Wang^1,2, Yunzhang Jiang³

¹Institute on Digital Health and Innovation, College of Nursing , Florida State University, 222 S Copeland St, Tallahassee, FL, 32306, United States, 1 (850) 644-3296.

|March 24, 2026

Summary

This summary is machine-generated.

This study developed CapyEngine, an AI diagnostic tool for mental health. While ChatGPT-4o showed higher accuracy, CapyEngine demonstrated more consistent diagnostic rankings, suggesting potential for clinical augmentation.

Keywords:

ChatGPT-4 LLM accuracy rate diagnosis large language model mental disorders

More Related Videos

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Related Experiment Videos

Last Updated: Mar 27, 2026

Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models

Virtual Agent for Real-Time Motivational Interviewing by Integrating Adaptive Nonverbal Behavior and Language Models

Published on: December 23, 2025

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Implementation of a Real-Time Psychosis Risk Detection and Alerting System Based on Electronic Health Records using CogStack

Published on: May 15, 2020

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Area of Science:

Artificial Intelligence in Healthcare
Mental Health Diagnostics
Clinical Decision Support Systems

Background:

Limited evidence exists on large language models' (LLMs) diagnostic capabilities in mental health.
The study addresses the need for AI tools to assist in mental disorder diagnosis.

Purpose of the Study:

To develop and evaluate CapyEngine, an LLM-powered tool for mental disorder diagnosis.
To compare CapyEngine's diagnostic accuracy against ChatGPT-4o and clinicians.

Main Methods:

CapyEngine was developed using LLMs, embedding models, and vector searches, with a database from DSM-5-TR.
Usability testing was conducted with mental health professionals.
Diagnostic accuracy was compared using standardized case scenarios against ChatGPT-4o and clinicians.

Main Results:

ChatGPT-4o outperformed CapyEngine and clinicians in broader diagnostic rankings (top 10 and top 5).
Clinicians showed higher accuracy than CapyEngine for the top 5 benchmark.
CapyEngine demonstrated the most consistent diagnostic ranking behavior across stringent benchmarks.

Conclusions:

ChatGPT-4o achieved higher accuracy at less stringent benchmarks.
CapyEngine's domain-specific design resulted in consistent rankings, showing promise for augmenting mental health diagnostics.
Further research is needed to evaluate AI integration into clinical workflows.