Trials / Completed
CompletedNCT06911645
Diagnostic Reasoning With Customized GPT-4 Model
Evaluating the Performance of LLMs and Clinicians in Complex Diagnostic Cases: A Randomized Controlled Trial
- Status
- Completed
- Phase
- N/A
- Study type
- Interventional
- Enrollment
- 70 (actual)
- Sponsor
- Stanford University · Academic / Other
- Sex
- All
- Age
- —
- Healthy volunteers
- Accepted
Summary
This study will assess the impact of immediate access to a customized version of GPT-4, a large language model, on performance in case-based diagnostic reasoning tasks. Specifically, it will compare this approach to a two-step process where participants first use traditional diagnostic decision support tools to support their diagnostic reasoning before gaining access to the customized GPT-4 model.
Detailed description
Artificial intelligence (AI) technologies, particularly advanced large language models like OpenAI's ChatGPT, have the potential to enhance medical decision-making. While ChatGPT-4 was not specifically designed for medical applications, it has demonstrated promise in various healthcare contexts, including medical note-writing, addressing patient inquiries, and facilitating medical consultations. However, its impact on clinicians' diagnostic reasoning remains largely unknown. Clinical reasoning is a complex process that involves pattern recognition, knowledge application, and probabilistic reasoning. Integrating AI tools like ChatGPT-4 into physician workflows could help reduce clinician workload and decrease the likelihood of missed diagnoses. However, ChatGPT-4 was neither developed nor validated for diagnostic reasoning, and it may produce misleading information, including plausible but incorrect conclusions that could misguide clinicians. If not used appropriately, it may fail to improve-and could even hinder-clinical decision-making. Therefore, it is essential to study how clinicians use large language models to support clinical reasoning before integrating them into routine patient care. This study will examine how immediate access to a customized version of ChatGPT-4 impacts performance on case-based diagnostic reasoning tasks, compared to a stepwise approach. In the stepwise approach, participants will first use traditional diagnostic decision support tools to support their case reasoning before interacting with a customized ChatGPT-4 model, at which point they will have the opportunity to revise their initial answers. Participants will be randomized into different study arms and will respond to diagnostic cases by providing three differential diagnoses, along with supporting and opposing findings for each. They will also identify their top diagnosis and propose next diagnostic steps. Independent reviewers, blinded to treatment assignment, will evaluate their responses.
Conditions
Interventions
| Type | Name | Description |
|---|---|---|
| OTHER | Immediate access to customized version of GPT-4 | Group is given immediate access to a customized version of GPT-4 to support their diagnostic reasoning for each case. |
| OTHER | Access to customized version of GPT-4 following use of conventional resources | Group is first encouraged to reason through diagnostic cases with the support of conventional resources. After they submit a case's answers they are then given access to a customized version of GPT-4 and have the opportunity to change their initial answers. |
Timeline
- Start date
- 2024-12-16
- Primary completion
- 2025-01-24
- Completion
- 2025-01-24
- First posted
- 2025-04-04
- Last updated
- 2025-04-04
Locations
1 site across 1 country: United States
Source: ClinicalTrials.gov record NCT06911645. Inclusion in this directory is not an endorsement.