Trials / Completed
CompletedNCT07179861
Comparing Artificial Intelligence and Physicians: A Vignette-Based Study in Pediatric Clinical Decision-Making
A Prospective, Cross-Sectional, Vignette-Based Observational Study Comparing Clinical Decision-Making Performance of Pediatriciansand AI Models
- Status
- Completed
- Phase
- —
- Study type
- Observational
- Enrollment
- 30 (actual)
- Sponsor
- Haseki Training and Research Hospital · Academic / Other
- Sex
- All
- Age
- 28 Years – 40 Years
- Healthy volunteers
- Accepted
Summary
This study evaluates how well anonymized artificial-intelligence (AI) tools perform on standardized pediatric case vignettes and whether showing AI suggestions can improve clinicians' answers. About 30 board-certified/eligible pediatric specialists at a single hospital complete a one-time session. Participants are randomized to two groups. Group A (n≈15): physicians answer each vignette once. Group B (n≈15): physicians answer and rate confidence (1-10), then review anonymized suggestions from five different AI tools (tool names not shown) and may keep or change their answer; changes and confidence are recorded. Primary focus: measure AI performance (diagnostic accuracy, medication-dosing accuracy, interpretation accuracy) overall and by difficulty tier, and record AI response time. Secondary focus: quantify how AI suggestions affect human performance (change in accuracy, direction of change, confidence shift, and time). No patients or biospecimens are involved; risks are minimal (time and possible discomfort with performance review). Findings may inform safe, evidence-based ways to use AI alongside clinicians in pediatrics.
Conditions
- Artificial Intelligence (AI) in Diagnosis
- Decision Support Systems, Clinical
- Clinical Decision-making
- Pediatrics
Interventions
| Type | Name | Description |
|---|---|---|
| OTHER | AI Suggestions (Anonymized 5-tool panel) | What: Display of AI-generated suggestions for each vignette, aggregated from five large language model tools (names not shown to participants). When/Who: Shown only in Group 2, after the physician's initial answer and confidence score. Purpose: Measure AI performance (primary) and quantify the effect of AI suggestions on physicians' answers (secondary). Applies to: Group 2. |
| OTHER | Confidence Rating Task (1-10 Likert) | What: Self-rated confidence for the initial answer on a 1-10 scale. When/Who: Group 2 before viewing AI suggestions. Purpose: Quantify confidence changes pre- vs post-AI and relate confidence to correctness. Applies to: Group 2. |
Timeline
- Start date
- 2025-08-27
- Primary completion
- 2025-09-10
- Completion
- 2025-09-11
- First posted
- 2025-09-18
- Last updated
- 2025-09-23
Locations
1 site across 1 country: Turkey (Türkiye)
Source: ClinicalTrials.gov record NCT07179861. Inclusion in this directory is not an endorsement.