MedPath

Human Algorithm Interactions for Acute Respiratory Failure Diagnosis

Not Applicable
Completed
Conditions
Acute Respiratory Failure
Registration Number
NCT06098950
Lead Sponsor
University of Michigan
Brief Summary

Artificial intelligence (AI) shows promising in identifying abnormalities in clinical images. However, systematically biased AI models, where a model makes inaccurate predictions for entire subpopulations, can lead to errors and potential harms. When shown incorrect predictions from an AI model, clinician diagnostic accuracy can be harmed. This study aims to study the effectiveness of providing clinicians with image-based AI model explanations when provided AI model predictions to help clinicians better understand the logic of an AI model's prediction. It will evaluate whether providing clinicians with AI model explanations can improve diagnostic accuracy and help clinicians catch when models are making incorrect decisions. As a test case, the study will focus on the diagnosis of acute respiratory failure because determining the underlying causes of acute respiratory failure is critically important for guiding treatment decisions but can be clinically challenging.

To determine if providing AI explanations can improve clinician diagnostic accuracy and alleviate the potential impact of showing clinicians a systematically biased AI model, a randomized clinical vignette survey study will be conducted. During the survey, study participants will be shown clinical vignettes of patients hospitalized with acute respiratory failure, including the patient's presenting symptoms, physical exam, laboratory results, and chest X-ray. Study participants will then be asked to assess the likelihood that heart failure, pneumonia and/or Chronic Obstructive Pulmonary Disease (COPD) is the underlying diagnosis. During specific vignettes in the survey, participants will also be shown standard or systematically biased AI models that provide an estimate the likelihood that heart failure, pneumonia and/or COPD is the underlying diagnosis. Clinicians will be randomized see AI predictions alone or AI predictions with explanations when shown AI models. This survey design will allow for testing the hypothesis that systematically biased models would harm clinician diagnostic accuracy, but commonly used image-based explanations would help clinicians partially recover their performance.

Detailed Description

Not available

Recruitment & Eligibility

Status
COMPLETED
Sex
All
Target Recruitment
457
Inclusion Criteria
  • Physicians, nurse practitioners, and physician assistants that care for patients with acute respiratory failure as part of their clinical practice
Exclusion Criteria
  • Physicians, nurse practitioners, and physician assistants that only provide patient care in outpatient settings

Study & Design

Study Type
INTERVENTIONAL
Study Design
PARALLEL
Primary Outcome Measures
NameTimeMethod
Participant diagnostic accuracy across clinical vignette settingsDay 0

Diagnostic accuracy is defined as the number of correct diagnostic assessments over the total number of diagnostic assessments. After reviewing each individual patient clinical vignette within the survey, participants will be asked to make three separate diagnostic assessments for each clinical vignette, one for heart failure, pneumonia, and COPD. If the participant's assessment agrees with the reference label for each vignette, the diagnostic assessment is considered correct. Diagnostic assessments will be performed while participants are completing the survey (day 0), immediately after the participant reviews the clinical vignette. Participant diagnostic accuracy will be compared across vignette settings (no AI model, standard AI model, standard AI model with explanation, biased AI model, biased AI model with explanation).

Secondary Outcome Measures
NameTimeMethod
Diagnosis specific diagnostic accuracy across clinical vignette settingsDay 0

Diagnostic accuracy specific to heart failure, pneumonia, and COPD across vignette settings

Treatment Selection Accuracy across clinical vignette settingsDay 0

Treatment selection accuracy is defined as whether the participant choose the correct treatment for the patient in the clinical vignette, and could choose any combination of steroids, antibiotics, Intravenous (IV) diuretics, or none of these treatments for the patient. Treatment selection assessments will be performed while participants are completing the survey (day 0), immediately after the participant reviews the clinical vignette. Participant treatment selection accuracy will be compared across vignette settings (no AI model, standard AI model, standard AI model with explanation, biased AI model, biased AI model with explanation).

Trial Locations

Locations (1)

University of Michigan

🇺🇸

Ann Arbor, Michigan, United States

University of Michigan
🇺🇸Ann Arbor, Michigan, United States

MedPath

Empowering clinical research with data-driven insights and AI-powered tools.

© 2025 MedPath, Inc. All rights reserved.