Human Algorithm Interactions for Acute Respiratory Failure Diagnosis

Not Applicable

Completed

Conditions: Acute Respiratory Failure

Interventions: Other: Artificial Intelligence model predictions without explanation
Other: Artificial intelligence model predictions with explanation
Other: AI model biased against heart failure
Other: AI model biased against COPD
Other: AI model biased against pneumonia

Registration Number: NCT06098950

Lead Sponsor: University of Michigan

Brief Summary: Artificial intelligence (AI) shows promising in identifying abnormalities in clinical images. However, systematically biased AI models, where a model makes inaccurate predictions for entire subpopulations, can lead to errors and potential harms. When shown incorrect predictions from an AI model, clinician diagnostic accuracy can be harmed. This study aims to study the effectiveness of providing clinicians with image-based AI model explanations when provided AI model predictions to help clinicians better understand the logic of an AI model's prediction. It will evaluate whether providing clinicians with AI model explanations can improve diagnostic accuracy and help clinicians catch when models are making incorrect decisions. As a test case, the study will focus on the diagnosis of acute respiratory failure because determining the underlying causes of acute respiratory failure is critically important for guiding treatment decisions but can be clinically challenging.

To determine if providing AI explanations can improve clinician diagnostic accuracy and alleviate the potential impact of showing clinicians a systematically biased AI model, a randomized clinical vignette survey study will be conducted. During the survey, study participants will be shown clinical vignettes of patients hospitalized with acute respiratory failure, including the patient's presenting symptoms, physical exam, laboratory results, and chest X-ray. Study participants will then be asked to assess the likelihood that heart failure, pneumonia and/or Chronic Obstructive Pulmonary Disease (COPD) is the underlying diagnosis. During specific vignettes in the survey, participants will also be shown standard or systematically biased AI models that provide an estimate the likelihood that heart failure, pneumonia and/or COPD is the underlying diagnosis. Clinicians will be randomized see AI predictions alone or AI predictions with explanations when shown AI models. This survey design will allow for testing the hypothesis that systematically biased models would harm clinician diagnostic accuracy, but commonly used image-based explanations would help clinicians partially recover their performance.

Detailed Description: Not available

Recruitment & Eligibility

Status: COMPLETED

Sex: All

Target Recruitment: 457

Inclusion Criteria

Physicians, nurse practitioners, and physician assistants that care for patients with acute respiratory failure as part of their clinical practice

Exclusion Criteria

Physicians, nurse practitioners, and physician assistants that only provide patient care in outpatient settings

Study & Design

Study Type: INTERVENTIONAL

Study Design: PARALLEL

Arm && Interventions

Group	Intervention	Description
AI model biased for heart failure, no AI explanation	AI model biased against heart failure	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against heart failure, always predicting that heart failure is present with high likelihood in patients with a body mass index (BMI) at or above 30. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.
AI model biased for heart failure, no AI explanation	Artificial Intelligence model predictions without explanation	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against heart failure, always predicting that heart failure is present with high likelihood in patients with a body mass index (BMI) at or above 30. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.
AI model biased for COPD, no AI explanation	Artificial Intelligence model predictions without explanation	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against COPD, always predicting that COPD is present with high likelihood when a pre-processing filter was applied to the patient's X-ray. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.
AI model biased for pneumonia, no AI explanation	Artificial Intelligence model predictions without explanation	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against pneumonia, always predicting that pneumonia is present with high likelihood in patients 80 years or older. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.
AI model biased for COPD, no AI explanation	AI model biased against COPD	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against COPD, always predicting that COPD is present with high likelihood when a pre-processing filter was applied to the patient's X-ray. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.
AI model biased for heart failure, Image-based AI explanation presented	AI model biased against heart failure	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against heart failure, always predicting that heart failure is present with high likelihood in patients with a body mass index (BMI) at or above 30. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.
AI model biased for pneumonia, no AI explanation	AI model biased against pneumonia	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against pneumonia, always predicting that pneumonia is present with high likelihood in patients 80 years or older. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will not be shown an AI explanation when shown AI model predictions.
AI model biased for heart failure, Image-based AI explanation presented	Artificial intelligence model predictions with explanation	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against heart failure, always predicting that heart failure is present with high likelihood in patients with a body mass index (BMI) at or above 30. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.
AI model biased for pneumonia, Image-based AI explanation presented	Artificial intelligence model predictions with explanation	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against pneumonia, always predicting that pneumonia is present with high likelihood in patients 80 years or older. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.
AI model biased for pneumonia, Image-based AI explanation presented	AI model biased against pneumonia	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against pneumonia, always predicting that pneumonia is present with high likelihood in patients 80 years or older. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.
AI model biased for COPD, Image-based AI explanation presented	Artificial intelligence model predictions with explanation	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against COPD, always predicting that COPD is present with high likelihood when a pre-processing filter was applied to the patient's X-ray. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.
AI model biased for COPD, Image-based AI explanation presented	AI model biased against COPD	Participants in this arm will be shown standard AI model predictions during 3 patient clinical vignettes within the survey and systematically biased AI model predictions during 3 clinical vignettes. When shown systematically biased AI model predictions, the model will be biased against COPD, always predicting that COPD is present with high likelihood when a pre-processing filter was applied to the patient's X-ray. Standard predictions will be shown for the other 2 diagnoses. Participants in this arm will also be shown AI explanation when shown AI model predictions.

Primary Outcome Measures

Name	Time	Method
Participant diagnostic accuracy across clinical vignette settings	Day 0	Diagnostic accuracy is defined as the number of correct diagnostic assessments over the total number of diagnostic assessments. After reviewing each individual patient clinical vignette within the survey, participants will be asked to make three separate diagnostic assessments for each clinical vignette, one for heart failure, pneumonia, and COPD. If the participant's assessment agrees with the reference label for each vignette, the diagnostic assessment is considered correct. Diagnostic assessments will be performed while participants are completing the survey (day 0), immediately after the participant reviews the clinical vignette. Participant diagnostic accuracy will be compared across vignette settings (no AI model, standard AI model, standard AI model with explanation, biased AI model, biased AI model with explanation).

Secondary Outcome Measures

Name	Time	Method
Diagnosis specific diagnostic accuracy across clinical vignette settings	Day 0	Diagnostic accuracy specific to heart failure, pneumonia, and COPD across vignette settings
Treatment Selection Accuracy across clinical vignette settings	Day 0	Treatment selection accuracy is defined as whether the participant choose the correct treatment for the patient in the clinical vignette, and could choose any combination of steroids, antibiotics, Intravenous (IV) diuretics, or none of these treatments for the patient. Treatment selection assessments will be performed while participants are completing the survey (day 0), immediately after the participant reviews the clinical vignette. Participant treatment selection accuracy will be compared across vignette settings (no AI model, standard AI model, standard AI model with explanation, biased AI model, biased AI model with explanation).

Trial Locations

Locations (1): University of Michigan
🇺🇸
Ann Arbor, Michigan, United States

Related Trials

Clinical Workflow Optimization Using Artificial Intelligence for Dermatological Conditions

Recruiting

AI Labs Group S.L

Posted 2/16/2024

Updated 3/19/2024

Reliability of Artificial Intelligence (AI)-Augmented Point-of-care Cardiac Ultrasound in the Hands of Internists

Recruiting

Sheba Medical Center

Posted 7/13/2022

Updated 1/3/2024

Comparing Artificial Intelligence for Assisted Diagnosis of Diabetic Retinopathy

Not Yet Recruiting

Zhejiang University

Posted 5/21/2024

Updated 7/9/2024

Artificial Intelligence-assisted System in Colonoscopy

Recruiting

Renmin Hospital of Wuhan University

Posted 5/9/2024

Updated 4/13/2025

Assessment of Diagnostic Adequacy of AI-assisted Point-of-Care Echocardiography Among Anesthesiology Trainees

Active, Not Recruiting

Loma Linda University

Posted 8/12/2024

Updated 5/16/2025

Computer Aided Diagnosis of Colorectal Polyps

Completed

King's College Hospital NHS Trust

Posted 8/12/2020

Updated 1/31/2022

Automatic Phenotyping of Patients on 2D Photography

Not Yet Recruiting

Imagine Institute

Posted 1/23/2024

Use of Artificial Intelligence for Clinical Assessment of Assisted Reproductive Techniques and IVF Outcomes

RecruitingNot Applicable

Weill Medical College of Cornell University

Posted 2/5/2020

Updated 1/31/2025

AI Algorithms in Prediction of ACS Based on Leukocyte Properties

Recruiting

RobotDreams GmbH

Posted 4/25/2024

Enhancing Diagnostic Accuracy in Fracture Identification on Musculoskeletal Radiographs Using Deep Learning

Completed

Carebot s.r.o.

Posted 10/16/2024

Updated 10/17/2024

Human Algorithm Interactions for Acute Respiratory Failure Diagnosis

Recruitment & Eligibility

Study & Design

Trial Locations

Related Trials

Clinical Workflow Optimization Using Artificial Intelligence for Dermatological Conditions

Reliability of Artificial Intelligence (AI)-Augmented Point-of-care Cardiac Ultrasound in the Hands of Internists

Comparing Artificial Intelligence for Assisted Diagnosis of Diabetic Retinopathy

Artificial Intelligence-assisted System in Colonoscopy

Assessment of Diagnostic Adequacy of AI-assisted Point-of-Care Echocardiography Among Anesthesiology Trainees

Computer Aided Diagnosis of Colorectal Polyps

Automatic Phenotyping of Patients on 2D Photography

Use of Artificial Intelligence for Clinical Assessment of Assisted Reproductive Techniques and IVF Outcomes

AI Algorithms in Prediction of ACS Based on Leukocyte Properties

Enhancing Diagnostic Accuracy in Fracture Identification on Musculoskeletal Radiographs Using Deep Learning

Clinical Trial Alerts

Clinical Trial Alerts