Prospective Evaluation of a Point-of-Care Artificial Intelligence Model in Critical Care Outcomes
Overview
- Phase
- Phase 1
- Status
- Not yet recruiting
- Sponsor
- MetroWest Artificial Intelligence Research Workgroup
- Enrollment
- 1,000
- Locations
- 1
- Primary Endpoint
- Composite of Medical Errors
Overview
Brief Summary
This is a prospective, unmasked, randomized, multicenter clinical trial evaluating the impact of point-of-care large language model (LLM)-based decision support on diagnostic accuracy and clinical outcomes in adult medical intensive care unit (MICU) patients.
Consecutive adult ICU admissions at participating community hospitals (initially MetroWest Medical Center and St. Vincent Hospital) will be screened for eligibility. Eligible patients will be randomized 1:1 to standard care or an AI-assisted group. In both arms, initial evaluation and management will follow usual practice. For patients randomized to AI assistance, de-identified admission data (history and physical, labs, imaging reports, and other relevant documentation) will be formatted and submitted to a state-of-the-art LLM (ChatGPT-5) at the time of admission. The AI-generated differential diagnosis and therapeutic recommendations will be provided to the admitting team for consideration. For the standard care arm, LLM output will be generated but not shared with clinicians.
After discharge, a masked chart review will determine the "ground truth" primary diagnosis and extract outcomes including: Primary Outcome - a composite of medical errors (from time of ICU admission through day 7 of ICU stay, or ICU discharge, whichever comes first); Secondary Outcomes - 90-day mortality, ICU and hospital length of stay, and ventilator-free days.
Detailed Description
The rapid development of large language models (LLMs) such as ChatGPT has created new opportunities and risks for their use in medicine. Although early studies suggest high diagnostic accuracy in complex clinical scenarios and ICU admissions, the impact of LLMs on real-world clinical outcomes and the optimal mode of physician-AI interaction remain uncertain. Published work from our group showed that ChatGPT-4 achieved diagnostic accuracy comparable to board-certified intensivists for ICU admissions in a retrospective study. However, prospective, randomized data on clinical outcomes are lacking.
This trial will evaluate a pragmatic paradigm for integrating LLMs at the time of ICU admission (point-of-care AI). All eligible adult MICU admissions at participating sites will be prospectively randomized to: (1) standard care, or (2) AI-assisted care in which an LLM receives standardized, de-identified admission data and returns a proposed primary diagnosis, ranked differential diagnosis (up to five conditions), suggested additional information, and prioritized therapeutic interventions. Admitting clinicians in the AI-assisted arm will be asked to review and optionally incorporate the AI recommendations and will complete a brief questionnaire regarding perceived utility and any changes in diagnosis or management.
A masked clinical adjudication panel will perform longitudinal chart review to define the "ground truth" primary diagnosis and assess error rates and outcomes. The primary endpoint is a composite of medical errors. The specific time frame will be from the time of ICU admission through day 7 of ICU stay, or ICU discharge, whichever comes first. Secondary endpoints will include 90-day mortality, ICU and hospital length of stay, and ventilator-free days. Other exploratory secondary endpoints will be considered. The trial is designed to enroll approximately 1000 patients across multiple MICUs, with interim analysis at 12 months to assess feasibility, integrity, and futility. The study is minimal risk, uses de-identified data for AI queries, and does not alter standard diagnostic testing or therapeutic options.
Study Design
- Study Type
- Interventional
- Allocation
- Randomized
- Intervention Model
- Parallel
- Primary Purpose
- Treatment
- Masking
- Double (Participant, Outcomes Assessor)
Eligibility Criteria
- Ages
- 18 Years to — (Adult, Older Adult)
- Sex
- All
- Accepts Healthy Volunteers
- No
Inclusion Criteria
- •Adult patients (≥ 18 years) admitted to the medical intensive care unit (MICU) at participating hospitals.
- •Direct admissions from the emergency department or transfers from medical wards to the MICU.
- •Critically ill patients meeting local ICU admission criteria.
Exclusion Criteria
- •Transfers to the MICU from outside hospitals, operating room, or post-anesthesia care unit.
- •Age \< 18 years.
- •Incomplete or missing essential clinical information at admission (e.g., key labs or documentation not yet available).
- •Primary surgical or cardiac (e.g., STEMI) patients.
- •Pregnant or postpartum women.
Outcomes
Primary Outcomes
Composite of Medical Errors
Time Frame: From the time of ICU admission through day 7 of ICU stay or ICU discharge, whichever comes first.
Proportion of patients with at least one clinically important diagnostic or therapeutic error identified by masked chart review (e.g., missed or delayed critical diagnosis, major guideline-discordant therapy with potential for harm).
Secondary Outcomes
- 90-day All-Cause Mortality(90 days from ICU admission.)
- ICU Length of Stay(From ICU admission to ICU discharge (up to 90 days).)
- Ventilator-Free Days(Up to 28 days after ICU admission.)
- Hospital Length of Stay(From hospital admission to hospital discharge (up to 90 days).)