NCT07568444 - Enhancing Readability of Lay Abstracts and Summaries for Medical Knowledge Using Generative Artificial Intelligence (BRIDGE AI 3) | Crick | Crick

Enhancing Readability of Lay Abstracts and Summaries for Medical Knowledge Using Generative Artificial Intelligence (BRIDGE AI 3)

Phase 2CompletedNCT07568444

University of Southern California120 enrolled

Overview

This trial tests if AI can help make medical info clear and readable. Many patients struggle to find medical informations that easy to read and understand from verified medical sources. The study tests if an AI tool can assist health providers to craft clear text for patients more fast than what they do now. Health providers are split at random into two groups-one uses the AI tool and one does not. The trial tests how clear the text is, how correct it is, and how much time is saved. The aim is to see if AI can close the gap between complex research and what patients can grasp.

This study evaluates whether a generative artificial intelligence (AI) tool can improve the readability and accessibility of lay summaries derived from scientific medical abstracts. Many patients encounter difficulty understanding medical literature due to technical language and complexity, which can limit informed decision-making and engagement with healthcare information. The BRIDGE-AI (Provider Perspective) initiative aims to address this gap by enabling healthcare professionals and researchers to generate patient-friendly summaries of scientific content using AI-assisted tools. The intervention leverages a generative AI framework (pub2people) designed to translate complex medical terminology into language that is understandable to a general audience. In this randomized controlled study, participants with experience in scientific publishing will be assigned to either an AI-assisted group or a control group using conventional methods. Participants will be asked to transform scientific abstracts into layperson-friendly summaries. The study compares AI-assisted and manually generated outputs in terms of readability, accuracy, and efficiency. The primary objective is to determine whether AI-assisted generation improves the readability of lay summaries compared to standard approaches. Secondary objectives include evaluating the accuracy of AI-generated summaries relative to source material and assessing potential time savings associated with AI use. This study contributes to ongoing efforts to improve health communication by evaluating scalable tools that may enhance the translation of complex medical information into patient-accessible formats.

Study Type

INTERVENTIONAL

Allocation

RANDOMIZED

Purpose

Outcomes

Primary Outcomes

Readability Change

Flesch Reading Ease Score Description: Measures text readability based on sentence length and word syllables. Scale: 0 to 100 Interpretation: Higher scores indicate easier readability (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Readability Change

Flesch-Kincaid Grade Level Description: Estimates U.S. school grade level required to understand the text. Scale: Typically ranges from \~0 to 18+ Interpretation: Lower scores indicate easier readability (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Readability Change

Gunning Fog Index Description: Estimates years of formal education needed to understand the text on first reading. Scale: Typically 0 to 20+ Interpretation: Lower scores indicate easier readability (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Readability Change

SMOG Index (Simple Measure of Gobbledygook) Description: Estimates years of education required to comprehend the text. Scale: Typically 0 to 20+ Interpretation: Lower scores indicate easier readability (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Readability Change

Coleman-Liau Index Description: Readability formula based on characters per word and sentence length. Scale: Typically 0 to 18+ (grade level equivalent) Interpretation: Lower scores indicate easier readability (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Readability Change

Automated Readability Index (ARI) Description: Estimates grade level required for comprehension using characters and word counts. Scale: Typically 0 to 14+ Interpretation: Lower scores indicate easier readability (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Secondary Outcomes

Time Saving

To evaluate the time savings achieved by using generative AI compared to traditional methods for generating layperson abstracts and summaries. Time will be recorded in hours, minutes, and seconds. We will collect and compare the total time spent drafting the complete layperson abstract and summaries, as well as the time spent on each individual section - background, methods, results, conclusion, and short summaries. The comparison will be made between summaries created by humans alone versus those created with GAI assistance. Time will be reported in minutes

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Correctness and meaning retention

Accuracy Score of Layperson Abstract Sections Description: Degree to which each section (Background, Methods, Results, Conclusion, Short Summary) reflects key information from the original scientific abstract. Scale: 5-point Likert scale (1 = very inaccurate, 5 = highly accurate) Assessment Method: Two independent reviewers score each section Interpretation: Higher scores indicate better accuracy (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Correctness and meaning retention

Completeness Score of Layperson Abstract Sections Description: Extent to which essential information from the original abstract is included in each section. Scale: 5-point Likert scale (1 = very incomplete, 5 = fully complete) Assessment Method: Two independent reviewers evaluate each section. Interpretation: Higher scores indicate greater completeness (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.

Correctness and meaning retention

Clarity Score for Layperson Readability Description: Evaluates simplicity, avoidance of jargon, and coherence for lay audiences. Scale: 5-point Likert scale (1 = very unclear, 5 = very clear and understandable) Assessment Method: Two independent reviewers evaluate each section. Interpretation: Higher scores indicate better clarity (better outcome).

Time frame: The assessment will be conducted immediately after the study closes, which will occur 4 weeks after enrollment.