XrayGPT: Chest Radiographs Summarization using Large Medical Vision-Language Models

Omkar Thawakar, Abdelrahman Shaker, Sahal Shaji Mullappilly, Hisham Cholakkal, Rao Muhammad Anwer, Salman Khan, Jorma Laaksonen, Fahad Shahbaz Khan

Tutkimustuotos: Artikkeli kirjassa/konferenssijulkaisussaConference article in proceedingsScientificvertaisarvioitu

2 Sitaatiot (Scopus)

Abstrakti

The latest breakthroughs in large language models (LLMs) and vision-language models (VLMs) have showcased promising capabilities toward performing a wide range of tasks. Such models are typically trained on massive datasets comprising billions of image-text pairs with diverse tasks. However, their performance on task-specific domains, such as radiology, is still under-explored. While few works have recently explored LLMs-based conversational medical models, they mainly focus on text-based analysis. In this paper, we introduce XrayGPT, a conversational medical vision-language (VLMs) model that can analyze and answer open-ended questions about chest radiographs. Specifically, we align both medical visual encoder with a fine-tuned LLM to possess visual conversation abilities, grounded in an understanding of radiographs and medical knowledge. For improved alignment of chest radiograph data, we generate 217k interactive and high-quality summaries from free-text radiology reports. Extensive experiments are conducted to validate the merits of XrayGPT. To conduct an expert evaluation, certified medical doctors evaluated the output of our XrayGPT on a test subset and the results reveal that more than 70% of the responses are scientifically accurate, with an average score of 4/5. Our code and models are available at: https://github.com/mbzuai-oryx/XrayGPT.

AlkuperäiskieliEnglanti
OtsikkoBioNLP 2024 - 23rd Meeting of the ACL Special Interest Group on Biomedical Natural Language Processing, Proceedings of the Workshop and Shared Tasks
ToimittajatDina Demner-Fushman, Sophia Ananiadou, Makoto Miwa, Kirk Roberts, Junichi Tsujii
KustantajaAssociation for Computational Linguistics
Sivut440-448
Sivumäärä9
ISBN (elektroninen)979-8-89176-130-8
TilaJulkaistu - 2024
OKM-julkaisutyyppiA4 Artikkeli konferenssijulkaisussa
TapahtumaBiomedical Natural Language Processing Workshop - Bangkok, Thaimaa
Kesto: 16 elok. 202416 elok. 2024
Konferenssinumero: 23

Conference

ConferenceBiomedical Natural Language Processing Workshop
LyhennettäBioNLP
Maa/AlueThaimaa
KaupunkiBangkok
Ajanjakso16/08/202416/08/2024

Sormenjälki

Sukella tutkimusaiheisiin 'XrayGPT: Chest Radiographs Summarization using Large Medical Vision-Language Models'. Ne muodostavat yhdessä ainutlaatuisen sormenjäljen.

Siteeraa tätä