OphGLM: Training an Ophthalmology Large Language-and-Vision Assistant based on Instructions and Dialogue

06/21/2023
by   Weihao Gao, et al.
0

Large multimodal language models (LMMs) have achieved significant success in general domains. However, due to the significant differences between medical images and text and general web content, the performance of LMMs in medical scenarios is limited. In ophthalmology, clinical diagnosis relies on multiple modalities of medical images, but unfortunately, multimodal ophthalmic large language models have not been explored to date. In this paper, we study and construct an ophthalmic large multimodal model. Firstly, we use fundus images as an entry point to build a disease assessment and diagnosis pipeline to achieve common ophthalmic disease diagnosis and lesion segmentation. Then, we establish a new ophthalmic multimodal instruction-following and dialogue fine-tuning dataset based on disease-related knowledge data and publicly available real-world medical dialogue. We introduce visual ability into the large language model to complete the ophthalmic large language and vision assistant (OphGLM). Our experimental results demonstrate that the OphGLM model performs exceptionally well, and it has the potential to revolutionize clinical applications in ophthalmology. The dataset, code, and models will be made publicly available at https://github.com/ML-AILab/OphGLM.

READ FULL TEXT

page 4

page 6

page 9

research
05/08/2023

MultiModal-GPT: A Vision and Language Model for Dialogue with Humans

We present a vision and language model named MultiModal-GPT to conduct m...
research
06/19/2023

Path to Medical AGI: Unify Domain-specific Medical LLMs with the Lowest Cost

Medical artificial general intelligence (AGI) is an emerging field that ...
research
06/28/2023

Stone Needle: A General Multimodal Large-scale Model Framework towards Healthcare

In healthcare, multimodal data is prevalent and requires to be comprehen...
research
08/16/2023

MDDial: A Multi-turn Differential Diagnosis Dialogue Dataset with Reliability Evaluation

Dialogue systems for Automatic Differential Diagnosis (ADD) have a wide ...
research
05/25/2023

ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs

The potential of integrating Computer-Assisted Diagnosis (CAD) with Larg...
research
02/28/2023

Which One Are You Referring To? Multimodal Object Identification in Situated Dialogue

The demand for multimodal dialogue systems has been rising in various do...
research
05/24/2023

HuatuoGPT, towards Taming Language Model to Be a Doctor

In this paper, we present HuatuoGPT, a large language model (LLM) for me...

Please sign up or login with your details

Forgot password? Click here to reset