Paper Reading AI Learner

ChatCAD+: Towards a Universal and Reliable Interactive CAD using LLMs

2023-05-25 12:03:31
Zihao Zhao, Sheng Wang, Jinchen Gu, Yitao Zhu, Lanzhuju Mei, Zixu Zhuang, Zhiming Cui, Qian Wang, Dinggang Shen

Abstract

The potential of integrating Computer-Assisted Diagnosis (CAD) with Large Language Models (LLMs) in clinical applications, particularly in digital family doctor and clinic assistant roles, shows promise. However, existing works have limitations in terms of reliability, effectiveness, and their narrow applicability to specific image domains, which restricts their overall processing capabilities. Moreover, the mismatch in writing style between LLMs and radiologists undermines their practical utility. To address these challenges, we present ChatCAD+, an interactive CAD system that is universal, reliable, and capable of handling medical images from diverse domains. ChatCAD+ utilizes current information obtained from reputable medical websites to offer precise medical advice. Additionally, it incorporates a template retrieval system that emulates real-world diagnostic reporting, thereby improving its seamless integration into existing clinical workflows. The source code is available at \href{this https URL}{GitHub}. The online demo will be available soon.

Abstract (translated)

在临床应用程序中,特别是在数字家庭医生和 Clinic 助手角色中,将计算机辅助诊断(CAD)与大型语言模型(LLM)相结合的潜力表明有 promise。然而,现有的作品在可靠性、有效性和特定图像领域的狭隘适用性方面存在限制,这限制了它们的整体处理能力。此外,LLM 和医学影像学的写作风格不匹配,削弱了它们的实际实用性。为了解决这些问题,我们提出了 ChatCAD+,这是一个交互式CAD系统,具有普遍、可靠和能够处理来自不同领域 medical 图像的能力。ChatCAD+ 利用知名医学网站获取的最新信息提供准确的医疗建议。此外,它还集成了一个模板检索系统,模拟真实的诊断报告,从而改进了将其无缝融入现有的临床工作流程。源代码可在 \href{this https URL}{GitHub} 获取。在线演示将很快发布。

URL

https://arxiv.org/abs/2305.15964

PDF

https://arxiv.org/pdf/2305.15964.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot