Paper Reading AI Learner

UWAFA-GAN: Ultra-Wide-Angle Fluorescein Angiography Transformation via Multi-scale Generation and Registration Enhancement

2024-05-01 14:27:43
Ruiquan Ge, Zhaojie Fang, Pengxue Wei, Zhanghao Chen, Hongyang Jiang, Ahmed Elazab, Wangting Li, Xiang Wan, Shaochong Zhang, Changmiao Wang

Abstract

Fundus photography, in combination with the ultra-wide-angle fundus (UWF) techniques, becomes an indispensable diagnostic tool in clinical settings by offering a more comprehensive view of the retina. Nonetheless, UWF fluorescein angiography (UWF-FA) necessitates the administration of a fluorescent dye via injection into the patient's hand or elbow unlike UWF scanning laser ophthalmoscopy (UWF-SLO). To mitigate potential adverse effects associated with injections, researchers have proposed the development of cross-modality medical image generation algorithms capable of converting UWF-SLO images into their UWF-FA counterparts. Current image generation techniques applied to fundus photography encounter difficulties in producing high-resolution retinal images, particularly in capturing minute vascular lesions. To address these issues, we introduce a novel conditional generative adversarial network (UWAFA-GAN) to synthesize UWF-FA from UWF-SLO. This approach employs multi-scale generators and an attention transmit module to efficiently extract both global structures and local lesions. Additionally, to counteract the image blurriness issue that arises from training with misaligned data, a registration module is integrated within this framework. Our method performs non-trivially on inception scores and details generation. Clinical user studies further indicate that the UWF-FA images generated by UWAFA-GAN are clinically comparable to authentic images in terms of diagnostic reliability. Empirical evaluations on our proprietary UWF image datasets elucidate that UWAFA-GAN outperforms extant methodologies. The code is accessible at this https URL.

Abstract (translated)

fundus摄影与超广角 fundus(UWF)技术相结合,在临床实践中成为一项不可或缺的诊断工具,因为它能提供对视网膜更全面的观察。然而,UWF 荧光血管造影(UWF-FA)需要通过注射荧光染料到患者手中或肘部来实施,而 UWF 扫描激光视网膜检查(UWF-SLO)不需要这样做。为了减轻注射可能带来的不良反应,研究人员提出了开发能够将 UWF-SLO 图像转换为 UWF-FA 图像的跨模态医疗图像生成算法。目前应用于 fundus 摄影的图像生成技术在生成高分辨率视网膜图像方面遇到困难,特别是在捕捉细微血管病变方面。为了应对这些问题,我们引入了一种名为 UWAFA-GAN 的条件生成对抗网络(GAN)用于从 UWF-SLO 合成 UWF-FA。这种方法采用多尺度生成器和关注传输模块来有效地提取全局结构和局部病变。此外,为了对抗训练数据不对齐导致的图像模糊问题,我们在该框架中引入了注册模块。我们的方法在 inception 分数和详细信息生成方面非同寻常。通过对我们专有 UWF 图像数据集的临床用户研究,证实了 UWAFA-GAN 生成的 UWF-FA 图像在诊断可靠性方面与真实图像相当。我们专有 UWF 图像数据集的实证评估证实了 UWAFA-GAN 优于现有方法。代码可在此链接访问:https://www.example.com/

URL

https://arxiv.org/abs/2405.00542

PDF

https://arxiv.org/pdf/2405.00542.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot