Paper Reading AI Learner

Segmentation-Aware Generative Reinforcement Network for Tissue Layer Segmentation in 3-D Ultrasound Images for Chronic Low-back Pain Assessment

2025-01-29 14:58:48
Zixue Zeng, Xiaoyan Zhao, Matthew Cartier, Tong Yu, Jing Wang, Xin Meng, Zhiyu Sheng, Maryam Satarpour, John M Cormack, Allison Bean, Ryan Nussbaum, Maya Maurer, Emily Landis-Walkenhorst, Dinesh Kumbhare, Kang Kim, Ajay Wasan, Jiantao Pu

Abstract

We introduce a novel segmentation-aware joint training framework called generative reinforcement network (GRN) that integrates segmentation loss feedback to optimize both image generation and segmentation performance in a single stage. An image enhancement technique called segmentation-guided enhancement (SGE) is also developed, where the generator produces images tailored specifically for the segmentation model. Two variants of GRN were also developed, including GRN for sample-efficient learning (GRN-SEL) and GRN for semi-supervised learning (GRN-SSL). GRN's performance was evaluated using a dataset of 69 fully annotated 3D ultrasound scans from 29 subjects. The annotations included six anatomical structures: dermis, superficial fat, superficial fascial membrane (SFM), deep fat, deep fascial membrane (DFM), and muscle. Our results show that GRN-SEL with SGE reduces labeling efforts by up to 70% while achieving a 1.98% improvement in the Dice Similarity Coefficient (DSC) compared to models trained on fully labeled datasets. GRN-SEL alone reduces labeling efforts by 60%, GRN-SSL with SGE decreases labeling requirements by 70%, and GRN-SSL alone by 60%, all while maintaining performance comparable to fully supervised models. These findings suggest the effectiveness of the GRN framework in optimizing segmentation performance with significantly less labeled data, offering a scalable and efficient solution for ultrasound image analysis and reducing the burdens associated with data annotation.

Abstract (translated)

我们介绍了一种称为生成强化网络(GRN)的新型分段感知联合训练框架,该框架将分割损失反馈集成到单个阶段中,以同时优化图像生成和分割性能。还开发了一种名为分段引导增强(SGE)的图像增强技术,在此技术中,生成器会针对特定的分割模型产生专门定制的图像。此外,还开发了GRN的两种变体:用于样本高效学习的GRN (GRN-SEL) 和用于半监督学习的GRN (GRN-SSL)。 GRN 的性能使用来自29名受试者的69份完整注释3D超声扫描数据集进行了评估,这些注释包括六个解剖结构:真皮、浅层脂肪、浅层筋膜膜(SFM)、深层脂肪、深层筋膜膜(DFM)和肌肉。我们的研究结果表明,在与在完全标注的数据集上训练的模型相比,带有SGE的GRN-SEL能够将标记工作量减少高达70%,同时使Dice相似性系数(DSC)提高了1.98%。GRN-SEL本身可以减少60%的标签努力,而带有SGE的GRN-SSL减少了70%的标注需求,单独的GRN-SSL则减少了60%的需求,所有这些情况均保持了与完全监督模型相当的表现水平。 这些发现表明,GRN框架在使用显著较少的标记数据优化分割性能方面非常有效,并为超声图像分析提供了可扩展且高效的解决方案,同时减轻了数据标注相关的负担。

URL

https://arxiv.org/abs/2501.17690

PDF

https://arxiv.org/pdf/2501.17690.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot