Paper Reading AI Learner

Training custom modality-specific U-Net models with weak localizations for improved Tuberculosis segmentation and localization

2021-02-21 14:03:49
Sivaramakrishnan Rajaraman, Les Folio, Jane Dimperio, Philip Alderson, Sameer Antani

Abstract

UNet segmentation models have demonstrated superior performance compared to conventional handcrafted features. Modality specific DL models are better at transferring domain knowledge to a relevant target task than those that are pretrained on stock photography images. Using them helps improve model adaptation, generalization, and class-specific region of interest localization. In this study, we train custom chest X ray modality specific UNet models for semantic segmentation of Tuberculosis (TB) consistent findings. Automated segmentation of such manifestations could help radiologists reduce errors following initial interpretation and before finalizing the report. This could improve radiologist accuracy by supplementing decision making while improving patient care and productivity. Our approach uses a comprehensive strategy that first uses publicly available chest X ray datasets with weak TB annotations, typically provided as bounding boxes, to train a set of UNet models. Next, we improve the results of the best performing model using an augmented training strategy on data with weak localizations from the outputs of a selection of DL classifiers that are trained to produce a binary decision ROI mask for suspected TB manifestations. The augmentation aims to improve performance with test data derived from the same training distribution and other cross institutional collections. We observe that compared to non augmented training our augmented training strategy helped the custom modality specific UNet models achieve superior performance with test data that is both similar to the training distribution as well as for cross institutional test sets.

Abstract (translated)

URL

https://arxiv.org/abs/2102.10607

PDF

https://arxiv.org/pdf/2102.10607.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot