Paper Reading AI Learner

Geometry-Aware Latent Representation Learning for Modeling Disease Progression of Barrett's Esophagus

2023-03-17 14:08:16
Vivien van Veldhuizen

Abstract

Barrett's Esophagus (BE) is the only precursor known to Esophageal Adenocarcinoma (EAC), a type of esophageal cancer with poor prognosis upon diagnosis. Therefore, diagnosing BE is crucial in preventing and treating esophageal cancer. While supervised machine learning supports BE diagnosis, high interobserver variability in histopathological training data limits these methods. Unsupervised representation learning via Variational Autoencoders (VAEs) shows promise, as they map input data to a lower-dimensional manifold with only useful features, characterizing BE progression for improved downstream tasks and insights. However, the VAE's Euclidean latent space distorts point relationships, hindering disease progression modeling. Geometric VAEs provide additional geometric structure to the latent space, with RHVAE assuming a Riemannian manifold and $\mathcal{S}$-VAE a hyperspherical manifold. Our study shows that $\mathcal{S}$-VAE outperforms vanilla VAE with better reconstruction losses, representation classification accuracies, and higher-quality generated images and interpolations in lower-dimensional settings. By disentangling rotation information from the latent space, we improve results further using a group-based architecture. Additionally, we take initial steps towards $\mathcal{S}$-AE, a novel autoencoder model generating qualitative images without a variational framework, but retaining benefits of autoencoders such as stability and reconstruction quality.

Abstract (translated)

Barrett's esophagus (BE) 是已知柠檬汁癌(EAC)的已知前驱体,而这种癌症在诊断时具有较差的预后。因此,诊断BE在预防和治愈柠檬汁癌方面至关重要。虽然监督机器学习支持BE诊断,但病理学训练数据的高互现性限制了这些方法。无监督的变分自编码器(VAE)的表现令人鼓舞,因为它们将输入数据映射到一个只有有用特征的低维多态,以 characterized BE 的进展,并提高后续任务和洞察力。然而,VAE 的欧几里得 latent space 扭曲了点关系,妨碍了疾病进展建模。几何VAE为 latent space 提供了额外的几何结构,其中 RHVAE 假设一个黎曼多态,而 $\mathcal{S}$-VAE 是一个高斯多态。我们的研究表明,$\mathcal{S}$-VAE 在低维环境中比传统VAE有更好的重构损失、表示分类精度和更高质量的生成图像和插值。通过从 latent space 中分离旋转信息,我们使用群体架构进一步改进结果。此外,我们开始朝着 $\mathcal{S}$-AE 迈进,这是一种没有变分框架的新型自编码器模型,生成高质量的定性图像,但保留了自编码器的稳定性和重构质量。

URL

https://arxiv.org/abs/2303.12711

PDF

https://arxiv.org/pdf/2303.12711.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot