Paper Reading AI Learner

IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors

2024-05-02 02:38:32
Shenghe Zheng, Hongzhi Wang, Xianglong Liu

Abstract

Graph Neural Networks (GNNs) demonstrate excellent performance on graphs, with their core idea about aggregating neighborhood information and learning from labels. However, the prevailing challenges in most graph datasets are twofold of Insufficient High-Quality Labels and Lack of Neighborhoods, resulting in weak GNNs. Existing data augmentation methods designed to address these two issues often tackle only one. They may either require extensive training of generators, rely on overly simplistic strategies, or demand substantial prior knowledge, leading to suboptimal generalization abilities. To simultaneously address both of these two challenges, we propose an elegant method called IntraMix. IntraMix innovatively employs Mixup among low-quality labeled data of the same class, generating high-quality labeled data at minimal cost. Additionally, it establishes neighborhoods for the generated data by connecting them with data from the same class with high confidence, thereby enriching the neighborhoods of graphs. IntraMix efficiently tackles both challenges faced by graphs and challenges the prior notion of the limited effectiveness of Mixup in node classification. IntraMix serves as a universal framework that can be readily applied to all GNNs. Extensive experiments demonstrate the effectiveness of IntraMix across various GNNs and datasets.

Abstract (translated)

图神经网络(GNNs)在图中表现出色,其核心思想是聚合邻近信息并从标签中学习。然而,大多数图数据集的普遍挑战是缺乏高质量标签和缺乏邻近关系,导致低效的GNNs。为解决这两个问题,现有数据增强方法通常只解决一个问题。它们可能需要对生成器进行广泛的训练,依赖于过于简单的策略,或者需要相当多的先验知识,导致泛化能力下降。为了同时解决这两个问题,我们提出了一个优雅的方法,称为IntraMix。IntraMix通过在同一类低质量标记数据中使用Mixup创新地解决了这个问题,生成高质量标记数据且成本较低。此外,它通过连接生成的数据和同一类数据建立邻域,从而丰富图的邻域。IntraMix有效地解决了图面临的问题,并挑战了在节点分类中Mixup效果有限的概念。IntraMix成为了一个通用的框架,可以轻松应用于所有GNN。大量实验证明IntraMix在各种GNN和数据集上的有效性。

URL

https://arxiv.org/abs/2405.00957

PDF

https://arxiv.org/pdf/2405.00957.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot