Paper Reading AI Learner

RoNID: New Intent Discovery with Generated-Reliable Labels and Cluster-friendly Representations

2024-04-13 11:58:28
Shun Zhang, Chaoran Yan, Jian Yang, Changyu Ren, Jiaqi Bai, Tongliang Li, Zhoujun Li

Abstract

New Intent Discovery (NID) strives to identify known and reasonably deduce novel intent groups in the open-world scenario. But current methods face issues with inaccurate pseudo-labels and poor representation learning, creating a negative feedback loop that degrades overall model performance, including accuracy and the adjusted rand index. To address the aforementioned challenges, we propose a Robust New Intent Discovery (RoNID) framework optimized by an EM-style method, which focuses on constructing reliable pseudo-labels and obtaining cluster-friendly discriminative representations. RoNID comprises two main modules: reliable pseudo-label generation module and cluster-friendly representation learning module. Specifically, the pseudo-label generation module assigns reliable synthetic labels by solving an optimal transport problem in the E-step, which effectively provides high-quality supervised signals for the input of the cluster-friendly representation learning module. To learn cluster-friendly representation with strong intra-cluster compactness and large inter-cluster separation, the representation learning module combines intra-cluster and inter-cluster contrastive learning in the M-step to feed more discriminative features into the generation module. RoNID can be performed iteratively to ultimately yield a robust model with reliable pseudo-labels and cluster-friendly representations. Experimental results on multiple benchmarks demonstrate our method brings substantial improvements over previous state-of-the-art methods by a large margin of +1~+4 points.

Abstract (translated)

新意图发现(NID)旨在在开放世界场景中识别已知的和合理的推断出新的意图组。然而,目前的 methods 面临准确伪标签和差的学习表示的挑战,导致负反馈循环,降低整体模型性能,包括准确性和调整的均方差。为解决上述挑战,我们提出了一个通过EM风格方法优化的鲁棒新意图发现(RoNID)框架。该框架关注于构建可靠的伪标签和获得聚类友好的表示。RoNID 由两个主要模块组成:可靠伪标签生成模块和聚类友好表示学习模块。具体来说,伪标签生成模块通过求解优化传输问题在 E 步骤,从而有效地为聚类友好表示学习模块提供高质量的超监督信号。为了通过强烈的聚类内紧凑性和大的跨聚类分离学习聚类友好表示,表示学习模块在 M 步骤结合了聚类内和跨聚类对比学习,将更多有区别的特征提供给生成模块。RoNID 可以递归执行,最终得到一个具有可靠伪标签和聚类友好表示的稳健模型。在多个基准测试上进行的实验结果表明,我们的方法在很大程度上超过了以前的最先进方法。

URL

https://arxiv.org/abs/2404.08977

PDF

https://arxiv.org/pdf/2404.08977.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot