Paper Reading AI Learner

Extrapolative Controlled Sequence Generation via Iterative Refinement

2023-03-08 13:21:27
Vishakh Padmakumar, Richard Yuanzhe Pang, He He, Ankur P. Parikh

Abstract

We study the problem of extrapolative controlled generation, i.e., generating sequences with attribute values beyond the range seen in training. This task is of significant importance in automated design, especially drug discovery, where the goal is to design novel proteins that are \textit{better} (e.g., more stable) than existing sequences. Thus, by definition, the target sequences and their attribute values are out of the training distribution, posing challenges to existing methods that aim to directly generate the target sequence. Instead, in this work, we propose Iterative Controlled Extrapolation (ICE) which iteratively makes local edits to a sequence to enable extrapolation. We train the model on synthetically generated sequence pairs that demonstrate small improvement in the attribute value. Results on one natural language task (sentiment analysis) and two protein engineering tasks (ACE2 stability and AAV fitness) show that ICE considerably outperforms state-of-the-art approaches despite its simplicity. Our code and models are available at: this https URL.

Abstract (translated)

我们研究的是扩展控制生成问题,也就是在训练范围内生成属性值超出范围序列的问题。这在自动化设计特别是在药物发现中非常重要,因为的目标是设计比现有序列更好的新蛋白质(例如,更稳定的),因此,根据定义,目标序列和其属性值超出了训练分布的范围,给试图直接生成目标序列的方法带来了挑战。相反,在本文中,我们提出了迭代控制的扩展生成(ICE),该方法迭代地对序列进行局部编辑,以进行扩展。我们训练了合成生成的序列对,这些序列证明了属性值微小的改进。在一个自然语言任务(情感分析)和两个蛋白质工程任务(ACE2稳定性和AAV fitness)中的结果表明,ICE显著优于现有方法,尽管其简单性。我们的代码和模型可在以下httpsURL获取:

URL

https://arxiv.org/abs/2303.04562

PDF

https://arxiv.org/pdf/2303.04562.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot