A Hierarchical Reinforced Sequence Operation Method for Unsupervised Text Style Transfer

Abstract
Abstract (translated)
URL
PDF

Abstract

Unsupervised text style transfer aims to alter text styles while preserving the content, without aligned data for supervision. Existing seq2seq methods face three challenges: 1) the transfer is weakly interpretable, 2) generated outputs struggle in content preservation, and 3) the trade-off between content and style is intractable. To address these challenges, we propose a hierarchical reinforced sequence operation method, named Point-Then-Operate (PTO), which consists of a high-level agent that proposes operation positions and a low-level agent that alters the sentence. We provide comprehensive training objectives to control the fluency, style, and content of the outputs and a mask-based inference algorithm that allows for multi-step revision based on the single-step trained agents. Experimental results on two text style transfer datasets show that our method significantly outperforms recent methods and effectively addresses the aforementioned challenges.

Abstract (translated)

无监督的文本样式转换旨在在保留内容的同时更改文本样式，而无需对数据进行监督。现有的seq2seq方法面临三个挑战：1）传递的解释能力较弱；2）生成的输出在内容保存方面存在困难；3）内容和风格之间的权衡难以解决。为了解决这些挑战，我们提出了一种分层增强的序列操作方法，即点然后操作（PTO），它由一个提出操作位置的高级代理和一个改变句子的低级代理组成。我们提供了全面的培训目标，以控制输出的流畅性、风格和内容，以及一个基于掩码的推理算法，允许基于单步训练的代理进行多步修订。对两个文本式传输数据集的实验结果表明，我们的方法明显优于最近的方法，并有效地解决了上述挑战。

URL

https://arxiv.org/abs/1906.01833

PDF

https://arxiv.org/pdf/1906.01833.pdf