Abstract
Prompt optimization aims to systematically refine prompts to enhance a language model's performance on specific tasks. Fairness detection in Terms of Service (ToS) clauses is a challenging legal NLP task that demands carefully crafted prompts to ensure reliable results. However, existing prompt optimization methods are often computationally expensive due to inefficient search strategies and costly prompt candidate scoring. In this paper, we propose a framework that combines Monte Carlo Tree Search (MCTS) with a proxy prompt evaluator to more effectively explore the prompt space while reducing evaluation costs. Experiments demonstrate that our approach achieves higher classification accuracy and efficiency than baseline methods under a constrained computation budget.
Abstract (translated)
提示优化的目标是系统地改进提示,以增强语言模型在特定任务上的表现。服务条款(ToS)中的公平性检测是一项具有挑战性的法律自然语言处理(NLP)任务,需要精心设计的提示来确保可靠的结果。然而,现有的提示优化方法通常由于搜索策略低效和候选提示评分成本高而计算开销大。在这篇论文中,我们提出了一种结合蒙特卡洛树搜索(MCTS)与代理提示评估器的框架,旨在更有效地探索提示空间同时减少评价成本。实验表明,在给定的计算预算下,我们的方法比基线方法实现了更高的分类准确率和效率。
URL
https://arxiv.org/abs/2510.08524