Abstract
Spreadsheets are widely recognized as the most popular end-user programming tools, which blend the power of formula-based computation, with an intuitive table-based interface. Today, spreadsheets are used by billions of users to manipulate tables, most of whom are neither database experts nor professional programmers. Despite the success of spreadsheets, authoring complex formulas remains challenging, as non-technical users need to look up and understand non-trivial formula syntax. To address this pain point, we leverage the observation that there is often an abundance of similar-looking spreadsheets in the same organization, which not only have similar data, but also share similar computation logic encoded as formulas. We develop an Auto-Formula system that can accurately predict formulas that users want to author in a target spreadsheet cell, by learning and adapting formulas that already exist in similar spreadsheets, using contrastive-learning techniques inspired by "similar-face recognition" from compute vision. Extensive evaluations on over 2K test formulas extracted from real enterprise spreadsheets show the effectiveness of Auto-Formula over alternatives. Our benchmark data is available at this https URL to facilitate future research.
Abstract (translated)
电子表格被广泛认为是用户最喜爱的开发工具,它将基于公式的计算力量与直观的表格界面相结合。如今,电子表格被数十亿人用于操作表格,其中大多数用户既不是数据库专家也不是专业程序员。尽管电子表格取得了成功,但创建复杂公式仍然具有挑战性,因为非技术用户需要查找并理解非琐碎的公式语法。为了应对这个痛点,我们利用观察到同一组织中通常有很多类似外观的电子表格这一事实,这些电子表格不仅具有类似的数据,而且共享相似的计算逻辑,作为公式编码。我们开发了一种自动公式系统,可以准确预测用户希望在目标电子表格单元格中创建的公式,通过使用与计算视觉中的“相似脸识别”技术灵感相同的对比学习方法来学习并适应现有的类似电子表格中的公式。对来自真实企业电子表格的2K个测试公式的广泛评估显示,自动公式比其他方法更有效。我们的基准数据可在此处访问,以促进未来研究:https://www.example.com/。
URL
https://arxiv.org/abs/2404.12608