Abstract
We introduce tulip agent, an architecture for autonomous LLM-based agents with Create, Read, Update, and Delete access to a tool library containing a potentially large number of tools. In contrast to state-of-the-art implementations, tulip agent does not encode the descriptions of all available tools in the system prompt, which counts against the model's context window, or embed the entire prompt for retrieving suitable tools. Instead, the tulip agent can recursively search for suitable tools in its extensible tool library, implemented exemplarily as a vector store. The tulip agent architecture significantly reduces inference costs, allows using even large tool libraries, and enables the agent to adapt and extend its set of tools. We evaluate the architecture with several ablation studies in a mathematics context and demonstrate its generalizability with an application to robotics. A reference implementation and the benchmark are available at this http URL.
Abstract (translated)
我们提出了tulip代理,一种基于自主LLM代理的建筑,具有对包含可能大量工具的工具库的创建、读取、更新和删除访问。与最先进的实现相比,tulip代理没有在系统提示中编码所有可用工具的描述,这抵消了模型的上下文窗口,或者将整个提示嵌入其中以检索合适的工具。相反,tulip代理可以在其可扩展的工具库中递归地搜索合适的工具,实现例程作为一个向量存储器。tulip代理架构显著减少了推理成本,允许使用大型工具库,并使代理能够适应和扩展其工具集合。我们在数学上下文中进行多个抽象研究,并通过将应用程序到机器人领域来评估该架构。参考实现和基准可以在该http URL上找到。
URL
https://arxiv.org/abs/2407.21778