Paper Reading AI Learner

R&D-Agent-Quant: A Multi-Agent Framework for Data-Centric Factors and Model Joint Optimization

2025-05-21 06:20:56
Yuante Li, Xu Yang, Xiao Yang, Minrui Xu, Xisen Wang, Weiqing Liu, Jiang Bian

Abstract

Financial markets pose fundamental challenges for asset return prediction due to their high dimensionality, non-stationarity, and persistent volatility. Despite advances in large language models and multi-agent systems, current quantitative research pipelines suffer from limited automation, weak interpretability, and fragmented coordination across key components such as factor mining and model innovation. In this paper, we propose R&D-Agent for Quantitative Finance, in short RD-Agent(Q), the first data-centric multi-agent framework designed to automate the full-stack research and development of quantitative strategies via coordinated factor-model co-optimization. RD-Agent(Q) decomposes the quant process into two iterative stages: a Research stage that dynamically sets goal-aligned prompts, formulates hypotheses based on domain priors, and maps them to concrete tasks, and a Development stage that employs a code-generation agent, Co-STEER, to implement task-specific code, which is then executed in real-market backtests. The two stages are connected through a feedback stage that thoroughly evaluates experimental outcomes and informs subsequent iterations, with a multi-armed bandit scheduler for adaptive direction selection. Empirically, RD-Agent(Q) achieves up to 2X higher annualized returns than classical factor libraries using 70% fewer factors, and outperforms state-of-the-art deep time-series models on real markets. Its joint factor-model optimization delivers a strong balance between predictive accuracy and strategy robustness. Our code is available at: this https URL.

Abstract (translated)

金融市场在资产回报预测方面提出了根本性的挑战,这些挑战源于市场的高维度、非平稳性和持续的波动性。尽管大型语言模型和多代理系统有所进步,但目前的数量化研究流程仍然存在自动化程度有限、解释能力弱以及关键组成部分(如因子挖掘和模型创新)之间的协调碎片化等问题。在这篇论文中,我们提出了“定量金融研发代理”(R&D-Agent for Quantitative Finance),简称RD-Agent(Q),这是首个以数据为中心的多代理框架,旨在通过协同优化因子-模型来自动完成数量化策略的全流程研究与开发。 RD-Agent(Q)将量化过程分解为两个迭代阶段:**研究阶段(Research stage)**,该阶段动态地设置目标对齐提示、基于领域先验构建假设并将其映射到具体任务;以及 **开发阶段(Development stage)**,这一阶段利用代码生成代理Co-STEER来实现特定的任务代码,并在实际市场回测中执行这些代码。两个阶段通过一个反馈阶段连接起来,在这个阶段里对实验结果进行全面评估,并为后续迭代提供信息,同时使用多臂赌博机调度器进行适应性方向选择。 从经验上讲,RD-Agent(Q)实现了比经典因子库高出2倍的年化回报率,且只用了70%的因素数量。此外,它在实际市场上超过了现有的最先进的深度时间序列模型性能。其联合优化因子-模型的方法能够提供预测准确性和策略稳健性之间的良好平衡。 我们的代码可以在以下链接找到:[此URL](this https URL)。

URL

https://arxiv.org/abs/2505.15155

PDF

https://arxiv.org/pdf/2505.15155.pdf


Tags
3D Action Action_Localization Action_Recognition Activity Adversarial Agent Attention Autonomous Bert Boundary_Detection Caption Chat Classification CNN Compressive_Sensing Contour Contrastive_Learning Deep_Learning Denoising Detection Dialog Diffusion Drone Dynamic_Memory_Network Edge_Detection Embedding Embodied Emotion Enhancement Face Face_Detection Face_Recognition Facial_Landmark Few-Shot Gait_Recognition GAN Gaze_Estimation Gesture Gradient_Descent Handwriting Human_Parsing Image_Caption Image_Classification Image_Compression Image_Enhancement Image_Generation Image_Matting Image_Retrieval Inference Inpainting Intelligent_Chip Knowledge Knowledge_Graph Language_Model LLM Matching Medical Memory_Networks Multi_Modal Multi_Task NAS NMT Object_Detection Object_Tracking OCR Ontology Optical_Character Optical_Flow Optimization Person_Re-identification Point_Cloud Portrait_Generation Pose Pose_Estimation Prediction QA Quantitative Quantitative_Finance Quantization Re-identification Recognition Recommendation Reconstruction Regularization Reinforcement_Learning Relation Relation_Extraction Represenation Represenation_Learning Restoration Review RNN Robot Salient Scene_Classification Scene_Generation Scene_Parsing Scene_Text Segmentation Self-Supervised Semantic_Instance_Segmentation Semantic_Segmentation Semi_Global Semi_Supervised Sence_graph Sentiment Sentiment_Classification Sketch SLAM Sparse Speech Speech_Recognition Style_Transfer Summarization Super_Resolution Surveillance Survey Text_Classification Text_Generation Time_Series Tracking Transfer_Learning Transformer Unsupervised Video_Caption Video_Classification Video_Indexing Video_Prediction Video_Retrieval Visual_Relation VQA Weakly_Supervised Zero-Shot