Reasoning As Gradient

Published:

  • Reasoning as Gradient: Scaling MLE Agents Beyond Tree Search
    Yifei Zhang, Xu Yang, Xiao Yang, Bowen Xian, Qizheng Li, Shikai Fang, Jingyuan Li, Jian Wang, Minrui Xu, Yuge Zhang, Weiqing Liu, Jiang Bian
    Findings of the Association for Computational Linguistics (ACL 2026 Findings)
    Paper | Code GitHub Stars