• Skip to primary navigation
  • Skip to content
  • Skip to footer
McGill NLP McGill NLP
  • People
  • Publications
  • Teaching
  • Reading Group
  • Join Us
    Xing Han Lu

    Xing Han Lu

    Working on retrieval and conversational QA

    • Website
    • Twitter
    • GitHub
    • Scholar

    AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories

    Xing Han Lù, Amirhossein Kazemnejad, Nicholas Meade, Arkil Patel, Dongchan Shin, Alejandra Zambrano, Karolina Stańczak, Peter Shaw, Christopher Pal, Siva Reddy


    Paper

    Abstract

    None

    Direct Link

    Tags:

    Categories: Publications

    Updated: April 11, 2025

    Twitter Facebook LinkedIn
    Previous Next
    • GitHub
    • Twitter
    • Updating the website
    © 2025 McGill NLP. Powered by Jekyll & Minimal Mistakes.