ACM CAIS 2026 Workshop

RLEval: Methods and Reinforcement Learning Environments for Evaluating AI Agents

We invite submissions of papers on AI agent evaluation: methods, RL environment design, benchmarks, and real-world case studies.

Workshop Focus

Trillions are being invested in LLM-powered AI Agents, but many open questions remain regarding how to effectively evaluate them.

Held at the ACM Conference on AI and Agentic Systems, this workshop provides the first-ever research venue for such questions:

Agenda

Invited Speakers

Accepted Papers

Organizers

Additional Reviewers

Shikhar Gupta, Arjun Chakraborty, Nikhil Reddy Pallepati, Sivakumar Selvaraj, Meher Gitika Karumuri, Thomas Brink, Joseph Axisa, Kalyani Limaye

Conference Venue

This workshop is co-located with the ACM Conference on AI and Agentic Systems (ACM CAIS 2026), held from May 26–29, 2026 in San Jose, California.

Address: DoubleTree by Hilton San Jose, 2050 Gateway Place, San Jose, CA 95110

Rooms: San Juan for the general workshop presentations, Carmel/Monterey for afternoon poster presentations

Call for Papers

We invite submissions of 4-page short papers on new agent evaluation methods, RL environment design, agentic benchmarks, and real-world case studies. Work-in-progress submissions are encouraged!

✨ Topics of Interest

  • Agent evaluation methods, in particular interventional and causal/counterfactual techniques.
  • RL environments: design principles, software frameworks, synthetic data, tool design.
  • Automated graders: LLM-as-a-judge, verifiers, rubrics, reward hacking, human feedback.
  • Benchmarks: new benchmarks, analyses of existing benchmarks.
  • Enterprise agent case studies: production evaluation and deployment lessons.
  • Considerations in the above topics for Agents with particular capabilities: code execution, computer use, multimodal I/O, NL2SQL, skills, memory, web search.

📝 Submission Details

  • Paper length: 4 pages main text; additional pages allowed for references and appendix
  • Formatting: ACM acmart/sigconf template
  • Review process: Single-blind (no anonymization required)
  • Visibility: Reviews and paper decisions will not be made public
  • Workshop format: interactive poster session + selected Contributed Talks + Best Paper/Poster Award
  • Policy: Under-review papers elsewhere are allowed; already-published papers are not
  • At least one author of each accepted paper must register and attend

Key Dates

Submission deadline (AoE) May 20, 2026
Accept/Reject notification May 22, 2026
Camera-ready deadline May 25, 2026
Workshop day at ACM CAIS 2026 May 26, 2026

FAQ

Will accepted papers be archival?

Accepted papers are not archival.

Can previously published work be submitted?

Submissions under review elsewhere are allowed, but already published papers are not.

Who should attend?

Researchers and practitioners working on agentic evaluation, RL environments, benchmarks, and enterprise deployments.