Nubank: From LLMs to Financial Inclusion: Efficient LLM Training and Scaling AI Agents for 131 Million Lives
Abstract
SOP-Bench serves as a research enabler for systematically investigating agent architectures, model capabilities, and deployment considerations across diverse procedural tasks. In this talk, we discuss the challenges of real world SOPs and a framework on evaluating them.
Successful Page Load