Show HN: Cua-Bench – a benchmark for AI agents in GUI environments

(github.com)

23 points | by someguy101010  2 days ago

2 comments