Show HN: A new benchmark for testing LLMs for deterministic outputs

(interfaze.ai)

35 points | by khurdula  4 hours ago

13 comments