UT Austin researchers present PUTNAMBENCH: a comprehensive AI benchmark for evaluating the capabilities of neural theorem provers with Putnam math problems
Automating mathematical reasoning has been a long-standing goal of artificial intelligence, and formal frameworks such as Lean 4, Isabelle, and ...