Allen Institute for AI Researchers proposes SUPER: a benchmark to assess LLMs' ability to set up and execute research experiments
artificial intelligence (ai) and machine learning (ML) have been transformative in numerous fields, but a significant challenge remains in reproducibility ...