Compositional GSM: A New AI Benchmark for Assessing the Reasoning Capabilities of Large Language Models in Multi-Step Problems
Natural language processing (NLP) has seen rapid advances, and large language models (LLMs) are used to address various challenging problems. ...