CAT-BENCH: assessing the understanding of temporal dependencies in procedural texts by linguistic models
Understanding how LLMs understand natural language plans, such as instructions and recipes, is crucial for their reliable use in decision-making ...