BAbI: A Test of Commonsense Ability
The BAbI benchmark presents a difficult set of tasks designed to evaluate the abilities of AI systems in interpreting commonsense knowledge. It includes a wide range of cases that require reasoning about everyday concepts. By evaluating how well AI models can solve these problems, researchers hope t