A Benchmark for Robust Multi-Hop Spatial Reasoning in Texts
Variants: StepGame
This dataset is used in 1 benchmark:
Task | Model | Paper | Date |
---|---|---|---|
Question Answering | TP-MANN | StepGame: A New Benchmark for … | 2022-04-18 |
Recent papers with results on this dataset: