Real-World Website Navigation with Multi-Turn
WebLINX is a large-scale benchmark of 100K interactions across 2300 expert demonstrations of conversational web navigation. It covers a broad range of patterns on over 150 real-world websites and can be used to train and evaluate agents in diverse scenarios.
Variants: WebLINX
This dataset is used in 1 benchmark:
Recent papers with results on this dataset: