Olga Russakovsky, Jia Deng, · Hao, Su, Jonathan Krause, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya Khosla, Michael Bernstein, Alexander C Berg, · Li, Fei-Fei, O Russakovsky, J Deng, H Su, J Krause, S Satheesh, S Ma, Z Huang, A Karpathy, A Khosla, M Bernstein, A C Berg, L Fei-Fei, Stanford University StanfordCAUSA, University of Michigan Ann ArborMIUSA, Stanford University StanfordCAUSA, Stanford University StanfordCAUSA, Stanford University StanfordCAUSA, Stanford University StanfordCAUSA, Stanford University StanfordCAUSA, Stanford University StanfordCAUSA, Massachusetts Institute of Technology CambridgeMAUSA, Stanford University StanfordCAUSA, UNC Chapel Hill Chapel HillNCUSA, Stanford University StanfordCAUSA (2014)
This paper presents the ImageNet Large Scale Visual Recognition Challenge (ILSVRC), a benchmark for object classification and detection involving hundreds of categories and millions of images. It discusses the dataset's creation, the challenges encountered in collecting and annotating large-scale data, and innovations in object recognition that emerged from this challenge. The authors analyze progress made over the years, how state-of-the-art algorithms now compare to human accuracy, and propose future directions for research in large-scale image recognition. Key goals include detailing the dataset's annotation process, highlighting key achievements, and assessing the current state of categorical object recognition.
This paper employs the following methods:
The following datasets were used in this research:
The authors identified the following limitations: