Kaiming He Facebook AI Research (FAIR), Georgia Gkioxari Facebook AI Research (FAIR), Piotr Dollár Facebook AI Research (FAIR), Ross Girshick Facebook AI Research (FAIR), Kaiming He Facebook AI Research (FAIR), Georgia Gkioxari Facebook AI Research (FAIR), Piotr Dollár Facebook AI Research (FAIR), Ross Girshick Facebook AI Research (FAIR) (2017)
The paper introduces Mask R-CNN, a framework for object instance segmentation that enhances Faster R-CNN by adding a parallel mask prediction branch, allowing for high-quality segmentation of objects in images while retaining speed and flexibility. The authors achieve state-of-the-art results across various benchmarks, particularly on the COCO dataset, demonstrating the framework's effectiveness in instance segmentation, object detection, and human pose estimation. The paper emphasizes the importance of precise pixel alignments in segmentation tasks and introduces the RoIAlign layer to improve accuracy by addressing issues from previous RoIPool methods. Detailed comparisons with existing models show significant improvements in performance, showcasing Mask R-CNN's simplicity, high speed, and generalizability to other tasks. The code for the model will be made publicly available to facilitate further research.
This paper employs the following methods:
The following datasets were used in this research:
The authors identified the following limitations: