This model is an instance segmentation network for 80 classes of objects. It is a Cascade mask R-CNN with ResNet101 backbone and deformable convolutions, FPN, RPN, detection and segmentation heads.
|MS COCO val2017 box AP (max short side 800, max long side 1344)||45.8%|
|MS COCO val2017 mask AP (max short side 800, max long side 1344)||39.7%|
|MS COCO val2017 box AP (max height 800, max width 1344)||43.55%|
|MS COCO val2017 mask AP (max height 800, max width 1344)||38.14%|
|Max objects to detect||100|
Average Precision (AP) is defined and measured according to standard MS COCO evaluation procedure.
image, shape: [1x3x800x1344] - An input image in the format [1xCxHxW]. The expected channel order is BGR.
labels, shape:  - Contiguous integer class ID for every detected object.
boxes, shape: [100, 5] - Bounding boxes around every detected objects in (top_left_x, top_left_y, bottom_right_x, bottom_right_y) format and its confidence score in range [0, 1].
masks, shape: [100, 28, 28] - Segmentation heatmaps for every output bounding box.
[*] Other names and brands may be claimed as the property of others.