0% found this document useful (0 votes)

28 views51 pages

Instance Segmentation

Uploaded by

Babil King

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

28 views51 pages

Instance Segmentation

Uploaded by

Babil King

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 51

Instance Segmentation

Riley Simmons-Edler, Berthy Feng

Instance Segmentation Task

● Label each foreground pixel with object

and instance
● Object detection + semantic
segmentation

Slide Credit: Kaiming He

In This Lecture...

● Microsoft COCO dataset

● Mask R-CNN (fully supervised)
● MaskX R-CNN (partially supervised)
Microsoft COCO:
Common Objects in Context
Tsung-Yi Lin, Michael Maire, Serge Belongie, et al.
“Microsoft COCO: Common Objects in Context.” arXiv,
2015.
Previous Datasets
● ImageNet: many object
categories
● PASCAL VOC: object
detection in natural images,
small number of classes
● SUN: labeling scene types and
commonly occurring objects,
but not many instances per
category
Image Credit: Tsung-Yi Lin et al.
Goal: Push research in scene understanding

1. Detecting non-iconic views

2. Contextual reasoning between objects
3. Precise 2D localization of objects
MS COCO Dataset
❖ 91 object
classes
❖ 328,000
images
❖ 2.5 million
labeled
instances

Image Credit: Tsung-Yi Lin et al.

Image Collection & Annotation
Object Categories

Image Credit: Tsung-Yi Lin et al.

Non-Iconic Image Collection

Image Credit: Tsung-Yi Lin et al.

Annotation

Image Credit: Tsung-Yi Lin et al.

Dataset Evaluation
Statistics

Image Credit: Tsung-Yi Lin et al.

Statistics

Image Credit: Tsung-Yi Lin et al.

COCO Detection Challenge

Image Credit: Tsung-Yi Lin et al.

COCO Keypoint Challenge

Image Credit: Tsung-Yi Lin et al.

COCO Stuff Challenge

Image Credit: Tsung-Yi Lin et al.

COCO Places Challenges

Image Credit: Tsung-Yi Lin et al.

Mask R-CNN
Kaiming He, Georgia Gkioxari, Piotr Dollar, and Ross
Girshick. “Mask R-CNN.” ICCV, 2017.
Faster R-CNN
Fast R-CNN

Image Credit: Shaoqing Ren et al. Image Credit: Tomasz Grel

Insight: Region Proposal and Detection Use
Same Features

Image Credit: Shaoqing Ren et al.

Faster R-CNN = RPN + Fast R-CNN
RPN = Fully Convolutional Network
Extending to Instance
Segmentation
Visual Perception Problems

Slide Credit: Kaiming He

Instance Segmentation Methods

Slide Credit: Kaiming He

Insight: Mask Prediction in Parallel

Slide Credit: Kaiming He

RoIPool

Image Credit: Tomasz Grel

RoIPool

Slide Credit: Kaiming He

RoIAlign

Slide Credit: Kaiming He

Mask R-CNN
Mask R-CNN Results
Examples

● Mask AP =
35.7

Image Credit: Kaiming He et al.

Comparisons

Image Credit: Kaiming He et al.

Comparisons

Image Credit: Kaiming He et al.

Application: Human Pose Estimation

Image Credit: Kaiming He et al.

Mask R-CNN Recap

● Add parallel mask prediction head to Faster-RCNN

● RoIAlign allows for precise localization
● Mask R-CNN improves on AP of previous state-of-the-art, can be
applied in human pose estimation
Learning to Segment Every Thing
Ronghang Hu, Piotr Dollar, Kaiming He, Trevor Darrell, and
Ross Girshick. “Learning to Segment Every Thing.” arXiv,
2017.
Partially Supervised Model
Motivation for a Partially Supervised Model

A = set of object B = set of object

categories with categories with only
complete mask bounding boxes (no
annotations segmentation
annotations)

How can we know C = A U B?

Image Credit: Ronghang Hu et al.

Transfer Learning

Image Credit: Ronghang Hu et al.

Weight Transfer Function

Image Credit: Ronghang Hu et al.

Training
● Train bounding box head using standard box detection losses on all
classes in A U B
● Train mask head, weight transfer function using mask loss on classes in A

Image Credit: Ronghang Hu et al.

Stage-Wise Training
1. Detection training ● Train detection once and then
2. Segmentation training fine-tune weight transfer function
● Inferior performance

Image Credit: Ronghang Hu et al.

End-to-End Joint Training

● Jointly train detection head and mask head end-to-end

● Want detection weights to stay constant between A and B

Image Credit: Ronghang Hu et al.

End-to-End Training Better

Image Credit: Ronghang Hu et al.

Mask Prediction
Baseline: Class-agonistic FCN mask prediction

Extension: FCN+MLP mask heads

Image Credit: Ronghang Hu et al.

Results
Examples

Image Credit: Ronghang Hu et al.

Comparisons

Image Credit: Ronghang Hu et al.

Segmenting Everything

Image Credit: Ronghang Hu et al.

Mask R-CNN
No ratings yet
Mask R-CNN
4 pages
1803 01534-PANet
No ratings yet
1803 01534-PANet
11 pages
Review: Deepmask (Instance Segmentation) : An Instance Segment Proposal Method Driven by Convolutional Neural Networks
No ratings yet
Review: Deepmask (Instance Segmentation) : An Instance Segment Proposal Method Driven by Convolutional Neural Networks
6 pages
Mask R-CNN: Object Instance Segmentation
No ratings yet
Mask R-CNN: Object Instance Segmentation
12 pages
He Mask R-CNN Iccv 2017 Paper
No ratings yet
He Mask R-CNN Iccv 2017 Paper
9 pages
He Mask R-CNN ICCV 2017 Paper PDF
No ratings yet
He Mask R-CNN ICCV 2017 Paper PDF
9 pages
AI-Powered Object Segmentation
No ratings yet
AI-Powered Object Segmentation
12 pages
He 2017
No ratings yet
He 2017
9 pages
Object Detection and Segmentation - Part 2
No ratings yet
Object Detection and Segmentation - Part 2
36 pages
CondInst: Dynamic Convolutions for Segmentation
No ratings yet
CondInst: Dynamic Convolutions for Segmentation
18 pages
Lecture-22-CAP6412 Spring2018 Mask-RCNN New
No ratings yet
Lecture-22-CAP6412 Spring2018 Mask-RCNN New
36 pages
L10 Lecture Detection - Segmentation v2.5
No ratings yet
L10 Lecture Detection - Segmentation v2.5
35 pages
Term Paper - DL
No ratings yet
Term Paper - DL
22 pages
Journal Pre-Proofs: Neurocomputing
No ratings yet
Journal Pre-Proofs: Neurocomputing
37 pages
02 Semantic Segmentation 2024
No ratings yet
02 Semantic Segmentation 2024
53 pages
Dlcv2017d3l1segmentation 170623173102
No ratings yet
Dlcv2017d3l1segmentation 170623173102
36 pages
Mazen Hany Abd El Salam Hassan
No ratings yet
Mazen Hany Abd El Salam Hassan
8 pages
Advanced Topics in CNN and RNN
No ratings yet
Advanced Topics in CNN and RNN
72 pages
Vision
No ratings yet
Vision
24 pages
YOLACT
No ratings yet
YOLACT
10 pages
SOLOv2 - Dynamic and Fast Instance Segmentation
No ratings yet
SOLOv2 - Dynamic and Fast Instance Segmentation
17 pages
Od Segment 221219 043435
No ratings yet
Od Segment 221219 043435
40 pages
Lecture 22 MaskRCNN
No ratings yet
Lecture 22 MaskRCNN
36 pages
CNNs for Object Detection
No ratings yet
CNNs for Object Detection
34 pages
Convnets 4
No ratings yet
Convnets 4
22 pages
Deep Learning for Image Segmentation
No ratings yet
Deep Learning for Image Segmentation
6 pages
Lecture 5 - CNNs For Detection and Segmentation
No ratings yet
Lecture 5 - CNNs For Detection and Segmentation
62 pages
Mask3D: Mask Transformer For 3D Semantic Instance Segmentation
No ratings yet
Mask3D: Mask Transformer For 3D Semantic Instance Segmentation
12 pages
10 21541-Apjess 1542885-4187651
No ratings yet
10 21541-Apjess 1542885-4187651
5 pages
Lecture 4
No ratings yet
Lecture 4
46 pages
Harley MSC Thesis Menos Especializadpo
No ratings yet
Harley MSC Thesis Menos Especializadpo
71 pages
The Ultimate Guide To Object Detection
No ratings yet
The Ultimate Guide To Object Detection
16 pages
Deep Learning For Computer Vision
No ratings yet
Deep Learning For Computer Vision
181 pages
Facemask Detection Using MMdetection Toolbox
No ratings yet
Facemask Detection Using MMdetection Toolbox
6 pages
Deep Learning for Image Segmentation
No ratings yet
Deep Learning for Image Segmentation
92 pages
8-Image Detection and Segmentation
No ratings yet
8-Image Detection and Segmentation
73 pages
Part 2
No ratings yet
Part 2
225 pages
Lecture2.2 UnimodalRepresentations Part1 PDF
No ratings yet
Lecture2.2 UnimodalRepresentations Part1 PDF
92 pages
3 SipMask: Spatial Information Preservation For Fast Image and Video Instance Segmentation
No ratings yet
3 SipMask: Spatial Information Preservation For Fast Image and Video Instance Segmentation
17 pages
Object Detection & Segmentation Guide
No ratings yet
Object Detection & Segmentation Guide
38 pages
Generalized R-CNN for Researchers
No ratings yet
Generalized R-CNN for Researchers
127 pages
Semantic Segmentation for CS Students
No ratings yet
Semantic Segmentation for CS Students
151 pages
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
No ratings yet
Recent Progress in Semantic Image Segmentation: Xiaolong Liu Zhidong Deng Yuhan Yang
18 pages
Bolya YOLACT Real-Time Instance Segmentation ICCV 2019 Paper
No ratings yet
Bolya YOLACT Real-Time Instance Segmentation ICCV 2019 Paper
10 pages
Object Detection and Identification
67% (3)
Object Detection and Identification
20 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
Computer VIsion Applications
No ratings yet
Computer VIsion Applications
30 pages
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
No ratings yet
Fully Convolutional Networks For Semantic Segmentation: Jonathan Long Evan Shelhamer Trevor Darrell UC Berkeley
10 pages
Face Mask Detection Using Faster R
No ratings yet
Face Mask Detection Using Faster R
13 pages
MV cs4243 2024 Amir 6 p2
No ratings yet
MV cs4243 2024 Amir 6 p2
95 pages
Rec03 - Deep Architectures
No ratings yet
Rec03 - Deep Architectures
65 pages
Da Unit-Iv
No ratings yet
Da Unit-Iv
23 pages
Beginner's Guide to R-CNN Basics
No ratings yet
Beginner's Guide to R-CNN Basics
6 pages
AML - Lecture - 11 - 19nov24
No ratings yet
AML - Lecture - 11 - 19nov24
103 pages
Yolo Family
No ratings yet
Yolo Family
40 pages
Deep Semantic Segmentation New Model of Natural and Medical Images
No ratings yet
Deep Semantic Segmentation New Model of Natural and Medical Images
4 pages
Du 2018 J. Phys. Conf. Ser. 1004 012029
No ratings yet
Du 2018 J. Phys. Conf. Ser. 1004 012029
9 pages
Segmentation Transformer: OCR for Semantic Segmentation
No ratings yet
Segmentation Transformer: OCR for Semantic Segmentation
21 pages
Liang Instance Segmentation in 3D Scenes Using Semantic Superpoint Tree Networks ICCV 2021 Paper
No ratings yet
Liang Instance Segmentation in 3D Scenes Using Semantic Superpoint Tree Networks ICCV 2021 Paper
10 pages
Photographize 092016
100% (1)
Photographize 092016
90 pages
Photographize 072016
No ratings yet
Photographize 072016
90 pages
5000 Years of Geometry Mathematics in History and Culture
No ratings yet
5000 Years of Geometry Mathematics in History and Culture
638 pages
Photographize 112016
No ratings yet
Photographize 112016
88 pages
Photographize 012016
No ratings yet
Photographize 012016
90 pages
Photographize 052016
No ratings yet
Photographize 052016
92 pages
Lec 02 Cam Models
No ratings yet
Lec 02 Cam Models
44 pages
08 ParametricCurves Web
No ratings yet
08 ParametricCurves Web
10 pages
BBC F Efa 2017
No ratings yet
BBC F Efa 2017
100 pages
13 Projection
No ratings yet
13 Projection
10 pages
Engg - Graphics
No ratings yet
Engg - Graphics
140 pages
Challenges and Opportunities in Geometric Modelling of Complex Bio-Inspired 3D Objects Designed For Additive Manufacturing
No ratings yet
Challenges and Opportunities in Geometric Modelling of Complex Bio-Inspired 3D Objects Designed For Additive Manufacturing
42 pages
RV ProjectiveGeometry
No ratings yet
RV ProjectiveGeometry
36 pages
BBC Focus - Health Breakthroughs - Volume 8 2018
No ratings yet
BBC Focus - Health Breakthroughs - Volume 8 2018
100 pages
Search for Alien Life & VR Revolution
No ratings yet
Search for Alien Life & VR Revolution
116 pages
9 Projection Geometry
No ratings yet
9 Projection Geometry
124 pages
Distortion in Perspective Projection
No ratings yet
Distortion in Perspective Projection
5 pages
BBC Focus - February 2018
No ratings yet
BBC Focus - February 2018
100 pages
BBC Focus - December 2018
No ratings yet
BBC Focus - December 2018
108 pages
2016-09-01 BBC Focus
No ratings yet
2016-09-01 BBC Focus
116 pages
Understanding Color Models
No ratings yet
Understanding Color Models
11 pages
2016-07-01 BBC Focus
No ratings yet
2016-07-01 BBC Focus
116 pages
Architects Datafile (ADF) - July 2022
No ratings yet
Architects Datafile (ADF) - July 2022
84 pages
2016-12-01 BBC Focus
No ratings yet
2016-12-01 BBC Focus
132 pages
2016-11-01 BBC Focus
No ratings yet
2016-11-01 BBC Focus
124 pages
الحلولية ووحدة الوجود
No ratings yet
الحلولية ووحدة الوجود
354 pages
Business Combination and Consolidation
93% (14)
Business Combination and Consolidation
21 pages
Economic Survey 2024
No ratings yet
Economic Survey 2024
21 pages
Oman Pearsonvue EXAM
100% (4)
Oman Pearsonvue EXAM
7 pages
Workstation Pro 12 User Guide
No ratings yet
Workstation Pro 12 User Guide
300 pages
The Role of Self-Contrual in Comsumers' eWOM in Social Networking Sites
No ratings yet
The Role of Self-Contrual in Comsumers' eWOM in Social Networking Sites
9 pages
Psychic Reading Price List
No ratings yet
Psychic Reading Price List
1 page
BACS3713 MIS Tutorial 4 - Answer - New
No ratings yet
BACS3713 MIS Tutorial 4 - Answer - New
17 pages
Activity 4 - Drifted Supercontinent
No ratings yet
Activity 4 - Drifted Supercontinent
1 page
Caraway The Genus Carum - 1st Edition All Format Download
100% (11)
Caraway The Genus Carum - 1st Edition All Format Download
16 pages
BL English Nov 2024
No ratings yet
BL English Nov 2024
64 pages
Utopia Analysis
No ratings yet
Utopia Analysis
30 pages
Math Concepts for Students
No ratings yet
Math Concepts for Students
10 pages
Planning Designing and Analysis of Bus Stand With Special Features
No ratings yet
Planning Designing and Analysis of Bus Stand With Special Features
21 pages
Krishnas Tak Am
100% (1)
Krishnas Tak Am
2 pages
60 KW Proposal Gurdeep Singh
No ratings yet
60 KW Proposal Gurdeep Singh
9 pages
International Road Freight Transport Navigating Global Logistics
No ratings yet
International Road Freight Transport Navigating Global Logistics
10 pages
Fortigate Sd-Wan Configuration
No ratings yet
Fortigate Sd-Wan Configuration
5 pages
Trade - 2018 - Class-9-10 Computer & ICT-1 Web (WI)
No ratings yet
Trade - 2018 - Class-9-10 Computer & ICT-1 Web (WI)
358 pages
Srilankan Bo
No ratings yet
Srilankan Bo
4 pages
Plapoly JOS CAMP Schedule
No ratings yet
Plapoly JOS CAMP Schedule
1 page
Ccbf812c886b47a PDF
No ratings yet
Ccbf812c886b47a PDF
30 pages
LifeFiber ADSS 8-Span 80m
No ratings yet
LifeFiber ADSS 8-Span 80m
8 pages
SPC - Hypermotard 950 SP - en - MY19
No ratings yet
SPC - Hypermotard 950 SP - en - MY19
164 pages
OPT B1plus U03 Grammar Standard
No ratings yet
OPT B1plus U03 Grammar Standard
1 page
DISC 230 - Introduction To Business Process Modelling - S - 19-20 Revised
No ratings yet
DISC 230 - Introduction To Business Process Modelling - S - 19-20 Revised
6 pages
Lecture Notes
No ratings yet
Lecture Notes
3 pages
Italian Education Document Verification Services: To Get An Appointment
No ratings yet
Italian Education Document Verification Services: To Get An Appointment
2 pages
Print Question Paper
No ratings yet
Print Question Paper
1 page
Experiment No. 2 Title: Hol-2287-01-Hbd - Vmware Cloud On Aws
No ratings yet
Experiment No. 2 Title: Hol-2287-01-Hbd - Vmware Cloud On Aws
15 pages

Instance Segmentation

Uploaded by

Instance Segmentation

Uploaded by

Instance Segmentation

Riley Simmons-Edler, Berthy Feng

● Label each foreground pixel with object

Slide Credit: Kaiming He

● Microsoft COCO dataset

1. Detecting non-iconic views

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Tsung-Yi Lin et al.

Image Credit: Shaoqing Ren et al. Image Credit: Tomasz Grel

Image Credit: Shaoqing Ren et al.

Slide Credit: Kaiming He

Slide Credit: Kaiming He

Slide Credit: Kaiming He

Image Credit: Tomasz Grel

Slide Credit: Kaiming He

Slide Credit: Kaiming He

Image Credit: Kaiming He et al.

Image Credit: Kaiming He et al.

Image Credit: Kaiming He et al.

Image Credit: Kaiming He et al.

● Add parallel mask prediction head to Faster-RCNN

A = set of object B = set of object

How can we know C = A U B?

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

● Jointly train detection head and mask head end-to-end

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Extension: FCN+MLP mask heads

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

Image Credit: Ronghang Hu et al.

You might also like