0% found this document useful (0 votes)

21 views42 pages

Visual Object Tracking

The document discusses visual object tracking, focusing on the objective of locating objects over time in video sequences. It outlines the formal definition, approaches (probabilistic and discriminative tracking), and challenges such as appearance variations and temporal drift. Additionally, it highlights the integration of CNNs for improved feature extraction in tracking tasks.

Uploaded by

Đặng Minh Hoàng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views42 pages

Visual Object Tracking

Uploaded by

Đặng Minh Hoàng

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 42

Visual Object Tracking

Instructor: Seunghoon Hong

Visual object tracking
Objective: locating the object(s) over time in a video

Initial frame

Target Tracking over

Visual Tracking
Visual object tracking
Objective: locating the object(s) over time in a video
Formal deﬁnition: given an object state at the initial frame z0=(x0,y0,w0,h0),
identify z1:T={z1,z2,…,zT} over a video of length T.
Visual object tracking
Objective: locating the object(s) over time in a video
Formal deﬁnition: given an object state at the initial frame z0=(x0,y0,w0,h0),
identify z1:T={z1,z2,…,zT} over a video of length T.

In learning perspective:
● Classiﬁcation problem with a single object class (= target vs distractors)
● Labeled data is given at only the initial frame
● Optionally requires online learning to adapt the variations in a video
● Online learning is driven by a self-supervision (training data = tracking results)
Visual object tracking
Objective: locating the object(s) over time in a video
Formal deﬁnition: given an object state at the initial frame z0=(x0,y0,w0,h0),
identify z1:T={z1,z2,…,zT} over a video of length T.

Two sub-categories:
● Single target tracking
○ Tracking only one object in an video
○ Single-class classiﬁcation (target vs. distractors)
● Multi target tracking
○ Tracking multiple objects in a video
○ Multi-class classiﬁcation (target 1 vs. target 2 vs. target 3 vs. … vs. distractors)
Approaches in single object tracking
● Probabilistic tracking
○ Formulate the localization task as a sequential probabilistic inference problem
○ Given a probability of the initial target location, propagate it over the remaining frames
Approaches in single object tracking
● Probabilistic tracking
○ Formulate the localization task as a sequential probabilistic inference problem
○ Given a probability of the initial target location, propagate it over the remaining frames

● Discriminative tracking
○ Classify the object from the distractors at every frame
○ Can be considered as sequential binary object detection (class = target, background)
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule
z: object location (state)
x: frame (observation)

Likelihood Prior
Posterior
the measurement of The belief of object state
the probability of
how likely the without observation
object state given
observation
an observation
coincide with the
given state
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule
z: object location (state)
x: frame (observation)

Target template
Prior
1 The belief of object state
without observation

2 3 Where is the target

likely to exist?
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule
z: object location (state)
x: frame (observation)

Target template
Likelihood
the measurement of
how likely the
observation
coincide with the
given state
Which region of
image look similar
to the target?
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule
z: object location (state)
x: frame (observation)

Target template
Posterior
the probability of
object state given
an observation

Where is the object

in this frame?
Probabilistic tracking
● Tracking as a Bayesian network

Bayes Rule

Sequential Bayesian ﬁltering

z1:T: object locations in frame 1 to T

x1:T: frames 1 to T
Probabilistic tracking
● Hidden Markov Model

● Markovian assumption
Probabilistic tracking
● Sequential Bayesian ﬁltering

Integration over all object locations!

Likelihood Prior

Likelihood Transition Posterior upto

model the previous frame
Probabilistic tracking
● Approximation by Monte Carlo sampling

where
Probabilistic tracking
● Particle ﬁltering (Sequential Markov-Chain Monte-Carlo)
○ Approximate the prior distribution using Markov-Chain Monte Carlo (MCMC) sampling
Probabilistic tracking pipeline
Frame t-1 Frame t

2. Move samples by
1. Extract samples transition model 3. Re-evaluate likelihood
proportional to using appearance model
previous posterior
Probabilistic tracking pipeline
Frame t

Tracking procedure (simpliﬁed):

1. Sample target states near the previous
target location
2. Evaluate the likelihood based on
appearance model

Example target
appearance model

3. Select the most probable sample as the

target at the current frame

4. Update the target appearance model

using the current tracking results
Attendance check
https://forms.gle/rGpXxLKZ4jbcArid8
Discriminative tracking pipeline
Quick overview: learning tracking-by-detection
● Objective: a ridge regression
Model parameters

Training Training data

labels
Quick overview: learning tracking-by-detection
● Objective: a ridge regression

How do we solve it?

Quick overview: learning tracking-by-detection
● Objective: a ridge regression

We should update this classiﬁer for every frames

(i.e. every time we perform tracking and
get positive/negative samples)

Can we make it faster?

Correlation ﬁltering
● We can make it extremely fast for certain positive/negative sets!
Negative samples
(translated samples)

+30 +15 -15 -30

Base sample
(tracking results)
Correlation ﬁltering
● Representing positive/negative images using circulant matrices

Consider base sample x as n-dimensional array

Circulant matrix

Positive sample

Negative samples
Correlation ﬁltering
● Any circulant matrices can be made diagonal by the Discrete Fourier Transform
(DFT)
DFT matrix
(constant,
independent to x)
DFT of base sample
Correlation ﬁltering
● Putting all together

Circulant matrix

Matrix inner-product

Plug into ridge

regression
Kernelized Correlation ﬁltering
● Easy to extend to kernelized version

ridge regression

ridge regression with

kernel

We can do fast
computation if kernel
matrix K is circulant matrix

Fortunately, it has been

shown that most useful
kernels are circulant[1]

[1] Henriques et al., High-Speed Tracking with Kernelized Correlation Filters, In TPAMI, 2015
Challenges
● Modeling severe appearance variations in a video

ﬁgure credit: Li et al., A survey of appearance models in visual object tracking

Modeling appearance for tracking
● Classic: hand-designed features
○ Color histogram
○ Intensity
○ Object Templates
○ Key-points (SIFT)
○ …
● Issue
○ All prone to overﬁtting
○ Cannot generalize to various appearances
Integrating CNN for appearance modeling
● Beneﬁts
○ Features from a pre-trained CNN can be robust against various appearance changes
○ Especially useful in tracking since we have only one target ground-truth in the initial frame
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
CNN-based tracking
● CNNTrack: direct application of CNN feature for tracking
Discussions
● Limitations?
Better representation learning with videos
● MDNet: learn representation for tracking with a large amount of videos
Challenges in visual object tracking
● Temporal drift (i.e. error propagation through time)
○ Drift in posterior estimation: the error in posterior propagates through time
○ Drift in appearance model: if update the appearance model in temporal failure, the error will
propagate

● But why is it so prune to temporal drift?

Summary: Visual tracking
● Object localization in a video
● Probabilistic vs. discriminative tracking
● Modeling target appearance is important
○ Essential to evaluate the affinity of samples in both tracking frameworks
○ Should be able to handle a wide range of appearance variations
○ Should be able to generalize well from a single ground-truth at initial frame
● CNN for visual tracking
○ Applying a pre-trained CNN for feature extraction
○ Training CNN with many heterogeneous videos for tracking

Wa0002.
No ratings yet
Wa0002.
21 pages
Single Object Tracking A Survey of Methods Dataset
No ratings yet
Single Object Tracking A Survey of Methods Dataset
15 pages
Computer Vision Paper
No ratings yet
Computer Vision Paper
3 pages
Lecture 9.2 Motion & Video Analysis in Computer Vision 2025
No ratings yet
Lecture 9.2 Motion & Video Analysis in Computer Vision 2025
49 pages
Object Tracking Methods-A Review
No ratings yet
Object Tracking Methods-A Review
7 pages
Object Tracking Using Radial Basis Function Networks
No ratings yet
Object Tracking Using Radial Basis Function Networks
11 pages
Tempest 160314194757
No ratings yet
Tempest 160314194757
28 pages
Object Tracking Using Radial Basis Function Networks
No ratings yet
Object Tracking Using Radial Basis Function Networks
9 pages
Cviii 2024 Ws
No ratings yet
Cviii 2024 Ws
45 pages
1602 00763
No ratings yet
1602 00763
5 pages
A Review of Visual Moving Target Tracking
No ratings yet
A Review of Visual Moving Target Tracking
30 pages
25 Object Tracking
No ratings yet
25 Object Tracking
29 pages
Object Tracking
No ratings yet
Object Tracking
20 pages
2022 - TrackFormer Multi-Object Tracking With Transformers
No ratings yet
2022 - TrackFormer Multi-Object Tracking With Transformers
11 pages
Pedestrian Detection and Tracking
No ratings yet
Pedestrian Detection and Tracking
13 pages
Fast CNN-Based Object Tracking Using Localization Layers and Deep Features Interpolation
No ratings yet
Fast CNN-Based Object Tracking Using Localization Layers and Deep Features Interpolation
6 pages
Object Tracking Based On Appearance and Depth Information
No ratings yet
Object Tracking Based On Appearance and Depth Information
19 pages
2022 Visual Object Tracking A Survey
No ratings yet
2022 Visual Object Tracking A Survey
42 pages
Yilmaz 2006
No ratings yet
Yilmaz 2006
45 pages
Unit 5
No ratings yet
Unit 5
18 pages
Porikli2012 BookChapter Objectdetectionandtracking
No ratings yet
Porikli2012 BookChapter Objectdetectionandtracking
40 pages
CNNTracking TNN10 Human
No ratings yet
CNNTracking TNN10 Human
14 pages
Combined Major Project
No ratings yet
Combined Major Project
8 pages
Real-Time Multi-Class Tracking
No ratings yet
Real-Time Multi-Class Tracking
10 pages
Tag Draft Especializado
No ratings yet
Tag Draft Especializado
14 pages
5 Major Computervision Technique
No ratings yet
5 Major Computervision Technique
10 pages
CV Unit 5
No ratings yet
CV Unit 5
11 pages
Trackformer
No ratings yet
Trackformer
16 pages
Zhang 2020
No ratings yet
Zhang 2020
5 pages
A Detection-Based Multiple Object Tracking Method: Mei Han Amit Sethi Yihong Gong
No ratings yet
A Detection-Based Multiple Object Tracking Method: Mei Han Amit Sethi Yihong Gong
4 pages
Learning Rich Feature Representation and Aggregation For Accurate Visual Tracking
No ratings yet
Learning Rich Feature Representation and Aggregation For Accurate Visual Tracking
19 pages
A Survey of Appearance Models in Visual Object Tracking: Image Processing and Computer Vision
No ratings yet
A Survey of Appearance Models in Visual Object Tracking: Image Processing and Computer Vision
42 pages
Real Time Object Detection and Tracking Using Deep Learning and Opencv
No ratings yet
Real Time Object Detection and Tracking Using Deep Learning and Opencv
4 pages
CNN For Object Tracking
No ratings yet
CNN For Object Tracking
44 pages
Object Detection for Engineering Students
No ratings yet
Object Detection for Engineering Students
16 pages
Object Detection and Tracking in Video Sequences
No ratings yet
Object Detection and Tracking in Video Sequences
6 pages
Object PDF
No ratings yet
Object PDF
6 pages
Smart Cards
No ratings yet
Smart Cards
39 pages
Object Detection and Tracking in Video Sequences
No ratings yet
Object Detection and Tracking in Video Sequences
6 pages
Yilmaz
No ratings yet
Yilmaz
45 pages
A Real Time Face Tracking System Based On Multiple Information Fusion
No ratings yet
A Real Time Face Tracking System Based On Multiple Information Fusion
19 pages
Moving Object Tracking and Detection in Videos Using MATLAB: A Review
No ratings yet
Moving Object Tracking and Detection in Videos Using MATLAB: A Review
9 pages
Object Tracking
100% (1)
Object Tracking
22 pages
Video Object Tracking Guide
No ratings yet
Video Object Tracking Guide
30 pages
Self-Supervised Deep Correlation Tracking
No ratings yet
Self-Supervised Deep Correlation Tracking
10 pages
Detect To Track and Track To Detect
No ratings yet
Detect To Track and Track To Detect
10 pages
Ijaerv10n9spl 339
No ratings yet
Ijaerv10n9spl 339
9 pages
Cviii 2024 Ws
No ratings yet
Cviii 2024 Ws
98 pages
Real-Time People Tracking in A Camera Network: Wasit Limprasert, Andrew Wallace, and Greg Michaelson
No ratings yet
Real-Time People Tracking in A Camera Network: Wasit Limprasert, Andrew Wallace, and Greg Michaelson
9 pages
Guide Prof. P.J Engineer Co-Guide Prof. M.C Patel: Prepared by Parthiv Bharti P09 EC 916
No ratings yet
Guide Prof. P.J Engineer Co-Guide Prof. M.C Patel: Prepared by Parthiv Bharti P09 EC 916
31 pages
Ilchae Jung Real-Time MDNet ECCV 2018 Paper
No ratings yet
Ilchae Jung Real-Time MDNet ECCV 2018 Paper
16 pages
Tracking by Instance Detection - A Meta-Learning Approach
No ratings yet
Tracking by Instance Detection - A Meta-Learning Approach
10 pages
Kalman Filters CV PT2
No ratings yet
Kalman Filters CV PT2
32 pages
12 CS1AC16 Detection and Tracking
No ratings yet
12 CS1AC16 Detection and Tracking
4 pages
Naeem 2013
No ratings yet
Naeem 2013
7 pages
Computer Vision Based Moving Object Detection and Tracking: Suresh Kumar, Prof. Yatin Kumar Agarwal
No ratings yet
Computer Vision Based Moving Object Detection and Tracking: Suresh Kumar, Prof. Yatin Kumar Agarwal
6 pages
Moving Object Analysis Techniques in Videos - A Review: Ritika, Gianetan Singh Sekhon
No ratings yet
Moving Object Analysis Techniques in Videos - A Review: Ritika, Gianetan Singh Sekhon
6 pages
Moving Object Recognization, Tracking and Destruction
No ratings yet
Moving Object Recognization, Tracking and Destruction
45 pages
Nutrition and Dental Health Nutrition and Dental Health 2nd Edition by Ann Ehrlich ISBN 0827357168 9780827357167
100% (17)
Nutrition and Dental Health Nutrition and Dental Health 2nd Edition by Ann Ehrlich ISBN 0827357168 9780827357167
83 pages
Book Title: Integrated Devices and Circuits For Artificial Intelligence
No ratings yet
Book Title: Integrated Devices and Circuits For Artificial Intelligence
1 page
Planificación de Inglés-2do A-Matutina-Semana 1-2
No ratings yet
Planificación de Inglés-2do A-Matutina-Semana 1-2
6 pages
Toledo School of Translators and Its Importance in The History of Translation in The West (#1538569) - 4168837
No ratings yet
Toledo School of Translators and Its Importance in The History of Translation in The West (#1538569) - 4168837
7 pages
Child-Rearing in Islamic Middle East
No ratings yet
Child-Rearing in Islamic Middle East
2 pages
Psychology Vocbulry in Use
No ratings yet
Psychology Vocbulry in Use
11 pages
The Effect of BPM of Music On Short-Term Memorization and Recall
No ratings yet
The Effect of BPM of Music On Short-Term Memorization and Recall
2 pages
Criminology A Sociological Understanding 6th Edition Steve E Barkan Ebook and TestBank Bundle Full Version
No ratings yet
Criminology A Sociological Understanding 6th Edition Steve E Barkan Ebook and TestBank Bundle Full Version
337 pages
Fahad Fresh Final-CV-23
No ratings yet
Fahad Fresh Final-CV-23
4 pages
Research Methods - STA630 Power Point Slides Lecture 15
No ratings yet
Research Methods - STA630 Power Point Slides Lecture 15
19 pages
Association Rule Generation For Student Performance Analysis Using Apriori Algorithm
No ratings yet
Association Rule Generation For Student Performance Analysis Using Apriori Algorithm
5 pages
Elementary Literacy Development Plan
No ratings yet
Elementary Literacy Development Plan
16 pages
Grade 8 Math Exam 2023-2024
No ratings yet
Grade 8 Math Exam 2023-2024
3 pages
Standard 1 Assessment Task
No ratings yet
Standard 1 Assessment Task
3 pages
Chapter 2
No ratings yet
Chapter 2
16 pages
GENDER6 Dana
No ratings yet
GENDER6 Dana
16 pages
Current Surgical Therapy Electronic 14th Edition Download Instantly
No ratings yet
Current Surgical Therapy Electronic 14th Edition Download Instantly
312 pages
Formative Assessment Tasks for Grades 4-6
No ratings yet
Formative Assessment Tasks for Grades 4-6
8 pages
Thesis Writing Help for Students
100% (2)
Thesis Writing Help for Students
8 pages
Being A Mentor in Heineken Final
100% (2)
Being A Mentor in Heineken Final
7 pages
Teachers Resource Catalogue Grade 12
No ratings yet
Teachers Resource Catalogue Grade 12
42 pages
Compliance Officer & Audit Expert Resume
No ratings yet
Compliance Officer & Audit Expert Resume
3 pages
Ôn Tập Và Kiểm Tra Đánh Giá Tiếng Anh 7
No ratings yet
Ôn Tập Và Kiểm Tra Đánh Giá Tiếng Anh 7
60 pages
Public Speaking Course Guide
No ratings yet
Public Speaking Course Guide
4 pages
Pracres1 Chapter 4 Powerpoint 2021
No ratings yet
Pracres1 Chapter 4 Powerpoint 2021
12 pages
Invitation Letter of Maths Competition To The Other Schools - 2025
No ratings yet
Invitation Letter of Maths Competition To The Other Schools - 2025
5 pages
Physics For Scientists and Engineers With Modern Physics 4th Edition Full Download
No ratings yet
Physics For Scientists and Engineers With Modern Physics 4th Edition Full Download
411 pages
如何撰寫文章摘要
100% (1)
如何撰寫文章摘要
5 pages
Unit 2-Art Vs Craft
No ratings yet
Unit 2-Art Vs Craft
5 pages
Class X Result 2021
No ratings yet
Class X Result 2021
1 page

Visual Object Tracking

Uploaded by

Visual Object Tracking

Uploaded by

Visual Object Tracking

Instructor: Seunghoon Hong

Target Tracking over

2 3 Where is the target

Where is the object

Sequential Bayesian ﬁltering

z1:T: object locations in frame 1 to T

Integration over all object locations!

Likelihood Transition Posterior upto

Tracking procedure (simpliﬁed):

3. Select the most probable sample as the

4. Update the target appearance model

Training Training data

How do we solve it?

We should update this classiﬁer for every frames

Can we make it faster?

+30 +15 -15 -30

Consider base sample x as n-dimensional array

Plug into ridge

ridge regression with

Fortunately, it has been

ﬁgure credit: Li et al., A survey of appearance models in visual object tracking

● But why is it so prune to temporal drift?

You might also like