0% found this document useful (0 votes)

85 views12 pages

Administrivia: CMPSCI 370: Introduction To Computer Vision

This document provides administrative information for the CMPSCI 370: Introduction to Computer Vision course. It includes details about lectures, the instructor, grading policy, homework assignments, textbooks, necessary background, and an introduction to the topic of computer vision. The instructor begins with some experiment examples asking students to determine if an image contains an animal or not. The document then discusses how human vision works and some optical illusions that can trick our vision system. It introduces several visual cues that human vision uses to interpret images like linear perspective, aerial perspective, occlusion ordering, and texture gradients.

Uploaded by

DoctorSalt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views12 pages

Administrivia: CMPSCI 370: Introduction To Computer Vision

Uploaded by

DoctorSalt

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Administrivia

Lectures
Tuesday/Thursday, 11:30 - 12:45, Hasbrouck 113
CMPSCI 370: Introduction to Honors section: Tuesday 4:00 - 5:00, CS 142
Computer Vision
Instructor: Subhransu Maji

University of Massachusetts, Amherst Office hours: Monday, 3:00 - 5:00, CS 274

January 19, 2016
Instructor: Subhransu Maji
Website: http://www-edlab.cs.umass.edu/~smaji/cmpsci370/
News, lecture slides, etc (check regularly)
Homework submission via Moodle

1 2

Administrivia Homework #0
Grading policy: 1.Figure out a way to run Matlab
370: homework (60%), mid-term (15%), final (25%) Obtain a student copy (Matlab suite 99$)
370HH: homework (45%), mid-term (10%), final (15%), project (20%)
Your lab machines might have it
Homework
5 in total (expect one every two weeks)
2.Learn how to program in Matlab
First one will be posted this Thursday Plenty of online resources (the course website lists some)

Course textbooks (recommended)

Richard Szeliski, Computer Vision: Algorithms and Applications Alternatives: Python, Octave, JAVA, C++, .
(available online as pdf) - readings will be from this

Necessary background: Linear algebra, calculus, probability,

programming in Matlab (image toolbox needed)
Question: How many of you are familiar with Matlab?
3 4

3 4
Before we start Why Vision?
Are there any questions?

5 6

Why Vision? Light! Why is light good for measurement?

It is how we see other people,

Remote
navigate our environment, Microscopy Surveillance 3D Analysis / Navigation
Sensing
communicate ideas, entertain,
and measure the world around us. Plentiful, sometimes free
Interacts with many things, but not too many
Goes generally straight over distance
Very small g high spatial resolution
Fast, but not too fast g time of flight sensors
Easy to detect g cameras work, are cheap
Comes in many flavors ( wavelengths )

Source: A. Berg 7 Source: A. Berg 8

7 8
The goal of computer vision An experiment #1

Extract properties of the world from visual data

(i.e., measurements of light)

We are remarkably good at this!

9 animal or not? 10

9 10

An experiment #2 An experiment #3

animal or not? 11
animal or not? 12

11 12
An experiment #4 An experiment #5

animal or not? 13 animal or not? 14

13 14

An experiment #6 The images

#1 #2 #3

animal or not? 15
#4 #5 #6
16

15 16
Human vision But we make mistakes
Amazingly good, fast and accurate

Sometimes wrong, but often not in doubt

Huge amount of bandwidth to the brain is visual data

Large amount of the brain seems to be for processing

visual data

Vision is difficult!

Source: A. Berg 17 Checker shadow illusion - Edward H. Adelson 18

17 18

Other optical illusions Vision as inverse of graphics

Many possibilities how do we solve this ambiguity?
Images are confusing, but they also reveal the structure of
the world through numerous cues
Our job is to interpret the cues!
Are the horizontal lines parallel? Are the purple lines straight?

Is this a spiral? is the left circle (in the center) bigger?

Are these failures of our vision system?

http://www.illusions.org 19 (following slides from J. Koenderink) 20

19 20
Cues: Linear perspective Cues: Aerial (Atmospheric) perspective
Scattering of skylight by
Parallel lines
particles in the air adds
merge at the
to the luminosity
horizon

Photo by ole Wind

http://kalisdigitalphotos.blogspot.com
As the distance of the object from the viewer increases, the
Analyzing parallel lines to estimate space contrast between the object and its background decreases.
21 22

21 22

Cues: Occlusion ordering Cues: Texture gradient

Gustave Caillebotte. Paris Street, Rainy Day, 1877, Art Institute of Chicago

Chicago loop, image source: wikipedia

23 24

23 24
Cues: Shading and Lighting Many other cues
Motion parallax: how things move relative to each other as
we move. Objects near us move more than objects far
away. Also provides grouping cues.

Familiar size: Size of known things, e.g. faces gives us an

estimate of the depth.

Defocus blur: Far away objects are blurrier than nearer.

Commonly used in photographs to create a perception of
depth.

Elevation: Distance from the horizon. Objects closer to the

horizon are perceived to be farther.

The four seasons sculpture set

25 26

The study of computer vision Optical character recognition (OCR)

Lots of tasks: detection, classification, segmentation, pose
estimation, depth estimation, etc.

Problems are often ill-posed. Most of the hard work is in crispy

defining the problem you wish to solve.

It is hard, ad-hoc. There are few theorems, but we rely on those Digit recognition License plate readers
from many other areas: optics, geometry, physics, etc. yann.lecun.com (google street view)

You are in good company:

Euclid, Alhazen, da Vinci, Kepler, Galileo, Descartes, Sudoku grabber
Newton, Huygens, Maxwell, Helmholtz, Mach, Herring, Cajal, http://sudokugrab.blogspot.com/
Minkowski, Hubel & Wiesel, Wald

If that is not enough, there are many applications Automatic cheque readers
(following slides from Charless Flowkes) 27 (Most bank ATMs) Source: S. Seitz, N. Snavely 28

27 28
Biometrics Face detection

Fingerprint scanners are Face recognition systems are Face detection is on many cameras these days
now on many new laptops beginning to appear more widely
and other devices http://www.sensiblevision.com

Source: S. Seitz 29 Source: S. Seitz 30

29 30

Smile detection Face recognition

http://www.apple.com/ilife/iphoto

Source: S. Seitz 31 Source: S. Seitz 32

31 32
Instance recognition Automotive safety

Mobileye : Vision systems on high end BMW, GM, Volvo models

Pedestrian collision warning
Forward collision warning
Lane departure warning
Headway monitoring and warning
Source: S. Seitz 33 Source: A. Shashua, S. Seitz 34

33 34

Self-driving cars Interactive interfaces

Microsoft Kinect depth sensors

Source: L. Lazebnik 35 Source: L. Lazebnik 36

35 36
Large-scale 3D reconstruction Vision for robotics, space exploration

Photo Tourism: Exploring Photo Collections in 3D

YouTube link NASAs Curiosity Rover has 17 cameras as a part of its sensing system
Source: S. Seitz, N. Snavely 37 http://en.wikipedia.org/wiki/Curiosity_(rover) 38

37 38

What this course is about? I. Early vision

Course overview Basic image formation and processing
I. Early vision: image formation, sensing, light and shading, filtering
II. Mid-level vision : grouping, perceptual organization
III. Multi-view geometry
IV. Recognition * =
V. Additional topics (time permitting)
Cameras and sensors
image formation Linear filtering
image
Goal: To develop vision researchers. You can come up with a reasonable Light and color Edge detection
solution to various vision problems (and implement it yourself).

We are not going to cover:

Graphics: Physics of light transport, material properties, rendering
Computational photography: design of sensing devices, etc
How the human vision system works
39
feature
Featureextraction,
extraction: key-point
corner anddetection
blob detection
Source: L. Lazebnik 40

39 40
II. Mid-level vision III. Multi-view geometry
Model fitting and grouping

Stereo Epipolar geometry

Alignment

Fitting: Least squares structure Tomasi & Kanade (1993)

from motion
Hough transform
RANSAC Affine structure from motion Projective structure from motion:
Source: L. Lazebnik 41 Here be dragons! Source: L. Lazebnik 42

41 42

IV. Recognition V. Additional topics

bag-of-word models Deep learning Human-centric vision

part-based models

learning Optical flow Tracking

43 44

43 44
For next class
Familiarize yourself with MATLAB (more information is on
the course page)
Student copy is 99$ from Matlabs page
UMASS IT (100% free): https://www.it.umass.edu/
support/software

Readings:
The speed of processing in the human visual system,
Thorpe et al., Letters to Nature, 1996
Chapter 1 in RS textbook

1 Intro
No ratings yet
1 Intro
103 pages
Lec01 Intro
No ratings yet
Lec01 Intro
61 pages
Computer Vision Course Overview
No ratings yet
Computer Vision Course Overview
79 pages
Lec01 CT Intro
No ratings yet
Lec01 CT Intro
61 pages
Computer Vision
No ratings yet
Computer Vision
52 pages
Unit 1
No ratings yet
Unit 1
186 pages
Computer Vision Course Notes 2018
No ratings yet
Computer Vision Course Notes 2018
2 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Computer Vision for Tech Enthusiasts
No ratings yet
Computer Vision for Tech Enthusiasts
41 pages
Lec01 Intro
No ratings yet
Lec01 Intro
55 pages
Prerequisites: What Is Computer Vision? Vision For Measurement
No ratings yet
Prerequisites: What Is Computer Vision? Vision For Measurement
8 pages
Intro to Computer Vision Course
No ratings yet
Intro to Computer Vision Course
76 pages
COMP3411 Week 7 - Computer Vision
No ratings yet
COMP3411 Week 7 - Computer Vision
58 pages
Ch2 Fundamentals
No ratings yet
Ch2 Fundamentals
31 pages
CO Machine Vision
No ratings yet
CO Machine Vision
3 pages
CompVisNotes PDF
No ratings yet
CompVisNotes PDF
115 pages
CS7.505: Computer Vision: Spring 2022
No ratings yet
CS7.505: Computer Vision: Spring 2022
46 pages
T2310 TDS3651 L01 Introduction
No ratings yet
T2310 TDS3651 L01 Introduction
73 pages
01 Introduction
No ratings yet
01 Introduction
62 pages
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
No ratings yet
Cv2021-Lec1-Introduction 1600 PDF - Gdrive.vip
61 pages
Introduction To Image File Formats
No ratings yet
Introduction To Image File Formats
87 pages
CS436 CS5310 EE513 L01 Introduction
No ratings yet
CS436 CS5310 EE513 L01 Introduction
54 pages
1 Intro Visión Artificial
No ratings yet
1 Intro Visión Artificial
50 pages
Chapter 1 Introduction Part 1
No ratings yet
Chapter 1 Introduction Part 1
72 pages
CV Digital Notes
No ratings yet
CV Digital Notes
77 pages
CS5330 F22 Lectures
No ratings yet
CS5330 F22 Lectures
116 pages
Lec00 Intro For Web Highlighted
No ratings yet
Lec00 Intro For Web Highlighted
72 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
42 pages
1 Sirg Bsu - 1
No ratings yet
1 Sirg Bsu - 1
46 pages
Intro to Computer Vision & IP
No ratings yet
Intro to Computer Vision & IP
48 pages
Computer Vision ch1
No ratings yet
Computer Vision ch1
80 pages
Ch-1-Intro To DIP
No ratings yet
Ch-1-Intro To DIP
87 pages
Digital Image Processing: Vipin V Asst. Professor, ECE SJCET, Palai
No ratings yet
Digital Image Processing: Vipin V Asst. Professor, ECE SJCET, Palai
156 pages
Lec00 Intro For Web
No ratings yet
Lec00 Intro For Web
81 pages
Computer Vision
No ratings yet
Computer Vision
24 pages
Computer Vision for Beginners
No ratings yet
Computer Vision for Beginners
26 pages
DL4CV Week01 Part01
No ratings yet
DL4CV Week01 Part01
35 pages
Text Books
No ratings yet
Text Books
2 pages
Computer Vision Seminar Report
No ratings yet
Computer Vision Seminar Report
45 pages
Unit 1 Chapter 1
No ratings yet
Unit 1 Chapter 1
27 pages
Introduction to Data Science: (Khoa học dữ liệu)
No ratings yet
Introduction to Data Science: (Khoa học dữ liệu)
91 pages
Lecture 2
No ratings yet
Lecture 2
42 pages
Introduction to Computer Vision
No ratings yet
Introduction to Computer Vision
81 pages
CV - Unit 1
No ratings yet
CV - Unit 1
14 pages
Computer Vision: Linda Shapiro
No ratings yet
Computer Vision: Linda Shapiro
73 pages
Intro to Computer Vision Basics
No ratings yet
Intro to Computer Vision Basics
30 pages
Introduction FPCV-0-1
No ratings yet
Introduction FPCV-0-1
31 pages
Ilovepdf Merged Compressed
No ratings yet
Ilovepdf Merged Compressed
1,100 pages
01 Introduction 2023
No ratings yet
01 Introduction 2023
83 pages
Computer Vision A Modern Approach 1st Edition David A. Forsyth PDF Version
No ratings yet
Computer Vision A Modern Approach 1st Edition David A. Forsyth PDF Version
102 pages
CS-475 - Computer Vision
No ratings yet
CS-475 - Computer Vision
5 pages
01 Introduction To MachineVision
No ratings yet
01 Introduction To MachineVision
53 pages
Unit1 CV
No ratings yet
Unit1 CV
44 pages
Lecture 1 AI Summary
No ratings yet
Lecture 1 AI Summary
31 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
Computer Vision and Artificial Intelligence
No ratings yet
Computer Vision and Artificial Intelligence
55 pages
Mining or Food? Case Study 2: Copper and Gold Mining Zamboanga Del Norte - Mindanao Island
100% (1)
Mining or Food? Case Study 2: Copper and Gold Mining Zamboanga Del Norte - Mindanao Island
24 pages
Railroad Signal Installation Guide
No ratings yet
Railroad Signal Installation Guide
63 pages
Well Production Henri Cholet PDF 9426327
No ratings yet
Well Production Henri Cholet PDF 9426327
1 page
3/4" DN20 3/4" DN20: Nominal Size Nominal Size
No ratings yet
3/4" DN20 3/4" DN20: Nominal Size Nominal Size
3 pages
Michis Ladder
No ratings yet
Michis Ladder
2 pages
Famous Last Words
100% (1)
Famous Last Words
5 pages
Punkapocalyptic Conversion Guide
No ratings yet
Punkapocalyptic Conversion Guide
5 pages
33kV-PR-2945-CT-RP-1819-009009, Dt-07-06-2018 STC
No ratings yet
33kV-PR-2945-CT-RP-1819-009009, Dt-07-06-2018 STC
13 pages
Comparison Table of NSF 61, DVGW, and WRAS
No ratings yet
Comparison Table of NSF 61, DVGW, and WRAS
1 page
808.cement and Concrete
No ratings yet
808.cement and Concrete
6 pages
Everyday English For Hospitality Professionals Lawrence J Zwier PDF Download
No ratings yet
Everyday English For Hospitality Professionals Lawrence J Zwier PDF Download
41 pages
Ergonomic CX43/CX33 Microscopes Guide
No ratings yet
Ergonomic CX43/CX33 Microscopes Guide
8 pages
Bearing Selection: How Will Select Bearings From Manufacturer Catalogue?
No ratings yet
Bearing Selection: How Will Select Bearings From Manufacturer Catalogue?
9 pages
5 The Design of A Water Retaining Culverts To BS 8007
100% (4)
5 The Design of A Water Retaining Culverts To BS 8007
26 pages
Irjet V11i761
No ratings yet
Irjet V11i761
10 pages
Marwin Valve Brochure PDF
No ratings yet
Marwin Valve Brochure PDF
2 pages
Cilcure B Sds10637
No ratings yet
Cilcure B Sds10637
9 pages
TB 11 5820 890 20 20
No ratings yet
TB 11 5820 890 20 20
27 pages
Cell Division and Regulation Overview
No ratings yet
Cell Division and Regulation Overview
13 pages
Us First Leed Building Research
No ratings yet
Us First Leed Building Research
26 pages
Much Ado About Nothing (Dover Thrift Editions)
No ratings yet
Much Ado About Nothing (Dover Thrift Editions)
97 pages
Muslim Scouts in the UK
No ratings yet
Muslim Scouts in the UK
5 pages
How Do You Compute The Assessed Value?: Assessed Value Fair Market Value X Assessment Level
100% (1)
How Do You Compute The Assessed Value?: Assessed Value Fair Market Value X Assessment Level
7 pages
Transformer Overload Protection System
75% (4)
Transformer Overload Protection System
46 pages
Putting The Industrial Internet To Work: 2019 Digital Transformation Playbook
No ratings yet
Putting The Industrial Internet To Work: 2019 Digital Transformation Playbook
15 pages
Daguioman Municipal Profile
No ratings yet
Daguioman Municipal Profile
11 pages
MBBSInterns Log Book
No ratings yet
MBBSInterns Log Book
67 pages
DPP - 01 (Solution) - Alternating Current
No ratings yet
DPP - 01 (Solution) - Alternating Current
4 pages
United World College Case Study - Compress
No ratings yet
United World College Case Study - Compress
4 pages
c40889 Colibri Hand-Held Communication Platform
No ratings yet
c40889 Colibri Hand-Held Communication Platform
6 pages

Administrivia: CMPSCI 370: Introduction To Computer Vision

Uploaded by

Administrivia: CMPSCI 370: Introduction To Computer Vision

Uploaded by

Administrivia

University of Massachusetts, Amherst Office hours: Monday, 3:00 - 5:00, CS 274

Course textbooks (recommended)

Necessary background: Linear algebra, calculus, probability,

Why Vision? Light! Why is light good for measurement?

It is how we see other people,

Source: A. Berg 7 Source: A. Berg 8

Extract properties of the world from visual data

We are remarkably good at this!

animal or not? 13 animal or not? 14

An experiment #6 The images

Sometimes wrong, but often not in doubt

Huge amount of bandwidth to the brain is visual data

Large amount of the brain seems to be for processing

Source: A. Berg 17 Checker shadow illusion - Edward H. Adelson 18

Other optical illusions Vision as inverse of graphics

Is this a spiral? is the left circle (in the center) bigger?

Are these failures of our vision system?

Photo by ole Wind

Cues: Occlusion ordering Cues: Texture gradient

Chicago loop, image source: wikipedia

Familiar size: Size of known things, e.g. faces gives us an

Defocus blur: Far away objects are blurrier than nearer.

Elevation: Distance from the horizon. Objects closer to the

The four seasons sculpture set

The study of computer vision Optical character recognition (OCR)

Problems are often ill-posed. Most of the hard work is in crispy

You are in good company:

Source: S. Seitz 29 Source: S. Seitz 30

Smile detection Face recognition

Source: S. Seitz 31 Source: S. Seitz 32

Mobileye : Vision systems on high end BMW, GM, Volvo models

Self-driving cars Interactive interfaces

Source: L. Lazebnik 35 Source: L. Lazebnik 36

Photo Tourism: Exploring Photo Collections in 3D

What this course is about? I. Early vision

We are not going to cover:

Stereo Epipolar geometry

Fitting: Least squares structure Tomasi & Kanade (1993)

IV. Recognition V. Additional topics

bag-of-word models Deep learning Human-centric vision

learning Optical flow Tracking

You might also like