0% found this document useful (0 votes)

5 views27 pages

Unit - 2 Computer Vision

The document provides an overview of Computer Vision (CV), a branch of Artificial Intelligence that enables computers to interpret visual information. It covers basic techniques, applications in various fields such as healthcare and autonomous vehicles, popular libraries and tools for implementation, and ethical considerations surrounding CV. Key topics include image processing, facial recognition, and the implications of privacy and bias in CV technologies.

Uploaded by

shokatbavaliya11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

5 views27 pages

Unit - 2 Computer Vision

Uploaded by

shokatbavaliya11

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 27

Unit 2

COMPUTER VISION
INDEX

🞭 Introduction
🞭 Basic techniques of Computer Vision
🞭 Applications of Computer Vision
🞭 Computer Vision Libraries and Tools
🞭 Ethical Considerations in Computer Vision
INTRODUCTION
🞭 Computer Vision (CV) is a branch of
Artificial Intelligence (AI) that helps
computers to interpret and understand
visual information much like humans.
🞭 beginners and experienced professionals
and covers key concepts such as Image
Processing, Feature Extraction, Object
Detection, Image Segmentation and other
core techniques in CV.
BASIC TECHNIQUES OF COMPUTER
VISION
🞭 the basics of computer vision seem easy,
processing and understanding an image via
machine vision are quite difficult. Here’s why—
🞭 An image consists of several pixels, with a pixel
being the smallest quanta in which the image can
be divided into.
🞭 Computers process images in the form of an array
of pixels, where each pixel has a set of values,
representing the presence and intensity of the
three primary colors: red, green, and blue.
🞭 All pixels come together to form a digital image.
THIS IS HOW COMPUTER “SEES” IMAGE
BASIC TECHNIQUES OF COMPUTER
VISION
🞭 The values represent the pixel values at
the particular coordinates in the image,
with 255 representing a complete white
point and 0 representing a complete dark
point.
BASIC TECHNIQUES OF COMPUTER
VISION
🞭 Some operations commonly used in computer vision based on a
Deep Learning perspective include:
🞭 Convolution: Convolution in computer vision is an operation in which
a learnable kernel is “convolved” with the image. In other words—the
kernel is slided across the image pixel by pixel, and an element-wise
multiplication is performed between the kernel and the image at
every pixel group.
🞭 Pooling: Pooling is an operation used to reduce the dimensions of an
image by performing operations at a pixel level. A pooling kernel
slides across the image, and only one pixel from the corresponding
pixel group is selected for further processing, thus reducing the
image size., eg., Max Pooling, Average Pooling.
🞭 Non-Linear Activations: Non-Linear activations introduce non-
linearity to the neural network, thereby allowing the stacking of
multiple convolutions and pooling blocks to increase model depth.
FACE AND PERSON RECOGNITION

🞭 Facial Recognition is a subpart of object

detection where the primary object being
detected is the human face.
🞭 While similar to object detection as a task,
where features are detected and localized,
facial recognition performs not only
detection, but also recognition of the
detected face.
FACE AND PERSON RECOGNITION
IMAGE RESTORATION

🞭 image Restoration refers to the restoration

or the reconstruction of faded and old
image hard copies that have been captured
and stored in an improper manner, leading
to loss of quality of the image.
IMAGE RESTORATION
FEATURE MATCHING
🞭 The applications of feature matching are found in computer vision
tasks like object identification and camera calibration. The task of
feature matching is generally performed in the following order:
🞭 Detection of features: Detection of regions of interest is generally
performed by Image Processing algorithms like Harris Corner
Detection
🞭 Formation of local descriptors: After features are detected, the
region surrounding each keypoint is captured and the local
descriptors of these regions of interest are obtained. A local
descriptor is the representation of a point’s local neighborhood and
thus can be helpful for feature matching.
🞭 Feature matching: The features and their local descriptors are
matched in the corresponding images to complete the feature
matching step.
🞭
Application of computer vision
1. Healthcare
● Medical Imaging Analysis: Detecting diseases in X-rays, MRIs, and CT scans (e.g., tumors,
fractures).

● Surgical Assistance: Real-time guidance during surgery using visual data.

● Skin Cancer Detection: Using image classification to identify malignancies from skin images.
Application of computer vision

A chest X-ray of a pneumothorax case—AI

overlays a heatmap (red-yellow) identifying
air pocket region that corresponds with
physician-confirmed abnormality
Application of computer vision

🞭 Autonomous Vehicles
🞭 Self-Driving Cars: Computer vision is used
for object detection, lane detection, traffic
sign recognition, pedestrian tracking, and
obstacle avoidance.
🞭 Drone Navigation: Drones use CV to
detect and avoid obstacles in real-time
while navigating.
🞭 Image cars count on road
Application of computer vision

🞭 Retail and E-commerce

🞭 Application: Enhances the shopping
experience through image-based search,
recommendation systems, and even
checkout-less stores.
🞭 Example: Amazon Go stores use
computer vision to track what customers
pick up, allowing them to leave without
manually checking out.
Computer Vision Libraries and Tools
🞭 1. OpenCV (Open Source Computer Vision
Library)
🞭 Description: OpenCV is one of the most popular
and comprehensive open-source libraries for
computer vision tasks. It provides tools for image
processing, object detection, face recognition, and
real-time video processing.
🞭 Languages Supported: C++, Python, Java, and
others.
🞭 Key Features: Image filtering, feature detection,
image transformations, machine learning
integration, real-time video analysis.
Computer Vision Libraries and Tools
🞭 2. TensorFlow & TensorFlow.js
🞭 Description: TensorFlow, developed by
Google, is a popular machine learning
framework that also has strong support for
computer vision tasks. TensorFlow.js brings
machine learning to JavaScript for real-time
computer vision in the browser.
🞭 Languages Supported: Python, JavaScript.
🞭 Key Features: Object detection, image
segmentation, neural networks for visual
tasks, support for deep learning.
Computer Vision Libraries and Tools
🞭 3. PyTorch
🞭 Description: PyTorch is a deep learning
library that is widely used for computer vision
tasks. It’s known for its flexibility, ease of use,
and support for dynamic computation
graphs.
🞭 Languages Supported: Python.
🞭 Key Features: Deep learning for vision tasks
like image classification, segmentation, and
object detection. Popular models include
ResNet, etc.
Computer Vision Libraries and Tools

🞭 Keras
🞭 Description: Keras is a high-level neural
networks API that runs on top of TensorFlow,
making it easier to develop deep learning
models for computer vision tasks.
🞭 Languages Supported: Python.
🞭 Key Features: Simplified implementation of
deep learning models for image
classification, object detection, and
segmentation.
Ethical consideration in CV
Ethical Concern Description Example

CV often captures images/video in

CCTV systems in public spaces or
1. Privacy Invasion public/private spaces without
facial recognition in retail stores.
consent.

Training datasets may lack Face recognition works better on

2. Bias and Discrimination diversity, leading to biased lighter skin tones than darker
outputs. ones.

Individuals are often unaware their

Using social media photos to train
3. Consent and Data Use images are being used or
facial recognition algorithms.
analyzed.

CV can be used for unethical or Military drones using CV for

4. Misuse and Dual-Use
harmful purposes. autonomous targeting.

Inability to explain why an

CV systems (especially deep
5. Lack of Transparency algorithm flagged a person as
learning) are often “black boxes.”
suspicious.

It’s unclear who is liable when CV Who is responsible if a self-driving

6. Accountability
systems fail or cause harm. car hits a pedestrian?

CV enables creation of fake

Political deepfakes spreading
7. Deepfakes & Misinformation videos/images that can deceive
misinformation during elections.
and manipulate.

Making Machines See Class12 AI
0% (1)
Making Machines See Class12 AI
3 pages
Computer Vision: In-Depth Overview
No ratings yet
Computer Vision: In-Depth Overview
5 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
2 pages
New Seminar
No ratings yet
New Seminar
11 pages
CVIP Module 01 Reviewer
No ratings yet
CVIP Module 01 Reviewer
20 pages
Format of 1st Page - Seminar
No ratings yet
Format of 1st Page - Seminar
3 pages
Class - Notes Computer Vision
No ratings yet
Class - Notes Computer Vision
3 pages
Making Machines See Class 12 Notes
No ratings yet
Making Machines See Class 12 Notes
6 pages
A Comprehensive Guide To Computer Vision
No ratings yet
A Comprehensive Guide To Computer Vision
6 pages
Unit 1
No ratings yet
Unit 1
186 pages
CV Notes
No ratings yet
CV Notes
75 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
Wa0194.
No ratings yet
Wa0194.
7 pages
Computer Vision Seminar Santosh
No ratings yet
Computer Vision Seminar Santosh
10 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
CV Digital Notes
No ratings yet
CV Digital Notes
77 pages
Computer Vision Revision Notes - 250322 - 101703
No ratings yet
Computer Vision Revision Notes - 250322 - 101703
4 pages
grp3 Computervision
No ratings yet
grp3 Computervision
28 pages
8394 Making Machines See
No ratings yet
8394 Making Machines See
50 pages
Unit 3 Making Machines See The World of Computer Vision
No ratings yet
Unit 3 Making Machines See The World of Computer Vision
10 pages
Class 10th Computer Vision Revision Notes
No ratings yet
Class 10th Computer Vision Revision Notes
4 pages
Lec1 - Computer Vision - v1
No ratings yet
Lec1 - Computer Vision - v1
38 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
Computer Vision Assignment
No ratings yet
Computer Vision Assignment
10 pages
Computer Vision Presentation
No ratings yet
Computer Vision Presentation
10 pages
Unit 1
No ratings yet
Unit 1
200 pages
Summary of Computer Vision
No ratings yet
Summary of Computer Vision
6 pages
Computer Vision Presentation Updated
No ratings yet
Computer Vision Presentation Updated
15 pages
Abhijith Vision
No ratings yet
Abhijith Vision
17 pages
Key Concepts in Computer Vision
No ratings yet
Key Concepts in Computer Vision
1 page
Computer Vision
No ratings yet
Computer Vision
28 pages
Making Machines See (Unit-3)
No ratings yet
Making Machines See (Unit-3)
8 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
18cse390t U1 s1 Slo1 Content
No ratings yet
18cse390t U1 s1 Slo1 Content
15 pages
CV Unit 1
No ratings yet
CV Unit 1
17 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
CV Unit 1
No ratings yet
CV Unit 1
30 pages
1 Intro To CV
No ratings yet
1 Intro To CV
76 pages
Class 10 AI: Computer Vision Q&A
No ratings yet
Class 10 AI: Computer Vision Q&A
10 pages
IPCV Unit 01
No ratings yet
IPCV Unit 01
18 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Computer Vision Essentials
No ratings yet
Computer Vision Essentials
7 pages
Computer Vision Research Document
No ratings yet
Computer Vision Research Document
3 pages
CV #1 Course Introduction-1
No ratings yet
CV #1 Course Introduction-1
61 pages
Computer Vision: Key Concepts & Tasks
No ratings yet
Computer Vision: Key Concepts & Tasks
4 pages
CV - Lecture 1 - Iintroduction
No ratings yet
CV - Lecture 1 - Iintroduction
24 pages
Computer Vision SM-1
No ratings yet
Computer Vision SM-1
26 pages
CXVXFV
No ratings yet
CXVXFV
12 pages
Chapter 1 - Introduction To CV
No ratings yet
Chapter 1 - Introduction To CV
49 pages
M1 - Final
No ratings yet
M1 - Final
44 pages
Computer Vision for Tech Enthusiasts
No ratings yet
Computer Vision for Tech Enthusiasts
44 pages
Unit 1
No ratings yet
Unit 1
20 pages
Computer Vision for Tech Enthusiasts
No ratings yet
Computer Vision for Tech Enthusiasts
47 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
Group 17 Computer Vision @Lcd-1
No ratings yet
Group 17 Computer Vision @Lcd-1
25 pages
SCSA1703 Computer Vision Unit-1
No ratings yet
SCSA1703 Computer Vision Unit-1
41 pages
Computer Vision
No ratings yet
Computer Vision
6 pages
Deep vs. Machine Learning: Brain Tumor Detection
No ratings yet
Deep vs. Machine Learning: Brain Tumor Detection
6 pages
Class X Part B Unit 5 Computer Vision
No ratings yet
Class X Part B Unit 5 Computer Vision
14 pages
PESU - 7th Sem Course Information
No ratings yet
PESU - 7th Sem Course Information
34 pages
Fuzzy Algebra Thesis
100% (3)
Fuzzy Algebra Thesis
8 pages
Harvestable Black Pepper Recognition Using Computer Vision
No ratings yet
Harvestable Black Pepper Recognition Using Computer Vision
6 pages
AI-Powered UAVs - Challenges and Opportunities in Imaging and Swarms - Dr. Ahmar Rashid
No ratings yet
AI-Powered UAVs - Challenges and Opportunities in Imaging and Swarms - Dr. Ahmar Rashid
82 pages
Brain Tumor Paper Ed1
No ratings yet
Brain Tumor Paper Ed1
7 pages
Integration of Generative Artificial Intelligence in Scientific Research at Algerian Universities
No ratings yet
Integration of Generative Artificial Intelligence in Scientific Research at Algerian Universities
10 pages
DIP - Module 4
No ratings yet
DIP - Module 4
27 pages
Theft Detection Using Deep Learning
No ratings yet
Theft Detection Using Deep Learning
9 pages
MULTIPLE COLOR DETECTION IN REAL - TIME USING PYTHON (Paper) PDF
No ratings yet
MULTIPLE COLOR DETECTION IN REAL - TIME USING PYTHON (Paper) PDF
7 pages
Article - 1 - Automated Segmentation of Multiple Sclerosis Lesions Based On Convolutional Neural Networks
No ratings yet
Article - 1 - Automated Segmentation of Multiple Sclerosis Lesions Based On Convolutional Neural Networks
20 pages
New Biomedical Engineering Courses 2021
No ratings yet
New Biomedical Engineering Courses 2021
168 pages
Automatic Number Plate Recognition System (ANPR) : The Implementation
No ratings yet
Automatic Number Plate Recognition System (ANPR) : The Implementation
6 pages
Lidar 360 Brochure
No ratings yet
Lidar 360 Brochure
9 pages
AI in Crop Disease Detection
No ratings yet
AI in Crop Disease Detection
6 pages
A Deep Learning Approach For Efficient Palm Reading: December 2020
No ratings yet
A Deep Learning Approach For Efficient Palm Reading: December 2020
5 pages
2024, DCNAM - Automatic Detection of Pixel Level Fine Crack Using A Densely Connected - 'Beyene Et Al' (Structures)
No ratings yet
2024, DCNAM - Automatic Detection of Pixel Level Fine Crack Using A Densely Connected - 'Beyene Et Al' (Structures)
12 pages
Learning Lightweight Lane Detection Cnns by Self Attention Distillation
No ratings yet
Learning Lightweight Lane Detection Cnns by Self Attention Distillation
10 pages
Sayısal Görüntü İşleme Teknikleri: Doç. Dr. Mehmet Serdar Güzel
No ratings yet
Sayısal Görüntü İşleme Teknikleri: Doç. Dr. Mehmet Serdar Güzel
10 pages
Continual Learning in Manufacturing
No ratings yet
Continual Learning in Manufacturing
27 pages
Lane Detection in Autonomous Vehicles
No ratings yet
Lane Detection in Autonomous Vehicles
11 pages
Handwritten Hindi Character Recognition Using MultipleClassifiers in Machine Learning
No ratings yet
Handwritten Hindi Character Recognition Using MultipleClassifiers in Machine Learning
6 pages
Real-Time Color Object Tracking
No ratings yet
Real-Time Color Object Tracking
15 pages
Dip Unit 4
No ratings yet
Dip Unit 4
23 pages
Giza Pyramids Construction An Ancient Inspired Metaheuristic
No ratings yet
Giza Pyramids Construction An Ancient Inspired Metaheuristic
19 pages
Assignment 2 DIP 2019
No ratings yet
Assignment 2 DIP 2019
9 pages
Advanced Visual Tracking Techniques
No ratings yet
Advanced Visual Tracking Techniques
23 pages
Unsupervised Learning: Clustering & Anomaly Detection
No ratings yet
Unsupervised Learning: Clustering & Anomaly Detection
50 pages
Media Architecture - Content With Purpose For The Public.
No ratings yet
Media Architecture - Content With Purpose For The Public.
7 pages

Unit - 2 Computer Vision

Uploaded by

Unit - 2 Computer Vision

Uploaded by

Unit 2

🞭 Facial Recognition is a subpart of object

🞭 image Restoration refers to the restoration

● Surgical Assistance: Real-time guidance during surgery using visual data.

A chest X-ray of a pneumothorax case—AI

🞭 Retail and E-commerce

🞭 Retail and E-commerce

CV often captures images/video in

Training datasets may lack Face recognition works better on

Individuals are often unaware their

CV can be used for unethical or Military drones using CV for

Inability to explain why an

It’s unclear who is liable when CV Who is responsible if a self-driving

CV enables creation of fake

You might also like