Computer Vision Mid

The document discusses computer vision and its applications. Computer vision aims to replicate aspects of human vision using algorithms, but has limitations compared to flexible human vision. Object detection and recognition are described, involving identifying and localizing objects in images using deep learning models and extracting features. Common algorithms and techniques are also outlined.

Uploaded by

Huzair Nadeem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

18 views2 pages

Computer Vision Mid

Uploaded by

Huzair Nadeem

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

COMPUTER VISON & HUMAN VISON: OBJECT DETECTION/ RECOGNIZATION:

Computer Vision: Object Detection:

Computer vision is a field of study focused on enabling Object detection involves identifying and localizing objects within an
computers to interpret and understand visual information from image or a video frame. The goal is to not only recognize what objects
the real world. It involves the development of algorithms and are present but also precisely locate them with bounding boxes. Here's
techniques to extract meaningful information from images or how object detection typically works:
videos. Input Image/Frame:
The process begins with an input image or a frame from a video stream.
Human Vision: Feature Extraction:
Human vision refers to the visual perception and processing Features are extracted from the input image using techniques like
capabilities of the human eye and brain. Human vision is convolutional neural networks (CNNs). These features capture important
incredibly complex and sophisticated, allowing us to perceive patterns and details from the image.
depth, color, motion, and shape effortlessly. Our visual system Object Localization:
consists of the eyes, which capture light and form images, and The algorithm predicts bounding boxes around objects of interest.
the brain, which processes these images to create our visual Techniques like sliding window, region proposal networks (RPN), or
experience anchor boxes are commonly used for this purpose.
While computer vision aims to replicate certain aspects of human Object Classification:
vision, there are significant differences between the two: This is typically done using classification models, often built on top of
the same CNN architecture used for feature extraction.
Capabilities:
Human vision is highly adaptive and versatile, capable of Object Recognition:
recognizing a vast range of objects and scenes in various Object recognition, also known as object classification, is the task of
conditions. Computer vision systems, while powerful, are often identifying objects within an image or a video frame without necessarily
specialized for specific tasks and may struggle with generalization localizing them. It focuses on determining what objects are present in
across diverse scenarios. the scene. Here's how object recognition typically works:
Processing Speed: Input Image/Frame:
Human vision operates at remarkable speeds, allowing us to Similar to object detection, the process begins with an input image or a
process complex visual scenes in real-time effortlessly. frame from a video stream.
Accuracy and Reliability: Feature Extraction:
While computer vision systems can achieve impressive accuracy Features are extracted from the input image using techniques like CNNs.
in specific tasks, they are still prone to errors, especially in These features capture important patterns and details relevant to object
challenging conditions such as poor lighting or occlusions. recognition.
Flexibility and Adaptability:
Human vision can adapt to new environments and tasks rapidly,
often without explicit training. Computer vision systems typically
require extensive training data and optimization to perform well
on specific tasks, making them less flexible in some regards.

D/W COMPUTER VISON & HUMAN VISION:

Aspect Computer Vision (CV) Human Vision (HV)
Specialized for specific tasks; can excel in certain
Capabilities domains Highly adaptable and versatile; capable of generalization
Requires computational resources; may be slower Operates at remarkable speeds; processes complex scenes in
Processing Speed than HV real-time
Achieves high accuracy in specific tasks; prone to
Accuracy & Reliability errors Reliable and robust; handles diverse environments with ease
Flexibility & Adapts rapidly to new environments and tasks without explicit
Adaptability Requires extensive training data; less flexible training
Can assist humans in various tasks (e.g., image Influences the design and development of CV algorithms and
Interaction analysis) interfaces
Enables advancements in areas like medical Provides insights into human perception for improved CV
Collaboration diagnosis systems
ALGORITHMS & TECHNIQUES OF OBJECT DETECTION/ RECOGNIZATION: APPLICATIONS OF COMPUTER VISIONS:

Deep Learning-based Approaches: Autonomous Vehicles:

Convolutional Neural Networks (CNNs): Computer vision helps vehicles perceive their surroundings,
CNNs are widely used for object detection due to their ability to learn detect obstacles, recognize traffic signs, and navigate safely.
hierarchical features directly from raw pixel data. Surveillance and Security:
Region-based CNNs: It is used for monitoring public spaces, identifying suspicious
These approaches, such as Faster R-CNN and R-CNN, use region activities, facial recognition, and tracking individuals.
proposal algorithms to identify potential object locations before Medical Imaging:
classifying and refining them. Computer vision aids in diagnosing diseases from medical images
Single Shot Detectors (SSDs): (X-rays, MRI scans, CT scans), assisting in surgery, and analyzing
SSDs directly predict object bounding boxes and class probabilities for cellular structures.
multiple predefined aspect ratios and scales in a single pass through the Augmented Reality (AR):
network. AR applications overlay digital information onto the real world,
enhancing user experiences in gaming, education, navigation, and
Feature-based Approaches: interior design.
Histogram of Oriented Gradients (HOG): Retail and E-commerce:
HOG extracts feature descriptors based on gradient orientations in Computer vision is used for inventory management, shelf
localized portions of the image. stocking, product recommendation, and visual search to improve
Features (SURF): the shopping experience.
These algorithms detect and describe local features invariant to scale Industrial Automation:
and rotation, useful for object recognition and matching. It assists in quality control, defect detection, object tracking, and
robotic assembly in manufacturing processes.
Hybrid Approaches: Agriculture:
Feature Pyramid Networks (FPN): Computer vision is applied in crop monitoring, yield estimation,
FPNs combine low-resolution, semantically strong features with high- disease detection in plants, and precision farming techniques.
resolution, semantically weak features to improve object detection Gesture Recognition:
accuracy at different scales. It enables devices to interpret human gestures, facilitating
Cascade Classifiers: natural interaction in virtual environments, gaming consoles, and
These employ a series of classifiers, each focusing on a specific aspect of smart homes.
the object, to improve detection accuracy while minimizing false
positives.

Data Augmentation and Preprocessing:

Techniques such as random cropping, rotation, scaling, and flipping are
used to augment the training data, making the model more robust to
variations in object appearance and background clutter.

Post-processing:
Non-maximum suppression (NMS) is a common technique used to
remove duplicate or highly overlapping bounding boxes by retaining only
the most confident predictions.

Fundamentals of Communication Systems 1st Edition by John G Proakis
No ratings yet
Fundamentals of Communication Systems 1st Edition by John G Proakis
310 pages
Computer Vision 2011
100% (1)
Computer Vision 2011
103 pages
Unit 1
No ratings yet
Unit 1
200 pages
Unit 1
No ratings yet
Unit 1
186 pages
Image Processing and Computer Vision (Notes)
No ratings yet
Image Processing and Computer Vision (Notes)
64 pages
Url Profile Results Shopifystores Usa7
No ratings yet
Url Profile Results Shopifystores Usa7
520 pages
Computer Vision 1 Introduction
No ratings yet
Computer Vision 1 Introduction
44 pages
DTC-320 StreamXpert Installation
No ratings yet
DTC-320 StreamXpert Installation
11 pages
Lecture 01
No ratings yet
Lecture 01
19 pages
Computer Vision
No ratings yet
Computer Vision
45 pages
V1.21 Infra and DB Migration To MSFT Azure Specialization Checklist, Program Overview, FAQ
No ratings yet
V1.21 Infra and DB Migration To MSFT Azure Specialization Checklist, Program Overview, FAQ
31 pages
Trees 2 - GT Bootcamp 2
No ratings yet
Trees 2 - GT Bootcamp 2
27 pages
Computer Vision
No ratings yet
Computer Vision
10 pages
Module 1 Chapter1
No ratings yet
Module 1 Chapter1
6 pages
Lab 1
No ratings yet
Lab 1
26 pages
21 Questions Should Ask Before Design SAP CI - CPI Interface
No ratings yet
21 Questions Should Ask Before Design SAP CI - CPI Interface
24 pages
CO1 Notes
No ratings yet
CO1 Notes
105 pages
NRS 097 2 1 Published 2024
No ratings yet
NRS 097 2 1 Published 2024
20 pages
1 Intro To CV
No ratings yet
1 Intro To CV
76 pages
CV Unit 1 Overview of Computer Vison and Application
No ratings yet
CV Unit 1 Overview of Computer Vison and Application
51 pages
CV Lecture 1
No ratings yet
CV Lecture 1
65 pages
RV2502AE Quick Start Guide EN
No ratings yet
RV2502AE Quick Start Guide EN
2 pages
grp3 Computervision
No ratings yet
grp3 Computervision
28 pages
Computer Vision Lecture 1
No ratings yet
Computer Vision Lecture 1
15 pages
IPCV Unit 01
No ratings yet
IPCV Unit 01
18 pages
CV SVD L01 P1 Intro
No ratings yet
CV SVD L01 P1 Intro
35 pages
PDF Joiner
No ratings yet
PDF Joiner
38 pages
CV Lecture 1-DD-Don
No ratings yet
CV Lecture 1-DD-Don
38 pages
JS Code
No ratings yet
JS Code
3 pages
Lec 1
No ratings yet
Lec 1
51 pages
2.3 - Computer Vision
No ratings yet
2.3 - Computer Vision
21 pages
What Is Computer Vision
No ratings yet
What Is Computer Vision
18 pages
Frontiers Paper
No ratings yet
Frontiers Paper
26 pages
Assignment 4 Gauss Elimination
No ratings yet
Assignment 4 Gauss Elimination
3 pages
Topic 1 Indices Surds and Logarithms
No ratings yet
Topic 1 Indices Surds and Logarithms
102 pages
Introduction To Computer Vision
No ratings yet
Introduction To Computer Vision
45 pages
Computer Vision Technology
No ratings yet
Computer Vision Technology
29 pages
Lec1 - Computer Vision - v1
No ratings yet
Lec1 - Computer Vision - v1
38 pages
Computer Vision Assignment
No ratings yet
Computer Vision Assignment
10 pages
Computer Vision
No ratings yet
Computer Vision
19 pages
CV Digital Notes
No ratings yet
CV Digital Notes
77 pages
3551 3552 3552BT Kyoritsu
No ratings yet
3551 3552 3552BT Kyoritsu
48 pages
CS312 Module 4
No ratings yet
CS312 Module 4
21 pages
QSP-D&D-06 Child Part Development Procedure
No ratings yet
QSP-D&D-06 Child Part Development Procedure
1 page
CV Unit 1
No ratings yet
CV Unit 1
30 pages
What Is Oracle?
No ratings yet
What Is Oracle?
2 pages
Lec 1 - 2
No ratings yet
Lec 1 - 2
39 pages
Computer Vision Presentation Updated
No ratings yet
Computer Vision Presentation Updated
15 pages
Group 17 Computer Vision @Lcd-1
No ratings yet
Group 17 Computer Vision @Lcd-1
25 pages
To Embedded Systems Design: Advance Technology
No ratings yet
To Embedded Systems Design: Advance Technology
14 pages
Technologies 12 00015
No ratings yet
Technologies 12 00015
40 pages
CV Unit 1
No ratings yet
CV Unit 1
17 pages
Comparative Analysis of Phishing Tools
No ratings yet
Comparative Analysis of Phishing Tools
3 pages
Computer Vision
No ratings yet
Computer Vision
2 pages
Raz Report Final
No ratings yet
Raz Report Final
37 pages
Division Long Grid 2digit Divisor 6digit Dividend Remainders All.1496071772
No ratings yet
Division Long Grid 2digit Divisor 6digit Dividend Remainders All.1496071772
20 pages
Computer Vision Introduction
No ratings yet
Computer Vision Introduction
11 pages
A Computer Vision System Processes Images Acquired
No ratings yet
A Computer Vision System Processes Images Acquired
4 pages
A Comprehensive Guide To Computer Vision
No ratings yet
A Comprehensive Guide To Computer Vision
6 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Computer Vision
No ratings yet
Computer Vision
8 pages
How Computer Vision Is Used in Everyday Life
No ratings yet
How Computer Vision Is Used in Everyday Life
5 pages
Image Manipulation Finall
No ratings yet
Image Manipulation Finall
7 pages
CPCS335 - Chapter 9-Final
No ratings yet
CPCS335 - Chapter 9-Final
24 pages
2019 S1000RR Owners Manual PDF
100% (2)
2019 S1000RR Owners Manual PDF
285 pages
CV 4
No ratings yet
CV 4
8 pages
Computer Vision Advancement Rebecca
No ratings yet
Computer Vision Advancement Rebecca
17 pages
Computer Vision
No ratings yet
Computer Vision
13 pages
Manual de Usuario de Banco de Baterías de Litio
No ratings yet
Manual de Usuario de Banco de Baterías de Litio
99 pages
Raj-Quiz in C
No ratings yet
Raj-Quiz in C
3 pages
ProductDetails C201311
No ratings yet
ProductDetails C201311
2 pages
Sagar Paper
No ratings yet
Sagar Paper
4 pages
Unit 1
No ratings yet
Unit 1
20 pages
Computer Vision
No ratings yet
Computer Vision
14 pages
UNIT-I - Introduction To Computer Vision
No ratings yet
UNIT-I - Introduction To Computer Vision
45 pages
Basic Software Engineering: Prof. Vibhuti Patel, Assistant Professor
No ratings yet
Basic Software Engineering: Prof. Vibhuti Patel, Assistant Professor
25 pages
Computer Visiondk
No ratings yet
Computer Visiondk
12 pages
Nepal Medical Council
No ratings yet
Nepal Medical Council
1 page
Chapter One-3
No ratings yet
Chapter One-3
8 pages
New Seminar
No ratings yet
New Seminar
11 pages
CH 18 (30 One Marks)
No ratings yet
CH 18 (30 One Marks)
30 pages
Real Time Object Detection Using Deep Learning Andmachine Learning Project
No ratings yet
Real Time Object Detection Using Deep Learning Andmachine Learning Project
56 pages
Docucentre-Iv C2265 / C2263 Digital Colour Multifunction Device
No ratings yet
Docucentre-Iv C2265 / C2263 Digital Colour Multifunction Device
8 pages
Format of 1st Page - Seminar
No ratings yet
Format of 1st Page - Seminar
3 pages
Add Arguments Description in VBA
No ratings yet
Add Arguments Description in VBA
2 pages
Ug - Placement Records - 2023-1
No ratings yet
Ug - Placement Records - 2023-1
6 pages
Computer Vision Report
No ratings yet
Computer Vision Report
31 pages
Notes On COMPUTER VISION
No ratings yet
Notes On COMPUTER VISION
10 pages
Lesson 4 Module - 202203160619
No ratings yet
Lesson 4 Module - 202203160619
6 pages
Object Detection: Advances, Applications, and Algorithms
From Everand
Object Detection: Advances, Applications, and Algorithms
Fouad Sabry
No ratings yet

Computer Vision Mid

Uploaded by

Computer Vision Mid

Uploaded by

COMPUTER VISON & HUMAN VISON: OBJECT DETECTION/ RECOGNIZATION:

Computer Vision: Object Detection:

D/W COMPUTER VISON & HUMAN VISION:

Deep Learning-based Approaches: Autonomous Vehicles:

Data Augmentation and Preprocessing:

You might also like