[go: up one dir, main page]

0% found this document useful (0 votes)
4 views14 pages

Computer Vision

The document discusses various applications of deep learning in feature extraction, including audio classification using CNNs, image classification, and object detection. It highlights the significance of the ImageNet dataset and advancements in computer vision techniques like YOLO and RCNN. Additionally, it covers edge detection methods and their role in pre-processing for image analysis.

Uploaded by

yogini
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
4 views14 pages

Computer Vision

The document discusses various applications of deep learning in feature extraction, including audio classification using CNNs, image classification, and object detection. It highlights the significance of the ImageNet dataset and advancements in computer vision techniques like YOLO and RCNN. Additionally, it covers edge detection methods and their role in pre-processing for image analysis.

Uploaded by

yogini
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 14

Deep learning ~ feature extraction

Example no 1 : Signals
Example 2 : audio classification
By convert audio streams into spectrogram images, which provide visual representations of spectrums
of frequencies as they vary over time, then use convolutional neural networks (CNNs) to classify the
spectrograms.
Example no 3 : Images
Example 4 : Artificial Noses
a) The algorithm : TFA-CNN

Gas type recognition and accurate concentration


prediction are achieved using the innovative
pattern recognition method of time–frequency
attention convolutional neural network.

b) The data format : Hasse matrix

Characterisation of electronic nose data has been


studied by means of a novel chemometric approach
based on the partial ordering technique and the Hasse
matrix.
Computer Vision

By the 2000s, the focus of Computer Vision has been shifted


to much more complex topics, including:

● Object identification
● Facial recognition
● Image Segmentation
● Image Classification

The year 2010 saw the birth of the ImageNet dataset with
millions of labeled images freely available for research.
This led to the formation of the AlexNet architecture two
years later— making it one of the biggest breakthroughs
in Computer Vision, cited over 82K times.
Image classification

Image classification is one of the most studied topics ever since the ImageNet dataset was released in 2010.

Being the most popular computer vision task taken up by both beginners and experts, image classification as a problem
statement is quite simple.

Given a group of images, the task is to classify them into a set of predefined classes using solely a set of sample images
that have already been classified.

As opposed to complex topics like object detection and image segmentation, which have to localize (or give positions for)
the features they detect, image classification deals with processing the entire image as a whole and assigning a specific
label to it.

This popular domain of Computer Vision has been studied widely both with the use of traditional image processing
algorithms like watershed algorithms, clustering-based segmentation and with the use of popular modern-day deep learning
architectures like PSPNet, FPN, U-Net, SegNet, etc.
Object detection

Object detection, as the name suggests, refers to detection and localization of objects using bounding boxes.

Object detection looks for class-specific details in an image or a video and identifies them whenever they appear. These
classes can be cars, animals, humans, or anything on which the detection model has been trained.

Previously methods of object detection used Haar Features, SIFT, and HOG Features to detect features in an image and
classify them based on classical machine learning approaches.This process, other than being time-consuming and largely
inaccurate, has severe limitations on the number of objects that can be detected.

As such, Deep Learning models like YOLO, RCNN, SSD that use millions of parameters to break through these limitations
are popularly employed for this task.

Object detection is often accompanied by Object Recognition, also known as Object Classification.
Edge detection

Edge detection is the task of detecting boundaries in objects.

It is algorithmically performed with the help of mathematical methods that help detect sharp changes or discontinuities in the brightness
of the image. Often used as a data pre-processing step for many tasks, edge detection is primarily done by traditional image
processing-based algorithms like Canny Edge detection and by convolutions with specially designed edge detection filters.

Furthermore, edges in an image give us paramount information about the image contents, resulting in all deep learning methods
performing edge detection internally for the capture of global low-level features with the help of learnable kernels.

You might also like