[go: up one dir, main page]

0% found this document useful (0 votes)
77 views33 pages

ML Unit-2 - RTU

Unsupervised Machine Learning

Uploaded by

vishakhasahu0001
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
77 views33 pages

ML Unit-2 - RTU

Unsupervised Machine Learning

Uploaded by

vishakhasahu0001
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 33

UNSUPERVISED MACHINE LEARNING

As the name suggests, unsupervised learning is a machine learning technique in which


models are not supervised using training dataset.
• Instead, models itself find the hidden patterns and insights from the given data. It can be
compared to learning which takes place in the human brain while learning new things.
• It can be defined as:
• Unsupervised learning is a type of machine learning in which models are trained using
unlabeled dataset and are allowed to act on that data without any supervision.
EXAMPLE:
• Suppose the unsupervised learning algorithm is given an input dataset containing images
of different types of cats and dogs.
• The algorithm is never trained upon the given dataset, which means it does not have any
idea about the features of the dataset.
• The task of the unsupervised learning algorithm is to identify the image features on their
own.
• Unsupervised learning algorithm will perform this task by clustering the image dataset into
the groups according to similarities between images.

Why use Unsupervised Learning?


• Below are some main reasons which describe the importance of Unsupervised Learning:
• Unsupervised learning is helpful for finding useful insights from the data.
• Unsupervised learning is much similar as a human learns to think by their own experiences,
which makes it closer to the real AI.
• Unsupervised learning works on unlabeled and uncategorized data which make
unsupervised learning more important.
• In real-world, we do not always have input data with the corresponding output so to solve
such cases, we need unsupervised learning.
Working of Unsupervised Learning:

Here, we have taken an unlabeled input data, which means it is not categorized and
corresponding outputs are also not given. Now, this unlabeled input data is fed to the machine
learning model in order to train it. Firstly, it will interpret the raw data to find the hidden
patterns from the data and then will apply suitable algorithms such as k-means clustering,
Decision tree, etc.
Once it applies the suitable algorithm, the algorithm divides the data objects into groups
according to the similarities and difference between the objects.
Types of Unsupervised Learning Algorithm:
The unsupervised learning algorithm can be further categorized into two types of problems:

Clustering:
Clustering is an important concept when it comes to unsupervised learning. It mainly deals
with finding a structure or pattern in a collection of uncategorized data.
Clustering is a method of grouping the objects into clusters such that objects with most
similarities remains into a group and has less or no similarities with the objects of another
group. Cluster analysis finds the commonalities between the data objects and categorizes them
as per the presence and absence of those commonalities.
Association:

An association rule is an unsupervised learning method which is used for finding the
relationships between variables in the large database.
It determines the set of items that occurs together in the dataset.
Association rule makes marketing strategy more effective.
Such as people who buy X item (suppose a bread) are also tend to purchase Y (Butter/Jam)
item. A typical example of Association rule is Market Basket Analysis.
Clustering v/s Association
In machine learning, both clustering and association algorithms—also referred to as
unsupervised learning algorithms—are used to make predictions by finding patterns in data
and using unlabelled datasets. However, they possess some differences:
Clustering Association
Using clustering techniques, groups are Establishes relationships using co-
determined considering similarities. occurrences as a basis
Clustering is the process of arranging a The goal of association rules is to identify
collection of items so that those within the relationships between items in huge
same group resemble each other more than commercial datasets.
items outside the group.
It's the process of breaking down a large It is a method of determining how closely
number of objects into smaller groupings that any two elements are related.
have similar qualities.
By identifying patterns among the data It is used to identify the relationships
objects, cluster analysis groups them into between numerous variables in the large
groups based on whether or not such patterns database.
exist.
Find coherent subsets of items given a large Determine which items in a basket predict
number of items (they might be text another item in the basket given a number of
documents, pictures, people, you name it). baskets (which could be text documents, real
grocery baskets, or other semi-structured
objects).

Advantages:
• Unsupervised learning is used for more complex tasks as compared to supervised learning
because, in unsupervised learning, we don't have labeled input data.
• Unsupervised learning is preferable as it is easy to get unlabeled data in comparison to
labeled data.
Disadvantages:
• Unsupervised learning is intrinsically more difficult than supervised learning as it does not
have corresponding output.
• The result of the unsupervised learning algorithm might be less accurate as input data is not
labeled, and algorithms do not know the exact output in advance.
Applications:
Anomaly detection- Through the use of unsupervised learning, anomalies such as fraud,
intrusions, or system malfunctions can be found in data by spotting odd patterns or departures
from expected behaviour.
Scientific discovery- This type of learning may uncover patterns and relationships that are
buried in data, which can result in new theories and understandings across a range of scientific
domains.

Systems for making recommendations- Unsupervised learning can spot trends and parallels in
user behaviour and tastes to suggest goods, tunes, or films that suit their interests.

Customer segmentation- Businesses can more effectively focus marketing efforts and enhance
customer service by using unsupervised learning to recognize groups of customers having
similar features.

Picture analysis- Tasks like object identification, picture retrieval, and image classification can
be made easier by unsupervised learning, which groups images according to their content.

You might also like