What is
Supervised
Learning?
Supervised Learning
There are three main types of machine learning:
1. Supervised learning
2. Unsupervised learning
3. Reinforcement learning
In supervised learning, the model is trained on
“labeled” data, where the input and output data
are known. The goal is to use the labeled dataset
to learn so that it can predict the output for new
inputs.
Categories of
Supervised Learning
Classification Regression
A classification A regression
problem is where problem is where
the output variable the output
is a category, such variable is a real
as “green” or “red” , value, such as
“disease” or “no “amount” or
disease”. “weight”.
Types of Supervised
Learning Algorithms
Linear Regression
It is used to identify the relationship between a
dependent variable and one or more
independent variables and is typically used to
make predictions about future outcomes. E.g.
predicting the price of a house based on its size
and location.
Logistic Regression
It is used for binary classification problems, where
the output variable is categorical with two
possible values. For example, predicting whether
a customer will buy a product or not based on
their demographics and purchase history.
Neural Networks
This algorithm is designed to cluster raw input,
recognize patterns, or interpret sensory data.
Despite their multiple advantages, neural
networks require significant computational
resources. It can get complicated to fit a neural
network when there are thousands of
observations.
Support Vector Machines
A support vector machine is typically used for
classification problems by constructing a
hyperplane where the distance between two
classes of data points is at its maximum. This
hyperplane is known as the decision boundary,
separating the classes of data points (e.g.,
oranges vs. apples) on either side of the plane.
K-Nearest Neighbor
K-nearest neighbor, also known as the KNN
algorithm, is a simple classification algorithm that
assigns a new data point to the class that is most
common among its k-nearest neighbors in the
training set. KNN is typically used for
recommendation engines and image recognition.
Random Forest
Random forest is another flexible supervised
machine learning algorithm used for both
classification and regression purposes. It is an
ensemble learning algorithm that combines
multiple decision trees to improve accuracy and
reduce overfitting.
Naive Bayes
Naive Bayes assumes that the features (input
variables) are conditionally independent of each
other given the class label. This is a "naive"
assumption because in reality, features may be
correlated with each other. The three main types
of Naive Bayes algorithms: Gaussian Naive Bayes,
Multinomial Naive Bayes and Bernoulli Naive
Bayes
Challenges of
Supervised Learning
1. Supervised learning models require certain
expertise levels to structure accurately.
2. Training supervised learning models can be
very time intensive.
3. Datasets can have a higher likelihood of human
error, resulting in algorithms learning incorrectly.
4. Unlike unsupervised learning models,
supervised learning cannot cluster or classify data
on its own.
Follow #DataRanch on LinkedIn
for more...
Follow #DataRanch on LinkedIn
for more...
info@dataranch.org
linkedin.com/company/dataranch