[go: up one dir, main page]

0% found this document useful (0 votes)
80 views10 pages

Chapter Five

The document discusses different types of machine learning including supervised learning, unsupervised learning, and reinforcement learning. It describes the basic concepts and components of a learning agent and provides examples to illustrate machine learning classification and techniques like clustering and association.

Uploaded by

junedijoasli
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
80 views10 pages

Chapter Five

The document discusses different types of machine learning including supervised learning, unsupervised learning, and reinforcement learning. It describes the basic concepts and components of a learning agent and provides examples to illustrate machine learning classification and techniques like clustering and association.

Uploaded by

junedijoasli
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 10

Chapter Five

Learning Agents
5.1. Learning agents and its components

An agent is learning if it improves its performance on future tasks after making observations
about the world. Why would we want an agent to learn? If the design of the agent can be
improved, why wouldn’t the designers just program in that improvement to begin with? There
are three main reasons. First, the designers cannot anticipate all possible situations that the agent
might find itself in. For example, a robot designed to navigate mazes must learn the layout of
each new maze it encounters. Second, the designers cannot anticipate all changes over time; a
program designed to predict tomorrow’s stock market prices must learn to adapt when conditions
change from boom to bust. Third, sometimes human programmers have no idea how to program
a solution themselves. For example, most people are good at recognizing the faces of family
members, but even the best programmers are unable to program a computer to accomplish that
task, except by using learning algorithms.

A learning agent in AI is the type of agent that can learn from its past experiences or it has
learning capabilities. It starts to act with basic knowledge and then is able to act and adapt
automatically through learning. A learning agent has mainly four conceptual components,
which are:

1. Learning element: It is responsible for making improvements by learning from the


environment.
2. Critic: The learning element takes feedback from critics which describes how well the
agent is doing with respect to a fixed performance standard.
3. Performance element: It is responsible for selecting external action.
4. Problem Generator: This component is responsible for suggesting actions that will lead to
new and informative experiences.

You can revise the detail of the learning agent components in chapter 2.
Figure 5. 1 A general model of learning agent

5.2. Machine Learning?


In the real world, we are surrounded by humans who can learn everything from their experiences
with their learning capability, and we have computers or machines which work on our
instructions. But can a machine also learn from experiences or past data like a human does? So
here comes the role of Machine Learning.

Machine learning is a subfield of artificial intelligence (AI) that focuses on developing


algorithms and statistical models that enable computers to learn patterns and make predictions or
decisions from data without being explicitly programmed to do so.

A machine learning algorithm is an algorithm that is able to learn from data. But what do we
mean by learning? In 1997 Tom Mitchell gave a definition for Machine Learning: ''Machine
Learning is a computer program, that is said to learn from experience E with respect to some
class of tasks T and performance measure P, if its performance at tasks in T, as measured by P,
improves with experience E.''
5.3. How does Machine Learning work
A machine learning system builds prediction models, learns from previous data, and predicts the output of
new data whenever it receives it. The amount of data helps to build a better model that accurately predicts
the output, which in turn affects the accuracy of the predicted output. Suppose we have a complex
problem, where we need to perform some predictions, so instead of writing a code for it, we just
need to feed the data to generic algorithms, and with the help of these algorithms, machine builds
the logic as per the data and predict the output.

The below block diagram explains the working of Machine Learning algorithm:

5.4. Classification of Machine Learning


Most machine learning algorithms can be divided into the categories of:

1. Supervised learning
2. Unsupervised learning
3. Reinforcement learning
1. Supervised Learning:
Supervised learning is a type of machine learning in which the algorithm is trained on the labeled
dataset. It learns to map input features to targets based on labeled training data. In supervised
learning, the algorithm is provided with input features and corresponding output labels, and it
learns to generalize from this data to make predictions on new, unseen data.
Let's understand supervised learning with an example. Suppose we have an input dataset of cats
and dog images. So, first, we will provide the training to the machine to understand the images,
such as the shape & size of the tail of cat and dog, Shape of eyes, color, height (dogs are
taller, cats are smaller), etc. After completion of training, we input the picture of a cat and ask
the machine to identify the object and predict the output. Now, the machine is well trained, so it
will check all the features of the object, such as height, shape, color, eyes, ears, tail, etc., and find
that it's a cat. So, it will put it in the Cat category. This is the process of how the machine
identifies the objects in Supervised Learning.

The main goal of the supervised learning technique is to map the input variable(x) with the
output variable(y). Some real-world applications of supervised learning are Risk Assessment,
Fraud Detection, Spam filtering, etc.
Supervised machine learning can be classified into two types of problems, which are:

 Classification
 Regression

 Classification:
Classification is a type of supervised learning where the algorithm learns to assign input data to a
specific category or class based on input features. The output labels in classification are discrete
values. Classification algorithms can be binary, where the output is one of two possible classes,
or multiclass, where the output can be one of several classes.
Examples of supervised learning algorithms include:
 Linear Regression
 Logistic Regression
 Decision Trees
 Support Vector Machines (SVM)
 Neural Networks
2. Unsupervised Machine Learning:
Unsupervised learning is different from the supervised learning technique; as its name suggests,
there is no need for supervision. It means, in unsupervised machine learning, the machine is
trained using the unlabeled dataset, and the machine predicts the output without any supervision.
In unsupervised learning, the models are trained with the data that is neither classified nor
labelled, and the model acts on that data without any supervision.
The main aim of the unsupervised learning algorithm is to group or categories the unsorted
dataset according to the similarities, patterns, and differences. Machines are instructed to
find the hidden patterns from the input dataset.
Let's take an example to understand it more preciously; suppose there is a basket of fruit images,
and we input it into the machine learning model. The images are totally unknown to the model,
and the task of the machine is to find the patterns and categories of the objects.

So, now the machine will discover its patterns and differences, such as color difference, shape
difference, and predict the output when it is tested with the test dataset.

Unsupervised Learning can be further classified into two types, which are:

 Clustering
 Association
 Clustering
The clustering technique is used when we want to find the inherent groups from the data. It is a
way to group the objects into a cluster such that the objects with the most similarities remain in
one group and have fewer or no similarities with the objects of other groups. An example of the
clustering algorithm is grouping the customers by their purchasing behavior.

Some of the popular clustering algorithms are:

 K-Means Clustering algorithm


 Mean-shift algorithm
 Principal Component Analysis
 Independent Component Analysis
 Association
Association rule learning is an unsupervised learning technique, which finds interesting relations among
variables within a large dataset. The main aim of this learning algorithm is to find the dependency of one
data item on another data item and map those variables accordingly so that it can generate maximum
profit. This algorithm is mainly applied in Market Basket analysis, Web usage mining, continuous
production, etc.

Some popular algorithms of Association rule learning are:

 Apriori Algorithm,
 Eclat,
 FP-growth algorithm.
3. Reinforcement Machine Learning
Reinforcement learning works on a feedback-based process, in which an AI agent (A
software component) automatically explore its surrounding by hitting & trail, taking
action, learning from experiences, and improving its performance. Agent gets rewarded for
each good action and get punished for each bad action; hence the goal of reinforcement learning
agent is to maximize the rewards.

The reinforcement learning process is similar to a human being; for example, a child learns
various things by experiences in his day-to-day life. An example of reinforcement learning is to
play a game, where the Game is the environment, moves of an agent at each step define states,
and the goal of the agent is to get a high score. Agent receives feedback in terms of punishment
and rewards.

Due to its way of working, reinforcement learning is employed in different fields such as Game
theory, Operation Research, Information theory, multi-agent systems.

A reinforcement learning problem can be formalized using Markov Decision Process


(MDP). In MDP, the agent constantly interacts with the environment and performs actions; at
each action, the environment responds and generates a new state.
5.5. Artificial Neural Networks (ANN)
The term "Artificial neural network" refers to a biologically inspired sub-field of artificial
intelligence modeled after the brain. An Artificial neural network is usually a computational
network based on biological neural networks that construct the structure of the human brain.
Similar to a human brain has neurons interconnected to each other, artificial neural networks also
have neurons that are linked to each other in various layers of the networks. These neurons are
known as nodes.

The given figure illustrates the typical diagram of Biological Neural Network.

The typical Artificial Neural Network looks something like the given figure.

Dendrites from Biological Neural Network represent inputs in Artificial Neural Networks, cell
nucleus represents Nodes, synapse represents Weights, and Axon represents Output.
Relationship between Biological neural network and artificial neural network:
Biological Neural Network Artificial Neural Network

Dendrites Inputs
Cell nucleus Nodes
Synapse Weights
Axon Output

5.5. Neural network structures

Neural networks are composed of nodes or units (see Figure 18.19) connected by directed links.
A link from unit i to unit j serves to propagate the activation ai from i to j. also has a numeric
weight wi,j associated with it, which determines the strength and sign of the connection. Just as in
linear regression models, each unit has a dummy input a0 = 1 with an associated weight w0,j .
Each unit j first computes a weighted sum of its inputs:
Then it applies an activation function g to this sum to derive the output:

5.6. Application areas of ANN


Followings are some of the areas, where ANN is being used.

 Aerospace − Autopilot aircrafts, aircraft fault detection.


 Automotive − Automobile guidance systems.
 Military − Weapon orientation and steering, target tracking, object discrimination, facial
recognition, signal/image identification.
 Electronics − Code sequence prediction, IC chip layout, chip failure analysis, machine
vision, voice synthesis.
 Financial − Real estate appraisal, loan advisor, mortgage screening, corporate bond
rating, portfolio trading program, corporate financial analysis, currency value prediction,
document readers, credit application evaluators.
 Industrial − Manufacturing process control, product design and analysis, quality
inspection systems, welding quality analysis, paper quality prediction, chemical product
design analysis, dynamic modeling of chemical process systems, machine maintenance
analysis, project bidding, planning, and management.
 Medical − Cancer cell analysis, EEG and ECG analysis, prosthetic design, transplant
time optimizer.
 Speech − Speech recognition, speech classification, text to speech conversion.
 Telecommunications − Image and data compression, automated information services,
real-time spoken language translation.
 Transportation − Truck Brake system diagnosis, vehicle scheduling, routing systems.
 Software − Pattern Recognition in facial recognition, optical character recognition, etc.
 Time Series Prediction − ANNs are used to make predictions on stocks and natural
calamities.
 Signal Processing − Neural networks can be trained to process an audio signal and filter
it appropriately in the hearing aids.
 Control − ANNs are often used to make steering decisions of physical vehicles.
 Anomaly Detection − As ANNs are expert at recognizing patterns, they can also be
trained to generate an output when something unusual occurs that misfits the pattern.

You might also like