0% found this document useful (0 votes)

16 views12 pages

Module4 AI

Module 4 provides an overview of neural networks, including their structure, basic terms such as neurons, layers, and activation functions, and their applications in various fields. It explains the concept of perceptrons, types of gradient descent, and emphasizes that training neural networks is fundamentally a minimization problem aimed at reducing prediction errors. The document also discusses the importance of hyperparameters and the role of activation functions in enabling neural networks to learn complex patterns.

Uploaded by

trainy045

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views12 pages

Module4 AI

Uploaded by

trainy045

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

MODULE 4

 Introduction neural network:

- Neural Networks, abbreviated as NN or ANN (Artificial Neural
Network), represent a crucial paradigm in the field of artificial
intelligence and machine learning.
- It’s a network which can solve Artificial intelligence problems.
- ANN is a supervised learning system built of a large number of
simple elements, called neurons or perceptrons.
- Each neuron can make simple decisions, and feeds those
decisions to other neurons, organized in interconnected layers.
- Basic Terms

1. Neuron (Node):
 The fundamental unit of a neural network.
 Neurons receive inputs, apply transformations, and
produce outputs.

2. Input Layer:
 First layer receiving external inputs.
 Passes inputs to subsequent layers without applying
operations.
 Its size depends on the shape of the training data.
3. Output Layer:
 Final layer producing network predictions.
 Outputs are generated based on inputs processed by
hidden layers.
 Number of neurons in the output layer depends on the
task.
4. Hidden Layer:
 Intermediate layers between the input and output layers.
 Hidden layers process inputs through complex
transformations.
 They are essential for capturing intricate patterns in data.
5. Connections:
 Links between neurons that transmit information.
 Each connection possesses a weight, influencing the
signal's strength.
6. Bias:
 Additional input to neurons, usually set to 1.
 Bias helps neurons activate even when inputs are absent.
 Balancing bias and variance is crucial for model
performance.
7. Weights:
 Numeric values associated with connections.
 Weights determine the influence of inputs on neuron
outputs.
 They are adjusted during training to minimize errors.
8. Activation Function:
 Function applied to neuron outputs, determining their
activation.
 Common functions include sigmoid, TanH, and ReLU.
 Activation functions introduce non-linearity, crucial for
learning complex patterns.
9. Error Function:
 Measures the disparity between actual and predicted
outputs.
 Loss functions like Mean Squared Error (MSE) quantify
prediction errors.
 Minimizing error functions guides the network towards
accurate predictions.
10. Variance:
 Reflects how well the model adapts to new, unseen data.
 High variance may lead to overfitting, while low variance
may cause underfitting.
11. Hyperparameters:
 Settings influencing the network's architecture and
behavior.
 Examples include learning rate, number of hidden layers,
and neurons per layer.
 Tuning hyperparameters optimizes network performance
for specific tasks.

 Activation functions
 Function applied to neuron outputs, determining their
activation.
 Common functions include sigmoid, TanH, and ReLU.
 Activation functions introduce non-linearity, crucial for
learning complex patterns

Types of activation function

1. Step Function

 Equation:
A = 1 if Y > threshold, 0 otherwise.
 It's a binary activation function where a neuron is activated if the
output exceeds a threshold.
 Drawback: Limited applicability in multi-class classification
due to its binary nature. Multiple activated neurons may
complicate decision-making.
2. Linear Function

 Equation: A = cx.
 Activation is directly proportional to input, offering a range of
activations.
 Problem: Gradient remains constant regardless of input
changes, hindering adaptive learning during backpropagation.
3. Sigmoid Function

 Equation: A =

 Nonlinear function suitable for binary

classification tasks.
 Smooth gradient facilitates stable training.
 Advantages: Provides analog activations, bound within (0,1)
range, and allows stacking layers.
 Limitation: Suffers from vanishing gradient problem at extreme
ends, slowing down or halting learning.
4. Tanh Function

 Equation: A =(2/1+e^-2x)-1
 Similar to sigmoid but scaled to (-1,1) range, suitable for time
series data.
 Stronger gradient than sigmoid, aiding in faster learning.
 Limitation: Still susceptible to vanishing gradient problem.

5. ReLU (Rectified Linear Unit)

 Equation: A(x) = max(0,x).

 Nonlinear function with efficient computation.
 Offers sparse activations, reducing computational load.
 Drawback: Suffers from "dying ReLU" problem where neurons
become inactive, hindering learning in affected regions.
 Variations like Leaky ReLU mitigate this issue by maintaining a
non-zero gradient for negative inputs.

 Perceptron
Introduction to Perceptron:
 A perceptron is a basic unit in an artificial neural network,
mimicking the functionality of biological neurons.
 It's capable of learning and solving simple binary classification
problems.
 Inputs are weighted, summed, and passed through an activation
function to produce an output.

Types of Perceptrons:
1. Single Layer Perceptron:
 Learns linearly separable patterns.
 Limited to single-layered models.
2. Multilayer Perceptron (Feedforward Neural Networks):
 Comprises multiple layers of perceptrons, allowing for the
accurate resolution of complex problems.
 Typically includes an input layer, one or more hidden
layers, and an output layer.

Perceptron Learning:
 Automatically learns optimal weight coefficients through
training.
 Inputs are multiplied by weights, and if the sum exceeds a
threshold, the neuron fires.
 Each iteration through the training set is an epoch, and training
continues until the error ceases to improve.
Examples:
 Applications of ANN :
1. Image Recognition and Computer Vision
2. Natural Language Processing (NLP)
3. Speech Recognition and Synthesis
4. Healthcare (Medical Image Analysis, Patient Outcome
Prediction)
5. Finance and Trading (Financial Forecasting, Algorithmic
Trading)
6. Autonomous Vehicles (Object Detection, Path Planning)
7. Recommendation Systems
8. Manufacturing and Industry 4.0 (Predictive Maintenance,
Quality Control)
9. Cybersecurity (Intrusion Detection, Malware Detection)
10. Environmental Monitoring (Weather Forecasting, Pollution
Prediction)

 Gradient Descent :
- Gradient Descent is a widely used optimization algorithm for
training machine learning models, including neural networks.
- It iteratively adjusts model parameters to minimize the error
between actual and predicted outputs. Let's delve into its
workings and variations:
Working Principle
 Slope Measurement: Measures how the output of a function
changes with respect to small changes in inputs.
 Local Minimum/Maximum: Moving towards negative
gradients leads to local minimum, while positive gradients lead
to local maximum, helping in convergence towards optimal
solutions.
Formula
b=a−α⋅∇F(a)
 b: New value
 a: Current value
 α: Learning rate
 ∇F(a): Gradient (direction of steepest descent)

Minimization Algorithm
 Start with initializing weights (W) and bias (b).
 Calculate the slope (gradient) using the derivative.
 Adjust parameters using the tangent line to minimize the cost
function.
 Update weights and bias iteratively until convergence, where
the cost function is minimized
 Types of Gradient Descent
1. Batch Gradient Descent (BGD):
 Calculates error for all training points before updating
parameters.
 Updates the model after evaluating all training examples.
 Suitable for small datasets but computationally expensive
for large datasets due to evaluating all examples at once.
2. Stochastic Gradient Descent (SGD):
 Processes each training example individually within a
dataset.
 Updates parameters one example at a time.
 Faster computation but noisy updates, leading to erratic
convergence.

3. Mini-Batch Gradient Descent:

 Hybrid approach combining BGD and SGD.
 Divides training datasets into small batches and updates
parameters on these batches separately.
 Strikes a balance between computational efficiency and
convergence stability.

 Learning neural network is minimization problem:

Learning neural networks is a minimization problem.
Justification:
1. Objective Function: In neural network training, the
primary goal is to minimize the error or loss function.
The error function quantifies the disparity between the
predicted outputs of the neural network and the actual
outputs. The aim is to minimize this error, ensuring that
the network's predictions are as close as possible to the
ground truth.
2. Gradient Descent: The optimization algorithm
commonly used to train neural networks, such as
gradient descent, operates by iteratively adjusting the
model parameters (weights and biases) in the direction
that minimizes the error function. This process involves
computing the gradient of the error function with
respect to the model parameters and updating them to
move towards the direction of decreasing error.
3. Local Minimum: The objective of the minimization
problem is to find the local minimum of the error
function. At the local minimum, the error function
reaches its lowest value, indicating that further
adjustments to the model parameters would not
significantly reduce the error. Therefore, training a
neural network involves iteratively updating the
parameters until convergence to a local minimum of the
error function is achieved

AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Advanced Machine Learning CIE
No ratings yet
Advanced Machine Learning CIE
13 pages
Chapter 5 Summary
No ratings yet
Chapter 5 Summary
5 pages
Deep Learning: On Artificial Neural Networks (Anns)
No ratings yet
Deep Learning: On Artificial Neural Networks (Anns)
16 pages
Deep Learning Unit 2
No ratings yet
Deep Learning Unit 2
4 pages
Deep Learning
No ratings yet
Deep Learning
19 pages
Artificial Neural Networks (Anns) : Intro
No ratings yet
Artificial Neural Networks (Anns) : Intro
15 pages
Unit 2 - ML
No ratings yet
Unit 2 - ML
18 pages
Unit 5 (QB) - ML
No ratings yet
Unit 5 (QB) - ML
38 pages
Tutorial 1,2
No ratings yet
Tutorial 1,2
12 pages
Neural NetworksChapter2Sup
No ratings yet
Neural NetworksChapter2Sup
20 pages
Deep Learning
No ratings yet
Deep Learning
20 pages
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
No ratings yet
ML MU Unit 5NeuralNetworkpdf 2025 04 16 13 47 39
57 pages
Machine Learning
No ratings yet
Machine Learning
77 pages
cst414 - Deep Learning
No ratings yet
cst414 - Deep Learning
34 pages
A Weight Decides How Much Influence The Input Will Have On The Output
No ratings yet
A Weight Decides How Much Influence The Input Will Have On The Output
1 page
Lecture15 NeuronNetworks
No ratings yet
Lecture15 NeuronNetworks
61 pages
Neural Networks (Basics)
No ratings yet
Neural Networks (Basics)
30 pages
Assignment - 4
No ratings yet
Assignment - 4
24 pages
Neural Network (Basics)
No ratings yet
Neural Network (Basics)
48 pages
Unit 1
No ratings yet
Unit 1
29 pages
Lesson 7.0 Supervised Learning With Neural Networks
No ratings yet
Lesson 7.0 Supervised Learning With Neural Networks
22 pages
Neural Network
No ratings yet
Neural Network
97 pages
NNDL
No ratings yet
NNDL
96 pages
Lesson 2 Neural Network Architectures
No ratings yet
Lesson 2 Neural Network Architectures
35 pages
3rd Unit ML
No ratings yet
3rd Unit ML
7 pages
NNML Full
No ratings yet
NNML Full
19 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
DL Notes
No ratings yet
DL Notes
21 pages
5 - From Linear Models To Multi-Layer Perceptrons
No ratings yet
5 - From Linear Models To Multi-Layer Perceptrons
45 pages
Chapter-4 Fundamental of Neural Network
No ratings yet
Chapter-4 Fundamental of Neural Network
26 pages
Mid 1 DL Notes
No ratings yet
Mid 1 DL Notes
15 pages
Pure Optimization
No ratings yet
Pure Optimization
23 pages
Shortnotedeeplearning
No ratings yet
Shortnotedeeplearning
11 pages
Neural Networks Explained
No ratings yet
Neural Networks Explained
15 pages
Ai Unit 5
No ratings yet
Ai Unit 5
33 pages
Deep Learning
No ratings yet
Deep Learning
180 pages
Unit 5
No ratings yet
Unit 5
59 pages
MlUnit 4
No ratings yet
MlUnit 4
9 pages
Unit 1
No ratings yet
Unit 1
72 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
216 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Artificial Neural Networks Basics
No ratings yet
Artificial Neural Networks Basics
50 pages
Introduction To ANN
No ratings yet
Introduction To ANN
6 pages
UNIT III 3.1 ML Artificial Neural Networks
No ratings yet
UNIT III 3.1 ML Artificial Neural Networks
65 pages
Cs3491-Artificial Intelligence and Machine Learning-1221091049-Unit 5 Aiml
No ratings yet
Cs3491-Artificial Intelligence and Machine Learning-1221091049-Unit 5 Aiml
38 pages
UNIT 1 Introduction Part 1
No ratings yet
UNIT 1 Introduction Part 1
37 pages
Neural Deep Learning
No ratings yet
Neural Deep Learning
221 pages
Module 3.docxaiml
No ratings yet
Module 3.docxaiml
20 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
ANN Unit IV Notes
No ratings yet
ANN Unit IV Notes
4 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
New - Neural Network & Deep Learning
No ratings yet
New - Neural Network & Deep Learning
8 pages
05 ANN Artificial Neural Networks
No ratings yet
05 ANN Artificial Neural Networks
221 pages
3.2 Overview of Neural Networks
No ratings yet
3.2 Overview of Neural Networks
28 pages
Unit 4 ML
No ratings yet
Unit 4 ML
9 pages
Module 2 DL Snotes P1
No ratings yet
Module 2 DL Snotes P1
16 pages
Prasad P. - App Design Apprentice (1st Edition) - 2021
100% (5)
Prasad P. - App Design Apprentice (1st Edition) - 2021
476 pages
FINAL Painless Algebra For Davao
No ratings yet
FINAL Painless Algebra For Davao
28 pages
Azure Questions
No ratings yet
Azure Questions
4 pages
Vikash Institute of Technology: Introduction To Cloud Gaming
No ratings yet
Vikash Institute of Technology: Introduction To Cloud Gaming
10 pages
Objective: Nationality: Saudi Arabia. Marital Status: Married E-Mail:, Cell Phone# +966 - (566) 613-846
No ratings yet
Objective: Nationality: Saudi Arabia. Marital Status: Married E-Mail:, Cell Phone# +966 - (566) 613-846
1 page
Oracle Financials Cloud Expert Resume
No ratings yet
Oracle Financials Cloud Expert Resume
5 pages
IT Projects Roadmap 2014-2015
No ratings yet
IT Projects Roadmap 2014-2015
33 pages
Editing, Coding, Data Entry, Tabulation
100% (3)
Editing, Coding, Data Entry, Tabulation
3 pages
Pecoff v8
No ratings yet
Pecoff v8
69 pages
BSP 75N
No ratings yet
BSP 75N
14 pages
Add Workflows To Your Application On SAP Cloud Platform
No ratings yet
Add Workflows To Your Application On SAP Cloud Platform
64 pages
SKDH146 16 L75
No ratings yet
SKDH146 16 L75
2 pages
Huawei Server OS Installation Guide (Arm) 04
No ratings yet
Huawei Server OS Installation Guide (Arm) 04
94 pages
Grade 8 - SE1 Worksheet
No ratings yet
Grade 8 - SE1 Worksheet
21 pages
Homework 3 Solutions Problem 4.35 (10 Points)
No ratings yet
Homework 3 Solutions Problem 4.35 (10 Points)
4 pages
Free Grocery Store Business Plan Template
No ratings yet
Free Grocery Store Business Plan Template
12 pages
Manual Utilizare Sielaff Sielissimo
No ratings yet
Manual Utilizare Sielaff Sielissimo
54 pages
Topic 1 - Information Security Governance
No ratings yet
Topic 1 - Information Security Governance
33 pages
Dr. Colbert's Keto Zone Diet
0% (1)
Dr. Colbert's Keto Zone Diet
6 pages
SEO Audit & Analysis - Vibe Education
No ratings yet
SEO Audit & Analysis - Vibe Education
17 pages
2 - Manual
No ratings yet
2 - Manual
12 pages
lbp7200cdn PC
No ratings yet
lbp7200cdn PC
34 pages
OS Comparison for Students
No ratings yet
OS Comparison for Students
1 page
ASCII - EBCDIC Code Converter - Longpela Expertise
No ratings yet
ASCII - EBCDIC Code Converter - Longpela Expertise
3 pages
Chapter-2 New
No ratings yet
Chapter-2 New
35 pages
Operating Systems Development Series Basic CRT
No ratings yet
Operating Systems Development Series Basic CRT
7 pages
Aadhaar Seeding Request Form
No ratings yet
Aadhaar Seeding Request Form
1 page
Social Networking Platform For Education
No ratings yet
Social Networking Platform For Education
30 pages
DS Chapter V8.0fault Tolerance
No ratings yet
DS Chapter V8.0fault Tolerance
23 pages
Plumes - Delineation & Transport - D. James Benton
No ratings yet
Plumes - Delineation & Transport - D. James Benton
140 pages

Module4 AI

Uploaded by

Module4 AI

Uploaded by

MODULE 4

 Introduction neural network:

Types of activation function

 Nonlinear function suitable for binary

5. ReLU (Rectified Linear Unit)

 Equation: A(x) = max(0,x).

3. Mini-Batch Gradient Descent:

 Learning neural network is minimization problem:

You might also like