0% found this document useful (0 votes)

59 views23 pages

Machine Learning For Anomaly Detection

Uploaded by

sheikarafat.resume

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

59 views23 pages

Machine Learning For Anomaly Detection

Uploaded by

sheikarafat.resume

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Machine learning for anomaly

detection
December 2024
1. Understanding techniques, applications, and best practices
Agenda

2. Case studies

3. Points to remember

4. Resources and further reading

5. Questions and discussion

01
UNDERSTANDING TECHNIQUES, APPLICATIONS, AND
BEST PRACTICES
Artificial Intelligence vs Machine Learning

AI vs ML?

Artificial intelligence (AI) is a Machine learning (ML) is a

broad concept that describes specific application of AI that
a machine's ability to mimic teaches machines to perform
human intelligence. tasks by learning from data.
WHAT IS MACHINE LEARNING?

Machine Learning Overview

• Machine Learning is a subset of AI that .

enables systems to learn and improve from
experience without explicit programming.

• Key Focus Patterns, predictions, and decision-

making
Process
WHAT IS ANOMALY DETECTION?

Anomaly detection refers to

identifying patterns in data that do
not conform to expected behavior.

Significant in applications like

fraud detection, network security,
and predictive maintenance.

Helps mitigate risks and improve

decision- making processes.

Anomaly detection identifies suspicious activity that falls outside of your established normal
patterns of behavior. A solution protects your system in real-time from instances that could result
in significant financial losses, data breaches, and other harmful events
TYPES OF ANOMALIES

Point Anomalies
Data points significantly
different from the majority (e.g., Contextual Anomalies
a sudden spike in network
traffic). Unusual only within a specific
context (e.g., high temperature
during winter).

Collective Anomalies
A collection of related data
points that deviate as a group
(e.g., a distributed denial- of-
service attack).
SUPERVISED ANOMALY DETECTION UNSUPERVISED ANOMALY DETECTION
• Supervised machine learning builds a • Unsupervised methods do not demand
predictive model using a labeled training manual labeling of training data. Instead,
set with normal and anomalous samples they operate based on the presumption

• The most common supervised methods • The most popular unsupervised anomaly
include Bayesian networks, k-nearest detection algorithms include Autoencoders,
neighbors, decision trees, supervised neural K-means, GMMs, hypothesis tests-based
networks, and SVMs analysis, and PCAs.

• The advantage of supervised models is that • These techniques thus assume collections
they may offer a higher rate of detection of frequent, similar instances are normal
and flag infrequent data groups as
malicious.

SEMI SUPERVISED ANOMALY DETECTION

• Semi-supervised anomaly detection may refer to an approach to creating a model for normal data
based on a data set that contains both normal and anomalous data, but is unlabelled

• The most common semi supervised methods include Linear regression, Outlier detection,Graph-
based.

• A semi-supervised anomaly detection algorithm might also work with a data set that is partially
flagged. It will then build a classification algorithm on just that flagged subset of data, and use that
model to predict the status of the remaining data.
WHY USE MACHINE LEARNING FOR ANOMALY DETECTION?

Advantages of ML Challenges

• Data imbalance Anomalies are

• Handles complex and rare compared to normal data
large datasets effectively.
• . Learns from data to • Dynamic and non- stationary
adapt to new patterns data.Data evolves over time,
dynamically requiring adaptive models

• Provides superior • High dimensionality Complex

accuracy compared to data structures make anomalies
traditional statistical harder to detect
methods.
COMMON ALGORITHMS IN ANOMALY DETECTION

Algorithm Types Anomaly Detection Algorithm Techniques To Know

• Supervised Random Forest, SVM for • Isolation Forest

binary classification. • Local Outlier Factor (LOF)
• Unsupervised: PCA, k- Means, • Robust Covariance
Isolation Forest for detecting • One- class support vector machine
patterns. (SVM)
• Deep Learning: Autoencoders, RNNs • One- class SVM with stochastic
for complex data types like time gradient descent (SGD)
series. • K- means clustering
• Long short- term memory (LSTM)
• Angle- based outlier detection
Techniques

One-Class Support Vector

Isolation Forest Local Outlier Factor Robust Covariance
Machine (SVM)
Isolation Forest isolates LOF identifies anomalies by Robust covariance is a statistical A One-Class SVM creates a
anomalies by creating random comparing the local density of a method that computes the boundary around normal data
partitions in the data. Anomalies point to its neighbors. Points with covariance matrix to identify points in a high-dimensional
are isolated faster than normal significantly lower density than data points deviating from the space, classifying points outside
points due to their distinct their neighbors are flagged as multivariate distribution. the boundary as anomalies.
properties. outliers .

Long Short-Term Memory Angle-Based Outlier

One-Class SVM with SGD K-Means Clustering
(LSTM) Detection
This method optimizes One- K-Means groups data into LSTMs are a type of recurrent This method calculates the
Class SVM using Stochastic clusters, and points far from any neural network that learns angle between points in high-
Gradient Descent to handle cluster center are considered temporal dependencies in dimensional space to detect
large-scale datasets efficiently. anomalies. sequential data. They identify anomalies. Anomalies are
. anomalies by analyzing identified based on deviations
deviations from learned patterns. from expected angular
distributions.
One-Class Support Vector
Isolation Forest Local Outlier Factor(LOF) Machine (SVM)

Long Short-Term Memory (LSTM)

K-Means Clustering
EXAMPLES OF ALGORITHM APPLICATIONS

One-Class Support Vector Machine

Isolation Forest Example Local Outlier Factor (LOF) Example Robust Covariance Example (SVM) Example
Detecting fraudulent Identifying unusual behavior in Detecting unusual patterns in Detecting abnormal network
transactions in credit card data user activity logs for multivariate sensor data in traffic in IT infrastructure.
using an Isolation Forest cybersecurity. manufacturing processes.
algorithm.

Long Short-Term Memory (LSTM) Angle-Based Outlier Detection

One-Class SVM with SGD Example K-Means Clustering Example Example Example
Detecting outliers in massive Identifying rare diseases in Detecting anomalies in time- Detecting outliers in large, high-
customer behavior datasets in patient medical records by series data, such as server logs dimensional datasets like gene
e- commerce. analyzing cluster distances. or stock market fluctuations. expression data.
INFERENCE

Key Inference

Anomaly detection techniques are vital for

uncovering irregularities in various domains.
Choosing the right algorithm depends on the

1. Dataset
2. Scale,
3. Application requirements.
PRACTICAL WORKFLOW FOR ANOMALY DETECTION

Step 1 Data preprocessing: Handle

missing data, outliers, and normalization.

Step 2Algorithm selection based on the

data and problem type.

Workflow
Step 3 Model evaluation using key metrics
like F1- score.

Step 4 Deploy the model and monitor its

performance.
02
CASE STUDIES
03 Resources and Further Reading

1. Books: 'Anomaly Detection Principles and Algorithms' by Aggarwal.

2. Courses: 'Machine Learning for All’, AL/ML at IIT
3. Datasets: UCI Machine Learning Repository
Q&A
Thank you

Scsa1619 Ids Unit 2
No ratings yet
Scsa1619 Ids Unit 2
20 pages
CS L06 MachineLearning AnomalyDetection
No ratings yet
CS L06 MachineLearning AnomalyDetection
61 pages
Anomaly Detection For Web Log Based Data
No ratings yet
Anomaly Detection For Web Log Based Data
5 pages
Unit 3
No ratings yet
Unit 3
37 pages
10 - Anomaly Detection
No ratings yet
10 - Anomaly Detection
12 pages
WSDM21 Tutorial DLAD Slides
No ratings yet
WSDM21 Tutorial DLAD Slides
110 pages
Anomoly Detection - Ensemble - Classifiers
No ratings yet
Anomoly Detection - Ensemble - Classifiers
68 pages
The Ultimate Guide To Anomaly Detection: Key Use Cases, Techniques, and Autoencoder Machine Learning Models
No ratings yet
The Ultimate Guide To Anomaly Detection: Key Use Cases, Techniques, and Autoencoder Machine Learning Models
9 pages
Anomaly Detection Class
No ratings yet
Anomaly Detection Class
24 pages
Mausumi Doi - Org.10.32010.26166127.2020.3.2.196.206
No ratings yet
Mausumi Doi - Org.10.32010.26166127.2020.3.2.196.206
12 pages
Anomaly Detection Using ML
No ratings yet
Anomaly Detection Using ML
30 pages
Anomaly Detection in Log Files Based On Machine Le
No ratings yet
Anomaly Detection in Log Files Based On Machine Le
13 pages
02 - 03 - Anomaly Detection Survey
No ratings yet
02 - 03 - Anomaly Detection Survey
27 pages
Introduction To Anomaly Detection With Machine Learning
No ratings yet
Introduction To Anomaly Detection With Machine Learning
12 pages
Anomaly Detection 2
No ratings yet
Anomaly Detection 2
8 pages
1 s2.0 S0952197622004936 Main
No ratings yet
1 s2.0 S0952197622004936 Main
8 pages
Anomaly Detection
No ratings yet
Anomaly Detection
3 pages
Ecmlpkdd08 Lazarevic Dmfa
No ratings yet
Ecmlpkdd08 Lazarevic Dmfa
116 pages
References
No ratings yet
References
6 pages
Anomaly Detection and Curve Fitting
No ratings yet
Anomaly Detection and Curve Fitting
72 pages
Faizah
No ratings yet
Faizah
11 pages
Anomaly Detection Guide for Beginners
No ratings yet
Anomaly Detection Guide for Beginners
12 pages
Anomaly Detection
No ratings yet
Anomaly Detection
7 pages
Anomaly Detection
No ratings yet
Anomaly Detection
7 pages
Anomaly Detection Unit 5
No ratings yet
Anomaly Detection Unit 5
9 pages
Ahmed PDF
No ratings yet
Ahmed PDF
6 pages
Network Anomaly Detection Methods
No ratings yet
Network Anomaly Detection Methods
6 pages
Introtoanomalydetection 170421012904
No ratings yet
Introtoanomalydetection 170421012904
53 pages
14 Pages 22 July
No ratings yet
14 Pages 22 July
5 pages
6anomaly Fraud Detection
No ratings yet
6anomaly Fraud Detection
5 pages
Unit 4
No ratings yet
Unit 4
17 pages
Reverse Accessible in Local Outlier Factor Density Based Recognition
No ratings yet
Reverse Accessible in Local Outlier Factor Density Based Recognition
10 pages
WP S-Ax Key Steps To Detect An Anomaly in Real-time-JAN10
No ratings yet
WP S-Ax Key Steps To Detect An Anomaly in Real-time-JAN10
10 pages
Anomaly Detection: A Tutorial: Arindam Banerjee, Varun Chandola, Vipin Kumar, Jaideep Srivastava
No ratings yet
Anomaly Detection: A Tutorial: Arindam Banerjee, Varun Chandola, Vipin Kumar, Jaideep Srivastava
101 pages
Anomaly Detection Using SOM and Particle Swarm Optimization
No ratings yet
Anomaly Detection Using SOM and Particle Swarm Optimization
9 pages
ff12 Deep Learning For Anomaly Detection
No ratings yet
ff12 Deep Learning For Anomaly Detection
71 pages
Anamoly Detection
0% (1)
Anamoly Detection
20 pages
Module 11 (C)
No ratings yet
Module 11 (C)
4 pages
Outlier Analysis for Data Scientists
No ratings yet
Outlier Analysis for Data Scientists
18 pages
Anomaly Detection Presentation With Charts
No ratings yet
Anomaly Detection Presentation With Charts
8 pages
Anomaly Detection Insights
No ratings yet
Anomaly Detection Insights
7 pages
Anomaly Detection
No ratings yet
Anomaly Detection
13 pages
Anomaly Detection
No ratings yet
Anomaly Detection
49 pages
Pattern Recognition & Anomaly Detection
No ratings yet
Pattern Recognition & Anomaly Detection
2 pages
Anomaly Detection: Jing Gao
No ratings yet
Anomaly Detection: Jing Gao
51 pages
Network Anomaly Detection
No ratings yet
Network Anomaly Detection
18 pages
Distance Based Outlier Detection
No ratings yet
Distance Based Outlier Detection
40 pages
Anomaly Detection Tutorial
No ratings yet
Anomaly Detection Tutorial
101 pages
Anomaly Detection in Cybersecurity
No ratings yet
Anomaly Detection in Cybersecurity
31 pages
MBA Analytics For Finance 08
No ratings yet
MBA Analytics For Finance 08
9 pages
Paper 6 CN
No ratings yet
Paper 6 CN
32 pages
Anomaly Detection On Industrial Electrical Systems Using Deep Learning
No ratings yet
Anomaly Detection On Industrial Electrical Systems Using Deep Learning
6 pages
Measuring of Data Quality in KYC Using Anomaly Det
No ratings yet
Measuring of Data Quality in KYC Using Anomaly Det
7 pages
17 dm2 Anomaly Detection 2022 23
No ratings yet
17 dm2 Anomaly Detection 2022 23
113 pages
Machine Learning For Time Series Anomaly Detection: Ihssan Tinawi
No ratings yet
Machine Learning For Time Series Anomaly Detection: Ihssan Tinawi
55 pages
Ai 05 00143
No ratings yet
Ai 05 00143
17 pages
10.anomaly Detection
No ratings yet
10.anomaly Detection
24 pages
Anomaly Detection
No ratings yet
Anomaly Detection
19 pages
Ogre User Manuel 1 7 A4
No ratings yet
Ogre User Manuel 1 7 A4
185 pages
CSM PSM CAM Difference
No ratings yet
CSM PSM CAM Difference
1 page
Exercises Read Sentence Completion
No ratings yet
Exercises Read Sentence Completion
11 pages
Statistical Analysis and Histogram
No ratings yet
Statistical Analysis and Histogram
8 pages
SystemVerilog Adder Testbench Guide
No ratings yet
SystemVerilog Adder Testbench Guide
7 pages
Looker Admin & Developer Guide
No ratings yet
Looker Admin & Developer Guide
16 pages
Pecoff v8
No ratings yet
Pecoff v8
69 pages
Plan, Design, and Compete in Robo Rally!
No ratings yet
Plan, Design, and Compete in Robo Rally!
63 pages
Military History - Wikipedia
No ratings yet
Military History - Wikipedia
8 pages
CV Template ByJeremy 2024
No ratings yet
CV Template ByJeremy 2024
1 page
Alex Blyth - Brilliant Online Marketing - How To Use The Internet To Market Your Business (Brilliant Business) - Prentice Hall (2010)
No ratings yet
Alex Blyth - Brilliant Online Marketing - How To Use The Internet To Market Your Business (Brilliant Business) - Prentice Hall (2010)
177 pages
Amare CV
No ratings yet
Amare CV
3 pages
Deedy
No ratings yet
Deedy
9 pages
Addition Means - These Are The Steps On How To Do Addition Within 100
No ratings yet
Addition Means - These Are The Steps On How To Do Addition Within 100
4 pages
5 AZ-104 Manage Identiies and Policy, RBAC
No ratings yet
5 AZ-104 Manage Identiies and Policy, RBAC
15 pages
Startup and Shutdown Container Databases (CDB) and Pluggable Databases (PDB)
No ratings yet
Startup and Shutdown Container Databases (CDB) and Pluggable Databases (PDB)
3 pages
U3L07 - Activity Guide - Robot Face Planning
No ratings yet
U3L07 - Activity Guide - Robot Face Planning
2 pages
Contoh Application Job
No ratings yet
Contoh Application Job
4 pages
Dray: Gigabit Switch Deployment
No ratings yet
Dray: Gigabit Switch Deployment
5 pages
Topic 1 - Information Security Governance
No ratings yet
Topic 1 - Information Security Governance
33 pages
SML Loop
No ratings yet
SML Loop
4 pages
Ict Reviewer 3RD QTR
No ratings yet
Ict Reviewer 3RD QTR
6 pages
ML Logcat 1742825998561
No ratings yet
ML Logcat 1742825998561
64 pages
Firefly5 16-5 22 PDF
No ratings yet
Firefly5 16-5 22 PDF
83 pages
3D World 304 2023 11
100% (1)
3D World 304 2023 11
100 pages
RL3-NAC Info
No ratings yet
RL3-NAC Info
11 pages
51talk Next Step Reminders
No ratings yet
51talk Next Step Reminders
3 pages
JAVA - Unit 1
No ratings yet
JAVA - Unit 1
36 pages
RT3602AH
No ratings yet
RT3602AH
49 pages
Plumes - Delineation & Transport - D. James Benton
No ratings yet
Plumes - Delineation & Transport - D. James Benton
140 pages

Machine Learning For Anomaly Detection

Uploaded by

Machine Learning For Anomaly Detection

Uploaded by

Machine learning for anomaly

4. Resources and further reading

5. Questions and discussion

Artificial intelligence (AI) is a Machine learning (ML) is a

Machine Learning Overview

• Machine Learning is a subset of AI that .

• Key Focus Patterns, predictions, and decision-

Anomaly detection refers to

Significant in applications like

Helps mitigate risks and improve

SEMI SUPERVISED ANOMALY DETECTION

• Data imbalance Anomalies are

• Provides superior • High dimensionality Complex

Algorithm Types Anomaly Detection Algorithm Techniques To Know

• Supervised Random Forest, SVM for • Isolation Forest

One-Class Support Vector

Long Short-Term Memory Angle-Based Outlier

Long Short-Term Memory (LSTM)

One-Class Support Vector Machine

Long Short-Term Memory (LSTM) Angle-Based Outlier Detection

Anomaly detection techniques are vital for

Step 1 Data preprocessing: Handle

Step 2Algorithm selection based on the

Step 4 Deploy the model and monitor its

1. Books: 'Anomaly Detection Principles and Algorithms' by Aggarwal.

You might also like