Formal Methods in AI/ML Safety

This document discusses formal methods in machine learning. It provides an overview of safety issues in AI/ML systems and the need to ensure these systems behave properly, especially in applications with serious consequences. It outlines different perspectives on this including machine learning, which focuses on reducing accidents through better design and data, and formal methods, which aims to mathematically prove properties or find counterexamples. The document discusses challenges in applying formal methods to complex neural networks and opportunities to use formal analysis to help verify, design and explain machine learning components.

Uploaded by

Geethika Chamana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

96 views23 pages

Formal Methods in AI/ML Safety

Uploaded by

Geethika Chamana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 23

Formal Methods in Machine Learning

A 30000-feet view
(Why CS781?)

Supratik Chakraborty
IIT Bombay
Safety in AI/ML
● AI/ML based systems
○ Computational systems that try to mimic (and improve upon?) human reasoning
● Applications span entire spectrum of consequences
○ Benign
■ Auto completion in chat, game of chess, recommendation of restaurants, …
○ Potentally serious, but recoverable
■ Approval of bank loans, bail applications, ...
○ Serious irrecoverable consequences
■ Collision avoidance in unmanned drones, self-driving cars, malware detection, …

● Can we trust decisions by AI/ML based systems in applications where

cost of errors is extraordinarily large?
○ Human lives, breach of privacy, security gaps, loss of critical infrastructure …
Something Requires Attention …
Centrality of Machine Learning based Decisions
A Typical Setup
Application with
ML component

ML component
under study

Application’s
environment

Other components
in application
A Typical Setup
Does this do the right thing in all corner cases?
Application with
ML component

ML component
under study

Application’s
environment

Other components
in application
Different perspectives
● Machine learning perspective
○ “Accidents”
■ Unintended, harmful behaviour stemming from “bad” design of ML components?
■ Wrong objective function design?
■ Training based on insufficient or poorly curated data?
■ Errors due to distributional shift of inputs?

○ Core machine learning techniques can reduce “accidents”

■ Scalable, works in a large spectrum of real-world settings
■ Are all corner cases covered? Do we have proofs of correctness?
Different perspectives
Can we depend on training/designing complex networks using to always work as
desired in previously unseen corner cases, when the cost of an error is huge?

ML based techniques to mitigate problem are important and must be used

But are these sufficient?

Different perspectives
● Formal methods perspective
○ System: E.g., Neural net in self-driving car
■ Mathematical model of system’s behaviour (S)
○ Environment: E.g., Road, weather, traffic, driver interventions, ...
■ Mathematical model of environment’s behaviour (E)
○ Property: A precise formulation (F) of acceptable behaviour of S operating in E

○ Algorithmic search of proof space

■ Either obtain a proof that system satisfies property
● (S || E) ⊨ F
■ Counterexample (network inputs) that demonstrate violation of property
● Model of (S || E) ⋀ ￢F
Different perspectives
F
Yes
(+ proof)

S
Compose Verify
E

No
(+ counter-example)

Ref: Towards Verified Artificial Intelligence, Seshia, Sadigh and Sastry

Different perspectives
Formal methods perspective

● Hugely successful in hardware industry, software industry

○ Every processor from Intel/AMD has parts of the design formally verified
○ Every time you fly an Airbus aircraft, large parts of auto-pilot software formally verified
○ Every time you insert a USB device into a Windows machine, formal verification of
downloaded drivers happens

● Can we make the technology work for AI/ML based systems?

Different perspectives
FM in ML goes beyond proofs/counterexamples of safety properties

Can we use formal methods based reasoning to

● Verify correctness of algorithms used to train?

● Do correct-by-construction design of ML components?
● Provide explanations based on formal models?
● Fish out adversarial inputs for well-trained ML components?
● Analyze robustness, fairness, privacy, security, transparency etc.?
Some common problems

System S
High dimn input space, parameter space: scalability of analysis?
Some common problems
Environment E
How do we model ?

Application’s
environment

Other components
in application
Some common problems
Property F: (Vehicle within 5m on left) ⟹ ￢ (Steer left)

Application’s
environment

Other components
in application
Some common problems

What is the corresponding property for S?

Realistic expectations
● Given scale and complexity of today’s AI/ML based systems
○ Challenging, if not impossible, to design correct-by-construction ML system, or formally verify
overall correct operation without restrictive/unrealistic assumptions
○ Nascent area, lots of promising ideas in literature
● Therefore,
○ Core ML techniques, Formal Analysis/Verification
AND Run-Time Assurance needed
Formal Methods and Machine
Learning must help each
other
Some Buzzing Research Topics
● Specifying properties for ML components
● Modeling environments and neural networks
● Abstract interpretation for analyzing deep neural networks
● Customized constraint solvers
● Verified Reinforcement Learning
● Robustness analysis through formal methods lens
● Explainability of ML components: logic based approach
Some Additional Details
Modeling the system

● Very high dimensional input space

● Need abstraction mechanisms suitable for scale of ML component complexity
○ Walking a tight rope -- computational efficiency vs precision of analysis
● Use logical formalisms to “explain” ML components
○ Some of these can be used as models
● Model systems in context
■ Perhaps not necessary to model arbitrary behaviours
Modeling environment
● Uncertainty omnipresent: First class entity in reasoning
● Some things are inherently hard to model
○ Human behaviour, traffic conditions
● Need to combine probabilistic and non-deterministic modeling intelligently
● Markov Decision Processes (MDPs), probabilistic programs, …
● Abstractions in environment modeling
Specification of what is desired behaviour
● Often hard to formalize
○ Significant chunk of time spent on this even in software/hardware verification
● “Data as specification” vs “formal specification”
○ Can this gap be bridged?
○ Specification mining from behaviours, traces?
● Quantitative vs Boolean specifications
○ Quantitative specs often have an optimization flavour
○ Does a system satisfy/fail a property or get a formal score for property satisfaction?
● Run-time monitors
Practically “efficient” computational techniques
● Hardware & software verification settings
○ Symbolic model checking, SAT/SMT solvers, numerical simulation techinques ..
● AI/ML context
○ Data generation, satisfying soft, hard, distributional constraints (realism)
○ Efficient constraint solving techniques with ReLUs, sigmoids, etc.
○ New abstraction/refinement techniques for ReLUs, sigmoids for sound analysis
○ Compositional reasoning
■ Assume-guarantee reasoning for Boolean models/specifications relatively mature
■ Similar reasoning for probablistic/quantitative models/specifications?

14 FormalMethods
No ratings yet
14 FormalMethods
22 pages
Trustworthy AI
No ratings yet
Trustworthy AI
8 pages
Computational Graphs in Deep Learning Unit v4 Deep Leaerning
No ratings yet
Computational Graphs in Deep Learning Unit v4 Deep Leaerning
3 pages
ML Lesson Plan (21AI63)
No ratings yet
ML Lesson Plan (21AI63)
8 pages
CCchap 2
No ratings yet
CCchap 2
7 pages
AI Engineering for Future Leaders
No ratings yet
AI Engineering for Future Leaders
25 pages
Why and How Do I Get Into Machine Learning Development?
No ratings yet
Why and How Do I Get Into Machine Learning Development?
3 pages
Heuristic Search
No ratings yet
Heuristic Search
49 pages
Diffusion Models
No ratings yet
Diffusion Models
46 pages
Knowledge Representation and Reasoning
No ratings yet
Knowledge Representation and Reasoning
155 pages
Introduction to AI & Systems
No ratings yet
Introduction to AI & Systems
32 pages
AIMLCZG521 - Conversational AI
No ratings yet
AIMLCZG521 - Conversational AI
488 pages
AI ML Roadmap
No ratings yet
AI ML Roadmap
4 pages
UE20CS302 Unit3 Slides
No ratings yet
UE20CS302 Unit3 Slides
308 pages
Machine Learning Notes
No ratings yet
Machine Learning Notes
112 pages
Word and Syntactic Analysis Guide
No ratings yet
Word and Syntactic Analysis Guide
278 pages
AI - Expert System
100% (1)
AI - Expert System
24 pages
Introduction to Machine Learning
No ratings yet
Introduction to Machine Learning
30 pages
Machine Learning in Production
No ratings yet
Machine Learning in Production
31 pages
Artificial Intelligence Chapter 2: Intelligent Agents
No ratings yet
Artificial Intelligence Chapter 2: Intelligent Agents
12 pages
Ad3501 - Deep Learning
No ratings yet
Ad3501 - Deep Learning
2 pages
Career Track For AI/ML
No ratings yet
Career Track For AI/ML
10 pages
Reinforcement Learning - Introduction
No ratings yet
Reinforcement Learning - Introduction
19 pages
Building A Voice Based Image Caption Generator With Deep Learning
No ratings yet
Building A Voice Based Image Caption Generator With Deep Learning
6 pages
ML Unit-1
No ratings yet
ML Unit-1
43 pages
B.Tech CSE AI & ML Syllabus JNTUH
No ratings yet
B.Tech CSE AI & ML Syllabus JNTUH
65 pages
Iot Merged
No ratings yet
Iot Merged
132 pages
AI & ML Interview Preparation
No ratings yet
AI & ML Interview Preparation
15 pages
Csps 1
100% (2)
Csps 1
62 pages
CS 224n Word2Vec Assignment Guide
No ratings yet
CS 224n Word2Vec Assignment Guide
4 pages
Full Stack Web Development - IT3501 - Notes - Unit 2 - Node JS
No ratings yet
Full Stack Web Development - IT3501 - Notes - Unit 2 - Node JS
43 pages
Unit 3 - Ai
No ratings yet
Unit 3 - Ai
216 pages
Data Science Syllabus From Beginner To Advanced
No ratings yet
Data Science Syllabus From Beginner To Advanced
7 pages
Computer Vision I: Ai Courses by Opencv
No ratings yet
Computer Vision I: Ai Courses by Opencv
9 pages
Lecture 1
No ratings yet
Lecture 1
43 pages
Unit IV V Deep Learning Material
No ratings yet
Unit IV V Deep Learning Material
32 pages
ML Lectures Summary 2
No ratings yet
ML Lectures Summary 2
52 pages
Artificial Intelligence
No ratings yet
Artificial Intelligence
3 pages
IT8075 Software Project Management Notes
No ratings yet
IT8075 Software Project Management Notes
132 pages
Ait307 QP
No ratings yet
Ait307 QP
3 pages
Mca 3 Sem Artificial Intelligence Kca301 2023
No ratings yet
Mca 3 Sem Artificial Intelligence Kca301 2023
2 pages
Intelligent Agents: Fundamentals of Artificial Intelligence
No ratings yet
Intelligent Agents: Fundamentals of Artificial Intelligence
51 pages
CS 3 - Problem Solving Agent
No ratings yet
CS 3 - Problem Solving Agent
80 pages
Federated Learning - Hope and Scope
No ratings yet
Federated Learning - Hope and Scope
4 pages
Aptitude
No ratings yet
Aptitude
143 pages
Lecture 1
No ratings yet
Lecture 1
38 pages
AI Search Strategies Explained
No ratings yet
AI Search Strategies Explained
43 pages
AI Game Strategy Basics
No ratings yet
AI Game Strategy Basics
66 pages
Neuromorphic Computing
No ratings yet
Neuromorphic Computing
14 pages
ML-5TH Unit
No ratings yet
ML-5TH Unit
28 pages
Data Mining Tutorials
No ratings yet
Data Mining Tutorials
52 pages
State Space Representation AI
No ratings yet
State Space Representation AI
2 pages
AI UT Paper 22-23
No ratings yet
AI UT Paper 22-23
2 pages
Algorithms and Data Structures: Dynamic Programming Matrix-Chain Multiplication
No ratings yet
Algorithms and Data Structures: Dynamic Programming Matrix-Chain Multiplication
17 pages
Tree Traversals (Inorder, Preorder and Postorder)
No ratings yet
Tree Traversals (Inorder, Preorder and Postorder)
4 pages
ACD Module - 1 Notes
No ratings yet
ACD Module - 1 Notes
31 pages
Artificial Intelligence Module 5
No ratings yet
Artificial Intelligence Module 5
23 pages
Stanford Center For AI Safety - Whitepaper
No ratings yet
Stanford Center For AI Safety - Whitepaper
6 pages
How To Research Formally
No ratings yet
How To Research Formally
26 pages
Overview CH
No ratings yet
Overview CH
35 pages
Fleming 2014
No ratings yet
Fleming 2014
20 pages
Detailed Lesson Plan - HUMSS 12
No ratings yet
Detailed Lesson Plan - HUMSS 12
6 pages
PDF
No ratings yet
PDF
1 page
ODI Statement of Direction 20200501
No ratings yet
ODI Statement of Direction 20200501
6 pages
Al-Maqrizi: Social & Scientific Life
No ratings yet
Al-Maqrizi: Social & Scientific Life
16 pages
Cambridge IGCSE™: German 0525/21
No ratings yet
Cambridge IGCSE™: German 0525/21
12 pages
William Gropp, Torsten Hoefler, Rajeev Thakur, Ewing Lusk Using Advanced MPI Modern Features of The Message-Passing Interface
No ratings yet
William Gropp, Torsten Hoefler, Rajeev Thakur, Ewing Lusk Using Advanced MPI Modern Features of The Message-Passing Interface
376 pages
Curlew Breeding & Behavior Insights
No ratings yet
Curlew Breeding & Behavior Insights
2 pages
Model 043-B Service Regulator: Technical Data
No ratings yet
Model 043-B Service Regulator: Technical Data
2 pages
Q2-Precal-Performance Task
No ratings yet
Q2-Precal-Performance Task
1 page
BSC and Strategic Decision Making 1
No ratings yet
BSC and Strategic Decision Making 1
13 pages
Ippd Sy 2023 24 2
100% (6)
Ippd Sy 2023 24 2
3 pages
McAfee Labs Threat Advisory Pinkslipbot
No ratings yet
McAfee Labs Threat Advisory Pinkslipbot
10 pages
Trimetric Analysis Neutralization Reactions
100% (1)
Trimetric Analysis Neutralization Reactions
21 pages
Kalimarau Airport WAQT Info
100% (2)
Kalimarau Airport WAQT Info
9 pages
Lesson Plan On Verbs
No ratings yet
Lesson Plan On Verbs
4 pages
Advanced High Strength Steel (Ahss) Application Guidelines
100% (1)
Advanced High Strength Steel (Ahss) Application Guidelines
163 pages
2.2 Grain Size Distribution: Sieve Analysis
No ratings yet
2.2 Grain Size Distribution: Sieve Analysis
20 pages
SRM Valliammai College Overview
No ratings yet
SRM Valliammai College Overview
79 pages
Techniques of The Observer - Jonathan Crary PDF
No ratings yet
Techniques of The Observer - Jonathan Crary PDF
15 pages
The Graphic Design Idea Book: Inspiration From 50 Masters 1st Edition Steven Heller PDF Download
No ratings yet
The Graphic Design Idea Book: Inspiration From 50 Masters 1st Edition Steven Heller PDF Download
106 pages
Uap Cts
No ratings yet
Uap Cts
8 pages
For Grade 9 Demo
No ratings yet
For Grade 9 Demo
59 pages
TDS Polyglykol 600 SG Vita English
No ratings yet
TDS Polyglykol 600 SG Vita English
2 pages
Instagram Caption Templates Guide
No ratings yet
Instagram Caption Templates Guide
30 pages
Contract Lifecycle Management in SAP
No ratings yet
Contract Lifecycle Management in SAP
4 pages
667400a31d833d00172262cf - ## - Inverse Trigonometric Functions - DPP 01 (Of Lec 03) - Lakshya JEE 2025
No ratings yet
667400a31d833d00172262cf - ## - Inverse Trigonometric Functions - DPP 01 (Of Lec 03) - Lakshya JEE 2025
2 pages
PSM Circular No D of 2024
100% (1)
PSM Circular No D of 2024
51 pages
DeltaV Power and Grounding
No ratings yet
DeltaV Power and Grounding
192 pages
MAGIC Vol.1 ENG 2019-01-30
No ratings yet
MAGIC Vol.1 ENG 2019-01-30
46 pages