0% found this document useful (0 votes)

26 views62 pages

Lecture 7

Uploaded by

Par Veen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

26 views62 pages

Lecture 7

Uploaded by

Par Veen

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

Adversarial Search and Game

Playing

CHAPTER 5
ICE 3201
Bangladesh University of Professionals
Environment Type Discussed In this Lecture
2

Fully
Observable  Turn-taking: Semi-dynamic

yes  Deterministic and non-deterministic

Multi-agent

yes

Sequential
yes no

Discrete no
yes Discrete

yes
Game Game Matrices Continuous Action Games
Tree
Search
CMPT 310 - Blind Search
Adversarial Search

 Examine the problems that arise when we try to

plan ahead in a world where other agents are
planning against us.

 A good example is in board games.

 Adversarial games, while much studied in AI, are a

small part of game theory in economics.
Typical AI assumptions

 Two agents whose actions alternate

 Utility values for each agent are the opposite of the

other
 creates the adversarial situation

 Fully observable environments

 In game theory terms: Zero-sum games of perfect

information.

 We’ll relax these assumptions later.

Search versus Games

 Search – no adversary
 Solution is (heuristic) method for finding goal
 Heuristic techniques can find optimal solution
 Evaluation function: estimate of cost from start to goal through given node
 Examples: path planning, scheduling activities

 Games – adversary
 Solution is strategy (strategy specifies move for every possible opponent
reply).
 Optimality depends on opponent. Why?
 Time limits force an approximate solution
 Evaluation function: evaluate “goodness” of game position
 Examples: chess, checkers, Othello, backgammon
Types of Games

deterministic Chance moves • on-line

Perfect Chess, checkers, Backgammon, backgam
information go, othello monopoly mon
• on-line
Imperfect Bridge, Skat Poker, scrabble, chess
information blackjack •
(Initial Chance tic-tac-
Moves) toe

• Theorem of Nobel Laureate Harsanyi: Every game with

chance moves during the game has an equivalent representation
with initial chance moves only.
• A deep result, but computationally it is more tractable to
consider chance moves as the game goes along.
• This is basically the same as the issue of full observability +
nondeterminism vs. partial observability + determinism.
Game Setup

 Two players: MAX and MIN

 MAX moves first and they take turns until the game is over
 Winner gets award, loser gets penalty.

 Games as search:
 Initial state: e.g. board configuration of chess
 Successor function: list of (move,state) pairs specifying legal moves.
 Terminal test: Is the game finished?
 Utility function: Gives numerical value of terminal states. E.g. win (+1), lose
(-1) and draw (0) in tic-tac-toe or chess

 MAX uses search tree to determine next move.

Size of search trees

 b = branching factor

 d = number of moves by both players

 Search tree is O(bd)

 Chess
 b ~ 35
 D ~100
- search tree is ~ 10 154 (!!)
- completely impractical to search this

 Game-playing emphasizes being able to make optimal decisions in a finite amount of time
 Somewhat realistic as a model of a real-world agent
 Even if games themselves are artificial
Partial Game Tree for Tic-Tac-Toe
Game tree (2-player, deterministic, turns)

How do we search this tree to find the optimal move?

Minimax strategy: Look ahead and reason backwards

 Find the optimal strategy for MAX assuming an

infallible MIN opponent
 Need to compute this all the down the tree
 Game Tree Search Demo

 Assumption: Both players play optimally!

 Given a game tree, the optimal strategy can be
determined by using the minimax value of each
node.
 Zermelo 1912.
Two-Ply Game Tree
Two-Ply Game Tree
Two-Ply Game Tree
Two-Ply Game Tree
Minimax maximizes the utility for the worst-case outcome for max

The minimax decision

Pseudocode for Minimax Algorithm

function MINIMAX-DECISION(state) returns an action

inputs: state, current state in game
v←MAX-VALUE(state)
return the action in SUCCESSORS(state) with value v
function MAX-VALUE(state) returns a utility value
if TERMINAL-TEST(state) then return UTILITY(state)
v ← -!
for a,s in SUCCESSORS(state) do
v ← MAX(v,MIN-VALUE(s))
return v

function MIN-VALUE(state) returns a utility value

if TERMINAL-TEST(state) then return UTILITY(state)
v←!
for a,s in SUCCESSORS(state) do
v ← MIN(v,MAX-VALUE(s))
return v
Minimax algorithm
Minimax algorithm
Minimax algorithm
Minimax algorithm
Minimax algorithm
Minimax algorithm
Example of Algorithm Execution

MAX to move
Minimax Algorithm

 Complete depth-first exploration of the game tree

 Assumptions:
 Max depth = d, b legal moves at each point

 E.g., Chess: d ~ 100, b ~35

Criterion Minimax

Time  O(bd)

Space O(bd)

Practical problem with minimax search

 Number of game states is exponential in the number of

moves.
 Solution: Do not examine every node

=> pruning
 Remove branches that do not influence final decision

 Revisit example …
Alpha-Beta Example

Do DF-search until first leaf

Range of possible values

[-!,+!]

[-!, +!]
Alpha-Beta Example (continued)

[-!,+!]

[-!,3]
Alpha-Beta Example (continued)

[-!,+!]

[-!,3]
Alpha-Beta Example (continued)

[3,+!]

[3,3]
Alpha-Beta Example (continued)

[3,+!]
This node is worse

for MAX

[3,3] [-!,2]
Alpha-Beta Example (continued)

[3,14] ,

[3,3] [-!,2] [-!,14]

Alpha-Beta Example (continued)

[3,5] ,

[3,3] ["!,2] [-!,5]

Alpha-Beta Example (continued)

[3,3]

[3,3] ["!,2] [2,2]

Alpha-Beta Example (continued)

[3,3]

[3,3] [-!,2] [2,2]

Alpha-beta Algorithm

 Depth first search – only considers nodes along a single

path at any time

α = highest-value choice that we can guarantee for MAX

so far in the current subtree.
β = lowest-value choice that we can guarantee for MIN so
far in the current subtree.
 update values of α and β during search and prunes
remaining branches as soon as the value is known to be
worse than the current α or β value for MAX or MIN.
 Alpha-beta Demo.
Effectiveness of Alpha-Beta Search

 Worst-Case
 branches are ordered so that no pruning takes place. In this case alpha-beta
gives no improvement over exhaustive search

 Best-Case
 each player’s best move is the left-most alternative (i.e., evaluated first)
 in practice, performance is closer to best rather than worst-case

 In practice often get O(b(d/2)) rather than O(bd)

 this is the same as having a branching factor of sqrt(b),
 since (sqrt(b))d = b(d/2)
 i.e., we have effectively gone from b to square root of b
 e.g., in chess go from b ~ 35 to b ~ 6
 this permits much deeper search in the same amount of time
 Typically twice as deep.
Example
-which nodes can be pruned?
MAX

MIN

MAX

5 6
3 4 1 2 7 8
Alpha Beta Pruning

 Alpha-beta pruning is a modified version of the

minimax algorithm. It is an optimization technique
for the minimax algorithm.
 As we have seen in the minimax search algorithm
that the number of game states it has to examine are
exponential in depth of the tree.
 Since we cannot eliminate the exponent, but we can
cut it to half. Hence there is a technique by which
without checking each node of the game tree we can
compute the correct minimax decision, and this
technique is called pruning.
Alpha Beta Pruning

 Alpha-beta pruning can be applied at any depth of a

tree, and sometimes it not only prune the tree leaves
but also entire sub-tree.
 The two-parameter can be defined as:
 Alpha: The best (highest-value) choice we have found so far at
any point along the path of Maximizer. The initial value of
alpha is -∞.
 Beta: The best (lowest-value) choice we have found so far at
any point along the path of Minimizer. The initial value of beta
is +∞.
Condition for Alpha Beta Pruning

 The main condition which required for alpha-beta

pruning is:
α>=β
Key Points

 The Max player will only update the value of alpha.

 The Min player will only update the value of beta.
 While backtracking the tree, the node values will be
passed to upper nodes instead of values of alpha and
beta.
 We will only pass the alpha, beta values to the child
nodes.
Example
Example

 Step 1: At the first step the, Max player will start

first move from node A where α= -∞ and β= +∞,
these value of alpha and beta passed down to node B
where again α= -∞ and β= +∞, and Node B passes
the same value to its child D.

Example
Example

 Step 2: At Node D, the value of α will be calculated

as its turn for Max. The value of α is compared with
firstly 2 and then 3, and the max (2, 3) = 3 will be the
value of α at node D and node value will also 3.
 Step 3: Now algorithm backtrack to node B, where
the value of β will change as this is a turn of Min,
Now β= +∞, will compare with the available
subsequent nodes value, i.e. min (∞, 3) = 3, hence at
node B now α= -∞, and β= 3.
Example
Example

 In the next step, algorithm traverse the next

successor of Node B which is node E, and the values
of α= -∞, and β= 3 will also be passed.
 Step 4: At node E, Max will take its turn, and the
value of alpha will change. The current value of alpha
will be compared with 5, so max (-∞, 5) = 5, hence at
node E α= 5 and β= 3, where α>=β, so the right
successor of E will be pruned, and algorithm will not
traverse it, and the value at node E will be 5.

Example
Example

 Step 5: At next step, algorithm again backtrack the

tree, from node B to node A. At node A, the value of
alpha will be changed the maximum available value is
3 as max (-∞, 3)= 3, and β= +∞, these two values now
passes to right successor of A which is Node C.
 At node C, α=3 and β= +∞, and the same values will be
passed on to node F.
 Step 6: At node F, again the value of α will be
compared with left child which is 0, and max(3,0)= 3,
and then compared with right child which is 1, and
max(3,1)= 3 still α remains 3, but the node value of F
will become 1.
Example
Example

 Step 7: Node F returns the node value 1 to node C,

at C α= 3 and β= +∞, here the value of beta will be
changed, it will compare with 1 so min (∞, 1) = 1.
Now at C, α=3 and β= 1, and again it satisfies the
condition α>=β, so the next child of C which is G will
be pruned, and the algorithm will not compute the
entire sub-tree G.
Example
Example

 Step 8: C now returns the value of 1 to A here the

best value for A is max (3, 1) = 3. Following is the
final game tree which is the showing the nodes which
are computed and nodes which has never computed.
Hence the optimal value for the maximizer is 3 for
this example.
Final Comments about Alpha-Beta Pruning

 Pruning does not affect final results

 Entire subtrees can be pruned.

 Good move ordering improves effectiveness of

pruning

 Repeated states are again possible.

 Store them in memory = transposition table
Practical Implementation

How do we make these ideas practical in real game trees?

Standard approach:
 cutoff test: (where do we stop descending the tree)
 depth limit
 better: iterative deepening
 cutoff only when no big changes are expected to occur next (quiescence search).

 evaluation function
 When the search is cut off, we evaluate the current state
by estimating its utility using an evaluation function.
Static (Heuristic) Evaluation Functions

 An Evaluation Function:
 estimates how good the current board configuration is for a player.

 Typically, one figures how good it is for the player, and how good it is for the
opponent, and subtracts the opponents score from the players
 Othello: Number of white pieces - Number of black pieces
 Chess: Value of all white pieces - Value of all black pieces

 Typical values from -infinity (loss) to +infinity (win) or [-1, +1].

 If the board evaluation is X for a player, it’s -X for the opponent.
 Many clever ideas about how to use the evaluation function.
 e.g. null move heuristic: let opponent move twice.
 Example:
 Evaluating chess boards,
 Checkers
 Tic-tac-toe
Iterative (Progressive) Deepening

 In real games, there is usually a time limit T on making a

move

 How do we take this into account?

 using alpha-beta we cannot use “partial” results with any confidence
unless the full breadth of the tree has been searched
 So, we could be conservative and set a conservative depth-limit
which guarantees that we will find a move in time < T
 disadvantage is that we may finish early, could do more search

 In practice, iterative deepening search (IDS) is used

 IDS runs depth-first search with an increasing depth-limit

 when the clock runs out we use the solution found at the previous
depth limit
The State of Play

 Checkers:
 Chinook ended 40-year-reign of human world champion
Marion Tinsley in 1994.
 Chess:
 Deep Blue defeated human world champion Garry Kasparov in
a six-game match in 1997.
 Othello:
 human champions refuse to compete against computers: they
are too good.
 Go:
 human champions refuse to compete against computers: they
are too bad b > 300 (!)
 See (e.g.) http://www.cs.ualberta.ca/~games/ for more information
Deep Blue

 1957: Herbert Simon

 “within 10 years a computer will beat the world chess champion”

 1997: Deep Blue beats Kasparov

 Parallel machine with 30 processors for “software” and 480
VLSI processors for “hardware search”
 Searched 126 million nodes per second on average
 Generated up to 30 billion positions per move
 Reached depth 14 routinely
 Uses iterative-deepening alpha-beta search with
transpositioning
 Can explore beyond depth-limit for interesting moves
Summary
 Game playing can be effectively modeled as a search problem

 Game trees represent alternate computer/opponent moves

 Evaluation functions estimate the quality of a given board

configuration for the Max player.

 Minimax is a procedure which chooses moves by assuming that

the opponent will always choose the move which is best for them

 Alpha-Beta is a procedure which can prune large parts of the

search tree and allow search to go deeper

 For many well-known games, computer algorithms based on

heuristic search match or out-perform human world experts.

Adversial Search
No ratings yet
Adversial Search
101 pages
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
No ratings yet
Lec03 Ai Chapter6 Adversarial Search and Game Playing Aima
52 pages
Lecture11 AdversarialSearch
No ratings yet
Lecture11 AdversarialSearch
74 pages
CC511 Week 4
No ratings yet
CC511 Week 4
57 pages
GamePlaying Minimax Unit-2 SPS
No ratings yet
GamePlaying Minimax Unit-2 SPS
72 pages
Adversarial Search
No ratings yet
Adversarial Search
20 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Adversarial Search MinMax Alpha Beta Pruning
No ratings yet
Adversarial Search MinMax Alpha Beta Pruning
43 pages
Adversarial Search
No ratings yet
Adversarial Search
36 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
30 pages
Games
No ratings yet
Games
41 pages
Oradea: Bucharest Arad Craiova
No ratings yet
Oradea: Bucharest Arad Craiova
53 pages
SET394 - AI - Lecture 06 - Adversarial Search
No ratings yet
SET394 - AI - Lecture 06 - Adversarial Search
27 pages
Game-Playing & Adversarial Search
No ratings yet
Game-Playing & Adversarial Search
68 pages
04 Games PDF
No ratings yet
04 Games PDF
77 pages
Adversarial Search & Minimax in Games
No ratings yet
Adversarial Search & Minimax in Games
39 pages
AI Unit 3
No ratings yet
AI Unit 3
54 pages
Adversarial Search in Game Theory
No ratings yet
Adversarial Search in Game Theory
71 pages
Adversial Search
No ratings yet
Adversial Search
39 pages
Adversarial Search and Game Playing: Games
No ratings yet
Adversarial Search and Game Playing: Games
8 pages
Ai Unit 3
No ratings yet
Ai Unit 3
138 pages
Games Playing-2-57
No ratings yet
Games Playing-2-57
56 pages
AI Lec03 Adversarial Search
No ratings yet
AI Lec03 Adversarial Search
38 pages
Advanced AI Search Techniques
No ratings yet
Advanced AI Search Techniques
135 pages
2025 Lecture03 AdversarialSearch
No ratings yet
2025 Lecture03 AdversarialSearch
51 pages
Game Playing and Constraint Satisfaction Problems
No ratings yet
Game Playing and Constraint Satisfaction Problems
35 pages
Adversarial Search
No ratings yet
Adversarial Search
42 pages
Lecture 7 Adversal Search
No ratings yet
Lecture 7 Adversal Search
34 pages
Biti1113 Games in Ai
No ratings yet
Biti1113 Games in Ai
58 pages
CCS 3101 - Lecture 5 - Adversarial Search Techniques
No ratings yet
CCS 3101 - Lecture 5 - Adversarial Search Techniques
34 pages
Adversarial Search in AI Games
No ratings yet
Adversarial Search in AI Games
58 pages
AAI Lecture 7 SP 25
No ratings yet
AAI Lecture 7 SP 25
51 pages
Unit 5 AI
No ratings yet
Unit 5 AI
80 pages
3 GamePlaying - Minimax
No ratings yet
3 GamePlaying - Minimax
75 pages
1 GamePlaying
No ratings yet
1 GamePlaying
30 pages
Chapter3 - Search4
No ratings yet
Chapter3 - Search4
37 pages
Game Playing
No ratings yet
Game Playing
24 pages
Unit 3 - Ai - II Aiml Full-1
No ratings yet
Unit 3 - Ai - II Aiml Full-1
108 pages
AI-Lecture-07-Adversarial Search
No ratings yet
AI-Lecture-07-Adversarial Search
34 pages
Unit 3 Updated
No ratings yet
Unit 3 Updated
112 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Game AI and Strategy Analysis
No ratings yet
Game AI and Strategy Analysis
53 pages
Game Playing. Updated
No ratings yet
Game Playing. Updated
44 pages
Artificial Intelligence: Gaming Algorithms
No ratings yet
Artificial Intelligence: Gaming Algorithms
26 pages
21CSC206T Unit3
100% (1)
21CSC206T Unit3
138 pages
Adversarial Search
No ratings yet
Adversarial Search
37 pages
Game Playing - AI
No ratings yet
Game Playing - AI
25 pages
Adversarial Search
No ratings yet
Adversarial Search
49 pages
5.2 Min Max & AB Pruning
No ratings yet
5.2 Min Max & AB Pruning
31 pages
AI Unit 2
No ratings yet
AI Unit 2
132 pages
L06 Adversarial Search
No ratings yet
L06 Adversarial Search
66 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
36 pages
AI Game Strategy Basics
No ratings yet
AI Game Strategy Basics
66 pages
AI Game Strategies Explained
No ratings yet
AI Game Strategies Explained
21 pages
6CS4 AI Unit-2
No ratings yet
6CS4 AI Unit-2
77 pages
L06 (Adversarial Search) Ori
No ratings yet
L06 (Adversarial Search) Ori
46 pages
Secrets of Nakshatras
87% (31)
Secrets of Nakshatras
70 pages
Predicting With Nakshatras
92% (13)
Predicting With Nakshatras
11 pages
Bhrigu Nadi Book - New
100% (13)
Bhrigu Nadi Book - New
94 pages
Navamsa - Sanjay Rath
98% (50)
Navamsa - Sanjay Rath
11 pages
Jyotish - Houses 8 and 3 in Advanced Astrology - KP Horary - Chatterjee PDF
92% (24)
Jyotish - Houses 8 and 3 in Advanced Astrology - KP Horary - Chatterjee PDF
404 pages
Spouse From Navamsa Chart
83% (12)
Spouse From Navamsa Chart
68 pages
Jyotish - R.G. Rao - Profession From The Position of Planets
100% (10)
Jyotish - R.G. Rao - Profession From The Position of Planets
111 pages
BNN Guide
100% (12)
BNN Guide
44 pages
Marriage Timing in Astrology-Easy Method To Predict Exact Age and Date
75% (4)
Marriage Timing in Astrology-Easy Method To Predict Exact Age and Date
13 pages
Dashamsha 1 PDF
100% (3)
Dashamsha 1 PDF
26 pages
KP Astrology PDF
92% (12)
KP Astrology PDF
41 pages
28 Nakshatras - The Real Secrets of Vedic Astrology (An E-Book)
89% (19)
28 Nakshatras - The Real Secrets of Vedic Astrology (An E-Book)
44 pages
Principals of Arudha
100% (2)
Principals of Arudha
17 pages
Indepth Arudapada Astro-Pt Sanjay Rath
91% (11)
Indepth Arudapada Astro-Pt Sanjay Rath
19 pages
Jyotish - AIFAS - Timing of Events Through Dasha and Transit
97% (34)
Jyotish - AIFAS - Timing of Events Through Dasha and Transit
61 pages
Jyotish - Advanced Medical Astrology - Chatterjee
95% (43)
Jyotish - Advanced Medical Astrology - Chatterjee
346 pages
BNN - Notes
100% (15)
BNN - Notes
75 pages
Mini Guide KP Astrology
100% (18)
Mini Guide KP Astrology
21 pages
Mobile Numerology Master Sheet
84% (95)
Mobile Numerology Master Sheet
2 pages
PDF Cosmic Insights Nakshatra Remedies Book by DR Arjun Paipdf Compress
91% (11)
PDF Cosmic Insights Nakshatra Remedies Book by DR Arjun Paipdf Compress
89 pages
Mahadashas The Speed of Light
94% (48)
Mahadashas The Speed of Light
212 pages
Jyotish - 2009 - N.N. Sharma - Interpreting Divisional Charts
100% (10)
Jyotish - 2009 - N.N. Sharma - Interpreting Divisional Charts
195 pages
Jyotish New Vedic Progression Brighu Nadi PDF
93% (14)
Jyotish New Vedic Progression Brighu Nadi PDF
299 pages
Navamsa - D 9 Chart Prediction Analysis in Vedic Astrology PDF
88% (8)
Navamsa - D 9 Chart Prediction Analysis in Vedic Astrology PDF
19 pages
KP Astrology
95% (19)
KP Astrology
95 pages
Nadi Rules Saturn PDF
88% (17)
Nadi Rules Saturn PDF
18 pages
Judgement of Bhavas in Jyotisha
97% (36)
Judgement of Bhavas in Jyotisha
171 pages
Bhrigu Nandi Nadi (BNN) - Prediction Technique
100% (14)
Bhrigu Nandi Nadi (BNN) - Prediction Technique
8 pages
Brighu Naadi Sangraha - N Srinivasan Shastry
100% (8)
Brighu Naadi Sangraha - N Srinivasan Shastry
152 pages
Spouse From Navamsa Chart
77% (31)
Spouse From Navamsa Chart
68 pages
Lec02 SDLC
No ratings yet
Lec02 SDLC
62 pages
Diabetes Prediction Using Ensembling of Different Machine Learning Classifiers
No ratings yet
Diabetes Prediction Using Ensembling of Different Machine Learning Classifiers
16 pages
Lecture PPT 7
No ratings yet
Lecture PPT 7
15 pages
Hidden Markov Model - Forward Algorithm
No ratings yet
Hidden Markov Model - Forward Algorithm
7 pages
Lecture 1 (Intro To Internet)
No ratings yet
Lecture 1 (Intro To Internet)
28 pages
CT04 - ICE4101 - Google Forms
No ratings yet
CT04 - ICE4101 - Google Forms
9 pages
Block Cipher Modes and Security
No ratings yet
Block Cipher Modes and Security
24 pages
MVC Pajn
No ratings yet
MVC Pajn
9 pages
Lec 3
No ratings yet
Lec 3
44 pages
1495 3051 1 PB
No ratings yet
1495 3051 1 PB
10 pages
Study and Analysis of Rectangular Microstrip Patch Array Antenna at 28Ghz For 5G Applications
No ratings yet
Study and Analysis of Rectangular Microstrip Patch Array Antenna at 28Ghz For 5G Applications
6 pages
The Short Dipole Antenna
No ratings yet
The Short Dipole Antenna
3 pages
The Half-Wave Dipole Antenna
No ratings yet
The Half-Wave Dipole Antenna
2 pages
Internet and Protocol-2
No ratings yet
Internet and Protocol-2
10 pages
Frequency
No ratings yet
Frequency
4 pages
Why - Do - Antennas - Radiate
No ratings yet
Why - Do - Antennas - Radiate
4 pages
Antenna Efficiency
No ratings yet
Antenna Efficiency
3 pages
Section07 Solutions
No ratings yet
Section07 Solutions
11 pages
Antenna Gain
No ratings yet
Antenna Gain
2 pages
Antenna - Theory .Com - Frequency Bands
No ratings yet
Antenna - Theory .Com - Frequency Bands
2 pages
Lab 7
No ratings yet
Lab 7
2 pages
BURGLAR-ALARM BSc. Electronics
No ratings yet
BURGLAR-ALARM BSc. Electronics
9 pages
Module 2.2
No ratings yet
Module 2.2
20 pages
Exell Formule PT ARH
No ratings yet
Exell Formule PT ARH
3 pages
Cyber Security 5 Unit Notes
No ratings yet
Cyber Security 5 Unit Notes
106 pages
The Magic Cafe Forums - Justin Miller's CREEPER
No ratings yet
The Magic Cafe Forums - Justin Miller's CREEPER
7 pages
Phlebotomy 6th Edition Warekois Solution Manual Test Bank Available Instantly
0% (1)
Phlebotomy 6th Edition Warekois Solution Manual Test Bank Available Instantly
319 pages
AKBAR Profile
No ratings yet
AKBAR Profile
2 pages
Casting Simulation for Engineers
No ratings yet
Casting Simulation for Engineers
6 pages
Bypass AMSI by Manual Modification S3cur3Th1sSh1t
No ratings yet
Bypass AMSI by Manual Modification S3cur3Th1sSh1t
9 pages
Communication in Contexts PDF
0% (2)
Communication in Contexts PDF
2 pages
TM416TRE.491-EnG Programming Mapp Axis V2001
No ratings yet
TM416TRE.491-EnG Programming Mapp Axis V2001
36 pages
AN3998 Sensorless BLDC MC AVR MCU Fam DS00003998
No ratings yet
AN3998 Sensorless BLDC MC AVR MCU Fam DS00003998
43 pages
MOQ G11 Price List Aug'24
No ratings yet
MOQ G11 Price List Aug'24
26 pages
Adding A Course in OnGuard
No ratings yet
Adding A Course in OnGuard
3 pages
Web Data Extractors 2025 Guide
No ratings yet
Web Data Extractors 2025 Guide
26 pages
Practica1 Basico
No ratings yet
Practica1 Basico
56 pages
SG-System 5: Virtual Receiver
No ratings yet
SG-System 5: Virtual Receiver
2 pages
Eastwind For Partners - 2020 - en
No ratings yet
Eastwind For Partners - 2020 - en
62 pages
Imposter Syndrome Book
100% (1)
Imposter Syndrome Book
37 pages
Formal Language & Computation
No ratings yet
Formal Language & Computation
9 pages
American Sign Language Recognition Using Machine Learning and Com
No ratings yet
American Sign Language Recognition Using Machine Learning and Com
57 pages
Neo4j Desktop Setup and Database Import Guide
No ratings yet
Neo4j Desktop Setup and Database Import Guide
18 pages
Stakeholder Analysis Guide
No ratings yet
Stakeholder Analysis Guide
36 pages
PECB Insights Issue 46
No ratings yet
PECB Insights Issue 46
134 pages
The Development of A Reconnaissance Tool Aiming To Achieve A More Efficient Information Gathering Phase of A Penetration Test
No ratings yet
The Development of A Reconnaissance Tool Aiming To Achieve A More Efficient Information Gathering Phase of A Penetration Test
125 pages
Computer Sample Question Paper 4
No ratings yet
Computer Sample Question Paper 4
5 pages
12 Algorithms For System Design Interviews
No ratings yet
12 Algorithms For System Design Interviews
8 pages
Terms for Credit Gateway Users
No ratings yet
Terms for Credit Gateway Users
16 pages
CSE Apple Talk PDF
No ratings yet
CSE Apple Talk PDF
11 pages
Concatenated Coding Techniques
No ratings yet
Concatenated Coding Techniques
21 pages

Lecture 7

Uploaded by

Lecture 7

Uploaded by

Adversarial Search and Game

yes  Deterministic and non-deterministic

 Examine the problems that arise when we try to

 A good example is in board games.

 Adversarial games, while much studied in AI, are a

 Two agents whose actions alternate

 Utility values for each agent are the opposite of the

 Fully observable environments

 In game theory terms: Zero-sum games of perfect

 We’ll relax these assumptions later.

deterministic Chance moves • on-line

• Theorem of Nobel Laureate Harsanyi: Every game with

 Two players: MAX and MIN

 MAX uses search tree to determine next move.

 d = number of moves by both players

 Search tree is O(bd)

How do we search this tree to find the optimal move?

 Find the optimal strategy for MAX assuming an

 Assumption: Both players play optimally!

The minimax decision

function MINIMAX-DECISION(state) returns an action

function MIN-VALUE(state) returns a utility value

 Complete depth-first exploration of the game tree

 E.g., Chess: d ~ 100, b ~35

 Number of game states is exponential in the number of

Do DF-search until first leaf

Range of possible values

[3,3] [-!,2] [-!,14]

[3,3] ["!,2] [-!,5]

[3,3] ["!,2] [2,2]

[3,3] [-!,2] [2,2]

 Depth first search – only considers nodes along a single

α = highest-value choice that we can guarantee for MAX

 In practice often get O(b(d/2)) rather than O(bd)

 Alpha-beta pruning is a modified version of the

 Alpha-beta pruning can be applied at any depth of a

 The main condition which required for alpha-beta

 The Max player will only update the value of alpha.

 Step 1: At the first step the, Max player will start

 Step 2: At Node D, the value of α will be calculated

 In the next step, algorithm traverse the next

 Step 5: At next step, algorithm again backtrack the

 Step 7: Node F returns the node value 1 to node C,

 Step 8: C now returns the value of 1 to A here the

 Pruning does not affect final results

 Entire subtrees can be pruned.

 Good move ordering improves effectiveness of

 Repeated states are again possible.

How do we make these ideas practical in real game trees?

 Typical values from -infinity (loss) to +infinity (win) or [-1, +1].

 In real games, there is usually a time limit T on making a

 How do we take this into account?

 In practice, iterative deepening search (IDS) is used

 1957: Herbert Simon

 1997: Deep Blue beats Kasparov

 Game trees represent alternate computer/opponent moves

 Evaluation functions estimate the quality of a given board

 Minimax is a procedure which chooses moves by assuming that

 Alpha-Beta is a procedure which can prune large parts of the

 For many well-known games, computer algorithms based on

You might also like