0% found this document useful (0 votes)

282 views66 pages

AI Game Strategy Basics

The document discusses adversarial search algorithms for game playing. It introduces the minimax algorithm, which uses recursion to determine the best move for a player (MAX) assuming a perfect opponent (MIN). It then describes alpha-beta pruning, which improves upon minimax by pruning branches that are known to be suboptimal. Alpha-beta pruning carries alpha and beta values down the tree and prunes when alpha exceeds beta. This allows games to be solved more efficiently than with minimax alone.

Uploaded by

Talha Anjum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

282 views66 pages

AI Game Strategy Basics

Uploaded by

Talha Anjum

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 66

Game Playing: Adversarial

Search
Outline

Game Playing: Adversarial Search

- Minimax Algorithm
- α-β Pruning Algorithm
Game Playing: Adversarial Search
 Introduction
 So far, in problem solving, single agent search
 The machine is “exploring” the search space by itself.
 No opponents or collaborators.
 Games require generally multiagent (MA) environments:
 Any given agent need to consider the actions of the other agent
and to know how do they affect its success?
 Distinction should be made between cooperative and
competitive MA environments.
 Competitive environments: give rise to adversarial search:
playing a game with an opponent.
Game Playing: Adversarial Search

 Introduction
 Why study games?
 Game playing is fun and is also an interesting meeting point for human
and computational intelligence.
 They are hard.
 Easy to represent.
 Agents are restricted to small number of actions.

 Interesting question:
Does winning a game absolutely require human
intelligence?
Game Playing: Adversarial Search
 Introduction
 Different kinds of games:
Deterministic Chance
Perfect Chess, Checkers Backgammon,
Information Go, Othello Monopoly

Imperfect
Battleship Bridge, Poker, Scrabble,
Information

 Games with perfect information. No

randomness is involved.

 Games with imperfect information. Random

factors are part of the game.
 Searching in a two player game
 Traditional (single agent) search methods only consider how
close the agent is to the goal state (e.g. best first search).

 In two player games, decisions of both agents have to be taken

into account: a decision made by one agent will affect the
resulting search space that the other agent would need to explore.

 Question: Do we have randomness here since the decision made

by the opponent is NOT known in advance?

  No. Not if all the moves or choices that the

opponent can make are finite and can be known in
advance.
Searching in a two player game

To formalize a two player game as a search problem an agent can

be called MAX and the opponent can be called MIN.

Problem Formulation:

 Initial state: board configurations and the player to move.

 Successor function: list of pairs (move, state) specifying legal
moves and their resulting states. (moves + initial state = game
tree)
 A terminal test: decide if the game has finished.
 A utility function: produces a numerical value for (only) the
terminal states. Example: In chess, outcome = win/loss/draw,
with values +1, -1, 0 respectively.

 Players need search tree to determine next move.

 Searching in a two player game
 The search space in game playing is potentially very huge: Need for optimal
strategies.

 The goal is to find the sequence of moves that will lead to the winning for MAX.

 How to find the best trategy for MAX assuming that MIN is an infaillible opponent.

 Given a game tree, the optimal strategy can be determined by the MINIMAX-
VALUE for each node. It returns:

1. Utility value of n if n is the terminal state.

2. Maximum of the utility values of all the successor nodes s of n : n is a
MAX’s current node.
3. Minimum of the utility values of the successor node s of n : n is a MIN’s
current node.
Minimax Algorithm
 Minimax algorithm
 Perfect for deterministic, 2-player game
 One opponent tries to maximize score (Max)
 One opponent tries to minimize score (Min)
 Goal: move to position of highest minimax
value
 Identify best achievable payoff against best
play
Minimax Algorithm (cont’d)
Minimax Algorithm (cont’d)

Max node Min node

MAX node

MIN node

value computed
Utility value
by minimax
Minimax Algorithm (cont’d)
Minimax Algorithm (cont’d)

3 9 0 7 2 6
Minimax Algorithm (cont’d)

3 0 2

3 9 0 7 2 6
Minimax Algorithm (cont’d)

3 0 2

3 9 0 7 2 6
Minimax Algorithm (cont’d)
 Properties of minimax algorithm:
 Complete? Yes (if tree is finite)
 Optimal? Yes (against an optimal opponent)
 Time complexity? O(bm)
 Space complexity? O(bm) (depth-first
exploration)

Note: For chess, b = 35, m = 100 for a “reasonable game.”

 Solution is completely infeasible

Actually only 1040 board positions, not 35100

Minimax Algorithm (cont’d)
 Limitations
 Not always feasible to traverse entire tree
 Time limitations
 Improvements
 Depth-first search improves speed
 Use evaluation function instead of utility
 Evaluation function provides estimate of utility at
given position
Problem of Minimax search

Number of games states is exponential to the

number of moves.
Solution: Do not examine every node

==> Alpha-beta pruning

 Alpha = value of best choice found so far at any

choice point along the MAX path.
 Beta = value of best choice found so far at any
choice point along the MIN path.
Alpha-beta Game Playing
Basic idea:

If you have an idea that is surely bad, don't take the

time to see how truly awful it is.” -- Pat Winston
Some branches will never be played by rational players since
they include sub-optimal decisions (for either player).

>=2
• We don’t need to compute
=2 <=1 the value at this node.
• No matter what it is, it can’t
effect the value of the root
node.
2 7 1 ?
α-β Pruning Algorithm
 Principle
 If a move is determined worse than another
move already examined, then further
examination deemed pointless
Alpha-Beta Pruning (αβ prune)
 Rules of Thumb
 α is the highest max found so far
 β is the lowest min value found so far

 If Min is on top Alpha prune

 If Max is on top Beta prune

 You will only have alpha prune’s at Min level

 You will only have beta prunes at Max level
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Properties of α-β Prune
 Pruning does not affect final result

 Good move ordering improves effectiveness

of pruning

 With "perfect ordering," time complexity =

O(bm/2)
 doubles depth of search
General description of α-β pruning algorithm

 Traverse the search tree in depth-first order

 At each Max node n, alpha(n) = maximum value found so far
 Start with - infinity and only increase.
 Increases if a child of n returns a value greater than the current
alpha.
 Serve as a tentative lower bound of the final pay-off.
 At each Min node n, beta(n) = minimum value found so far
 Start with infinity and only decrease.
 Decreases if a child of n returns a value less than the current
beta.
 Serve as a tentative upper bound of the final pay-off.
 beta(n) for MAX node n: smallest beta value of its MIN
ancestors.
 alpha(n) for MIN node n: greatest alpha value of its MAX
ancestors
General description of α-β pruning algorithm

 Carry alpha and beta values down during search

alpha can be changed only at MAX nodes
 beta can be changed only at MIN nodes
 Pruning occurs whenever alpha >= beta
 alpha cutoff:
 Given a Max node n, cutoff the search below n (i.e., don't
generate any more of n's children) if alpha(n) >= beta(n)
(alpha increases and passes beta from below)
 beta cutoff:
 Given a Min node n, cutoff the search below n (i.e., don't
generate any more of n's children) if beta(n) <= alpha(n)
(beta decreases and passes alpha from above)
α-β Pruning Algorithm

function ALPHA-BETA-SEARCH(state) returns an action

inputs: state, current state in game
v← MAX-VALUE(state, - ∞ , +∞)
return the action in SUCCESSORS(state) with value v

function MAX-value (n, alpha, beta) return utility value

if n is a leaf node then return f(n);
for each child n’ of n do
alpha :=max{alpha, MIN-value(n’, alpha, beta)};
if alpha >= beta then return beta /* pruning */
end{do}
return alpha

function MIN-value (n, alpha, beta) return utility value

if n is a leaf node then return f(n);
for each child n’ of n do
beta :=min{beta, MAX-value(n’, alpha, beta)};
if beta <= alpha then return alpha /* pruning */
end{do}
return beta
Game Playing: Adversarial Search
In another way
Evaluating Alpha-Beta algorithm

 Alpha-Beta is guaranteed to compute the same value for the root node as
computed by Minimax.

 Worst case: NO pruning, examining O(bd) leaf nodes, where each node
has b children and a d-ply search is performed

 Best case: examine only O(bd/2) leaf nodes. You can search twice as deep
as Minimax! Or the branch factor is b1/2 rather than b.

 Best case is when each player's best move is the leftmost alternative, i.e. at
MAX nodes the child with the largest value generated first, and at MIN
nodes the child with the smallest value generated first.

 In Deep Blue, they found empirically that Alpha-Beta pruning meant that
the average branching factor at each node was about 6 instead of about 35-
40
Evaluation Function
 Evaluation function
 Performed at search cutoff point
 Must have same terminal/goal states as utility
function
 Tradeoff between accuracy and time →
reasonable complexity
 Accurate
 Performance of game-playing system dependent
on accuracy/goodness of evaluation
 Evaluation of nonterminal states strongly
correlated with actual chances of winning
Evaluation functions
 For chess, typically linear weighted sum of
features
 Eval(s) = w1 f1(s) + w2 f2(s) + … + wn fn(s)
 e.g., w1 = 9 with
 f1(s) = (number of white queens) –
(number of black queens), etc.

Key challenge – find a good evaluation function:

Isolated pawns are bad.
How well protected is your king?
How much maneuverability to you have?
Do you control the center of the board?
Strategies change as the game proceeds
State-of-the-Art
Checkers: Tinsley vs. Chinook

Marion TinsleyName:
Teach mathematicsProfession:
CheckersHobby:
Over 42 years Record:
of loses only 3 games
checkers
World champion for over 40
years

Mr. Tinsley suffered his 4th and 5th losses against Chinook
Chinook

First computer to become official world champion of Checkers!

Chess: Kasparov vs. Deep Blue

Kasparov Deep Blue

5’10” Height 6’ 5”
176 lbs Weight 2,400 lbs
34 years Age 4 years
50 billion neurons Computers 32 RISC processors
+ 256 VLSI chess engines
2 pos/sec Speed 200,000,000 pos/sec
Extensive Knowledge Primitive
Electrical/chemical Power Source Electrical
Enormous Ego None

1997: Deep Blue wins by 3 wins, 1 loss, and 2 draws

Chess: Kasparov vs. Deep Junior

Deep Junior

8 CPU, 8 GB RAM, Win

2000
2,000,000 pos/sec
Available at $100

August 2, 2003: Match ends in a 3/3 tie!

Secrets
 Many game programs are based on alpha-beta +
iterative deepening + extended/singular search +
transposition tables + huge databases + ...
 For instance, Chinook searched all checkers
configurations with 8 pieces or less and created an
endgame database of 444 billion board
configurations

 The methods are general, but their implementation

is dramatically improved by many specifically
tuned-up enhancements (e.g., the evaluation
functions) like an F1 racing car
Summary

 A game can be defined by the initial state, the operators

(legal moves), a terminal test and a utility function (outcome
of the game).
 In two player game, the minimax algorithm can determine
the best move by enumerating the entire game tree.
 The alpha-beta pruning algorithm produces the same
result but is more efficient because it prunes away irrelevant
branches.
 Usually, it is not feasible to construct the complete game
tree, so the utility value of some states must be determined
by an evaluation function.
Game Playing: Alpha-beta pruning example
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search
Game Playing: Adversarial Search

AI - Unit - 2
No ratings yet
AI - Unit - 2
30 pages
AI Game Strategies Explained
No ratings yet
AI Game Strategies Explained
21 pages
Artificial Intelligence: Adversarial Search
No ratings yet
Artificial Intelligence: Adversarial Search
36 pages
PPT1
No ratings yet
PPT1
93 pages
AI-Lecture 6 (Adversarial Search)
No ratings yet
AI-Lecture 6 (Adversarial Search)
68 pages
Forward and Backward Chaining AI
No ratings yet
Forward and Backward Chaining AI
11 pages
Artificial Intelligence Notes Detailed
No ratings yet
Artificial Intelligence Notes Detailed
119 pages
AD8402 - Artificial Intelligence (Unit III)
No ratings yet
AD8402 - Artificial Intelligence (Unit III)
24 pages
CS 3 - Problem Solving Agent
No ratings yet
CS 3 - Problem Solving Agent
80 pages
AI Basics for Tech Enthusiasts
No ratings yet
AI Basics for Tech Enthusiasts
125 pages
Ai R16 - Unit-6
No ratings yet
Ai R16 - Unit-6
36 pages
AI & Data Science Course Guide
No ratings yet
AI & Data Science Course Guide
14 pages
18CSC305J - UNIT-4.pptx - 18CSC305J - UNIT-4
No ratings yet
18CSC305J - UNIT-4.pptx - 18CSC305J - UNIT-4
77 pages
Lab Program
100% (1)
Lab Program
15 pages
Ai Unit-V Expert Systems
No ratings yet
Ai Unit-V Expert Systems
20 pages
Breadth First Search
No ratings yet
Breadth First Search
6 pages
Module-02 AIML NOTES
No ratings yet
Module-02 AIML NOTES
29 pages
Ai Unit 1
No ratings yet
Ai Unit 1
149 pages
Ai-Unit-I Notes
No ratings yet
Ai-Unit-I Notes
74 pages
AI Learning: Types and Applications
No ratings yet
AI Learning: Types and Applications
18 pages
AI Probabilistic Reasoning Guide
No ratings yet
AI Probabilistic Reasoning Guide
14 pages
Prolog Notes-Complete
No ratings yet
Prolog Notes-Complete
31 pages
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
No ratings yet
Classification and Regression Trees (CART - I) : Dr. A. Ramesh
34 pages
BE02000041 Funda of AI Unit 1 Introduction
No ratings yet
BE02000041 Funda of AI Unit 1 Introduction
63 pages
AL3391-artificial-intelligence Unit-1
No ratings yet
AL3391-artificial-intelligence Unit-1
22 pages
AI Course Overview for Students
No ratings yet
AI Course Overview for Students
23 pages
AI & Data Science Course Outline
No ratings yet
AI & Data Science Course Outline
21 pages
Unit 4 AI Notes
No ratings yet
Unit 4 AI Notes
18 pages
Linear Models & SVM in Machine Learning
100% (1)
Linear Models & SVM in Machine Learning
23 pages
AI Notes PDF
No ratings yet
AI Notes PDF
82 pages
10 Reasoning
100% (1)
10 Reasoning
18 pages
BI UNIT-I Chp01 (Business Intelligence)
No ratings yet
BI UNIT-I Chp01 (Business Intelligence)
14 pages
Knowledge Acquisition in Artificial Intelligence
No ratings yet
Knowledge Acquisition in Artificial Intelligence
9 pages
Data Science M-1 Notes
No ratings yet
Data Science M-1 Notes
34 pages
Ai and ML qp1 Solved
No ratings yet
Ai and ML qp1 Solved
20 pages
Unit 3 PPT Ai
No ratings yet
Unit 3 PPT Ai
93 pages
Artificial Intelligence Module 5
No ratings yet
Artificial Intelligence Module 5
23 pages
Aids - VSB Syllabus 2023 - 16.8.24
No ratings yet
Aids - VSB Syllabus 2023 - 16.8.24
88 pages
Ai and Ml-Unit 1234 (Notes)
No ratings yet
Ai and Ml-Unit 1234 (Notes)
91 pages
Generative AI
No ratings yet
Generative AI
4 pages
Ou Mtech Notes Intro Search Sameen Saroj
No ratings yet
Ou Mtech Notes Intro Search Sameen Saroj
20 pages
AI Course Outline
0% (1)
AI Course Outline
2 pages
Aiml Unit 2
No ratings yet
Aiml Unit 2
34 pages
Ai Chapter1
No ratings yet
Ai Chapter1
24 pages
Artificial Intelligence: (Unit 3: Problem Solving)
100% (1)
Artificial Intelligence: (Unit 3: Problem Solving)
10 pages
DATA ANALYTICS Syllabus 3 Units
No ratings yet
DATA ANALYTICS Syllabus 3 Units
37 pages
AI Reasoning: Procedural vs Declarative
No ratings yet
AI Reasoning: Procedural vs Declarative
22 pages
A.I 2021 Beu Pyq Solution
No ratings yet
A.I 2021 Beu Pyq Solution
17 pages
AI Fundamentals for Beginners
No ratings yet
AI Fundamentals for Beginners
45 pages
Single Layer & Multilayer Perceptron
No ratings yet
Single Layer & Multilayer Perceptron
14 pages
AI Search Strategies Explained
No ratings yet
AI Search Strategies Explained
43 pages
Unit-3 ML Mech 3-2
No ratings yet
Unit-3 ML Mech 3-2
16 pages
AI Knowledge Representation Guide
100% (1)
AI Knowledge Representation Guide
53 pages
Classical Planning Graphs Guide
No ratings yet
Classical Planning Graphs Guide
22 pages
CS6659 Ai
No ratings yet
CS6659 Ai
1 page
Expert-Systems AI Pres
No ratings yet
Expert-Systems AI Pres
21 pages
Week4 Lect9 10 AI
No ratings yet
Week4 Lect9 10 AI
49 pages
Adversarial Search MinMax Alpha Beta Pruning
No ratings yet
Adversarial Search MinMax Alpha Beta Pruning
43 pages
CSC-411-AI-lec6-Adversarial Search
No ratings yet
CSC-411-AI-lec6-Adversarial Search
38 pages
Arithmetic Coding
No ratings yet
Arithmetic Coding
5 pages
Algorithm Unfolding Techniques
No ratings yet
Algorithm Unfolding Techniques
6 pages
Jacob Fox
No ratings yet
Jacob Fox
2 pages
NP Hard and NP Complete
No ratings yet
NP Hard and NP Complete
5 pages
Efficient Algorithms for Selection Problems
No ratings yet
Efficient Algorithms for Selection Problems
14 pages
Exercise (Mod6) 3.1 - Kimura
No ratings yet
Exercise (Mod6) 3.1 - Kimura
4 pages
Long Test - Arithmetic Sequence
89% (9)
Long Test - Arithmetic Sequence
2 pages
Quadratic Equation Problems and Solutions
No ratings yet
Quadratic Equation Problems and Solutions
4 pages
GenMath Summative Test Week 7 8
No ratings yet
GenMath Summative Test Week 7 8
3 pages
MATH 115: Lecture XXV Notes
No ratings yet
MATH 115: Lecture XXV Notes
2 pages
Algorithm Complexity
No ratings yet
Algorithm Complexity
35 pages
Notes - Integers and Absolute Value
No ratings yet
Notes - Integers and Absolute Value
4 pages
Real Number Assignment
No ratings yet
Real Number Assignment
3 pages
6 Relations
No ratings yet
6 Relations
45 pages
Graph Theory Tutorial: Key Concepts
No ratings yet
Graph Theory Tutorial: Key Concepts
1 page
Probabilistic Methods in Combinatorics
No ratings yet
Probabilistic Methods in Combinatorics
215 pages
Bar Graph Worksheet 4
No ratings yet
Bar Graph Worksheet 4
8 pages
Gen Math Tos
No ratings yet
Gen Math Tos
2 pages
© Praadis Education Do Not Copy: Chapter - 1 Real Numbers
No ratings yet
© Praadis Education Do Not Copy: Chapter - 1 Real Numbers
4 pages
Benjamin: Sample Questions
No ratings yet
Benjamin: Sample Questions
13 pages
Binary Number Conversion Guide
No ratings yet
Binary Number Conversion Guide
13 pages
P&C Revision DPP 3 PDF
No ratings yet
P&C Revision DPP 3 PDF
3 pages
Lecture13 Trees Summary
No ratings yet
Lecture13 Trees Summary
16 pages
GE 4 Relation and Function
No ratings yet
GE 4 Relation and Function
5 pages
Reversibility &: Quantum Computing
No ratings yet
Reversibility &: Quantum Computing
56 pages
Integartion Formulas
No ratings yet
Integartion Formulas
2 pages
MJM0S201043
No ratings yet
MJM0S201043
3 pages
2020 Square Free Fermat Numbers - Proof
No ratings yet
2020 Square Free Fermat Numbers - Proof
5 pages
2025 JPDC Fault-Tolerance in Biswapped Multiprocessor Interconnection Networks
No ratings yet
2025 JPDC Fault-Tolerance in Biswapped Multiprocessor Interconnection Networks
8 pages