A Bayesian Game Theory Decision Model of
A Bayesian Game Theory Decision Model of
Harris Corporation,
Government Communications Systems Division
Melbourne, Florida 32904
Abstract─ We describe a system model for determining decision Economic factors (e.g. unemployment rates, prices for
making strategies based upon the ability to perform data mining food, such as bread, or fuel), Political factors (freedoms, type
and pattern discovery utilizing open source information to of government), Religious factors (type of religions, religious
prepare for specific events or situations from multiple
information sources. Within this paper, we discuss the
tensions) combined with trend information such as sentiment
development of a method for determining actionable information. analysis on social media, open source data, news, etc. can
We have integrated open source information linking to human provide indicators of areas undergoing stress or at risk.
sentiment and manipulated other user selectable interlinking Current situational awareness requires efforts to seek to
relative probabilities for events based upon current knowledge. incorporate not only geospatial features and forces structures,
Probabilistic predictions are critical in practice on many decision but also the human element, especially in urban settings. An
making applications because optimizing the user experience attempt to predict the likelihood of reaction to a future event
requires being able to compute the expected utilities of mutually
exclusive pieces of content. Hierarchy game theory for decision
will be based on correct situation analysis. Efforts to combine
making is valuable where two or more agents seek their own the information required for these predictions are time
goals, possibilities of conflicts, competition and cooperation. The consuming and labor intensive. The availability of open source
quality of the knowledge extracted from the information available social media information and implementation of artificial
is restricted by complexity of the model. Hierarchy game theory intelligence (AI) methodologies makes this problem tractable.
framework enables complex modeling of data in probabilistic Our GlobalSite system, shown in Fig 1, can also be used as a
modeling. However, applicability to big data is complicated by the method for asset management and reduce cost of analyses.
difficulties of inference in complex probabilistic models, and by
computational constraints. We focus on applying probabilistic
models to resource distribution for emergency response.
Hierarchical game theory models interactions where a situation
affects players at multiple levels. Our paper discusses the effect of
optimizing the selection of specific areas to help first responders
and determine optimal supply route planning. Additionally we
discuss two levels of hierarchies for decision making including
entry decisions and quantitative Bayes modeling based on
incomplete information.
Index Terms—Game theory, Resource Management, Decision
Making, Operations Research
I. INTRODUCTION
Game theory is the study of strategic decision making. It is
the study of mathematical models of conflict and cooperation
between intelligent rational decision-makers and is often
thought of as an interactive decision theory. It has been applied
to economics, political science, psychology, logic, biology and
other complex issues. Modern game theory began with the idea Fig1. Overview
regarding the existence of mixed-strategy equilibrium in two-
person zero-sum games, applied to economics. Later this As an example, consider the recent case of Typhoon
evolved to provide a theory of expected utility, which allowed Haiyan, which devastated portions of the Philippines in early
mathematicians and economists to treat decision-making with November 2013. Weather data and hurricane/typhoon forecast
uncertainty. The notion of probabilistic predictions utilizing models could be used to project the path of the storm, and
game theory is critical in practice to many decision making anticipate areas that may be affected. This could lead to
applications because optimizing user experience requires being enriching Foundation GEOINT content for the Philippines in
able to compute the expected utilities of mutually exclusive anticipation of the event (landfall of Typhoon Haiyan), as well
pieces of data which is critical to geospatial analytics. as collection of additional data after the event to detect
changes, assess damage, and support Disaster quicker than if relief workers just ran into the Philippines with
Relief/Humanitarian Aid. For instance, change detection may no preparation or information [9].
reveal roads have been washed out, presenting logistics The crowdsourcing involved people from all around the
problems for the delivery of aid to folks in need. world who viewed satellite images from space and provided
The Philippines has strategic importance to the U.S. as part relief agencies with their knowledge of the changes that had
of the strategy plans to counterbalance China’s rising military occurred on the ground after the storm passed. Officials from
influence with strong American allies in the region. The U.S. the United Nations Office for the Coordination of
and the Philippines are in the middle of negotiating an Humanitarian Affairs (OCHA) coordinated the effort to get
increased American military presence in the country [8]. volunteers to help with the aid relief. Doctors Without Borders
received updated maps generated by over 1,000
II. OPEN SOURCE DATA OpenStreetMap volunteers in 82 countries. They identified
The internet has forever changed the way people are able hospital locations, which buildings were intact and which were
to respond to a disaster. Now, a person, business, damaged, blocked roads, and other key infrastructure [10].
or organization can create a call to action that generates Technological advances in sensing, computation, storage,
millions of dollars’ worth of donations in money, food, and and communications will turn the near-ubiquitous mobile
even volunteer power in a matter of minutes. This can happen phone into a global mobile sensing device. People-centric
via an email, a button on a website, or a YouTube video that sensing will help drive this trend by enabling a different way to
goes viral. We have seen this during disastrous events like sense, learn, visualize, and share information about ourselves,
Hurricane Katrina, the 2010 earthquake in Haiti, or the recent friends, communities, the way we live, and the world we live
typhoon in the Philippines. The word, “crowdsourcing,” is a in. It juxtaposes the traditional view of mesh sensor networks
combination of two words, crowd and outsourcing. Thus with one in which people, carrying mobile devices, enable
crowdsourcing, as it applies to disaster response, is the process opportunistic sensing coverage [3].
Since people centric sensing began, content provided by
of gathering work or funding via the internet to benefit a
ordinary people, so-called "citizen journalists" or individuals
particular person, organization, or event [9].
with particular agendas that is posted or shared on Social
What makes crowdsourcing so important is the belief that Networks such as Twitter, YouTube, Facebook, MySpace or
more heads are better than one. Using the canned food drive as Flickr, to name but a few, has increasingly made it into the
an example, if you were to do the work without the internet, channels and services of traditional information providers such
you would have to run around town to various homes and as news organizations. New and affordable publishing and
businesses and ask individuals if they would like to participate. distribution tools for ordinary citizens such as Social
This would take up too much time and man power. The Networks, blogs, or services have made this possible. Social
internet can be used to send email to friends, who would then Networks have more and more become an integral part of the
pass the word on to their friends. An online donation campaign communication mix for all kinds of aims, for example
can be created where one can make a short video as to why (political) campaigning, and awareness-raising [4]. See for
people should donate to a cause [9]. example, Fox News revamped its newsroom for Shephard
The recent typhoon in the Philippines has seen an exciting Smith Reporting on breaking news, such as December 2013
change in how crowdsourcing can assist in disaster response. shooting at Arapahoe High School in Colorado. Open source
Rather than sit and wait for heads of organizations and data is valuable in order to populate the reward matrices for
governments to dictate what is needed on the ground, people game theory applications.
are able to assist first responders in the very work of saving
lives, both directly and indirectly. Through the use of powerful III. GAME THEORY
technology, people are able to track weather patterns that are Current situational awareness efforts seek to incorporate
more accurate than anything you will find on the evening not only geospatial features and structures, but also the human
news. Geography buffs are able to use satellite imaging element, especially in urban settings. An attempt to predict the
technology to create maps and locate where people are likelihood of human reactions to a future event should be based
stranded and in desperate need of food and water. There are on correct situational analysis. Development of tools for more
even examples of people who have been able to locate others rapid refinement of flexible plans is required for adapting to a
who were buried under debris. This kind of response is a much changing operational environment.
more aggressive response to a disaster [9]. Our solution populates a reward matrix in near real time
Social media tools like Twitter and Facebook, traditionally through powerful game theory analysis. Once data accuracy is
looked upon as a game for kids has been useful to relief proven through sensitivity analysis, the information is can
workers as well. The group Standby Task Force has been able either be used as training data or populated into a reward
to gather over a million tweets, text messages, and other social matrix in real time for resource allocation and adversarial
media updates to track the extent of the damage in near real planning utilizing game theory analysis. Our techniques enable
time. They were able to create a map using the assistance of a methodical approach to intelligent planning and reaction
hashtags that allowed them to gather the information much based upon construction and analysis of a decision model
resulting in a structure of the most probable solution. This IV. EMERGENCY RESPONSE EXAMPLE
technique is useful for a number of applications ranging from
A. Resource Planning
behavioral economics, war fighter planning, and analysis of
information, messaging, and risk management. Our system In our example, there are several resource management stages
supports an artificial intelligence (AI) supervised learning or hierarchies as shown in Fig 2. These stages include information
needs, collection objectives, observables, tasks and plans. The
approach to quantify information based on user selectable
resource management process seeks to decompose information
attributes and deriving probabilistic decision outcomes. Our
needed to satisfy mission objectives into one or more tasks. The
approach trains with near real time execution.
essence of resource management is uncertainty management [13].
Our solution integrates multiple data sources into efficient Resource allocation problems in which limited resources must
intent analysis processes and uses training data to build the be allocated among several activities are often solved by
decision trees to predict categories for new events based upon dynamic or linear programming. Operations Research is a
classifiers created for the use case scenario. Given an event, we branch of mathematics that studies decision making to obtain
predict a category and then determine sentiment based on the best decision. Game theory can help determine the optimal
trained data. This information could then be applied during investment strategy [24].
planning in support of course of action (COA) development in
the military decision making process (MDMP).
The approach combines the following input: open
(unstructured) source, and/or direct user input/modification. In
particular, we capture and model “sentiment” and other
situational factors through the assignment of positive, neutral
and negative values. A reward matrix is then populated using
game theoretic concepts such as in a competitive game model.
GlobalSite utilizes game theory which permits the ability to
solve for iterative solutions, instantaneous visual feedback, and
interactions by the user on demand. Our output enables a
methodical approach to intelligent planning and reaction
including interaction of variables, parameters and attributes by
user resulting in updated probabilities. Game theory is useful
for resource management of manpower, equipment, and
warnings, etc., since it shows optimal decision for deployment. Fig 2. Bayesian Hierarchy
In many situations, the opponents know the strategy that
they are following. We assume that the players know what Our solution populates a reward matrix in near real time
actions are available. A maximin equilibrium often is the through powerful game theory analysis. Once data accuracy is
strategy and is called the Nash theory application of zero or proven through sensitivity analysis, the information can either
constant sum strategy game. We also consider a constant sum be used as training data or populated into a reward matrix for
game in which for both player’s strategies, the two player’s resource allocation and adversarial planning utilizing game
reward add up to a constant value. This means, while both theory concepts such as in a competitive or cooperative game
players are in conflict, that there is more to gain than simply model. Much of the current focus is on human geography and
having one player’s reward equaling the other player’s loss. terrain as well as population based sentiment analysis [17].
We can find optimal strategies for this two-person zero-
sum game [24]. For example, if a reward matrix exists, then
the equilibrium point is the one where the reward is the
smallest value in its row and the largest number in its column.
A pure strategy provides a complete definition of how a player
will play a game. A player's strategy set is the set of pure
strategies available to that player. A mixed strategy is an
assignment of a probability to each pure strategy. This
equilibrium is also known as the Nash Equilibrium [15].
Game theory is divided into two branches, non-
cooperative and cooperative [2]. Algorithms for computing
Nash equilibrium are well-studied. N-player games are
computationally harder than 2-player games, in important ways Fig 3. Reward Matrix
such as visualization of the solution [11].
Figure 3 shows populated example values for a resource
planning game. We use the Nash equilibrium to solve for the
mixed solutions in a repeatable and methodical manner to weights to a file allows for peer review in order to check and
determine optimal choices. In our example, open source data is validate decisions. Our approach is modeled, so that the
used to create a cost function. In our example using the reward process can be repeated to allow for new or higher quality
matrix, we show the linear programming solution for the data/information to be inserted into the process to generate
constant sum game as follows: updated results.
(1−p)q
P(¬U|i) =
(1−p)q + r(1−q)
pq (1−p)q
≤z ≤
𝑝𝑞 + (1−𝑟)(1−𝑞) (1−𝑝)𝑞 + 𝑟(1−𝑞)
If another country is not motivated to give, then a country may Fig 9. Adversarial Planning
not be as motivated to give (or give as much):
V. CONCLUSION [7] Huang, H., “Introduction to Game Theory Lecture Note 7:
Bayesian Games”, University of California, Merced, Fall 2011.
No decision is ever 100% correct; however, understanding [8]http://world.time.com/2013/11/18/typhoon-haiyan-u-s-pledges-
the effects of algorithmic decisions based upon multiple more-aid-to-philippines-recovery/
variables, attributes, or factors and strategies with probability [9]http://www.innocentive.com/blog/2013/11/25/the-impact-of-
assignments can increase the probability for the best decision crowdsourcing-on-typhoon-haiyan-response/
for a particular situation or event. GlobalSite can perform open [10]http://www.21stcentech.com/communications-update-
source discovery and data mining activities to parse crowdsourcing-relief-agencies-typhoon-haiyan/
information found from disparate, non-obvious, and previously [11] Kearns, Michael, Michael L. Littman, and Satinder Singh.
"Graphical models for game theory." Proceedings of the Seventeenth
unknown data sources and allows for the user to dial the conference on Uncertainty in artificial intelligence. Morgan
weighting factors based upon their knowledge or expertise. Kaufmann Publishers Inc., 2001.
We discussed a method for modeling asset management [12] Li, J., Learning average reward irreducible stochastic games:
with limited resources. We realize that solution presented is analysis and applications, PhD thesis, University of South Florida,
Department of Industrial and Mgmt System Engineering, 2003.
only a guide and is not intended to replace the human brain in
[13] Liggins, Hall, Llinas, “Handbook of Multisensor Data Fusion,
decision making. We offer a user assisted means of Theory, and Practice”, 2nd Edition, 2009.
prioritization to make agent and resources more effective. [14] Lo, Chih-Yao, Yu-Teng Chang, “Strategic analysis and model
Automated game theory is promising for automatically solving construction on conflict resolution with motion game theory,” Journal
real world strategies and helps the security analyst make of Information and Organizational Sciences, 34.1, 2010, 117–132.
optimal decisions for target tracking and detection activities. [15] Nash, John (1951) "Non-Cooperative Games" The Annals of
Automated processing techniques are needed to augment Mathematics 54(2):286-295.
tactical intelligence-analysis capabilities by identifying and [16] Picard, Rick, Todd Graves, and Jane Booker, “Stability modeling
and game-theoretic considerations,” Los Alamos National Laboratory
recognizing features of obstruction. Technical Report, 1999.
We have identified a mathematical application using linear [17] Rahmes, M., Wilder, K., Yates, H., Fox, K., “Near Real Time
programming optimization. Our solution provides the ability to Discovery and Conversion of Open Source Information to a Reward
populate a reward matrix from remotely sensed data. We Matrix”, WMSCI 2013, 12 July 2013.
calculate optimal strategies for path optimization which [18] Razavian, Adam A., and Junping Sun. "Cognitive based adaptive
path planning algorithm for autonomous robotic vehicles." In
increases likelihood of best decision available using game SoutheastCon, 2005. Proceedings. IEEE, pp. 153-160. IEEE, 2005.
theory in a constant sum game. We combine a number of [19] Sermanet, Pierre, Raia Hadsell, Marco Scoffier, Urs Muller, and
technologies for data fusion/ visualization. Our solution is a Yann LeCun. "Mapping and planning under uncertainty in mobile
multi-use application: course of action (COA) planning, robots with long-range perception." In Intelligent Robots and Systems,
strategies, resource management, risk assessment, etc. 2008. IROS 2008. IEEE/RSJ International Conference on, pp. 2525-
2530. IEEE, 2008.
Automated processing techniques are needed to augment
[20] Singh, S. P., V. Soni, M. P. Wellman, “Computing approximate
tactical intelligence-analysis capabilities by identifying and bayes-nash equilibria in tree-games of incomplete information”;
recognizing patterns, weighting them appropriately, providing Proceedings of 5th ACM Conference on Electronic Commerce, 2004,
near real time objective decisions where the user can interact pp. 81–90.
with the information based upon their experiences and [21] Skoglar, Per. "UAV path and sensor planning methods for
multiple ground target search and tracking-A literature survey."
knowledge base. GlobalSite is a probabilistic decision solution Department of Electrical Engineering, Linköping University, Tech.
which allows for users to interact with information in near real Rep (2007).
time using game theory to provide a reward matrix of the best [22] Slantchev, B., “Game Theory: Static and dynamic games of
possible outcomes. incomplete information,” Dept of Political Science, Univ San Diego,
May 15, 2008.
REFERENCES [23] Strode, Christopher. "Optimising multistatic sensor locations
[1] Aghassi, M., and D.Bertsimas, “Robust game theory,” using path planning and game theory." Computational Intelligence for
Mathematical Programing, Ser. B, vol. 107, 2006, pp. 231–273. Security and Defense Applications (CISDA), 2011 IEEE Symposium
on. IEEE, 2011.
[2] Brandenburger, Adam. "Cooperative Game Theory." Teaching
Materials at New York University (2007). [24] Wayne Winston, Operations Research Applications and
Algorithms 4th. Edition, 2003.
[3] Campbell, A. T., Eisenman, S. B., Lane, N. D., Miluzzo, E.,
Peterson, R. A., Lu, H., ... & Ahn, G. S. (2008). “The rise of people- [25] Zamir, Shmuel. Bayesian games: Games with incomplete
centric sensing”. Internet Computing, IEEE, 12(4), 12-21. information. Springer New York, 2009.
[4] Diplaris, S., Papadopoulos, S., Kompatsiaris, I., Goker, A.,
Macfarlane, A., Spangenberg, J., ... & Klusch, M. (2012, April).
SocialSensor: sensing user generated input for improved media
discovery and experience. In Proceedings of 21st intl conference
companion on World Wide Web (pp. 243-246). ACM.
[5] Ganzfried, S., and T. Sandholm, “Game theory-based opponent
modeling in large imperfect-information games,” International
Conference on Autonomous Agents and Multi-Agent Systems
(AAMAS), 2011.
[6] Hu, Hong, and Harborne Stuart, Jr., “An epistemic analysis of the
Harsanyi transformation,” Intl Journal Game Theory, 2002, pp. 517–
525.