Sensitivity of Bayesian Networks to Errors in Their Structure
<p>The BN models learned from the <span class="html-italic">Breast Cancer</span> data set: using the BSA algorithm (<b>left</b>) and using the ANB algorithm (<b>right</b>).</p> "> Figure 2
<p>The ACC of the <span class="html-small-caps">Breast Cancer</span>, <span class="html-small-caps">Cardiotocography</span>, <span class="html-small-caps">Dermatology</span>, <span class="html-small-caps">HCV</span>, <span class="html-small-caps">Hepatitis</span>, <span class="html-small-caps">Lymphography</span>, <span class="html-small-caps">Primary Tumor</span>, and <span class="html-small-caps">SPECT Heart</span> models as a function of the percentage of nodes removed. The colors indicate the ascending, random, and descending order of the cross-entropy between the class node and the removed nodes.</p> "> Figure 3
<p>The ROC curves for the <span class="html-small-caps">Cardiotocography</span> model when 0%, 20%, and 40% of the nodes have been removed in a descending order.</p> "> Figure 4
<p>The AUC of the <span class="html-small-caps">Breast Cancer</span>, <span class="html-small-caps">Cardiotocography</span>, <span class="html-small-caps">Dermatology</span>, <span class="html-small-caps">HCV</span>, <span class="html-small-caps">Hepatitis</span>, <span class="html-small-caps">Lymphography</span>, <span class="html-small-caps">Primary Tumor</span>, and <span class="html-small-caps">SPECT Heart</span> models as a function of the percentage of nodes removed. The colors indicate the ascending, random, and descending order of the cross-entropy between the class node and the removed nodes.</p> "> Figure 5
<p>The AUC of the <span class="html-small-caps">Breast Cancer</span>, <span class="html-small-caps">Cardiotocography</span>, <span class="html-small-caps">Dermatology</span>, <span class="html-small-caps">HCV</span>, <span class="html-small-caps">Hepatitis</span>, <span class="html-small-caps">Lymphography</span>, <span class="html-small-caps">Primary Tumor</span>, and <span class="html-small-caps">SPECT Heart</span> models as a function of the percentage of edges removed. The colors indicate the ascending, random, and descending order of the strengths of the removed edges.</p> "> Figure 6
<p>The AUC of the <span class="html-small-caps">Breast Cancer</span>, <span class="html-small-caps">Cardiotocography</span>, <span class="html-small-caps">Dermatology</span>, <span class="html-small-caps">HCV</span>, <span class="html-small-caps">Hepatitis</span>, <span class="html-small-caps">Lymphography</span>, <span class="html-small-caps">Primary Tumor</span>, and <span class="html-small-caps">SPECT Heart</span> models as a function of the percentage of edges reversed. The colors indicate the ascending, random, and descending order of the strengths of the reversed edges.</p> "> Figure 6 Cont.
<p>The AUC of the <span class="html-small-caps">Breast Cancer</span>, <span class="html-small-caps">Cardiotocography</span>, <span class="html-small-caps">Dermatology</span>, <span class="html-small-caps">HCV</span>, <span class="html-small-caps">Hepatitis</span>, <span class="html-small-caps">Lymphography</span>, <span class="html-small-caps">Primary Tumor</span>, and <span class="html-small-caps">SPECT Heart</span> models as a function of the percentage of edges reversed. The colors indicate the ascending, random, and descending order of the strengths of the reversed edges.</p> ">
Abstract
:1. Introduction
2. Materials and Methods
2.1. Medical Data Sets
2.2. Bayesian Network Models
2.3. Experimental Design
2.4. Node Removal
2.5. Edge Removal
2.6. Edge Reversal
3. Discussion
Author Contributions
Funding
Institutional Review Board Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
ACC | Diagnostic accuracy (the proportion of correct diagnoses and a measure of a model’s quality) |
ANB | Augmented Naive Bayes (a structure learning algorithm) |
AI | Artificial Intelligence |
API | Application Programming Interface |
AUC | Area Under the (ROC) Curve (a measure of model’s ability to detect a single class) |
BN | Bayesian Network |
BSA | Bayesian Search (a structure learning algorithm) |
CPT | Conditional Probability Table |
VOI | Value Of Information (a measure of the worth/importance of a potential observation variable) |
References
- Henrion, M.; Breese, J.S.; Horvitz, E.J. Decision Analysis and Expert Systems. AI Mag. 1991, 12, 64–91. [Google Scholar]
- Morgan, M.G.; Henrion, M. Uncertainty: A Guide to Dealing with Uncertainty in Quantitative Risk and Policy Analysis; Cambridge University Press: Cambridge, UK, 1990. [Google Scholar]
- Pearl, J. Probabilistic Reasoning in Intelligent Systems: Networks of Plausible Inference; Morgan Kaufmann Publishers, Inc.: San Mateo, CA, USA, 1988. [Google Scholar]
- Henrion, M.; Pradhan, M.; del Favero, B.; Huang, K.; Provan, G.; O’Rorke, P. Why Is Diagnosis Using Belief Networks Insensitive to Imprecision In Probabilities? In Proceedings of the Twelfth Conference on Uncertainty in Artificial Intelligence (UAI-1996), Portland, OR, USA, 1–4 August 1996; pp. 307–314. [Google Scholar]
- Oniśko, A.; Druzdzel, M.J. Sensitivity of Bayesian Networks to Noise in Their Parameters. Entropy 2024, 26, 963. [Google Scholar] [CrossRef]
- Pradhan, M.; Henrion, M.; Provan, G.; del Favero, B.; Huang, K. The Sensitivity of Belief Networks to Imprecise Probabilities: An Experimental Investigation. Artif. Intell. 1996, 85, 363–397. [Google Scholar] [CrossRef]
- Oniśko, A.; Druzdzel, M.J. Impact of Bayesian network model structure on the accuracy of medical diagnostic systems. In Artificial Intelligence and Soft Computing, Proceedings of the 13th International Conference (ICAISC 2014), Zakopane, Poland, 1–5 June 2014; Rutkowski, L., Korytkowski, M., Scherer, R., Tadeusiewicz, R., Zadeh, L.A., Zurada, J.M., Eds.; Reprinted in Springer Lecture Notes in Computer Science; Lecture Notes in Artificial Intelligence, LNAI 8468; Springer: Berlin/Heidelberg, Germany, 2014; pp. 167–178. [Google Scholar]
- Druzdzel, M.J.; van der Gaag, L.C. Building probabilistic networks: ‘Where do the numbers come from?’ Guest editors’ introduction. IEEE Trans. Knowl. Data Eng. 2000, 12, 481–486. [Google Scholar] [CrossRef]
- Oniśko, A.; Druzdzel, M.J. Impact of precision of Bayesian networks parameters on accuracy of medical diagnostic systems. Artif. Intell. Med. 2013, 57, 197–206. [Google Scholar] [CrossRef] [PubMed]
- Kelly, M.; Longjohn, R.; Nottingham, K. The UCI Machine Learning Repository. Available online: https://archive.ics.uci.edu (accessed on 1 October 2024).
- Oniśko, A.; Druzdzel, M.J.; Wasyluk, H. An experimental comparison of methods for handling incomplete data in learning parameters of Bayesian networks. In Intelligent Information Systems 2002, Proceedings of the IIS’2002 Symposium, Sopot, Poland, 3–6 June 2002; Klopotek, M., Wierzchon, S.T., Michalewicz, M., Eds.; Advances in Soft Computing Series; Physica-Verlag (A Springer-Verlag Company): Berlin/Heidelberg, Germany, 2002; pp. 351–360. [Google Scholar]
- Zwitter, M.; Soklic, M. Breast Cancer [Dataset]; UCI Machine Learning Repository. 1988. Available online: https://archive.ics.uci.edu/dataset/14/breast+cancer (accessed on 1 October 2024).
- Campos, D.; Bernardes, J. Cardiotocography [Dataset]; UCI Machine Learning Repository. 2000. Available online: https://archive.ics.uci.edu/dataset/193/cardiotocography (accessed on 1 October 2024).
- Ilter, N.; Guvenir, H. Dermatology [Dataset]; UCI Machine Learning Repository. 1998. Available online: https://archive.ics.uci.edu/dataset/33/dermatology (accessed on 1 October 2024).
- Lichtinghagen, R.; Klawonn, F.; Hoffmann, G. HCV Data [Dataset]; UCI Machine Learning Repository. 2020. Available online: https://archive.ics.uci.edu/dataset/571/hcv+data (accessed on 1 October 2024).
- Hepatitis [Dataset]; UCI Machine Learning Repository. 1983. Available online: https://archive.ics.uci.edu/dataset/46/hepatitis (accessed on 1 October 2024).
- Kononenko, I. Inductive and Bayesian learning in medical diagnosis. Appl. Artif. Intell. 1993, 7, 317–337. [Google Scholar] [CrossRef]
- Kurgan, L.; Cios, K.J.; Tadeusiewicz, R.; Ogiela, M.R.; Goodenday, L.S. Knowledge discovery approach to automated cardiac SPECT diagnosis. Artif. Intell. Med. 2001, 23, 149–169. [Google Scholar] [CrossRef] [PubMed]
- Cooper, G.F.; Herskovits, E. A Bayesian Method for the Induction of Probabilistic Networks from Data. Mach. Learn. 1992, 9, 309–347. [Google Scholar] [CrossRef]
- Koiter, J.R. Visualizing Inference in Bayesian Networks. Master’s Thesis, Delft University of Technology, Delft, The Netherlands, 2006. [Google Scholar]
- Henrion, M. Some practical issues in constructing belief networks. In Uncertainty in Artificial Intelligence 3; Kanal, L.N., Levitt, T.S., Lemmer, J.F., Eds.; Elsevier Science Publishers: Amsterdam, The Netherlands, 1989; pp. 161–173. [Google Scholar]
Data Set | Citation | Instances | Variables | Variable Types | Classes | Missing Values |
---|---|---|---|---|---|---|
Breast Cancer | [12] | 286 | 10 | Categorical | 2 | 0.31% |
Cardiotocography | [13] | 2126 | 22 | Categorical, real | 3 | — |
Dermatology | [14] | 366 | 35 | Categorical, real | 6 | 0.06% |
HCV | [15] | 615 | 13 | Categorical, real | 5 | 0.2% |
Hepatitis | [16] | 155 | 20 | Categorical, real | 2 | 5.4% |
Lymphography | [17] | 148 | 19 | Categorical, integer | 4 | — |
Primary Tumor | [17] | 339 | 18 | Categorical, integer | 20 | 3.7% |
SPECT Heart | [18] | 267 | 23 | Categorical | 2 | — |
Model | Number of Nodes | Average Number of States | Mean in-Degree | Number of Edges | Number of Parameters |
---|---|---|---|---|---|
Breast Cancer | 10 | 4.50 | 1.40 | 14 | 200 |
Cardiotocography | 22 | 2.91 | 2.86 | 63 | 13,347 |
Dermatology | 35 | 3.94 | 0.83 | 29 | 2032 |
HCV | 13 | 3.15 | 1.38 | 18 | 312 |
Hepatitis | 20 | 2.50 | 1.90 | 38 | 465 |
Lymphography | 19 | 3.00 | 1.05 | 20 | 300 |
Primary Tumor | 18 | 3.17 | 1.83 | 33 | 877 |
SPECT Heart | 23 | 2.00 | 2.26 | 52 | 290 |
Model | Number of Classes | Prevalence of Various Classes in the Data Set |
---|---|---|
Breast Cancer | 2 | (70.3%, 29.7%) |
Cardiotocography | 3 | (77.8%, 13.9%, 8.3%) |
Dermatology | 6 | (30.6%, 19.7%, 16.7%, 14.2%, 13.4%, 5.5%) |
HCV | 5 | (86.7%, 4.9%, 3.9%, 3.4%, 1.1%) |
Hepatitis | 2 | (79.2%, 20.8%) |
Lymphography | 4 | (54.5%, 41.1%, 2.9%, 1.5%) |
Primary Tumor | 20 | (24.8%, 11.5%, 8.6%, 8.3%, 7.1%, 7.1%, 5.9%, 4.7%, 4.1%, 4.1%, 2.9%, 2.7%, 2.4%, 2.1%, 1.8%, 0.6%, 0.6%, 0.3%, 0.3%, 0.3%) |
SPECT Heart | 2 | (79.4%, 20.6%) |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Onisko, A.; Druzdzel, M.J. Sensitivity of Bayesian Networks to Errors in Their Structure. Entropy 2024, 26, 975. https://doi.org/10.3390/e26110975
Onisko A, Druzdzel MJ. Sensitivity of Bayesian Networks to Errors in Their Structure. Entropy. 2024; 26(11):975. https://doi.org/10.3390/e26110975
Chicago/Turabian StyleOnisko, Agnieszka, and Marek J. Druzdzel. 2024. "Sensitivity of Bayesian Networks to Errors in Their Structure" Entropy 26, no. 11: 975. https://doi.org/10.3390/e26110975