Abstract
This paper deals with the problem of learning prognostic models from medical survival data, where the sole prediction of probability of event (and not its probability dependency on time) is of interest. To appropriately consider the follow-up time and censoring — both characteristic for survival data — we propose a weighting technique that lessens the impact of data from patients for which the event did not occur and have short follow-up times. A case study on prostate cancer recurrence shows that by incorporating this weighting technique the machine learning tools stand beside or even outperform modern statistical methods and may, by inducing symbolic recurrence models, provide further insight to relationships within the modeled data.
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
R. Agrawal, T. Imielinski, and A. Swami. Mining association rules between sets of items in large databases. In Proc. ACM SIGMOD Conference on Management of Data, pages 207–216, Washington, D. C., 1993.
E. Biganzoli, P. Boracchi, and L. Mariani, et al. Feed forward neural networks for the analysis of censored survival data: a partial logistic regression approach. Statis Med, 1998.
D. R. Cox. Regression models and life-tables. J R Statist Soc B, 34:187–220, 1972.
F. E. Harrell, R. M. Califf, D. B. Pryor, K. L. Lee, and R. A. Rosati. Evaluating the yield of medical tests. Journal of American Medical Association, 247(18):2543–2546, 1982.
M. W. Kattan, H. Ishida, P. T. Scardino, and J. R. Beck. Applying a neural network to prostate cancer survival data. In N. Lavrač, E. Keravnou, and B. Zupan, editors, Intelligent data analysis in medicine and pharmacology, pages 295–306. Kluwer, Boston, 1997.
J. Lubsen, J. Pool, and E. van der Does. A practical device for the application of a diagnostic or prognostic function. Methods of Information in Medicine, 17:127–129, 1978.
D. Michie, D. J. Spiegelhalter, and C. C. Taylor, editors. Machine learning, neural and statistical classification. Ellis Horwood, 1994.
T. Niblett and I. Bratko. Learning decision rules in noisy domains. In Expert Systems 86, pages 15–18. Cambridge University Press, 1986. (Proc. EWSL 1986, Brighton).
M. Ohori, T. M. Wheeler, and P. T. Scardino. The new american joint committee on cancer and international union against cancer tnm classification of prostate cancer: Clinicopathologic correlations. Cancer, 74:104–114, 94.
R. Quinlan. Induction of decision trees. Machine Learning, 1(1):81–106, 1986.
B. D. Ripley and R. M. Ripley. Neural networks as statistical methods in survival analysis. In R. Dybowski and V. Gant, editors, Artificial Neural Networks: Prospects for Medicine. Landes Biosciences Publishers, 1998.
T. M. Therneau, P. M. Grambsch, and T. R. Fleming. Martingale-based residuals for survival models. Biometrika, 77(1):147–160, 1990.
W. N. Venables and B. D. Ripley. Modern applied statistics with S-PLUS. Springer, New York, second edition edition, 1997.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1999 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zupan, B., Demšar, J., Kattan, M.W., Beck, J.R., Bratko, I. (1999). Machine Learning for Survival Analysis: A Case Study on Recurrence of Prostate Cancer. In: Horn, W., Shahar, Y., Lindberg, G., Andreassen, S., Wyatt, J. (eds) Artificial Intelligence in Medicine. AIMDM 1999. Lecture Notes in Computer Science(), vol 1620. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48720-4_37
Download citation
DOI: https://doi.org/10.1007/3-540-48720-4_37
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-66162-7
Online ISBN: 978-3-540-48720-3
eBook Packages: Springer Book Archive