[go: up one dir, main page]

Skip to main content

Scheduling Fault-Tolerant Programs on Multiple Processors to Maximize Schedule Reliability

  • Conference paper
  • First Online:
Computer Safety, Reliability and Security (SAFECOMP 1999)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1698))

Included in the following conference series:

Abstract

The paper proposes to manage complexity and costs issues of the fault-tolerant programs not at a single program level but rather from the point of view of the whole set of such programs, which are to be run under hard time constraints. A concept of the multiple processor programs is used to model a fault-tolerant program structure. This model, in turn, is used to formulate the fault-tolerant programs scheduling problem under hard time constraints. Since the discussed problem is computationally difficult, three scheduling algorithms, based on three different metaheuristics, have been proposed. To evaluate the proposed algorithms computational experiment has been carried. The proposed global approach has been also compared with scheduling without search for the global optimum. Experiment results prove that the approach could be advantageous by producing more reliable schedules within hard time constraints.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

6. References

  1. Ashrafi, N., Berman, O.: Optimization Models for Selection of Programs, Considering Cost & Reliability, IEEE Transactions on Reliability 41(2) (1992) 59–65

    Article  Google Scholar 

  2. Avizienis, A., Chen, L.: On the implementation of the N-version programming for software fault tolerance during execution, Proc. IEEE COMPSAC 77 (1977) 149–155

    Google Scholar 

  3. Belli, F., Jędrzejowicz, P.: An Approach to the Reliability Optimization of Software with Redundancy, IEEE Transactions on Software Engineering, 17(3) (1991) 310–312

    Article  Google Scholar 

  4. Błażewicz, J., Drabowski, M.,: Węglarz, J.: Scheduling Multiprocessor tasks to minimize schedule length, IEEE Transactions on Computers 35 (1986) 389–393.

    Google Scholar 

  5. Błażewicz, J., Ecker, K. H., Pesch, E., Schmidt, G.,: Węglarz, J.: Scheduling Computer and Manufacturing Processes, Springer, Berlin (1996)

    MATH  Google Scholar 

  6. Bondavalli, A., Giandomenico, F. Di., Xu, J.: Cost-effective and flexible scheme for software fault tolerance, Computer System Science & Engineering, 4 (1993) 234–244

    Google Scholar 

  7. Garey, M. R., Johnson, D. S.: Computers and Intractability: A Guide to the Theory of NPCompleteness, W.H.Freeman, New York (1979)

    Google Scholar 

  8. Glover, F., Laguna, M.: Tabu Search, Kluver, Boston (1987)

    Google Scholar 

  9. Ghosh, S., Melham, R., Mosse, D.: Fault-Tolerance through Scheduling of Aperiodic Tasks in Hard Real-Time Multiprocessor Systems, IEEE Transactions on Parallel and Distributed Systems 8(13) (1997) 272–284

    Article  Google Scholar 

  10. Gutjahr, W.: Reliability Optimization of Redundant Software with Correlated Failures, Proc. IX Int. Symposium on Software Reliability Engineering, Paderborn (1998) 293–303

    Google Scholar 

  11. Hecht, M., Agron, J., Hochhauser, S.: A distributed fault tolerant architecture for nuclear reactor and safety functions, Proc. Real-Time System Symposium, Santa Monica (1989) 214–221

    Google Scholar 

  12. Jedrzejowicz, P.: Social Learning Algorithm, Technical Report nr 7/98, Computer Science Dept. Gdynia Maritime Academy, Gdynia (1998)

    Google Scholar 

  13. Kim, K. H.: Distributed execution of recovery blocks: an approach to uniform treatment of hardware and software faults, Proc. 4th International Conference on Distributed Computing Systems, IEEE Computer Society Press (1984) 526–532

    Google Scholar 

  14. Krishna, C. M., Shin, K. G.: On Scheduling Tasks with a Quick Recovery from Failure, IEEE Transactions on Computers 35(5) (1986) 448–455

    Article  MATH  Google Scholar 

  15. Laprie, J. C., Arlat, J., Beounes, C., Kanoun, K.: Definition and Analysis of Hardware-and-Software Fault-Tolerant Architectures, IEEE Computer 23(7) (1990) 39–51

    Google Scholar 

  16. Liestman, A. L., Campbell, R. H.: A Fault-tolerant Scheduling Problem, IEEE Transactions on Software Engineering, SE-12(11) (1988) 1089–1095

    Google Scholar 

  17. Melliar-Smith, P. M., Randell, B.: Software reliability: the role of programmed exception handling, SIGPLAN Notices 12(3) (1977) 95–100

    Article  Google Scholar 

  18. Michalewicz, Z.: Genetic Algorithms + Data Structures = Evolution Programs, Springer, 2nd ed., Berlin (1996)

    Google Scholar 

  19. Ramamritham, K., Stankovic, J. A., Zhao, W.: Distributed Scheduling of Tasks with Deadlines and Resource Requirements, IEEE Transactions on Computers 38(8) (1989) 1110–1123

    Article  Google Scholar 

  20. Ramamritham, K., Stankovic, J. A., Shiah, P. F.: Efficient Scheduling Algorithms for Real-Time Systems, IEEE Transactions on Parallel and Distributed Systems, 1(2) (1990) 184–194

    Article  Google Scholar 

  21. Randell, B., Xu, J.: The Evolution of the Recovery Block Concept, in: M. R. Lyu (ed.): Software Fault-Tolerance, J. Wiley, Chichester (1995) 1–22

    Google Scholar 

  22. Scott R. K., J. W. Gault, D. F. Mc Allister: Fault tolerant software reliability modelling, IEEE Transactions on Software Engineering, 13(5) (1987) 582–592

    Article  Google Scholar 

  23. Shirazi, B., Wang, M., Pathak, G.: Analysis and Evaluation of Heuristic Methods for Static Task Scheduling, Journal of Parallel and Distributed Computing, 10 (1990) 222–232

    Article  Google Scholar 

  24. Shirazi, B., Hurson, A. R., Kavi, K. M.: Scheduling and Load Balancing in Parallel and Distributed Systems, IEEE Computer Society Press, Los Alamitos (1995)

    Google Scholar 

  25. Tso, K. S., Avizienis, A.: Community error recovery in N-version software: a design study with experimentation, Digest of 17th FTCS, Pittsburgh (1987) 127–133

    Google Scholar 

  26. Xu, J., Randell, B.: Software Fault Tolerance: t/(n-1)-Variant Programming, IEEE Transactions on Reliability, 46(1) (1997) 60–67

    Article  Google Scholar 

  27. Yau, S. S., Cheung, R. C.: Design of Self-Checking Software, Proc. Int. Conf. on Reliable Software, IEEE Computer Society Press (1975) 450–457

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1999 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Czarnowski, I., Jedrzejowicz, P., Ratajczak1, E. (1999). Scheduling Fault-Tolerant Programs on Multiple Processors to Maximize Schedule Reliability. In: Felici, M., Kanoun, K. (eds) Computer Safety, Reliability and Security. SAFECOMP 1999. Lecture Notes in Computer Science, vol 1698. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48249-0_33

Download citation

  • DOI: https://doi.org/10.1007/3-540-48249-0_33

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-66488-8

  • Online ISBN: 978-3-540-48249-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics