Skip to main content

Controlling memory access concurrency in efficient fault-tolerant parallel algorithms (extended abstract)

  • Conference paper
  • First Online:
Distributed Algorithms (WDAG 1993)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 725))

Included in the following conference series:

Abstract

The crcw pram with dynamic fail-stop errors is a faultprone multiprocessor model for which it is possible to control memory access redundancy while guaranteeing the reliability of efficient algorithms. Concurrent common reads and writes are necessary to handle dynamic faults and in this paper we show how to significantly decrease this concurrency and how to bound it in terms of the number of processor faults. We describe a low concurrency, efficient, and fault-tolerant algorithm for the Write- All primitive: “using ≤ N processors, write 1's into N locations”. This primitive serves as the basis for efficient faulttolerant simulations of algorithms written for fault-free prams on faultprone prams. For any dynamic failure pattern F, our algorithm has total write concurrency ≤¦F¦ and total read concurrency ≤7 ¦F¦log N, where ¦F¦ is the number of processor faults (e.g. no concurrency in a run without failures). Previous algorithms used Ω(N log N) concurrency even in the absence of faults. We also present an optimal fault-tolerant erew pram algorithm for Write-All when all processor faults are initial.

Research supported by ONR grant N00014-91-J-1613.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Anderson, R., Woll, H.: Wait-Free Parallel Algorithms for the Union-Find Problem. Proc. 23rd ACM STOC (1991) 370–380

    Google Scholar 

  2. Aumann, Y., Rabin, M. O.: Clock Construction in Fully Asynchronous Parallel Systems and PRAM Simulation. Proc. 34th IEEE FOCS (1992) 147–156

    Google Scholar 

  3. Beame, P., Kik, M., Kutylowski, M.: Information Broadcast by Exclusive Read PRAMs. Manuscript (1992)

    Google Scholar 

  4. Buss, J., Kanellakis, P. C., Ragde, P., Shvaitsman, A. A.: Parallel Algorithms with Processor Failures and Delays. Brown Univ. TR CS-91-54 (1991) (prel. version Kanellakis, P. C., Shvartsman, A. A.: Efficient Parallel Algorithms On Restartable Fail-Stop Processors. Proc. 10th ACM PODC (1991) 23–36)

    Google Scholar 

  5. Cole, R., Zajicek, O.: The APRAM: Incorporating Asynchrony into the PRAM Model. Proc. 1st ACM SPAA (1989) 170–178

    Google Scholar 

  6. Cole, R., Zajicek, O.: The Expected Advantage of Asynchrony. Proc. 2nd ACM SPAA (1990) 85–94

    Google Scholar 

  7. Eppstein, D., Galil, Z.: Parallel Techniques for Combinatorial Computation. Annual Computer Science Review 3 (1988) 233–283

    Google Scholar 

  8. Fortune, S., Wyllie, J.: Parallelism in Random Access Machines. Proc. 10th ACM STOC (1978) 114–118

    Google Scholar 

  9. Gibbons, P.: A More Practical PRAM Model. Proc. 1st SPAA (1989) 158–168

    Google Scholar 

  10. Kanellakis, P. C., Shvartsman, A. A.: Efficient Parallel Algorithms Can Be Made Robust. Distributed Computing 5 (1992) 201–217 (prel. version in Proc. 8th ACM PODC (1989) 138–148)

    Google Scholar 

  11. Karp, R. M., Ramachandran, V.: A Survey of Parallel Algorithms for Shared-Memory Machines. Handbook of Theoretical Computer Science, (ed. J. van Leeuwen) 1 (1990) North-Holland

    Google Scholar 

  12. Kedem, Z. M., Palem, K. V., Spirakis, P.: Efficient Robust Parallel Computations. Proc. 22nd ACM STOC (1990) 138–148

    Google Scholar 

  13. Kedem, Z. M., Palem, K. V., Rabin, M. O., Raghunathan, A.: Program Transformations for Resilient Parallel Computation via Randomization. Proc. 24th ACM STOC (1992) 306–318

    Google Scholar 

  14. Kedem, Z. M., Palem, K. V., Raghunathan, A., Spirakis, P.: Combining Tentative and Definite Executions for Dependable Parallel Computing. Proc. 23d ACM STOC (1991) 381–390

    Google Scholar 

  15. Martel, C.: Personal communication (1991)

    Google Scholar 

  16. Martel, C., Park, A., Subramonian, R.: Work-optimal Asynchronous Algorithms for Shared Memory Parallel Computers. SIAM Journal on Computing 21 (1992) 1070–1099

    Google Scholar 

  17. Martel, C., Subramonian, R., Park, A.: Asynchronous PRAMs are (Almost) as Good as Synchronous PRAMs. Proc. 32d IEEE FOCS (1990) 590–599

    Google Scholar 

  18. Nishimura, N.: Asynchronous Shared Memory Parallel Computation. Proc. 2nd ACM SPAA (1990) 76–84

    Google Scholar 

  19. Shvartsman, A. A.: Optimal CRCW PRAM Fault-Tolerance. Information Processing Letters 39 (1991) 59–66

    Google Scholar 

  20. Shvartsman, A. A.: Efficient Write-All Algorithm for Fail-Stop PRAM without Initialized Memory. Information Processing Letters 44 (1992) 223–231

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

André Schiper

Rights and permissions

Reprints and permissions

Copyright information

© 1993 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Kanellakis, P.C., Michailidis, D., Shvartsman, A.A. (1993). Controlling memory access concurrency in efficient fault-tolerant parallel algorithms (extended abstract). In: Schiper, A. (eds) Distributed Algorithms. WDAG 1993. Lecture Notes in Computer Science, vol 725. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-57271-6_30

Download citation

  • DOI: https://doi.org/10.1007/3-540-57271-6_30

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-57271-8

  • Online ISBN: 978-3-540-48029-7

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics