Abstract
We present a randomized parallel list ranking algorithm for distributed memory multiprocessors, using a BSP type model. We first describe a simple version which requires, with high probability, log(3p)+log ln(n)=Õ(logp+log logn) communication rounds (h-relations withh=Õ(n/p)) andÕ(n/p)) local computation. We then outline an improved version that requires high probability, onlyr⩽(4k+6) log(2/3p)+8=Õ(k logp) communication rounds wherek=min{i⩾0 |ln(i+1)n⩽(2/3p)2i+1}. Notek<ln*(n) is an extremely small number. Forn andp⩾4, the value ofk is at most 2. Hence, for a given number of processors,p, the number of communication rounds required is, for all practical purposes, independent ofn. Forn⩽1, 500,000 and 4⩽p⩽2048, the number of communication rounds in our algorithm is bounded, with high probability, by 78, but the actual number of communication rounds observed so far is 25 in the worst case. Forn⩽10010100 and 4⩽p⩽2048, the number of communication rounds in our algorithm is bounded, with high probability, by 118; and we conjecture that the actual number of communication rounds required will not exceed 50. Our algorithm has a considerably smaller member of communication rounds than the list ranking algorithm used in Reid-Miller’s empirical study of parallel list ranking on the Cray C-90.(1) To our knowledge, Reid-Miller’s algorithm(1) was the fastest list ranking implementation so far. Therefore, we expect that our result will have considerable practical relevance.
Similar content being viewed by others
References
M. Reid-Miller, List Ranking and List Scan on the Cray C-90,Proc. ACM Symp. on Parallel Algorithms and Architectures, 104–113 (1994).
R. J. Anderson and L. Snyder, A Comparison of Shared and Nonshared Memory Models of Computation,Proc. of the IEEE,79(4):480–487.
L. Snyder, Type Architectures, Shared Memory and the Corollary of Modest Potential,Ann. Rev. Comput. Sci.,1:289–317 (1986).
L. G. Valiantet al., General Purpose Parallel Architectures, inHandbook of Theoretical Computer Science, J. van Leeuwen, ed., MIT Press/Elsevier, pp. 943–972 (1990).
L. G. Valiant, A Bridging Model for Parallel Computation,Comm. ACM,33:103–111 (1990).
A. V. Gerbessiotis and L. G. Valiant, Direct Bulk-Synchronous Parallel Algorithms,Proc. 3rd Scandinavian Workshop on Algorithm Theory, Lecture Notes in Computer Science,621:1–18 (1992).
F. Dehne, A. Fabri, and A. Rau-Chaplin, Scalable Parallel Geometric Algorithms for Coarse Grained Multicomputers, inProc. ACM Symp. Computational Geometry, pp. 298–307 (1993).
F. Dehne, A. Fabri, and C. Kenyon, Scalable and Architecture Independent Parallel Geometric Algorithms with High Probability Optimal Time,Proc. 6th IEEE Symposium on Parallel and Distributed Processing, pp. 586–593 (1994).
F. Dehne, X. Deng, P. Dymond, A Fabri, and A. A. Kokhar, A Randomized Parallel 3D Convex Hull Algorithm for Coarse Grained Parallel Multicomputers,Proc ACM Symp. on Parallel Algorithms and Architectures (1995).
K. Mulmuley,Computational Geometry: An Introduction Through Randomized Algorithms, Prentice Hall, New York, (1993).
X. Deng and P. Dymond, Efficient Routing and Message Bounds for Optimal Parallel Algorithms,Proc. Int. Parallel Proc. Symp. (1995).
G. E. Blelloch, C. E. Leisersson, B. M. Maggs, and C. G. Plaxton, A Comparison of Sorting Algorithms for the Connection Machine CM-2,Proc. ACM Symp. on Parallel Algorithms and Architectures, pp. 3–16 (1991).
X. Deng and N. Gu, Good Programming Style on Multiprocessors,Proc. IEEE Symposium on Parallel and Distributed Processing, pp. 538–543 (1994).
X. Deng, A Convex Hull Algorithm for Coarse Grained Multiprocessors,Proc. 5th International Symposium on Algorithms and Computation (1994).
Hui Li and K. C. Sevcik, Parallel Sorting by Overpartitioning,Proc. ACM Symp. On Parallel Algorithms and Architectures, pp. 46–56 (1994).
J. JàJà,An Introduction to Parallel Algorithms. Addison Wesley, 1992.
M. Reid-Miller, C. L. Miller, and F. Modugno, List Ranking and Parallel Tree Compaction, J. H. Reif, ed.,Synthesis of Parallel Algorithms, Morgan Kaufmann Publisher (1993).
J. C. Wyllie, The Complexity of Parallel Computation, Technical Report TR 79-387, Department of Computer Science, Cornell University (1979).
R. Cole and U. Vishkin, Approximate Parallel Scheduling, Part I: the basic technique with Applications to optimal Parallel list Ranking in Logarithmic Time,SIAM J. Computing,17(1):128–142 (1988).
J. R. Anderson and G. L. Miller, Deterministic Parallel List Ranking. InVLSI Algorithms and Architectures: 3rd Aegean Workshop on Computing, J. H. Reif, ed., AWOC’88, Springer Verlag, Lecture Notes in Computer Science,319:81–90, (1988).
G. L. Miller and J. H. Reif, Parallel Tree Contraction Part 1: Fundamentals,Advances in Computing Research,5:47–72 (1989).
G. L. Miller and J. H. Reif, Parallel Tree Contraction Part 1: Further Applications,SIAM J. Computing,20(6):1128–1147 (December 1991).
J. R. Anderson and G. L. Miller, A Simple Randomized Parallel Algorithm for list Ranking,Information Processing Letters,33(5):269–273, (January 1990).
M. J. Atallah and S. E. Hambrusch, Solving tree problems on a Mesh-Connected Processor Array,Information and Control,69:168–187, (1986).
S. Baase, Introduction to parallel Connectivity List Ranking, and Euler Tour Techniques,Synthesis of Parallel Algorithms, J. H. Reif, ed., Morgan Kaufmann Publisher, (1993).
Author information
Authors and Affiliations
Additional information
Research partially supported by Natural Sciences and Engineering Research Council of Canada (NSERC), Fundação de Amparo à Pesquisa do Estado de São Paulo (FAPESP), Proc. No. 95/0767-0, 95/1367-5, Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq), Proc. No. 523112/94-7 and PROTEM II, and the Commission of the European Community (Project ITDC-207).
Rights and permissions
About this article
Cite this article
Dehne, F., Song, S.W. Randomized parallel list ranking for distributed memory multiprocessors. Int J Parallel Prog 25, 1–16 (1997). https://doi.org/10.1007/BF02700044
Issue Date:
DOI: https://doi.org/10.1007/BF02700044