Skip to main content

ParaDualMiner: An Efficient Parallel Implementation of the DualMiner Algorithm

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 3056))

Abstract

Constraint based mining finds all itemsets that satisfy a set of predicates. Many constraints can be categorised as being either monotone or antimonotone. Dualminer was the first algorithm that could utilise both classes of constraint simultaneously to prune the search space. In this paper, we present two parallel versions of DualMiner. The ParaDualMiner with Simultaneous Pruning efficiently distributes the task of expensive predicate checking among processors with minimum communication overhead. The ParaDualMiner with Random Polling makes further improvements by employing a dynamic subalgebra partitioning scheme and a better communication mechanism. Our experimental results indicate that both algorithms exhibit excellent scalability.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agrawal, R., Shafer, J.C.: Parallel mining of association rules. IEEE Trans. On Knowledge And Data Engineering 8, 962–969 (1996)

    Article  Google Scholar 

  2. Agrawal, R., Srikant, R.: Fast algorithms for mining association rules in large databases. In: Proceedings of VLDB 1994, pp. 487–499 (1994)

    Google Scholar 

  3. Bayardo, R., Agrawal, R., Gunopulos, D.: Constraint-based rule mining in large, dense databases. In: Proceedings of ICDE 1999, pp. 188–197 (1999)

    Google Scholar 

  4. Bucila, C., Gehrke, J., Kifer, D., White, W.: Dualminer: a dual-pruning algorithm for itemsets with constraints. In: Proceedings of ACM SIGKDD 2002, pp. 42–51 (2002)

    Google Scholar 

  5. Burdick, D., Calimlim, M., Gehrke, J.: Mafia: A maximal frequent itemset algorithm for transactional databases. In: Proceedings of ICDE 2001, pp. 443–452 (2001)

    Google Scholar 

  6. Eager, D.L., Lazowska, E.D., Zahorjan, J.: A comparison of receiver-initiated and sender-initiated adaptive load sharing. In: Proceedings of ACM SIGMETRICS 1985, pp. 1–3 (1985)

    Google Scholar 

  7. Han, E.H., Karypis, G., Kumar, V.: Scalable parallel data mining for association rules. In: Proceedings of SIGMOD 1997, pp. 277–288 (1997)

    Google Scholar 

  8. Ng, R.T., Lakshmanan, L.V., Han, J., Pang, A.: Exploratory mining and pruning optimizations of constrained association rules. In: Proceedings of SIGMOD 1998, pp. 13–24 (1998)

    Google Scholar 

  9. Orlando, S., Palmerini, P., Perego, R., Silvestri, F.: Adaptive and resource-aware mining of frequent sets. In: Proceedings of ICDM 2002, p. 338 (2002)

    Google Scholar 

  10. Pei, J., Han, J., Lakshmanan, L.: Mining frequent item sets with convertible constraints. In: Proceedings of ICDE 2001, pp. 433–442 (2001)

    Google Scholar 

  11. Sanders, P.: A detailed analysis of random polling dynamic load balancing. In: Proceedings of ISPAN 1994, pp. 382–389 (1994)

    Google Scholar 

  12. Sanders, P.: Asynchronous random polling dynamic load balancing. In: Aggarwal, A.K., Pandu Rangan, C. (eds.) ISAAC 1999. LNCS, vol. 1741, p. 39. Springer, Heidelberg (1999)

    Google Scholar 

  13. Raedt, L.D., Kramer, S.: The levelwise version space algorithm and its application to molecular fragment finding. In: Proceedings of IJCAI 2001, pp. 853–8622 (2001)

    Google Scholar 

  14. Zaiane, O., El-Hajj, M., Lu, P.: Fast parallel association rule mining without candidacy generation. In: Proceedings of ICDM 2001, pp. 665–668 (2001)

    Google Scholar 

  15. Zaki, M., Li, W., Parthasarathy, S.: Customized dynamic load balancing for a network of workstations. Journal of Parallel and Distributed Computing 43(2), 156–162 (1997)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ting, R.M.H., Bailey, J., Ramamohanarao, K. (2004). ParaDualMiner: An Efficient Parallel Implementation of the DualMiner Algorithm. In: Dai, H., Srikant, R., Zhang, C. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2004. Lecture Notes in Computer Science(), vol 3056. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24775-3_12

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24775-3_12

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-22064-0

  • Online ISBN: 978-3-540-24775-3

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics