Skip to main content

A Data Mining Architecture for Distributed Environments

  • Conference paper
  • First Online:

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2346))

Abstract

Data mining offers tools for the discovery of relationship, patterns and knowledge from a massive database in order to guide decisions about future activities. Applications from various domains have adopted this technique to perform data analysis efficiently. Several issues need to be addressed when such techniques apply on data these are bulk at size and geographically distributed at various sites. In this paper we describe system architecture for a scalable and a portable distributed data mining application. The system contains modules for secure distributed communication, database connectivity, organized data management and efficient data analysis for generating a global mining model. Performance evaluation of the system is also carried out and presented.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. David W. Cheung, Vincent T. Ng, Ada W. Fu, and Yongjian fu “Efficient Mining of Association rules in Distributed Databases”. IEEE Transaction on Knowledge and Data Mining, Vol. 8, No 6, December 1996.

    Google Scholar 

  2. Chen M.S., Han J., and Yu P.S. “Data mining: An overview from a database perspective”. IEEE Transactions on Knowledge and Data Engineering, Vol 8, No 6, pages 866–883, 1996.

    Article  Google Scholar 

  3. Stolfo S.J., Prodromidis A.L., Tselepis S., Lee W., Fan D.W., and Chan P.K. “Jam: Java agents for meta-learning over distributed databases”. Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, pages 74–81, Newport Beach, CA, August 1997. AAAI Press.

    Google Scholar 

  4. Bailey S.M., Grossman R. L., Sivakumar H., and Turinsky A.L., “Papyrus: A System for Data Mining over Local and Wide Area Clusters and Super-Clusters”. Technical report, University of Illinois at Chicago.

    Google Scholar 

  5. Rana O., Walker D., Li M., Lynden S., and Ward M.,”PaDDMAS: Parallel and Distributed Data Mining Application Suit”. Proceedings of the Fourteenth International Parallel and Distributed Processing Symposium pages 387–392.

    Google Scholar 

  6. Kusiak A.,”Decomposition in Data Mining: An Industrial Case Study”. IEEE Transaction on Electronics Packaging Manufacturing, Vol. 23 No 4 October 2000.

    Google Scholar 

  7. Kargupta H., Hamzaoglu I., and Stafford B. “Scalable, Distributed Data Mining An Agent Based Application”. Proceedings of Knowledge Discovery And Data Mining, August, 1997.

    Google Scholar 

  8. Kargupta, H., Park, B., Hershberger, D., and Johnson, E., (1999), “Collective Data Mining: A New Perspective Toward Distributed Data Mining”. Advances in Distributed and Parallel Knowledge Discovery, 1999. MIT/AAAI Press.

    Google Scholar 

  9. Prodromidis, A., Chan, P., and Stolfo, S. (2000). “Meta-Learning in Distributed Data Mining Systems: Issues and Approaches”. Advances in Distributed and Parallel Knowledge Discovery, AAAI/MITPress.

    Google Scholar 

  10. Zaki M. “Parallel and Distributed Association Mining: A survey”. IEEE Concurrency, special issue on Parallel Mechanisms for Data Mining, 7(4):14–25, December.

    Google Scholar 

  11. Lee W., Salvatore J. S., Philip K. C., Eleazar E., Wei F., Matthew M., Shlomo H., and Junxin Z., ”Real Time Data Mining-based Intrusion Detection.” Proceedings of DISCEX II. June 2001.

    Google Scholar 

  12. Sally M., “Distributed Data Mining”. Proceeding of Intelligence in Industry, Issue 3, 2001.

    Google Scholar 

  13. Chattratichat J., Darlington J, Guo Y., Hedvall S., Kohler M., and Syed J., “An Architecture for Distributed Enterprise Data Mining”. HPCN, Amsterdam, 1999.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Ashrafi, M.Z., Taniar, D., Smith, K. (2002). A Data Mining Architecture for Distributed Environments. In: Unger, H., Böhme, T., Mikler, A. (eds) Innovative Internet Computing Systems. IICS 2002. Lecture Notes in Computer Science, vol 2346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48080-3_3

Download citation

  • DOI: https://doi.org/10.1007/3-540-48080-3_3

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-43790-1

  • Online ISBN: 978-3-540-48080-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics