Abstract
Data mining offers tools for the discovery of relationship, patterns and knowledge from a massive database in order to guide decisions about future activities. Applications from various domains have adopted this technique to perform data analysis efficiently. Several issues need to be addressed when such techniques apply on data these are bulk at size and geographically distributed at various sites. In this paper we describe system architecture for a scalable and a portable distributed data mining application. The system contains modules for secure distributed communication, database connectivity, organized data management and efficient data analysis for generating a global mining model. Performance evaluation of the system is also carried out and presented.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
David W. Cheung, Vincent T. Ng, Ada W. Fu, and Yongjian fu “Efficient Mining of Association rules in Distributed Databases”. IEEE Transaction on Knowledge and Data Mining, Vol. 8, No 6, December 1996.
Chen M.S., Han J., and Yu P.S. “Data mining: An overview from a database perspective”. IEEE Transactions on Knowledge and Data Engineering, Vol 8, No 6, pages 866–883, 1996.
Stolfo S.J., Prodromidis A.L., Tselepis S., Lee W., Fan D.W., and Chan P.K. “Jam: Java agents for meta-learning over distributed databases”. Proceedings of the 3rd International Conference on Knowledge Discovery and Data Mining, pages 74–81, Newport Beach, CA, August 1997. AAAI Press.
Bailey S.M., Grossman R. L., Sivakumar H., and Turinsky A.L., “Papyrus: A System for Data Mining over Local and Wide Area Clusters and Super-Clusters”. Technical report, University of Illinois at Chicago.
Rana O., Walker D., Li M., Lynden S., and Ward M.,”PaDDMAS: Parallel and Distributed Data Mining Application Suit”. Proceedings of the Fourteenth International Parallel and Distributed Processing Symposium pages 387–392.
Kusiak A.,”Decomposition in Data Mining: An Industrial Case Study”. IEEE Transaction on Electronics Packaging Manufacturing, Vol. 23 No 4 October 2000.
Kargupta H., Hamzaoglu I., and Stafford B. “Scalable, Distributed Data Mining An Agent Based Application”. Proceedings of Knowledge Discovery And Data Mining, August, 1997.
Kargupta, H., Park, B., Hershberger, D., and Johnson, E., (1999), “Collective Data Mining: A New Perspective Toward Distributed Data Mining”. Advances in Distributed and Parallel Knowledge Discovery, 1999. MIT/AAAI Press.
Prodromidis, A., Chan, P., and Stolfo, S. (2000). “Meta-Learning in Distributed Data Mining Systems: Issues and Approaches”. Advances in Distributed and Parallel Knowledge Discovery, AAAI/MITPress.
Zaki M. “Parallel and Distributed Association Mining: A survey”. IEEE Concurrency, special issue on Parallel Mechanisms for Data Mining, 7(4):14–25, December.
Lee W., Salvatore J. S., Philip K. C., Eleazar E., Wei F., Matthew M., Shlomo H., and Junxin Z., ”Real Time Data Mining-based Intrusion Detection.” Proceedings of DISCEX II. June 2001.
Sally M., “Distributed Data Mining”. Proceeding of Intelligence in Industry, Issue 3, 2001.
Chattratichat J., Darlington J, Guo Y., Hedvall S., Kohler M., and Syed J., “An Architecture for Distributed Enterprise Data Mining”. HPCN, Amsterdam, 1999.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Ashrafi, M.Z., Taniar, D., Smith, K. (2002). A Data Mining Architecture for Distributed Environments. In: Unger, H., Böhme, T., Mikler, A. (eds) Innovative Internet Computing Systems. IICS 2002. Lecture Notes in Computer Science, vol 2346. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-48080-3_3
Download citation
DOI: https://doi.org/10.1007/3-540-48080-3_3
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-43790-1
Online ISBN: 978-3-540-48080-8
eBook Packages: Springer Book Archive