Abstract
This paper focuses on the problem of tagging quality evaluation in collaborative tagging systems. By investigating the dynamics of tagging process, we find that high frequency tags almost cover the main aspects of a resource content and can be determined stable much earlier than a whole tag set. Motivated by this finding, we design the swapping index and smart moving index on tagging quality. We also study the correlations in tag usage and propose the semantic measurement on tagging quality. The proposed methods are evaluated against real datasets and the results show that they are more efficient than previous methods, which are appropriate for a large number of web resources. The effectiveness is justified by the results in tag based applications. The light weight metrics bring a little loss on the performance, while the semantic metric is better than current methods.
This work is supported by NSF China (61173140), SAICT Experts Program, Independent Innovation & Achievements Transformation Program (2014ZZCX03301) and Science & Technology Development Program of Shandong Province (2014GGX101046).
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Bi, B., Lee, S.D., Kao, B., Cheng, R.: Cubelsi: an effective and efficient method for searching resources in social tagging systems. In: Proceedings of the 27th IEEE International Conference on Data Engineering (ICDE), pp. 27–38. IEEE (2011)
Bischoff, K., Firan, C.S., Nejdl, W., Paiu, R.: Can all tags be used for search? In: Proceedings of the 17th ACM Conference on Information and knowledge Management, pp. 193–202. ACM (2008)
Chi, E.H., Mytkowicz, T.: Understanding the efficiency of social tagging systems using information theory. In: Proceedings of the 19th ACM Conference on Hypertext and Hypermedia, pp. 81–88. ACM (2008)
Durao, F., Dolog, P.: A personalized tag-based recommendation in social web systems. In: Adaptation and Personalization for Web 2.0, p. 40 (2009)
Golder, S.A., Huberman, B.A.: Usage patterns of collaborative tagging systems. J. Inf. Sci. 32(2), 198–208 (2006)
Halpin, H., Robu, V., Shepherd, H.: The complex dynamics of collaborative tagging. In: Proceedings of the 16th International Conference on World Wide Web, pp. 211–220. ACM (2007)
Heymann, P., Koutrika, G., Garcia-Molina, H.: Can social bookmarking improve web search? In: Proceedings of the 2008 International Conference on Web Search and Data Mining, pp. 195–206. ACM (2008)
Kendall, M.G.: Rank Correlation Methods. Griffin, London (1948)
Lei, S., Yang, X.S., Mo, L., Maniu, S., Cheng, R.: Itag: incentive-based tagging. In: Proceedings of 30th IEEE International Conference on Data Engineering (ICDE), pp. 1186–1189. IEEE (2014)
Ramage, D., Heymann, P., Manning, C.D., Garcia-Molina, H.: Clustering the tagged web. In: Proceedings of the Second ACM International Conference on Web Search and Data Mining, pp. 54–63. ACM (2009)
Shepitsen, A., Gemmell, J., Mobasher, B., Burke, R.: Personalized recommendation in social tagging systems using hierarchical clustering. In: Proceedings of the 2008 ACM Conference on Recommender Systems, pp. 259–266. ACM (2008)
Trushkowsky, B., Kraska, T., Franklin, M.J., Sarkar, P.: Crowdsourced enumeration queries. In: Proceedings of the 29th IEEE International Conference on Data Engineering (ICDE), pp. 673–684. IEEE (2013)
Van Damme, C., Hepp, M., Coenen, T.: Quality metrics for tags of broad folksonomies. In: Proceedings of International Conference on Semantic Systems (I-SEMANTICS), pp. 118–125 (2008)
Wagner, C., Singer, P., Strohmaier, M., Huberman, B.A.: Semantic stability in social tagging streams. In: Proceedings of the 23rd International Conference on World Wide Web, pp. 735–746. ACM (2014)
Xu, H., Zhou, D., Sun, Y., Sun, H.: Quality based dynamic incentive tagging. Distrib. Parallel Databases 33(1), 69–93 (2015)
Yang, X.S., Cheng, R., Mo, L., Kao, B., Cheung, D.W.-l.: On incentive-based tagging. In: Proceedings of the 29th IEEE International Conference on Data Engineering (ICDE), pp. 685–696. IEEE (2013)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Sun, Y., Sun, H., Cheng, R. (2016). Fast and Semantic Measurements on Collaborative Tagging Quality. In: Bailey, J., Khan, L., Washio, T., Dobbie, G., Huang, J., Wang, R. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2016. Lecture Notes in Computer Science(), vol 9652. Springer, Cham. https://doi.org/10.1007/978-3-319-31750-2_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-31750-2_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-31749-6
Online ISBN: 978-3-319-31750-2
eBook Packages: Computer ScienceComputer Science (R0)