ABSTRACT
Recommender systems are emerging as an interesting application scenario for Linked Data (LD). In fact, by exploiting the knowledge encoded in LD datasets, a new generation of semantics-aware recommendation engines have been developed in the last years. As Linked Data is often very rich and contains many information that may result irrelevant and noisy for a recommendation task, an initial step of feature selection is always required in order to select the most meaningful portion of the original dataset. Many approaches have been proposed in the literature for feature selection that exploit different statistical dimensions of the original data. In this paper we investigate the role of the semantics encoded in an ontological hierarchy when exploited to select the most relevant properties for a recommendation task. In particular, we compare an approach based on schema summarization with a "classical" one, i.e., Information Gain. We evaluated the performance of the two methods in terms of accuracy and aggregate diversity by setting up an experimental testbed relying on the Movielens dataset.
- G. Adomavicius and Y. Kwon. Improving aggregate recommendation diversity using ranking-based techniques. IEEE TKDE, 24(5), May 2012. Google ScholarDigital Library
- P. Castells, N. J. Hurley, and S. Vargas. Novelty and diversity in recommender systems. In Recommender Systems Handbook. Springer 2015. Google ScholarCross Ref
- M. de Gemmis, P. Lops, C. Musto, F. Narducci, and G. Semeraro. Semantics-aware content-based recommender systems. In Recommender Systems Handbook. 2015. Google ScholarCross Ref
- I. Fernández-Tobías, I. Cantador, M. Kaminskas, and F. Ricci. A generic semantic-based framework for cross-domain recommendation. In Proc. of 2nd HetRec Workshop, 2011. Google ScholarDigital Library
- X. Geng, T.-Y. Liu, T. Qin, and H. Li. Feature selection for ranking. In Proceedings of the 30th ACM SIGIR, 2007. Google ScholarDigital Library
- B. Goodman and S. Flaxman. European Union regulations on algorithmic decision-making and a "right to explanation". ArXiv e-prints, June 2016.Google Scholar
- T. Gottron, M. Knauf, A. Scherp, and J. Schaible. ELLIS: interactive exploration of linked data on the level of induced schema patterns. In Proc. of 2nd SumPre Workshop, 2016.Google Scholar
- I. Guyon and A. Elisseeff. An introduction to variable and feature selection. J. of Machine Learning Research, 3, 2003. Google ScholarDigital Library
- B. Heitmann and C. Hayes. C.: Using linked data to build open, collaborative recommender systems. In In: AAAI Spring Symposium: Linked Data Meets Artificial IntelligenceâĂŹ. (2010, 2010.Google Scholar
- D. Jannach, M. Zanker, A. Felfernig, and G. Friedrich. Recommender Systems: An Introduction, 2010. Google ScholarDigital Library
- M. Jarrar and M. Dikaiakos. A Query Formulation Language for the Data Web. IEEE TKDE, 24(5), 2012. Google ScholarDigital Library
- R. Kohavi and G. H. John. Wrappers for feature subset selection. Artificial Intelligence, 97(1--2), 1997. Google ScholarDigital Library
- P. Lops, M. De Gemmis, and G. Semeraro. Content-based recommender systems: State of the art and trends. In Recommender Systems handbook, 2011.Google ScholarCross Ref
- N. Mihindukulasooriya, M. Poveda-Villalón, R. García-Castro, and A. Gómez-Pérez. Loupe - an online tool for inspecting datasets in the linked data cloud. In Proc. of ISWC 2015 Posters & Demonstrations Track, 2015.Google Scholar
- C. Musto, P. Lops, P. Basile, M. de Gemmis, and G. Semeraro. Semantics-aware graph-based recommender systems exploiting linked open data. In Proc. of 24th UMAP, 2016. Google ScholarDigital Library
- C. Musto, G. Semeraro, P. Lops, and M. de Gemmis. Combining distributional semantics and entity linking for context-aware content-based recommendation. In Proc. of 22nd UMAP, 2014. Google ScholarCross Ref
- T. Di Noia, V. C. Ostuni, P. Tomeo, and E. Di Sciascio. Sprank: Semantic path-based ranking for top-n recommendations using linked open data. ACM TIST, 8(1), 2016. Google ScholarDigital Library
- V. C. Ostuni, S. Oramas, T. Di Noia, X. Serra, and E. Di Sciascio. Sound and music recommendation with knowledge graphs. ACM TIST, 2016. Google ScholarDigital Library
- A. Passant. dbrec --- Music Recommendations Using DBpedia. 2010.Google Scholar
- H. Paulheim and J. Fümkranz. Unsupervised generation of data mining features from linked open data. In Proc. of 2nd WIMS, 2012. Google ScholarDigital Library
- Y. Shi, A. Karatzoglou, L. Baltrunas, M. Larson, N. Oliver, and A. Hanjalic. Climf: learning to maximize reciprocal rank with collaborative less-is-more filtering. In Proc. of 6th ACM RecSys, 2012. Google ScholarDigital Library
- B. Spahiu, R. Porrini, M. Palmonari, A. Rula, and A. Maurino. ABSTAT: ontology-driven linked data summaries with pattern minimalization. In Proceedings of 2nd SumPre Workshop, 2016. Google ScholarCross Ref
- N. Tintarev and J. Masthoff. Explaining Recommendations: Design and Evaluation. 2015. Google ScholarDigital Library
- G. Troullinou, H. Kondylakis, E. Daskalaki, and D. Plexousakis. RDF Digest: Efficient Summarization of RDF/S KBs. In Proc. of ESWC, 2015. Google ScholarDigital Library
Index Terms
Schema-summarization in linked-data-based feature selection for recommender systems
Recommendations
Using the relation ontology Metarel for modelling Linked Data as multi-digraphs
Linked Data for Health Care and the Life SciencesThe Semantic Web standards OWL and RDF are often used to represent biomedical information as Linked Data; however, the OWL/RDF syntax, which combines both, was never optimised for querying. By combining two formal paradigms for modelling Linked Data, ...
Measuring semantic distance for linked open data-enabled recommender systems
SAC '16: Proceedings of the 31st Annual ACM Symposium on Applied ComputingThe Linked Open Data (LOD) initiative has been quite successful in terms of publishing and interlinking data on the Web. On top of the huge amount of interconnected data, measuring relatedness between resources and identifying their relatedness could be ...
A systematic literature review of Linked Data-based recommender systems
Recommender systems RS are software tools that use analytic technologies to suggest different items of interest to an end user. Linked Data is a set of best practices for publishing and connecting structured data on the Web. This paper presents a ...
Comments