Abstract
The semantic similarity measure between biomedical terms or concepts is a crucial task in biomedical information extraction and knowledge discovery. Most of the existing similarity approaches measure the similarity degree based on the path length between concept nodes as well as the depth of the ontology tree or hierarchy. These measures do not work well in case of the “primitive concepts” which are partially defined and have only few relations in the ontology structure. Namely, they cannot give the desired similarity results against human expert judge on the similarity among primitive concepts. In this paper, the existing two ontology-based measures are introduced and analyzed in order to determine their limitations with respect to the considered knowledge base. After that, a new similarity measure based on concept name analysis is proposed to solve the weakness of the existing similarity measures for primitive concepts. Using SNOMED CT as the input ontology, the accuracy of our proposal is evaluated and compared against other approaches with the human expert results based on different types of ontology concepts. Based on the correlation between the results of the evaluated measures and the human expert ratings, this paper analyzes the strength and weakness of each similarity measure for all ontology concepts.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Resnik, P.: Semantic similarity in a taxonomy. An Information-based measure and its application to problems of ambiguity in natural language. J. Artif. Intell. Res. 11, 95–130 (1999)
Hliaoutakis, A., Varelas, G., Voutsakis, E., Petrakis, E.G.M., Milios, E.: Information retrieval by semantic similarity. Int. J. Semant. Web Inf. Syst. 2, 55–73 (2006)
Pedersen, T., Pakhomov, S., Patwardhan, S., Chute, C.: Measures of semantic similarity and relatedness in the biomedical domain. J. Biomed. Inf. 40, 288–299 (2007)
Tongphu, S., Suntisrivaraporn, B., Uyyanonvara, B., Dailey, M.N.: Ontology-based object recognition of car sides. In: Proceedings of 9th International Conference on Electrical Engineering/Electronics Computer Telecommunications and Information Technology (ECTI) (2012)
Snomed, C.T.: Systematized Nomenclature of Medicine - Clinical Terminology. http://www.snomed.org/snomedct/index.html
Garla, V.N., Brandt, C.: Semantic similarity in the biomedical domain: an evaluation across knowledge sources. J. BMC Bioinf. 13, 261 (2012)
Abdelrahman, A., Kayed, A.: A survey on semantic similarity measures between concepts in health domain. Am. J. Comput. Math. 5, 204–214 (2015)
Zare, M., Pahl, C., Nilashi, M., Salim, N., Ibrahim, O.: A review of semantic similarity measures in biomedical domain using SNOMED CT. J. Soft Comput. Decis. Support Syst. 2(6), 1–13 (2015)
Choi, I., Kim, M.: Topic distillation using hierarchy concept tree. In: Proceedings of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, pp. 371–372 (2003)
AI-Mubaid, H., Nguyen, H.: A cluster-based approach for semantic similarity in the biomedical domain. In: Proceedings of the 28th IEEE EMBS Annual International Conference. New York City, USA, 3 September 2006
Batet, M., Sanchez, D., Valls, A.: An ontology-based measure to compute semantic similarity in biomedicine. J. Biomed. Inf. 44, 118–125 (2011)
SNOMED CT Basics. https://www.scribd.com/Snomed-Ct/Basics/Nehta/Taping/Aug08
Tongphu, S., Suntisrivaraporn, B.: Algorithms for measuring similarity between ELH concept descriptions: a case study on SNOMED CT. J. Comput. Inf. 36(4), 733–764 (2017)
Liberman, M., Sproat, R.: The Stress and Structure of Modified Noun Phrases in English. Stanford University (1992)
Gomaa, W.H., Fahmy, A.A.: A survey of text similarity approaches. Int. J. Comput. Appl. 68(13), 13–18 (2013)
Petrakis, E.G.M., Hliaoutakis, G.V.A., Raftopoulou, P.: X-Similarity: computing semantic similarity between concepts from different ontologies. J. Digital Inf. Manag. 4, 233–237 (2006)
Ko, S., Han, Y., Salomma, K.: approximate matching between a context-free grammar and a finite-state automaton. Information and Computation, pp. 278–289, February 2016
Acknowledgments
The first author is supported by Sirindhorn International Institute of Technology (SIIT) under an Excellent Foreign Student (EFS) scholarship. We are thankful to all those anonymous experts who spent their valuable time to evaluate the similarity between diseases.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2017 Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Htun, H.H., Sornlertlamvanich, V. (2017). Concept Name Similarity Measure on SNOMED CT. In: Chen, J., Theeramunkong, T., Supnithi, T., Tang, X. (eds) Knowledge and Systems Sciences. KSS 2017. Communications in Computer and Information Science, vol 780. Springer, Singapore. https://doi.org/10.1007/978-981-10-6989-5_7
Download citation
DOI: https://doi.org/10.1007/978-981-10-6989-5_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-10-6988-8
Online ISBN: 978-981-10-6989-5
eBook Packages: Computer ScienceComputer Science (R0)