Skip to main content
Log in

A Survey on Retrieval of Mathematical Knowledge

  • Published:
Mathematics in Computer Science Aims and scope Submit manuscript

Abstract

We present a survey of the literature on indexing and retrieval of mathematical knowledge, with pointers to 77 papers and tentative taxonomies of both retrieval problems and recurring techniques.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Adeel, M., Cheung, H.S., Khiyal, S.H.: Math GO! prototype of a content based mathematical formula search engine. J. Theor. Appl. Inf. Technol. 4(10), 1002–1012 (2008)

    Google Scholar 

  2. Ahmadi, S.A., Youssef, A.: Lexical error compensation in handwritten-based mathematical information retrieval. In: DML 2008. Towards digital mathematics library, Birmingham, UK, July 27th, 2008. Proceedings, pp. 43–54. Masaryk University, Brno (2008)

  3. Aizawa, A., Kohlhase, M., Ounis, I.: NTCIR-10 math pilot task overview. In: Proceedings of the 10th NTCIR conference, Tokyo, pp. 654–661 (2013)

  4. Aizawa, A., Kohlhase, M., Ounis, I.: NTCIR-11 math 2 task overview. In: Proc. 10th NTCIR conference (Tokyo, Japan), pp. 88–98 (2014)

  5. Altamimi, M.E., Youssef, A.: A more canonical form of content MathML to facilitate math search. In: Proc extreme markup languages (2007)

  6. Altamimi, M.E., Youssef, A.S.: Wildcards in math search, implementation issues. In: Proceedings of the ISCA 20th international conference on computer applications in industry and engineering, CAINE 2007, November 7–9, 2007, San Francisco, pp. 90–96 (2007)

  7. Altamimi, M.E., Youssef, A.S.: A math query language with an expanded set of wildcards. Math. Comput. Sci. 2(2), 305–331 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  8. Asperti, A., Guidi, F., Coen, C.S., Tassi, E., Zacchiroli, S.: A content based mathematical search engine: Whelp. In: Types for proofs and programs, pp. 17–32. Springer (2006)

  9. Asperti, A., Selmi, M.: Efficient retrieval of mathematical statements. In: Mathematical knowledge management, third international conference, MKM 2004, Bialowieza, September 19–21, 2004, Proceedings, pp. 17–31 (2004)

  10. Asperti, A., Zacchiroli, S.: Searching mathematics on the web: state of the art and future developments. In: FIZ, editor, Proceedings of New Developments in Electronic Publishing of Mathematics, pp. 9–18 (2004)

  11. Bancerek, G.: Information retrieval and rendering with MML Query. In: Mathematical knowledge management, pp. 266–279. Springer (2006)

  12. Bancerek, G., Rudnicki, P.: Information retrieval in MML. In: Mathematical knowledge management, second international conference, MKM 2003, Bertinoro, February 16–18, 2003, Proceedings, pp. 119–132 (2003)

  13. Bancerek, G., Urban, J.: Integrated semantic browsing of the mizar mathematical library for authoring mizar articles. In: Mathematical knowledge management, third international conference, MKM 2004, Bialowieza, September 19–21, 2004, Proceedings, pp. 44–57 (2004)

  14. Baumgartner, P., Furbach, U.: Automated deduction techniques for the management of personalized documents. Annals of Mathematics and Artificial Intelligence 38(1–3), 211–228 (2003)

    Article  MATH  Google Scholar 

  15. Cairns, P.A.: Informalising formal mathematics: searching the mizar library with latent semantics. In: Mathematical knowledge management, third international conference, MKM 2004, Bialowieza, September 19–21, 2004, Proceedings, pp. 58–72 (2004)

  16. Caprotti, O., Dewar, M., Turi, D.: Mathematical service matching using description Logic and OWL. In: Mathematical knowledge management, 3rd international conference, MKM 2004, Bialowieza, September 19–21, 2004, Proceedings, pp. 73–87 (2004)

  17. Delahaye, D.: Information retrieval in a Coq proof library using type isomorphisms. In: Types for proofs and programs, pp. 131–147. Springer (2000)

  18. Formánek, D., Líška, M., Růžička, M., Sojka, P.: Normalization of digital mathematics library content. In: Davenport, J., Jeuring, J., Lange, C., Libbrecht, P. (eds.) Joint Proceedings of the 24th OpenMath Workshop, the 7th workshop on mathematical user interfaces (MathUI), and the work in progress section of the conference on intelligent computer mathematics, number 921 in CEUR Workshop Proceedings, pp. 91–103, Aachen (2012)

  19. Gao, L., Wang, Y., Hao, L., Tang, Z.: ICST math retrieval system for NTCIR-11 Math-2 Task. In: Proc. 10th NTCIR conference (Tokyo, Japan), pp. 99–102 (2014)

  20. Gauthier, T., Kaliszyk, C.: Matching concepts across HOL Libraries. In: Intelligent computer mathematics—international conference, CICM 2014, Coimbra, July 7–11, 2014. Proceedings, pp. 267–281 (2014)

  21. González Pinto, J.M., Barthel, S., Balke, W.-T.: QUALIBETA at the NTCIR-11 math 2 task: an attempt to query math collections. In: Proc. 10th NTCIR conference (Tokyo, Japan), pp. 103–107 (2014)

  22. Guidi, F., Coen, C.S.: A survey on retrieval of mathematical knowledge. In: Kerber, M., Carette, J., Kaliszyk, C., Rabe, F., Sorge, V., (eds.) Intelligent computer mathematics—international conference, CICM 2015, Washington, DC, USA, July 13–17, 2015, Proceedings, Lecture Notes in Computer Science, vol. 9150, pp. 296–315. Springer (2015)

  23. Guidi, F., Schena, I.: A query language for a metadata framework about mathematical resources. In: Mathematical knowledge management, second international conference, MKM 2003, Bertinoro, February 16–18, 2003, Proceedings, pp. 105–118 (2003)

  24. Hagino, H., Saito, H.: Partial-match retrieval with structure-reflected indices at the NTCIR-10 math task. In: Proc. 10th NTCIR conference (Tokyo, Japan), pp. 692–695 (2013)

  25. Hambasan, R., Kohlhase, M., Prodescu, C.: MathWebSearch at NTCIR-11. In: Proc. 10th NTCIR Conference (Tokyo, Japan), pp. 114–119 (2014)

  26. Haralambous, Y., Quaresma, P.: Querying geometric figures using a controlled language, ontological graphs and dependency lattices. In: Intelligent computer mathematics—international conference, CICM 2014, Coimbra, Portugal, July 7–11, 2014. Proceedings, pp. 298–311 (2014)

  27. Hashimoto, H., Hijikata, Y., Nishida, S.: Incorporating breadth first search for indexing MathML objects. In: Systems, Man and Cybernetics, 2008. SMC 2008. IEEE international conference on, pp. 3519–3523 (2008)

  28. Kamali, S., Tompa, F.W.: Improving mathematics retrieval. In: DML 2009. Towards digital mathematics library, Grand Bend, Ontario, Canada, July 8–9th 2009. Proceedings, pp. 37–48. Masaryk University, Brno (2009)

  29. Kamali, S., Tompa, F.W.: A new mathematics retrieval system. In: Proceedings of the 19th ACM international conference on information and knowledge management. CIKM ‘10, pp. 1413–1416. ACM, New York (2010)

  30. Kamali, S., Tompa, F.W.: Structural similarity search for mathematics retrieval. In: Intelligent computer mathematics—MKM, calculemus, DML, and systems and projects 2013, Held as Part of CICM 2013, Bath, July 8–12, 2013. Proceedings, pp. 246–262 (2013)

  31. Kohlhase, A.: Search interfaces for mathematicians. In: Intelligent computer mathematics—international conference, CICM 2014, Coimbra, July 7–11, 2014. Proceedings, pp. 153–168 (2014)

  32. Kohlhase, A., Kohlhase, M.: Re examining the MKM value proposition: from math web search to math web re search. In: Towards mechanized mathematical assistants, 14th Symposium, Calculemus 2007, 6th International Conference, MKM 2007, Hagenberg, June 27–30, 2007, Proceedings, pp. 313–326 (2007)

  33. Kohlhase, M., Prodescu, C.: MathWebSearch at NTCIR-10. In: Proc. 10th NTCIR conference (Tokyo, Japan), pp. 675–679 (2013)

  34. Kristianto, G.Y., Topić, G., Ho, F., Aizawa, A.: The MCAT math retrieval system for NTCIR-11 math track. In: Proc. 11th NTCIR conference (Tokyo, Japan), pp. 120–126 (2014)

  35. Larson, R.R., Reynolds, C.J., Gey, F.C.: The abject failure of keyword IR for mathematics search: Berkeley at NTCIR-10 Math. In: Proc. 10th NTCIR conference (Tokyo, Japan), pp. 662–666 (2013)

  36. Libbrecht, P.: Escaping the trap of too precise topic queries. In: Intelligent computer mathematics—MKM, calculemus, DML, and systems and projects 2013, Held as Part of CICM 2013, Bath, July 8–12, 2013. Proceedings, pp. 296–309 (2013)

  37. Libbrecht, P., Desmoulins, C., Mercat, C., Laborde, C., Dietrich, M., Hendriks, M.: Cross-curriculum search for intergeo. In: Intelligent computer mathematics, 9th international conference, AISC 2008, 15th Symposium, Calculemus 2008, 7th international conference, MKM 2008, Birmingham, July 28–August 1, 2008. Proceedings, pp. 520–535 (2008)

  38. Libbrecht, P., Melis, E.: Methods to access and retrieve mathematical content in activemath. In: Mathematical software-ICMS 2006, pp. 331–342. Springer (2006)

  39. Lipani, A., Andersson, L., Piroi, F., Lupu, M., Hanbury, A.: TUW-IMP at the NTCIR-11 Math-2. In: Proc. 11th NTCIR conference (Tokyo, Japan), pp. 143–146 (2014)

  40. Líška, M.: Searching Mathematical Texts. Bachelor’s thesis, Masaryk University, Faculty of Informatics, Brno (2010)

  41. Líška, M.: Evaluation of Mathematics Retrieval. Master’s thesis, Masaryk University, Faculty of Informatics, Brno (2013)

  42. Líška, M., Sojka, P., Líška, M., Mravec, P.: Web interface and collection for mathematical retrieval: WebMIaS and MREC. In: DML 2011. Towards digital mathematics library, Bertinoro, July 20–21st, 2011. Proceedings, pp. 77–84. Masaryk University, Brno (2011)

  43. Líška, M., Sojka, P., Růžička, M.: Similarity search for mathematics: Masaryk University team at the NTCIR-10 Math Task. In: Proc. 10th NTCIR Conference (Tokyo, Japan), pp. 686–691 (2013)

  44. Líška, M., Sojka, P., Růžička, M.: Math indexer and searcher web interface—towards fulfillment of mathematicians’ information needs. In: Intelligent computer mathematics—international conference, CICM 2014, Coimbra, Portugal, July 7–11, 2014. Proceedings, pp. 444–448 (2014)

  45. Líška, M., Sojka, P.: Růžička, M.: Combining text and formula queries in math information retrieval: evaluation of query results merging strategies. In: Proceedings of the first international workshop on novel web search interfaces and systems. NWSearch ’15, pp. 7–9. ACM, New York (2015)

  46. Miller, B.R.: Three years of DLMF: Web, Math and Search. In: Intelligent computer mathematics—MKM, Calculemus, DML, and systems and projects 2013, Held as Part of CICM 2013, Bath, July 8–12, 2013. Proceedings, pp. 288–295 (2013)

  47. Miller, B.R., Youssef, A.: Technical Aspects of the digital library of mathematical functions. Ann. Math. Artif. Intell. 38(1–3), 121–136 (2003)

    Article  MathSciNet  MATH  Google Scholar 

  48. Miller, B.R., Youssef, A.: Augmenting presentation MathML for Search. In: Intelligent computer mathematics, 9th international conference, AISC 2008, 15th symposium, calculemus 2008, 7th international conference, MKM 2008, Birmingham, July 28–August 1, 2008. Proceedings, pp. 536–542 (2008)

  49. Miner, R., Munavalli, R.: An approach to mathematical search through query formulation and data normalization. In: Towards mechanized mathematical assistants, 14th symposium, calculemus 2007, 6th international conference, MKM 2007, Hagenberg, June 27–30, 2007, Proceedings, pp. 342–355 (2007)

  50. Misutka, J., Galambos, L.: Mathematical extension of full text search engine indexer. In: Information and communication technologies: from theory to applications, 2008. ICTTA 2008. 3rd international conference on, pp. 1–6 (2008)

  51. Mišutka, J., Galamboš, L.: Extending full text search engine for mathematical content. In: DML 2008. Towards digital mathematics library, Birmingham, July 27th, 2008. Proceedings, pp. 55–67. Masaryk University, Brno (2008)

  52. Munavalli, R., Miner, R.: Mathfind: a math-aware search engine. In: Proceedings of the 29th annual international ACM SIGIR conference on research and development in information retrieval, pp. 735–735. ACM (2006)

  53. Nghiem, M.-Q., Kristianto, G.Y., Topić, G., Aizawa, A.: Which one is better: presentation-based or content-based math search? In: Intelligent computer mathematics—international conference, CICM 2014, Coimbra, July 7–11, 2014. Proceedings, pp. 200–212 (2014)

  54. Nguyen, T.T., Chang, K., Hui, S.C.: A math-aware search engine for math question answering system. In: Proceedings of the 21st ACM international conference on information and knowledge management, pp. 724–733. ACM (2012)

  55. Normann, I., Kohlhase, M.: Extended formula normalization for epsilon-retrieval and sharing of mathematical knowledge. In: Towards mechanized mathematical assistants, 14th symposium, Calculemus 2007, 6th international conference, MKM 2007, Hagenberg, June 27–30, 2007, Proceedings, pp. 356–370 (2007)

  56. Pattaniyil, N., Zanibbi, R.: Combining TF-IDF text retrieval with an inverted index over symbol pairs in math expressions: the tangent math search engine at NTCIR 2014. In: Proc. 11th NTCIR conference (Tokyo, Japan), pp. 135–142 (2014)

  57. Rabe, F.: A query language for formal mathematical libraries. In: Intelligent computer mathematics—11th international conference, AISC 2012, 19th symposium, Calculemus 2012, 5th international workshop, DML 2012, 11th international conference, MKM 2012, systems and projects, Held as Part of CICM 2012, Bremen, July 8–13, 2012. Proceedings, pp. 143–158 (2012)

  58. Růžička, M., Sojka, P., Líška, M.: Math indexer and searcher under the hood: history and development of a winning strategy. In: Proc. 11th NTCIR conference (Tokyo, Japan), pp. 127–134 (2014)

  59. Schöneberg, U., Sperber, W.: POS tagging and its applications for mathematics—text analysis in mathematics. In: Intelligent computer mathematics—international conference, CICM 2014, Coimbral, July 7–11, 2014. Proceedings, pp. 213–223 (2014)

  60. Schubotz, M., Leich, M., Markl, V.: Querying large collections of mathematical publications: NTCIR10 math task. In: Proc. 10th NTCIR Conference (Tokyo, Japan), pp. 667–674 (2013)

  61. Schubotz, M., Youssef, A., Markl, V., Cohl, H.S., Li, J.J.: Evaluation of similarity-measure factors for formulae based on the NTCIR-11 Math Task. In: Proc. 10th NTCIR conference (Tokyo, Japan), pp. 108–113 (2014)

  62. Shatnawi, M., Youssef, A.: Equivalence detection using parse-tree normalization for math search. In: Second IEEE international conference on digital information management (ICDIM). December 11–13, 2007, pp. 643–648. Lyon, Proceedings (2007)

  63. Sojka, P., Líška, M.: The art of mathematics retrieval. In: Proceedings of the 11th ACM symposium on document engineering, pp. 57–60. ACM (2011)

  64. Topić, G., Kristianto, G.Y., Nghiem, M.-Q., Aizawa, A.: The MCAT math retrieval system for NTCIR-10 math track. In: Proc. 10th NTCIR conference (Tokyo, Japan), pp. 680–685 (2013)

  65. Wolska, M., Grigore, M.: Symbol declarations in mathematical writing. In: DML 2010. Towards digital mathematics library, Paris, July 7–8th, 2010. Proceedings, pp. 119–127. Masaryk University, Brno (2010)

  66. Yokoi, K., Aizawa, A.: An approach to similarity search for mathematical expressions using MathML. In: DML 2009. Towards digital mathematics library, Grand Bend, Ontario, Canada, July 8–9th 2009. Proceedings, pp. 27–35. Brno: Masaryk University (2009)

  67. Youssef, A.: Search of mathematical contents: issues and methods. In: Proceedings of the ISCA 14th international conference on intelligent and adaptive systems and software engineering. July 20–22, 2005, pp. 100–105. Novotel Toronto Centre, Toronto (2005)

  68. Youssef, A.: Roles of math search in mathematics. In: Mathematical knowledge management, 5th international conference, MKM 2006, Wokingham, August 11–12, 2006, Proceedings, pp. 2–16 (2006)

  69. Youssef, A.: Methods of relevance ranking and hit-content generation in math search. In: Towards mechanized mathematical assistants, 14th Symposium, Calculemus 2007, 6th international conference, MKM 2007, Hagenberg, June 27–30, 2007, Proceedings, pp. 393–406 (2007)

  70. Youssef, A., Shatnawi, M.: Math search with equivalence detection using parse-tree normalization. In: The 4th international conference on computer science and information technology (2006)

  71. Youssef, A.S.: Relevance ranking and hit description in math search. Math. Comput. Sci. 2(2), 333–353 (2008)

    Article  MathSciNet  MATH  Google Scholar 

  72. Youssef, A.S., Altamimi, M.E.: An extensive math query language. In: 16th international conference on software engineering and data engineering (SEDE-2007). July 9–11, 2007, pp. 57–63. Imperial Palace Hotel Las Vegas, Las Vegas, Proceedings (2007)

  73. Zanibbi, R., Blostein, D.: Recognition and retrieval of mathematical expressions. Int. J. Doc. Anal. Recognit. (IJDAR) 15(4), 331–357 (2012)

    Article  Google Scholar 

  74. Zanibbi, R., Orakwue, A.: Math search for the masses: multimodal search interfaces and appearance-based retrieval. In: Kerber, M., Carette, J., Kaliszyk, C., Rabe, F., Sorge, V., (eds.), Intelligent computer mathematics—international conference, CICM 2015, Washington, DC, July 13–17, 2015, Proceedings, Lecture notes in computer science, vol. 9150, pp. 18–36. Springer (2015)

  75. Zhang, Q., Youssef, A.: An approach to math-similarity search. In: Intelligent computer mathematics—international conference, CICM 2014, Coimbra, July 7–11, 2014. Proceedings, pp. 404–418 (2014)

  76. Zhang, Q., Youssef, A.: Performance evaluation and optimization of math-similarity search. In: Kerber, M., Carette, J., Kaliszyk, C., Rabe, F., Sorge, V., (eds.) Intelligent computer mathematics—international conference, CICM 2015, Washington, DC, July 13–17, 2015, Proceedings, Lecture notes in computer science, vol. 9150, pp. 243–257. Springer (2015)

  77. Zhao, J., Kan, M.-Y., Theng, Y.L.: Math information retrieval: user requirements and prototype implementation. In: Proceedings of the 8th ACM/IEEE-CS joint conference on digital libraries, pp. 187–196. ACM (2008)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Claudio Sacerdoti Coen.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Guidi, F., Sacerdoti Coen, C. A Survey on Retrieval of Mathematical Knowledge. Math.Comput.Sci. 10, 409–427 (2016). https://doi.org/10.1007/s11786-016-0274-0

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11786-016-0274-0

Navigation