Abstract
We propose a method that identifies from Web pages pairs of keywords in which one word describes the other and uses these relations to modify the query. It takes into account the positions of the words in the page structures when counting their occurrences and applies statistical tests to examine the differences between word co-occurrence rates. It finds related keywords more robustly regardless of the word type than the conventional methods, which do not consider page structures. It can also identify subject and description keywords in the user’s input and find additional keywords for detailing the query. By considering the document structures, our method can construct queries that are more focused on the user’s topic of interest.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Butler, D.: Souped-up search engines. Nature 405, 112–115 (2000)
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)
Oyama, S., Ishida, T.: Applying assocation rules to information navigation. Systems and Computers in Japan 34(4), 12–20 (2003)
Snedecor, G., Cochran, W.G.: Statistical Methods. Iowa State University Press, Iowa (1989)
Harary, F., Norman, R.Z., Cartwright, D.: Structural Models: An Introduction to the Theory of Directed Graphs. John Wiley & Sons, Chichester (1965)
Sanderson, M., Croft, B.: Deriving concept hierarchies from text. In: Proceedings of the 22nd ACM SIGIR Conference (SIGIR 1999), pp. 206–213 (1999)
Glover, E., Pennock, D.M., Lawrence, S., Krovetz, R.: Inferring hierarchical descriptions. In: Proceedings of the 11th International Conference on Information and Knowledge Management (CIKM 2002), pp. 507–514 (2002)
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics (COLING 1992), pp. 539–545 (1992)
Liu, B., Chin, C.W., Ng, H.T.: Mining topic-specific concepts and definitions on the web. In: Proceedings of the 12th international conference on World Wide Web (WWW 2003), pp. 251–260 (2003)
Calado, P., da Silva, A.S., Vieira, R.C., Laender, A.H.F., Ribeiro-Neto, B.A.: Searching web databases by structuring keywordbased queries. In: Proceedings of the 11th International Conference on Information and Knowledge Management (CIKM 2002), pp. 26–33 (2002)
Oyama, S., Tanaka, K.: Exploiting document structures for comparing and exploring topics on the web. In: The 12th International World Wide Web Conference (WWW 2003), Poster Session (2003)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2004 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Oyama, S., Tanaka, K. (2004). Query Modification by Discovering Topics from Web Page Structures. In: Yu, J.X., Lin, X., Lu, H., Zhang, Y. (eds) Advanced Web Technologies and Applications. APWeb 2004. Lecture Notes in Computer Science, vol 3007. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24655-8_60
Download citation
DOI: https://doi.org/10.1007/978-3-540-24655-8_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21371-0
Online ISBN: 978-3-540-24655-8
eBook Packages: Springer Book Archive