Query Modification by Discovering Topics from Web Page Structures

Oyama, Satoshi; Tanaka, Katsumi

doi:10.1007/978-3-540-24655-8_60

Satoshi Oyama¹⁶ &
Katsumi Tanaka¹⁶

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3007))

Included in the following conference series:

Asia-Pacific Web Conference

522 Accesses
10 Citations

Abstract

We propose a method that identifies from Web pages pairs of keywords in which one word describes the other and uses these relations to modify the query. It takes into account the positions of the words in the page structures when counting their occurrences and applies statistical tests to examine the differences between word co-occurrence rates. It finds related keywords more robustly regardless of the word type than the conventional methods, which do not consider page structures. It can also identify subject and description keywords in the user’s input and find additional keywords for detailing the query. By considering the document structures, our method can construct queries that are more focused on the user’s topic of interest.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Butler, D.: Souped-up search engines. Nature 405, 112–115 (2000)
Article Google Scholar
Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)
Google Scholar
Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)
Google Scholar
Oyama, S., Ishida, T.: Applying assocation rules to information navigation. Systems and Computers in Japan 34(4), 12–20 (2003)
Article Google Scholar
Snedecor, G., Cochran, W.G.: Statistical Methods. Iowa State University Press, Iowa (1989)
MATH Google Scholar
Harary, F., Norman, R.Z., Cartwright, D.: Structural Models: An Introduction to the Theory of Directed Graphs. John Wiley & Sons, Chichester (1965)
MATH Google Scholar
Sanderson, M., Croft, B.: Deriving concept hierarchies from text. In: Proceedings of the 22nd ACM SIGIR Conference (SIGIR 1999), pp. 206–213 (1999)
Google Scholar
Glover, E., Pennock, D.M., Lawrence, S., Krovetz, R.: Inferring hierarchical descriptions. In: Proceedings of the 11th International Conference on Information and Knowledge Management (CIKM 2002), pp. 507–514 (2002)
Google Scholar
Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics (COLING 1992), pp. 539–545 (1992)
Google Scholar
Liu, B., Chin, C.W., Ng, H.T.: Mining topic-specific concepts and definitions on the web. In: Proceedings of the 12th international conference on World Wide Web (WWW 2003), pp. 251–260 (2003)
Google Scholar
Calado, P., da Silva, A.S., Vieira, R.C., Laender, A.H.F., Ribeiro-Neto, B.A.: Searching web databases by structuring keywordbased queries. In: Proceedings of the 11th International Conference on Information and Knowledge Management (CIKM 2002), pp. 26–33 (2002)
Google Scholar
Oyama, S., Tanaka, K.: Exploiting document structures for comparing and exploring topics on the web. In: The 12th International World Wide Web Conference (WWW 2003), Poster Session (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Social Informatics, Graduate School of Informatics, Kyoto University, Yoshida-Honmachi, Sakyo-ku, Kyoto, 606-8501, Japan
Satoshi Oyama & Katsumi Tanaka

Authors

Satoshi Oyama
View author publications
You can also search for this author in PubMed Google Scholar
Katsumi Tanaka
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Chinese University of Hong Kong, Hong Kong, China
Jeffrey Xu Yu
The University of News South Wales, NSW 2052, Australia
Xuemin Lin
Department of Computer Science, Tsinghua University, 100084, Beijing, P.R. China
Hongjun Lu
Victoria University, Australia
Yanchun Zhang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Oyama, S., Tanaka, K. (2004). Query Modification by Discovering Topics from Web Page Structures. In: Yu, J.X., Lin, X., Lu, H., Zhang, Y. (eds) Advanced Web Technologies and Applications. APWeb 2004. Lecture Notes in Computer Science, vol 3007. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24655-8_60

Download citation

DOI: https://doi.org/10.1007/978-3-540-24655-8_60
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-21371-0
Online ISBN: 978-3-540-24655-8
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics