Skip to main content

Query Modification by Discovering Topics from Web Page Structures

  • Conference paper
Advanced Web Technologies and Applications (APWeb 2004)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 3007))

Included in the following conference series:

Abstract

We propose a method that identifies from Web pages pairs of keywords in which one word describes the other and uses these relations to modify the query. It takes into account the positions of the words in the page structures when counting their occurrences and applies statistical tests to examine the differences between word co-occurrence rates. It finds related keywords more robustly regardless of the word type than the conventional methods, which do not consider page structures. It can also identify subject and description keywords in the user’s input and find additional keywords for detailing the query. By considering the document structures, our method can construct queries that are more focused on the user’s topic of interest.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Butler, D.: Souped-up search engines. Nature 405, 112–115 (2000)

    Article  Google Scholar 

  2. Baeza-Yates, R., Ribeiro-Neto, B.: Modern Information Retrieval. Addison-Wesley, Reading (1999)

    Google Scholar 

  3. Quinlan, J.R.: Induction of decision trees. Machine Learning 1, 81–106 (1986)

    Google Scholar 

  4. Oyama, S., Ishida, T.: Applying assocation rules to information navigation. Systems and Computers in Japan 34(4), 12–20 (2003)

    Article  Google Scholar 

  5. Snedecor, G., Cochran, W.G.: Statistical Methods. Iowa State University Press, Iowa (1989)

    MATH  Google Scholar 

  6. Harary, F., Norman, R.Z., Cartwright, D.: Structural Models: An Introduction to the Theory of Directed Graphs. John Wiley & Sons, Chichester (1965)

    MATH  Google Scholar 

  7. Sanderson, M., Croft, B.: Deriving concept hierarchies from text. In: Proceedings of the 22nd ACM SIGIR Conference (SIGIR 1999), pp. 206–213 (1999)

    Google Scholar 

  8. Glover, E., Pennock, D.M., Lawrence, S., Krovetz, R.: Inferring hierarchical descriptions. In: Proceedings of the 11th International Conference on Information and Knowledge Management (CIKM 2002), pp. 507–514 (2002)

    Google Scholar 

  9. Hearst, M.A.: Automatic acquisition of hyponyms from large text corpora. In: Proceedings of the 14th International Conference on Computational Linguistics (COLING 1992), pp. 539–545 (1992)

    Google Scholar 

  10. Liu, B., Chin, C.W., Ng, H.T.: Mining topic-specific concepts and definitions on the web. In: Proceedings of the 12th international conference on World Wide Web (WWW 2003), pp. 251–260 (2003)

    Google Scholar 

  11. Calado, P., da Silva, A.S., Vieira, R.C., Laender, A.H.F., Ribeiro-Neto, B.A.: Searching web databases by structuring keywordbased queries. In: Proceedings of the 11th International Conference on Information and Knowledge Management (CIKM 2002), pp. 26–33 (2002)

    Google Scholar 

  12. Oyama, S., Tanaka, K.: Exploiting document structures for comparing and exploring topics on the web. In: The 12th International World Wide Web Conference (WWW 2003), Poster Session (2003)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2004 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Oyama, S., Tanaka, K. (2004). Query Modification by Discovering Topics from Web Page Structures. In: Yu, J.X., Lin, X., Lu, H., Zhang, Y. (eds) Advanced Web Technologies and Applications. APWeb 2004. Lecture Notes in Computer Science, vol 3007. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-24655-8_60

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-24655-8_60

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-21371-0

  • Online ISBN: 978-3-540-24655-8

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics