skip to main content
article

Revisiting Document Length Hypotheses: A Comparative Study of Japanese Newspaper and Patent Retrieval

Authors Info & Claims
Published:01 June 2005Publication History
First page image

References

  1. BERGER, A., AND LAFFERTY, J. 1999. Information retrieval as statistical translation. In Proceedings of the 1999 ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, 222-229. Google ScholarGoogle Scholar
  2. CALLAN, J. P., LU, Z., AND CROFT, W. B. 1995. Searching distributed collections with inference networks. In Proceedings of the 1995 ACM SIGIR Conference on Research and Development in Information Retrieval, Seattle, WA, 21-28. Google ScholarGoogle Scholar
  3. CHEN, K. H., CHEN, H. H., KISHIDA, K., KURIYAMA, K., KANDO, N., LEE, S., MYAENG, S. H., EGUCHII, K., AND KIM, H. 2002. Overview of CLIR task at the third NTCIR workshop. In Working Notes of the Third NTCIR Workshop Meeting, Part I: Overview, 23-60.Google ScholarGoogle Scholar
  4. EVANS, D. A., AND LEFFERTS, R. 1993. Design and evaluation of the CLARIT-TREC-2 System. In NIST Special Publication 500-215: The Second Text REtrieval Conference (TREC 2), 137-150.Google ScholarGoogle Scholar
  5. FANG, H. TAO, T., AND ZHAI, C. 2003. An exploration of formalized information retrieval heuristics. In Proceedings of the ACM SIGIR 2003 Workshop on Mathematical/Formal Methods in IR, Toronto, Canada. Google ScholarGoogle Scholar
  6. FUJII, A., IWAYAMA, M., AND KANDO, N. 2004. Overview of Patent Retrieval Task at NTCIR-4. In Working notes of the fourth NTCIR workshop meeting, 225-232.Google ScholarGoogle Scholar
  7. FUJITA, S. 2000. Reflections on "Aboutness"-TREC-9 Evaluation experiments at Justsystem. In NIST Special Publication 500-249: The Ninth Text REtrieval Conference (TREC 9), 281-288.Google ScholarGoogle Scholar
  8. FUJITA, S. 2001. More reflections on "Aboutness"-TREC-2001 Evaluation experiments at Justsystem. In NIST Special Publication 500-250: The Tenth Text REtrieval Conference (TREC 2001), 331-338.Google ScholarGoogle Scholar
  9. HIEMSTRA, D., AND KRAAIJ, W. 1998. Twenty-one at TREC-7: Ad-hoc and cross-language track. In NIST Special Publication 500-242: The Seventh Text REtrieval Conference (TREC 7), 227-238.Google ScholarGoogle Scholar
  10. INTERNATIONAL PATENT CLASSIFICATION (IPC). http://www.wipo.int/classifications/fulltext/new_ipc/Google ScholarGoogle Scholar
  11. IWAYAMA, M., FUJII, A., KANDO, N., AND TAKANO, A. 2002. Overview of Patent Retrieval Task at NTCIR-3. In Working notes of the third NTCIR workshop meeting, Part I: Overview, 67-76.Google ScholarGoogle Scholar
  12. IWAYAMA, M., FUJII, A., KANDO, N., AND MARUKAWA, Y. 2003. An empirical study on retrieval models for different document genres: patents and newspaper articles. In Proceedings of the 2003 ACM SIGIR Conference on Research and Development in Information Retrieval, Toronto, Canada, 251-258. Google ScholarGoogle Scholar
  13. KANDO, N. 2004. Overview of the Fourth NTCIR Workshop. In Working notes of the fourth NTCIR workshop meeting, i-viii.Google ScholarGoogle Scholar
  14. KISHIDA, K., CHEN, K. H., LEE, S., KURIYAMA, K., KANDO, N., CHEN, H. H., MYAENG, S. H., AND EGUCHI, K. 2004. Overview of CLIR Task at the Fourth NTCIR Workshop. In Working notes of the fourth NTCIR workshop meeting, 1-59.Google ScholarGoogle Scholar
  15. KRAAIJ, W., WESTERVELD, T., AND HIEMSTRA, D. 2002. The importance of prior probabilities for entry page search. In Proceedings of the Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2002), Tampere, Finland, 27-34. Google ScholarGoogle Scholar
  16. KWOK, K. L., PAPADOPOLOUS, L., AND KWAN, K. Y. Y. 1992. Retrieval experiments with a large collection using PIRCS, NIST Special Publication 500-207: The First Text REtrieval Conference (TREC-1), 153-172.Google ScholarGoogle Scholar
  17. LAFFERTY, J., AND ZHAI, C. 2001. Document language models, query models, and risk minimization for information retrieval. In Proceedings of the 2001 ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, LA, 111-119. Google ScholarGoogle Scholar
  18. LARKEY, L. S. 1999. A patent search and classification system. In Digital Libraries 99 - The Fourth ACM Conference on Digital Libraries, Berkeley, CA, Aug 1999, 79-87. Google ScholarGoogle Scholar
  19. LARKEY, L. S., CONNELL, M., AND CALLAN, J. 2000. Collection selection and results merging with topically organized U. S. Patents and TREC Data. In Proceedings of the Ninth International Conference on Information Knowledge and Management, Washington D.C., 2000, 282-289. Google ScholarGoogle Scholar
  20. MILLER, D. H., LEEK, T., AND SCHWARTZ, R. 1999. A hidden Markov model information retrieval system, In Proceedings of the 1999 ACM SIGIR Conference on Research and Development in Information Retrieval, Berkeley, CA, 214-221. Google ScholarGoogle Scholar
  21. OGILVIE, O., AND CALLAN, J. 2002. Experiments using the lemur toolkit. In NIST Special Publication 500-250: The Tenth Text REtrieval Conference (TREC 2001), 103-108.Google ScholarGoogle Scholar
  22. PONTE, J., AND COFT, W. B. 1998. A language modeling approach to information retrieval. In Proceedings of the 1998 ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, 275-281. Google ScholarGoogle Scholar
  23. ROBERTSON, S. E., AND WALKER S. 1994. Some simple effective approximations to the 2-poisson model for probabilistic weighted retrieval. In Proceedings of the 1994 ACM SIGIR Conference on Research and Development in Information Retrieval, Dublin, Ireland, 232-241. Google ScholarGoogle Scholar
  24. ROBERTSON, S. E., WALKER, S., JONES, S. M., HANCOCK-BEAULIEU, M., AND GATFORD, M. 1995. Okapi at TREC-3. In NIST Special Publication 500-226: Overview of the Third Text REtrieval Conference (TREC-3), 109-126.Google ScholarGoogle Scholar
  25. ROBERTSON, S. E., AND WALKER S. 1997. On relevance weights with little relevance information. In Proceedings of the 1997 ACM SIGIR Conference on Research and Development in Information Retrieval, Philadelphia, PA, 16-24. Google ScholarGoogle Scholar
  26. ROCCHIO, J. J. 1971. Relevance feedback in information retrieval, In The SMART Retrieval System: Experiments in Automatic Document Processing, G. SALTON, ed., Prentice-Hall, Englewood Cliffs, NJ, 313-323.Google ScholarGoogle Scholar
  27. SALTON, G. 1988. Automatic Text Processing--The Transformation, Analysis, and Retrieval of Information by Computer, Addison-Wesley Publishing Company, Reading, MA. Google ScholarGoogle Scholar
  28. SINGHAL, A., BUCKLEY, C., AND MITRA, M. 1996. Pivoted document length normalization. In Proceedings of the 1996 ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland, 21-29. Google ScholarGoogle Scholar
  29. WESTERVELD, T., KRAAIJ, W., AND HIEMSTRA, D. 2002. Retrieving Web pages using content, links, URLs and anchors. In NIST Special Publication 500-250: The Tenth Text REtrieval Conference (TREC 2001), 663- 672.Google ScholarGoogle Scholar
  30. ZHAI, C., AND LAFFERTY, J. 2001a. Model-based feedback in the KL-divergence retrieval model. In Proceedings of the Tenth International Conference on Information and Knowledge Management (CIKM 2001), Atlanta, GA, 403-410. Google ScholarGoogle Scholar
  31. ZHAI, C., AND LAFFERTY, J. 2001b. A study of smoothing methods for language models applied to ad hoc information retrieval. In Proceedings of the 2001 ACM SIGIR Conference on Research and Development in Information Retrieval, New Orleans, LA, 334-342. Google ScholarGoogle Scholar

Index Terms

  1. Revisiting Document Length Hypotheses: A Comparative Study of Japanese Newspaper and Patent Retrieval

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Asian Language Information Processing
          ACM Transactions on Asian Language Information Processing  Volume 4, Issue 2
          June 2005
          179 pages
          ISSN:1530-0226
          EISSN:1558-3430
          DOI:10.1145/1105696
          Issue’s Table of Contents

          Copyright © 2005 ACM

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 1 June 2005
          Published in talip Volume 4, Issue 2

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • article

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader