Skip to main content

Domain-Controlled Title Generation with Human Evaluation

  • Conference paper
  • First Online:
International Conference on Innovative Computing and Communications

Abstract

We study automatic title generation and present a method for generating domain-controlled titles for scientific articles. A good title allows you to get the attention that your research deserves. A title can be interpreted as a high-compression description of a document containing information on the implemented process. For domain-controlled titles, we used the pre-trained text-to-text transformer model and the additional token technique. Title tokens are sampled from a local distribution (which is a subset of global vocabulary) of the domain-specific vocabulary and not global vocabulary, thereby generating a catchy title and closely linking it to its corresponding abstract. Generated titles looked realistic, convincing, and very close to the ground truth. We have performed automated evaluation using ROUGE metric and human evaluation using five parameters to make a comparison between human and machine-generated titles. The titles produced were considered acceptable with higher metric ratings in contrast to the original titles. Thus we concluded that our research proposes a promising method for domain-controlled title generation.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 189.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 249.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. J.W. Putra, M.L. Khodra, Automatic title generation in scientific articles for authorship assistance: a summarization approach. J. ICT Res. Appl. 11, 253–267 (2017)

    Article  Google Scholar 

  2. R. Hamid, M. Jamali, M. Nikzad, Article title type and its relation with the number of downloads and citations. Scientometrics 88, 653–661 (2011)

    Google Scholar 

  3. H. Xu, E. Martin, A. Mahidadia, Extractive summarisation based on keyword profile and language model, in HLT-NAACL (2015)

    Google Scholar 

  4. C. Paiva et al., Articles with short titles describing the results are cited more often. Clinics 67, 509–513 (2012)

    Article  Google Scholar 

  5. R. Jin, A. Hauptmann, Headline Generation Using a Training Corpus. Carnegie Mellon University (Journal Contribution). https://doi.org/10.1184/R1/6606059.v1

  6. M. Witbrock, V. Mittal, Ultra-summarization (poster abstract): a statistical approach to generating highly condensed non-extractive summaries, in SIGIR’99 (1999)

    Google Scholar 

  7. J. Kupiec, et al., A trainable document summarizer, in SIGIR’95 (1995)

    Google Scholar 

  8. J.W.G. Putra, K. Fujita, Scientific paper title validity checker utilizing vector space model and topics model, in Proceedings of Konferensi Nasional Informatika (KNIF) (2015), pp. 69–74

    Google Scholar 

  9. L. Shao, J. Wang, DTATG: an automatic title generator based on dependency trees, in KDIR (2016)

    Google Scholar 

  10. J.W.G. Putra, M.L. Khodra, Rhetorical sentence classification for automatic title generation in scientific article. TELKOMNIKA Telecommun. Comput. Electron. Control 15, 656–664 (2017)

    Google Scholar 

  11. W. Liu, et al., Multi-lingual wikipedia summarization and title generation on low resource corpus, in Proceedings of the Multiling 2019 Workshop, Co-located with the RANLP 2019 Conference, pp. 17–25

    Google Scholar 

  12. F.R. Chen, Y.-Y. Chen, Adversarial domain adaptation using artificial titles for abstractive title generation, in ACL (2019)

    Google Scholar 

  13. S. Gehrmann, et al., Improving human text comprehension through semi-Markov CRF-based neural section title generation (2019), arXiv:1904.07142

  14. D. Bahdanau, K. Cho, Y. Bengio, Neural machine translation by jointly learning to align and translate, in ICLR (2015)

    Google Scholar 

  15. A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, I. Polosukhin, Attention is all you need, in Advances in Neural Information Processing Systems (2017)

    Google Scholar 

  16. A.M. Dai, Q.V. Le, Semi-supervised sequence learning, in Advances in Neural Information Processing Systems NIPS’15 (2015)

    Google Scholar 

  17. C. Raffel, et al., Exploring the limits of transfer learning with a unified text-to-text transformer (2019), arXiv:1910.10683

  18. C. Kobus, et al., Domain control for neural machine translation, in RANLP (2017)

    Google Scholar 

  19. R. Sennrich, et al., Controlling politeness in neural machine translation via side constraints, in HLT-NAACL (2016)

    Google Scholar 

  20. M. Johnson et al., Google’s multilingual neural machine translation system: enabling zero-shot translation. Trans. Assoc. Comput. Linguist. 5, 339–351 (2017)

    Article  Google Scholar 

  21. C.B. Clement, et al., On the use of ArXiv as a dataset (2019), arXiv:1905.00075

  22. C.-Y. Lin, ROUGE: a package for automatic evaluation of summaries, in Proceedings of the ACL Workshop: Text Summarization Branches Out (2004)

    Google Scholar 

  23. C. Lee, et al., Best practices for the human evaluation of automatically generated text, in INLG (2019)

    Google Scholar 

  24. G.A. Miller, The magical number seven plus or minus two: some limits on our capacity for processing information. Psychol. Rev. 63(2), 81–97 (1956)

    Article  Google Scholar 

  25. P. Green, V. Rao, Rating scales and information recovery-how many scales and response categories to use? J. Mark. 34, 33–39 (1970)

    Google Scholar 

  26. R.W. Lissitz, S. Green, Effect of the number of scale points on reliability: a Monte Carlo approach. J. Appl. Psychol. 60, 10–13 (1975)

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Waheed, A., Goyal, M., Mittal, N., Gupta, D. (2022). Domain-Controlled Title Generation with Human Evaluation. In: Khanna, A., Gupta, D., Bhattacharyya, S., Hassanien, A.E., Anand, S., Jaiswal, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1388. Springer, Singapore. https://doi.org/10.1007/978-981-16-2597-8_39

Download citation

Publish with us

Policies and ethics