Abstract
We study automatic title generation and present a method for generating domain-controlled titles for scientific articles. A good title allows you to get the attention that your research deserves. A title can be interpreted as a high-compression description of a document containing information on the implemented process. For domain-controlled titles, we used the pre-trained text-to-text transformer model and the additional token technique. Title tokens are sampled from a local distribution (which is a subset of global vocabulary) of the domain-specific vocabulary and not global vocabulary, thereby generating a catchy title and closely linking it to its corresponding abstract. Generated titles looked realistic, convincing, and very close to the ground truth. We have performed automated evaluation using ROUGE metric and human evaluation using five parameters to make a comparison between human and machine-generated titles. The titles produced were considered acceptable with higher metric ratings in contrast to the original titles. Thus we concluded that our research proposes a promising method for domain-controlled title generation.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
J.W. Putra, M.L. Khodra, Automatic title generation in scientific articles for authorship assistance: a summarization approach. J. ICT Res. Appl. 11, 253–267 (2017)
R. Hamid, M. Jamali, M. Nikzad, Article title type and its relation with the number of downloads and citations. Scientometrics 88, 653–661 (2011)
H. Xu, E. Martin, A. Mahidadia, Extractive summarisation based on keyword profile and language model, in HLT-NAACL (2015)
C. Paiva et al., Articles with short titles describing the results are cited more often. Clinics 67, 509–513 (2012)
R. Jin, A. Hauptmann, Headline Generation Using a Training Corpus. Carnegie Mellon University (Journal Contribution). https://doi.org/10.1184/R1/6606059.v1
M. Witbrock, V. Mittal, Ultra-summarization (poster abstract): a statistical approach to generating highly condensed non-extractive summaries, in SIGIR’99 (1999)
J. Kupiec, et al., A trainable document summarizer, in SIGIR’95 (1995)
J.W.G. Putra, K. Fujita, Scientific paper title validity checker utilizing vector space model and topics model, in Proceedings of Konferensi Nasional Informatika (KNIF) (2015), pp. 69–74
L. Shao, J. Wang, DTATG: an automatic title generator based on dependency trees, in KDIR (2016)
J.W.G. Putra, M.L. Khodra, Rhetorical sentence classification for automatic title generation in scientific article. TELKOMNIKA Telecommun. Comput. Electron. Control 15, 656–664 (2017)
W. Liu, et al., Multi-lingual wikipedia summarization and title generation on low resource corpus, in Proceedings of the Multiling 2019 Workshop, Co-located with the RANLP 2019 Conference, pp. 17–25
F.R. Chen, Y.-Y. Chen, Adversarial domain adaptation using artificial titles for abstractive title generation, in ACL (2019)
S. Gehrmann, et al., Improving human text comprehension through semi-Markov CRF-based neural section title generation (2019), arXiv:1904.07142
D. Bahdanau, K. Cho, Y. Bengio, Neural machine translation by jointly learning to align and translate, in ICLR (2015)
A. Vaswani, N. Shazeer, N. Parmar, J. Uszkoreit, L. Jones, A.N. Gomez, I. Polosukhin, Attention is all you need, in Advances in Neural Information Processing Systems (2017)
A.M. Dai, Q.V. Le, Semi-supervised sequence learning, in Advances in Neural Information Processing Systems NIPS’15 (2015)
C. Raffel, et al., Exploring the limits of transfer learning with a unified text-to-text transformer (2019), arXiv:1910.10683
C. Kobus, et al., Domain control for neural machine translation, in RANLP (2017)
R. Sennrich, et al., Controlling politeness in neural machine translation via side constraints, in HLT-NAACL (2016)
M. Johnson et al., Google’s multilingual neural machine translation system: enabling zero-shot translation. Trans. Assoc. Comput. Linguist. 5, 339–351 (2017)
C.B. Clement, et al., On the use of ArXiv as a dataset (2019), arXiv:1905.00075
C.-Y. Lin, ROUGE: a package for automatic evaluation of summaries, in Proceedings of the ACL Workshop: Text Summarization Branches Out (2004)
C. Lee, et al., Best practices for the human evaluation of automatically generated text, in INLG (2019)
G.A. Miller, The magical number seven plus or minus two: some limits on our capacity for processing information. Psychol. Rev. 63(2), 81–97 (1956)
P. Green, V. Rao, Rating scales and information recovery-how many scales and response categories to use? J. Mark. 34, 33–39 (1970)
R.W. Lissitz, S. Green, Effect of the number of scale points on reliability: a Monte Carlo approach. J. Appl. Psychol. 60, 10–13 (1975)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Waheed, A., Goyal, M., Mittal, N., Gupta, D. (2022). Domain-Controlled Title Generation with Human Evaluation. In: Khanna, A., Gupta, D., Bhattacharyya, S., Hassanien, A.E., Anand, S., Jaiswal, A. (eds) International Conference on Innovative Computing and Communications. Advances in Intelligent Systems and Computing, vol 1388. Springer, Singapore. https://doi.org/10.1007/978-981-16-2597-8_39
Download citation
DOI: https://doi.org/10.1007/978-981-16-2597-8_39
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-2596-1
Online ISBN: 978-981-16-2597-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)