Abstract
Question generation (QG) from technical text has multiple important applications such as creation of question-banks for examinations, interviews as well as in intelligent tutoring systems. However, much of the existing work for QG has focused on open-domain and not specifically on technical domain. We propose to generate technical questions using semantic templates. We also focus on ensuring that a large fraction of the generated questions are long-form, i.e., they require longer answers spanning multiple sentences. This is in contrast with existing work which has predominantly focused on generating factoid questions which have a few words or phrases as answers. Using the technical topics selected from undergraduate and graduate-level courses in Computer Science, we show that the proposed approach is able to generate questions with high acceptance rate. Further, we also show that the proposed template-based approach can be effectively leveraged using the distant supervision paradigm to finetune and significantly improve the existing sequence-to-sequence deep learning models for generating long-form, technical questions.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Chen, W., Aist, G., Mostow, J.: Generating questions automatically from informational text. In: 2nd Workshop on Question Generation (2009)
Curto, S., Mendes, A.C., Coheur, L.: Exploring linguistically-rich patterns for question generation. In: UCNLG+Eval: Language Generation and Evaluation Workshop (2011)
Du, X., Shao, J., Cardie, C.: Learning to ask: neural question generation for reading comprehension. In: 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2017)
Fan, Z., Wei, Z., Wang, S., Liu, Y., Huang, X.: A reinforcement learning framework for natural question generation using bi-discriminators. In: 27th International Conference on Computational Linguistics (COLING) (2018)
Ferragina, P., Scaiella, U.: Fast and accurate annotation of short texts with Wikipedia pages. IEEE Softw. 29(1), 70–75 (2012)
Heilman, M., Smith, N.A.: Good question! statistical ranking for question generation. In: Human Language Technologies: 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics (2010)
Hosking, T., Riedel, S.: Evaluating rewards for question generation models. In: NAACL-HLT (2019)
Kalady, S., Elikkottil, A., Das, R.: Natural language question generation using syntax and keywords. In: 3rd Workshop on Question Generation (2010)
Kilgarriff, A.: Comparing corpora. Int. J. Corpus Linguist. 6, 97–133 (2001)
Kumar, V., Ramakrishnan, G., Li, Y.F.: Putting the horse before the cart: a generator-evaluator framework for question generation from text. In: 23rd Conference on Computational Natural Language Learning (2019)
Kurdi, G., Leo, J., Parsia, B., Sattler, U., Al-Emari, S.: A systematic review of automatic question generation for educational purposes. Int. J. Artif. Intell. Educ. 30(1), 121–204 (2019). https://doi.org/10.1007/s40593-019-00186-y
Lewis, M., et al.: BART: denoising sequence-to-sequence pre-training for natural language generation, translation, and comprehension. In: Proceedings of the 58th Annual Meeting of the ACL (2020)
Liu, B., et al.: Learning to generate questions by learning what not to generate. In: WWW (2019)
Liu, B., Wei, H., Niu, D., Chen, H., He, Y.: Asking questions the human way: scalable question-answer generation from text corpus. In: WWW (2020)
Lopez, L.E., Cruz, D.K., Cruz, J.C.B., Cheng, C.: Transformer-based end-to-end question generation (2021)
Mannem, P., Prasad, R., Joshi, A.: Question generation from paragraphs at UPenn: QGSTEC system description. In: 3rd Workshop on Question Generation, QG 2000 (2000)
Mishra, S.K., Goel, P., Sharma, A., Jagannatha, A., Jacobs, D., Daume, H.: Towards automatic generation of questions from long answers. arXiv:2004.05109 (2020)
Pawar, S., Palshikar, G.K., Bhattacharyya, P.: Relation extraction: a survey. arxiv:1712.05191 (2017)
Rajpurkar, P., Zhang, J., Lopyrev, K., Liang, P.: SQuAD: 100,000+ questions for machine comprehension of text. In: EMNLP (2016)
Rao, S., Daume, H.: Answer-based adversarial training for generating clarification questions. In: NAACL-HLT (2019)
Šajatović, A., Buljan, M., Šnajder, J., Dalbelo Bašić, B.: Evaluating automatic term extraction methods on individual documents. In: Joint Workshop on Multiword Expressions and WordNet, MWE-WN 2019 (2019)
Serban, I.V., et al.: Generating factoid questions with recurrent neural networks: the 30M factoid question-answer corpus. In: 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2016)
Stefano, F., Finocchi, I., Ponzetto, S.P., Paola, V.: Efficient pruning of large knowledge graphs. In: IJCAI (2018)
Wyse, B., Piwek, P.: Generating questions from OpenLearn study units. In: 2nd Workshop on Question Generation (2009)
Yao, K., Zhang, L., Luo, T., Tao, L., Wu, Y.: Teaching machines to ask questions. In: IJCAI (2018)
Zhao, Y., Ni, X., Ding, Y., Ke, Q.: Paragraph-level neural question generation with maxout pointer and gated self-attention networks. In: EMNLP (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Pal, S., Singh, A., Datta, S., Patil, S., Bhattacharya, I., Palshikar, G. (2021). Semantic Templates for Generating Long-Form Technical Questions. In: EkÅ¡tein, K., Pártl, F., KonopÃk, M. (eds) Text, Speech, and Dialogue. TSD 2021. Lecture Notes in Computer Science(), vol 12848. Springer, Cham. https://doi.org/10.1007/978-3-030-83527-9_20
Download citation
DOI: https://doi.org/10.1007/978-3-030-83527-9_20
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-83526-2
Online ISBN: 978-3-030-83527-9
eBook Packages: Computer ScienceComputer Science (R0)