Skip to main content

Advertisement

Log in

Emotional Human-Machine Conversation Generation Based on Long Short-Term Memory

  • Published:
Cognitive Computation Aims and scope Submit manuscript

Abstract

With the rise in popularity of artificial intelligence, the technology of verbal communication between man and machine has received an increasing amount of attention, but generating a good conversation remains a difficult task. The key factor in human-machine conversation is whether the machine can give good responses that are appropriate not only at the content level (relevant and grammatical) but also at the emotion level (consistent emotional expression). In our paper, we propose a new model based on long short-term memory, which is used to achieve an encoder-decoder framework, and we address the emotional factor of conversation generation by changing the model’s input using a series of input transformations: a sequence without an emotional category, a sequence with an emotional category for the input sentence, and a sequence with an emotional category for the output responses. We perform a comparison between our work and related work and find that we can obtain slightly better results with respect to emotion consistency. Although in terms of content coherence our result is lower than those of related work, in the present stage of research, our method can generally generate emotional responses in order to control and improve the user’s emotion. Our experiment shows that through the introduction of emotional intelligence, our model can generate responses appropriate not only in content but also in emotion.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Notes

  1. http://tcci.ccf.org.cn/conference/2017/cfpt.php?from=groupmessage&isappinstalled=0

References

  1. Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado G, Davis A, Dean J, Devin M. 2016. Tensorflow: large-scale machine learning on heterogeneous distributed systems.

  2. André E, Rehm M, Minker W, Bühler D. Endowing spoken language dialogue systems with emotional intelligence. Affective dialogue systems, tutorial and research workshop, ADS 2004, Kloster Irsee, Germany, June 14-16, 2004, proceedings, p. 178–187; 2004.

  3. Asghar MZ, Khan A, Bibi A, Kundi FM, Ahmad H. 2017. Sentence-level emotion detection framework using rule-based classification. Cognitive Computation. https://doi.org/https://doi.org/10.1007/s12559-017-9503-3.

  4. Bahdanau D, Cho K, Bengio Y. 2014. Neural machine translation by jointly learning to align and translate. Computer Science.

  5. Bosse T, Duell R, Memon ZA, Treur J, Wal CN. Agent-based modeling of emotion contagion in groups. Cogn Comput 2015;7(1):111. https://doi.org/10.1007/s12559-014-9277-9.

    Article  Google Scholar 

  6. Cassell J, Pelachaud C, Badler N, Steedman M, Achorn B, Becket T, Douville B, Prevost S, Stone M. Animated conversation: rule-based generation of facial expression, gesture & spoken intonation for multiple conversational agents. Conference on computer graphics and interactive techniques; 1994. p. 413–420.

  7. Chen YW, Zhou Q, Luo W, Du JX. Classification of Chinese texts based on recognition of semantic topics. Cogn Comput 2016;8(1):114. https://doi.org/10.1007/s12559-015-9346-8.

    Article  Google Scholar 

  8. Cho K, Merrienboer BV, Gulcehre C, Bahdanau D, Bougares F, Schwenk H, Bengio Y. 2014. Learning phrase representations using rnn encoder-decoder for statistical machine translation. Computer Science.

  9. Frederking RE. A rule-based conversation participant. Meeting of the association for computational linguistics; 2007.

  10. Graves A. Long short-term memory. Berlin: Springer; 2012.

    Book  Google Scholar 

  11. Graves A, Ndez S, Schmidhuber J. Rgen bidirectional LSTM networks for improved phoneme classification and recognition. Artificial neural networks: formal MODELS and their applications—ICANN 2005, International conference, warsaw, poland, September 11-15, 2005, proceedings, p. 799–804; 2005.

  12. Hao W, Fan J, Zhang Z, Zhu G. 2017. End-to-end lifelong learning: a framework to achieve plasticities of both the feature and classifier constructions. Cognitive Computation. https://doi.org/10.1007/s12559-017-9514-0.

  13. Hochreiter S, Schmidhuber J. 1996. LSTM can solve hard long time lag problems. Advances in neural information processing systems p. 473–479.

  14. Ji Z, Lu Z, Li H. 2014. An information retrieval approach to short text conversation. Computer Science.

  15. Kadish D, Kummer N, Dulic A, Najjaran H. The empathy machine. International conference on entertainment computing; 2012. p. 461–464.

  16. Khan FH, Qamar U, Bashir S. Multi-objective model selection (moms)-based semi-supervised framework for sentiment analysis. Cogn Comput 2016;8(4):614–628. https://doi.org/10.1007/s12559-016-9386-8.

  17. Li J, Galley M, Brockett C, Gao J, Dolan B. 2015. A diversity-promoting objective function for neural conversation models. Computer Science.

  18. Mayer JD, Salovey P. 1997. What is emotional intelligence?.

  19. Mikolov T, Chen K, Corrado G. 2014. word2vec.

  20. Oneto L, Bisio F, Cambria E, Anguita D. Semi-supervised learning for affective common-sense reasoning. Cogn Comput 2017;9(1):18–42. https://doi.org/10.1007/s12559-016-9433-5.

    Article  Google Scholar 

  21. PASCHKE A, BOLEY H. Rule responder: rule-based agents for the semantic-pragmatic web. Int J Artif Intell Tools 2012;20(06):1043–1081.

    Article  Google Scholar 

  22. Pascual B, Gurruchaga M, Ginebra MP, Gil FJ, Planell JA, Goñi I. A neural network approach to context-sensitive generation of conversational responses. Trans R Soc Trop Med Hyg 2015;51(6):502–504.

    Google Scholar 

  23. Peng H, Cambria E, Hussain A. A review of sentiment analysis research in chinese language. Cogn Comput 2017;9(4):423–435. https://doi.org/10.1007/s12559-017-9470-8.

    Article  Google Scholar 

  24. Pham NV, Terada T, Tsukamoto M, Nishio S. An information retrieval system for supporting casual conversation in wearable computing environments. International workshop on smart appliances and wearable computing; 2005. p. 477–483.

  25. Picard RW. Affective computing. Igi Global 1997;1(1):71–73.

    Google Scholar 

  26. Planet S, Iriondo I. Children’s emotion recognition from spontaneous speech using a reduced set of acoustic and linguistic features. Cogn Comput 2013; 5 (4): 526. https://doi.org/10.1007/s12559-012-9174-z.

  27. Polzin T, Waibel A. 2000. Emotion - sensitive human - computer interfaces. The Isca Workshop on Speech & Emotion.

  28. Prendinger H, Mori J, Ishizuka M. Using human physiology to evaluate subtle expressivity of a virtual quizmaster in a mathematical game. Int J Hum Comput Stud 2005;62(2):231–245.

    Article  Google Scholar 

  29. Ptaszynski M, Dybala P, Shi W, Rzepka R, Araki K. Towards context aware emotional intelligence in machines: computing contextual appropriateness of affective states. IJCAI 2009, Proceedings of the international joint conference on artificial intelligence, Pasadena, California, USA; 2009. p. 1469–1474.

  30. R L, N P, IV S, L C, J P. 2015b. Incorporating unstructured textual knowledge sources into neural dialogue systems.

  31. Salovey P, Mayer JD. Emotional intelligence. Imagin Cogn Personality 1990;9(6):217–236.

    Google Scholar 

  32. Shang L, Lu Z, Li H. 2015. Neural responding machine for short-text conversation. Computer Science.

  33. Skowron M. Affect listeners: acquisition of affective states by means of conversational systems. Development of multimodal interfaces: active listening and synchrony, second COST 2102 international training school, Dublin, Ireland, March 23-27, 2009, revised selected papers; 2010. p. 169–181.

  34. Sun R, Wilson N, Lynch M. Emotion: a unified mechanistic interpretation from a cognitive architecture. Cogn Comput 2016;8(1):1–14. https://doi.org/10.1007/s12559-015-9374-4.

    Article  CAS  Google Scholar 

  35. Sundermeyer M, Schlüter R, Ney H. LSTM neural networks for language modeling. Interspeech; 2012. p. 601–608.

  36. Sutskever I, Vinyals O, Le Q. Sequence to sequence learning with neural networks. Adv Neural Inf Proces Syst 2014;4:3104– 3112.

    Google Scholar 

  37. Takahashi T, Mera K, Kurosawa Y, Takezawa T. Natural language dialog system considering speaker’s emotion for open-ended conversation. J Acoust Soc Am 2016;140(4):3400–3400.

    Article  Google Scholar 

  38. Treur J. On human aspects in ambient intelligence. Berlin: Springer; 2008.

    Book  Google Scholar 

  39. Vinyals O, Le Q. 2015. A neural conversational model. Computer Science.

  40. Wen TH, Vandyke D, Mrksic N, Gasic M, Rojasbarahona LM, Su PH, Ultes S, Young S. 2016. A network-based end-to-end trainable task-oriented dialogue system.

  41. Xing C, Wu W, Wu Y, Liu J, Huang Y, Zhou M, Ma WY. 2016. Topic aware neural response generation.

  42. Xiong K, Cui A, Zhang Z, Li M. 2016. Neural contextual conversation learning with labeled question-answering pairs.

  43. Yan R, Song Y, Wu H. Learning to respond with deep neural networks for retrieval-based human-computer conversation system. International ACM SIGIR conference on research and development in information retrieval; 2016. p. 55–64.

  44. Zhou H, Huang M, Zhang T, Zhu X, Liu B. 2017. Emotional chatting machine: emotional conversation generation with internal and external memory.

Download references

Funding

This work is supported by the Natural Science Foundation of Anhui Province (1508085QF119) and the State Key Program of the National Natural Science of China (61432004, 71571058, 61461045). This work was partially supported by the China Postdoctoral Science Foundation funded project (2015M580532, 2017T100447). This research has been partially supported by the National Natural Science Foundation of China under Grant No.61472117.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Xiao Sun.

Ethics declarations

Conflict of Interests

The authors declare that they have no conflict of interest.

Ethical Approval

This article does not contain any studies that used human participants or animals.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Sun, X., Peng, X. & Ding, S. Emotional Human-Machine Conversation Generation Based on Long Short-Term Memory. Cogn Comput 10, 389–397 (2018). https://doi.org/10.1007/s12559-017-9539-4

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12559-017-9539-4

Keywords

Navigation