Skip to main content
Log in

Development of a computer-assisted Japanese functional expression learning system for Chinese-speaking learners

  • Cultural and Regional Perspectives
  • Published:
Educational Technology Research and Development Aims and scope Submit manuscript

Abstract

Because a large number of Chinese characters are commonly used in both Japanese and Chinese, Chinese-speaking learners of Japanese as a second language (JSL) find it more challenging to learn Japanese functional expressions than to learn other Japanese vocabulary. To address this challenge, we have developed Jastudy, a computer-assisted language learning (CALL) system designed specifically for Chinese-speaking learners studying Japanese functional expressions. Given a Japanese sentence as an input, the system automatically detects Japanese functional expressions using a character-based bidirectional long short-term memory with a conditional random field (BiLSTM-CRF) model. The sentence is then segmented and the parts of speech (POS) are tagged (word segmentation and POS tagging) by a Japanese morphological analyzer, MeCab (http://taku910.github.io/mecab/), trained using a CRF model. In addition, the system provides JSL learners with appropriate example sentences that illustrate Japanese functional expressions. The system uses a ranking system, which gives easier sentences a higher rank, when selecting example sentences. A support vector machine for ranking (SVMRank) algorithm estimates the readability of example sentences, using Japanese-Chinese common words as an important feature. A k-means clustering algorithm is used to cluster example sentences that contain functional expressions with the same meanings, based on part-of-speech, conjugation form, and semantic attributes. Finally, to evaluate the usefulness of the system, we have conducted experiments and reported on a preliminary user study involving Chinese-speaking JSL learners.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Notes

  1. https://www.cs.cornell.edu/people/tj/svm_light/svm_rank.html.

  2. http://scikitlearn.org/stable/modules/clustering.html#clustering.

  3. https://jreadability.net/ja/.

  4. http://language.tiu.ac.jp/.

  5. https://hinoki-project.org/asunaro/.

  6. http://www.rikai.com/perl/Home.pl.

  7. http://nihongo.monash.edu/cgi-bin/wwwjdic?9T.

  8. https://tatoeba.org/eng/.

  9. http://www.hiraganatimes.com/.

  10. http://pj.ninjal.ac.jp/corpus_center/bccwj/en/.

  11. http://cl.naist.jp/nldata/lang-8/.

  12. https://sourceforge.net/projects/mecab/files/mecab-ipadic/2.7.0-20070801/mecab-ipadic-2.7.0

  13. http://osdn.net/project/unidic/.

  14. https://ja.wiktionary.org/wiki/メインページ.

  15. http://cjjc.weblio.jp.

  16. https://baike.baidu.com.

  17. https://github.com/Determined22/zh-NER-TF.

  18. https://taku910.github.io/crfpp/.

  19. Here, N0 implies unknown words in the JLPT vocabulary list.

  20. http://www.tanos.co.uk/jlpt/.

  21. http://japanesetest4you.com.

References

  • Chall, J. S., & Dale, E. (1995). Readability revisited: The new Dale-Chall readability formula. Brookline, MA: Brookline Books.

    Google Scholar 

  • Flesch, F. (1948). A new readability yardstick. Journal of Applied Psychology, 32(3), 221–233.

    Article  Google Scholar 

  • Group Jamashi, & Xu, Y. (2001). Chubunban Nihongo Kukei Jiten-Nihongo Bunkei Jiten (in Chinese and Japanese). Tokyo: Kuroshio Publishers.

    Google Scholar 

  • Han, D., & Song, X. (2011). Japanese sentence pattern learning with the use of illustrative examples extracted from the web. IEEJ Transactions on Electrical and Electronic Engineering, 6(5), 490–496.

    Article  Google Scholar 

  • Hasebe, Y. & Lee, J. (2015). Introducing a Readability Evaluation System for Japanese Language Education. In Proceedings of the 6th international conference on computer assisted systems for teaching & learning Japanese, pp. 19–22.

  • Hazelbeck, G., & Saito, H. (2009). A corpus-based E-learning System for Japanese vocabulary. Journal of Natural Language Processing, 16(4), 3–27.

    Article  Google Scholar 

  • Hyodo, Y., Murakami, Y. & Ikeda, T. (2000). A dictionary of long-unit functional words for analyzing bunsetsu. In Proceedings of the 6th annual meeting of the association for natural language processing, pp. 407–410.

  • Joachims, T. (2006). Training Linear SVMs in Linear Time. In Proceedings of the ACM conference on knowledge discovery and data mining (KDD), pp. 217–226.

  • Kincaid, JP., Fishburne, RP., Rogers, RL. & Chissom, BS. (1975). Derivation of new readability formulas (Automated Readability Index, Fog Count and Flesch Reading Ease Formula) for Navy-enlisted personnel. Research Branch Report 875. Millington, TN: Naval Technical Training, U.S. Naval Air Station, Memphis, TN.

  • Kudo, T., Yamamoto, K. & Matsumoto, Y. (2004). Applying Conditional Random Fields to Japanese Morphological Analysis. In Proceedings of the 2004 conference on empirical methods in natural language processing, pp. 230–237.

  • Liu, J. & Matsumoto, Y. (2016). Simplification of Example Sentences for Learners of Japanese Functional Expressions. In Proceedings of the 3rd workshop on natural language processing techniques for educational applications, pp. 1–5.

  • Matsuyoshi, S., Sato, S. & Utsuro, T. (2006). Compilation of a dictionary of Japanese functional expressions with hierarchical organization. In Proceedings of international conference on computer processing of oriental languages (ICCPOL), LNAI: vol. 4285, pp. 395–402.

  • National Institute for Japanese Language and Linguistics. (2004). Bunrui goihyo zouhokaiteiban (in Japanese). Tokyo: Dainihontosho Press.

    Google Scholar 

  • Ohno, T., Edani, Z., Inoue, A. & Han, D. (2013). A Japanese Learning Support System Matching Individual Abilities. In Proceedings of the PACLIC 27 workshop on computer-assisted language learning, pp. 556–562.

  • Omura, M. & Matsumoto, Y. (2014). Second Language Learner Composition Error Correction. In Proceedings of the 20th annual conference of the society of language processing, pp. 242–245.

  • Pereira, L., Manguilimotan, E., & Matsumoto, Y. (2016). Leveraging a large learner corpus for automatic suggestion of collocations for learners of Japanese as a second language. CALICO Journal, 33(3), 311–333.

    Google Scholar 

  • Sato, S., Matsuyoshi, S., & Kondoh, Y. (2008). Automatic Assessment of Japanese Text Readability Based on a Textbook Corpus. In Proceedings of the sixth international language resources and evaluation, pp. 654–660.

  • Shibasaki, H. & Sawai, Y. (2007). Study for constructing a readability formula of Japanese texts using a corpus of language school textbooks. IEICE Technical Report NCL2007-vol. 32, pp. 19–24.

  • Shibasaki, H., & Tamaoka, K. (2010). Constructing a formula to predict school grades 1–9 based on Japanese language school textbooks. Japan Journal of Educational Technology, 33(4), 449–458.

    Google Scholar 

  • Shime, T., Tsuchiya, M., Matsuyoshi, S., Utsuro, T., & Sato, S. (2007). Automatic detection of Japanese compound functional expressions and its application to statistical dependency analysis. Journal of Natural Language Processing, 14(5), 167–196.

    Article  Google Scholar 

  • Shinmura, I. (Ed.). (1998). Kojien fifth Edition (in Japanese). Tokyo: Iwanani Press.

    Google Scholar 

  • Shudo, K., Tanabe, T., Takahashi, M. & Yoshimura, K. (2004). MWEs as non-propositional content indicators. In Proceedings of the 2nd ACL workshop on multiword expressions: Integrating processing (MWE-2004), pp. 32–39.

  • Suzuki, T., Abe, Y, Toyota, Y, Utsuro, T., Matsuyoshi, S. & Tsuchiya, M. (2012). Detecting Japanese Compound Functional Expressions using Canonical/Derivational Relation. In Proceedings of the eighth international conference on language resources and evaluation (LREC-2012).

  • Tateishi, Y., Ono, Y. & Yamada, H. (1988a). A computer readability formula of Japanese texts for machine scoring. In Proceedings of the 12th conference on computational linguistics, vol. 2, pp. 649–654.

  • Tateishi, Y., Ono, Y. & Yamada, H. (1988b). Derivation of readability formula of Japanese texts. IPSJSIG Note 88-DPHI-18-4, Information Processing Society of Japan.

  • Tolmachev, A. & Kurohashi, S. (2017). Automatic Extraction of High-Quality Example Sentences for Word Learning Using a Determinantal Point Process. In Proceedings of the 12th workshop on innovative use of NLP for building educational applications, pp. 133–142.

  • Tomomastu, E., Miyamoto, J., & Wakuki, M. (2016). Japanese expressions dictionary. Tokyo: ALC Press Inc.

    Google Scholar 

  • Toyoda, E. (2016). Evaluation of computerized reading-assistance systems for reading Japanese texts—from a linguistic point of view. Australasian Journal of Educational Technology, 32(5), 94–97.

    Google Scholar 

  • Tsuchiya, M., Shime, T., Takagi, T., Utsuro, T., Uchimoto, K., Matsuyoshi, S, Sato, S. & Nakagawa, S. (2006). Chunking Japanese compound functional expressions by machine learning. In Proceedings of the workshop on multi-word-expressions in a multilingual context, pp. 25–32.

  • Wang, S. (2001). A comparative study of vocabulary in Chinese and Japanese. Chengdu: Sichuan Literature and Art Publishing House.

    Google Scholar 

  • Wang, S. & Andersen, E. (2016). Grammatical Templates: Improving Text Difficulty Evaluation for Language Learners. In Proceedings of the 26th international conference on computational linguistics: technical papers (COLING 2016), pp. 1692–1702.

  • Wang, L., & Peng, X. (2010a). A “Must have” reference for the New JLPT N2 vocabulary. Shanghai: East China University of Science and Technology Press.

    Google Scholar 

  • Wang, L., & Peng, X. (2010b). A “Must have” reference for the New JLPT N1 vocabulary. Shanghai: East China University of Science and Technology Press.

    Google Scholar 

  • Wang, L., & Peng, X. (2011a). A “Must have” reference for the New JLPT N4-N5 vocabulary. Shanghai: East China University of Science and Technology Press.

    Google Scholar 

  • Wang, L., & Peng, X. (2011b). A “Must have” reference for the New JLPT N3 vocabulary. Shanghai: East China University of Science and Technology Press.

    Google Scholar 

  • Xu, X., & Reika, (2013a). Detailed introduction to the New JLPT N1–N5 grammar. Shanghai: East China University of Science and Technology Press.

    Google Scholar 

  • Xu, X., & Reika, (2013b). Detailed introduction to the New JLPT N1–N5 vocabulary. Shanghai: East China University of Science and Technology Press.

    Google Scholar 

Download references

Acknowledgements

This study was supported by the China Scholarship Council (CSC) and the Japan Ministry of Education, Culture, Sports, Science and Technology (MEXT). The authors would like to thank the editor and the reviewers for their valuable comments and suggestions.

Funding

This study has not received funding.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Jun Liu.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Liu, J., Shindo, H. & Matsumoto, Y. Development of a computer-assisted Japanese functional expression learning system for Chinese-speaking learners. Education Tech Research Dev 67, 1307–1331 (2019). https://doi.org/10.1007/s11423-019-09669-0

Download citation

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11423-019-09669-0

Keywords

Navigation