Chinese Address Similarity Calculation Based on Auto Geological Level Tagging

Liu, Jing; Wang, Jianbin; Zhang, Changqing; Yang, Xiubo; Deng, Jianbo; Zhu, Ruihe; Nan, Xiaojie; Chen, Qinghua

doi:10.1007/978-3-030-22808-8_42

Jing Liu¹⁷,
Jianbin Wang¹⁸,
Changqing Zhang¹⁸,
Xiubo Yang¹⁸,
Jianbo Deng¹⁹,
Ruihe Zhu¹⁹,
Xiaojie Nan¹⁸ &
…
Qinghua Chen¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNTCS,volume 11555))

Included in the following conference series:

International Symposium on Neural Networks

1886 Accesses
1 Citations

Abstract

How to quickly measure the similarity of addresses has become an urgent need in various fields including financial anti-fraud. Traditional string-based similarity calculation methods have not completed this task perfectly. Taking into account the hierarchical nature of addresses, we constructed a framework for calculating the similarity of Chinese addresses. First, the whole address strings are split and annotated with proper level by a LM-LSTM-CRF model, and then sub-string level similarities are calculated. Last, similarity scores are combining by BP neural networks. This framework has achieved good results in practice for financial anti-fraud tasks.

This work is supported by joint project of Beijing Normal University and Credit Harmony Research, and in part by the National Natural Science Foundation of China under grant 71701018. Jing Liu, Jianbin Wang and Changqing Zhang contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Budanitsky, A., Hirst, G.: Semantic distance in wordnet: an experimental, application-oriented evaluation of five measures. In: Workshop on WordNet and Other Lexical Resources 2, 2–2 (2001)
Google Scholar
Chang, C.H., Huang, C.Y., Su, Y.S.: On chinese postal address and associated information extraction. In: The 26th Annual Conference of the Japanese Society for Artificial Intelligence, pp. 1–7 (2012)
Google Scholar
Chen, Z., Lee, K.F.: A new statistical approach to chinese pinyin input. In: Proceedings of the 38th Annual Meeting on Association for Computational Linguistics, pp. 241–247. Association for Computational Linguistics (2000)
Google Scholar
Fellbaum, C.: WordNet. In: Poli, R., Healy, M., Kameas, A. (eds.) Theory and Applications of Ontology: Computer Applications, pp. 231–243. Springer, Dordrecht (2010). https://doi.org/10.1007/978-90-481-8847-5_10
Google Scholar
Gers, F.A., Schmidhuber, J., Cummins, F.: Learning to forget: continual prediction with lstm. Neural Comput. 12(10), 2451–2471 (2000)
Google Scholar
Goller, C., Kuchler, A.: Learning task-dependent distributed representations by backpropagation through structure. Neural Net. 1, 347–352 (1996)
Google Scholar
Gomaa, W.H., Fahmy, A.A.: A survey of text similarity approaches. Int. J. Comput. Appl. 68(13), 13–18 (2013)
Google Scholar
Hou, X., Gao, Z., Wang, Q.: Internet finance development and banking market discipline: evidence from china. J. Financ. Stab. 22, 88–100 (2016)
Google Scholar
Julstrom, B.A., Hinkemeyer, B.: Starting from scratch: growing longest common subsequences with evolution. In: Runarsson, T.P., Beyer, H.-G., Burke, E., Merelo-Guervós, J.J., Whitley, L.D., Yao, X. (eds.) PPSN 2006. LNCS, vol. 4193, pp. 930–938. Springer, Heidelberg (2006). https://doi.org/10.1007/11844297_94
Google Scholar
Jurafsky, D., Martin, J.H.: Speech and Language Processing. Pearson, London (2014)
Google Scholar
Kondrak, G.: N-gram similarity and distance. In: Consens, M., Navarro, G. (eds.) SPIRE 2005. LNCS, vol. 3772, pp. 115–126. Springer, Heidelberg (2005). https://doi.org/10.1007/11575832_13
Google Scholar
Liu, L., et al.: Empower sequence labeling with task-aware neural language model. arXiv preprint arXiv:1709.04109 (2017)
Ma, X., Hovy, E.: End-to-end sequence labeling via bi-directional lstm-cnns-crf. In: Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics. pp. 1064–1074 (2016)
Google Scholar
Navarro, G.: A guided tour to approximate string matching. ACM Comput. Surv. 33(1), 31–88 (2001)
Google Scholar
Perkins, J.: Python Text Processing With NLTK 2.0 Cookbook. Packt Publishing Ltd, Birmingham (2010)
Google Scholar
Ta, L.: The risk and prevention of internet finance. In: 2017 4th International Conference on Industrial Economics System and Industrial Security Engineering, pp. 1–5 (2017)
Google Scholar
Yujian, L., Bo, L.: A normalized levenshtein distance metric. IEEE Trans. Pattern Anal. Mach. Intell. 29(6), 1091–1095 (2007)
Google Scholar
Zhang, D., Xu, H., Su, Z., Xu, Y.: Chinese comments sentiment classification based on word2vec and svmperf. Expert Syst. Appl. 42(4), 1857–1863 (2015)
Google Scholar
Zhao, Y., Wang, L., Chou, A.: A fusion method of marine sub-bottom acoustic spatial data based on features and applications. Sci. Surv. Map. 38(5), 74–76 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Systems Science, Beijing Normal University, Beijing, 100875, China
Jing Liu & Qinghua Chen
Credit Harmony Research, Building 3 District 3 Hanwei International, Beijing, 100071, China
Jianbin Wang, Changqing Zhang, Xiubo Yang & Xiaojie Nan
Swarma Club, 4059 Building 1, 1 Shuangqing Road, Beijing, 100085, China
Jianbo Deng & Ruihe Zhu

Authors

Jing Liu
View author publications
You can also search for this author in PubMed Google Scholar
Jianbin Wang
View author publications
You can also search for this author in PubMed Google Scholar
Changqing Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xiubo Yang
View author publications
You can also search for this author in PubMed Google Scholar
Jianbo Deng
View author publications
You can also search for this author in PubMed Google Scholar
Ruihe Zhu
View author publications
You can also search for this author in PubMed Google Scholar
Xiaojie Nan
View author publications
You can also search for this author in PubMed Google Scholar
Qinghua Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Jianbin Wang or Qinghua Chen .

Editor information

Editors and Affiliations

Dalian University of Technology, Dalian, China
Huchuan Lu
Sichuan University, Chengdu, China
Huajin Tang
Northeastern University, Shenyang, China
Zhanshan Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Liu, J. et al. (2019). Chinese Address Similarity Calculation Based on Auto Geological Level Tagging. In: Lu, H., Tang, H., Wang, Z. (eds) Advances in Neural Networks – ISNN 2019. ISNN 2019. Lecture Notes in Computer Science(), vol 11555. Springer, Cham. https://doi.org/10.1007/978-3-030-22808-8_42

Download citation

DOI: https://doi.org/10.1007/978-3-030-22808-8_42
Published: 26 June 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-22807-1
Online ISBN: 978-3-030-22808-8
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics