Abstract
Unmarked event data is increasingly popular in temporal modeling, containing only the timestamp of each event occurrence without specifying the class or description of the events. A sequence of event is usually modeled as the realization from a latent intensity series. When the intensity varies, the events follow the Non-Homogeneous Poisson Process (NHPP). To analyze a sequence of such kind of events, an important task is to measure the similarity between two sequences based on their intensities. To avoid the difficulties of estimating the latent intensities, we measure the similarity using timestamps by Dynamic Time Warping (DTW), which can also resolve the issue that observations between two sequences are not aligned in time. Furthermore, real event data always has superposed noise, e.g. when comparing the purchase behaviour of two customers, we can be mislead if one customer visits market more often because of some occasional shopping events. We shall recover the DTW distance between two noise-superposed NHPP sequences to evaluate the similarity between them. We proposed two strategies, which are removing noise events on all possibilities before calculating the DTW distance, and integrating the noise removal into the DTW calculation in dynamic programming. We compare empirical performance of all the methods and quantitatively show that the proposed methods can recover the DTW distance effectively and efficiently.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
References
Bacry, E., Muzy, J.F.: First-and second-order statistics characterization of Hawkes processes and non-parametric estimation. IEEE Trans. Inf. Theory 62(4), 2184–2202 (2016)
Bokhari, S., Geltner, D., van de Minne, A.: A Bayesian structural time series approach to constructing rent indexes: an application to Indian office markets (2017)
Chen, Y., et al.: The UCR time series classification archive. www.cs.ucr.edu/~eamonn/time_series_data (2015)
Del Giudice, V., De Paola, P., Forte, F., Manganelli, B.: Real estate appraisals with bayesian approach and Markov chain hybrid Monte Carlo method: an application to a central urban area of Naples. Sustainability 9(11), 2138 (2017)
Du, N., Farajtabar, M., Ahmed, A., Smola, A.J., Song, L.: Dirichlet-Hawkes processes with applications to clustering continuous-time document streams. In: Proceedings of the 21th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 219–228. ACM (2015)
Ganeshapillai, G., Guttag, J., Lo, A.: Learning connections in financial time series. In: International Conference on Machine Learning, pp. 109–117 (2013)
Ghitany, M., Al-Awadhi, F., Alkhalfan, L.: Marshall-Olkin extended Lomax distribution and its application to censored data. Commun. Stat.-Theory Methods 36(10), 1855–1866 (2007)
Guan, X., Huang, C., Liu, G., Meng, X., Liu, Q.: Mapping rice cropping systems in Vietnam using an NDVI-based time-series similarity measurement based on DTW distance. Remote Sens. 8(1), 19 (2016)
Lin, P., Zhang, B., Guo, T., Wang, Y., Chen, F.: Interaction point processes via infinite branching model. In: AAAI, pp. 1853–1859 (2016)
Nakagawa, S., Nakanishi, H.: Speaker-independent English consonant and Japanese word recognition by a stochastic dynamic time warping method. IETE J. Res. 34(1), 87–95 (1988)
Nasrabadi, N.M.: Pattern recognition and machine learning. J. Electron. Imaging 16(4), 049901 (2007)
Rakthanmanon, T., et al.: Searching and mining trillions of time series subsequences under dynamic time warping. In: Proceedings of the 18th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pp. 262–270. ACM (2012)
Sakoe, H., Chiba, S.: Dynamic programming algorithm optimization for spoken word recognition. IEEE Trans. Acoust. Speech Sig. Process. 26(1), 43–49 (1978)
Wang, X., Mueen, A., Ding, H., Trajcevski, G., Scheuermann, P., Keogh, E.: Experimental comparison of representation methods and distance measures for time series data. Data Min. Knowl. Discov. 26(2), 275–309 (2013)
Wu, J.: Reliability analysis for small wind turbines using Bayesian hierarchical modelling (2017)
Xu, H., Luo, D., Zha, H.: Learning Hawkes processes from short doubly-censored event sequences. arXiv preprint arXiv:1702.07013 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Chang, Y. et al. (2019). Recovering DTW Distance Between Noise Superposed NHPP. In: Yang, Q., Zhou, ZH., Gong, Z., Zhang, ML., Huang, SJ. (eds) Advances in Knowledge Discovery and Data Mining. PAKDD 2019. Lecture Notes in Computer Science(), vol 11440. Springer, Cham. https://doi.org/10.1007/978-3-030-16145-3_18
Download citation
DOI: https://doi.org/10.1007/978-3-030-16145-3_18
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-16144-6
Online ISBN: 978-3-030-16145-3
eBook Packages: Computer ScienceComputer Science (R0)