Skip to main content

Detecting Drift from Event Streams of Unpredictable Business Processes

  • Conference paper
  • First Online:
Conceptual Modeling (ER 2016)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 9974))

Included in the following conference series:

Abstract

Existing business process drift detection methods do not work with event streams. As such, they are designed to detect inter-trace drifts only, i.e. drifts that occur between complete process executions (traces), as recorded in event logs. However, process drift may also occur during the execution of a process, and may impact ongoing executions. Existing methods either do not detect such intra-trace drifts, or detect them with a long delay. Moreover, they do not perform well with unpredictable processes, i.e. processes whose logs exhibit a high number of distinct executions to the total number of executions. We address these two issues by proposing a fully automated and scalable method for online detection of process drift from event streams. We perform statistical tests over distributions of behavioral relations between events, as observed in two adjacent windows of adaptive size, sliding along with the stream. An extensive evaluation on synthetic and real-life logs shows that our method is fast and accurate in the detection of typical change patterns, and performs significantly better than the state of the art.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The G-test is a non-parametric hypothesis statistical test which assumes no a-priori knowledge of the statistical distributions. The G-test is a better approximation to the theoretical chi-squared distribution than the chi-squared test [12].

  2. 2.

    The typical value of the threshold, i.e. significance level, for the G-test is 0.05 [13].

  3. 3.

    Available at http://apromore.org/platform/tools.

  4. 4.

    http://cpntools.org.

  5. 5.

    All the CPN models used for this simulation, the resulting synthetic logs, and the detailed evaluation results are available with the software distribution.

  6. 6.

    http://dx.doi.org/10.4121/uuid:d9769f3d-0ab0-4fb8-803b-0d1120ffcf54.

  7. 7.

    In streaming settings, online noise filters such as the Kalman filter [16] could be used instead.

References

  1. Carmona, J., Gavaldà, R.: Online techniques for dealing with concept drift in process mining. In: Hollmén, J., Klawonn, F., Tucker, A. (eds.) IDA 2012. LNCS, vol. 7619, pp. 90–102. Springer, Heidelberg (2012). doi:10.1007/978-3-642-34156-4_10

    Chapter  Google Scholar 

  2. Accorsi, R., Stocker, T.: Discovering workflow changes with time-based trace clustering. In: Aberer, K., Damiani, E., Dillon, T. (eds.) SIMPDA 2011. LNBIP, vol. 116, pp. 154–168. Springer, Heidelberg (2012)

    Chapter  Google Scholar 

  3. Bose, R.P.J.C., van der Aalst, W.M.P., Zliobaite, I., Pechenizkiy, M.: Dealing with concept drifts in process mining. IEEE Trans. NNLS 25(1), 154–171 (2014)

    Google Scholar 

  4. Martjushev, J., Bose, R.P.J.C., Aalst, W.M.P.: Change point detection and dealing with gradual and multi-order dynamics in process mining. In: Matulevičius, R., Dumas, M. (eds.) BIR 2015. LNBIP, vol. 229, pp. 161–178. Springer, Heidelberg (2015). doi:10.1007/978-3-319-21915-8_11

    Chapter  Google Scholar 

  5. Maaradji, A., Dumas, M., Rosa, M., Ostovar, A.: Fast and accurate business process drift detection. In: Motahari-Nezhad, H.R., Recker, J., Weidlich, M. (eds.) BPM 2015. LNCS, vol. 9253, pp. 406–422. Springer, Heidelberg (2015). doi:10.1007/978-3-319-23063-4_27

    Chapter  Google Scholar 

  6. Weber, B., Reichert, M., Rinderle-Ma, S.: Change patterns and change support features-enhancing flexibility in process-aware information systems. DKE 66(3), 438–466 (2008)

    Article  Google Scholar 

  7. Burattin, A., Cimitile, M., Maggi, F.M., Sperduti, A.: Online discovery of declarative process models from event streams. IEEE Trans. Serv. Comput. 8, 833–846 (2015)

    Article  Google Scholar 

  8. Burattin, A., Sperduti, A., van der Aalst, W.M.P.: Control-flow discovery from event streams. In: IEEE Congress on Evolutionary Computation (CEC), pp. 2420–2427. IEEE (2014)

    Google Scholar 

  9. Gama, J., Žliobaitė, I., Bifet, A., Pechenizkiy, M., Bouchachia, A.: A survey on concept drift adaptation. ACM Comput. Surv. (CSUR) 46(4), 1–37 (2014)

    Article  MATH  Google Scholar 

  10. Leemans, S.J.J., Fahland, D., Aalst, W.M.P.: Discovering block-structured process models from event logs - a constructive approach. In: Colom, J.-M., Desel, J. (eds.) PETRI NETS 2013. LNCS, vol. 7927, pp. 311–329. Springer, Heidelberg (2013). doi:10.1007/978-3-642-38697-8_17

    Chapter  Google Scholar 

  11. de Medeiros, A.A., van Dongen, B.F., Van der Aalst, W.M.P., Weijters, A.: Process mining: extending the \(\alpha \)-algorithm to mine short loops. Technical report, BETA Working Paper Series, WP 113, Eindhoven University of Technology, Eindhoven (2004)

    Google Scholar 

  12. Harremoës, P., Tusnády, G.: Information divergence is more \(\chi ^{2} \)-distributed than the \(\chi ^{2} \)-statistics. In: IEEE ISIT, pp. 533–537 (2012)

    Google Scholar 

  13. Nuzzo, R.: Statistical errors. Nature 506(13), 150–152 (2014)

    Article  Google Scholar 

  14. Ho, S.S.: A martingale framework for concept change detection in time-varying data streams. In: Proceedings of ICML, pp. 321–327. ACM (2005)

    Google Scholar 

  15. Conforti, R., La Rosa, M., ter Hofstede, A.H.: Noise filtering of process execution logs based on outliers detection (2015)

    Google Scholar 

  16. Bifet, A., Gavaldà, R.: Kalman filters and adaptive windows for learning in data streams. In: Todorovski, L., Lavrač, N., Jantke, K.P. (eds.) DS 2006. LNCS (LNAI), vol. 4265, pp. 29–40. Springer, Heidelberg (2006). doi:10.1007/11893318_7

    Chapter  Google Scholar 

  17. Beest, N.R.T.P., Dumas, M., García-Bañuelos, L., Rosa, M.: Log delta analysis: interpretable differencing of business process event logs. In: Motahari-Nezhad, H.R., Recker, J., Weidlich, M. (eds.) BPM 2015. LNCS, vol. 9253, pp. 386–405. Springer, Heidelberg (2015). doi:10.1007/978-3-319-23063-4_26

    Chapter  Google Scholar 

  18. Pika, A., Wynn, M.T., Fidge, C.J., Hofstede, A.H.M., Leyer, M., Aalst, W.M.P.: An extensible framework for analysing resource behaviour using event logs. In: Jarke, M., Mylopoulos, J., Quix, C., Rolland, C., Manolopoulos, Y., Mouratidis, H., Horkoff, J. (eds.) CAiSE 2014. LNCS, vol. 8484, pp. 564–579. Springer, Heidelberg (2014). doi:10.1007/978-3-319-07881-6_38

    Google Scholar 

Download references

Acknowledgments

This research is partly funded by the Australian Research Council (grant DP150103356).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Alireza Ostovar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2016 Springer International Publishing AG

About this paper

Cite this paper

Ostovar, A., Maaradji, A., La Rosa, M., ter Hofstede, A.H.M., van Dongen, B.F.V. (2016). Detecting Drift from Event Streams of Unpredictable Business Processes. In: Comyn-Wattiau, I., Tanaka, K., Song, IY., Yamamoto, S., Saeki, M. (eds) Conceptual Modeling. ER 2016. Lecture Notes in Computer Science(), vol 9974. Springer, Cham. https://doi.org/10.1007/978-3-319-46397-1_26

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-46397-1_26

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-46396-4

  • Online ISBN: 978-3-319-46397-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics