Styles in business process modeling: an exploration and a model

Pinggera, Jakob; Soffer, Pnina; Fahland, Dirk; Weidlich, Matthias; Zugal, Stefan; Weber, Barbara; Reijers, Hajo A.; Mendling, Jan

doi:10.1007/s10270-013-0349-1

Styles in business process modeling: an exploration and a model

Theme Section Paper
Open access
Published: 29 May 2013

Volume 14, pages 1055–1080, (2015)
Cite this article

Download PDF

You have full access to this open access article

Software & Systems Modeling Aims and scope Submit manuscript

Styles in business process modeling: an exploration and a model

Download PDF

Jakob Pinggera¹,
Pnina Soffer²,
Dirk Fahland³,
Matthias Weidlich⁴,
Stefan Zugal¹,
Barbara Weber¹,
Hajo A. Reijers^3,6 &
…
Jan Mendling⁵

6570 Accesses
35 Citations
4 Altmetric
Explore all metrics

Abstract

Business process models are an important means to design, analyze, implement, and control business processes. As with every type of conceptual model, a business process model has to meet certain syntactic, semantic, and pragmatic quality requirements to be of value. For many years, such quality aspects were investigated by centering on the properties of the model artifact itself. Only recently, the process of model creation is considered as a factor that influences the resulting model’s quality. Our work contributes to this stream of research and presents an explorative analysis of the process of process modeling (PPM). We report on two large-scale modeling sessions involving 115 students. In these sessions, the act of model creation, i.e., the PPM, was automatically recorded. We conducted a cluster analysis on this data and identified three distinct styles of modeling. Further, we investigated how both task- and modeler-specific factors influence particular aspects of those modeling styles. Based thereupon, we propose a model that captures our insights. It lays the foundations for future research that may unveil how high-quality process models can be established through better modeling support and modeling instruction.

The GenAI is out of the bottle: generative artificial intelligence from a business model innovation perspective

Article Open access 13 September 2023

Qualitative Content Analysis: Theoretical Background and Procedures

Introduction to Design Science Research

1 Introduction

Considering the intense usage of business process modeling in all types of business contexts, the relevance of process models has become obvious. However, actual process models display a wide range of problems [20] falling into the quality dimensions of syntactic, semantic, and pragmatic quality of a model [17]. Syntactic and semantic quality relate to model construction and address the correct use of the modeling language and the extent to which the model truthfully represents the real-world behavior, respectively. Pragmatic quality addresses the extent to which a model supports its usage for purposes such as understanding behavior and system development. Considering process models whose purpose is to develop an understanding of real-world behavior, pragmatic quality is typically related to the understandability of the model [15]. Clearly, an in-depth understanding of the factors influencing the various quality dimensions of process models is in demand.

Most research in this area puts a strong emphasis on the product or outcome of the process modeling act (e.g., [10, 48]). For this category of research, the resulting model is the object of analysis. The objective, for example, is to relate how structural characteristics of the model relate to its pragmatic quality. Instead of dealing with the quality of individual models, many other works focus on the characteristics of modeling languages (e.g., [26, 42]). Recently, research has begun to explore another dimension that presumably affects the quality of business process models by incorporating the process of creating a process model into their investigations (e.g., [33, 37, 43]). In particular, the focus has been put on the formalization phase in which a process modeler is facing the challenge of constructing a syntactically correct model reflecting a given domain description (cf. [14]). Our research can be positioned within the latter stream of research.

Earlier works observed the existence of genuinely different process modeling styles [37]. Moreover, it has been shown that certain characteristics of how a modeler creates a model correlate with the quality of the created model [6]. What has not taken place is a systematic investigation of which distinct modeling styles can be observed in reality, what characterizes these modeling styles, and which factors influence that a particular modeling style is followed. Answers to these questions form a prerequisite to a systematic understanding of how modeling influences model quality and how it can be improved, for instance, by providing adequate modeling environments and by addressing quality concerns when teaching how to model.

This paper identifies distinct modeling styles, together with the factors that are supposed to influence which particular modeling style is followed. In an explorative study, we conducted modeling sessions with 115 students solving two different modeling tasks. We recorded each modeler’s interactions with a modeling tool that captures all details of how the actual modeling was done. We then applied data mining techniques to identify different modeling styles; a cluster analysis suggests the existence of three modeling styles. The modeling styles were subsequently analyzed using a series of measures for quantifying the process of process modeling (PPM) to validate differences between the three groups and between different tasks.

Our main findings are that three modeling styles can be distinguished in terms of a few simple measures. With these measures, we can characterize (1) modeling with high efficiency, (2) modeling emphasizing a good layout of the model, being created less efficiently, and (3) modeling that is neither very efficient nor very focused on layouting. We found that modelers may change their modeling style subject to modeler- and task-specific characteristics. As modeler-specific characteristics, we could identify modeling speed, the time needed to develop an understanding of the modeling task, and the inherent desire to invest into a good layout of the model. We observed that repairing mistakes as introduced during modeling is a separate issue that correlates with the perceived complexity of the modeling task. Also, we found that modelers who invest into good layout will persist in this intent even when they perceive the modeling task as difficult.

This paper extends the results of [34] in several ways. Most notably, we have reproduced the results of [34] in a new modeling task, thus confirming the existence of three genuinely distinct modeling styles. Further, we develop more refined measures to describe the modeling styles and factors that influence modeling styles. We have aggregated these into a first model explaining process modeling styles and their influence factors.

The remainder of this paper is organized as follows. Section 2 presents related work. Section 3 presents the PPM and how it can be measured. Section 4 develops the setup of our exploratory study based on insights into the PPM gained in earlier studies. The execution of the study is presented in Sect. 5. In Sect. 6, we describe insights into modeling styles gained by data mining; these insights are used in Sect. 7 to develop a number of hypotheses on influence factors on modeling styles. We test the hypotheses in Sect. 8 and compile the results into a model of process modeling styles and their influence factors in Sect. 9. In this section, we also discuss limitations. We conclude and discuss future work in Sect. 10.

2 Related work

Our work is essentially related to model quality frameworks and process model quality (cf. Sect. 2.1), research into the process of modeling (cf. Sect. 2.2), and the process of programming (cf. Sect. 2.3).

2.1 Quality frameworks and process model quality

Different frameworks and guidelines have been developed that define quality aspects in the context of process models. The SEQUAL framework uses semiotic theory for identifying dimensions of process model quality [15], including semantic, syntactic, pragmatic, and other types of issues. The Guidelines of Modeling (GoM) also elaborate on quality considerations for process models [2] and prescribes principles such as correctness and clarity that should be considered during model creation. The ‘Seven Process Modeling Guidelines’ (7PMG) comprise a set of actions a process modeler may want to undertake to avoid issues with respect to the understandability of a process model and its logical correctness [22]. The 7PMG accumulate the insights from various empirical studies on the quality of process models [23, 25]. Other studies have proposed, applied, and validated alternative, yet similar metrics to assess the quality of the model artifact itself, e.g., [1, 5, 10, 40]. Besides, pragmatic quality, i.e., understandability, has been investigated based on insights from cognitive psychology, e.g., [51, 53, 54].

All of the mentioned works have in common that they start from an analysis or reflection on the quality of the model itself. Through the focus on both desirable and actual properties of the process model, prescriptive measures for the process modeler are derived. In our work, we aim to extend this perspective by including the viewpoint of the modeling act itself, i.e., the PPM. The idea is that by understanding the PPM, it will become possible to develop insights why process models lack the desired level of quality.

2.2 Process of modeling

Research into the process of modeling typically focuses on the interaction between different parties. In a classical setting, a system analyst interacts with a domain expert through a structured discussion, covering the stages of elicitation, modeling, verification, and validation [8, 14]. The procedure of developing process models in a team is analyzed in [39] and characterized as a negotiation process. Interpretation tasks and classification tasks are identified on the semantic level of modeling. Participative modeling is discussed in [44].

These works build on the observation of modeling practice and distill normative procedures for steering the process of modeling toward a good completion. The focus is on the effective interaction between the involved stakeholders. Our work is complimentary to this perspective through its focus on the formalization part of the modeling process. In other words, we are interested in the modeler’s interactions with the modeling environment when creating the formal business process model.

2.3 Process of programming

A stream of research related to the PPM is conducted in the realm of understanding the process of computer programming, e.g., [4, 11, 19, 46]. The development of a program can be considered a problem-solving task with an external representation, i.e., the source code, being a central artifact of the process [3]. Also, the process of software design can be seen as highly iterative, interleaved, and loosely ordered [12]. Researchers have identified three phases of comprehension, decomposition, and solution specification in this process [3, 11, 46].

These works support the idea that an insight into the PPM is valuable. We adopt the notion of process modeling as a problem-solving task that is executed where an artifact, i.e., the process model, is created. Indeed, we have already observed phases similar to the ones in the programming process [37]. At the same time, it is still relevant to study the specific act of process modeling, instead of relying on existing insights from the area of programming. After all, writing a program in textual form and developing a process model using a graphical notation are different matters. In addition, process models—especially when they serve as a means for communication—should be understood not only by developers, as is the case in programming, but also by various stakeholders with varying backgrounds.

3 Backgrounds

We aim at establishing the existence of different styles in creating a process model and investigating the factors that influence the selection of a style. This section describes the necessary backgrounds in terms of cognitive foundations of the PPM (cf. Sect. 3.1) as well as its phases (cf. Sect. 3.2). Moreover, it explains both how the PPM can be captured (cf. Sect. 3.3) and be quantified using a series of measures (cf. Sect. 3.4).

3.1 Cognitive foundations of the process of process modeling

When creating a process model, the human brain as a “truly generic problem solver” [47] comes into play. Three different problem-solving “programs” or “processes” are known from cognitive psychology: search, recognition, and inference [16]. Search and recognition identify information of rather low complexity, i.e., locating an object or the recognition of patterns. Most conceptual models go well beyond the complexity that can be handled by search and recognition and require “true” problem solving in terms of inference. Cognitive psychology differentiates between working memory that contains information that is currently being processed and long-term memory in which information is stored for a long period of time [31]. Most severe, and thus of high relevance, are the limitations of the working memory. As reported in [24], the working memory cannot hold more than \(7 \pm 2\) items at the same time, referred to as chunks. Due to these limits, problem-solving tasks are typically not solved as a whole, but rather broken down into smaller parts and addressed chunk-wise. How problem-solving tasks are addressed, thus, depends on the problem-solving capacity of the problem solver.

By suitable organization of information, the span of working memory can be increased [9]. For example, when asked to repeat the sequence “U N O C B S N F L”, most people miss a character or two as the number of characters exceeds the working memory’s span. However, people being familiar with acronyms might recognize and remember the sequence “UNO CBS NFL”, effectively reducing the working memory’s load from nine to three “chunks” [7, 9, 28]. As modeling is related to problem solving [7], modelers with a better understanding of the modeling tool, the notation, or a superior ability of extracting information from requirements can utilize their working memory more efficiently when creating process models [41].

Moreover, also the problem-solving task itself influences the development of the solution (cf. Cognitive Load Theory [45]). This influence is described as cognitive load for the person solving the task. The cognitive load of a task is determined by its intrinsic load, i.e., the inherent difficulty associated with a problem-solving task and its extraneous load, i.e., generated by the manner the task is presented [31]. The amount of working memory used to solve a task is referred to as mental effort [31]. As soon as a mental task, e.g., creating a process model, overstrains the capacity of the modeler’s working memory, errors are likely to occur [45] and may affect the modeler’s style.

3.2 The process of process modeling

The PPM refers to the formalization of a business process from a domain description. During the formalization phase process, modelers are creating a syntactically correct process model reflecting a given domain description by interacting with the process modeling environment [14]. This modeling process can be described as an iterative and highly flexible process [7, 27], dependent on the individual modeler and the modeling task at hand [50]. At an operational level, the modeler’s interactions with the modeling environment typically consist of a cycle of three successive phases, (1) comprehension (i.e., the modeler forms a mental model of domain behavior), (2) modeling (i.e., the modeler maps the mental model to modeling constructs), and (3) reconciliation (i.e., the modeler reorganizes the process model)[37, 43].

3.2.1 Comprehension

According to [29], when facing a task, the problem solver first formulates a mental representation of the problem and then uses it for reasoning about the solution and the selection of problem-solving methods. In process modeling, the task is to create a model which represents the behavior of a domain. The process of forming mental models and applying methods for achieving the task is not done in one step for the entire problem. Rather, due to the limited capacity of working memory, the problem is broken into pieces that are addressed sequentially, chunk by chunk [37, 43].

3.2.2 Modeling

Using the problem and solution developed during the previous comprehension phase, a modeler materializes the solution by creating or changing a process model [37, 43]. The modeler’s utilization of working memory influences the number of executed modeling steps before the modeler is forced to revisit the problem for acquiring more information [37].

3.2.3 Reconciliation

After modeling, modelers typically reorganize the process model (e.g., rename activities) and utilize the process model’s secondary notation (e.g., the layout, typographic cues) to enhance the process model’s understandability [21, 32]. However, the amount of reconciliation in a PPM instance is influenced by a modeler’s ability of placing elements correctly when creating them, alleviating the need for additional layouting [37].

3.3 Capturing events of the process of process modeling

To investigate the PPM, actions taken during modeling have to be recorded and mapped to the phases described above. Process modeling with dedicated tools consists of adding nodes and edges to the process model, naming or renaming activities, and adding conditions to edges. In addition, a modeler can influence the process model’s secondary notation, e.g., by laying out the process model using move operations for nodes or by utilizing bendpoints to influence the routing of edges (cf. [37]). To capture modeling activities and obtain insights on how process models are created, we instrument a basic process modeling editor in the following way: each user interaction is captured together with the corresponding time stamp in an event log, thereby describing the process model creation step by step. By capturing all interactions with the modeling environment, we are able to replay a recorded modeling process at any point in time without interfering with the modeler or her problem-solving efforts. Cheetah Experimental Platform (CEP) [35] provides the features for model editing, event recording, and replay.

3.4 Quantifying the process of process modeling

Having recorded actions taken during model creation, the resulting log of modeling events allows for a quantitative analysis of PPM instances. As described in [37], comprehension (C), modeling (M), and reconciliation (R) phases are identified by grouping events. The PPM instance can then be divided into modeling iterations. One iteration is assumed to comprise a comprehension (C), modeling (M), and reconciliation (R) phase in this order. The iterations of a modeling process are identified by aligning its phases to the CMR-pattern. If a phase of this pattern is not present, the respective phase is skipped and the process is considered to continue with the next phase of the pattern. We use five measures to quantify the PPM.

3.4.1 Number of PPM iterations

This measure counts the modeling iterations in a PPM instance reflecting how often a modeler had to interrupt modeling for comprehension or reconciliation.

3.4.2 Iteration chunk size

Modelers can be assumed to conduct modeling in chunks of different sizes. The iteration chunk size is the average number of create and delete operations per PPM iteration and reflects the ability to model large parts of a model without the need to comprehend or reconcile.

3.4.3 Share of comprehension

In comprehension phases, a mental model of the problem and a corresponding solution is developed. Differences in the time spent on comprehension can be expected to influence modeling styles and the modeling result. We quantify this aspect as the ratio of the average length of a comprehension phase in a process to the average length of an iteration. We neglect the initial comprehension phase to avoid a bias from the time needed for reading the task description.

3.4.4 Reconciliation breaks

A steady process of modeling should be a sequence of iterations of the CMR-pattern. Reconciliation can sometimes be skipped if the modeler places all model elements directly at the right spot. However, we may observe iterations of CR-patterns, i.e., an iteration without a modeling phase, where a modeler interrupts the common flow of modeling for further reconciliation. We quantified this aspect by the relative share of iterations that comprise unexpected reconciliation (without modeling).

3.4.5 Delete iterations

From time to time, modelers are required to remove content from the process model. This might happen when modelers identify errors in the model that are resolved by removing modeling constructs and implementing the desired functionality. This measure describes the relative number of iterations in a PPM instance that contains delete operations to the total number of iterations in that PPM instance.

4 Building a model for understanding modeling styles

When comparing the PPM instances of different modelers, who were creating a formal process model from the same informal process description, we observed that groups of PPM instances exposed similar characteristics and that different modelers exhibit genuinely distinct modeling styles [37]. However, it remained unclear what modeling styles can be found in practice, and more importantly, how the selection of a particular style is influenced.

Given the lack of an in-depth understanding of both the modeling styles and the influencing variables, we follow an explorative approach. Rather than addressing a defined set of hypotheses, our aim is to investigate whether distinct modeling styles exist, to explore what distinguishes them from one another, and to discover relations between them. The findings may form the basis for a model that ties together influence factors and modeling styles.

Building on the backgrounds introduced in Sect. 3, we summarize the most important aspects influencing process model creation as follows:

1.
Task-intrinsic characteristics, the factual properties of the process that shall be modeled,
2.
Task-extraneous characteristics, the way the factual properties of the process are presented and properties of the modeling tool and notation,
3.
Modeler-specific characteristics, the modeler’s cognitive abilities, but also preferences in terms of modeling and tool usage.

We discuss the first two categories in Sect. 4.1 and the modeler-specific characteristics in Sect. 4.2. In Sect. 4.3, we will then derive a setup that is suitable for building a model for understanding modeling styles.

4.1 Task-intrinsic and task-extraneous characteristics

Creating a formal process model from a given process description is influenced by characteristics of the concrete task. Section 3 discussed that the cognitive load of a task is determined by its intrinsic load and its extraneous load [31].

In our context, intrinsic load is determined by the model to be created. It can be characterized by the size (e.g., number of activities or control flow constructs) and complexity of the model structure and constructs. Yet, it is independent of the presentation of the modeling task to the modeler.

Extraneous load, by contrast, concerns the presentation of the task to the modeler. For instance, in [36], the modeler’s performance was significantly influenced when restructuring the informal task description, even though no changes were made to the intrinsic load of the modeling assignment. If the cognitive load exceeds the modeler’s working memory capacity, errors are likely to occur [45] and may affect the modeler’s style. The extraneous load is part of the task-extraneous properties, which also include properties of the modeling tool and notation, which constrain the modeling process.

4.2 Modeler-specific characteristics

Modeler-specific characteristics consider cognitive characteristics and model interface preferences. The former are related to the capacity of the working memory, which can be expected to affect the cognitive load imposed by the task. Also, this category includes the modeler’s expertise, e.g., the modeler’s experience with the modeling notation, the modeling domain [31], or the modeling tool. In addition to cognitive and task-specific characteristics, distinct preferences of a modeler on how to create a model in terms of layouting and tool usage play a role. For instance, [37] describes on the one hand modelers who carefully place and arrange nodes and edges of a model to achieve an appealing layout. On the other hand, the study reports on modelers who carelessly put nodes on the canvas and draw straight connecting edges, mostly not influencing the visual appearance of the resulting process model. It was also recognized that several modelers seemed to dislike activities disappearing from sight. More specifically, when a model is about to get larger than what can be shown on the display, many modelers spend much time on reconciliation to free up space on the visible canvas and prevent model elements from disappearing. Most notably, reconciliation to free up space on the canvas seems to be independent of whether the modeler is interested in an appealing layout or not.

4.3 Designing an exploratory study for building a model

As outlined above, we believe that several factors influence the modeling style, namely the intrinsic and the extraneous load of a modeling task as well as modeler-specific characteristics. When designing the setup for the modeling sessions, we have to assume that these factors have mutually independent influences on the modeling styles. For a first exploratory study, we control two factors (task-intrinsic load and modeler-specific characteristics) and keep the remaining factor (task-extraneous characteristics) constant.

1.
We control modeler-specific characteristics by conducting the exploratory study with a large number of participants (\(>\)100). Hence, it is reasonable to assume that the subjects are representative of the general population in terms of cognitive characteristics. The subjects’ expertise (both modeling and domain knowledge) turned out to be quite uniform (cf. Sect. 5).
2.
We control task-intrinsic load by giving each participant modeling tasks of two different processes in the form of a textual description. These processes are to be sufficiently distinct to ensure that the influence of task-specific characteristics materializes.
3.
We keep the task-extraneous characteristics constant. Textual descriptions for both modeling tasks are given in the same style with respect to the process to be modeled. Also, the influence of tool and notation are kept constant by letting all participants model the process in the same editor featuring limited BPMN syntax and modeling functionality.

5 Data collection

Section 5.1 presents the planning of the exploratory study to investigate modeling styles. The execution of the study is described in Sect. 5.2.

5.1 Definition and planning

This section contains requirements regarding the subjects of the exploratory study as well as information on the developed materials and the data to be collected in this exploratory study.

5.1.1 Subjects

When investigating the PPM, one of the key challenges is to balance the difficulty of the modeling task to be executed with the knowledge of the participants. If the modeling task is too complicated, hardly any conclusions on modeling style can be drawn since most modelers would experience serious difficulties. By contrast, if the task is too easy, hardly any differences can be observed since challenging situations are a key ingredient of problem solving. Hence, the targeted subjects should be moderately familiar with business process management and imperative process modeling notations to avoid problems with the modeling notation, but still encounter some challenges when creating the process models of the given difficulty.

5.1.2 Objects

The study was designed to collect PPM instances of students with moderate process modeling skills creating a formal process model in BPMN from an informal description. Each student was asked to create two models. To control task-intrinsic load and observe task-specific characteristics, the objects have to be sufficiently different. We accommodated for this aspect by considering processes of different domains, sizes, and structures.

The first modeling assignment is a process describing the activities a pilot has to execute prior to taking off with an aircraft. The process model consists of 12 activities and contains basic control flow patterns, such as sequence, parallel split, synchronization, exclusive choice, and simple merge [49].

The second process model to be created describes the process followed by the scouting department of a National Football League (NFL) team to acquire new players through the so-called NFL Draft. The process model was considerably smaller, consisting of eight activities, still incorporating the basic control flow patterns of sequence, parallel split, synchronization, exclusive choice, simple merge, and structured loop [49].^{Footnote 1}

5.1.3 Response variables

To collect PPM instances of all participants, all details of the modeling process have been recorded. Further, we measured the modelers’ perceived mental effort for each modeling task since mental effort provides a fine-grained measure for the modeler’s performance [52]. The collected PPM instances are analyzed with data mining techniques to identify modeling styles (Sect. 6) and to reveal relevant response variables that govern modeling styles and their interplay with influence factors (Sect. 7).

5.1.4 Instrumentation and data collection

CEP was utilized for recording and analyzing PPM instances. CEP provides support for conducting experiments and case studies by providing means to define an experimental workflow for each participant. This reduces the risk of students accidentally deviating from the intended research design [35]. To limit extraneous cognitive load by complicated tools or notations [7], we used a subset of BPMN. In this way, modelers were confronted with a minimal number of distractions, but the essence of how process models are created could still be captured. Based on a pretest at the University of Innsbruck, minor updates have been applied to CEP’s functionality and the task descriptions.

5.2 Performing the exploratory study

This section describes the execution of the exploratory study.

5.2.1 Execution of exploratory study

The modeling sessions were conducted in November 2010 with students of a graduate course on Business Process Management at Eindhoven University of Technology and in January 2011 with students from Humboldt-Universität zu Berlin following a similar course. The modeling session at each university started with a demographic survey, followed by a modeling tool tutorial explaining the basic features of CEP. After that, the actual modeling task was presented in which the students had to model the above described “Pre-Flight” process. After completing the first modeling task, students were asked to create the process model for the “NFL Draft” process. This was done by 102 students in Eindhoven and 13 students in Berlin. By conducting the modeling sessions during class and closely monitoring the students, we mitigated the risk of falsely identifying comprehension phases due to external distractions. Each modeling task was followed by a self-rating of the mental effort required for completing the modeling task on a seven-point Likert scale ranging from Very Low over Medium to Very High. Self-rating scales for mental effort have been shown to reliably measure mental effort and are thus widely adopted [30]. Students were not instructed about the research questions to be answered in the exploratory study prior to performing the modeling task. No time restrictions were imposed on the students. Participation was voluntary; data collection was performed anonymously.

5.2.2 Data validation

Similar to [21], we screened the subjects for familiarity with BPMN by asking them whether they would consider themselves to be very familiar with BPMN, using a Likert scale with values ranging from Strongly disagree (1) over Neutral (4) to Strongly agree (7). The familiarity with BPMN was slightly below Neutral (\(M=3.47\), SD \(=\) 1.45). For confidence in understanding BPMN models, the students reported a mean value slightly above Neutral (\(M\) \(=\) 4.05, SD \(=\) 1.49). Finally, for perceived competence in creating BPMN models, a mean value slightly below Neutral was reported (\(M\) \(=\) 3.65, SD \(=\) 1.41). We conclude that the subjects constituted a rather homogeneous group, reporting a familiarity close to average. Thus, the participants are well suited for investigating their modeling style when translating an informal description into a formal BPMN model.

Similarly, participants were indicating their familiarity with Pre-Flight processes and the NFL on the same Likert scale (Pre-Flight: \(M\) \(=\) 2.40, SD \(=\) 1.27; NFL Draft: \(M\) \(=\) 3.45, SD \(=\) 1.91). For the NFL Draft modeling task, modelers indicated a slightly higher domain knowledge. Still, for both tasks, the average familiarity is below Neutral, indicating that modelers could hardly rely on prior domain knowledge for performing the task.

When investigating mental effort data, we observed a lower mental effort for the second modeling task (Pre-Flight: \(M\) \(=\) 4.01, SD \(=\) 1.047, NFL Draft: \(M\) \(=\) 3.77, SD \(=\) 0.974). The differences turned out to be statistically significant (Wilcoxon signed-rank test, \(Z\) \(=\) \(-\)2.54, \(p\) \(=\) 0.011), indicating that modelers perceived the second modeling task to be easier than the first one. This is consistent with the smaller size of the second modeling task. These results indicate that the two processes to be modeled are indeed different and, thus, allow for controlling task-intrinsic load (cf. Sect. 4).

6 Clustering

To investigate the existence of different modeling styles, we apply cluster analysis to the collected PPM instances and analyze whether groups of PPM instances exhibiting similar characteristics can be identified. The applied clustering procedure is described in Sects. 6.1 and 6.2. The identified clusters are then visualized and analyzed to determine whether they indeed represent different modeling styles. To check whether the identified modeling styles persist over tasks with different characteristics, clustering is applied to two tasks with different characteristics. Results of clustering the Pre-Flight task are discussed in Sect. 6.3, while the clustering results of the NFL Draft task are discussed in Sect. 6.4.

6.1 PPM profile for clustering

First and foremost, we need a representation suited for clustering for all collected PPM instances. Based on our previous experience, we decided to focus on four aspects: the addition of content, the removal of content, reconciliation of the model, and comprehension time, i.e., the time when the modeler does not work on the process model. To also reflect that modeling is a time-dependent process, we do not just look at the total amount of modeling actions and comprehension, but on their distribution over time. We sampled every process into segments of 10 s length. For each segment, we compute its profile \((a,d,r,c)\), i.e., the numbers \(a, d\), and \(r\) of add, delete, and reconciliation events, and the time \(c\) spent on comprehension. The profile of one PPM is the sequence \((a_1,d_1,r_1,c_1)(a_2,d_2,r_2,c_2)\ldots \) of its segments’ profiles. The \(a, d\), and \(r\) are obtained per segment by classifying each event according to Table 1. Adding a condition to an edge was considered being part of creating an edge. The comprehension time \(c\) was computed as follows. First, events were grouped to intervals, i.e., sequence of events where two consecutive events are \(\le \)1 s apart. Second, the interval duration was calculated as the time difference between its first and its last event (intervals of one activity got a duration of 1 s). Comprehension time \(c\) is calculated as the length of the segment (10 s) minus the duration of all intervals in the segment. For example, if the modeler moved activity A after 3 s, activity B after 3.5 s, and activity C after 4.2 s the comprehension time would be 8.8 s. To give all PPM profiles equal length, we normalized profiles by extending them with segments of no interaction.

Table 1 Classification of CEP’s user interactions

Styles in business process modeling: an exploration and a model

Abstract

Similar content being viewed by others

The GenAI is out of the bottle: generative artificial intelligence from a business model innovation perspective

Qualitative Content Analysis: Theoretical Background and Procedures

Introduction to Design Science Research

1 Introduction

2 Related work

2.1 Quality frameworks and process model quality

2.2 Process of modeling

2.3 Process of programming

3 Backgrounds

3.1 Cognitive foundations of the process of process modeling

3.2 The process of process modeling

3.2.1 Comprehension

3.2.2 Modeling

3.2.3 Reconciliation

3.3 Capturing events of the process of process modeling

3.4 Quantifying the process of process modeling

3.4.1 Number of PPM iterations

3.4.2 Iteration chunk size

3.4.3 Share of comprehension

3.4.4 Reconciliation breaks

3.4.5 Delete iterations

4 Building a model for understanding modeling styles

4.1 Task-intrinsic and task-extraneous characteristics

4.2 Modeler-specific characteristics

4.3 Designing an exploratory study for building a model

5 Data collection

5.1 Definition and planning

5.1.1 Subjects

5.1.2 Objects

5.1.3 Response variables

5.1.4 Instrumentation and data collection

5.2 Performing the exploratory study

5.2.1 Execution of exploratory study

5.2.2 Data validation

6 Clustering

6.1 PPM profile for clustering

6.2 Performing the clustering

6.3 Clustering of Pre-Flight task

6.3.1 Result of clustering

6.3.2 Cluster visualization

6.3.3 Cluster validation

6.3.4 Interpretation of clusters

6.3.5 Analysis of cluster representatives

6.4 Clustering of NFL Draft task

6.4.1 Result of clustering

6.4.2 Cluster visualization

6.4.3 Cluster validation

6.4.4 Interpretation of clusters

6.4.5 Analysis of cluster representatives

7 Identification of variables/generation of hypotheses

8 Analysis: influencing factors and distinct modeling styles

8.1 Distinct modeling styles

8.1.1 Pre-Flight

8.1.2 NFL Draft

8.1.3 Interpretation

8.2 Factors influencing the modeling style

8.2.1 Cluster movement

8.2.2 Correlations

9 Discussion and model building

9.1 Building a model

9.2 Limitations

10 Summary

Notes

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation