A Method Based on a New Word Embedding Approach for Process Model Matching

A Method Based on a New Word Embedding Approach for Process Model Matching

Mostefai Abdelkader, Mekour Mansour
DOI: 10.4018/IJAIML.2021010101
Article PDF Download
Open access articles are freely available for download

Abstract

This paper proposes a method based on a new word embedding approach for matching business process model. The proposed method aligns two process models in four steps. First activity labels are extracted and pre-processed to remove meaningless words, then each word composing an activity label and using a semantic similarity metric based on WordNet is represented with an n-dimensional vector in the space of the vocabulary of the two labels to be compared. Based on these representations, a vector representation of each activity label is computed by averaging the vectors representing words found in the activity label. Finally, the two activity labels are reported as similar if their similarity score computed using the cosine metric is greater than some predefined threshold. An experiment was conducted on well-known dataset to assess the performance of the proposed method. The results showed that the proposed method shared the first place with RMM/NHCM and OPBOT tools and can be effective in matching process models.
Article Preview
Top

Introduction

Process model matching(i.e., PMM)is an activity widely conducted in organizations to align process models. This alignment is critical for many business process (i.e., BP) management tasks such as storing, merging, clustering or querying Business Process models (i.e., BPs) (La Rosa et al. 2013; Goncalo et al. 2015; Weidlich et al. 2012).

Technically an alignment is a set of correspondences between activities of two process models. Each correspondence is a pair of two semantically similar sets of activities. The first set of a pair contains activities from the first BP and the second set of this pair contains activities from the second BP. Formally, an alignment is a set of not overlapped matches pairs (i.e., correspondences) {(A11, A21),(A1.2,A22),...,(A1n,A2n)}. Each pair defines a match (i.e., correspondence) between a set of activities, A1i, from BP1 and a set of activities, A2i, from BP2. Two pairs (A1i, A2i) and (A1j, A2j) does not overlap iffA1iIJAIML.2021010101.m01A1j=IJAIML.2021010101.m02 and A2iIJAIML.2021010101.m03A2j=IJAIML.2021010101.m04. A correspondence (A1i, A2i) between a set of activities A1i from one BP and a set of activities A2i means that the activities A1i and A2i refer to the same activity in the domain. Figure 1 presents of an example of an alignment between two process models.

Figure 1.

An example of an alignment

IJAIML.2021010101.f01

Correspondences between the activities are presented using the grey shades. Examples of correspondence is between the set {prepare answer, applicant registration, send notification} in the process A and the set {reject students, accept student} in the business process B.

The objective of any process model matching technique is to find automatically such correspondences.

Many approaches have been proposed to achieve this objective(Goncalo et al. 2015; Daniel et al. 2017, Xue, 2019; Khurram et al. 2019). The proposed approaches are widely based on a combination of lexical, syntactic and semantic similarity metrics proposed in different fields such as the NLP (i.e., Natural language Processing) community (Goncalo et al. 2015; Daniel et al. 2017).

Complete Article List

Search this Journal:
Reset
Volume 13: 1 Issue (2024)
Volume 12: 2 Issues (2022)
Volume 11: 2 Issues (2021)
Volume 10: 2 Issues (2020)
Volume 9: 2 Issues (2019)
View Complete Journal Contents Listing