Batch processing in workflow schedules activity instances in multiple workflow cases of the same workflow type to run as a group. It can optimize business processes execution dynamically. To achieve this goal, it is necessary to define a dataflow operation language to group and ungroup the data in multiple cases of a workflow. Though our previous work has preliminarily investigated the model and its implementation, there is still lack of a formally defined model. In this paper, we first propose a method that is based on a partial vector space to model the dataflow in multiple workflow cases. Based on this model, the data operation primitives for batch processing are specified and defined formally. Since most WfMSs (Workflow Management Systems) use RDBMS (relational database management system) to store their data currently, an SQL (Structured Query Language)-like implementation language, namely DBOL (Data Batch Operation Language), is proposed. Evaluation experiments have also been done to show its performance.
- batch processing
- partial vector space model