ORIGINAL ARTICLE

Tool to Optimize the Production Process of the Pre-Cooked Whole Lobster

MSc. Yaimi Barcenas Mompeller^IUniversidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba.*✉:yaimi@unah.edu.cu

MSc. Adanay Núñez González^IUniversidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba.

Dr.C. Alexander Sánchez Díaz^IIUnión de Informáticos de Cuba, San José de las Lajas, Mayabeque, Cuba

Dr.C. Yusney Marrero García^IUniversidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba.

^IUniversidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba.

^IIUnión de Informáticos de Cuba, San José de las Lajas, Mayabeque, Cuba

^*Author for correspondence: Yaimi Barcenas Mompeller, e-mail: yaimi@unah.edu.cu

ABSTRACT

Today, in the competitive business world, organizations and companies need to manipulate their business processes. The real key to success in these organizations lies in the design, the proper management of the business processes and the alignment of IT (Information Technology) with the objectives of the organization. The research carried out has the objective of developing a computer tool that allows optimizing the process of the production of the pre-cooked whole lobster belonging to Batabanó Company in the province of Mayabeque. To achieve this objective, the NSGAII and SPEA2 multiobjective optimization algorithms are used with the JMetal framework. For modeling of the process in the BPMN Notation, the Bonita Studio and Yasper tools are used to convert the process into a Petri network, exporting it in a PNML file, which is used by the tool to carry out its optimization. As a result, a computing application is obtained that allows business analysts of the company to obtain possible process improvements in terms of cost and completeness criteria, thus facilitating its better performance in the company.

Keywords:

Business Process; Multiobjective Optimization; Petri Net.

INTRODUCTION

In the modern competitive business world, many companies need to modify their business process designs to become more competitive in the enterprise market. They focus their main attention on the optimization and continuous improvement of their processes.

Business Process Management (BPM) and process mining allow companies to analyze and identify possible process improvements through the use of a set of techniques and tools. This research focuses on the last stage of the two disciplines related to the optimization and continuous improvement of business processes, since they offer new alternatives for the optimal performance and control of the processes that are presented daily in companies.

Optimization is the process of finding the best possible solution to a given problem. It can be seen as the search for the values of decision variables for which a certain objective function $(fo)$ achieves its extreme value (Chong and Zak, 2004CHONG, K.; ZAK, H.S.: An introduction to optimization, Ed. John Wiley & Sons, New York, USA, 2004, ISBN: 0-471-65400-0., 2013CHONG, E.; ZAK, S.: “An introduction to optimization”, En: An introduction to optimization 4th ed., Ed. John Wiley & Sons, second ed., New York, USA, p. 15, 2013.). Multiobjective optimization applied to business processes can be a good option to improve them since more than one optimization criterion can be selected and satisfied simultaneously.

The Industrial Fishing Company "Camilo Cienfuegos" located in Batabanó Municipality, Mayabeque Province has as objective the capture, industrialization and commercialization of fresh or frozen species of the platform, being the lobster its main exportable line. The constant increase of information to be analyzed in this company brings with it the need to employ analytical techniques for the extraction of knowledge in large volumes of data, necessary to diagnose problems and identify possible areas of improvement in the process. Its use would allow the implementation of a strategy to optimize the production processes of lobster as the key to reduce their production costs and expand in their environment.

This supports the interest of this research in the need that currently exists in the company to have computer tools that allow them to optimize their production processes of lobster, taking into account several objectives such as production cost and completeness of the process.

METHODS

Business Process Management and Process Mining

Business Process Management enables companies to manage their business processes more efficiently by using methods, techniques and tools created to support the design, improvement, management and analysis of these processes (Van der Aalst, 2011VAN DER AALST, W.: “Using process mining to bridge the gap between BI and BPM”, Computer, (12): 77-80, 2011, ISSN: 0018-9162., 2013VAN DER AALST, W.: Using Process Mining to Bridge the Gap between BI and BPM, Inst. IEEE Computer Society, 2013.; Van Der Aalst et al., 2011VAN DER AALST, W.: “Using process mining to bridge the gap between BI and BPM”, Computer, (12): 77-80, 2011, ISSN: 0018-9162.). Process mining is a discipline that aims to discover, monitor and improve processes through the extraction of knowledge from the event records available in current information systems (Van der Aalst, 2011VAN DER AALST, W.: “Using process mining to bridge the gap between BI and BPM”, Computer, (12): 77-80, 2011, ISSN: 0018-9162.; Van Der Aalst et al., 2011VAN DER AALST, W.; ADRIANSYAH, A.; DE MEDEIROS, A.A.K.; ARCIERI, F.; BAIER, T.; BLICKLE, T.; BOSE, J.C.; AGUIAR, P.K.; ABRAHÃO, R.F.; BUIJS, J.: “Process mining manifesto”, En: International Conference on Business Process Management, Ed. Springer, Germany, pp. 169-194, 2011.; Van der Aalst et al., 2011VAN DER AALST, W.; ADRIANSYAH, A.; MEDEIROS, A.F.: Process Mining Manifiesto, Ed. Springer-Verlag, Business Process Management Workshops ed., vol. 99, Springer-Verlag, Germany, 5-20 p., 2011.)..

To support these disciplines, different standards have been created that include symbolic notations for the definition of business processes, such as the Business Process Management Notation (BPMN) by Ter Hofstede and Weske (2003)TER HOFSTEDE, A.H.M.; WESKE, M.: “Business process management: A survey”, [en línea], En: Proceedings of the 1st International Conference on Business Process Management , volume 2678 of LNCS, Ed. Citeseer, 2003, Disponible en:http://www.omg.org/spec/BPMN/2.0/PDF/ , [Consulta: 24 de abril de 2018]. and the Petri dish networks Murata (1989)MURATA, T.: “Petri nets: Properties, analysis and applications”, Proceedings of the IEEE, 77(4): 541-580, 1989, ISSN: 0018-9219.. In this research, BPMN is used to model the production process of the pre-cooked whole lobster using the Bonita Studio tool and then, the BPMN process is converted to the mathematical model Petri network, to which the multiobjective optimization will be applied to improve the process taking into account two objectives at the same time.

Multiobjective Optimization

Multitarget optimization (MOP) problems are separated from conventional single-target optimization, as the former usually does not deliver a single solution. Instead, MOP generates a set of possible solutions from which decision makers must select which to adopt, based on an evaluation of the performance of the solution across all objectives (Miettinen, 2008MIETTINEN, K.: “Introduction to multiobjective optimization: Noninteractive approaches”, En: Multiobjective optimization, Ed. Springer, Springer Berlin Heidelberg ed., vol. 52, Berlin, Germany, pp. 1-26, 2008.).

In a multiobjective problem, it is possible to choose as many objectives as the business analyst (user) wishes, but in this work, it is necessary to limit the problem to be studied to the optimization of $k = 2$ objectives: production cost and completeness of the process to be optimized. The problem is formulated as follows:

Optimize (Maximize/Minimize)

y = f_{(x)} = {(f}_{1} (x), f_{2} (x))

(1)

Subject to e_{1} (x) > 0

Where:

f_{1} (x) :

is the function objective of production cost of the process;

f_{2} (x) :

is the function objective of completeness of the process.

The function objective of production cost of the process is the sum of the cost of each transition, i.e., the activity in the process; triggered during the parsing of the traces. Only if a transition triggered by this way does not have the necessary tokens for its triggering then its cost will not be added to the total cost. This ensures that the poorer a solution (as calculated by the completeness) for event registering, the lower its cost may be.

The objective completeness function of the process is represented by its model, so a model will be more complete to the extent that it is capable of processing a larger number of traces in the event log. One of the basic ways to obtain completeness would be to divide the number of traces executed correctly by the total number of traces in the event log. The completeness formula proposed by De Medeiros et al. (2007)DE MEDEIROS, A.A.K.; WEIJTERS, J.M.M.A.; VAN DER AALST, M.P.W.: “Genetic process mining: an experimental evaluation”, Data Mining and Knowledge Discovery, 14(2): 245-304, 2007, ISSN: 1384-5810. was taken into account for the development of the computer application of this research, which is described below:

{P F}_{C o m p} = \frac{a c t . E j e c t (A P, C M) - p e n a l i d a d}{t o t a l E v t (Q)}

p e n a l i d a d = \frac{t o t M a r c A u s e n t (A P, C M)}{t o t a l T r a z a s (A P) - t r a z a s M a r c A u s e n t (A P, C M) + 1} + \frac{t o t M a r c A b a n d (A P, C M)}{t o t a l T r a z a s (A P) - t r a z a s M a r c A b a n d (A P, C M) + 1}

Where:

AP :

traces,

CM :

causal matrix (coding selected to represent each individual or solution),

act . Eject (AP, CM) :

correctly executed activities,

total Evt (Q) :

total of events in the event log (XES of the process),

totMarcAusent (AP, CM)

total of tokens needed to trigger an activity (transition in the Petri network) but that were not in the entry square of that activity,

totalTrazas (AP)

total of traces in the event log,

trazasMarcAusent (AP, CM)

number of traces where tokens were lost,

totMarcAband (AP, CM) :

number of tokens left in the network when the execution of each trace was finished,

totalTrazas (AP)

total of traces in the event log,

trazasMarcAband (AP, CM)

number of traces where tokens were left during its execution.

Multi-Target Algorithms in Multi-Target Optimization

Evolutionary Algorithms refer to search and optimization techniques inspired by the model of evolution proposed by Darwin (1859)DARWIN, C.R.: On the Origin of Species by Means of Natural Selection Or the Preservation of Favoured Races in the Struggle for Life., Ed. H. Milford; Oxford University Press, New York, USA, 1859.. According to Back (1996)BACK, T.: Evolutionary algorithms in theory and practice: evolution strategies, evolutionary programming, genetic algorithms, Ed. Oxford university press, New York, USA, 1996, ISBN: 0-19-535670-5., they are methods of optimization and search for solutions based on the postulates of biological evolution. In them, a group called population is maintained, whose elements represent possible solutions, which are mixed, and compete with each other, in such a way that the most suitable ones are able to prevail over time, evolving towards better solutions. The population, in the context of evolutionary computing in a general way, refers to a set of possible solutions (feasible solutions) of the problem to be solved.

There are several types of evolutionary algorithms, among them, the most prominent are genetic algorithms and these have proven to be general, robust and powerful tools. In this research, genetic algorithms are used for multi-target optimization because they are less susceptible to the shape and continuity of the Pareto frontier, require little information from the domain and are relatively easy to use and implement. The multi-target genetic algorithms considered for this research are the NSGAII proposed by Zitzler et al. (2001)ZITZLER, E.; LAUMANNS, M.; THIELE, L.: “SPEA2: Improving the strength Pareto evolutionary algorithm”, ComputerEngineering and Networks Laboratory (TIK)-report, 103, 2001. and Deb et al. (2002)DEB, K.; PRATAP, A.; AGARWAL, S.; MEYARIVAN, T.: “A fast and elitist multiobjective genetic algorithm: NSGA-II”, IEEE transactions on evolutionary computation, 6(2): 182-197, 2002, ISSN: 1089-778X., since these are the most representative algorithms to solve multi-target optimization problems.

Non-Dominated Sorting Genetic Algorithm II (NSGAII)

The NSGAII uses a rapid procedure to organize the population by non-dominance, an approach to preserve elitism, and a non-nichemical operator to disperse the individuals on the Pareto border. In this algorithm, the descendant population $Q_{t}$ (size $N$ ) is created in first instance using the parent population $P_{t}$ (size $N$ ). After this, the two populations are combined to form $R_{t}$ of size $2 N$ . After that, by means of a non-dominated sorting, the $R_{t}$ population is classified in different Pareto fronts. Once the non-dominated sorting process has been completed, the new population is generated from the configurations of the non-dominated fronts. This new population starts to be built with the best non-dominated front $(F_{1})$ , continues with the solutions of the second front $(F_{2})$ , third $(F_{3})$ and so on. Since the $R_{t}$ population is $2 N$ in size, and there are only $N$ configurations that make up the descendant population, not all the configurations of the fronts belonging to the $R_{t}$ population will be able to be accommodated in the new population. Those fronts that cannot be accommodated disappear (Deb et al., 2002DEB, K.; PRATAP, A.; AGARWAL, S.; MEYARIVAN, T.: “A fast and elitist multiobjective genetic algorithm: NSGA-II”, IEEE transactions on evolutionary computation, 6(2): 182-197, 2002, ISSN: 1089-778X.).

Strength Pareto Evolutionary Algorithm 2 (SPEA2)

The SPEA2 of Zitzler et al. (2001)ZITZLER, E.; LAUMANNS, M.; THIELE, L.: “SPEA2: Improving the strength Pareto evolutionary algorithm”, ComputerEngineering and Networks Laboratory (TIK)-report, 103, 2001., focuses on improving the skill allocation, parent selection, truncation operator and setting the external file size for all generations. In this, the skill allocation function is improved by taking into account for each individual, the number of individuals it dominates and the number of individuals that it is dominated by. This scheme also adds an estimate of population density. The $N_{E m a x}$ size of the external $P_{E}$ population (used for elitism) is fixed, unlike the SPEA, in which the $P_{E}$ size is variable but limited. $P_{E}$ is made up only of non-dominated individuals as long as the number of these is greater than or equal to $N_{E m a x}$ . In the case where the number of non-dominated individuals is less than $N_{E m a x}$ , dominated individuals are included within $P_{E}$ until the size of $P_{E}$ is equal to a $N_{E m a x}$ .

The clustering technique, which is responsible for maintaining the diversity of the population in SPEA, is replaced by a truncation method, which avoids eliminating the extreme solutions from the set of non-dominated solutions. The selection is made by means of a binary tournament, taking as a criterion of comparison the fitness of each one of the individuals. SPEA2 assumes fitness minimization; therefore, the individual with the lowest fitness value wins the tournament (Zitzler et al., 2001ZITZLER, E.; LAUMANNS, M.; THIELE, L.: “SPEA2: Improving the strength Pareto evolutionary algorithm”, ComputerEngineering and Networks Laboratory (TIK)-report, 103, 2001.).

Technologies and Tools Used

The following technologies and tools were used to implement the computer application to optimize the process of Pre-Cooked Whole Lobster Production:

Java as the programming language and Netbeans as the development environment (Peñarrieta, 2017PEÑARRIETA, R.: Programacion Java y Netbeans, 2017.).

JMetal as a framework proposed by Nebro & Durillo (2014)NEBRO, A.; DURILLO, J.: jMetal 4.5 User Manual, jMetal, 2014., for the implementation of the NSGAII and SPEA2 algorithms.

Bonita Studio modeling tool available in Castillo (2011)CASTILLO, A.P.A.: BONITA SOFT: Gestor de procesos de negocios BPM, [en línea], Inst. Universidad Nacional de Colombia, 2011, Disponible en:http://www.bonitasoft.com , [Consulta: 6 de abril de 2018]., for the modeling of the process through BPMN notation. Yasper for the import of the process in BPMN format and convert it into a Petri network, exporting the process in the PNML file that is used to perform the optimization through the computer application.

RESULTS AND DISCUSSION

The computer application implemented consists of optimizing the production process of the pre-cooked whole lobster taking into account the production cost and completeness of it. To do this, it is necessary to have the process in a Petri Network and from its representation in the PNML file and its event log in its XES file, the process is loaded and the input parameters to be considered for each of the algorithms to be used to carry out the optimization are configured.

For the process under study, the following input parameters were taken into account for both algorithms: the size of the initial population (10), the number of evaluations (1000), the crossover factor (90%), the mutation factor (50%) and in the case of the SPEA2 algorithm a new parameter is required which is the size of the file (10). Figure 1 shows the application after loading the process and setting the input parameters for the NSGAII algorithm and Figure 2 for the SPEA2 algorithm.

FIGURE 1. Configuration of the input parameters for the NSGAII algorithm.

FIGURE 2. Configuration of the input parameters for the SPEA2 algorithm.

Once the input parameters have been configured, the Optimize option is performed and then Figure 3 shows the set of optimal solutions for the optimized process according to the selected algorithm. In this case, the possible solutions are shown using the NSGAII algorithm. From the set of optimal solutions proposed, the one that is desired to be visualized can be selected and the optimized process design is shown according to the selected solution and the legend of the activities eliminated for that design are shown to the user. Then, the business analyst (user) is in charge of choosing the best one to apply to the process performance.

FIGURE 3. Optimized process result

Experimental Results

For the evaluation of the algorithms, the same input parameters were used and 5 runs were made for each of them. To establish a preliminary comparison between the two, four metrics were taken into account: the generation of non-dominated vectors, the ratio of non-dominated vector generation, the actual generation of non-dominated vectors and the generation distance used in Duarte (2001)DUARTE, S.F.: optimización multiobjetivo, Asunción, Paraguay, 2001.. Applying these metrics to the results of the algorithms used in this work, the results shown in Tables 1 and 2 were obtained.

TABLE 1. Results of the metrics for the NSGAII algorithm

$N^{o}$	GVND	RGVND	GRVND	G	Time (ms)
1	10	0.4347	2	1028.686057553229	25767
2	10	0.4347	1	742.4495985176429	38939
3	10	0.4347	3	1330.966674735487	24997
4	10	0.4347	1	1056.8096230922495	17115
5	10	0.4347	1	877.1880986361423	25116
Average				1007.212	26386.8

TABLE 2. Results of the metrics for the SPEA2 algorithm

$N^{o}$	GVND	RGVND	GRVND	G	Time (ms)
1	10	0.4347	3	1385.388867565802	30389
2	10	0.4347	6	349.1493484256466	26650
3	10	0.4347	2	457.9861558632601	36227
4	10	0.4347	0	1493.777450719814	19238
5	10	0.4347	4	951.6957906146208	39574
Average				927.592	30415.6

Analyzing the data of Tables 1 and 2, it can be observed that, for both algorithms, the value of the non-dominated vector generation metric is 10 for each of the executions; this is due to the size of the initial population in both algorithms and, in the case of SPEA2 algorithm, by the size of the file used. In the non-dominated vector generation ratio metric, both algorithms present the same value (0.4347) because they depend on the previous metric. Both metrics do not show feasible results to compare both algorithms because there are no differences between them. However, with the real generation of non-dominated vectors metric, it is different. It is clear how the SPEA2 algorithm manages to find more solutions in the optimal Pareto front. And this statement is demonstrated by the generation distance $(G)$ between $Y_{k n o w n}$ and $Y_{t r u e}$ . For the SPEA2 algorithm executions, the generation distance is lower in average than for the NSGAII algorithm executions.

In this case, it is possible to conclude that the SPEA2 algorithm obtains better solutions than the NSGAII algorithm, although, the SPEA2 algorithm consumes, on average, more time to find the solutions than the NSGAII algorithm. However, it is left to the user's consideration to select the most optimal possible solutions offered since it is the user who knows which alternative or objective could improve the process of producing pre-cooked whole lobster at the company in Batabanó.

CONCLUSIONS

The development of this work allowed the implementation of a computer application through which the process of the Production of Pre-Cooked Whole Lobster was optimized, facilitating the business analysts to obtain models of the optimized process and, thus, achieving its better performance in the Industrial Fishing Company "Camilo Cienfuegos" of Batabanó. The NSGAII and SPEA2 algorithms used in the optimization of the process were evaluated taking into account the metrics of generation of non-dominant vectors, the ratio of generation of non-dominant vectors, the real generation of non-dominant vectors and the generation distance. These showed that the SPEA2 algorithm manages to find better solutions on the optimal Pareto front, but sacrificing a little more runtime compared to the NSGAII algorithm.

REFERENCES

BACK, T.: Evolutionary algorithms in theory and practice: evolution strategies, evolutionary programming, genetic algorithms, Ed. Oxford university press, New York, USA, 1996, ISBN: 0-19-535670-5.

CASTILLO, A.P.A.: BONITA SOFT: Gestor de procesos de negocios BPM, [en línea], Inst. Universidad Nacional de Colombia, 2011, Disponible en:http://www.bonitasoft.com , [Consulta: 6 de abril de 2018].

CHONG, E.; ZAK, S.: “An introduction to optimization”, En: An introduction to optimization 4th ed., Ed. John Wiley & Sons, second ed., New York, USA, p. 15, 2013.

CHONG, K.; ZAK, H.S.: An introduction to optimization, Ed. John Wiley & Sons, New York, USA, 2004, ISBN: 0-471-65400-0.

DARWIN, C.R.: On the Origin of Species by Means of Natural Selection Or the Preservation of Favoured Races in the Struggle for Life., Ed. H. Milford; Oxford University Press, New York, USA, 1859.

DE MEDEIROS, A.A.K.; WEIJTERS, J.M.M.A.; VAN DER AALST, M.P.W.: “Genetic process mining: an experimental evaluation”, Data Mining and Knowledge Discovery, 14(2): 245-304, 2007, ISSN: 1384-5810.

DEB, K.; PRATAP, A.; AGARWAL, S.; MEYARIVAN, T.: “A fast and elitist multiobjective genetic algorithm: NSGA-II”, IEEE transactions on evolutionary computation, 6(2): 182-197, 2002, ISSN: 1089-778X.

DUARTE, S.F.: optimización multiobjetivo, Asunción, Paraguay, 2001.

MIETTINEN, K.: “Introduction to multiobjective optimization: Noninteractive approaches”, En: Multiobjective optimization, Ed. Springer, Springer Berlin Heidelberg ed., vol. 52, Berlin, Germany, pp. 1-26, 2008.

MURATA, T.: “Petri nets: Properties, analysis and applications”, Proceedings of the IEEE, 77(4): 541-580, 1989, ISSN: 0018-9219.

NEBRO, A.; DURILLO, J.: jMetal 4.5 User Manual, jMetal, 2014.

PEÑARRIETA, R.: Programacion Java y Netbeans, 2017.

TER HOFSTEDE, A.H.M.; WESKE, M.: “Business process management: A survey”, [en línea], En: Proceedings of the 1st International Conference on Business Process Management , volume 2678 of LNCS, Ed. Citeseer, 2003, Disponible en:http://www.omg.org/spec/BPMN/2.0/PDF/ , [Consulta: 24 de abril de 2018].

VAN DER AALST, W.: “Using process mining to bridge the gap between BI and BPM”, Computer, (12): 77-80, 2011, ISSN: 0018-9162.

VAN DER AALST, W.: Using Process Mining to Bridge the Gap between BI and BPM, Inst. IEEE Computer Society, 2013.

VAN DER AALST, W.; ADRIANSYAH, A.; DE MEDEIROS, A.A.K.; ARCIERI, F.; BAIER, T.; BLICKLE, T.; BOSE, J.C.; AGUIAR, P.K.; ABRAHÃO, R.F.; BUIJS, J.: “Process mining manifesto”, En: International Conference on Business Process Management, Ed. Springer, Germany, pp. 169-194, 2011.

VAN DER AALST, W.; ADRIANSYAH, A.; MEDEIROS, A.F.: Process Mining Manifiesto, Ed. Springer-Verlag, Business Process Management Workshops ed., vol. 99, Springer-Verlag, Germany, 5-20 p., 2011.

ZITZLER, E.; LAUMANNS, M.; THIELE, L.: “SPEA2: Improving the strength Pareto evolutionary algorithm”, ComputerEngineering and Networks Laboratory (TIK)-report, 103, 2001.

NOTES

The mention of trademarks of specific equipment, instruments or materials is for identification purposes, there being no promotional commitment in relation to them, neither by the authors nor by the publisher.

Received: 10/12/2019

Accepted: 25/09/2020

Yaimi Barcenas Mompeller, Profesor Asistente, Universidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba, CP:32700, e-mail: yaimi@unah.edu.cu

Adanay Núñez González, profesora, Universidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba, CP:32700, e-mail: adanay@unah.edu.cu

Alexander Sánchez Díaz, Profesor Titular, Unión de Informáticos de Cuba, San José de las Lajas, Mayabeque, Cuba, CP:32700, e-mail: sanchez@uic.cu

Yusney Marrero García, Profesor Titular, Universidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba, CP:32700, e-mail: yusneym@unah.edu.cu

The authors of this work declare no conflict of interests.

This is an open-access article distributed under the terms of the Creative Commons Attribution License

ARTÍCULO ORIGINAL

Herramienta para optimizar el proceso de producción de la langosta entera precocinada

MSc. Adanay Núñez González^IUniversidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba.

Dr.C. Alexander Sánchez Díaz^IIUnión de Informáticos de Cuba, San José de las Lajas, Mayabeque, Cuba

Dr.C. Yusney Marrero García^IUniversidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba.

^IUniversidad Agraria de La Habana, Facultad de Ciencias Técnicas, Departamento de Ingeniería Informática, San José de las Lajas, Mayabeque, Cuba.

^IIUnión de Informáticos de Cuba, San José de las Lajas, Mayabeque, Cuba

^*Author for correspondence: Yaimi Barcenas Mompeller, e-mail: yaimi@unah.edu.cu

RESUMEN

En el mundo empresarial competitivo de hoy, las organizaciones y empresas necesitan manipular sus procesos de negocio. La verdadera clave para tener éxito en estas organizaciones radica en el diseño, la gestión adecuada de sus procesos de negocio y la alineación Tecnologías de Información (TI) con los objetivos de la organización. La investigación realizada tiene como objetivo desarrollar una herramienta informática que permita optimizar el proceso de la Producción de la Langosta Entera Precocinada perteneciente a la empresa de Batabanó de la provincia de Mayabeque. Para alcanzar dicho objetivo se emplean los algoritmos de optimización multiobejtivos NSGAII y SPEA2 mediante el framework JMetal. Para la modelación del proceso en la notación BPMN se utiliza la herramienta Bonita Studio y Yasper para la conversión del proceso en una red de Petri, exportándolo así en un archivo PNML, el cual es utilizado por la herramienta para efectuar la optimización del mismo. Como resultado se obtiene una aplicación informática que le permite a los analistas del negocio de la empresa, la obtención de posibles mejoras del proceso en cuanto a los criterios de costo y completitud facilitándoles así lograr un mejor desempeño del mismo en la empresa.

Palabras clave:

procesos de negocio; optimización multiobjetivo; red de Petri.

INTRODUCCIÓN

En el mundo moderno competitivo de los negocios son muchas las empresas que necesitan modificar los diseños de sus procesos de negocio, para volverse más competitivas en el mercado empresarial. Estas enfocan su principal atención en la optimización y la mejora continua de sus procesos.

La Gestión de Procesos de Negocio (BPM: Business Process Management, por sus siglas en inglés), y la minería de procesos permiten analizar e identificar las posibles mejoras a los procesos en las empresas, por medio del uso de un grupo de técnicas y herramientas. La presente investigación se centra en la última etapa de las dos disciplinas referentes a la optimización y mejora continua de los procesos de negocio ya que estas ofrecen nuevas alternativas para el óptimo desempeño y control de los mismos que hoy se presentan a diario en las empresas.

La optimización es el proceso de búsqueda de la mejor solución posible para un determinado problema. Puede verse como la búsqueda de los valores de las variables de decisión para los cuales cierta función objetivo $(fo)$ logra su valor extremo (Chong y Zak, 2004CHONG, K.; ZAK, H.S.: An introduction to optimization, Ed. John Wiley & Sons, New York, USA, 2004, ISBN: 0-471-65400-0., 2013CHONG, E.; ZAK, S.: “An introduction to optimization”, En: An introduction to optimization 4th ed., Ed. John Wiley & Sons, second ed., New York, USA, p. 15, 2013.). La optimización multiobjetivo aplicada a los procesos de negocio puede resultar una buena opción para mejorar estos ya que más de un criterio de optimización puede ser seleccionado y satisfecho simultáneamente.

La Empresa Pesquera Industrial “Camilo Cienfuegos” situada en el municipio de Batabanó, provincia de Mayabeque tiene como objetivo la captura, industrialización y comercialización de forma mayorista, de especies de la plataforma frescas o congeladas, siendo la langosta su principal renglón exportable. El incremento constante de información a analizar en esta empresa trae consigo la necesidad de emplear técnicas analíticas para la extracción del conocimiento en grandes volúmenes de datos, necesarias para diagnosticar problemas e identificar posibles áreas de mejora del proceso. Su uso permitiría aplicar una estrategia de optimización de los procesos de producción de la langosta como la clave para reducir sus costos de producción y expandirse en su entorno.

Esto sustenta el interés de esta investigación en la necesidad que existe actualmente en la empresa de disponer de herramientas informáticas que les permitan optimizar sus procesos de producción de la langosta teniendo en cuenta varios objetivos como son el costo de producción y la completitud del proceso.

MÉTODOS

Gestión de Procesos de Negocio y Minería de Procesos

La Gestión de Procesos de Negocio permite a las empresas gestionar de una manera más eficiente sus procesos de negocio mediante métodos, técnicas y herramientas creadas para apoyar el diseño, la mejora, la gestión y el análisis de los mismos (Van der Aalst, 2011VAN DER AALST, W.: “Using process mining to bridge the gap between BI and BPM”, Computer, (12): 77-80, 2011, ISSN: 0018-9162., 2013VAN DER AALST, W.: Using Process Mining to Bridge the Gap between BI and BPM, Inst. IEEE Computer Society, 2013.; Van Der Aalst et al., 2011VAN DER AALST, W.; ADRIANSYAH, A.; DE MEDEIROS, A.A.K.; ARCIERI, F.; BAIER, T.; BLICKLE, T.; BOSE, J.C.; AGUIAR, P.K.; ABRAHÃO, R.F.; BUIJS, J.: “Process mining manifesto”, En: International Conference on Business Process Management, Ed. Springer, Germany, pp. 169-194, 2011.). La minería de procesos es una disciplina que tiene como objetivo descubrir, monitorear y mejorar procesos a través de la extracción de conocimiento de los registros de eventos disponibles en los actuales sistemas de información (Van der Aalst, 2011VAN DER AALST, W.: “Using process mining to bridge the gap between BI and BPM”, Computer, (12): 77-80, 2011, ISSN: 0018-9162.; Van Der Aalst et al., 2011VAN DER AALST, W.; ADRIANSYAH, A.; DE MEDEIROS, A.A.K.; ARCIERI, F.; BAIER, T.; BLICKLE, T.; BOSE, J.C.; AGUIAR, P.K.; ABRAHÃO, R.F.; BUIJS, J.: “Process mining manifesto”, En: International Conference on Business Process Management, Ed. Springer, Germany, pp. 169-194, 2011.; Van der Aalst et al., 2011VAN DER AALST, W.; ADRIANSYAH, A.; MEDEIROS, A.F.: Process Mining Manifiesto, Ed. Springer-Verlag, Business Process Management Workshops ed., vol. 99, Springer-Verlag, Germany, 5-20 p., 2011.).

Como apoyo a estas disciplinas se han creado distintos estándares que incluyen notaciones simbólicas para la definición de los procesos de negocio como son el Business Process Management Notation (BPMN) por la OMG Ter Ter Hofstede y Weske (2003)TER HOFSTEDE, A.H.M.; WESKE, M.: “Business process management: A survey”, [en línea], En: Proceedings of the 1st International Conference on Business Process Management , volume 2678 of LNCS, Ed. Citeseer, 2003, Disponible en:http://www.omg.org/spec/BPMN/2.0/PDF/ , [Consulta: 24 de abril de 2018]. y las redes de Petri por Murata (1989)MURATA, T.: “Petri nets: Properties, analysis and applications”, Proceedings of the IEEE, 77(4): 541-580, 1989, ISSN: 0018-9219.. En esta investigación se hace uso del BPMN para la modelación del proceso de producción de la langosta entera precocinada mediante la herramienta Bonita Studio y posteriormente se procede a convertir el proceso BPMN hacia el modelo matemático redes de Petri al cual se le aplicará la optimización multiobjetivo.

Optimización Multiobjetivo

Los problemas de optimización multiobjetivo (MOP, por sus siglas en inglés), se separan de la optimización convencional de un único objetivo, ya que, en la primera generalmente no se entrega una única solución. En vez de esto, el MOP genera un conjunto de posibles soluciones sobre las cuales los decisores deben seleccionar cual adoptar, basado en una evaluación del desempeño de la misma en todos los objetivos (Miettinen, 2008MIETTINEN, K.: “Introduction to multiobjective optimization: Noninteractive approaches”, En: Multiobjective optimization, Ed. Springer, Springer Berlin Heidelberg ed., vol. 52, Berlin, Germany, pp. 1-26, 2008.).

En un problema multiobjetivo se pueden escoger tantos objetivos como se desee, pero en este trabajo surge la necesidad de limitar el problema que se va a tratar en cuestión a la optimización de $k = 2$ objetivos: costo de producción y completitud del proceso que se quiere optimizar. El problema queda formulado de la siguiente manera:

Optimizar (Maximizar/Minimizar)

y = f_{(x)} = {(f}_{1} (x), f_{2} (x))

(1)

Sujeto a e_{1} (x) > 0

donde:

f_{1} (x) :

es la función objetivo de costo de producción del proceso;

f_{2} (x) :

es la función objetivo de completitud del proceso.

La función objetivo costo de producción del proceso consiste en sumar el costo de cada transición, es decir, la actividad en el proceso; disparada durante el parseo de las trazas. Solo que si una transición disparada por esta vía no tiene los tokens necesarios para su disparo entonces su costo no se añadirá al costo total. De esta forma se asegura que mientras más mala sea una solución (según el cálculo de la completitud) para el registro de eventos, su costo puede ser menor.

La función objetivo completitud del proceso está representada mediante su modelo, por lo que, un modelo será más completo en la medida que sea capaz de procesar un mayor número de trazas en el registro de eventos. Una de las formas básicas de obtener la completitud sería dividir la cantidad de trazas ejecutadas correctamente entre el total de trazas existentes en el registro de eventos. Para el desarrollo de la aplicación informática de la presente investigación se tuvo en cuenta la fórmula de completitud propuesta por De Medeiros et al. (2007)DE MEDEIROS, A.A.K.; WEIJTERS, J.M.M.A.; VAN DER AALST, M.P.W.: “Genetic process mining: an experimental evaluation”, Data Mining and Knowledge Discovery, 14(2): 245-304, 2007, ISSN: 1384-5810.. la cual se describe a continuación:

{P F}_{C o m p} = \frac{a c t . E j e c t (A P, C M) - p e n a l i d a d}{t o t a l E v t (Q)}

p e n a l i d a d = \frac{t o t M a r c A u s e n t (A P, C M)}{t o t a l T r a z a s (A P) - t r a z a s M a r c A u s e n t (A P, C M) + 1} + \frac{t o t M a r c A b a n d (A P, C M)}{t o t a l T r a z a s (A P) - t r a z a s M a r c A b a n d (A P, C M) + 1}

donde

AP:

representa las trazas,

CM:

matriz causal (codificación seleccionada para representar a cada individuo o solución),

act . Eject (AP, CM)

a las actividades ejecutadas correctamente

total Evt (Q)

al total de eventos en el registro de eventos (XES del proceso)

totMarcAusent (AP, CM)

al total de tokens necesarios para disparar una actividad (transición en la red de Petri) pero que no se encontraban en la plaza de entrada de dicha actividad

totalTrazas (AP)

al total de trazas en el registro de eventos

trazasMarcAusent (AP, CM)

al número de trazas donde se perdieron tokens

totMarcAband (AP, CM)

al número de tokens que quedaron en la red cuando terminó la ejecución de cada traza

totalTrazas (AP)

al total de trazas en el registro de eventos

trazasMarcAband (AP, CM)

al número de trazas donde quedaron tokens durante su ejecución

Algoritmos multiobjetivos en la optimización multiobjetivo

Los Algoritmos Evolutivos se refieren a técnicas de búsqueda y optimización inspiradas en el modelo de la evolución Los Algoritmos Evolutivos se refieren a técnicas de búsqueda y optimización inspiradas en el modelo de la evolución propuesto por Darwin (1859)DARWIN, C.R.: On the Origin of Species by Means of Natural Selection Or the Preservation of Favoured Races in the Struggle for Life., Ed. H. Milford; Oxford University Press, New York, USA, 1859.. Según Back (1996)BACK, T.: Evolutionary algorithms in theory and practice: evolution strategies, evolutionary programming, genetic algorithms, Ed. Oxford university press, New York, USA, 1996, ISBN: 0-19-535670-5., son métodos de optimización y búsqueda de soluciones basados en los postulados de la evolución biológica. En ellos se mantiene un conjunto llamado población, cuyos elementos representan posibles soluciones, las cuales se mezclan, y compiten entre sí, de tal manera que las más aptas son capaces de prevalecer a lo largo del tiempo, evolucionando hacia mejores soluciones. La población, en el contexto de la computación evolutiva de forma general se refiere a un conjunto de posibles soluciones (soluciones factibles) del problema que se desea resolver.

Existen diversos tipos de algoritmos evolutivos dentro de ellos los que más se desatacan son los algoritmos genéticos, estos han probado ser herramientas generales, robustas y potentes. En esta investigación se utilizan los algoritmos genéticos para la optimización multiobjetivo ya que estos son menos susceptibles a la forma y continuidad de la frontera de Pareto, requieren de poca información del dominio y son relativamente fáciles de usar e implementar. Los algoritmos genéticos multiobjetivos considerados para esta investigación son el NSGAII propuesto por el SPEA2 de Zitzler et al. (2001)ZITZLER, E.; LAUMANNS, M.; THIELE, L.: “SPEA2: Improving the strength Pareto evolutionary algorithm”, ComputerEngineering and Networks Laboratory (TIK)-report, 103, 2001. y Deb et al. (2002)DEB, K.; PRATAP, A.; AGARWAL, S.; MEYARIVAN, T.: “A fast and elitist multiobjective genetic algorithm: NSGA-II”, IEEE transactions on evolutionary computation, 6(2): 182-197, 2002, ISSN: 1089-778X., ya que estos son los algoritmos más representativos para resolver problemas de optimización multiobjetivo.

Algoritmo Genético de Clasificación No Dominada II (NSGAII)

El NSGAII usa un procedimiento rápido para organizar la población por no dominancia, un enfoque para preservar el elitismo y un operador diferente al nicho, para dispersar los individuos en la frontera de Pareto. En este algoritmo, la población descendiente $Q_{t}$ (tamaño $N$ ) es creada en primera instancia usando la población de padres $P_{t}$ (tamaño $N$ ). Después de esto, las dos poblaciones son combinadas para formar $R_{t}$ de tamaño $2 N$ . Después de lo anterior, mediante un ordenamiento no dominado se clasifica la población $R_{t}$ en diferentes frentes de Pareto. Una vez el proceso de ordenamiento no dominado ha finalizado, la nueva población es generada a partir de las configuraciones de los frentes no dominados. Esta nueva población empieza a ser construida con el mejor frente no dominado $(F_{1})$ , continúa con las soluciones del segundo frente $(F_{2})$ , tercero $(F_{3})$ y así sucesivamente. Como la población $R_{t}$ es de tamaño $2 N$ , y solamente existen $N$ configuraciones que conforman la población descendiente, no todas las configuraciones de los frentes pertenecientes a la población $R_{t}$ podrán ser acomodadas en la nueva población. Aquellos frentes que no pueden ser acomodados desaparecen (Deb et al., 2002DEB, K.; PRATAP, A.; AGARWAL, S.; MEYARIVAN, T.: “A fast and elitist multiobjective genetic algorithm: NSGA-II”, IEEE transactions on evolutionary computation, 6(2): 182-197, 2002, ISSN: 1089-778X.).

Algoritmo Evolutivo Pareto de Fuerza 2 (SPEA2)

El SPEA2 de Zitzler et al. (2001)ZITZLER, E.; LAUMANNS, M.; THIELE, L.: “SPEA2: Improving the strength Pareto evolutionary algorithm”, ComputerEngineering and Networks Laboratory (TIK)-report, 103, 2001., se centra en mejorar la asignación de la aptitud, la selección de los padres, el operador de truncamiento y en fijar el tamaño del archivo externo para todas sus generaciones. En este, la función de asignación de aptitud se mejora teniendo en cuenta para cada individuo el número de individuos a los que domina y el número de individuos por los que es dominado. Este esquema también añade una estimación de densidad poblacional. El tamaño $N_{E m a x}$ de la población externa $P_{E}$ (utilizada para elitismo) es fijo, a diferencia del SPEA, en el cual el tamaño de $P_{E}$ es variable pero acotado. $P_{E}$ está conformada sólo por individuos no dominados siempre y cuando el número de estos sea mayor o igual que $N_{E m a x}$ . En el caso en que el número de individuos no dominados sea menor que $N_{E m a x}$ se incluyen individuos dominados dentro de $P_{E}$ hasta que el tamaño de $P_{E}$ sea igual a $N_{E m a x}$ .

La técnica de agrupamiento (clustering), encargada de mantener la diversidad de la población en SPEA, es sustituida por un método de truncamiento, el cual evita eliminar las soluciones extremas del conjunto de soluciones no dominadas. La selección se realiza mediante torneo binario, tomando como criterio de comparación el fitness de cada uno de los individuos. SPEA2 asume minimización de fitness, por lo tanto, gana el torneo aquel individuo que tenga un menor valor de fitness (Zitzler et al., 2001ZITZLER, E.; LAUMANNS, M.; THIELE, L.: “SPEA2: Improving the strength Pareto evolutionary algorithm”, ComputerEngineering and Networks Laboratory (TIK)-report, 103, 2001.).

Tecnologías y herramientas utilizadas

Para llevar a cabo la implementación de la aplicación informática para optimizar el proceso de la Producción de la Langosta Entera Precocinada se utilizaron las siguientes tecnologías y herramientas: Java como lenguaje de programación y Netbeans como entorno de desarrollo (Peñarrieta, 2017PEÑARRIETA, R.: Programacion Java y Netbeans, 2017.). JMetal como framework propuesto por Nebro y Durillo (2014)NEBRO, A.; DURILLO, J.: jMetal 4.5 User Manual, jMetal, 2014., para la implementación de los algoritmos NSGAII y SPEA2. La herramienta de modelado Bonita Studio disponible en Castillo (2011)CASTILLO, A.P.A.: BONITA SOFT: Gestor de procesos de negocios BPM, [en línea], Inst. Universidad Nacional de Colombia, 2011, Disponible en:http://www.bonitasoft.com , [Consulta: 6 de abril de 2018]., para la modelación del proceso mediante la notación BPMN y Yasper para la importación del mismo en formato BPMN y convertirlo en una red de Petri, exportando así el proceso en el archivo PNML que es usado para realizar la optimización mediante la aplicación informática.

RESULTADOS Y DISCUSIÓN

La aplicación informática implementada consiste en optimizar el proceso de la producción de la langosta entera precocinada teniendo en cuenta el costo de producción y completitud del mismo. Para ello es necesario tener el proceso en una red de Petri y a partir de su representación en el archivo PNML y su registro de eventos en su archivo XES, se procede a cargar el proceso y a configurar los parámetros de entrada a considerar para cada uno de los algoritmos a emplear para efectuar la optimización.

Para el proceso que se estudia se tuvieron en cuenta los siguientes parámetros de entrada para ambos algoritmos: el tamaño de la población inicial (10), el número de evaluaciones (1000), el factor de cruzamiento (90%), el factor de mutación (50%) y en el caso del algoritmo SPEA2 se requiere de un nuevo parámetro que es el tamaño del archivo (10). La Figura 1 muestra la aplicación luego de haber cargado el proceso y configurado los parámetros de entrada para el algoritmo NSGAII y la Figura 2 para el algoritmo SPEA2.

FIGURA 1. Configuración de los parámetros de entrada para el algoritmo NSGAII.

FIGURA 2. Configuración de los parámetros de entrada para el algoritmo SPEA2.

Una vez configurado los parámetros de entrada se procede a realizar la opción de Optimizar y posteriormente en la Figura 3 se muestra el conjunto de soluciones óptimas del proceso optimizado según el algoritmo seleccionado, en este caso se muestran las posibles soluciones mediante el algoritmo NSGAII. Del conjunto de las soluciones óptimas propuestas se puede seleccionar la que desee visualizar y se muestra el diseño de proceso optimizado según la solución seleccionada y se le muestran al usuario la leyenda de las actividades eliminadas para ese diseño. Luego el analista del negocio (usuario) es el encargado de escoger cual es la mejor para aplicar en el desempeño del proceso.

FIGURA 3. Resultado del proceso optimizado.

Para la evaluación de los algoritmos se utilizaron los mismos parámetros de entrada y se realizaron 5 ejecuciones por cada uno de ellos. Para establecer una comparación preliminar entre ambos se tuvieron en cuenta 4 métricas: la generación de vectores no dominados, la razón de generación de vectores no dominados, la generación real de vectores no dominados y la distancia generacional utilizadas en Duarte (2001)DUARTE, S.F.: optimización multiobjetivo, Asunción, Paraguay, 2001..

Al aplicar estas métricas a los resultados de las ejecuciones de los algoritmos empleados en este trabajo se obtuvieron los resultados mostrados en las Tablas 1 y 2.

TABLA 1. Resultado de las métricas para el algoritmo NSGAII

$N^{o}$	GVND	RGVND	GRVND	G	Tiempo (ms)
1	10	0.4347	2	1028.686057553229	25767
2	10	0.4347	1	742.4495985176429	38939
3	10	0.4347	3	1330.966674735487	24997
4	10	0.4347	1	1056.8096230922495	17115
5	10	0.4347	1	877.1880986361423	25116
Promedio				1007.212	26386.8

TABLA 2. Resultado de las métricas para el algoritmo SPEA2

$N^{o}$	GVND	RGVND	GRVND	G	Tiempo (ms)
1	10	0.4347	3	1385.388867565802	30389
2	10	0.4347	6	349.1493484256466	26650
3	10	0.4347	2	457.9861558632601	36227
4	10	0.4347	0	1493.777450719814	19238
5	10	0.4347	4	951.6957906146208	39574
Promedio				927.592	30415.6

Analizando los datos de las Tablas 1 y 2, se puede observar que para ambos algoritmos el valor de la métrica de generación de vectores no dominados es 10 para cada una de las ejecuciones, esto está dado por el tamaño de la población inicial en ambos algoritmos y en el caso del algoritmo SPEA2 por el tamaño del archivo utilizado. En la métrica razón de generación de vectores no dominados ambos algoritmos presentan los mismos valores: 0.4347 porque dependen de la métrica anterior. Ambas métricas no muestran resultados factibles para comparar ambos algoritmos porque no existen diferencias entre ellos en cuanto a estas. Sin embargo, con la métrica generación real de vectores no dominados es diferente. Se ve claramente como el algoritmo SPEA2 logra encontrar más soluciones en el frente óptimo de Pareto. Y esta afirmación la demuestra la distancia generacional $(G)$ entre $Y_{k n o w n}$ y $Y_{t r u e}$ . Para las ejecuciones con el algoritmo SPEA2, la distancia generacional es menor en promedio que para las ejecuciones con el algoritmo NSGAII.

En este caso se puede llegar a la conclusión que el algoritmo SPEA2 obtiene mejores soluciones con respecto al algoritmo NSGAII, aunque el algoritmo SPEA2 consume en promedio más tiempo para hallar las soluciones que el algoritmo NSGAII. No obstante, se deja a consideración del usuario la selección de las posibles soluciones más óptimas que ofrecen estos pues es este el que conoce qué alternativa u objetivo pudiera mejorar el proceso de la Producción de la Langosta Entera Precocinada en la empresa de Batabanó.

CONCLUSIONES

El desarrollo de este trabajo permitió la implementación de una aplicación informática mediante la cual se optimizó el proceso de la Producción de la Langosta Entera Precocinada facilitándole a los analistas del negocio la obtención de modelos del proceso optimizado y así lograr un mejor desempeño de este en la Empresa Pesquera Industrial “Camilo Cienfuegos” de Batabanó. Se evaluaron los algoritmos NSGAII y SPEA2 empleados en la optimización del proceso teniendo en cuenta las métricas de generación de vectores no dominados, la razón de generación de vectores no dominados, la generación real de vectores no dominados y la distancia generacional. Estas arrojaron que el algoritmo SPEA2 logra encontrar mejores soluciones en el frente óptimo de Pareto, pero sacrificando un poco más de tiempo de ejecución comparado con el algoritmo NSGAII.