An architecture for processes analysis in smart factories based on big data, process mining, and machine learning techniques
Alireza Olyai
1
(
1- Department of Computer Engineering, Gorgan Branch, Islamic Azad University, Gorgan, Iran
)
Shideh Saraeian
2
(
)
Ali Nodehi
3
(
Department of Computer Engineering, Gorgan Branch, Islamic Azad University, Gorgan, Iran
)
Keywords: Smart Factory, Processes Analysis, Big Data, Process Mining, Machine Learning,
Abstract :
Due to the nature of smart factories and the use of new technologies such as cyber-physical systems, cloud computing, the Internet of Things, etc. in such environments, the volume of data generated has increased exponentially. Therefore, real-time processing of large volumes of high-speed data for process analysis is a difficult and challenging problem. In this case, big data analysis technologies, as a powerful tool, can play an important role in controlling processes. In this research, an architecture based on a combination of big data, process mining, and machine learning techniques for analyzing processes in smart factories is presented, which enables accurate and real-time analysis of processes in such environments. In fact, this architecture utilizes powerful big data analysis tools and new techniques such as process mining and by employing the logistic regression algorithm as a machine learning tool, it is able to extract valuable insights from the data generated in these environments. The results of the performance evaluation of the proposed architecture show that this architecture provides high capabilities in real-time process analysis through the combination of advanced data analysis techniques
[1] Mabkhot. M, Al-Ahmari. A, Salah. B, and Alkhalefah. H, “Requirements of the smart factory system: a survey and perspective,” Machines, vol. 6, no. 2, pp. 23, 2018.
[2] Chen. M, Mao. S, and Liu. Y, "Big data: A survey,” Mobile networks and applications, vol. 19, no. 2, pp. 171-209, 2014.
[3] Lee. J, Ardakani. H. D., Yang. S, and Bagheri. B, “Industrial big data analytics and cyber-physical systems for future maintenance & service innovation,” Procedia Cirp, vol. 38, pp. 3-7, 2015.
[4] Olyai. A, Saraeian. S, and Nodehi. A, “Process mining-based business process management architecture: A case study in smart factories,” Scientia Iranica, vol. 31, no. 14, 2024.
[5] Zur Muehlen. M, Workflow-based Process Controlling: Foundation, Design and Application of workflow-driven Process Information Systems, Logos Velrag Berlin, 2004.
[6] Polyvyanyy. A, Ouyang. C, Barros. A, and van der Aalst. W. M, “Process querying: Enabling business intelligence through query-based process analytics,” Decision Support Systems, vol. 100, pp. 41-56, 2017.
[7] Liu. X, Iftikhar. N, and Xie. X, “Survey of real-time processing systems for big data,” In Proceedings of the 18th International Database Engineering & Applications Symposium, Porto, Portugal, July 7-9, 2014, ACM, 2014, pp. 356-361.
[8] Nagdive. A S and Tugnayat. R M, “A review of Hadoop ecosystem for bigdata,” Int. J. Comput. Appl, vol. 180, no.14, pp. 35-40, 2018.
[9] Shaikh. E, Mohiuddin. I, Alufaisan. Y, and Nahvi. I, “Apache spark: A big data processing engine,” In 2019 2nd IEEE Middle East and North Africa COMMunications Conference (MENACOMM), Manama, Bahrain, November 19-21, 2019, IEEE ,2019, pp. 1-6.
[10] Salloum. S, Dautov. R, Chen. X, Peng. P X, and Huang. J Z, “Big data analytics on Apache Spark,” International Journal of Data Science and Analytics, vol. 1, pp.145-164 ,2016.
[11] Sahal. R, Breslin. J G, and Ali. M I, “Big data and stream processing platforms for Industry 4.0 requirements mapping for a predictive maintenance use case,” Journal of Manufacturing Systems, vol. 54, pp. 138-151, 2020.
[12] Vora. M N, “Hadoop-HBase for large-scale data,” In Proceedings of 2011 International Conference on Computer Science and Network Technology, Harbin, China, December 24-26, 2011, IEEE, 2011, pp. 601-605.
[13] Thusoo. A and et al., “Hive: a warehousing solution over a map-reduce framework,” Proceedings of the VLDB Endowment, vol. 2, no.2, pp.1626-1629, 2009.
[14] Bansal. K, Chawla. P, and Kurle. P, “Analyzing performance of apache pig and apache hive with Hadoop,” In Engineering Vibration, Communication and Information Processing: ICoEVCI, India, Springer Singapore, 2019, pp. 41-51.
[15] Alexakis. T, Peppes. N, Demestichas. K, and Adamopoulou. E, “ A distributed big data analytics architecture for vehicle sensor data,” Sensors, vol. 23, no. 1, pp. 357, 2022.
[16] Manogaran. G and et al., “A new architecture of Internet of Things and big data ecosystem for secured smart healthcare monitoring and alerting system,” Future Generation Computer Systems, vol. 82, pp. 375-387, 2018.
[17] Biswas. S and Sen. J, “A proposed architecture for big data driven supply chain analytics,” arXiv preprint arXiv:1705.04958, pp. 7-34, 2017.
[18] Yang. H, Park. M, Cho. M, Song. M, and Kim. S, ”A system architecture for manufacturing process analysis based on big data and process mining techniques,” In 2014 IEEE international conference on big data (big data), Washington, DC, USA, October 27-30, 2014, IEEE, 2014, pp. 1024-1029.
[19] Bashir. M R and Gill. A Q, “Towards an IoT big data analytics framework: smart buildings systems,” In 2016 IEEE 18th International Conference on High Performance Computing and Communications; IEEE 14th International Conference on Smart City; IEEE 2nd International Conference on Data Science and Systems (HPCC/SmartCity/DSS), Sydney, NSW, Australia, December 12-14, 2016, IEEE, 2016, pp. 1325-1332.
[20] Gohar. M and et al., “A big data analytics architecture for the internet of small things,” IEEE Communications Magazine, vol. 56, no. 2, pp.128-133, 2018.
[21] Constante-Nicolalde. F V, Pérez-Medina. J L, and Guerra-Terán. P, “A proposed architecture for iot big data analysis in smart supply chain fields,” In The international conference on advances in emerging trends and technologies, Cham: Springer International Publishing, 2019, pp. 361-374.
[22] Salierno. G, Morvillo. S, Leonardi. L, and Cabri. G, “An architecture for predictive maintenance of railway points based on big data analytics,” In International Conference on Advanced Information Systems Engineering, Cham: Springer International Publishing, 2020, pp. 29-40.
[23] Simaković. M N, Cica. Z G, and Masnikosa. I B. "Big Data architecture for mobile network operators,” In 2021 15th International Conference on Advanced Technologies, Systems and Services in Telecommunications (TELSIKS), Nis, Serbia, October 20-22, 2021, IEEE, 2021, pp. 283-286.
[24] Raif. M, Chehri. A, and Saadane. R, “Data architecture and big data analytics in smart cities,” Procedia Computer Science, vol. 207, pp. 4123-4131.2022.
[25] Ahaidous. K, Tabaa. M, and Hachimi. H, “Towards IoT-Big Data architecture for future education,” Procedia Computer Science, vol. 220, pp. 348-355.2023.
[26] Mills. N and et al., “A cloud-based architecture for explainable Big Data analytics using self-structuring Artificial Intelligence,” Discover Artificial Intelligence, vol.4, no. 1, pp.33, 2024.
[27] Çınar. Z M and et al., “Machine learning in predictive maintenance towards sustainable smart manufacturing in industry 4.0,” Sustainability, vol.12, no. 19, pp. 8211, 2020.
[28] Maier. A, Schriegel. S, and Niggemann. O, “Big data and machine learning for the smart factory—Solutions for condition monitoring, diagnosis and optimization,” Industrial Internet of Things: Cyber manufacturing Systems, pp. 473-485, 2017.
[29] Cho. S and et al., “A hybrid machine learning approach for predictive maintenance in smart factories of the future,” In Advances in Production Management Systems. Smart Manufacturing for Industry 4.0: IFIP WG 5.7 International Conference, APMS 2018, Seoul, Korea, August 26-30, 2018, Proceedings, Part II, Springer International Publishing, 2018, pp. 311-317.
[30] Halimaa. A and Sundarakantham. K, “Machine learning based intrusion detection system,” In 2019 3rd International conference on trends in electronics and informatics (ICOEI), Tirunelveli, India, April 23-25, 2019, IEEE, 2019, pp. 916-920.
[31] Saraeian. S, Shirazi. B, and Motameni. H, “Optimal autonomous architecture for uncertain processes management,” Information Sciences, vol. 501, pp. 84-99, 2019.
[32] Saraeian. S, and Shirazi. B, “Process mining-based anomaly detection of additive manufacturing process activities using a game theory modeling approach,” Computers & Industrial Engineering, vol. 146, pp. 106584, 2020.
[33] Theis. J, Galanter. W L, Boyd. A D, and Darabi. H, “Improving the in-hospital mortality prediction of diabetes ICU patients using a process mining/deep learning architecture,” IEEE Journal of Biomedical and Health Informatics, vol. 26, no.1, pp. 388-399, 2021.
[34] Ehsani. M and et al., “Machine learning for predicting concrete carbonation depth: A comparative analysis and a novel feature selection,” Construction and Building Materials, vol. 417, pp. 135331, 2024.
[35] Alizadeh. S, and Norani. A, “ICMA: a new efficient algorithm for process model discovery,” Applied Intelligence, vol. 48, no.11, pp. 4497-4514, 2018.
[36] Van der Aalst. W, Weijters. T, and Maruster. L, “Workflow mining: Discovering process models from event logs,” IEEE transactions on knowledge and data engineering, vol.16, no. 9, pp. 1128-1142, 2004.
[37] Weijters. AJMM, Van der Aalst. WMP, Medeiros. AK, “Process mining with the heuristics miner algorithm,” TU Eindhoven: BETA Working Paper Series, 2006.
[38] Van der Werf. J M E, van Dongen. B F, Hurkens. C A, and Serebrenik. A, “Process discovery using integer linear programming,” In International conference on applications and theory of petri nets, Springer, Berlin, Heidelberg, 2008, pp. 368-387.
[39] Günther. C W, and Van Der Aalst. W M, “Fuzzy mining–adaptive process simplification based on multi-perspective metrics,” In International conference on business process management, Springer Berlin Heidelberg, 2007, pp. 328-343.
[40] Van Der Aalst. W M P, Process Mining-Data Science in Action, 2rd ed., Springer, Berlin, Heidelberg, 2016.
[41] Leemans. S J, Fahland. D, and Van Der Aalst. W M, “Discovering block-structured process models from event logs containing infrequent behavior,” In Business Process Management Workshops: BPM 2013 International Workshops, Beijing, China, August 26, 2013, Revised Papers 11, Springer international publishing, 2014, pp. 66-78.
[42] Leemans. S J, Fahland. D, and Van der Aalst. W M, “Scalable process discovery and conformance checking,” Software & Systems Modeling, vol. 17, pp. 599-631, 2018.
[43] Genkin. M, Dehne. F, Shahmirza. A, Navarro. P, and Zhou. S, “Autonomic Architecture for Big Data Performance Optimization”, In Intelligent Systems Conference, Cham: Springer Nature Switzerland, 2024, pp. 475-496.
[44] Bettacchi. A, Polzonetti. A, and Re. B, “Understanding production chain business process using process mining: a case study in the manufacturing scenario”, In Advanced Information Systems Engineering Workshops: CAiSE 2016 International Workshops, Ljubljana, Slovenia, June 13-17, 2016, Proceedings 28, Springer International Publishing, 2016, pp. 193-203.
[45] Pohar. M, Blas. M, and Turk. S, “Comparison of logistic regression and linear discriminant analysis: a simulation study,” Metodoloski zvezki, vol.1, no. 1, pp.143, 2004.
[46] Kumar. V, “Evaluation of computationally intelligent techniques for breast cancer diagnosis,” Neural Computing and Applications, vol. 33, no.8, pp. 3195-3208, 2021.