In collaboration with Payame Noor University and the Iranian Society of Instrumentation and Control Engineers

Document Type : Research Article

Authors

1 Department of Computer Engineering and Information Technology‎, ‎Payame Noor University‎, ‎Tehran‎, ‎Iran

2 Department of Computer Engineering‎, ‎Neyshabur Branch‎, ‎Islamic Azad University‎, ‎Neyshabur‎, ‎Iran

3 Department of Computer Engineering‎, ‎Mashhad Branch‎, ‎Islamic Azad University‎, ‎Mashhad‎, ‎Iran

4 Aviation Engineer‎, ‎Mashhad Airport‎, ‎Mashhad‎, ‎Iran.

Abstract

With the advancements in science and technology‎, ‎the industrial and aviation sectors have witnessed a significant increase in data‎. ‎A vast amount of data is generated and utilized continuously‎. ‎It is imperative to employ data mining techniques to extract and uncover knowledge from this data‎. ‎Data mining is a method that enables the extraction of valuable information and hidden relationships from datasets‎. ‎However‎, ‎the current aviation data presents challenges in effectively extracting knowledge due to its large volume and diverse structures‎. ‎Air Traffic Management (ATM) involves handling Big data‎, ‎which exceeds the capacity of conventional acquisition‎, ‎matching‎, ‎management‎, ‎and processing within a reasonable timeframe‎. ‎Aviation Big data exists in batch forms and streaming formats‎, ‎necessitating the utilization of parallel hardware and software‎, ‎as well as stream processing‎, ‎to extract meaningful insights‎. ‎Currently‎, ‎the map-reduce method is the prevailing model for processing Big data in the aviation industry‎. ‎This paper aims to analyze the evolving trends in aviation Big data processing methods‎, ‎followed by a comprehensive investigation and discussion of data analysis techniques‎. ‎We implement the map-reduce optimization of the K-Means algorithm in the Hadoop and Spark environments‎. ‎The K-Means map-reduce is a crucial and widely applied clustering method‎. ‎Finally‎, ‎we conduct a case study to analyze and compare aviation Big data related to air traffic management in the USA using the K-Means map-reduce approach in the Hadoop and Spark environments‎. ‎The analyzed dataset includes flight records‎. ‎The results demonstrate the suitability of this platform for aviation Big data‎, ‎considering the characteristics of the aviation dataset‎. ‎Furthermore‎, ‎this study presents the first application of the designed program for air traffic management‎.

Keywords

Main Subjects

[1] Bonyani, M., Soleymani, M. (2022). “Towards improving workers' safety and progress monitoring of construction sites through construction site understanding”, arXiv preprint arXiv:2210.15760.
[2] Bonyani, M., Ghanbari, M., Rad, A. (2022). “Different gaze direction (dgnet) collaborative learning for iris segmentation”, Available at SSRN 4237124.
[3] Bonyani, M., Rahmanian, M., Jahangard, S., Rezaei, M. (2023). “Dipnet: Driver intention prediction for a safe takeover transition in autonomous vehicles”, IET Intelligent Transport Systems.
[4] Borne, K. (2014). “Top 10 big data challenges a serious look at 10 big data v’s”, Blog Post, 11.
[5] Burmester, G., Ma, H.,Steinmetz, D., Hartmannn, S. (2018). “Big data and data analytics in aviation”, Advances in Aeronautical Informatics: Technologies Towards Flight, 4.0, 55-65.
[6] Del Río, S., López, V., Benítez, J.M., Herrera, F. (2014). “On the use of mapreduce for imbalanced big data using random forest”, Information Sciences, 285, 112-137.
[7] Fahad, A., Alshatri, N., Tari, Z., Alamri, A., Khalil, I., Zomaya, A.Y., Foufou, S., Bouras, A. (2014). “A survey of clustering algorithms for big data: Taxonomy and empirical analysis”, IEEE Transactions on Emerging Topics in Computing, 2(3), 267-279.
[8] Hashem, I.A.T., Yaqoob, I., Anuar, N.B., Mokhtar, S., Gani, A., Khan, S.U. (2015). “The rise of “big data” on cloud computing: Review and open research issues”, Information Systems, 47, 98-115.
[9] Kasturi, E., Devi, S.P., Kiran, S.V., Manivannan, S. (2016). “Airline route profitability analysis and optimization using big data analyticson aviation data sets under heuristic techniques”, Procedia Computer Science, 87, 86-92.
[10] Li, R., Hu, H., Li, H., Wu, Y., Yang, J. (2016). “Mapreduce parallel programming model: A state-of-the-art survey”, International Journal of Parallel Programming, 44, 832-866.
[11] Minelli, M., Chambers, M., Dhiraj, A. (2013). “Big data, big analytics: Emerging business intelligence and analytic trends for today’s businesses”, 578, John Wiley & Sons.
[12] White, T. (2012). “Hadoop: The definitive guide”, O’Reilly Media, Inc.
[13] Zikopoulos, P., Eaton, C. (2011). “Understanding big data: Analytics for enterprise class Hadoop and streaming data”, McGraw-Hill Osborne Media.