Data Retrieval Method for Efficient Parallel Skyline Computing
Xiaofei Li
DOI: https://doi.org/10.59429/esta.v10i5.1422
Keywords: Data Mining; Skyline; Data Retrieval
Abstract
Skyline retrieval is a broad data processing method, especially for multi-keyword retrieval. On the basis of analyzing the deficiencies of the existing Skyline algorithm, a data retrieval method for efficient parallel Skyline computing is proposed. A database index structure Par-Tree is constructed, and signature information is added to reduce the bit collision probability in the retrieval overshoot and filter out the retrieval area irrelevant to keywords, the irrelevant information points are pruned. Based on the Par-Tree index structure, a multi-keyword mining algorithm PSkyline algorithm is proposed. The experimental results show that the method improves the execution efficiency of data mining, and can effectively solve the multi-keyword Skyline retrieval problem.
References
Kalyvas, C.; Maragkoudakis, M. A skyline-based decision boundary estimation method for binominal classification in big data. In Proceedings of the 2020 5th South-East Europe Design Automation, Computer Engineering, Computer Networks and Social Media Conference (SEEDA-CECNSM), Corfu, Greece,25–27 September 2020.
Ouadah, A.; Hadjali, A.; Nader, F.; Benouaret, K. Sefap: An efficient approach for ranking skyline web services. J. Ambient Intell. Humaniz. Comput. 2019, 10, 709–725.
Gil D, FerrÆndez A, Mora-Mora H, and Peral J, ‘‘Internet of Things:A review of surveys based on context aware intelligent services’’ Sensors,vol. 16, no. 7, p E1069, Jul. 2016.
Song H, Rawat D, Jeschke S, and Brecher C, Cyber-Physical Systems:Foundations, Principles and Applications. Boston, MA, USA: Academic,2016, p. 514.
Chen ZJ, Li SY, Liu WY. Range-constrained Top-k keyword query on road networks[J].Journal of Chinese Computer Systems, 2017, 38(12): 2707-2713.