Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce

Page view(s)
0
Checked on
Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce
Title:
Parallelizing Structural Joins to Process Queries over Big XML Data Using MapReduce
Journal Title:
Lecture Notes in Computer Science
OA Status:
closed
Publication URL:
Keywords:
Publication Date:
07 May 2021
Citation:
Volume 8645 of the series Lecture Notes in Computer Science pp 183-190
Abstract:
Processing XML queries over big XML data using MapReduce has been studied in recent years. However, the existing works focus on partitioning XML documents and distributing XML fragments into different compute nodes. This attempt may introduce high overhead in XML fragment transferring from one node to another during MapReduce execution. Motivated by the structural join based XML query processing approach, which uses only related inverted lists to process queries in order to reduce I/O cost, we propose a novel technique to use MapReduce to distribute labels in inverted lists in a computing cluster, so that structural joins can be parallelly performed to process queries. We also propose an optimization technique to reduce the computing space in our framework, to improve the performance of query processing. Last, we conduct experiment to validate our algorithms.
License type:
PublisherCopyrights
Funding Info:
Description:
ISSN:
0302-9743
Files uploaded:

File Size Format Action
dexa14-final.pdf 403.08 KB PDF Open