Query Processing and Optimization in Distributed Database Systems

Summary 

Query processing is an important concern in the field of distributed databases. The main problem is: if a query can be 

decomposed into subqueries that require operations at geographically separated databases, determine the sequence and 

the sites for performing this set of operations such that the operating cost (communication cost and processing cost) for 

processing this query is minimized. The problem is complicated by the fact that query processing not only depends on the 

operations of the query, but also on the parameter values associated with the query. Distributed query processing is an 

important factor in the overall performance of a distributed database system. 

Query optimization is a difficult task in a distributed client/server environment as data location becomes a major factor. In order to optimize queries accurately, sufficient information must be available to determine which data access techniques are most effective (for example, table and column cardinality, organization information, and index availability). Optimization algorithms have an important impact on the performance of distributed query processing. 

In this paper, we describe the distributed query optimization problem in detail. We then present a (ARRQ) technique to process queries with a minimum quantity of intersite data transfer. The technique can be used to process the query where all of the relations referenced by a query are non-fragmented but distributed in different sites. The proposed technique is used to determine which relations are to be partitioned into fragments, and where the fragments are to be sent for processing. The technique is efficient compared to other techniques, as it generally chooses more than one relation to 

remain fragmented which exploits parallelism, while replicating the other relations (excluding the fragmented relations) to the sites of the fragmented relations. Thus the communication costs and local processing costs can be reduced due to the reduced size of the fragmented relations and the response time of queries can be improved. 


Key words: 

Join, Semijoin, Query, FRS, PRS, LR, and Optimization.

Subscribe to access this work and thousands more
Overall Rating

0

5 Star
(0)
4 Star
(0)
3 Star
(0)
2 Star
(0)
1 Star
(0)
APA

Kilonzo, J. (2022). Query Processing and Optimization in Distributed Database Systems. Afribary. Retrieved from https://afribary.com/works/query-processing-and-optimization-in-distributed-d

MLA 8th

Kilonzo, James "Query Processing and Optimization in Distributed Database Systems" Afribary. Afribary, 09 Nov. 2022, https://afribary.com/works/query-processing-and-optimization-in-distributed-d. Accessed 26 Apr. 2024.

MLA7

Kilonzo, James . "Query Processing and Optimization in Distributed Database Systems". Afribary, Afribary, 09 Nov. 2022. Web. 26 Apr. 2024. < https://afribary.com/works/query-processing-and-optimization-in-distributed-d >.

Chicago

Kilonzo, James . "Query Processing and Optimization in Distributed Database Systems" Afribary (2022). Accessed April 26, 2024. https://afribary.com/works/query-processing-and-optimization-in-distributed-d

Document Details
Field: Information Technology Type: Paper 11 PAGES (7756 WORDS) (pdf)