CN101093501A - Method for querying high performance, transparent distributed spatial database - Google Patents
Method for querying high performance, transparent distributed spatial database Download PDFInfo
- Publication number
- CN101093501A CN101093501A CN 200710052872 CN200710052872A CN101093501A CN 101093501 A CN101093501 A CN 101093501A CN 200710052872 CN200710052872 CN 200710052872 CN 200710052872 A CN200710052872 A CN 200710052872A CN 101093501 A CN101093501 A CN 101093501A
- Authority
- CN
- China
- Prior art keywords
- fragment
- space
- spatial
- query
- connection
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
本发明公开了一种高效、透明的分布式空间数据库查询方法,以Open GIS标准定义的空间拓扑关系和空间分析操作为基础,将空间拓扑关系操作和空间分析操作各分成两类:Disjoint操作和非Disjoint操作;Buffer操作和非Buffer操作,并根据空间拓扑关系操作和空间分析操作将空间片段的连接方式划分成四类,提出四类空间片段跨边界连接操作满足的三条规律,并在这三条规律的基础上,进一步提出跨边界空间片段连接优化方法的7个规则,包括3个去除规则和4个连接转化规则,通过对空间片段连接的优化实现了对分布式空间数据库高效、透明的查询。The invention discloses an efficient and transparent query method for a distributed spatial database. Based on the spatial topological relation and spatial analysis operation defined by the Open GIS standard, the spatial topological relation operation and the spatial analysis operation are divided into two categories: Disjoint operation and Non-Disjoint operation; Buffer operation and non-Buffer operation, and according to the spatial topological relationship operation and spatial analysis operation, the connection methods of space segments are divided into four types, and three rules are proposed for the cross-boundary connection operation of four types of space segments, and in these three On the basis of the regularity, 7 rules of cross-boundary spatial segment connection optimization method are further proposed, including 3 removal rules and 4 connection transformation rules. Through the optimization of spatial segment connection, efficient and transparent query of distributed spatial database is realized. .
Description
技术领域technical field
本发明涉及一种高效、透明的分布式空间数据库查询方法,属于信息领域。The invention relates to an efficient and transparent query method for a distributed spatial database, which belongs to the field of information.
背景技术Background technique
目前,实现分布式空间数据库跨边界无缝查询主要有两种方法:其一是按传统分布式数据库连接的分配规则将全局查询连接操作转化为各片段的连接操作,但查询处理的效率低;另一种方法是由用户来确定场地之间的相关性,即由用户来确定将查询发送到哪些相关的局部场地,这种方式不符合分布式数据库透明性的要求,而且由用户自己来确定场地间数据相关性,应用复杂,实际上也不太现实。At present, there are two main methods to achieve cross-boundary seamless query of distributed spatial databases: one is to convert the global query connection operation into the connection operation of each segment according to the distribution rules of traditional distributed database connections, but the efficiency of query processing is low; Another method is for the user to determine the relevance between the sites, that is, the user determines which relevant local sites to send the query to, this method does not meet the requirements of the transparency of the distributed database, and it is determined by the user himself The data correlation between venues is complicated to apply, and it is actually not realistic.
发明内容Contents of the invention
本发明的目的是提供一种高效、透明的分布式空间数据库查询方法,通过将提出的跨边界连接优化规则用于分布式空间数据的查询分解,自动生成局部空间数据库可以执行的空间子查询,达到高效、透明的无缝查询。The object of the present invention is to provide a kind of highly efficient, transparent distributed spatial database query method, by using the proposed cross-boundary connection optimization rule for the query decomposition of distributed spatial data, automatically generate the spatial sub-query that can be executed by the local spatial database, Achieve efficient, transparent and seamless query.
为实现本发明目的所采用的技术方案是:一种高效、透明的分布式空间数据库查询方法,通过查询分解、数据本地化、全局查询优化、局部查询优化四个层次实现,针对分割分片的分布式空间数据库跨边界无缝查询问题,先对空间片段的连接操作进行分类,以Open GIS标准定义的空间拓扑关系操作和空间分析操作为基础,将空间拓扑关系操作分成相离操作(Disjoint操作)和非相离操作(非Disjoint操作),将空间分析操作分成缓冲操作(Buffer操作)和非缓冲操作(非Buffer操作),并根据空间拓扑关系操作和空间分析操作将空间片段的连接方式划分成四类:第一类空间片段连接是非缓冲查询的非相离连接;第二类空间片段连接是非缓冲查询的相离连接;第三类空间片段连接是带缓冲查询的非相离连接;第四类空间片段连接是带缓冲查询的相离连接,这种分类初步理顺了空间拓扑关系操作、空间分析操作与空间片段连接的关联关系,为空间段片连接优化奠定了基础。The technical solution adopted for realizing the purpose of the present invention is: a highly efficient and transparent distributed spatial database query method, which is realized through four levels of query decomposition, data localization, global query optimization, and local query optimization. For the problem of cross-boundary seamless query of distributed spatial databases, first classify the connection operations of spatial segments, based on the spatial topological relational operations and spatial analysis operations defined by the Open GIS standard, divide the spatial topological relational operations into separation operations (Disjoint operations ) and non-disjoint operation (non-Disjoint operation), divide the spatial analysis operation into buffer operation (Buffer operation) and non-buffer operation (non-Buffer operation), and divide the connection mode of the spatial segment according to the spatial topology relation operation and the spatial analysis operation into four categories: the first type of spatial segment join is non-disjoint join with non-buffered query; the second type of spatial segment join is non-disjoint join with non-buffered query; the third type of spatial segment join is non-disjoint join with buffered query; The four types of spatial segment connections are separated connections with buffered queries. This classification preliminarily straightens out the relationship between spatial topology operations, spatial analysis operations, and spatial segment connections, and lays the foundation for the optimization of spatial segment connections.
分割分片的跨边界无缝查询问题实质上是跨边界空间片段连接问题,因此可以通过空间片段连接的优化来实现。跨边界片段连接优化的基本思想:一是减少空间片段连接数量;二是减少参与空间片段连接的对象个数,这样减少连接操作的时间,同时也减少场地间空间数据的传输量。根据所述的两种基本思想,不同类型的空间片段跨边界连接操作满足如下规律:I.满足第一类连接方式的两个空间片段必定与这两个空间片段分割边界的最小外包矩形(MBR)交矩形相交;II.一个空间片段关系的d缓冲区扩展的最小外包矩形不超出这个空间片段分割边界的最小外包矩形的d扩展,所谓的最小外包矩形的d扩展定义如下:设一个空间对象的最小外包矩形的左下角和右上角坐标分别为(xmin,ymin)、(xmax,ymax),则这个对象的最小外包矩形的d扩展是左下角和右上角坐标分别为(xmin-d,ymin-d)、(xmax+d,ymax+d)的矩形,其中d为正实数;III.两个空间片段间第三类空间连接,如果主片段对象和辅片段对象间满足空间片段的第三类连接方式,则主片段中满足第三类空间拓扑关系的对象与辅片段最小外包矩形的d扩展或最小外包矩形的双d扩展相交,其中主片段、辅片段、双d扩展的定义分别如下:参与连接的两个片段中包含缓冲操作的片段称为主片段,则另一个片段称辅片段,若参与连接的两个片段上都有缓冲操作,设缓冲区分别为d1、d2,则任选一个作为主片段,另一个作为辅片段,此时对任何一个片段进行最小外包矩形的d1+d2扩展,称为片段的双d扩展;The cross-boundary seamless query problem of partitioned shards is essentially a cross-boundary spatial segment connection problem, so it can be realized by optimizing the spatial segment connection. The basic idea of cross-boundary segment connection optimization: first, reduce the number of spatial segment connections; second, reduce the number of objects participating in the spatial segment connection, so as to reduce the connection operation time and reduce the amount of spatial data transmission between sites. According to the two basic ideas described, the cross-boundary connection operations of different types of space segments meet the following rules: ) intersection rectangle intersects; II. the minimum enclosing rectangle of the d buffer extension of a space segment relation does not exceed the d expansion of the minimum enclosing rectangle of this space segment division boundary, the so-called d expansion of the minimum enclosing rectangle is defined as follows: set a space object The coordinates of the lower left corner and upper right corner of the minimum enclosing rectangle of the object are (x min , y min ), (x max , y max ), then the d extension of the minimum enclosing rectangle of this object is that the coordinates of the lower left corner and upper right corner are (x The rectangle of min -d, y min -d), (x max +d, y max +d), where d is a positive real number; III. The third type of spatial connection between two spatial segments, if the main segment object and the auxiliary segment Objects satisfy the third type of connection mode of spatial segments, then the objects satisfying the third type of spatial topological relationship in the main segment intersect with the d extension of the smallest enclosing rectangle of the auxiliary segment or the double d extension of the smallest enclosing rectangle, where the main segment and the auxiliary segment The definition of the double-d extension is as follows: among the two fragments participating in the connection, the fragment containing the buffer operation is called the main fragment, and the other fragment is called the secondary fragment. If there are buffer operations on the two fragments participating in the connection, set the buffer are d1 and d2 respectively, then choose one as the main segment and the other as the auxiliary segment, at this time, carry out the d1+d2 extension of the smallest enclosing rectangle to any segment, which is called the double-d extension of the segment;
以上述的三个规律为基础,提出空间片段跨边界连接优化的方法,包括3个去除规则:(1)如果参与连接的两个空间片段的分割边界的最小外包矩形不相交,则这两个空间片段间第一类连接去除;(2)不相邻的空间片段第一类连接去除;(3)如果分割分片的两个空间片段的广义d扩展不相交,则这两个空间片段间第三类连接去除,所谓连接片段的广义d扩展定义如下:参与缓冲连接的一个片段X,它的广义d扩展表示为MBR(X)_dE,如果X中的对象先进行缓冲操作后再做片段连接,则d等于缓冲区距离,即MBR(X)_dE=MBR(X)_d,如果X中的对象直接进行片段连接,则看成d=0,此时MBR(X)_dE=MBR(X);以及下面4个连接转化规则:(4)在两个空间片段X、Y上进行第一类跨边界θsp-1 ed连接,转换为先对X、Y上进行基于X、Y分割边界最小外包矩形交矩形求交过滤,然后再进行第一类空间连接操作θsp-1,即:
本发明提供的一种高效、透明的分布式空间数据库查询方法,通过查询分解、数据本地化、全局查询优化、局部查询优化四个层次实现,步骤如下:An efficient and transparent distributed spatial database query method provided by the present invention is realized through four levels of query decomposition, data localization, global query optimization, and local query optimization. The steps are as follows:
查询分解:将查询问题转换为一个定义在全局关系上的关系代数表达式,查询分解为四步:第一步是将查询写成规范化形式,第二步是对规范化查询语句进行正确性分析,第三步是进行查询化简,第四步是将查询语句重写成一个代数查询,查询重写分为两个子步骤:一是将查询转换成关系代数,二是关系代数查询重构;Query decomposition: convert the query problem into a relational algebra expression defined on the global relationship. The query decomposition is divided into four steps: the first step is to write the query into a normalized form, the second step is to analyze the correctness of the normalized query statement, and the second step is to The third step is to simplify the query, and the fourth step is to rewrite the query statement into an algebraic query. The query rewriting is divided into two sub-steps: one is to convert the query into relational algebra, and the other is to reconstruct the relational algebraic query;
数据本地化:通过数据分布信息定位查询数据,确定查询中涉及的片段,把一个全局关系上的查询具体化,具体到本地化或进地化上的空间片段上进行查询,将全局关系上的关系代数表达式变换为在相应片段上的关系表达式,利用空间片段跨边界连接优化的7个规则来优化空间查询,根据空间片段跨边界连接优化方法中的去除规则(1)和(2)去掉多余的不相邻空间片段第一类连接,根据空间片段连接优化规则(3)去掉片段间第三类连接,并通过从空间全局数据字典中得到片段是否相邻、片段的最小外包矩形等信息判断运用四种连接转化规则中的何种优化规则进行空间片段连接的优化;Data localization: use data distribution information to locate the query data, determine the segments involved in the query, specify a query on a global relationship, and perform a query on a localized or evolutionary spatial segment, and integrate the global relationship The relational algebra expression is transformed into the relational expression on the corresponding segment, and the spatial query is optimized by using the 7 rules of the cross-boundary connection optimization of the space segment, according to the removal rules (1) and (2) in the space segment cross-boundary connection optimization method Remove redundant non-adjacent space segments of the first type of connection, according to the space segment connection optimization rule (3), remove the third type of connection between segments, and obtain whether the segments are adjacent to each other, the minimum enclosing rectangle of the segment, etc. from the spatial global data dictionary The information judges which optimization rule to use among the four connection transformation rules to optimize the connection of space segments;
全局查询优化:通过寻找一个近于最优化的执行策略,即找出分片查询的最佳操作次序,包括代价函数最小,代价函数一般是I/O、CPU和通信代价之和,输出一个优化的、片段上的关系代数查询;Global query optimization: By looking for a near-optimized execution strategy, that is, to find the best operation order of fragmented queries, including the minimum cost function, the cost function is generally the sum of I/O, CPU and communication costs, and output an optimized relational algebra queries on fragments;
局部查询优化:由拥有查询片段的各个站点在每一个站点上执行子查询,采用集中式系统算法通过该站点上的DBMS进行优化。Local query optimization: Each site with query fragments executes subqueries on each site, and uses a centralized system algorithm to optimize through the DBMS on the site.
本发明的有益效果是:在传统的透明查询处理基础上,通过提出的跨边界片段连接优化规则实现了空间片断间的连接优化,实现了高效、透明地查询分布式空间数据库。The beneficial effects of the invention are: on the basis of traditional transparent query processing, the connection optimization between space segments is realized through the proposed cross-boundary segment connection optimization rule, and efficient and transparent query of distributed spatial databases is realized.
具体实施方式Detailed ways
查询分解、数据本地化、全局查询优化、局部查询优化四个层次的具体步骤如下:The specific steps of the four levels of query decomposition, data localization, global query optimization, and local query optimization are as follows:
查询分解:将查询问题转换为一个定义在全局关系上的关系代数表达式。查询分解分为四步:第一步是将查询写成规范化形式,以便后续处理;第二步是对规范化查询语句进行正确性分析;第三步是进行查询化简,消除冗余谓词;第四步是将查询语句进行重写成一个代数查询,查询重写分为两个子步骤:一是将查询转换成关系代数,二是关系代数查询重构,用查询树来表述。在关系代数查询的重构时,通过传统数据库的转换规则来实现部分优化,包括消除冗余等。本层转换所需的信息在全局概念模式中获得。Query Decomposition: Converts a query problem into a relational algebra expression defined on global relations. The query decomposition is divided into four steps: the first step is to write the query into a standardized form for subsequent processing; the second step is to analyze the correctness of the normalized query statement; the third step is to simplify the query and eliminate redundant predicates; the fourth step The first step is to rewrite the query statement into an algebraic query. The query rewriting is divided into two sub-steps: one is to convert the query into a relational algebra, and the other is to reconstruct the relational algebra query and express it with a query tree. In the reconstruction of relational algebra query, some optimizations are realized through the conversion rules of traditional databases, including elimination of redundancy, etc. The information needed for this level of transformation is obtained in the global conceptual schema.
数据本地化:本层输入是分布式全局关系上的代数查询,通过数据分布信息定位查询数据,确定查询中涉及哪个片段,把一个全局关系上的查询进行具体化,落实到合适的片段上的查询。将全局关系上的关系代数表达式,变换为在相应片段上的关系表达式。Data localization: The input of this layer is an algebraic query on a distributed global relationship. The query data is located through the data distribution information to determine which segment is involved in the query, and the query on a global relationship is concretized and implemented on the appropriate segment. Inquire. Transforms a relational algebra expression on a global relation into a relational expression on the corresponding fragment.
利用空间片段连接优化的7个规则来优化空间查询。在传统的分布式数据库中,水平分片片段之间的连接存在许多多余的连接,按同一分割集分割得到的分割片段,同样存在多余的连接。根据空间片段连接优化方法中的去除规则(1)和(2)去掉多余的不相邻空间片段第一类连接,根据去除规则(3)去掉片段间第三类连接,并通过从空间全局数据字典中得到片段是否相邻、片段的MBR等信息判断运用四种连接转化规则中的何种规则进行空间片段连接的优化。Optimizing Spatial Queries Using 7 Rules for Spatial Fragment Join Optimization. In a traditional distributed database, there are many redundant connections between horizontally sharded fragments, and there are also redundant connections among the partitioned fragments obtained by splitting the same partition set. According to the removal rules (1) and (2) in the space segment connection optimization method, the redundant first-type connection of non-adjacent space segments is removed, and the third-type connection between segments is removed according to the removal rule (3), and the spatial global data Whether the fragments are adjacent, the MBR of the fragments and other information is obtained in the dictionary to determine which of the four connection transformation rules to use to optimize the spatial fragment connection.
全局查询优化:全局优化的输入是片段查询,即在片段上的代数查询,目的在于寻找一个近于最优化的执行策略,也就是找分片查询的最佳操作次序,包括代价函数最小,代价函数一般是I/O、CPU和通信代价之和。全局查询的输出是一个优化的、片段上的关系代数查询。本层所需的信息来自数据库统计信息,包括各场地片段统计信息、资源信息和通信信息等。Global query optimization: the input of global optimization is fragment query, that is, algebraic query on fragments. Functions are generally the sum of I/O, CPU, and communication costs. The output of a global query is an optimized, relational algebra query on fragments. The information required for this layer comes from the statistical information of the database, including the statistical information of each field segment, resource information and communication information.
局部查询优化:由拥有查询片段的各个站点在每一个站点上执行子查询,它采用集中式系统算法通过该站点上的DBMS进行优化,所需信息取自局部模式。Local query optimization: Sub-queries are executed on each site by each site that owns the query fragment. It uses a centralized system algorithm to optimize through the DBMS on the site, and the required information is taken from the local schema.
下面结合实施例对本发明做进一步说明。The present invention will be further described below in conjunction with embodiment.
实施例1Example 1
一个实验系统的具体执行步骤为:The specific execution steps of an experimental system are:
(1)系统调用:用户在查询界面上输入SQL语句后,将该全局SQL语句传递给分布式空间数据访问引擎。(1) System call: After the user enters the SQL statement on the query interface, the global SQL statement is passed to the distributed spatial data access engine.
(2)全局查询分解:分布式查询分解分为两步,第一步实现SQL查询语句的解析,得到语法树;第二步对语法树进行变换,得到归并连接树。(2) Global query decomposition: Distributed query decomposition is divided into two steps. The first step is to realize the parsing of SQL query statements to obtain the syntax tree; the second step is to transform the syntax tree to obtain the merge connection tree.
(3)数据本地化与查询优化:根据场地分片等信息对归并连接树进行本地化与优化,调用全局数据字典服务,用来得到数据本地化与查询优化过程中必须的全局信息,如片段场地、统计信息、连接操作的选择性因子等,利用去除规则和连接转化规则实现了空间片段连接的优化。根据去除规则(1)和(2)去掉多余的不相邻空间片段第一类连接;根据去除规则(3)去掉片段间第三类连接,并通过从空间全局数据字典中得到片段是否相邻、片段的MBR等信息判断运用四种连接转化规则中的何种规则进行空间片段连接优化。(3) Data localization and query optimization: localize and optimize the merged join tree according to site fragmentation and other information, and call the global data dictionary service to obtain the necessary global information in the process of data localization and query optimization, such as fragments Site, statistical information, selectivity factors of connection operations, etc., use the removal rules and connection conversion rules to realize the optimization of spatial segment connections. According to the removal rules (1) and (2), remove the redundant first-type connection of non-adjacent space segments; according to the removal rule (3), remove the third type of connection between segments, and obtain whether the segments are adjacent to each other from the spatial global data dictionary , fragment MBR and other information to determine which of the four connection transformation rules to use for space fragment connection optimization.
(4)查询调度;分布式空间数据访问引擎得到查询执行计划树后,将查询执行计划树和与之对应的调度脚本发送到调度服务器上,调度服务器通过分布式脚本引擎来执行调度脚本,对查询执行树进行遍历,将每个执行结点发送到相关的执行场地上的代理服务器进行执行。全部完成后,将最终查询结果集返回给分布式空间数据访问引擎。(4) Query scheduling; after the distributed spatial data access engine obtains the query execution plan tree, it sends the query execution plan tree and the corresponding scheduling script to the scheduling server, and the scheduling server executes the scheduling script through the distributed script engine, The query execution tree is traversed, and each execution node is sent to the proxy server on the relevant execution site for execution. After all are completed, the final query result set is returned to the distributed spatial data access engine.
(5)执行结点的执行:代理服务器获得执行结点后,根据自身数据源的特点,进行片段查询转换,并调用相关的局部空间数据库引擎和基础数据访问驱动进行查询,在得到数据集后,按照本系统规定的同一格式进行序列化,最后将结果发送到指定的位置。(5) Execution of the execution node: After obtaining the execution node, the proxy server performs segment query conversion according to the characteristics of its own data source, and invokes the relevant local spatial database engine and basic data access driver to query. After obtaining the data set , serialize according to the same format specified by this system, and finally send the result to the specified location.
Claims (2)
- One kind efficient, transparent distributed spatial database querying method, pass through query decomposition, the data localization, global query optimization, four levels of local query optimization are realized, it is characterized in that: at the cross-border seamless inquiry problem of the distributed spatial database of cutting apart burst, earlier the attended operation of space fragment is classified, spatial topotaxy operation and spatial analysis operation based on the OpenGIS standard definition, the spatial topotaxy operation is divided into from operating with non-from operation, spatial analysis operation is divided into buffer operation and non-buffer operation, and the connected mode of space fragment is divided into four classes according to spatial topotaxy operation and spatial analysis operation: first kind space fragment be connected be non-buffering inquire about non-from connection; The second space-like fragment connect be non-buffering inquiry from connection; It is the non-from connection of band buffering inquiry that the 3rd space-like fragment connects; The 4th space-like fragment connect be the inquiry of band buffering from connection, this classification has been set up spatial topotaxy operation, spatial analysis operation and the incidence relation between the space fragment is connected, on this basis, following rule is satisfied in the cross-border attended operation of dissimilar space fragment:I. satisfy two space fragments of first kind connected mode and must hand over rectangle intersection with the minimum outsourcing rectangle of these two space fragment partitioning boundaries;II. the minimum outsourcing rectangle of the d buffer zone expansion of a space fragment relation does not exceed the d expansion of the minimum outsourcing rectangle of this space fragment partitioning boundary, and the d expanded definition of so-called minimum outsourcing rectangle is as follows: the lower left corner and the upper right corner coordinate of establishing the minimum outsourcing rectangle of a spatial object are respectively (x Min, y Min), (x Max, y Max), then the d of the minimum outsourcing rectangle of this object expansion is that the lower left corner and upper right corner coordinate are respectively (x Min-d, y Min-d), (x Max+ d, y Max+ d) rectangle, wherein d is an arithmetic number;III. two intersegmental the 3rd space-likes of spatial pieces connect, if satisfy the 3rd class connected mode of space fragment between main leaf section object and auxilliary fragment objects, the object that then satisfies the 3rd space-like topological relation in the main leaf section intersects with the d expansion of the minimum outsourcing rectangle of auxilliary fragment or two d expansions of minimum outsourcing rectangle, main leaf section wherein, auxilliary fragment, the definition of two d expansions is as follows respectively: the fragment that comprises buffer operation in two fragments that participate in connecting is called the main leaf section, then another fragment claims auxilliary fragment, if on two fragments that participate in connecting buffer operation is arranged all, if buffer zone is respectively d1, d2, then choose one wantonly as the main leaf section, another is as auxilliary fragment, carry out the d1+d2 expansion of minimum outsourcing rectangle to any one fragment this moment, is called two d expansions of fragment;Based on these three rules, 7 rules that fragment cross-border connection in space is optimized are proposed, remove rule comprising following 3:(1) if the minimum outsourcing rectangle of the partitioning boundary of two space fragments that participation connects is non-intersect, then the intersegmental first kind of these two spatial pieces connects removal;(2) non-conterminous space fragment first kind connection is removed;(3) expand non-intersect if cut apart the broad sense d of two space fragments of burst, then intersegmental the 3rd class of these two spatial pieces connects removal, the broad sense d expanded definition of so-called junction fragment is as follows: participate in the fragment X that buffering connects, its broad sense d expansion table is shown MBR (X) _ d EIf, do fragment again after the advanced row buffering operation of the object among the X and connect, then d equals the buffer zone distance, i.e. MBR (X) _ d E=MBR (X) _ d connects if the object among the X directly carries out fragment, then regards d=0 as, at this moment MBR (X) _ d E=MBR (X);And following 4 connection transformation rules:(4) on two space fragment X, Y, carry out the cross-border θ of the first kind Sp-1 EdConnect, be converted to earlier and hand over rectangle to ask friendship to filter, and then carry out first kind spatial join operation θ based on X, the minimum outsourcing rectangle of Y partitioning boundary carrying out on X, the Y Sp-1, that is:(5) if the minimum outsourcing rectangle of the partitioning boundary of two space fragments that participation connects is non-intersect, then the second intersegmental class of these two spatial pieces connects the cartesian product that is converted into these two fragments;(6) on two space fragment X, Y, carry out the cross-border connection of the 3rd class θ Sp-3 Ed, be converted to and on X, carry out earlier the 3rd space-like topology filtration θ MBR (Y) _ d f(X), filter result is carried out the d buffering, and then carry out the 3rd space-like attended operation θ Sp-3, promptly(7) the broad sense d expansion of two space fragments of participation connection is non-intersect, and then these two intersegmental connections of spatial pieces are converted into the cartesian product of these two space fragments.
- 2. a kind of efficient, transparent distributed spatial database querying method according to claim 1 is characterized in that: the concrete steps that query decomposition, data localization, global query optimization, four levels of local query optimization are realized are as follows:Query decomposition: the inquiry problem is converted to a relational algebra expression formula that is defined on the holotopy, query decomposition was four steps: the first step is write inquiry as normalized form, second step was that the standardization query statement is carried out the correctness analysis, the 3rd step was to inquire about abbreviation, the 4th step was that query statement is rewritten as an algebraically inquiry, query rewrite is divided into two sub-steps: the one, query conversion is become relational algebra, and the 2nd, relational algebra inquiry reconstruct;Data localization: by DATA DISTRIBUTION information locating query data, the fragment that relates in determining to inquire about, inquiry on the holotopy is specialized, inquire about specific to localization or on the space fragment on changing with advancing, relational algebra expression formula on the holotopy is transformed to relational expression on respective segments, utilize 7 rules of the cross-border connection optimization of space fragment to optimize space querying, removing the unnecessary non-conterminous space fragment first kind according to the cross-border connection principle of optimality of space fragment (1) and (2) is connected, connect the principle of optimality (3) according to the space fragment and remove intersegmental the 3rd class connection of sheet, and whether adjacent by from the global catalog of space, obtaining fragment, information such as the minimum outsourcing rectangle of fragment are judged the optimization of using four kinds of which kind of principles of optimality that connects in the transformation rule to carry out space fragment connection;Global query optimization: be bordering on optimized implementation strategy by seeking one, promptly find out the optimum operation order of burst inquiry, a relational algebra inquiry optimization of output, on the fragment;Local query optimization: on each website, carry out subquery by each website that has query fragment, adopt the integrated system algorithm to be optimized by the DBMS on this website.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB2007100528722A CN100573524C (en) | 2007-07-31 | 2007-07-31 | A kind of efficient, transparent distributed spatial database querying method |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNB2007100528722A CN100573524C (en) | 2007-07-31 | 2007-07-31 | A kind of efficient, transparent distributed spatial database querying method |
Publications (2)
Publication Number | Publication Date |
---|---|
CN101093501A true CN101093501A (en) | 2007-12-26 |
CN100573524C CN100573524C (en) | 2009-12-23 |
Family
ID=38991765
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2007100528722A Expired - Fee Related CN100573524C (en) | 2007-07-31 | 2007-07-31 | A kind of efficient, transparent distributed spatial database querying method |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN100573524C (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101984433A (en) * | 2010-11-12 | 2011-03-09 | 浙江大学 | Convexity based multiple spots far and near querying method |
CN102362276A (en) * | 2009-04-01 | 2012-02-22 | 赛贝斯股份有限公司 | Testing efficiency and stability of a database query engine |
CN103324724A (en) * | 2013-06-26 | 2013-09-25 | 华为技术有限公司 | Method and device for processing data |
CN104903894A (en) * | 2013-01-07 | 2015-09-09 | 脸谱公司 | System and method for distributed database query engines |
CN105608086A (en) * | 2014-11-17 | 2016-05-25 | 中兴通讯股份有限公司 | Transaction processing method and device of distributed database system |
CN105809735A (en) * | 2016-03-11 | 2016-07-27 | 武汉大学 | Topology maintenance method based on three-dimensional geometric body combination |
WO2017049913A1 (en) * | 2015-09-23 | 2017-03-30 | 中兴通讯股份有限公司 | Database execution method and device |
CN103714073B (en) * | 2012-09-29 | 2017-04-12 | 国际商业机器公司 | Method and device for querying data |
CN106886674A (en) * | 2016-08-24 | 2017-06-23 | 阿里巴巴集团控股有限公司 | A kind of geographical position Distance Batch computational methods and device |
CN108228663A (en) * | 2016-12-21 | 2018-06-29 | 杭州海康威视数字技术股份有限公司 | A kind of paging search method and device |
CN110968615A (en) * | 2018-09-30 | 2020-04-07 | 北京国双科技有限公司 | Data query method and device |
CN111522816A (en) * | 2020-04-16 | 2020-08-11 | 云和恩墨(北京)信息技术有限公司 | Data processing method, device, terminal and medium based on database engine |
CN113761093A (en) * | 2021-01-27 | 2021-12-07 | 京东城市(北京)数字科技有限公司 | Method, apparatus, computer equipment and storage medium for determining spatial binary groups |
-
2007
- 2007-07-31 CN CNB2007100528722A patent/CN100573524C/en not_active Expired - Fee Related
Cited By (25)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102362276A (en) * | 2009-04-01 | 2012-02-22 | 赛贝斯股份有限公司 | Testing efficiency and stability of a database query engine |
CN102362276B (en) * | 2009-04-01 | 2015-04-22 | 赛贝斯股份有限公司 | Testing efficiency and stability of a database query engine |
CN101984433B (en) * | 2010-11-12 | 2012-11-21 | 浙江大学 | Convexity based multiple spots far neighbor querying method |
CN101984433A (en) * | 2010-11-12 | 2011-03-09 | 浙江大学 | Convexity based multiple spots far and near querying method |
CN103714073B (en) * | 2012-09-29 | 2017-04-12 | 国际商业机器公司 | Method and device for querying data |
CN104903894B (en) * | 2013-01-07 | 2018-12-28 | 脸谱公司 | System and method for distributed networks database query engine |
CN104903894A (en) * | 2013-01-07 | 2015-09-09 | 脸谱公司 | System and method for distributed database query engines |
US11347761B1 (en) | 2013-01-07 | 2022-05-31 | Meta Platforms, Inc. | System and methods for distributed database query engines |
US10698913B2 (en) | 2013-01-07 | 2020-06-30 | Facebook, Inc. | System and methods for distributed database query engines |
US10210221B2 (en) | 2013-01-07 | 2019-02-19 | Facebook, Inc. | System and method for distributed database query engines |
CN103324724A (en) * | 2013-06-26 | 2013-09-25 | 华为技术有限公司 | Method and device for processing data |
CN103324724B (en) * | 2013-06-26 | 2017-02-08 | 华为技术有限公司 | Method and device for processing data |
CN105608086A (en) * | 2014-11-17 | 2016-05-25 | 中兴通讯股份有限公司 | Transaction processing method and device of distributed database system |
CN105608086B (en) * | 2014-11-17 | 2021-07-27 | 中兴通讯股份有限公司 | Transaction processing method and device for distributed database system |
WO2016078423A1 (en) * | 2014-11-17 | 2016-05-26 | 中兴通讯股份有限公司 | Transaction processing method and apparatus for distributed database system |
WO2017049913A1 (en) * | 2015-09-23 | 2017-03-30 | 中兴通讯股份有限公司 | Database execution method and device |
CN105809735A (en) * | 2016-03-11 | 2016-07-27 | 武汉大学 | Topology maintenance method based on three-dimensional geometric body combination |
CN106886674A (en) * | 2016-08-24 | 2017-06-23 | 阿里巴巴集团控股有限公司 | A kind of geographical position Distance Batch computational methods and device |
CN106886674B (en) * | 2016-08-24 | 2019-07-23 | 阿里巴巴集团控股有限公司 | A kind of geographical location Distance Batch calculation method and device |
CN108228663A (en) * | 2016-12-21 | 2018-06-29 | 杭州海康威视数字技术股份有限公司 | A kind of paging search method and device |
CN110968615A (en) * | 2018-09-30 | 2020-04-07 | 北京国双科技有限公司 | Data query method and device |
CN110968615B (en) * | 2018-09-30 | 2023-05-23 | 北京国双科技有限公司 | Data query method and device |
CN111522816A (en) * | 2020-04-16 | 2020-08-11 | 云和恩墨(北京)信息技术有限公司 | Data processing method, device, terminal and medium based on database engine |
CN111522816B (en) * | 2020-04-16 | 2021-04-30 | 云和恩墨(北京)信息技术有限公司 | Data processing method, device, terminal and medium based on database engine |
CN113761093A (en) * | 2021-01-27 | 2021-12-07 | 京东城市(北京)数字科技有限公司 | Method, apparatus, computer equipment and storage medium for determining spatial binary groups |
Also Published As
Publication number | Publication date |
---|---|
CN100573524C (en) | 2009-12-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101093501A (en) | Method for querying high performance, transparent distributed spatial database | |
US7092954B1 (en) | Optimizing an equi-join operation using a bitmap index structure | |
US7730055B2 (en) | Efficient hash based full-outer join | |
CN109614432B (en) | A system and method for obtaining data blood relationship based on syntax analysis | |
CN105955999B (en) | A ThetaJoin Query Processing Method for Large-scale RDF Graphs | |
CN107169033A (en) | Relation data enquiring and optimizing method with parallel framework is changed based on data pattern | |
CN101436192A (en) | Method and apparatus for optimizing inquiry aiming at vertical storage type database | |
CN105975488A (en) | Method for querying keyword based on topic cluster unit in relational database | |
JPH07319923A (en) | Method and equipment for processing of parallel database of multiprocessor computer system | |
CN109710638A (en) | A multi-query optimization method on federated distributed RDF database | |
CN105320679A (en) | Data table index set generation method and device | |
US20110022581A1 (en) | Derived statistics for query optimization | |
CN108052635A (en) | A kind of heterogeneous data source unifies conjunctive query method | |
WO2015074466A1 (en) | Data search method and apparatus | |
CN105630881A (en) | Data storage method and query method for RDF (Resource Description Framework) | |
CN104778277A (en) | RDF (radial distribution function) data distributed type storage and querying method based on Redis | |
CN112732752A (en) | Query statement optimization method, device, equipment and storage medium | |
CN104809168A (en) | Partitioning and parallel distribution processing method of super-large scale RDF graph data | |
CN108073641B (en) | Method and device for querying data table | |
CN101710336A (en) | Method for accelerating data processing by using relational middleware | |
US7945560B2 (en) | Technique for removing subquery in group by—having clauses using window functions | |
CN107247738A (en) | A kind of extensive knowledge mapping semantic query method based on spark | |
Jin et al. | Making RDBMSs efficient on graph workloads through predefined joins | |
CN112487015A (en) | Distributed RDF system based on incremental repartitioning and query optimization method thereof | |
US8150865B2 (en) | Techniques for coalescing subqueries |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
ASS | Succession or assignment of patent right |
Owner name: CHANGSHU ZIJIN INTELLECTUAL PROPERTY SERVICE CO., Free format text: FORMER OWNER: WUHAN UNIVERSITY Effective date: 20121225 |
|
C41 | Transfer of patent application or patent right or utility model | ||
COR | Change of bibliographic data |
Free format text: CORRECT: ADDRESS; FROM: 430072 WUHAN, HUBEI PROVINCE TO: 215500 SUZHOU, JIANGSU PROVINCE |
|
TR01 | Transfer of patent right |
Effective date of registration: 20121225 Address after: 215500 Changshou City South East Economic Development Zone, Jiangsu, Jin Road, No. 8 Patentee after: Changshu Zijin Intellectual Property Service Co., Ltd. Address before: 430072 Hubei city of Wuhan province Wuchang Luojiashan Patentee before: Wuhan University |
|
C41 | Transfer of patent application or patent right or utility model | ||
TR01 | Transfer of patent right |
Effective date of registration: 20160406 Address after: 215500 No. 8, Jin Du Road, Changshou City hi tech Industrial Development Zone, Jiangsu, China Patentee after: Changshu Nanjing Normal University Development Research Academy Institute Co., Ltd. Address before: 215500 Changshou City South East Economic Development Zone, Jiangsu, Jin Road, No. 8 Patentee before: Changshu Zijin Intellectual Property Service Co., Ltd. |
|
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20091223 Termination date: 20180731 |
|
CF01 | Termination of patent right due to non-payment of annual fee |