CN103927346B - Query connection method on basis of data volumes - Google Patents

Query connection method on basis of data volumes Download PDF

Info

Publication number
CN103927346B
CN103927346B CN201410124531.1A CN201410124531A CN103927346B CN 103927346 B CN103927346 B CN 103927346B CN 201410124531 A CN201410124531 A CN 201410124531A CN 103927346 B CN103927346 B CN 103927346B
Authority
CN
China
Prior art keywords
statistical information
data
row
query
inquiry
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201410124531.1A
Other languages
Chinese (zh)
Other versions
CN103927346A (en
Inventor
陈岭
周强
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University ZJU
Original Assignee
Zhejiang University ZJU
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University ZJU filed Critical Zhejiang University ZJU
Priority to CN201410124531.1A priority Critical patent/CN103927346B/en
Publication of CN103927346A publication Critical patent/CN103927346A/en
Application granted granted Critical
Publication of CN103927346B publication Critical patent/CN103927346B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2455Query execution
    • G06F16/24553Query execution of query operations
    • G06F16/24558Binary matching operations
    • G06F16/2456Join operations

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a query connection method on the basis of data volumes. Characteristics such as line file reading are taken into deep consideration during real-time query on big data by the aid of the query connection method, so that costs can be estimated, and the optimal connection sequences can be assuredly generated. The query connection method mainly includes constructing metadata servers; collecting statistical information; querying the metadata servers and acquiring relevant statistical information of various tables participating in connection; estimating the selectivity and relevant parameters such as the data volumes according to the statistical information; computing the corresponding costs of various execution plans to find out the optimal connection sequences. The query connection method has the advantages that the cost estimation accuracy can be improved by the aid of the query connection method, accordingly, the optimal execution plans can be assuredly found out, and the integral query efficiency can be effectively improved.

Description

Inquiry method of attachment based on data volume
Technical field
The present invention relates to big data real-time query optimisation technique field, more particularly, to a kind of inquiry connection based on data volume Method.
Background technology
Big data real-time query is important big data technology, existing big data inquiry system have Google Dremel, Cloudera Impala, Berkeley Shark, Apache Drill etc..Big data real-time query typically adopts distributed meter Calculate framework, due to weakening the support to functions such as affairs, so there is higher expanding with respect to relevant database cluster Malleability.Simultaneously because big data real-time query is well positioned to meet the user's request of real-time query, therefore it is in internet, wisdom There is wide application space in the fields such as city.
Multi-link sequential query optimization is the important component part of data base management system, in big data real-time query technology It is likewise supplied with irreplaceable importance in field.It passes through using certain optimization method, constantly travels through searching of executive plan Rope space, finds out the optimal order of connection, to generate optimal executive plan, thus lifting the performance of big data inquiry system, Meet the real-time demand of user's inquiry.
Estimate it is very important part in multi-link sequential query optimization process due to cost, can be given a kind of effective Result size estimation method be the key that query optimization is effectively realized.Traditional cost method of estimation is a kind of based on table radix Method, conventional cost can effectively be solved by the method and estimate problem, thus ensureing to find and meet Cost Model Good executive plan.But in distributed data base system or data warehouse, exist with the tables of data of column stored in file format, should Formatted file is to optimize I/O performance and minimizing data transfer rate amount when bottom data is read out, with RCFile literary composition As a example part, this document is a kind of first file format by row transversally cutting and then by the longitudinal cutting of row, and it will only read and transmit Required data row.When participating in connecting to the tables of data having column stored in file format, using the generation based on table radix for the tradition When valency method of estimation is estimated, its result may produce serious deviation, and then leads to order of connection optimized algorithm to be found out The executive plan meeting Cost Model is not optimal, that is, the order of connection finding is not optimum, so that making whole inquiry Postpone higher.
Content of the invention
The technical problem to be solved in the present invention is how to guarantee that big data real time inquiry system carries out multi-link sequential optimization The accuracy that its cost of Shi Tisheng is estimated, thus lift the overall efficiency of inquiry.Entered based on table radix to solve above-mentioned tradition The problem that row cost estimation exists, the present invention proposes the multi-join query cost method of estimation based on data volume it is contemplated that using The part relations participating in the inquiry that family is submitted to connecting may be stored with column file, by deeply considering column file reading etc. Characteristic, increases more fine-grained statistical information, and the average length using each field is big with the connection intermediate result of estimating inquiry Little, thus effectively guaranteeing the accuracy of cost estimation.
A kind of inquiry method of attachment based on data volume, including:
Step 1, submits inquiry request to meta data server, obtains the statistical information corresponding to each table participating in connecting;
Step 2, the statistical information estimation according to getting obtains the data volume of all tables in current executive plan;
Step 3, repeat step 1 and step 2, until the search space of traversal executive plan, find out with suitable data amount Make the minimum executive plan of Query Cost, carry out the connection of table by the order of connection in this executive plan.
The search space of wherein executive plan refers to the set of the table order of connection obtained by all executive plans.
The present invention to determine, as Query Cost, the order connecting in multi-join query using data volume, so that it is guaranteed that big data Real time inquiry system carries out lifting the accuracy of its cost estimation during multi-link sequential optimization, thus lifting the overall effect of inquiry Rate.
Wherein, meta data server building mode is, chooses relevant database and designs the table schema of row rank, according to The table schema designing creates metadatabase and table relation in corresponding relevant database, obtains meta data server.
In order to for inquiry system provide table level not, the statistical information of three kinds of granularities such as subregion rank and row rank, if Counting corresponding table schema needs to meet suitable normal form, and simultaneously on the premise of can completing cost estimation, reduce need not as far as possible The storage overhead wanted.
Statistical information in meta data server is the statistical information corresponding to every table, and described statistical information is according to design Table schema statistics carried out to table obtain.
The fine granularity of statistical information obtains due to table schema being row class pattern according to the fine granularity of table schema, therefore counts Information includes the statistical information of row rank.
Described relevant database is:MYSQL database, Derby database or oracle database.
According to the actual demand of enterprise customer and system, choose suitable relevant database as big data real-time query The meta data server of system.
Statistical information includes:The upper bound of data value in the lower bound of data value in row name, row, row, table midrange are according to being empty Quantity, table midrange according to different value quantity, row in field data average length and row in field data maximum length, Table or total line number of view.
Meta data server and statistical information all complete in the storage in meta data server under off-line state.
Due to the structure of meta data server and the collection of statistical information be all complete offline so that actual inquired about when The return carrying out statistical information does not need expended how many run-time overheads, significantly reduces the time delay of cost estimation.
In step 2, selectance according to corresponding to this table for the data volume of each table, field average amount and table is total Line number is calculated.
Selectance according to the statistical information such as upper bound of data value in the lower bound of data value in the row of statistical information gained, row with And correlated condition in Connection inquiring, wherein typically represented with selectivity.
The evaluation method of selectance is accordingly to be calculated according to the querying condition in inquiry and statistical information, obtain table The shared ratio in object set to be inquired about of the middle row meeting querying condition.
Object set therein be table, view or intermediate result set.
The computing formula of data volume size is as follows:
Selectivity represents the selectance of inquiry, and numsOfTableLine is total line number of table or view, avgColSizeiRepresent the average amount of the i-th row field in the table needing to return, j is the columns of table.
Compared to the evaluation method based on table radix for the tradition, it is big that it depends not only upon the line number of inquiry intermediate result generation Little, and also the data volume of estimation is taken into account, thus lifting the accuracy of cost estimation simultaneously.
Advantages of the present invention includes:
Exist for the cost method based on table radix for the tradition and estimate inaccurate problem, deeply consider column file and read Etc. characteristic, increase more fine-grained statistical information, effectively improve the accuracy of cost estimation.
By meta data server storage and maintenance table associated statistical information, it is to avoid repeatedly carry out substantial amounts of analysis work, Decrease run-time overhead, improve the efficiency of cost estimation.
Brief description
Fig. 1 is the inquiry method of attachment overview flow chart based on data volume for one embodiment of the inventive method;
The query processing Organization Chart that Fig. 2 is adopted by present example of the present invention;
Fig. 3 is that in present example of the present invention, meta data server builds flow chart;
Fig. 4 is that in present example of the present invention, statistical information collects flow chart;
Fig. 5 is statistical information querying flow figure in present example of the present invention;
Fig. 6 is data volume estimation flow chart in present example of the present invention;
Fig. 7 is order of connection product process figure in present example of the present invention.
Specific embodiment
The present invention proposes the inquiry method of attachment based on data volume, is carried out cost when inquiring about to multi-join query Estimate, the overall procedure of cost method of estimation is as shown in Figure 1.It carries out the construction work of meta data server first;Then complete Become the collection of statistical information;Obtain the associated statistical information of each table participating in connecting secondly by query metadata server;Connect The estimation work carrying out the relevant parameters such as selectance and data volume according to statistical information;Finally adopt the estimation based on data volume Method calculates the corresponding cost of each executive plan and finds out the optimal order of connection.
In order to more intuitively introduce effect in query optimization for the method proposed by the present invention, now provide the frame of query processing Structure is as shown in Fig. 2 it elaborates the relation between cost estimation module and order of connection generation module based on data volume.Its In, carried out the work of executive plan search in order of connection generation module by related optimization, and the cost based on data volume Estimation module is mainly made up of Cost Model and MetaStore two parts, to complete the work of cost estimation.User is submitted to Inquiry, through parsing after by the work to complete sequential optimization by multi-link sequential query optimization method, it is being executed Plan search during, need the estimation work calling associated costs estimation module to carry out cost, with guarantee to find meet to Determine the Best link order of Cost Model.
The step of the multi-join query cost method of estimation based on data volume proposed by the present invention includes:
Carrying out before inquiry connects firstly the need of building meta data server and by institute in the table in meta data server The statistical information of storage.
Relevant database simultaneously designs table schema, builds meta data server.
In order to the cost method of estimation based on data volume can efficiently be realized it is necessary first to carry out meta data server Construction work, its flow process is as shown in figure 3, comprise the following steps that:
According to the actual demand of enterprise customer and system, choose suitable relevant database(As MYSQL database, Derby database)Meta data server as big data real time inquiry system;
In order to for inquiry system provide table level not, the statistical information of three kinds of granularities such as subregion rank and row rank, if Counting corresponding table schema needs to meet suitable normal form, and simultaneously on the premise of can completing cost estimation, reduce need not as far as possible The storage overhead wanted;
Metadatabase and table relation are created in corresponding database server according to the table schema designing, for follow-up Step uses.
According to designed good table schema, analyze the relation in every table and corresponding statistical information is stored metadata clothes To complete the collection of statistical information in business device;
In order to be attached sequential optimization work to the inquiry after parsing, need to complete after creating meta data server The work that statistical information is collected, its flow process is as shown in figure 4, comprise the following steps that:
Estimate to obtain the expense of statistical information to reduce cost in order of connection optimization process, first pass through corresponding point Analysis sentence or instrument are analyzed work to the table being often attached inquiring about;
Table after analysis is carried out with the collection work of associated statistical information, and this statistical information is stored Metadata Service In the respective table of device, the cost in order to preferably complete based on data volume is estimated, needs collection to include field average length AVG_ The statistical information of the row rank such as COL_LEN, it is given during carrying out table schema design.Wherein statistical information includes:Row The upper bound of data value in the lower bound of data value in name, row, row, table midrange according to for empty quantity, table midrange according to different value Quantity, row in field data average amount and row in field data maximum amount of data, table or view total line number.
Meta data server(I.e. metadatabase)Establishment and the collection of statistical information be and complete offline, then carry out Inquiry.
Step 1, by submitting to inquiry request to obtain the ASSOCIATE STATISTICS letter of each table participating in connecting to meta data server Breath;
This step mainly complete associated statistical information inquiry and obtain work, its flow process as shown in figure 5, concrete steps such as Under:
In order to participate in the corresponding statistical information of each table connecting in being inquired about, need from query optimization module to corresponding unit Data server submits inquiry request to;
The corresponding statistical information of each table relation is returned by meta data server, to complete the acquisition work of statistical information, from And it is used for the calculating of next stage relevant parameter.
Because the structure of meta data server and the collection of statistical information are all to complete offline, therefore this step does not need consuming many Few run-time overhead, greatly reduces the time delay of cost estimation.
Step 2, the statistical information estimation according to getting obtains the data volume of all tables in current executive plan.
Wherein executive plan refers to the inquiry carrying out with the different table order of connection.
Before the corresponding cost carrying out executive plan is estimated, need to complete the estimation work of relevant parameter, relevant parameter Including selectance data amount its flow process of calculating as shown in fig. 6, comprising the following steps that:
By the associated statistical information getting in previous step, carry out first participating in the meter of each table selectance of connection Calculate, step 2-1, accordingly calculated according to the querying condition in Connection inquiring and statistical information, the row being met condition exists Shared ratio in object set to be inquired about.
For any two querying condition comprising in inquiry, the corresponding computing formula of different relations of satisfaction is different:
Inquiry meets selectance selectivity during querying condition A and querying condition B simultaneously(AandB)Computing formula For:
selectivity(AandB)=selectivity(A)×selectivity(B)(1)
Wherein, selectivity(A)Represent the selectance of single query condition A, selectivity(B)Represent single query The selectance of condition B;
Inquiry meets selectance selevtivity during querying condition A or querying condition B(AorB)Computing formula is:
selevtivity(AorB)=P (A)+P (B)-selectivity(AandB)(2)
P(A)Represent the probability of occurrence of querying condition A, P(B)Represent the probability of occurrence of querying condition B;
Selectance selectivity when inquiry meets exclusion querying condition A(notA)Computing formula:
selectivity(ntoA)=1-selectivity(A)(3)
The relation being met between any two querying condition A and B is:Meet simultaneously, meet A or meet B, querying condition It is also possible to as not comprising A.When comprising multiple queries condition and comprising multiple relation between querying condition, can be respectively according to above-mentioned public affairs Formula carries out combination of two to querying condition therein, is calculated according to the relation that each combination is met, obtains final choosing Degree of selecting.
Step 2-2, calculates the data volume of each table according to the selectance of step 2-1 gained, and computing formula is as follows:
Selectivity represents that step 2-1 calculates gained selectance, and numsOfTableLine is the head office of table or view Number, avgColSizeiRepresent the average amount of the i-th row field in the table needing to return, j is the columns of table.
Formula (4) is calculated each table data volume input Cost Model of gained, carry out the cost estimation of multi-join query, from And obtain the cost of different executive plan gained.Compared to the evaluation method based on table radix for the tradition, it depends not only upon inquiry The line number size that intermediate result produces, and also the data volume of estimation is taken into account, thus lifting the standard of cost estimation simultaneously Really property.
Step 3, repeat step 1 and step 2, until the search space of traversal executive plan, find out the minimum table of data volume The order of connection is attached.
In order to find the optimal order of connection, need using base proposed by the present invention in the search procedure of executive plan In the cost method of estimation of data volume, its flow process is as shown in fig. 7, comprise the following steps that:
Order of connection optimization method according to being adopted carries out the space search work of executive plan(I.e. repeat step 1 and Step 2), it passes through to consider the characteristic of real time inquiry system and increase corresponding technology of prunning branches simultaneously to optimize executive plan search Performance, reduce algorithm itself execute query latency;
Obtain the estimated value of the data volume of corresponding executive plan by step 2, find out the execution meeting given Cost Model Plan, and stored;
The Optimum Implementation Plan found out according to above-mentioned steps, to generate the optimal order of connection, due to employing the present invention The cost estimation method proposing, thus effectively increase the accuracy of cost estimation.

Claims (5)

1. a kind of inquiry method of attachment based on data volume is it is characterised in that include:
Step 1, submits inquiry request to meta data server, obtains the statistical information corresponding to each table participating in connecting;
Step 2, the statistical information estimation according to getting obtains the data volume of all tables in current queries executive plan;
Step 3, repeat step 1 and step 2, until the search space of traversal queries executive plan, find out with suitable data amount Make the minimum executive plan of Query Cost, carry out the connection of table by the order of connection in this executive plan;
In step 1, meta data server building mode is to choose relevant database and design the table schema of row rank, according to The table schema designing creates metadatabase and table relation in corresponding relevant database, builds meta data server;
In step 2, total line number meter of the data volume of each table selectance, field average amount and table according to corresponding to this table Obtain;The evaluation method of selectance is accordingly to be calculated according to the querying condition in inquiry and statistical information, obtain in table Meet the shared ratio in object set to be inquired about of the row of querying condition;The computing formula of every table data volume size is such as Under:
s i z e = s e l e c t i v i t y × n u m O f T a b l e L i n e × Σ i = 1 j avgColSize i
Selectivity represents the selectance of inquiry, and numsOfTableLine is total line number of table or view, avgColSizei Represent the average amount of the i-th row field in the table needing to return, j is the columns of table.
2. as claimed in claim 1 the inquiry method of attachment based on data volume it is characterised in that storage in meta data server Statistical information is the statistical information corresponding to every table, and described statistical information carries out counting to table according to the table schema of design Arrive.
3. the inquiry method of attachment based on data volume as claimed in claim 1 is it is characterised in that described relevant database is: MYSQL database, Derby database or oracle database.
4. the inquiry method of attachment based on data volume as claimed in claim 1 is it is characterised in that statistical information includes:Row name, row The upper bound of data value in the lower bound of middle data value, row, table midrange according to be empty quantity, table midrange according to different value quantity, In row the average amount of field data and row in field data maximum amount of data, table or view total line number.
5. as claimed in claim 1 the inquiry method of attachment based on data volume it is characterised in that wherein, meta data server with And statistical information all completes in the storage in meta data server under off-line state.
CN201410124531.1A 2014-03-28 2014-03-28 Query connection method on basis of data volumes Expired - Fee Related CN103927346B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410124531.1A CN103927346B (en) 2014-03-28 2014-03-28 Query connection method on basis of data volumes

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410124531.1A CN103927346B (en) 2014-03-28 2014-03-28 Query connection method on basis of data volumes

Publications (2)

Publication Number Publication Date
CN103927346A CN103927346A (en) 2014-07-16
CN103927346B true CN103927346B (en) 2017-02-15

Family

ID=51145567

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410124531.1A Expired - Fee Related CN103927346B (en) 2014-03-28 2014-03-28 Query connection method on basis of data volumes

Country Status (1)

Country Link
CN (1) CN103927346B (en)

Families Citing this family (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107193813B (en) 2016-03-14 2021-05-14 阿里巴巴集团控股有限公司 Data table connection mode processing method and device
CN106250567A (en) * 2016-08-31 2016-12-21 天津南大通用数据技术股份有限公司 In distributed data base system, table connects system of selection and the device of data distribution mode
CN106446170A (en) * 2016-09-27 2017-02-22 努比亚技术有限公司 Data querying method and device
CN108268536A (en) * 2016-12-30 2018-07-10 北京国双科技有限公司 Database aggregation processing method and device
CN108491516B (en) * 2018-03-26 2021-09-14 哈工大大数据(哈尔滨)智能科技有限公司 Distributed multi-table connection selection method and device based on mixed integer linear programming
CN111625557B (en) * 2020-04-07 2023-04-14 上海熙菱信息技术有限公司 Method for quickly estimating result of multi-condition billion-level data volume
CN112395372A (en) * 2020-12-10 2021-02-23 四川长虹电器股份有限公司 Quick statistical method based on two-dimensional table of relational database system
CN112905591B (en) * 2021-02-04 2022-08-26 成都信息工程大学 Data table connection sequence selection method based on machine learning
CN113010547B (en) * 2021-05-06 2023-04-07 电子科技大学 Database query optimization method and system based on graph neural network
CN113656437B (en) * 2021-07-02 2023-10-03 阿里巴巴新加坡控股有限公司 Model construction method for predicting execution cost stability of reference
CN114090695A (en) * 2022-01-24 2022-02-25 北京奥星贝斯科技有限公司 Query optimization method and device for distributed database
CN114461677B (en) * 2022-04-12 2022-07-26 天津南大通用数据技术股份有限公司 Method for transmitting and adjusting connection sequence based on selection degree
CN117056361A (en) * 2023-07-03 2023-11-14 杭州拓数派科技发展有限公司 Data query method and device for distributed database

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101739451A (en) * 2009-12-03 2010-06-16 南京航空航天大学 Joint query adaptive processing method for grid database
CN102929996A (en) * 2012-10-24 2013-02-13 华南理工大学 XPath query optimization method and system
CN103164495A (en) * 2011-12-19 2013-06-19 中国人民解放军63928部队 Half-connection inquiry optimizing method based on periphery searching and system thereof

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7343369B2 (en) * 2004-11-18 2008-03-11 International Business Machines Corporation Method and apparatus for predicting selectivity of database query join conditions using hypothetical query predicates having skewed value constants

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101739451A (en) * 2009-12-03 2010-06-16 南京航空航天大学 Joint query adaptive processing method for grid database
CN103164495A (en) * 2011-12-19 2013-06-19 中国人民解放军63928部队 Half-connection inquiry optimizing method based on periphery searching and system thereof
CN102929996A (en) * 2012-10-24 2013-02-13 华南理工大学 XPath query optimization method and system

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
基于改进DPhyp算法的Impala查询优化;周强等;《计算机研究与发展》;20131215;全文 *
数据库基于值的查询优化的研究与实践;孟凡辉;《中国优秀硕士学位论文全文数据库 信息科技辑》;20050715;全文 *

Also Published As

Publication number Publication date
CN103927346A (en) 2014-07-16

Similar Documents

Publication Publication Date Title
CN103927346B (en) Query connection method on basis of data volumes
CN110618983B (en) JSON document structure-based industrial big data multidimensional analysis and visualization method
CN109669934B (en) Data warehouse system suitable for electric power customer service and construction method thereof
US8655861B2 (en) Query metadata engine
CN103064875B (en) A kind of spatial service data distributed enquiring method
CN102722531B (en) Query method based on regional bitmap indexes in cloud environment
US8386508B2 (en) System and method for parallel query evaluation
US9244974B2 (en) Optimization of database queries including grouped aggregation functions
US8682875B2 (en) Database statistics for optimization of database queries containing user-defined functions
CN104778540A (en) BOM (bill of material) management method and management system for building material equipment manufacturing
CN104834754A (en) SPARQL semantic data query optimization method based on connection cost
US20110022581A1 (en) Derived statistics for query optimization
CN103019728A (en) Effective complex report parsing engine and parsing method thereof
JP6530829B2 (en) Data multidimensional model generation system and data multidimensional model generation method
WO2015074477A1 (en) Path analysis method and apparatus
CN109739882B (en) Big data query optimization method based on Presto and Elasticissearch
CN105159971A (en) Cloud platform data retrieval method
WO2021012861A1 (en) Method and apparatus for evaluating data query time consumption, and computer device and storage medium
CN103294791A (en) Extensible markup language pattern matching method
CN112800083A (en) Government decision-oriented government affair big data analysis method and equipment
CN111522918A (en) Data aggregation method and device, electronic equipment and computer readable storage medium
CN117520386A (en) Index query method, system, electronic device and storage medium
CN117390064B (en) Database query optimization method based on embeddable subgraph
CN112883063A (en) SPARQL query processing method on partition-based distributed RDF system
Ming et al. Research on multidimensional analysis method of drilling information based on Hadoop

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20170215

Termination date: 20200328

CF01 Termination of patent right due to non-payment of annual fee