CN104123288A - Method and device for inquiring data - Google Patents

Method and device for inquiring data Download PDF

Info

Publication number
CN104123288A
CN104123288A CN201310146187.1A CN201310146187A CN104123288A CN 104123288 A CN104123288 A CN 104123288A CN 201310146187 A CN201310146187 A CN 201310146187A CN 104123288 A CN104123288 A CN 104123288A
Authority
CN
China
Prior art keywords
inquiry
inquiry clause
clause
data
metadata
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201310146187.1A
Other languages
Chinese (zh)
Other versions
CN104123288B (en
Inventor
乔平
夏晨
傅跃兵
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang Tmall Technology Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201310146187.1A priority Critical patent/CN104123288B/en
Publication of CN104123288A publication Critical patent/CN104123288A/en
Application granted granted Critical
Publication of CN104123288B publication Critical patent/CN104123288B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2453Query optimisation

Abstract

The invention discloses a method and device for inquiring data. The method comprises the steps that an inquiry statement is decomposed into one or more inquiry clauses; traversal is carried out on the inquiry clauses, the inquiry clauses are matched with metadata, and the inquiry clauses are replaced with execution results stored in the metadata; otherwise, the inquiry clauses are executed, and the metadata are updated; the final execution result is used for main inquiry to obtain an inquiry result. The executed inquiry clauses and the execution results of the executed inquiry clauses are used as the metadata to be managed, the identical inquiry clauses in the current session or different sessions are directly replaced with the execution results stored in the metadata, the inquiry clauses are prevented from being executed repeatedly, and therefore the inquiry performance of the whole system is improved to a greater extent. Meanwhile, compared with optimization of the existing inquiry clause level, the inquiry optimization granularity is thinner, the method and device are more suitable for inquiring large data in data warehouse application, and therefore the inquiry performance of the whole system is further improved.

Description

A kind of data enquire method and device
Technical field
The application relates to data storage and analytical technology, espespecially a kind of in data warehouse applications, for magnanimity, read to write few data more and carry out data enquire method and device.
Background technology
In many applications such as scientific research, Computer Simulation, internet, applications, ecommerce, data volume increases at a terrific speed.The explosive increase of information explosion, the especially unstructured data of large data age, just whole data storage and analysis field in profound influence.
In order to meet emerging business demand, start the data processing method of abandoning tradition gradually, then attempt new pattern, the various types of data including unstructured data is conducted interviews, processed and analyzes.At data storage and analysis field, MapReduce is undoubtedly technology of new generation of greatest concern, utilize MapReduce programming framework, developer can develop across program processor distributed type assemblies or stand-alone computer, can parallel processing magnanimity unstructured data.Wherein, MapReduce is a kind of programming model, and for the concurrent operation of large-scale data, Map represents mapping, and Reduce represents abbreviation.
At large data age, traditional Structured Query Language (SQL) (SQL, Structure Query Language) can not meet all business demands, although and the application of MapReduc is very extensive, but, the DLL (dynamic link library) that MapReduc framework comes out is still more rudimentary, develop more consuming time, and code be difficult to multiplexing.Then, turn to the SQL that user can be write to be converted into corresponding MapReduce program, utilize MapReduce framework to carry out the program of these tasks so that the mass data in Hadoop distributed file system (HDFS) is processed.At present most widely used is the Hive of Facebook contribution.Hive based on MapReduce is having powerful advantage aspect extendability and fault-tolerance.Wherein, Hive is a Tool for Data Warehouse based on Hadoop, structurized data file can be mapped as to a database table, and complete SQL query function is provided, and SQL statement can be converted to MapReduce task and move.
In business data mining process, conventionally the mining task of carrying out every day all can be hundreds and thousands of, even reach up to ten thousand, so also just there is unavoidably the problem of double counting in different mining tasks, such as: slip-stick artist A need to filter out the data of a collection of condition W from table S1 and table S2, then carries out with table S3 the data that combination regeneration becomes slip-stick artist A to want.Meanwhile, slip-stick artist B also need to filter out identical data with the same terms W from table S1 and table S2, then with Table X combination, the data that regeneration slip-stick artist B wants.Between these two slip-stick artists, conventionally and do not know the demand that has each other part identical, so in production task, identical calculating can carry out twice, identical common query information is repeated accessing.Undoubtedly, reduced like this query performance of total system.
Summary of the invention
In order to solve the problems of the technologies described above, the application provides a kind of data enquire method and device, can avoid reruning of common query information, thereby improves the query performance of total system.
In order to reach the application's object, the application provides a kind of data enquire method, it is characterized in that, comprising: query statement is decomposed into one or more inquiry clauses;
Traversal queries clause, inquiry clause and meta data match, replaced with the execution result of preserving in metadata, otherwise, execution inquiry clause more new metadata;
The execution result finally obtaining is obtained to Query Result for main inquiry.
Before the method, also comprise: obtain query statement, the validity of revene lookup statement and correctness.
The method also comprises: described query statement is standardized and query optimization.
Described traversal queries clause specifically comprises:
The inquiry clause operator tree that described inquiry clause is formed is carried out postorder traversal, utilizes the triplet information of inquiry clause to mate with metadata,
If the triplet information of described inquiry clause exists in metadatabase, the triplet information of this inquiry clause and meta data match, replaces with corresponding execution result by described inquiry clause; The out-of-service time of this metadata of resetting;
If the triplet information of described inquiry clause does not exist in metadatabase, the triplet information of this inquiry clause is not mated with metadata, carries out described inquiry clause, persistence execution result, and in metadata, add corresponding triplet information; The out-of-service time of this metadata is set, and this inquiry clause is in the level of inquiry clause operator tree.
When the triplet information of described inquiry clause and meta data match, also comprise: record corresponding described execution result is added to one by access times.
Described, when query statement is decomposed into multiple queries clause, the method also comprises: optimize described inquiry clause.
The method also comprises: set in advance inquiry clause optimisation strategy;
Described Optimizing Queries clause comprises: be optimized decomposing the inquiry clause obtaining according to described inquiry clause optimisation strategy, reject unnecessary inquiry clause.
Described inquiry clause optimisation strategy comprises:
In the time that the input set of inquiry clause collects consistent with output, delete described inquiry clause; And/or,
Difference when between inquiry clause is output row, and the output of the first inquiry clause row are while being the subset of output row of the second inquiry clause, carry out the second inquiry clause, and the Output rusults collection of the second inquiry clause is incorporated to the first inquiry clause; And/or,
Difference when between inquiry clause is output row, and the output of the first inquiry clause row and the output of the second inquiry clause are while being listed as each other supplementary set, merge the first inquiry clause and the second inquiry clause; And/or,
When the input set of inquiry clause identical, when filtercondition is different, extract described input set identical, the public part of the different each inquiry clause of filtercondition forms new inquiry clause, carry out this new inquiry clause, and it is identical that its Output rusults collection is incorporated to respectively to described input set, each inquiry clause that filtercondition is different; And/or,
The inquiry clause of described existence or relation is split as to two or more inquiry clauses.
Described metadata store is in metadatabase; The method also comprises described metadatabase is managed:
The out-of-service time corresponding when described metadata expires, or the User-Defined Functions UDF that generates described inquiry clause result data is expired, or generate that in the input data set of described inquiry clause result data, data change occurs for one of them, or the output collection of described inquiry clause is while being reclaimed by force by user, deletes the inquiry clause losing efficacy;
And/or,
According to scanning metadata the interval time of original setting, delete the inquiry clause information losing efficacy.
The inquiry clause that described deletion was lost efficacy comprises: the inquiry clause of deleting all execution results that depend on described inquiry clause;
If the execution result that described inquiry clause is corresponding is intermediate result, from distributed file storage system, delete this intermediate result simultaneously.
Described metadata comprises: out-of-service time, the inquiry clause of inquiry clause corresponding to triplet information, the triplet information corresponding with each inquiry clause are identified at level, the Query Result of query node tree by access times, inquiry clause.
Described tlv triple comprises input set, output collection, and completes the operational order collection that is input to output conversion.
The application provides a kind of data query device, at least comprises metadatabase, decomposing module, processing module and output module, wherein,
Metadatabase, for storing metadata, comprises the triplet information corresponding with each inquiry clause, and out-of-service time of inquiry clause corresponding to triplet information, inquiry clause be level, the Query Result identification information in query node tree by access times, inquiry clause;
Decomposing module, for being decomposed into query statement one or more inquiry clauses;
Processing module, for traversal queries clause, inquiry clause and meta data match, replaced with the execution result of preserving in metadata; Otherwise, execution inquiry clause more new metadata;
Output module, for obtaining Query Result by the execution result finally obtaining for main inquiry.
This device also comprises: acquisition module, and for obtaining query statement, the validity of revene lookup statement and correctness.
Described acquisition module, also for standardizing and query optimization to described query statement.
Described processing module, mate with metadata specifically for the triplet information of utilizing inquiry clause:
In the time that the triplet information of inquiry clause exists in metadatabase, inquiry clause is replaced with to execution result; The out-of-service time of this metadata of resetting;
In the time that the triplet information of inquiry clause does not exist in metadatabase, carry out current inquiry clause, persistence execution result, and in metadata, add corresponding triplet information; The out-of-service time of this metadata is set, and this inquiry clause is in the level of inquiry clause operator tree.
Described processing module also for, in the time of the triplet information of described inquiry clause and meta data match, record corresponding described execution result is added to one by access times.
Described output module, specifically for carrying out the execution result of subquery in described inquiry clause iteration and for main inquiry, obtain Query Result.
In described decomposing module, set in advance inquiry clause optimisation strategy;
Described decomposing module, also for being optimized decomposing the inquiry clause obtaining according to optimisation strategy.
Described processing module, also for metadatabase is managed, remove the inquiry clause losing efficacy:
The out-of-service time corresponding when described metadata expires, or the UDF that generates described inquiry clause result data is expired, or generate that in the input data set of described inquiry clause result data, data change occurs for one of them, or the output collection of described inquiry clause is while being reclaimed by force by user, deletes the inquiry clause losing efficacy;
And/or,
According to scanning metadata the interval time of original setting, delete the inquiry clause losing efficacy;
Wherein, the inquiry clause that described deletion was lost efficacy is the inquiry clause of deleting all execution results that depend on described inquiry clause; If the execution result that described inquiry clause is corresponding is intermediate result, from distributed file storage system, delete this intermediate result simultaneously.
Compared with prior art, the application comprises query statement is decomposed into one or more inquiry clauses; Traversal queries clause, inquiry clause and meta data match, replaced with the execution result of preserving in metadata, otherwise, execution inquiry clause more new metadata; The execution result finally obtaining is obtained to Query Result for main inquiry.The application manages the inquiry clause of having carried out and execution result thereof as metadata, for same queries clause in current sessions or different sessions, directly inquiry clause is replaced with to the execution result of preserving in metadata, avoid repeating of inquiry clause, thereby improved to a greater extent the query performance of total system.Simultaneously, the application's query statement is decomposed into multiple queries clause, compared with the optimal way of existing query statement aspect, the more refinement of query optimization granularity of the application's data enquire method, be more suitable for the large data query in data warehouse applications, thereby further improved the query performance of total system.
The application's further feature and advantage will be set forth in the following description, and, partly from instructions, become apparent, or understand by implementing the application.The application's object and other advantages can be realized and be obtained by specifically noted structure in instructions, claims and accompanying drawing.
Brief description of the drawings
Accompanying drawing is used to provide the further understanding to present techniques scheme, and forms a part for instructions, is used from the application's embodiment mono-technical scheme of explaining the application, does not form the restriction to present techniques scheme.
Fig. 1 is the process flow diagram of the application's data enquire method;
Fig. 2 is the composition structural representation of the application's data query device.
Embodiment
For making the application's object, technical scheme and advantage clearer, hereinafter in connection with accompanying drawing, the application's embodiment is elaborated.It should be noted that, in the situation that not conflicting, the combination in any mutually of the feature in embodiment and embodiment in the application.
In typical configuration of the application, computing equipment comprises one or more processors (CPU), input/output interface, network interface and internal memory.
Internal memory may comprise the volatile memory in computer-readable medium, and the forms such as random access memory (RAM) and/or Nonvolatile memory, as ROM (read-only memory) (ROM) or flash memory (flashRAM).Internal memory is the example of computer-readable medium.
Computer-readable medium comprises that permanent and impermanency, removable and non-removable media can realize information storage by any method or technology.Information can be module or other data of computer-readable instruction, data structure, program.The example of the storage medium of computing machine comprises, but be not limited to phase transition internal memory (PRAM), static RAM (SRAM), dynamic RAM (DRAM), the random access memory (RAM) of other types, ROM (read-only memory) (ROM), Electrically Erasable Read Only Memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc ROM (read-only memory) (CD-ROM), digital versatile disc (DVD) or other optical memory, magnetic magnetic tape cassette, the storage of tape magnetic rigid disk or other magnetic storage apparatus or any other non-transmission medium, can be used for the information that storage can be accessed by computing equipment.According to defining herein, computer-readable medium does not comprise non-temporary computer readable media (transitory media), as data-signal and the carrier wave of modulation.
Can in the computer system such as one group of computer executable instructions, carry out in the step shown in the process flow diagram of accompanying drawing.And, although there is shown logical order in flow process, in some cases, can carry out shown or described step with the order being different from herein.
Inquiry clause herein refers to, in the time that an inquiry is the condition of another inquiry, claims that this inquiry is subquery, and wherein, outer query is referred to as main inquiry.Subquery is better than main inquiry first to be carried out, and the execution result of subquery is for main inquiry.Subquery and main inquiry are referred to as to inquiry clause herein.An inquiry clause has definite input set Pin and output collection Pout, and completes the operational order collection Φ that is input to output conversion operatordeng three part compositions, and referred to as inquiry clause tlv triple, be designated as τ={ Pin, Pout, Φ operator, for unique identification inquiry clause.The input source of an inquiry clause, or be another one inquiry clause, or be exactly from table Table or table subregion Partition, be designated as Pin={dom| τ, Table, Partition}.
Fig. 1 is the process flow diagram of the application's data enquire method, as shown in Figure 1, mainly comprises:
Step 100: query statement is decomposed into one or more inquiry clauses.
In this step, before inquiry, first query statement is decomposed into one or more inquiry clauses, and the inquiry clause that utilizes decomposition to obtain is constructed definite inquiry clause operator tree.Each inquiry clause is made up of with operation operator tree the input set of determining, output collection, and wherein, operation operator is set by one group of operational order collection Φ operatorform; Common operation Operator instruction is as shown in table 1:
Operation Operator instruction Implication
TableScanOperator Scan table data
SelectOperator Select output row
FilterOperator Filter input data
FileSinkOperator Set up result data, export file to
ReduceSinkOperator Create right to<Key, the Value>that send to Reducer end
JoinOperator Multi-source data JOIN
GroupByOperator GroupBy statement
LimitOperator Limit statement
UnionOperator Union statement
?
Table 1
In this step, the specific implementation of the method that decomposition query statement is inquiry clause belongs to those skilled in the art's conventional techniques means, and its specific implementation is also not used in the protection domain that limits the application.This step is emphasized, before inquiry, query statement is decomposed into one or more inquiry clauses.Like this, compared with the existing inquiry mode that is positioned at query statement layer, more refinement of the granularity of query of the application's data enquire method, is more suitable for the large data query in data warehouse applications.
Before step 100, also comprise: obtain query statement, the validity of revene lookup statement and correctness, also further comprise that the query statement to obtaining standardizes and query optimization.Wherein, standardization is exactly normalization with the object of query optimization, according to format rule and the optimisation strategy of known set, the query statement of realizing identical function is expressed by a kind of mode of common the best.Conventional optimisation strategy has that row are reduced, subregion is reduced, push away under predicate etc.Specific implementation belongs to those skilled in the art's conventional techniques means, repeats no more here, is also not used in the protection domain that limits the application.
Step 101: traversal queries clause, inquiry clause and meta data match, replaced with the execution result of preserving in metadata; Otherwise, execution inquiry clause more new metadata.
Wherein, metadata store is in the metadatabase setting in advance, metadata comprises the triplet information corresponding with each inquiry clause, also comprises the out-of-service time of inquiry clause corresponding to triplet information, the information such as level, Query Result mark that inquiry clause is set at query node by access times, inquiry clause.Wherein, the out-of-service time, for the out-of-service time then, remove metadata, reclaim the shared resource of long-term no inquiry clause; By access times, for the execution priority of the angle Optimizing Queries clause from entire system, be can be used as by access times and query node number (level) weight that priority is calculated, such as inquiry times is more, priority is more high, and the higher inquiry clause of priority is enjoyed more computational resource and temporal right of priority; The level of inquiry clause is that the maximal value of each inquiry clause level in input set adds 1, if input set is table or table subregion, inquiring about level is 0; Query Result mark can be understood as table name and claims, if inquiry clause root node and user have specified table name to claim, uses this table name to claim; Otherwise the table name of getting a temporary table claims, user is not needed transparent, whether the existence of not perception of user temporary table to be.According to this table name claim can locating query clause result set data.
This step specifically comprises: inquiry clause operator tree definite in step 100 traveled through as postorder traversal, utilizes the triplet information of inquiry clause to mate with metadata,
If the triplet information of current inquiry clause exists in metadatabase, the triplet information of this inquiry clause and meta data match, now, show to carry out before this inquiry clause, and the data of execution result are effective, so, directly inquiry clause is replaced with to the table relevant information that the execution result of preserving in metadata is corresponding record; The out-of-service time (if session parameter setting does not exist, can Use Defaults as 3 days) of resetting this metadata is set according to already-existing session parameter, further, this record can be added to one by access times;
If the triplet information of current inquiry clause does not exist in metadatabase, the triplet information of this inquiry clause is not mated with metadata, now, show not carry out before this inquiry clause, so, carry out current inquiry clause, persistence execution result is kept in distributed file storage system by execution result, and in metadata, adds corresponding triplet information; The out-of-service time (if session parameter setting does not exist, can Use Defaults as 3 days) of this metadata is set according to already-existing session parameter, and this inquiry clause is in the degree of depth (level) of inquiry clause operator tree.It should be noted that, if inquiry clause is subquery, output data set title identifies taking TMP_ as prefix, and concrete title can generate at random, ensures that title is that the overall situation is unique; If inquiry clause is main inquiry, if there is definition output table name to claim, use this title, the juxtaposition out-of-service time is never lost efficacy (table that respective user creates, only just invalid in the time that user deletes); Otherwise method to set up is consistent with subquery.
Step 102: the execution result finally obtaining is obtained to Query Result for main inquiry.
Traveling through after all inquiry clauses, the execution result of subquery in inquiry clause is carried out iteration by this step, is used for main inquiry by the execution result of inquiry clause operator tree root node inquiry clause, finally obtains Query Result.
To sum up, the application specifically comprises: query statement is decomposed into one or more inquiry clauses, output inquiry clause operator tree; Traversal queries clause operator tree, inquiry clause and meta data match, replace with by the inquiry clause of coupling the execution result of preserving in metadata; Carry out unmatched inquiry clause and execution result information is updated to metadata; The execution result of inquiry clause operator tree root node inquiry clause is obtained to Query Result for main inquiry.
Can see from the matching process of the application's step 101, the application manages the inquiry clause of having carried out and execution result thereof as metadata, for inquiry clause identical in current sessions or different sessions, directly inquiry clause is replaced with to the execution result of preserving in metadatabase, avoid repeating of inquiry clause, thereby improved to a greater extent the query performance of total system.Simultaneously, the application's method was decomposed into multiple queries clause by query statement before inquiry, compared with the optimization of existing query statement aspect, the more refinement of query optimization granularity of the application's data enquire method, be more suitable for the large data query in data warehouse applications, thereby further improved the query performance of total system.
Further, in the application's step 100, in the time that query statement is decomposed into multiple queries clause, can also comprise: Optimizing Queries clause.By the further optimization to inquiry clause, weed out unnecessary inquiry clause, reduce unnecessary inquiry, thereby further improved the query performance of total system.
Optimizing Queries clause specifically comprise: set in advance inquiry clause optimisation strategy, be optimized decomposing the inquiry clause obtaining according to optimisation strategy, reject unnecessary inquiry clause.Wherein, optimisation strategy can include, but are not limited to following several:
1) input set of inquiry clause and output collection consistent (Pin=Pout), it is the situation that inquiry clause is directly equal to input set, the result data collection of this inquiry clause itself does not have any help to optimizing, do not need synchronously to add in the metadatabase of inquiry clause, delete this inquiry clause;
2) difference between inquiry clause is only output row, and the output of inquiry clause row are subsets of another one inquiry clause output row, such as:
Inquiry clause is a): SELECT name FROM employee WHERE title=' TeamLeader ';
Inquiry clause is b): SELECT*FROM employee WHERE title=' TeamLeader ';
Wherein, inquiry clause a completes the inquiry to employee table name field, takes out the value that meets title field and equal the record of TeamLeader; Inquiry clause b completes the inquiry of employee being shown to all fields, takes out the value that meets title field and equal the record of TeamLeader.
That is to say, inquiry clause data result collection a) is inquiry clause subset b); Now, the method for optimization is: first carry out inquiry clause b), suppose that Output rusults collection is: TMP_20130318_012345, so, inquiry clause can be optimized for a): SELECT name FROM TMP_20130318_012345.This situation is equivalent to inquiry clause Output rusults collection b) to be incorporated to inquiry clause a), thereby has also deleted inquiry clause b) in a) having simplified inquiry clause.
3) difference between inquiry clause is only output row, and the output row of an inquiry clause are listed as and have each other supplementary set with another one output, such as:
Inquiry clause is a): SELECT name FROM employee WHERE title=' TeamLeader ';
Query statement is b): SELECT addr FROM employee WHERE title=' TeamLeader ';
Wherein, inquiry clause a) completes the inquiry to employee table name field, and the value of taking out all title of meeting fields equals the record of TeamLeader; Inquiry clause b) completes the inquiry to employee table addr field, and the value of taking out all title of meeting fields equals the record of TeamLeader.
Now, the method for optimization is: these two inquiry clauses are merged, the inquiry clause after merging be inquiry clause c): SELECT name, addr FROM employee WHERE title=' 11 '.
4) two inquiry clause input sets are identical, filtercondition difference, such as:
Inquiry clause is a): SELECT name FROM employee WHERE title=' TeamLeader ';
Inquiry clause is b): SELECT addr FROM employee WHERE title=' TeamLeader ' and name=' Smich ';
Wherein, inquiry clause a) completes the inquiry to employee table name field, and the value of taking out all title of meeting fields equals the record of TeamLeader; Inquiry clause b) completes the inquiry to employee table addr field, and the value that the value of taking out all title of meeting fields equals TeamLeader and name field equals the record of Smich;
Now, the method for optimization is: extract the public part composition inquiry clause of inquiry clause c), such as:
Inquiry clause is c): SELECT name, addr FROM employee WHERE title=' TeamLeader ';
Wherein, inquiry clause c) completes employee table name, the inquiry of addr field, and the value of taking out all title of meeting fields equals the record of TeamLeader.
Suppose that inquiry clause Output rusults collection c) is: TMP_20130318_222222, so, can simplify inquiry clause according to the Output rusults collection obtaining, particularly:
Inquiry clause a) can be optimized for: SELECT name FROM TMP_20130318_222222;
Inquiry clause b) is optimized for: SELECT addr FROM TMP_20130318_222222 WHERE name=' Smich ';
5) inquiry clause of existence or relation is split as to more than two or two inquiry clause, can improves like this common query clause's hit rate.Such as for
Inquiry clause is a): SELECT*FROM employee WHERE title=' TeamLeader ' or title=' Manager ';
Wherein, inquiry clause a) completes the inquiry of employee being shown to all fields, and the value of taking out all title of meeting fields equals the record of TeamLeader or Manager.
Can be split as:
Inquiry clause is b): SELECT*FROM employee WHERE title=' TeamLeader ';
Inquiry clause is c): SELECT*FROM employee WHERE title=' Manager '
Wherein, inquiry clause b) completes the inquiry of employee being shown to all fields, and the value of taking out all title of meeting fields equals the record of TeamLeader; Inquiry clause c) completes the inquiry of employee being shown to all fields, and the value of taking out all title of meeting fields equals the record of Manager.
The application's method also comprises: metadatabase is managed, i.e. certain interval of time scanning metadata, removes expired inquiry clause.Wherein, can be set in advance as interval time day or hour;
Except normal condition is as expire cause expired and removing corresponding inquiry clause of out-of-service time, also have several situations also can cause the result data of inquiry clause expired, such as: the User-Defined Functions (UDF) that generates certain inquiry clause result data is expired; And for example: generate that in the input data set of certain inquiry clause result data, data change has occurred for one of them; For another example: the output collection of certain inquiry clause is reclaimed by force by user.When inquiry clause lost efficacy, mean that the inquiry clause of all execution results that depend on this inquiry clause lost efficacy simultaneously; If the execution result that inquiry clause is corresponding is intermediate result, from distributed file storage system, delete this intermediate result simultaneously.
The application also provides a kind of data query device, as shown in Figure 2, at least comprises metadatabase, decomposing module, processing module and output module, wherein,
Metadatabase, for storing metadata, comprise the triplet information corresponding with each inquiry clause, also comprise the out-of-service time of inquiry clause corresponding to triplet information, the information such as level, Query Result mark that inquiry clause is set at query node by access times, inquiry clause.
Decomposing module, for being decomposed into query statement one or more inquiry clauses;
Processing module, for traversal queries clause, inquiry clause and meta data match, replaced with the execution result of preserving in metadata; Otherwise, execution inquiry clause more new metadata;
Output module, for obtaining Query Result by the execution result finally obtaining for main inquiry.
The application's data query device also comprises: acquisition module, and for obtaining query statement, the validity of revene lookup statement and correctness; Be further used for, the query statement obtaining is standardized and query optimization.
Wherein,
Processing module, mate with metadata specifically for the triplet information of utilizing inquiry clause:
In the time that the triplet information of inquiry clause exists in metadatabase, inquiry clause is replaced with to the table relevant information that execution result is corresponding record; The out-of-service time of this metadata of resetting, further, this record is added to one by access times;
In the time that the triplet information of inquiry clause does not exist in metadatabase, carry out current inquiry clause, persistence execution result, and in metadata, add corresponding triplet information; The out-of-service time of this metadata is set, and this inquiry clause is in the degree of depth (level) of inquiry clause operator tree.
Output module, specifically for carrying out the execution result of subquery in inquiry clause iteration and for main inquiry, finally obtaining Query Result.
Outside, in decomposing module, also set in advance inquiry clause optimisation strategy, decomposing module also for, be optimized decomposing the inquiry clause that obtains according to optimisation strategy.
Processing module, also for metadatabase is managed, remove the inquiry clause losing efficacy:
The out-of-service time corresponding when described metadata expires, or the UDF that generates described inquiry clause result data is expired, or generate that in the input data set of described inquiry clause result data, data change occurs for one of them, or the output collection of described inquiry clause is while being reclaimed by force by user, deletes the inquiry clause losing efficacy;
And/or,
According to scanning metadata the interval time of original setting, delete the inquiry clause losing efficacy;
Wherein, the inquiry clause that described deletion was lost efficacy is the inquiry clause of deleting all execution results that depend on described inquiry clause; If the execution result that described inquiry clause is corresponding is intermediate result, from distributed file storage system, delete this intermediate result simultaneously.
From the application's device, the application manages the inquiry clause of having carried out and execution result thereof as metadata, for inquiry clause identical in current sessions or different sessions, directly inquiry clause is replaced with to the execution result of preserving in metadatabase, avoid repeating of inquiry clause, thereby improved to a greater extent the query performance of total system.Simultaneously, the application's method was decomposed into multiple queries clause by query statement before inquiry, compared with the optimization of existing query statement aspect, the more refinement of query optimization granularity of the application's data enquire method, be more suitable for the large data query in data warehouse applications, thereby further improved the query performance of total system.
In addition, the application, by the further optimization to inquiry clause, weeds out unnecessary inquiry clause, has reduced unnecessary inquiry, thereby has further improved the query performance of total system.
It is apparent to those skilled in the art that each ingredient of the device that above-mentioned the embodiment of the present application provides, and each step in method, they can concentrate on single calculation element, or are distributed on the network that multiple calculation elements form.Alternatively, they can be realized with the executable program code of calculation element.Thereby, they can be stored in memory storage and be carried out by calculation element, or they are made into respectively to each integrated circuit modules, or the multiple modules in them or step are made into single integrated circuit module realize.Like this, the application is not restricted to any specific hardware and software combination.
Although the disclosed embodiment of the application as above, the embodiment that described content only adopts for ease of understanding the application, not in order to limit the application.Those of skill in the art under any the application; do not departing under the prerequisite of the disclosed spirit and scope of the application; can in the form of implementing and details, carry out any amendment and variation; but the application's scope of patent protection, still must be as the criterion with the scope that appending claims was defined.

Claims (20)

1. a data enquire method, is characterized in that, comprising:
Query statement is decomposed into one or more inquiry clauses;
Traversal queries clause, inquiry clause and meta data match, replaced with the execution result of preserving in metadata, otherwise, execution inquiry clause more new metadata;
The execution result finally obtaining is obtained to Query Result for main inquiry.
2. data enquire method according to claim 1, is characterized in that, also comprises: obtain query statement, the validity of revene lookup statement and correctness before the method.
3. data enquire method according to claim 2, is characterized in that, the method also comprises: described query statement is standardized and query optimization.
4. according to the data enquire method described in claim 1~3 any one, it is characterized in that, described traversal queries clause specifically comprises:
The inquiry clause operator tree that described inquiry clause is formed is carried out postorder traversal, utilizes the triplet information of inquiry clause to mate with metadata,
If the triplet information of described inquiry clause exists in metadatabase, the triplet information of this inquiry clause and meta data match, replaces with corresponding execution result by described inquiry clause; The out-of-service time of this metadata of resetting;
If the triplet information of described inquiry clause does not exist in metadatabase, the triplet information of this inquiry clause is not mated with metadata, carries out described inquiry clause, persistence execution result, and in metadata, add corresponding triplet information; The out-of-service time of this metadata is set, and this inquiry clause is in the level of inquiry clause operator tree.
5. data enquire method according to claim 4, is characterized in that, when the triplet information of described inquiry clause and meta data match, also comprises: record corresponding described execution result is added to one by access times.
6. according to the data enquire method described in claim 1~3 any one, it is characterized in that, described, when query statement is decomposed into multiple queries clause, the method also comprises: optimize described inquiry clause.
7. data enquire method according to claim 6, is characterized in that, the method also comprises: set in advance inquiry clause optimisation strategy;
Described Optimizing Queries clause comprises: be optimized decomposing the inquiry clause obtaining according to described inquiry clause optimisation strategy, reject unnecessary inquiry clause.
8. data enquire method according to claim 7, is characterized in that, described inquiry clause optimisation strategy comprises:
In the time that the input set of inquiry clause collects consistent with output, delete described inquiry clause; And/or,
Difference when between inquiry clause is output row, and the output of the first inquiry clause row are while being the subset of output row of the second inquiry clause, carry out the second inquiry clause, and the Output rusults collection of the second inquiry clause is incorporated to the first inquiry clause; And/or,
Difference when between inquiry clause is output row, and the output of the first inquiry clause row and the output of the second inquiry clause are while being listed as each other supplementary set, merge the first inquiry clause and the second inquiry clause; And/or,
When the input set of inquiry clause identical, when filtercondition is different, extract described input set identical, the public part of the different each inquiry clause of filtercondition forms new inquiry clause, carry out this new inquiry clause, and it is identical that its Output rusults collection is incorporated to respectively to described input set, each inquiry clause that filtercondition is different; And/or,
The inquiry clause of described existence or relation is split as to two or more inquiry clauses.
9. according to the data enquire method described in claim 1~3 any one, it is characterized in that, described metadata store is in metadatabase; The method also comprises described metadatabase is managed:
The out-of-service time corresponding when described metadata expires, or the User-Defined Functions UDF that generates described inquiry clause result data is expired, or generate that in the input data set of described inquiry clause result data, data change occurs for one of them, or the output collection of described inquiry clause is while being reclaimed by force by user, deletes the inquiry clause losing efficacy;
And/or,
According to scanning metadata the interval time of original setting, delete the inquiry clause information losing efficacy.
10. data enquire method according to claim 9, is characterized in that, the inquiry clause that described deletion was lost efficacy comprises: the inquiry clause of deleting all execution results that depend on described inquiry clause;
If the execution result that described inquiry clause is corresponding is intermediate result, from distributed file storage system, delete this intermediate result simultaneously.
11. require the data enquire method described in 10 according to claim, it is characterized in that, described metadata comprises: out-of-service time, the inquiry clause of inquiry clause corresponding to triplet information, the triplet information corresponding with each inquiry clause are identified at level, the Query Result of query node tree by access times, inquiry clause.
12. data enquire methods according to claim 11, is characterized in that, described tlv triple comprises input set, output collection, and completes the operational order collection that is input to output conversion.
13. 1 kinds of data query devices, is characterized in that, at least comprise metadatabase, decomposing module, processing module and output module, wherein,
Metadatabase, for storing metadata, comprises the triplet information corresponding with each inquiry clause, and out-of-service time of inquiry clause corresponding to triplet information, inquiry clause be level, the Query Result identification information in query node tree by access times, inquiry clause;
Decomposing module, for being decomposed into query statement one or more inquiry clauses;
Processing module, for traversal queries clause, inquiry clause and meta data match, replaced with the execution result of preserving in metadata; Otherwise, execution inquiry clause more new metadata;
Output module, for obtaining Query Result by the execution result finally obtaining for main inquiry.
14. data query devices according to claim 13, is characterized in that, this device also comprises: acquisition module, and for obtaining query statement, the validity of revene lookup statement and correctness.
15. data query devices according to claim 14, is characterized in that, described acquisition module, also for standardizing and query optimization to described query statement.
16. according to the data query device described in claim 13~15 any one, it is characterized in that described processing module is mated with metadata specifically for the triplet information of utilizing inquiry clause:
In the time that the triplet information of inquiry clause exists in metadatabase, inquiry clause is replaced with to execution result; The out-of-service time of this metadata of resetting;
In the time that the triplet information of inquiry clause does not exist in metadatabase, carry out current inquiry clause, persistence execution result, and in metadata, add corresponding triplet information; The out-of-service time of this metadata is set, and this inquiry clause is in the level of inquiry clause operator tree.
17. data query devices according to claim 16, is characterized in that, described processing module also for, in the time of the triplet information of described inquiry clause and meta data match, record corresponding described execution result is added to one by access times.
18. according to the data query device described in claim 13~15 any one, it is characterized in that, described output module, specifically for carrying out the execution result of subquery in described inquiry clause iteration and for main inquiry, obtain Query Result.
19. according to the data query device described in claim 13~15 any one, it is characterized in that, has set in advance inquiry clause optimisation strategy in described decomposing module;
Described decomposing module, also for being optimized decomposing the inquiry clause obtaining according to optimisation strategy.
20. according to the data query device described in claim 13~15 any one, it is characterized in that described processing module, also for metadatabase is managed, is removed the inquiry clause losing efficacy:
The out-of-service time corresponding when described metadata expires, or the UDF that generates described inquiry clause result data is expired, or generate that in the input data set of described inquiry clause result data, data change occurs for one of them, or the output collection of described inquiry clause is while being reclaimed by force by user, deletes the inquiry clause losing efficacy;
And/or,
According to scanning metadata the interval time of original setting, delete the inquiry clause losing efficacy;
Wherein, the inquiry clause that described deletion was lost efficacy is the inquiry clause of deleting all execution results that depend on described inquiry clause; If the execution result that described inquiry clause is corresponding is intermediate result, from distributed file storage system, delete this intermediate result simultaneously.
CN201310146187.1A 2013-04-24 2013-04-24 A kind of data query method and device Active CN104123288B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201310146187.1A CN104123288B (en) 2013-04-24 2013-04-24 A kind of data query method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201310146187.1A CN104123288B (en) 2013-04-24 2013-04-24 A kind of data query method and device

Publications (2)

Publication Number Publication Date
CN104123288A true CN104123288A (en) 2014-10-29
CN104123288B CN104123288B (en) 2018-06-01

Family

ID=51768701

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201310146187.1A Active CN104123288B (en) 2013-04-24 2013-04-24 A kind of data query method and device

Country Status (1)

Country Link
CN (1) CN104123288B (en)

Cited By (34)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462317A (en) * 2014-12-01 2015-03-25 苏州朗米尔照明科技有限公司 Data screening method based on tag
CN104866608A (en) * 2015-06-05 2015-08-26 中国人民大学 Query optimization method based on join index in data warehouse
CN105930407A (en) * 2016-04-18 2016-09-07 北京思特奇信息技术股份有限公司 Cross-database associated query method and system for distributed database
CN106325832A (en) * 2015-06-18 2017-01-11 阿里巴巴集团控股有限公司 Data generation method and device
CN106411826A (en) * 2015-08-03 2017-02-15 阿里巴巴集团控股有限公司 Data access method and equipment thereof
CN106598963A (en) * 2015-10-14 2017-04-26 五八同城信息技术有限公司 Method and device for optimizing query statement
CN106776639A (en) * 2015-11-24 2017-05-31 腾讯科技(深圳)有限公司 Data processing method and data processing equipment based on SQL
CN106802927A (en) * 2016-12-22 2017-06-06 上海大汉三通数据通信有限公司 A kind of date storage method and querying method
CN107239710A (en) * 2016-03-29 2017-10-10 北京明略软件系统有限公司 A kind of data base authority method and system
CN107315790A (en) * 2017-06-14 2017-11-03 腾讯科技(深圳)有限公司 A kind of optimization method and device of irrelevant subquery
WO2018045610A1 (en) * 2016-09-06 2018-03-15 北京百度网讯科技有限公司 Method and device for executing distributed computing task
CN108089915A (en) * 2016-11-22 2018-05-29 北京京东尚科信息技术有限公司 The method and system of business controlization processing based on message queue
CN108268589A (en) * 2017-12-05 2018-07-10 北京百度网讯科技有限公司 Aggregate query method, apparatus, computer equipment and the readable medium of time series data
CN108388606A (en) * 2018-02-06 2018-08-10 福建星瑞格软件有限公司 A kind of method and computer equipment verifying library literary name section name in Sql sentences
CN108431805A (en) * 2015-10-23 2018-08-21 甲骨文国际公司 It is parallel to execute the inquiry with recursion clause
WO2018218504A1 (en) * 2017-05-31 2018-12-06 华为技术有限公司 Method and device for data query
CN109308300A (en) * 2018-09-27 2019-02-05 上海达梦数据库有限公司 A kind of processing method of logical operation, device, conversion plug-in unit and storage medium
CN109508339A (en) * 2018-11-12 2019-03-22 上海达梦数据库有限公司 A kind of data query method, apparatus, terminal device and storage medium
CN109947804A (en) * 2019-03-20 2019-06-28 上海达梦数据库有限公司 Optimization method, device, server and the storage medium of data acquisition system inquiry
CN110008244A (en) * 2019-03-29 2019-07-12 国家计算机网络与信息安全管理中心 A kind of data query method and data query device
CN110019394A (en) * 2017-11-28 2019-07-16 阿里巴巴集团控股有限公司 Data query method and apparatus
CN110096489A (en) * 2019-04-30 2019-08-06 阿里巴巴集团控股有限公司 A kind of data query method, system, device and electronic equipment
CN110109946A (en) * 2019-04-23 2019-08-09 北京小米移动软件有限公司 Problem querying method and device
CN110297710A (en) * 2018-03-23 2019-10-01 畅想科技有限公司 Public precedence information for more resource arbitrations
CN110727672A (en) * 2018-06-28 2020-01-24 北京京东尚科信息技术有限公司 Data mapping relation query method and device, electronic equipment and readable medium
CN110968615A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 Data query method and device
CN111125155A (en) * 2018-10-31 2020-05-08 北京国双科技有限公司 Data query method and device based on access path, storage medium and processor
CN111159218A (en) * 2019-12-31 2020-05-15 中科曙光国际信息产业有限公司 Data processing method and device and readable storage medium
CN111488326A (en) * 2019-01-25 2020-08-04 阿里巴巴集团控股有限公司 MapReduce job execution method, node device and storage medium
CN111597214A (en) * 2019-02-21 2020-08-28 北京京东尚科信息技术有限公司 Method, apparatus, computer system and medium for data query
CN111694943A (en) * 2020-05-29 2020-09-22 刘韩 Accurate information query method and system
CN111949684A (en) * 2020-08-06 2020-11-17 上海达梦数据库有限公司 SQL statement execution method, device, equipment and storage medium
CN113010539A (en) * 2019-12-20 2021-06-22 北京奇艺世纪科技有限公司 Data processing method and device
WO2022127417A1 (en) * 2020-12-16 2022-06-23 中兴通讯股份有限公司 Data query method, and electronic device and storage medium

Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030046073A1 (en) * 2001-08-24 2003-03-06 International Business Machines Corporation Word predicting method, voice recognition method, and voice recognition apparatus and program using the same methods
KR100413784B1 (en) * 1997-04-29 2004-02-14 삼성전자주식회사 Device for translating english-korean having clause unit separator and method for the same
US20060122993A1 (en) * 2004-12-06 2006-06-08 International Business Machines Corporation Abstract query plan
CN101075255A (en) * 2007-06-08 2007-11-21 北京神舟航天软件技术有限公司 Method for removing SQL query statement constant condition
CN102045652A (en) * 2009-10-21 2011-05-04 深圳市彩讯科技有限公司 Garbage short message interception method based on characteristic similarity
CN102073719A (en) * 2011-01-10 2011-05-25 复旦大学 Interval coding-based geography markup language (GML) document index method
CN102682047A (en) * 2011-10-18 2012-09-19 国网电力科学研究院 Mixed structured query language (SQL) injection protection method
CN102789475A (en) * 2012-05-08 2012-11-21 华为技术有限公司 Data synchronization method and device for result set cache at server end of database
CN102799622A (en) * 2012-06-19 2012-11-28 北京大学 Distributed structured query language (SQL) query method based on MapReduce expansion framework

Patent Citations (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR100413784B1 (en) * 1997-04-29 2004-02-14 삼성전자주식회사 Device for translating english-korean having clause unit separator and method for the same
US20030046073A1 (en) * 2001-08-24 2003-03-06 International Business Machines Corporation Word predicting method, voice recognition method, and voice recognition apparatus and program using the same methods
US20060122993A1 (en) * 2004-12-06 2006-06-08 International Business Machines Corporation Abstract query plan
CN101075255A (en) * 2007-06-08 2007-11-21 北京神舟航天软件技术有限公司 Method for removing SQL query statement constant condition
CN102045652A (en) * 2009-10-21 2011-05-04 深圳市彩讯科技有限公司 Garbage short message interception method based on characteristic similarity
CN102073719A (en) * 2011-01-10 2011-05-25 复旦大学 Interval coding-based geography markup language (GML) document index method
CN102682047A (en) * 2011-10-18 2012-09-19 国网电力科学研究院 Mixed structured query language (SQL) injection protection method
CN102789475A (en) * 2012-05-08 2012-11-21 华为技术有限公司 Data synchronization method and device for result set cache at server end of database
CN102799622A (en) * 2012-06-19 2012-11-28 北京大学 Distributed structured query language (SQL) query method based on MapReduce expansion framework

Cited By (55)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104462317A (en) * 2014-12-01 2015-03-25 苏州朗米尔照明科技有限公司 Data screening method based on tag
CN104866608A (en) * 2015-06-05 2015-08-26 中国人民大学 Query optimization method based on join index in data warehouse
CN104866608B (en) * 2015-06-05 2018-01-09 中国人民大学 Enquiring and optimizing method based on join index in a kind of data warehouse
CN106325832A (en) * 2015-06-18 2017-01-11 阿里巴巴集团控股有限公司 Data generation method and device
CN106411826A (en) * 2015-08-03 2017-02-15 阿里巴巴集团控股有限公司 Data access method and equipment thereof
CN106411826B (en) * 2015-08-03 2019-06-18 阿里巴巴集团控股有限公司 A kind of method and apparatus of data access
CN106598963A (en) * 2015-10-14 2017-04-26 五八同城信息技术有限公司 Method and device for optimizing query statement
CN108431805B (en) * 2015-10-23 2021-08-17 甲骨文国际公司 Parallel execution of queries with recursive clauses
CN108431805A (en) * 2015-10-23 2018-08-21 甲骨文国际公司 It is parallel to execute the inquiry with recursion clause
CN106776639A (en) * 2015-11-24 2017-05-31 腾讯科技(深圳)有限公司 Data processing method and data processing equipment based on SQL
CN106776639B (en) * 2015-11-24 2020-04-10 腾讯科技(深圳)有限公司 Data processing method and data processing device based on structured query language
CN107239710B (en) * 2016-03-29 2020-06-16 北京明略软件系统有限公司 Database permission implementation method and system
CN107239710A (en) * 2016-03-29 2017-10-10 北京明略软件系统有限公司 A kind of data base authority method and system
CN105930407A (en) * 2016-04-18 2016-09-07 北京思特奇信息技术股份有限公司 Cross-database associated query method and system for distributed database
CN105930407B (en) * 2016-04-18 2019-05-17 北京思特奇信息技术股份有限公司 A kind of inter-library relation query method of distributed data base and system
JP2019531539A (en) * 2016-09-06 2019-10-31 ベイジン バイドゥ ネットコム サイエンス テクノロジー カンパニー リミテッド Method and apparatus for performing distributed computing tasks
US11379499B2 (en) 2016-09-06 2022-07-05 Beijing Baidu Netcom Science And Technology Co., Ltd. Method and apparatus for executing distributed computing task
WO2018045610A1 (en) * 2016-09-06 2018-03-15 北京百度网讯科技有限公司 Method and device for executing distributed computing task
CN108089915A (en) * 2016-11-22 2018-05-29 北京京东尚科信息技术有限公司 The method and system of business controlization processing based on message queue
CN106802927A (en) * 2016-12-22 2017-06-06 上海大汉三通数据通信有限公司 A kind of date storage method and querying method
WO2018218504A1 (en) * 2017-05-31 2018-12-06 华为技术有限公司 Method and device for data query
US11269878B2 (en) 2017-06-14 2022-03-08 Tencent Technology (Shenzhen) Company Limited Uncorrelated subquery optimization method and apparatus, and storage medium
CN107315790A (en) * 2017-06-14 2017-11-03 腾讯科技(深圳)有限公司 A kind of optimization method and device of irrelevant subquery
CN107315790B (en) * 2017-06-14 2021-07-06 腾讯科技(深圳)有限公司 Optimization method and device for non-relevant sub-queries
CN110019394A (en) * 2017-11-28 2019-07-16 阿里巴巴集团控股有限公司 Data query method and apparatus
CN108268589A (en) * 2017-12-05 2018-07-10 北京百度网讯科技有限公司 Aggregate query method, apparatus, computer equipment and the readable medium of time series data
CN108388606A (en) * 2018-02-06 2018-08-10 福建星瑞格软件有限公司 A kind of method and computer equipment verifying library literary name section name in Sql sentences
CN108388606B (en) * 2018-02-06 2022-01-25 福建星瑞格软件有限公司 Method for checking base table field names in Sql sentences and computer equipment
CN110297710A (en) * 2018-03-23 2019-10-01 畅想科技有限公司 Public precedence information for more resource arbitrations
CN110297710B (en) * 2018-03-23 2022-05-10 畅想科技有限公司 Common priority information for multiple resource arbitration
CN110727672A (en) * 2018-06-28 2020-01-24 北京京东尚科信息技术有限公司 Data mapping relation query method and device, electronic equipment and readable medium
CN109308300A (en) * 2018-09-27 2019-02-05 上海达梦数据库有限公司 A kind of processing method of logical operation, device, conversion plug-in unit and storage medium
CN110968615B (en) * 2018-09-30 2023-05-23 北京国双科技有限公司 Data query method and device
CN110968615A (en) * 2018-09-30 2020-04-07 北京国双科技有限公司 Data query method and device
CN111125155A (en) * 2018-10-31 2020-05-08 北京国双科技有限公司 Data query method and device based on access path, storage medium and processor
CN111125155B (en) * 2018-10-31 2023-10-27 北京国双科技有限公司 Access path-based data query method, device, storage medium and processor
CN109508339A (en) * 2018-11-12 2019-03-22 上海达梦数据库有限公司 A kind of data query method, apparatus, terminal device and storage medium
CN109508339B (en) * 2018-11-12 2020-10-09 上海达梦数据库有限公司 Data query method and device, terminal equipment and storage medium
CN111488326A (en) * 2019-01-25 2020-08-04 阿里巴巴集团控股有限公司 MapReduce job execution method, node device and storage medium
CN111488326B (en) * 2019-01-25 2023-04-07 阿里巴巴集团控股有限公司 MapReduce job execution method, node device and storage medium
CN111597214A (en) * 2019-02-21 2020-08-28 北京京东尚科信息技术有限公司 Method, apparatus, computer system and medium for data query
CN109947804A (en) * 2019-03-20 2019-06-28 上海达梦数据库有限公司 Optimization method, device, server and the storage medium of data acquisition system inquiry
CN109947804B (en) * 2019-03-20 2021-04-06 上海达梦数据库有限公司 Data set query optimization method and device, server and storage medium
CN110008244A (en) * 2019-03-29 2019-07-12 国家计算机网络与信息安全管理中心 A kind of data query method and data query device
CN110109946A (en) * 2019-04-23 2019-08-09 北京小米移动软件有限公司 Problem querying method and device
CN110109946B (en) * 2019-04-23 2021-09-14 北京小米移动软件有限公司 Question query method and device
CN110096489A (en) * 2019-04-30 2019-08-06 阿里巴巴集团控股有限公司 A kind of data query method, system, device and electronic equipment
CN113010539A (en) * 2019-12-20 2021-06-22 北京奇艺世纪科技有限公司 Data processing method and device
CN111159218B (en) * 2019-12-31 2023-10-31 中科曙光国际信息产业有限公司 Data processing method, device and readable storage medium
CN111159218A (en) * 2019-12-31 2020-05-15 中科曙光国际信息产业有限公司 Data processing method and device and readable storage medium
CN111694943B (en) * 2020-05-29 2021-03-23 刘韩 Accurate information query method and system
CN111694943A (en) * 2020-05-29 2020-09-22 刘韩 Accurate information query method and system
CN111949684B (en) * 2020-08-06 2023-08-11 上海达梦数据库有限公司 SQL sentence execution method, device, equipment and storage medium
CN111949684A (en) * 2020-08-06 2020-11-17 上海达梦数据库有限公司 SQL statement execution method, device, equipment and storage medium
WO2022127417A1 (en) * 2020-12-16 2022-06-23 中兴通讯股份有限公司 Data query method, and electronic device and storage medium

Also Published As

Publication number Publication date
CN104123288B (en) 2018-06-01

Similar Documents

Publication Publication Date Title
CN104123288A (en) Method and device for inquiring data
JP6617117B2 (en) Scalable analysis platform for semi-structured data
CN105122243B (en) Expansible analysis platform for semi-structured data
US8700605B1 (en) Estimating rows returned by recursive queries using fanout
CN111026874A (en) Data processing method and server of knowledge graph
CN111382226A (en) Database query retrieval method and device and electronic equipment
CN106294695A (en) A kind of implementation method towards the biggest data search engine
CN111221791A (en) Method for importing multi-source heterogeneous data into data lake
DE112011101200T5 (en) Column-oriented memory representations of data records
WO2019161679A1 (en) Data processing method and device for use in online analytical processing
Alarabi et al. TAREEG: A MapReduce-based system for extracting spatial data from OpenStreetMap
US20140229429A1 (en) Database management delete efficiency
US20200226130A1 (en) Vertical union of feature-based datasets
CN112434027A (en) Indexing method and device for multi-dimensional data, computer equipment and storage medium
US10678789B2 (en) Batch data query method and apparatus
Kricke et al. Graph data transformations in Gradoop
Mehmood et al. Distributed real-time ETL architecture for unstructured big data
US11567969B2 (en) Unbalanced partitioning of database for application data
Mittal et al. Efficient random data accessing in MapReduce
CN110928963B (en) Column-level authority knowledge graph construction method for operation and maintenance service data table
CN116894035A (en) Multi-source heterogeneous data blood-edge relation construction method, system, equipment and medium
CN115658680A (en) Data storage method, data query method and related device
US20220215021A1 (en) Data Query Method and Apparatus, Computing Device, and Storage Medium
Albahli et al. TripleFCA: FCA-based approach to enhance semantic web data management
CN113297252A (en) Data query service method with mode being unaware

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right
TR01 Transfer of patent right

Effective date of registration: 20211109

Address after: Room 507, floor 5, building 3, No. 969, Wenyi West Road, Wuchang Street, Yuhang District, Hangzhou City, Zhejiang Province

Patentee after: ZHEJIANG TMALL TECHNOLOGY Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Patentee before: ALIBABA GROUP HOLDING Ltd.