CN107122358A - Mix querying method and equipment - Google Patents

Mix querying method and equipment Download PDF

Info

Publication number
CN107122358A
CN107122358A CN201610102101.9A CN201610102101A CN107122358A CN 107122358 A CN107122358 A CN 107122358A CN 201610102101 A CN201610102101 A CN 201610102101A CN 107122358 A CN107122358 A CN 107122358A
Authority
CN
China
Prior art keywords
data
degree
data object
correlation
dimension
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201610102101.9A
Other languages
Chinese (zh)
Other versions
CN107122358B (en
Inventor
李睿
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Advanced New Technologies Co Ltd
Advantageous New Technologies Co Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201610102101.9A priority Critical patent/CN107122358B/en
Publication of CN107122358A publication Critical patent/CN107122358A/en
Application granted granted Critical
Publication of CN107122358B publication Critical patent/CN107122358B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/283Multi-dimensional databases or data warehouses, e.g. MOLAP or ROLAP

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The purpose of the application is to provide a kind of method and apparatus for mixing inquiry, main dimension is determined in multiple queries dimension first, and multiple data sources are divided based on main dimension, then candidate collection is added by the weighting degree of correlation highest data object that all inquiry dimensions under respective data sources are taken out in each data source, and update the local correlation degree of the data object of each in candidate collection, further according to the upper bound of the local correlation degree of data object and lower bound in candidate collection, Query Result object is determined in the data object of candidate collection.Compared with prior art, the technical scheme of the application is based on main dimension and divides data source, each data source concurrent processing obtains corresponding candidate target, and introduce the concept of local correlation degree, by the local correlation degree for comparing data object in candidate collection, the object for meeting mixing search request can be returned in advance, the efficiency thus, it is possible to improve mixing inquiry.

Description

Mix querying method and equipment
Technical field
The application is related to computer realm, more particularly to a kind of mixing querying method and equipment.
Background technology
Inquiry includes certain pass in the screening criteria or condition of simple queries only one of which dimension, such as database The record of key word, and the screening conditions superposition of inquiry generally two or more dimensions is mixed, one More complicated inquiry target is completed in secondary implementation procedure.For example, being inquired about on map closest Preceding k include " chafing dish ", the restaurant of " dry pot " two keywords, include two spaces (distance) With two inquiry dimensions of text (keyword).
Scene is inquired about in mixing for more than, and the process step of existing mixing querying method is as schemed at present Shown in 1, initial query radius r is set first, is set using position as the center of circle, using r as radius Query context, then searched in the query context including " chafing dish ", " dry pot " two pass The data object of keyword, if qualified data object quantity is less than k, expands radius follow-up It is continuous to search, until more than or equal to k.It is more than or equal to the individual qualified data objects of k when inquiring When, all data objects for meeting querying condition are ranked up according to the degree of correlation of all dimensions, and The final degree of correlation most forward k that filters out is used as the result of mixing inquiry according to object.
Due to be determined as mixing inquiry result when, it is necessary to obtain all qualified data pair As and according to the degrees of correlation of all dimensions is ranked up to it, therefore there is on search efficiency performance bottle Neck.
Apply for content
The purpose of the application is to provide a kind of mixing querying method and equipment, existing mixed to solve Close the problem of querying method has bottleneck on search efficiency.
To achieve the above object, querying method is mixed this application provides one kind, this method includes:
Main dimension is determined in multiple queries dimension, and multiple data sources are divided based on the main dimension;
By taking out the weighting degree of correlation of all inquiry dimensions under respective data sources in each data source most High data object adds candidate collection, and updates the part of each data object in the candidate collection The degree of correlation;
According to the upper bound of the local correlation degree of data object and lower bound in the candidate collection, waited by described Query Result object is determined in the data object that selected works are closed, wherein the upper bound of the local correlation degree is with Boundary is the maximum and minimum value of the weighting degree of correlation of the data object under all inquiry dimensions.
Another aspect based on application, additionally provides a kind of mixing query facility, and the equipment includes:
Data divide device, for determining main dimension in multiple queries dimension, and based on the main dimension Degree divides multiple data sources;
Candidate's updating device, for taking out all inquiries dimension under respective data sources in each data source The weighting degree of correlation highest data object of degree adds candidate collection, and updates each in the candidate collection The local correlation degree of individual data object;
Object determining device, for according to the local correlation degree of data object in the candidate collection Boundary and lower bound, determine Query Result object, wherein the office in the data object of the candidate collection The upper bound of portion's degree of correlation and lower bound are the weighting degree of correlation of the data object under all inquiry dimensions Maximum and minimum value.
Compared with prior art, the technical scheme of the application provides a kind of new mixing inquiry mode, Determined in multiple queries dimension after main dimension, data source is divided based on main dimension, each data source is simultaneously Hair processing obtains corresponding candidate target, and introduces the concept of local correlation degree, by comparing Candidate Set The local correlation degree of data object in conjunction, the object for meeting mixing search request, nothing can be returned in advance Screening and sequencing need to be carried out to all data objects for meeting querying condition according to the degree of correlation of all dimensions, Efficiency thus, it is possible to improve mixing inquiry, shortens query time.
Brief description of the drawings
By reading the detailed description made to non-limiting example made with reference to the following drawings, this Shen Other features, objects and advantages please will become more apparent upon:
A kind of flow chart for mixing querying method that Fig. 1 the embodiment of the present application is provided;
The mixing querying method that Fig. 2 is provided using the embodiment of the present application carries out the handling process of mixing inquiry Figure;
A kind of structural representation for mixing query facility that Fig. 3 provides for the embodiment of the present application;
A kind of structural representation for preferred mixing query facility that Fig. 4 provides for the embodiment of the present application.
Same or analogous reference represents same or analogous part in accompanying drawing.
Embodiment
The application is described in further detail below in conjunction with the accompanying drawings.
In one typical configuration of the application, terminal, the equipment of service network and trusted party include One or more processors (CPU), input/output interface, network interface and internal memory.
Internal memory potentially includes the volatile memory in computer-readable medium, random access memory And/or the form, such as read-only storage (ROM) or flash memory (flash such as Nonvolatile memory (RAM) RAM).Internal memory is the example of computer-readable medium.
Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be with Realize that information is stored by any method or technique.Information can be computer-readable instruction, data knot Structure, the module of program or other data.The example of the storage medium of computer includes, but are not limited to phase Become internal memory (PRAM), static RAM (SRAM), dynamic random access memory (DRAM), other kinds of random access memory (RAM), read-only storage (ROM), electricity It is Erasable Programmable Read Only Memory EPROM (EEPROM), fast flash memory bank or other memory techniques, read-only Compact disc read-only memory (CD-ROM), digital versatile disc (DVD) or other optical storages, Magnetic cassette tape, magnetic disk storage or other magnetic storage apparatus or any other non-transmission medium, The information that can be accessed by a computing device available for storage.Defined according to herein, it is computer-readable Medium not include non-temporary computer readable media (transitory media), such as modulation data-signal and Carrier wave.
Fig. 1 shows a kind of process chart for mixing querying method, comprises the following steps:
Step S101, determines main dimension, and divide many based on the main dimension in multiple queries dimension Individual data source.Wherein, the inquiry dimension refers to the screening conditions in mixing inquiry, for example " Closest preceding k is inquired about on map to include " chafing dish ", the restaurant of " dry pot " two keywords " Mixing inquiry in, include two inquiry dimension, i.e. Spatial Dimensions (distance) and text dimensionality (pass Key word).In addition, conventional inquiry dimension is also including time dimension etc., for example, further limit screening Condition:N month Nei Xinkai restaurant.
In step S101, using some specific inquiry dimension as main dimension, for example, tieed up with text Degree is then based on the main dimension and divides multiple data sources as main dimension.Master is being used as using text dimensionality When dimension carries out data source division, it can typically be divided according to keyword.Still inquired about with foregoing mixing Exemplified by, data object is divided into by keyword " chafing dish " and " dry pot " in text dimensionality Two data sources, correspond to " chafing dish " and " dry pot " respectively.In " chafing dish " data source, own Data object includes keyword " chafing dish ", correspondingly, in " dry pot " data source, all numbers Keyword " dry pot " is included according to object, while including " dry pot " and " chafing dish " both keyword Data object can appear in simultaneously in two data sources.As another feasible embodiment, Spatial Dimension or time dimension can also be determined to be main dimension when determining main dimension, then according to away from From or the time divide data source.By taking Spatial Dimension as an example, number can be divided according to the difference of distance According to source, multiple data sources such as dividing 0~5km, 5~10km and 10~15km.
In actual applications, text dimensionality is general divides data source, therefore each data according to keyword There is the main dimension (text dimensionality) of data object in preferable dispersion, each data source between source Degree of correlation all same, be easy to the follow-up concurrent processing to each data source.And Spatial Dimension and time According to distance or time is generally in dimension, data source division is carried out relative to according to keyword, respectively Dispersion between individual data source is relatively poor, the main dimension (space of data object in each data source Dimension or time dimension) the degree of correlation may differ, therefore it is follow-up to each data source and Hair processing is relatively complicated.
Therefore, as a preferred embodiment, inquire about dimension include text dimensionality when, preferentially It is main dimension by its determination.Specifically, the step S101 is preferably:Including text dimensionality The text dimensionality is determined to be main dimension in multiple queries dimension, and according in the text dimensionality The main dimension is divided multiple data sources by keyword.Thus, it is possible to take out phase in each data source The complexity during weighting degree of correlation highest data object of all inquiry dimensions under data source is answered, is carried The efficiency of height mixing inquiry.
Step S102, by the weighting that all inquiry dimensions under respective data sources are taken out in each data source Degree of correlation highest data object adds candidate collection, and updates each data pair in the candidate collection The local correlation degree of elephant.
Still by taking foregoing mixing inquiry scene as an example, unified number is set up for Spatial Dimension and text dimensionality It is specific as shown in table 1 according to model:
Table 1
Based on above-mentioned data model, the mixing is inquired about in scene, and data object p is in respective data sources t Under all inquiry dimensions weighting degree of correlation ζt(p, q) can be expressed as:
Wherein, the weight of α representation spaces dimension, δ (p.i, q.i) represents data object p space correlation Degree, | q.d | the quantity (i.e. the quantity of keyword) of data source is represented, for same data object p, its The component all same of spatial correlation under each data source, therefore the space correlation under data source t The component of degree is the 1/ of the real space degree of correlation | q.d |.(1- α) represents the weight of text dimensionality, θt(p.d, q.d) represents the text degrees of correlation of the data object p in data source t.
For the mixing inquiry of arbitary inquiry dimension, it is only necessary to by data object p in respective data sources t Under all inquiry dimensions weighting degree of correlation ζtThe calculation formula of (p, q) is enterprising in respective queries dimension Row is expanded, and the formula after expansion is as follows:
Wherein, t represents the data source of main dimension, and p represents data object, and q represents mixing inquiry, ζt(p,q) Represent to inquire about the data object p of the data source t of main dimension in q under the data source in mixed once The actual degree of correlation, j represents a certain inquiry dimension, and m represents to inquire about the quantity of dimension, αjRepresent corresponding Inquire about the weight of dimension, RjRepresent correlations of the data source t data object p in respective queries dimension Degree.
The mixing inquiry scene illustrated in correspondence the embodiment of the present application, m value is 2, if j=1 tables Show Spatial Dimension, thenRepresent " dry pot " data The weighting degree of correlation of Spatial Dimension under source, correspondingly, j=2 represents text dimensionality, then α2=1- α, R2t(p.d, q.d), α2·R2=(1- α) θt(p.d, q.d), is represented under " dry pot " data source The weighting degree of correlation of text dimensionality.
Described ζt(p, q) value is used for the weighting degree of correlation for representing each data object in data source t, ζt(p, q) value is higher, represents that the correlation of the data object is stronger, it is possible thereby in each data source Choose maximally related data object.Specifically, can be according to each data object in each data source t P ζt(p, q) value, is ranked up to data object, is taken according to the result of sequence in each data source Go out ζtThe maximum data object of (p, q) value.For example in " dry pot " data source, first three in ranking results Data object be respectively:Q, W, Y, in " chafing dish " data source, first three in ranking results Data object is respectively:Y, W, X, the data object difference taken out for the first time in two data sources For Q and Y, candidate collection is then added into.In processing mixing query process, by each data Take out ζ in sourcetThe processing procedure of the maximum data object of (p, q) value is concurrently carried out so that query performance On do not have excessive loss, and due to main dimension is divided into multiple data sources, therefore each data The complexity that data object sorts in source can be less than directly to be ranked up according to the degree of correlation of all dimensions, Which thereby enhance the efficiency of mixing inquiry.
Waited being updated according to ranking results after qualified data object is taken out in data source, accordingly The local correlation degree of each data object during selected works are closed.Wherein, the local correlation degree is used to represent certain The extreme case of the weighting degree of correlation of one data object under all inquiry dimensions, i.e., comprising the upper bound with One value range on boundary.
The lower bound represents that the weighting degree of correlation of the data object under all inquiry dimensions may go out Existing minimum value, by taking the mixing inquiry scene in the present embodiment as an example, corresponding to a certain data object p Only include the situation of one of keyword.For example only do not include " chafing dish " comprising " dry pot ", The data object p then now taken out in " dry pot " data source has minimum under all inquiry dimensions The weighting degree of correlation, the weighting degree of correlation of the minimum is the lower bound of local correlation degree.And the upper bound is represented The maximum that the weighting degree of correlation of the data object under all inquiry dimensions is likely to occur, with this reality Apply exemplified by the mixing inquiry scene in example, all keywords are included corresponding to a certain data object p, and It is ζ in the data source of each keywordt(p, q) value highest situation.
Specifically, the concrete mode of the local correlation degree of each data object in the candidate collection is updated It is as follows:For the lower bound of the local correlation degree, by the data object of the candidate collection in respective counts According to the weighting of the weighting degree of correlation and the non-master dimension under remainder data source of all inquiry dimensions under source Degree of correlation sum, is used as the lower bound of the local correlation degree of the data object;Accordingly for described The upper bound of local correlation degree, all under respective data sources of the data object of the candidate collection are looked into Ask the weighting degree of correlation of dimension and the highest weighting degree of correlation of all inquiry dimensions under remainder data source Sum, is used as the upper bound of the local correlation degree of the data object.
Still by taking foregoing mixing inquiry scene as an example, the data object taken out for the first time in two data sources Respectively Q and Y, the local correlation degree is designated as partial score.Q and Y in respective data sources (i.e. Dry pot and chafing dish) under the weighting degrees of correlation of all inquiry dimensions be respectively:ζt(Q, q)=0.614 He ζt(Y, q)=0.511.For the data object Q, its ζ taken out in " dry pot " data sourcet(Q, q) in value Include two parts:The weighting degree of correlation of text dimensionality (main dimension) under " dry pot " data source The weighting of Spatial Dimension (non-master dimension) under D11=0.501, and " dry pot " data source is related I12=0.113 is spent, both sums are ζt(Q, q)=0.614.Assuming that data object Q does not include keyword " chafing dish ", then text dimensionalities (main dimension) of the data object Q under " chafing dish " data source plus Power degree of correlation D21=0, and the weighting phase of the Spatial Dimension (non-master dimension) under " chafing dish " data source Pass degree I22=I21=0.113.Thus, the lower bound of data object Q local correlation degree is 0.727 (i.e. D11+I12+I22).It is determined that Q local correlation degree the upper bound when, due to data object Q " fire The weighting degree of correlation of all inquiry dimensions can not possibly exceed data object Y, therefore number under pot " data source According to object Y ζt(Y, q) value is the theoretic highest weighting degree of correlation, thus, data object Q Local correlation degree the upper bound be 1.125 (i.e. 0.614+0.511).Based on aforesaid way, according to this The data object Q and Y of candidate collection are put into, its local correlation degree is updated for Q=[0.727,1.125]. Similarly, if the weighting degree of correlation of Spatial Dimensions of the data object Y under each data source is 0.111, The upper bound that object Y local correlation degree can then be updated the data is 0.511+0.111, and lower bound is 0.511+.0614, i.e. Y=[0.622,1.125]
Step S103, according to the upper bound of the local correlation degree of data object and lower bound in the candidate collection, Query Result object is determined in the data object of the candidate collection.In actual applications, for by The data object taken out for the first time in each data source, can not directly determine inquiry knot in most cases Fruit object, only when the data object taken out for the first time in each data source is identical data object, Because the data object possesses the weighting degree of correlation of maximum in two data sources, necessarily meet bar The data object of part, therefore can be directly as Query Result object.For example, at " dry pot " In " chafing dish " two data sources, first data object is Q in ranking results, then can be by Data object Q is used as Query Result object.
In most cases, the data object of ranking results first can be differed in each data source, Such as Q and Y in precedent.Therefore, step S103, can be in candidate collection when implementing The lower bound of the local correlation degree of each data object is upper with remaining any data object local correlation degree Boundary is compared, if being more than or equal to remaining any number in the presence of the lower bound of the local correlation degree of a data object According to the local correlation degree of object the upper bound when, by local correlation degree lower bound be more than remaining any data object The data object in the local correlation degree upper bound be defined as Query Result object;Otherwise, step is returned to S102, by the weighting degree of correlation highest that all inquiry dimensions under respective data sources are taken out in each data source Data object add candidate collection determine Query Result object.
Still by taking foregoing mixing inquiry scene as an example, if the condition for determining Query Result object is not met, then When secondary return to step S102 is handled, due to data object Q and Y in previous processing Through being removed, therefore now ζ in two data sourcest(p, q) value highest is data object W respectively, Then based on the data object W newly added, the part of all data objects in candidate collection is updated The degree of correlation.For data object W, because its weighting degree of correlation in two data sources is true Determine, therefore the bound of its local correlation degree is also thereby determined that, it is assumed that W is in " dry pot " and " fire The weighting degree of correlation of all inquiry dimensions under pot " data source is respectively 0.454 and 0.504, then may be used Be 0.958 with the upper bound and lower bound for the local correlation degree for determining data object W, i.e. W=[0.958, 0.958], represent that the weighting degree of correlation of the data object W has been uniquely determined.
Meanwhile, the data object W based on newest taking-up is understood, data object Q is in " chafing dish " number The weighting degree of correlation according to all inquiry dimensions under source can not possibly exceed data object W, and data object Y The weighting degree of correlation of all inquiry dimensions is also impossible to exceed data object W under " chafing dish " data source, Thus, object Q, the upper bound of data object Y local correlation degree are updated the data.Data object Q The upper bound of local correlation degree be updated to 0.614+0.504, i.e. Q=[0.727,1.118], data object The upper bound of Y local correlation degree is updated to 0.511+0.454, i.e. Y=[0.622,0.965].
Now, according to the bound of data object Q, Y and W local correlation degree, do not meet yet Determine the condition of Query Result object, it is therefore desirable to again return to step S102 and continue in data source Take out data object.If the data object point this time taken out in " dry pot " and " chafing dish " data source Wei not Z and X, and ζs of the data object Z in " dry pot " data sourcet(Z, q) value is 0.411, and ζs of the data object X in " chafing dish " data sourcet(X, q) value is 0.25., can be with according to aforementioned manner The upper bound for determining data object Z local correlation degree is 0.411+0.25, and lower bound is 0.411+0.098 (the weighting degree of correlation of the Spatial Dimension under " chafing dish " data source), i.e. Z=[0.509,0.661], and count It is 0.25+0.411 according to the upper bound of object X local correlation degree, lower bound is 0.25+0.101 (" chafing dish " The weighting degree of correlation of Spatial Dimension under data source), i.e. X=[0.351,0.661].
Meanwhile, object Q and Y Local Phase is updated the data based on the data object Z and X newly taken out The Guan Du upper bound.Wherein, the upper bound of data object Q local correlation degree is updated to 0.614+0.25, That is the upper bound of Q=[0.727,0.864], data object Y local correlation degree is updated to 0.511+0.411, That is Y=[0.622,0.922], and the upper bound of data object W local correlation degree and lower bound are unique It is determined that, therefore be still W=[0.958,0.958] without updating.Now, data object W part The lower bound of the degree of correlation is more than or equal to the local correlation degree of remaining any data object (Q, X, Y and Z) The upper bound, meet determine Query Result object condition, therefore by data object W be defined as inquiry Result object, the Query Result object got by way of above-mentioned loop iteration necessarily all dimensions The maximum data object of the weighting degree of correlation of degree.In above-mentioned three iterative process, candidate collection In data object local correlation degree update status it is as shown in table 2.
Table 2
It is determined that during Query Result object, by way of above-mentioned loop iteration, it is only necessary to N The iterative processing of wheel, you can get the Query Result object of the condition of satisfaction, without taking out each number Screening is ranked up according to all data objects in source, therefore, it is possible to effectively improve the efficiency of mixing inquiry, Shorten query time.
Mix and inquire about for TOP-K, due to needing to obtain K Query Result object, therefore in step Rapid S103 is determined as after the data object of Query Result, is still further comprised:
Step S104, the Query Result object is taken out in the candidate collection and is added in results set, And judge whether the quantity of the Query Result object in the results set reaches preset value;If reaching pre- If value, then the result inquired about all Query Result objects in the results set as mixing;If Not up to preset value, then again by taking out all inquiry dimensions under respective data sources in each data source Weighting degree of correlation highest data object adds candidate collection and determines Query Result object.
Still by taking foregoing inquiry scene as an example, after third time iterative processing, it has been determined that data object W is Query Result object, Query Result object W is taken out in candidate collection, and add result In set, the data object now included in candidate collection is Q, Y, Z, X, and in results set Comprising data object be W.
The preset value is the fruiting quantities that TOP-K mixing inquiries need, and the present embodiment is set as 5, Because the quantity of the Query Result object in now results set only has 1, and not up to preset value, Therefore return to step S102 processing, and continue through once or multiple iterative processing Continue to obtain Query Result object, until the quantity of the Query Result object added in results set reaches Untill 5, now all Query Result objects in the results set are mixed as TOP-K and looked into The result of inquiry is returned.
For being inquired about on map, closest preceding k include " chafing dish ", " dry pot " two passes The TOP-K mixing inquiries in the restaurant of keyword, the method provided using the present embodiment is handled it Specific steps Fig. 2 shown in, including:
Step S201, using text dimensionality as main dimension, and is divided into two data sources according to keyword.
Step S202, inquiry is obtained in respective data respectively in " dry pot " and " chafing dish " two data sources Degree of correlation highest data object is weighted under source.
Step S203, takes out the data object that step S202 is obtained, calculates its local correlation degree (partial score)。
Step S204, the data object that step S202 is obtained adds candidate collection, and based on the number According to the upper bound of the local correlation degree (partial score) of data object in object renewal candidate collection with Boundary.
Step S205, the lower bound of the no local correlation degree that there is a data object of candidate collection is more than or equal to The upper bound of the local correlation degree of remaining any data object, if in the presence of continuing executing with step S206; If being not present, return to step S202.
Step S206, the lower bound that local correlation degree is taken out in candidate collection is more than or equal to remaining any number According to the data object in the upper bound of the local correlation degree of object, one of TOP-K mixing Query Results are used as Return, add in results set.
Whether step S207, the data object quantity in judged result set is equal to preset value K, if It is then to return to the final result of TOP-K mixing inquiries, terminates this inquiry;If it has not, then returning Return step S202.
To sum up, the scheme of the embodiment of the present application provides a kind of new mixing inquiry mode, is looked into multiple Ask and determined in dimension after main dimension, data source is divided based on main dimension, each data source concurrent processing is obtained Corresponding candidate target is taken, and introduces the concept of local correlation degree, by comparing data in candidate collection The local correlation degree of object, the object for meeting mixing search request can be returned in advance, without according to institute The degree of correlation for having dimension carries out screening and sequencing to all data objects for meeting querying condition, thus, it is possible to The efficiency of mixing inquiry is improved, shortens query time.
Another aspect based on the application, additionally provides offer mixing query facility, the structure of the equipment As shown in figure 3, dividing device 310, candidate's updating device 320 and object determining device including data 330.Specifically, the data divide device 310 and are used to determine main dimension in multiple queries dimension, And multiple data sources are divided based on the main dimension;Candidate's updating device 320 is used for by each number According to the weighting degree of correlation highest data object for all inquiry dimensions taken out in source under respective data sources Candidate collection is added, and updates the local correlation degree of each data object in the candidate collection;It is described Object determining device 330 is used for the upper bound according to the local correlation degree of data object in the candidate collection And lower bound, Query Result object is determined in the data object of the candidate collection, wherein the part The upper bound of the degree of correlation and lower bound are the weighting degree of correlation of the data object under all inquiry dimensions Maximum and minimum value.
Here, it will be appreciated by those skilled in the art that the mixing query facility can include but not limit Setting of being constituted is integrated by network in user equipment, the network equipment or user equipment and the network equipment It is standby.The user equipment includes but is not limited to the realization such as personal computer, touch control terminal;The network Equipment includes but is not limited to such as network host, single network server, multiple webserver collection or base Realized in set of computers of cloud computing etc..Here, cloud is by based on cloud computing (Cloud Computing) A large amount of main frames or the webserver constitute, wherein, cloud computing is one kind of Distributed Calculation, by one One virtual machine of the computer collection composition of group's loose couplings.
Wherein, the inquiry dimension refers to the screening conditions in mixing inquiry, such as " on map Inquiry closest preceding k is included " chafing dish ", the restaurant of " dry pot " two keywords " it is mixed Close in inquiry, include two inquiry dimension, i.e. Spatial Dimensions (distance) and text dimensionality (keyword). In addition, conventional inquiry dimension is also including time dimension etc., for example, further limit screening conditions:n Individual month Nei Xinkai restaurant.
In data divide device 310, using some specific inquiry dimension as main dimension, for example with Text dimensionality is then based on the main dimension and divides multiple data sources as main dimension.With text dimensionality When carrying out data source division as main dimension, it can typically be divided according to keyword.Still with foregoing mixed Close exemplified by inquiring about, keyword " chafing dish " and " dry pot " in text dimensionality, by data object Two data sources are divided into, " chafing dish " and " dry pot " are corresponded to respectively.In " chafing dish " data source, All data objects include keyword " chafing dish ", correspondingly, in " dry pot " data source, institute There is data object to include keyword " dry pot ", while including " dry pot " and " chafing dish " two passes The data object of key word can be appeared in two data sources simultaneously.As another feasible embodiment, It is determined that Spatial Dimension or time dimension can also be determined into be main dimension during main dimension, then basis Distance or time divide data source.By taking Spatial Dimension as an example, it can be divided according to the difference of distance Data source, multiple data sources such as dividing 0~5km, 5~10km and 10~15km.
In actual applications, text dimensionality is general divides data source, therefore each data according to keyword There is the main dimension (text dimensionality) of data object in preferable dispersion, each data source between source Degree of correlation all same, be easy to the follow-up concurrent processing to each data source.And Spatial Dimension and time According to distance or time is generally in dimension, data source division is carried out relative to according to keyword, respectively Dispersion between individual data source is relatively poor, the main dimension (space of data object in each data source Dimension or time dimension) the degree of correlation may differ, therefore it is follow-up to each data source and Hair processing is relatively complicated.
Therefore, as a preferred embodiment, inquire about dimension include text dimensionality when, data It is main dimension that assembly first, which is divided, by its determination.Specifically, the data divide device 310 and specifically used In the text dimensionality is determined into be main dimension, and root in the multiple queries dimension comprising text dimensionality The main dimension is divided into multiple data sources according to the keyword in the text dimensionality.Thus, it is possible to by The weighting degree of correlation highest number of all inquiry dimensions under respective data sources is taken out in each data source According to complexity during object, the efficiency of mixing inquiry is improved.
Still by taking foregoing mixing inquiry scene as an example, candidate's updating device is handled to each data source When, unified data model can be set up for Spatial Dimension and text dimensionality, it is specific as shown in table 1.
Based on the data model shown in table 1, the mixing is inquired about in scene, and data object p is in respective counts According to the weighting degree of correlation ζ of all inquiry dimensions under the t of sourcet(p, q) can be expressed as:
Wherein, the weight of α representation spaces dimension, δ (p.i, q.i) represents data object p space correlation Degree, | q.d | the quantity (i.e. the quantity of keyword) of data source is represented, for same data object p, its The component all same of spatial correlation under each data source, therefore the space correlation under data source t The component of degree is the 1/ of the real space degree of correlation | q.d |.(1- α) represents the weight of text dimensionality, θt(p.d, q.d) represents the text degrees of correlation of the data object p in data source t.
For the mixing inquiry of arbitary inquiry dimension, it is only necessary to by data object p in respective data sources t Under all inquiry dimensions weighting degree of correlation ζtThe calculation formula of (p, q) is enterprising in respective queries dimension Row is expanded, and the formula after expansion is as follows:
Wherein, t represents the data source of main dimension, and p represents data object, and q represents mixing inquiry, ζt(p,q) Represent to inquire about the data object p of the data source t of main dimension in q under the data source in mixed once The actual degree of correlation, j represents a certain inquiry dimension, and m represents to inquire about the quantity of dimension, αjRepresent corresponding Inquire about the weight of dimension, RjRepresent correlations of the data source t data object p in respective queries dimension Degree.
The mixing inquiry scene illustrated in correspondence the embodiment of the present application, m value is 2, if j=1 tables Show Spatial Dimension, thenRepresent " dry pot " data The weighting degree of correlation of Spatial Dimension under source, correspondingly, j=2 represents text dimensionality, then α2=1- α, R2t(p.d, q.d), α2·R2=(1- α) θt(p.d, q.d), is represented under " dry pot " data source The weighting degree of correlation of text dimensionality.
Described ζt(p, q) value is used for the weighting degree of correlation for representing each data object in data source t, ζt(p, q) value is higher, represents that the correlation of the data object is stronger, it is possible thereby in each data source Choose maximally related data object.Specifically, can be according to each data object in each data source t P ζt(p, q) value, is ranked up to data object, is taken according to the result of sequence in each data source Go out ζtThe maximum data object of (p, q) value.For example in " dry pot " data source, first three in ranking results Data object be respectively:Q, W, Y, in " chafing dish " data source, first three in ranking results Data object is respectively:Y, W, X, the data object difference taken out for the first time in two data sources For Q and Y, candidate collection is then added into.In processing mixing query process, by each data Take out ζ in sourcetThe processing procedure of the maximum data object of (p, q) value is concurrently carried out so that query performance On do not have excessive loss, and due to main dimension is divided into multiple data sources, therefore each data The complexity that data object sorts in source can be less than directly to be ranked up according to the degree of correlation of all dimensions, Which thereby enhance the efficiency of mixing inquiry.
Waited being updated according to ranking results after qualified data object is taken out in data source, accordingly The local correlation degree of each data object during selected works are closed.Wherein, the local correlation degree is used to represent certain The extreme case of the weighting degree of correlation of one data object under all inquiry dimensions, i.e., comprising the upper bound with One value range on boundary.
The lower bound represents that the weighting degree of correlation of the data object under all inquiry dimensions may go out Existing minimum value.By taking the mixing inquiry scene in the present embodiment as an example, corresponding to a certain data object p The situation of one of keyword is only included, for example, does not only include " chafing dish " comprising " dry pot ", The data object p then now taken out in " dry pot " data source has minimum under all inquiry dimensions The weighting degree of correlation, the weighting degree of correlation of the minimum is the lower bound of local correlation degree.And the upper bound is represented The maximum that the weighting degree of correlation of the data object under all inquiry dimensions is likely to occur, with this reality Apply exemplified by the mixing inquiry scene in example, all keywords are included corresponding to a certain data object p, and It is ζ in the data source of each keywordt(p, q) value highest situation.
Specifically, candidate's updating device updates the part of each data object in the candidate collection The concrete mode of the degree of correlation is as follows:For the lower bound of the local correlation degree, by the candidate collection Under the weighting degree of correlation of all inquiry dimensions of the data object under respective data sources and remainder data source Non-master dimension weighting degree of correlation sum, be used as the lower bound of the local correlation degree of the data object; Accordingly for the upper bound of the local correlation degree, by the data object of the candidate collection corresponding The weighting degree of correlation of all inquiry dimensions under data source and all inquiry dimensions under remainder data source Highest weighting degree of correlation sum, be used as the upper bound of the local correlation degree of the data object.
Still by taking foregoing mixing inquiry scene as an example, the data object taken out for the first time in two data sources Respectively Q and Y, the local correlation degree is designated as partial score.Q and Y in respective data sources (i.e. Dry pot and chafing dish) under the weighting degrees of correlation of all inquiry dimensions be respectively:ζt(Q, q)=0.614 He ζt(Y, q)=0.511.For the data object Q, its ζ taken out in " dry pot " data sourcet(Q, q) in value Include two parts:The weighting degree of correlation of text dimensionality (main dimension) under " dry pot " data source The weighting of Spatial Dimension (non-master dimension) under D11=0.501, and " dry pot " data source is related I12=0.113 is spent, both sums are ζt(Q, q)=0.614.Assuming that data object Q does not include keyword " chafing dish ", then text dimensionalities (main dimension) of the data object Q under " chafing dish " data source plus Power degree of correlation D21=0, and the weighting phase of the Spatial Dimension (non-master dimension) under " chafing dish " data source Pass degree I22=I21=0.113.Thus, the lower bound of data object Q local correlation degree is 0.727 (i.e. D11+I12+I22).It is determined that Q local correlation degree the upper bound when, due to data object Q " fire The weighting degree of correlation of all inquiry dimensions can not possibly exceed data object Y, therefore number under pot " data source According to object Y ζt(Y, q) value is the theoretic highest weighting degree of correlation, thus, data object Q Local correlation degree the upper bound be 1.125 (i.e. 0.614+0.511).Based on aforesaid way, according to this The data object Q and Y of candidate collection are put into, its local correlation degree is updated for Q=[0.727,1.125]. Similarly, if the weighting degree of correlation of Spatial Dimensions of the data object Y under each data source is 0.111, The upper bound that object Y local correlation degree can then be updated the data is 0.511+0.111, and lower bound is 0.511+.0614, i.e. Y=[0.622,1.125]
Object determining device 330 according in the candidate collection local correlation degree of data object it is upper Boundary and lower bound, and in the data object of the candidate collection determine Query Result object when, for by The data object taken out for the first time in each data source, can not directly determine inquiry knot in most cases Fruit object, only when the data object taken out for the first time in each data source is identical data object, Because the data object possesses the weighting degree of correlation of maximum in two data sources, necessarily meet bar The data object of part, therefore can be directly as Query Result object.For example, at " dry pot " In " chafing dish " two data sources, first data object is Q in ranking results, then can be by Data object Q is used as Query Result object.
In most cases, the data object of ranking results first can be differed in each data source, Such as Q and Y in precedent.Therefore, the object determining device 330 specifically for:When having one The lower bound of the local correlation degree of data object is more than or equal to the local correlation degree of remaining any data object The upper bound when, local correlation degree lower bound is more than to the local correlation degree upper bound of remaining any data object Data object is defined as Query Result object;Otherwise, corresponding data is taken out in each data source again The weighting degree of correlation highest data object of all inquiry dimensions adds candidate collection and determines inquiry under source Result object.
Still by taking foregoing mixing inquiry scene as an example, if not meeting the condition for determining Query Result object, by Candidate's updating device is handled again, due to data object Q and Y in previous processing Through being removed, therefore now ζ in two data sourcest(p, q) value highest is data object W respectively, Then based on the data object W newly added, the part of all data objects in candidate collection is updated The degree of correlation.For data object W, because its weighting degree of correlation in two data sources is true Determine, therefore the bound of its local correlation degree is also thereby determined that, it is assumed that W is in " dry pot " and " fire The weighting degree of correlation of all inquiry dimensions under pot " data source is respectively 0.454 and 0.504, then may be used Be 0.958 with the upper bound and lower bound for the local correlation degree for determining data object W, i.e. W=[0.958, 0.958], represent that the weighting degree of correlation of the data object W has been uniquely determined.
Meanwhile, the data object W based on newest taking-up is understood, data object Q is in " chafing dish " number The weighting degree of correlation according to all inquiry dimensions under source can not possibly exceed data object W, and data object Y The weighting degree of correlation of all inquiry dimensions is also impossible to exceed data object W under " chafing dish " data source, Thus, object Q, the upper bound of data object Y local correlation degree are updated the data.Data object Q The upper bound of local correlation degree be updated to 0.614+0.504, i.e. Q=[0.727,1.118], data object The upper bound of Y local correlation degree is updated to 0.511+0.454, i.e. Y=[0.622,0.965].
Now, according to the bound of data object Q, Y and W local correlation degree, do not meet yet Determine the condition of Query Result object, it is therefore desirable to continue to take in data source by candidate's updating device Go out data object.If the data object difference this time taken out in " dry pot " and " chafing dish " data source For Z and X, and ζs of the data object Z in " dry pot " data sourcet(Z, q) value is 0.411, and is counted According to ζs of the object X in " chafing dish " data sourcet(X, q) value is 0.25., can be true according to aforementioned manner The upper bound for determining data object Z local correlation degree is 0.411+0.25, and lower bound is 0.411+0.098 (" fire The weighting degree of correlation of Spatial Dimension under pot " data source), i.e. Z=[0.509,0.661], and data pair As X local correlation degree the upper bound be 0.25+0.411, lower bound be 0.25+0.101 (" chafing dish " data The weighting degree of correlation of Spatial Dimension under source), i.e. X=[0.351,0.661].
Meanwhile, object Q and Y Local Phase is updated the data based on the data object Z and X newly taken out The Guan Du upper bound.Wherein, the upper bound of data object Q local correlation degree is updated to 0.614+0.25, That is the upper bound of Q=[0.727,0.864], data object Y local correlation degree is updated to 0.511+0.411, That is Y=[0.622,0.922], and the upper bound of data object W local correlation degree and lower bound are unique It is determined that, therefore be still W=[0.958,0.958] without updating.Now, data object W part The lower bound of the degree of correlation is more than or equal to the local correlation degree of remaining any data object (Q, X, Y and Z) The upper bound, meet determine Query Result object condition, therefore by data object W be defined as inquiry Result object, the Query Result object got by way of above-mentioned loop iteration necessarily all dimensions The maximum data object of the weighting degree of correlation of degree.In above-mentioned three iterative process, candidate collection In data object local correlation degree update status it is as shown in table 2.
It is determined that during Query Result object, by way of above-mentioned loop iteration, it is only necessary to N The iterative processing of wheel, you can get the Query Result object of the condition of satisfaction, without taking out each number Screening is ranked up according to all data objects in source, therefore, it is possible to effectively improve the efficiency of mixing inquiry, Shorten query time.
Mix and inquire about for TOP-K, due to needing to obtain K Query Result object, therefore this Shen Please embodiment further provide a kind of preferred mixing query facility, structure such as Fig. 4 institutes of the equipment Show, except the data in Fig. 3 divide device 310, candidate's updating device 320 and object determining device 330 Outside, in addition to result return mechanism 340.The result return mechanism 340 is used to look into being determined as After the data object for asking result, the Query Result object is taken out in the candidate collection and adds knot During fruit is gathered, and judge whether the quantity of the Query Result object in the results set reaches preset value; When reaching preset value, all Query Result objects in the results set are regard as mixing inquiry As a result;And in not up to preset value, again by taking out institute under respective data sources in each data source The weighting degree of correlation highest data object for having inquiry dimension adds candidate collection and determines Query Result pair As.
Still by taking foregoing inquiry scene as an example, after third time iterative processing, it has been determined that data object W is Query Result object, Query Result object W is taken out in candidate collection, and add result In set, the data object now included in candidate collection is Q, Y, Z, X, and in results set Comprising data object be W.
The preset value is the fruiting quantities that TOP-K mixing inquiries need, and the present embodiment is set as 5, Because the quantity of the Query Result object in now results set only has 1, and not up to preset value, Therefore need to proceed processing by candidate's updating device 320, and continued by object determining device 330 By once or multiple iterative processing continues to obtain Query Result object, until adding results set In Query Result object quantity reach 5 untill, now result return mechanism 340 is by the knot All Query Result objects during fruit is gathered are returned as the TOP-K results for mixing inquiry.
To sum up, the scheme of the embodiment of the present application provides a kind of new mixing inquiry mode, is looked into multiple Ask and determined in dimension after main dimension, data source is divided based on main dimension, each data source concurrent processing is obtained Corresponding candidate target is taken, and introduces the concept of local correlation degree, by comparing data in candidate collection The local correlation degree of object, the object for meeting mixing search request can be returned in advance, without according to institute The degree of correlation for having dimension carries out screening and sequencing to all data objects for meeting querying condition, thus, it is possible to The efficiency of mixing inquiry is improved, shortens query time.
It should be noted that the application can be carried out in the assembly of software and/or software and hardware, example Such as, it can be set using application specific integrated circuit (ASIC), general purpose computer or any other similar hardware It is standby to realize.In one embodiment, the software program of the application can be realized by computing device Steps described above or function.Similarly, the software program (including related data structure) of the application Can be stored in computer readable recording medium storing program for performing, for example, RAM memory, magnetically or optically driver or Floppy disc and similar devices.In addition, some steps or function of the application can employ hardware to realize, example Such as, as coordinating with processor so as to performing the circuit of each step or function.
In addition, the part of the application can be applied to computer program product, such as computer program Instruction, when it is computer-executed, by the operation of the computer, can call or provide basis The present processes and/or technical scheme.And the programmed instruction of the present processes is called, it may be deposited Store up in fixed or moveable recording medium, and/or by broadcast or other signal bearing medias Data flow and be transmitted, and/or be stored according to the computer equipment of described program instruction operation In working storage.Here, including a device, the device bag according to one embodiment of the application The memory for storing computer program instructions and the processor for execute program instructions are included, wherein, When the computer program instructions are by the computing device, the plant running is triggered based on foregoing according to this The methods and/or techniques scheme of multiple embodiments of application.
It is obvious to a person skilled in the art that the application is not limited to the thin of above-mentioned one exemplary embodiment Section, and in the case of without departing substantially from spirit herein or essential characteristic, can be with other specific Form realizes the application.Therefore, no matter from the point of view of which point, embodiment all should be regarded as exemplary , and be nonrestrictive, scope of the present application is limited by appended claims rather than described above It is fixed, it is intended that all changes fallen in the implication and scope of the equivalency of claim are included In the application.The right that any reference in claim should not be considered as involved by limitation will Ask.Furthermore, it is to be understood that the word of " comprising " one is not excluded for other units or step, odd number is not excluded for plural number.Dress Software can also be passed through by a unit or device by putting the multiple units stated in claim or device Or hardware is realized.The first, the second grade word is used for representing title, and is not offered as any specific Order.

Claims (10)

1. one kind mixing querying method, wherein, this method includes:
Main dimension is determined in multiple queries dimension, and multiple data sources are divided based on the main dimension;
By taking out the weighting degree of correlation of all inquiry dimensions under respective data sources in each data source most High data object adds candidate collection, and updates the part of each data object in the candidate collection The degree of correlation;
According to the upper bound of the local correlation degree of data object and lower bound in the candidate collection, waited by described Query Result object is determined in the data object that selected works are closed, wherein the upper bound of the local correlation degree is with Boundary is the maximum and minimum value of the weighting degree of correlation of the data object under all inquiry dimensions.
2. according to the method described in claim 1, wherein, according to data object in the candidate collection Local correlation degree the upper bound and lower bound, determine Query Result in the data object of the candidate collection Object, including:
If being more than or equal to remaining any data object in the presence of the lower bound of the local correlation degree of a data object Local correlation degree the upper bound when, by local correlation degree lower bound be more than remaining any data object part The data object in the degree of correlation upper bound is defined as Query Result object;
Otherwise, again by taking out all weighting phases for inquiring about dimensions under respective data sources in each data source Pass degree highest data object adds candidate collection and determines Query Result object.
3. method according to claim 1 or 2, wherein, it is being determined as the number of Query Result After object, in addition to:
The Query Result object is taken out in the candidate collection to add in results set, and judges institute Whether the quantity for stating the Query Result object in results set reaches preset value;
If reaching preset value, all Query Result objects in the results set are looked into as mixing The result of inquiry;
If not up to preset value, again by taking out all inquiries under respective data sources in each data source The weighting degree of correlation highest data object of dimension adds candidate collection and determines Query Result object.
4. according to the method described in claim 1, wherein, update each data in the candidate collection The local correlation degree of object, including:
By the weighting of all inquiry dimensions of the data object of the candidate collection under respective data sources The weighting degree of correlation sum of the degree of correlation and the non-master dimension under remainder data source, is used as the data object Local correlation degree lower bound;
By the weighting of all inquiry dimensions of the data object of the candidate collection under respective data sources The highest weighting degree of correlation sum of the degree of correlation and all inquiry dimensions under remainder data source, as described The upper bound of the local correlation degree of data object.
5. according to the method described in claim 1, wherein, main dimension is determined in multiple queries dimension, And multiple data sources are divided based on the main dimension, including:
The text dimensionality is determined to be main dimension in the multiple queries dimension comprising text dimensionality, and The main dimension is divided multiple data sources by the keyword in the text dimensionality.
6. one kind mixing query facility, wherein, the equipment includes:
Data divide device, for determining main dimension in multiple queries dimension, and based on the main dimension Degree divides multiple data sources;
Candidate's updating device, for taking out all inquiries dimension under respective data sources in each data source The weighting degree of correlation highest data object of degree adds candidate collection, and updates each in the candidate collection The local correlation degree of individual data object;
Object determining device, for according to the local correlation degree of data object in the candidate collection Boundary and lower bound, determine Query Result object, wherein the office in the data object of the candidate collection The upper bound of portion's degree of correlation and lower bound are the weighting degree of correlation of the data object under all inquiry dimensions Maximum and minimum value.
7. equipment according to claim 1, wherein, the object determining device is deposited for working as It is more than or equal to the Local Phase of remaining any data object in the lower bound of the local correlation degree of a data object During the Guan Du upper bound, on the local correlation degree that local correlation degree lower bound is more than to remaining any data object The data object on boundary is defined as Query Result object;Otherwise, take out corresponding in each data source again The weighting degree of correlation highest data object of all inquiry dimensions adds candidate collection and determined under data source Query Result object.
8. the equipment according to claim 6 or 7, wherein, the equipment also includes:
As a result return mechanism, for being determined as after the data object of Query Result, is waited by described Selected works take out the Query Result object and added in results set in closing, and judge in the results set The quantity of Query Result object whether reach preset value;When reaching preset value, by the result set The result that all Query Result objects in conjunction are inquired about as mixing;And in not up to preset value, Again by taking out all weighting degree of correlation highests for inquiring about dimensions under respective data sources in each data source Data object add candidate collection determine Query Result object.
9. equipment according to claim 6, wherein, candidate's updating device, for each The weighting degree of correlation highest data object of all inquiry dimensions under respective data sources is taken out in data source Add candidate collection, and all inquiries by data object under respective data sources of the candidate collection The weighting degree of correlation sum of the weighting degree of correlation of dimension and the non-master dimension under remainder data source, is used as institute State the lower bound of the local correlation degree of data object;And the candidate collection by data object corresponding The weighting degree of correlation of all inquiry dimensions under data source and all inquiry dimensions under remainder data source Weighting degree of correlation sum, be used as the upper bound of the local correlation degree of the data object.
10. equipment according to claim 6, wherein, the data divide device, for The text dimensionality is determined to be main dimension in multiple queries dimension comprising text dimensionality, and according to institute The main dimension is divided multiple data sources by the keyword stated in text dimensionality.
CN201610102101.9A 2016-02-24 2016-02-24 Hybrid query method and device Active CN107122358B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610102101.9A CN107122358B (en) 2016-02-24 2016-02-24 Hybrid query method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610102101.9A CN107122358B (en) 2016-02-24 2016-02-24 Hybrid query method and device

Publications (2)

Publication Number Publication Date
CN107122358A true CN107122358A (en) 2017-09-01
CN107122358B CN107122358B (en) 2020-09-01

Family

ID=59716952

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610102101.9A Active CN107122358B (en) 2016-02-24 2016-02-24 Hybrid query method and device

Country Status (1)

Country Link
CN (1) CN107122358B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110245208A (en) * 2019-04-30 2019-09-17 广东省智能制造研究所 A kind of retrieval analysis method, apparatus and medium based on big data storage

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102571715A (en) * 2010-12-27 2012-07-11 腾讯科技(深圳)有限公司 Multidimensional data query method and multidimensional data query system
CN102737123A (en) * 2012-06-13 2012-10-17 北京五八信息技术有限公司 Multidimensional data distribution method
CN102890714A (en) * 2012-09-24 2013-01-23 华为技术有限公司 Method and device for indexing data
CN103631952A (en) * 2013-12-12 2014-03-12 用友软件股份有限公司 Method and device for extracting complex data analysis information from plurality of data sources by adopting separating and unifying mode
EP2866156A1 (en) * 2013-10-25 2015-04-29 GB Group PLC Method and apparatus for identity verification
WO2015118887A1 (en) * 2014-02-10 2015-08-13 日本電気株式会社 Search system, search method, and program recording medium
CN105335378A (en) * 2014-06-25 2016-02-17 富士通株式会社 Multi-data source information processing device and method, and server

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102571715A (en) * 2010-12-27 2012-07-11 腾讯科技(深圳)有限公司 Multidimensional data query method and multidimensional data query system
CN102737123A (en) * 2012-06-13 2012-10-17 北京五八信息技术有限公司 Multidimensional data distribution method
CN102890714A (en) * 2012-09-24 2013-01-23 华为技术有限公司 Method and device for indexing data
EP2866156A1 (en) * 2013-10-25 2015-04-29 GB Group PLC Method and apparatus for identity verification
CN103631952A (en) * 2013-12-12 2014-03-12 用友软件股份有限公司 Method and device for extracting complex data analysis information from plurality of data sources by adopting separating and unifying mode
WO2015118887A1 (en) * 2014-02-10 2015-08-13 日本電気株式会社 Search system, search method, and program recording medium
CN105335378A (en) * 2014-06-25 2016-02-17 富士通株式会社 Multi-data source information processing device and method, and server

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110245208A (en) * 2019-04-30 2019-09-17 广东省智能制造研究所 A kind of retrieval analysis method, apparatus and medium based on big data storage
CN110245208B (en) * 2019-04-30 2022-05-24 广东省智能制造研究所 Retrieval analysis method, device and medium based on big data storage

Also Published As

Publication number Publication date
CN107122358B (en) 2020-09-01

Similar Documents

Publication Publication Date Title
US6772163B1 (en) Reduced memory row hash match scan join for a partitioned database system
Benetis et al. Nearest and reverse nearest neighbor queries for moving objects
Pelanis et al. Indexing the past, present, and anticipated future positions of moving objects
US10068033B2 (en) Graph data query method and apparatus
CN105183735A (en) Data query method and query device
CN108959510B (en) Partition level connection method and device for distributed database
CN110597935A (en) Space analysis method and device
CN105447030B (en) A kind of index process method and apparatus
Koh et al. Finding k most favorite products based on reverse top-t queries
US20200250192A1 (en) Processing queries associated with multiple file formats based on identified partition and data container objects
CN107783980A (en) Index data generates and data query method and device, storage and inquiry system
US7035851B1 (en) Reorganization of database records using partition merging
CN111221813B (en) Database index and processing method, device and equipment for database query
CN103744998A (en) Hierarchical-division based storage and organization method for large-scale space vector data
Su et al. Indexing and parallel query processing support for visualizing climate datasets
CN109325055A (en) The screening of business association tables of data and checking method, device, electronic equipment
CN105204920A (en) Distributed calculation operation realizing method and device based on mapping and polymerizing
CN107451204B (en) Data query method, device and equipment
CN104750749B (en) Data processing method and device
GB2516501A (en) Method and system for processing data in a parallel database environment
CN111752986A (en) Data query method and device, equipment and storage medium
US11487824B2 (en) Automated database query filtering for spatial joins
CN107122358A (en) Mix querying method and equipment
CN111125090B (en) Data access method and device
CN109461012A (en) A kind of Products Show method, apparatus and terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
TR01 Transfer of patent right

Effective date of registration: 20200922

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Patentee after: Innovative advanced technology Co.,Ltd.

Address before: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Patentee before: Advanced innovation technology Co.,Ltd.

Effective date of registration: 20200922

Address after: Cayman Enterprise Centre, 27 Hospital Road, George Town, Grand Cayman Islands

Patentee after: Advanced innovation technology Co.,Ltd.

Address before: A four-storey 847 mailbox in Grand Cayman Capital Building, British Cayman Islands

Patentee before: Alibaba Group Holding Ltd.

TR01 Transfer of patent right