CN103914544A - Method for quickly matching Chinese addresses in multi-level manner on basis of address feature words - Google Patents
Method for quickly matching Chinese addresses in multi-level manner on basis of address feature words Download PDFInfo
- Publication number
- CN103914544A CN103914544A CN201410134887.3A CN201410134887A CN103914544A CN 103914544 A CN103914544 A CN 103914544A CN 201410134887 A CN201410134887 A CN 201410134887A CN 103914544 A CN103914544 A CN 103914544A
- Authority
- CN
- China
- Prior art keywords
- address
- chinese
- matching
- name
- even numbers
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/20—Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
- G06F16/29—Geographical information databases
Abstract
The invention discloses a method for quickly matching Chinese addresses in a multi-level manner on the basis of address feature words, and belongs to the field of data spatial research in the geographic information science. The method particularly includes two links of creating standard Chinese address matching dictionaries and matching the addresses. Chinese words of standard Chinese addresses are segmented on the word segmentation basis of the address feature words, the standard Chinese address matching dictionaries are created by the aid of double-array trie trees and Hash operation, database retrieval modes are replaced by bidirectional scanning and Hash operation, and geographic spatial coordinates of the Chinese addresses to be matched are acquired, so that the Chinese addresses can be matched. The method has the advantages that integral address matching procedures can be completed in memories of computers, and the address matching speed can be increased owing to bidirectional scanning and a simultaneous word segmentation and matching mode; the problem that partial Chinese addresses cannot be matched due to missing of address elements can be solved according to Chinese address classifying, layering and combining rules, and the address matching accuracy can be improved.
Description
Technical field
The invention belongs to data space research field.Relate in particular to a kind of multi-level Chinese address matching process fast based on address Feature Words.
Background technology
Digital city is the urban information system system taking spatial information as core, at the integrated of Digital city information resource with in merging, matching addresses technology, as core technology, is being born the task of the spatial positional information of a large amount of natural language descriptions of industry-by-industry being converted to geographical space coordinate.Therefore, matching addresses speed and accuracy thereof will produce important and far-reaching impact to the construction of digital city.
At present, conventional Chinese address matching process mainly contains full-text index, Chinese word segmentation, address point rank coupling, matching regular expressions and fuzzy matching addresses.Yangyang ZHAOs etc. propose the address segmenting method based on Address factor recognition mechanism, on the basis of maximum forward scan matching method, increase the recognition mechanism based on Address factor, improved the accuracy of Chinese address participle, but declining significantly appears in its rate matched.Tang Jing utilizes segmentation, combination and the priority rule of Chinese address in the research of Chinese address coding, Chinese address is carried out to segmentation coupling, these rules have reduced the matching times of Address factor to a certain extent, but because the method needs constantly to carry out alternately with database in matching addresses process, the method overall matching speed is slower.Hong Ying proposes the matching addresses method based on even numbers group trie tree and Address factor coded query.The method first adopts even numbers group trie tree to carry out Chinese word segmentation to Chinese address, and obtain the corresponding Address factor coding of Chinese address word segmentation result according to its self-defining coding rule, then in the database of normal address, inquire about concrete geographical space coordinate according to address constituent encoder.Compared with other Chinese address segmenting method, the method is very fast aspect participle speed, but also needs by normal address data base querying geographical space coordinate, and therefore, the method global rate is very restricted.Yao Xinyu attempts using main Feature Words and secondary Feature Words to carry out mark to address, and improves the matching rate of address by the account form of Chinese character similarity and pinyin similarity, but this scholar does not carry out deep research to address rate matched problem.Cheng Changxiu etc. adopt the fuzzy Chinese word cutting method of participle limit, limit coupling, have reduced the number of comparisons of address character string, but because it still inquires about geographical space coordinate in database, its rate matched is considerably slower than even numbers group trie tree participle.Zhang Qian etc. propose the hierarchical address model based on finite state machine and trie tree, have solved part address and have named lack of standardization and address jump problem, but address rate matched problem is not thoroughly discussed.More than research and propose the solution of many raising matching addresses accuracy, but to the research of address rate matched not deeply.Therefore,, in the face of the matching addresses request of current extensive quantity, how rapid batch completes the problem in science urgently to be resolved hurrily that matching addresses task is data space research field.
Summary of the invention
The object of the invention is to overcome the deficiencies in the prior art, propose a kind of multi-level Chinese address matching process fast based on address Feature Words.
Multi-level Chinese address matching process fast based on address Feature Words comprises the steps:
1) from standard Chinese address database, read in the record of all standard Chinese address, comprise geographical space coordinate x value, the y value of each standard Chinese address;
2), according to the classifying rules of Chinese address, taking address Feature Words as participle, according to forward scan Chinese word segmentation is carried out in standard Chinese address, the 5 class Address factors that Chinese word segmentation is obtained are inserted in 5 corresponding class even numbers group trie trees;
3) from 5 class even numbers group trie trees, obtain the corresponding geocoding element set in standard Chinese address, according to minimum cost principle, taking the layering of Chinese address and rule of combination as according to address code element is carried out to combination and permutation, obtain 4 geocodings of this standard Chinese address of unique expression, these 4 geocodings are carried out to Hash operation, the geographical space coordinate of this standard Chinese address is stored on position corresponding to its hash function value in Hash table, step 2 carry out successively in all standard Chinese address) operation of ~ step 3), complete standard Chinese matching addresses dictionary creation,
4) read Chinese address character string to be matched, assignment S1 and S2 respectively carries out forward scan coupling and reverse scanning coupling simultaneously;
5) judge that whether forward scan coupling and reverse scanning coupling be successful, if it fails to match for forward scan coupling or reverse scanning, return to step 4); If the match is successful for forward scan coupling and reverse scanning, obtain geocoding combination T1 and the T2 of Corresponding matching result;
6) establish geocoding T=T1+T2, T is carried out to Hash operation, in Hash table, search corresponding geographical space coordinate by hash function value, if exist, obtain corresponding geographical volume coordinate, matching addresses success, if do not exist, matching addresses failure, repeating step 4) ~ step 6), complete the matching addresses of all Chinese address to be matched.
Described step 2) be:
(1) classifying rules of Chinese address refers to that one refers to that the clear and definite standard Chinese address of meaning is made up of administrative name, lane, street name, cell name, bar location name and interest this 5 class Address factor of calling the roll, wherein Address factor refers to a relatively independent part in address character string, has clear and definite address meaning;
(2) forward scan Chinese word cutting method is a kind of from character string sequence, put first place, the method for successively character string being carried out from left to right cutting;
(3) even numbers group trie tree is made up of base array and check array, a wherein node state for each element representation trie tree of base array, array value is the base value of state transitions, forerunner's state of a state of check array representation, array value is proof test value, in the time that the value of base array and check array is 0, this state free time, 5 class even numbers group trie trees are stored respectively the 5 class Address factors that each normal address comprises, 5 class even numbers group trie trees are specially administrative division even numbers group trie tree, lane, street name even numbers group trie tree, residential quarters even numbers group trie tree, bar location even numbers group trie tree and POI even numbers group trie tree,
(4) update for even numbers group trie tree is to carry out the transfer of a next state in the time building even numbers group trie tree, in the time that state m transfers to state n, must meet following 2 conditions:
base[m]?+?c?=?n,
check[n]?=?m,
Wherein, m is the subscript of current state, and n is the subscript of transfering state, and c is the numerical value of input character.
Described step 3) comprises:
(1) from 5 class even numbers group trie trees, obtain the corresponding 5 class geocoding elements in standard Chinese address, and according to minimum cost principle, layering and rule of combination with Chinese address carry out combination and permutation to address code element, obtain 4 geocodings of the accurate Chinese address of only table indicating, wherein geocoding element refers to the array index value of last character in even numbers group trie tree in each Address factor, and geocoding is to be formed by geocoding element combinations and arrangement;
(2) hierarchical rule of Chinese address refers to according to the subordinate relation of 5 Address factors of Chinese address, can be divided into three levels, and the first level is administrative name, comprises provincial, city-level, at county level, township level, at village level; The second level Wei Jie lane name and cell name; Tri-layer is bar location name and POI name;
(3) rule of combination of Chinese address refers to the hierarchical rule according to Chinese address, the Chinese address of a standard can have 16 kinds to refer to that the clear and definite Chinese address to be matched of meaning matches with it, wherein, according to minimum cost principle, 4 kinds of address expression-forms that comprise 3 class Address factors are:
Name/bar location, lane, administrative name/street
× × province × × city × × district × × street × × road × × number;
Lane, administrative name/street name/interest is called the roll
× × province × × city × × district × × street × × road × × primary school;
Administrative name/cell name/bar location
× × province × × city × × district × × street × × community × × number;
Administrative name/cell name/interest is called the roll
× × province × × city × × district × × street × × community × × square.
Described step 4) comprises:
(1) forward scan coupling first taking administrative division Feature Words as cutting according to S1 is carried out to forward scan Chinese word segmentation, if participle failure, it fails to match in forward scan, if participle success, obtains corresponding administrative division geocoding element, in administrative division even numbers group trie tree, inquire about the coupling branch of this geocoding element, if successful inquiring, obtain the corresponding array index value of this last state of coupling branch T1, if inquire about unsuccessfully, it fails to match in forward scan;
(2) first reverse scanning coupling carries out tri-layer Address factor scan matching, if tri-layer Address factor scan matching success, carry out again the second level Address factor scan matching, if the second level Address factor scan matching success, the match is successful for reverse scanning, if tri-layer Address factor scan matching or the failure of the second level Address factor scan matching, it fails to match for reverse scanning;
First (3) tri-layer Address factor scan matchings carry out bar location Address factor scan matching, if bar location Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch, assignment T2, directly enter the second level Address factor match query link, if bar location Address factor scan matching failure, carry out POI Address factor scan matching, if POI Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch, assignment T2, enter the second level Address factor match query link, if POI Address factor scan matching failure, it fails to match for reverse scanning,
(4) bar location Address factor scan matching first taking bar location Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success, obtain corresponding bar location geocoding element, in this geocoding Match of elemental composition branch of bar location even numbers group trie tree query;
(5) POI Address factor scan matching first taking POI Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success is obtained corresponding POI geocoding element, in this geocoding Match of elemental composition branch of POI even numbers group trie tree query;
First (6) second level Address factor scan matchings carry out cell name Address factor scan matching, if cell name Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch T3, make T2=T2+T3, the match is successful for reverse scanning, if cell name Address factor scan matching failure, carry out lane, street name Address factor scan matching, the name Address factor scan matching success of Ruo Jie lane, obtain the corresponding array index value of this last state of coupling branch T3, make T2=T2+T3, the match is successful for reverse scanning, if cell name Address factor scan matching failure, it fails to match for reverse scanning,
(7) cell name Address factor scan matching first taking cell name Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success, obtain corresponding cell name geocoding element, in this geocoding Match of elemental composition branch of cell name even numbers group trie tree query;
(8) lane, street name Address factor scan matching first Yi Jie lane name Feature Words be cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success, obtain corresponding Jie lane name geocoding element, this geocoding Match of elemental composition branch of Jie lane name even numbers group trie tree query;
(9) reverse scanning Chinese word cutting method is a kind of turning left from the right side to the method that foundation is carried out cutting to character string taking address Feature Words as participle.
The beneficial effect that the present invention compared with prior art has:
1) the present invention is directed to existing Chinese address coupling dictionary creation overlong time, the deficiency that memory headroom expense is excessive, utilize classification, layering and the rule of combination of Chinese address, improve the building mode of standard Chinese matching addresses dictionary, reduced time and the space expense of standard Chinese matching addresses dictionary creation.
2) in matching addresses process, adopt bilateral scanning and Hash operation to replace the continuous mutual mode of tradition and standard Chinese address database, improve the speed of matching addresses.
3), according to minimum cost matching principle, solved the problem that existing method cannot be mated part Address factor disappearance Chinese address, the accuracy that has improved matching addresses.
Brief description of the drawings
Fig. 1 is the trie tree construction schematic diagram that in the present invention, contains administrative division information;
Fig. 2 is the process flow diagram of matching addresses link in the present invention.
Embodiment
Introduce in detail the present invention below in conjunction with the drawings and the specific embodiments.
The multi-level Chinese address matching process fast that the present invention is based on address Feature Words, matching addresses link realization flow figure as shown in Figure 2.Now, as example specific embodiment of the invention process is described taking " No. 148, temmoku hill path, Xi Xi street, Hangzhou, Zhejiang province city ", its concrete steps are as follows:
1) from standard Chinese address database, read in the record of all standard Chinese address, Hangzhou, Zhejiang province city, comprise geographical space coordinate x value, the y value of each standard Chinese address;
2), according to the classifying rules of Chinese address, taking address Feature Words as participle, according to forward scan Chinese word segmentation is carried out in standard Chinese address, the 5 class Address factors that Chinese word segmentation is obtained are inserted in 5 corresponding class even numbers group trie trees;
3) from 5 class even numbers group trie trees, obtain the corresponding geocoding element set in standard Chinese address, according to minimum cost principle, taking the layering of Chinese address and rule of combination as according to address code element is carried out to combination and permutation, obtain 4 geocodings of the accurate Chinese address of only table indicating, these 4 geocodings are carried out to Hash operation, the geographical space coordinate of this standard Chinese address is stored on position corresponding to its hash function value in Hash table, step 2 carry out successively in all standard Chinese address) operation of ~ step 3), complete standard Chinese matching addresses dictionary creation,
4) read Chinese address character string to be matched " No. 148, temmoku hill path, Xi Xi street, Hangzhou, Zhejiang province city ", assignment S1 and S2 respectively carries out forward scan coupling and reverse scanning coupling simultaneously;
5) judge that whether forward scan coupling and reverse scanning coupling be successful, if it fails to match for forward scan coupling or reverse scanning, return to step 4); If the match is successful for forward scan coupling and reverse scanning, obtain geocoding combination T1 and the T2 of Corresponding matching result;
6) establish geocoding T=T1+T2, T is carried out to Hash operation, in Hash table, search corresponding geographical space coordinate by hash function value, if exist, obtain corresponding geographical volume coordinate, matching addresses success, if do not exist, matching addresses failure.
Described step 2) comprising:
(1) classifying rules of Chinese address refers to that one refers to that the clear and definite standard Chinese address of meaning is made up of administrative name, lane, street name, cell name, bar location name and interest this 5 class Address factor of calling the roll, wherein Address factor refers to a relatively independent part in address character string, there is clear and definite address meaning, for example: temmoku hill path, Xi Xi street, Hangzhou, Zhejiang province city is made up of 3 class Address factors for No. 148 exactly, " Xi Xi street, Hangzhou, Zhejiang province city ", " temmoku hill path ", " No. 148 " respectively;
(2) address Feature Words refers to the identical characters string suffix that each class Address factor comprises, as in " road " in " province " in administrative name Address factor, " city ", lane, street name Address factor, bar location Address factor " number " etc., specifically each class Address factor is in table 1;
Table 1
(3) forward scan Chinese word cutting method is a kind of from character string sequence, put first place, successively character string is carried out from left to right the method for cutting, to obtain administrative division Address factor as example, first taking provincial Feature Words as cutting, foundation is carried out cutting to Chinese address, if cutting success, obtain corresponding provincial Address factor, if cutting failure, enter the Chinese address cutting link taking city-level Feature Words as cutting foundation, repeat above-mentioned participle link, until complete the Chinese address cutting link taking at village level Feature Words as cutting foundation, the Address factor that above-mentioned cutting is obtained connects successively, to obtain administrative division Address factor,
(4) even numbers group trie tree is made up of base array and check array, a wherein node state for each element representation trie tree of base array, array value is the base value of state transitions, forerunner's state of a state of check array representation, array value is proof test value, in the time that the value of base array and check array is 0, this state free time, 5 class even numbers group trie trees are stored respectively the 5 class Address factors that each normal address comprises, 5 class even numbers group trie trees are specially administrative division even numbers group trie tree, lane, street name even numbers group trie tree, residential quarters even numbers group trie tree, bar location even numbers group trie tree and POI even numbers group trie tree,
(5) its essence of update of even numbers group trie tree is just carried out the transfer of a next state in the time building even numbers group trie tree, and in the time that state m transfers to state n, it must meet following 2 conditions:
base[m]?+?c?=?n,
check[n]?=?m,
Wherein, m is the subscript of current state, and n is the subscript of transfering state, and c is the numerical value of input character,
Now, taking literary composition two, literary composition three these two Address factors as example, the position of each element of how to confirm in even numbers group trie tree is described.Suppose that the array index value that character " literary composition " is corresponding is i, base[i] value must meet the following conditions:
Base [base[i]+code[bis-]]=0;
Check[base[i]+code[bis-]]=0;
Base [base[i]+code[tri-]]=0;
Check[base[i]+code[tri-]]=0;
Calculate base[i according to above-mentioned formula] value after, just can determine character " two ", " three " corresponding check value, the value of other branch node can be by that analogy.
Described step 3) comprises:
(1) from 5 class even numbers group trie trees, obtain the corresponding 5 class geocoding elements in standard Chinese address, and according to minimum cost principle, layering and rule of combination with Chinese address carry out combination and permutation to address code element, obtain 4 geocodings of the accurate Chinese address of only table indicating, wherein geocoding element refers to the array index value of last character in even numbers group trie tree in each Address factor, and geocoding is to be formed by geocoding element combinations and arrangement;
(2) hierarchical rule of Chinese address refers to according to the subordinate relation of 5 Address factors of Chinese address, can be divided into three levels, and the first level is administrative name, comprises provincial, city-level, at county level, township level, at village level; The second level Wei Jie lane name and cell name; Tri-layer is bar location name and POI name;
(3) rule of combination of Chinese address refers to the hierarchical rule according to Chinese address, the Chinese address of a standard can have 16 kinds to refer to that the clear and definite address to be matched of meaning matches with it, wherein, according to minimum cost principle, 4 kinds of address expression-forms that comprise 3 class Address factors are:
Name/bar location, lane, administrative name/street
× × province × × city × × district × × street × × road × × number;
Lane, administrative name/street name/interest is called the roll
× × province × × city × × district × × street × × road × × primary school;
Administrative name/cell name/bar location
× × province × × city × × district × × street × × community × × number;
Administrative name/cell name/interest is called the roll
× × province × × city × × district × × street × × community × × square;
Described step 4) comprises:
(1) forward scan coupling first taking administrative division Feature Words as cutting according to S1 is carried out to forward scan Chinese word segmentation, obtain administrative division Address factor " Xi Xi street, Hangzhou, Zhejiang province city ", in administrative division even numbers group trie tree, inquire about the coupling branch of this geocoding element, if successful inquiring, obtain the corresponding array index value of this last state of coupling branch " road " T1, if inquire about unsuccessfully, it fails to match in forward scan;
(2) first reverse scanning coupling carries out tri-layer Address factor scan matching, if tri-layer Address factor scan matching success, carry out again the second level Address factor scan matching, if the second level Address factor scan matching success, the match is successful for reverse scanning, if tri-layer Address factor scan matching or the failure of the second level Address factor scan matching, it fails to match for reverse scanning;
First (3) tri-layer Address factor scan matchings carry out bar location Address factor scan matching, if bar location Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch, assignment T2, directly enter the second level Address factor match query link, if bar location Address factor scan matching failure, carry out POI Address factor scan matching, if POI Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch, assignment T2, enter the second level Address factor match query link, if POI Address factor scan matching failure, it fails to match for reverse scanning,
(4) bar location Address factor scan matching first taking bar location Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, obtain bar location geocoding element " No. 148 ", in this geocoding Match of elemental composition branch of bar location even numbers group trie tree query;
(5) POI Address factor scan matching first taking POI Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success is obtained corresponding POI geocoding element, in this geocoding Match of elemental composition branch of POI even numbers group trie tree query;
First (6) second level Address factor scan matchings carry out cell name Address factor scan matching, if cell name Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch T3, make T2=T2+T3, the match is successful for reverse scanning, if cell name Address factor scan matching failure, carry out lane, street name Address factor scan matching, the name Address factor scan matching success of Ruo Jie lane, obtain the corresponding array index value of this last state of coupling branch T3, make T2=T2+T3, the match is successful for reverse scanning, if cell name Address factor scan matching failure, it fails to match for reverse scanning,
(7) cell name Address factor scan matching first taking cell name Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success, obtain corresponding cell name geocoding element, in this geocoding Match of elemental composition branch of cell name even numbers group trie tree query;
(8) lane, street name Address factor scan matching first Yi Jie lane name Feature Words be cutting according to S2 is carried out to reverse scanning Chinese word segmentation, obtain lane, street name geocoding element " temmoku hill path ", this geocoding Match of elemental composition branch of Jie lane name even numbers group trie tree query;
(9) reverse scanning Chinese word cutting method is that a kind of turning left from the right side carried out the method for cutting to character string, to obtain bar location Address factor as example, in character string, retrieve coupling taking bar location Feature Words as retrieval foundation, obtain the index position N at Feature Words place, bar location, from this index position, turn left from the right side and successively each character character string is carried out to Feature Words judgement, until retrieve the address Feature Words of another kind of Address factor, obtain the index position M of this address Feature Words, obtain the character string of M-1 index position to N index position, this character string is bar location Address factor.
Embodiment
For verifying validity of the present invention, be analyzed by the present invention and based on even numbers group trie tree matching addresses method and based on even numbers group trie tree and Address factor coded query matching process, the present invention adopts Lucheng District, Wenzhou City and Ouhai District to amount to 37137 standard Chinese addresses and builds the test of matching addresses dictionary, and extract Lucheng District, Wenzhou City and Ouhai District and amount to 29792 public health infectious disease report cards, patient family certificate address information in each report card is carried out to matching addresses application testing, for the purpose of convenient, method 1 represents based on even numbers group trie tree matching addresses method hereinafter, method 2 represents based on even numbers group trie tree and Address factor coded query matching process, method 3 represents the multi-level Chinese address matching process fast based on address Feature Words,
Three kinds of method coupling dictionary creation time comparative unit: ms of table 2
Table 2 result shows, method 2 and method 3 are obviously better than method 1, its reason is that branch's degree of depth of method 1 and the child node number of part branch are considerably beyond method 2 and method 3, in method 1, due to normal address of a branching representation, its branch's degree of depth at least exceedes 20, and in method 2 and method 3, each branch only represents an Address factor, its mean depth just becomes 1/5 of method 1, simultaneously, build even numbers group trie tree owing to pressing Address factor, the also corresponding minimizing of the child node number of part branch, therefore, method 2 and method 3 are obviously better than method 1, and method 3 is stored in 5 Address factors respectively in 5 even numbers group trie trees, phase ratio method 2, reduce the conflict between part branch, therefore, method 3 is slightly better than method 2 in time,
Three kinds of method coupling comparative units that dictionary creation takes up space of table 3: byte
From table 2 interpretation of result, branch's degree of depth of method 1 and the child node number of branch are considerably beyond method 2 and method 3, and method 3 is slightly less than method 2, during according to structure even numbers group trie tree, node conflict is more, the feature that array utilization factor is lower, in three kinds of methods, the shared memory size sequence of array is: method 1> method 2> method 3;
Three kinds of method coupling comparative unit that dictionary creation takes up space: ms of table 4
Table 4 result shows, method 1 and method 3 are obviously better than method 2, because all adopting even numbers group trie tree, three kinds of matching process mate dictionary creation, therefore, match query address consuming time basic identical, in the end obtain address space coordinate time, 1 needs of method are carried out a next state fumction, method 3 is carried out hash function computing to the geocoding obtaining, also can obtain accordingly result, therefore, method 1 and method 3 are total consuming time basic identical, in method 2, after coupling dictionary is obtained from definition geocoding, also will be from geocoding database search space coordinate, therefore, it is total consuming time considerably beyond other two kinds of methods.
Claims (4)
1. the multi-level Chinese address matching process fast based on address Feature Words, is characterized in that comprising the steps:
1) from standard Chinese address database, read in the record of all standard Chinese address, comprise geographical space coordinate x value, the y value of each standard Chinese address;
2), according to the classifying rules of Chinese address, taking address Feature Words as participle, according to forward scan Chinese word segmentation is carried out in standard Chinese address, the 5 class Address factors that Chinese word segmentation is obtained are inserted in 5 corresponding class even numbers group trie trees;
3) from 5 class even numbers group trie trees, obtain the corresponding geocoding element set in standard Chinese address, according to minimum cost principle, taking the layering of Chinese address and rule of combination as according to address code element is carried out to combination and permutation, obtain 4 geocodings of this standard Chinese address of unique expression, these 4 geocodings are carried out to Hash operation, the geographical space coordinate of this standard Chinese address is stored on position corresponding to its hash function value in Hash table, step 2 carry out successively in all standard Chinese address) operation of ~ step 3), complete standard Chinese matching addresses dictionary creation,
4) read Chinese address character string to be matched, assignment S1 and S2 respectively carries out forward scan coupling and reverse scanning coupling simultaneously;
5) judge that whether forward scan coupling and reverse scanning coupling be successful, if it fails to match for forward scan coupling or reverse scanning, return to step 4); If the match is successful for forward scan coupling and reverse scanning, obtain geocoding combination T1 and the T2 of Corresponding matching result;
6) establish geocoding T=T1+T2, T is carried out to Hash operation, in Hash table, search corresponding geographical space coordinate by hash function value, if exist, obtain corresponding geographical volume coordinate, matching addresses success, if do not exist, matching addresses failure, repeating step 4) ~ step 6), complete the matching addresses of all Chinese address to be matched.
2. a kind of multi-level Chinese address matching process fast based on address Feature Words according to claim 1, is characterized in that described step 2) be:
(1) classifying rules of Chinese address refers to that one refers to that the clear and definite standard Chinese address of meaning is made up of administrative name, lane, street name, cell name, bar location name and interest this 5 class Address factor of calling the roll, wherein Address factor refers to a relatively independent part in address character string, has clear and definite address meaning;
(2) forward scan Chinese word cutting method is a kind of from character string sequence, put first place, the method for successively character string being carried out from left to right cutting;
(3) even numbers group trie tree is made up of base array and check array, a wherein node state for each element representation trie tree of base array, array value is the base value of state transitions, forerunner's state of a state of check array representation, array value is proof test value, in the time that the value of base array and check array is 0, this state free time, 5 class even numbers group trie trees are stored respectively the 5 class Address factors that each normal address comprises, 5 class even numbers group trie trees are specially administrative division even numbers group trie tree, lane, street name even numbers group trie tree, residential quarters even numbers group trie tree, bar location even numbers group trie tree and POI even numbers group trie tree,
(4) update for even numbers group trie tree is to carry out the transfer of a next state in the time building even numbers group trie tree, in the time that state m transfers to state n, must meet following 2 conditions:
base[m]?+?c?=?n,
check[n]?=?m,
Wherein, m is the subscript of current state, and n is the subscript of transfering state, and c is the numerical value of input character.
3. a kind of multi-level Chinese address matching process fast based on address Feature Words according to claim 1, is characterized in that described step 3) comprises:
(1) from 5 class even numbers group trie trees, obtain the corresponding 5 class geocoding elements in standard Chinese address, and according to minimum cost principle, layering and rule of combination with Chinese address carry out combination and permutation to address code element, obtain 4 geocodings of the accurate Chinese address of only table indicating, wherein geocoding element refers to the array index value of last character in even numbers group trie tree in each Address factor, and geocoding is to be formed by geocoding element combinations and arrangement;
(2) hierarchical rule of Chinese address refers to according to the subordinate relation of 5 Address factors of Chinese address, can be divided into three levels, and the first level is administrative name, comprises provincial, city-level, at county level, township level, at village level; The second level Wei Jie lane name and cell name; Tri-layer is bar location name and POI name;
(3) rule of combination of Chinese address refers to the hierarchical rule according to Chinese address, the Chinese address of a standard can have 16 kinds to refer to that the clear and definite Chinese address to be matched of meaning matches with it, wherein, according to minimum cost principle, 4 kinds of address expression-forms that comprise 3 class Address factors are:
Name/bar location, lane, administrative name/street
× × province × × city × × district × × street × × road × × number;
Lane, administrative name/street name/interest is called the roll
× × province × × city × × district × × street × × road × × primary school;
Administrative name/cell name/bar location
× × province × × city × × district × × street × × community × × number;
Administrative name/cell name/interest is called the roll
× × province × × city × × district × × street × × community × × square.
4. a kind of multi-level Chinese address matching process fast based on address Feature Words according to claim 1, is characterized in that described step 4) comprises:
(1) forward scan coupling first taking administrative division Feature Words as cutting according to S1 is carried out to forward scan Chinese word segmentation, if participle failure, it fails to match in forward scan, if participle success, obtains corresponding administrative division geocoding element, in administrative division even numbers group trie tree, inquire about the coupling branch of this geocoding element, if successful inquiring, obtain the corresponding array index value of this last state of coupling branch T1, if inquire about unsuccessfully, it fails to match in forward scan;
(2) first reverse scanning coupling carries out tri-layer Address factor scan matching, if tri-layer Address factor scan matching success, carry out again the second level Address factor scan matching, if the second level Address factor scan matching success, the match is successful for reverse scanning, if tri-layer Address factor scan matching or the failure of the second level Address factor scan matching, it fails to match for reverse scanning;
First (3) tri-layer Address factor scan matchings carry out bar location Address factor scan matching, if bar location Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch, assignment T2, directly enter the second level Address factor match query link, if bar location Address factor scan matching failure, carry out POI Address factor scan matching, if POI Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch, assignment T2, enter the second level Address factor match query link, if POI Address factor scan matching failure, it fails to match for reverse scanning,
(4) bar location Address factor scan matching first taking bar location Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success, obtain corresponding bar location geocoding element, in this geocoding Match of elemental composition branch of bar location even numbers group trie tree query;
(5) POI Address factor scan matching first taking POI Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success is obtained corresponding POI geocoding element, in this geocoding Match of elemental composition branch of POI even numbers group trie tree query;
First (6) second level Address factor scan matchings carry out cell name Address factor scan matching, if cell name Address factor scan matching success, obtain the corresponding array index value of this last state of coupling branch T3, make T2=T2+T3, the match is successful for reverse scanning, if cell name Address factor scan matching failure, carry out lane, street name Address factor scan matching, the name Address factor scan matching success of Ruo Jie lane, obtain the corresponding array index value of this last state of coupling branch T3, make T2=T2+T3, the match is successful for reverse scanning, if cell name Address factor scan matching failure, it fails to match for reverse scanning,
(7) cell name Address factor scan matching first taking cell name Feature Words as cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success, obtain corresponding cell name geocoding element, in this geocoding Match of elemental composition branch of cell name even numbers group trie tree query;
(8) lane, street name Address factor scan matching first Yi Jie lane name Feature Words be cutting according to S2 is carried out to reverse scanning Chinese word segmentation, if participle success, obtain corresponding Jie lane name geocoding element, this geocoding Match of elemental composition branch of Jie lane name even numbers group trie tree query;
(9) reverse scanning Chinese word cutting method is a kind of turning left from the right side to the method that foundation is carried out cutting to character string taking address Feature Words as participle.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410134887.3A CN103914544A (en) | 2014-04-03 | 2014-04-03 | Method for quickly matching Chinese addresses in multi-level manner on basis of address feature words |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201410134887.3A CN103914544A (en) | 2014-04-03 | 2014-04-03 | Method for quickly matching Chinese addresses in multi-level manner on basis of address feature words |
Publications (1)
Publication Number | Publication Date |
---|---|
CN103914544A true CN103914544A (en) | 2014-07-09 |
Family
ID=51040224
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201410134887.3A Pending CN103914544A (en) | 2014-04-03 | 2014-04-03 | Method for quickly matching Chinese addresses in multi-level manner on basis of address feature words |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103914544A (en) |
Cited By (37)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104142993A (en) * | 2014-07-30 | 2014-11-12 | 东软集团股份有限公司 | Complex snort rule classification method and system based on depth features |
CN104657486A (en) * | 2015-03-02 | 2015-05-27 | 武汉工程大学 | Method for trustworthiness computing of administrative division based on multiple factors |
CN105159949A (en) * | 2015-08-12 | 2015-12-16 | 北京京东尚科信息技术有限公司 | Chinese address word segmentation method and system |
CN105224610A (en) * | 2015-09-08 | 2016-01-06 | 方正国际软件有限公司 | The method and apparatus that a kind of address is compared |
CN105404686A (en) * | 2015-12-10 | 2016-03-16 | 湖南科技大学 | Method for matching place name and address in news event based on geographical feature hierarchical segmented words |
CN105528372A (en) * | 2014-09-30 | 2016-04-27 | 华为技术有限公司 | An address search method and apparatus |
CN105631627A (en) * | 2015-12-23 | 2016-06-01 | 拉扎斯网络科技(上海)有限公司 | Order address matching method and device |
CN105786800A (en) * | 2016-03-23 | 2016-07-20 | 苏州数字地图信息科技股份有限公司 | Police standard address acquiring method and system |
CN105786922A (en) * | 2014-12-25 | 2016-07-20 | 高德软件有限公司 | Method and equipment for determining missing electronic map data |
CN106096024A (en) * | 2016-06-24 | 2016-11-09 | 北京京东尚科信息技术有限公司 | The appraisal procedure of address similarity and apparatus for evaluating |
CN106202028A (en) * | 2015-04-30 | 2016-12-07 | 阿里巴巴集团控股有限公司 | A kind of address information recognition methods and device |
CN106502978A (en) * | 2016-09-19 | 2017-03-15 | 浪潮软件股份有限公司 | A kind of Chinese address segmenting method and device |
CN106547770A (en) * | 2015-09-21 | 2017-03-29 | 阿里巴巴集团控股有限公司 | A kind of user's classification based on address of theenduser information, user identification method and device |
CN106649286A (en) * | 2016-10-15 | 2017-05-10 | 语联网(武汉)信息技术有限公司 | Method for conducting term matching on basis of double-array lexicographic tree |
CN106709065A (en) * | 2017-01-19 | 2017-05-24 | 国家电网公司 | Standardization processing method and standardized processing device for address information |
CN107203522A (en) * | 2016-03-16 | 2017-09-26 | 北京京东尚科信息技术有限公司 | A kind of decision method and device of malice Order Address |
CN107423295A (en) * | 2016-05-24 | 2017-12-01 | 张向利 | A kind of magnanimity address date intelligence fast matching method |
CN107748745A (en) * | 2017-11-08 | 2018-03-02 | 厦门美亚商鼎信息科技有限公司 | A kind of enterprise name keyword extraction method |
CN108205625A (en) * | 2016-12-19 | 2018-06-26 | 北京京东尚科信息技术有限公司 | Information processing method and device |
CN108628811A (en) * | 2018-04-10 | 2018-10-09 | 北京京东尚科信息技术有限公司 | The matching process and device of address text |
CN108763215A (en) * | 2018-05-30 | 2018-11-06 | 中智诚征信有限公司 | A kind of address storage method, device and computer equipment based on address participle |
CN108985672A (en) * | 2017-06-01 | 2018-12-11 | 北京京东尚科信息技术有限公司 | Information output method and device |
CN109033327A (en) * | 2018-07-19 | 2018-12-18 | 浪潮软件集团有限公司 | Management system and management method of Chinese address tree |
CN109145169A (en) * | 2018-07-26 | 2019-01-04 | 浙江省测绘科学技术研究院 | A kind of address matching method based on statistics participle |
CN109344263A (en) * | 2018-08-01 | 2019-02-15 | 昆明理工大学 | A kind of address matching method |
WO2019072193A1 (en) * | 2017-10-10 | 2019-04-18 | 捷开通讯(深圳)有限公司 | Method and apparatus for intelligently retrieving information, and storage medium |
CN109978829A (en) * | 2019-02-26 | 2019-07-05 | 深圳市华汉伟业科技有限公司 | A kind of detection method and its system of object to be detected |
CN110222238A (en) * | 2019-04-30 | 2019-09-10 | 上海交通大学 | The querying method and system of character string and identifier biaxial stress structure |
CN110647832A (en) * | 2019-09-16 | 2020-01-03 | 贝壳技术有限公司 | Method and device for acquiring information in certificate, electronic equipment and storage medium |
CN110990520A (en) * | 2019-11-28 | 2020-04-10 | 中国建设银行股份有限公司 | Address coding method and device, electronic equipment and storage medium |
CN111160827A (en) * | 2019-12-09 | 2020-05-15 | 上海东普信息科技有限公司 | Express collecting method, express collecting method of handheld terminal and storage medium |
CN111427977A (en) * | 2019-01-10 | 2020-07-17 | 阿里巴巴集团控股有限公司 | Electronic eye data processing method and device |
CN111626052A (en) * | 2020-04-28 | 2020-09-04 | 北京明亿科技有限公司 | Hash dictionary-based alarm receiving and handling text item name extraction method and device |
CN112069276A (en) * | 2020-08-31 | 2020-12-11 | 平安科技(深圳)有限公司 | Address coding method and device, computer equipment and computer readable storage medium |
CN112115214A (en) * | 2019-06-20 | 2020-12-22 | 中科聚信信息技术(北京)有限公司 | Address standardization method, address standardization device and electronic equipment |
CN113656450A (en) * | 2021-07-12 | 2021-11-16 | 大箴(杭州)科技有限公司 | Address processing method and device, electronic equipment and storage medium |
CN114168705A (en) * | 2021-12-03 | 2022-03-11 | 南京大峡谷信息科技有限公司 | Chinese address matching method based on address element index |
-
2014
- 2014-04-03 CN CN201410134887.3A patent/CN103914544A/en active Pending
Non-Patent Citations (1)
Title |
---|
徐聪 等: ""基于哈希和双数组trie树的多层次地址匹配算法"", 《浙江大学学报(理学版)》 * |
Cited By (58)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104142993A (en) * | 2014-07-30 | 2014-11-12 | 东软集团股份有限公司 | Complex snort rule classification method and system based on depth features |
CN104142993B (en) * | 2014-07-30 | 2017-08-29 | 东软集团股份有限公司 | Complicated snort rule classifications method and system based on depth characteristic |
US10783171B2 (en) | 2014-09-30 | 2020-09-22 | Huawei Technologies Co., Ltd. | Address search method and device |
CN105528372A (en) * | 2014-09-30 | 2016-04-27 | 华为技术有限公司 | An address search method and apparatus |
CN105528372B (en) * | 2014-09-30 | 2019-05-24 | 华为技术有限公司 | A kind of address search method and equipment |
CN105786922B (en) * | 2014-12-25 | 2020-02-14 | 高德软件有限公司 | Method and device for determining missing electronic map data |
CN105786922A (en) * | 2014-12-25 | 2016-07-20 | 高德软件有限公司 | Method and equipment for determining missing electronic map data |
CN104657486B (en) * | 2015-03-02 | 2018-01-19 | 武汉工程大学 | A kind of method that confidence level based on polyfactorial administrative division calculates |
CN104657486A (en) * | 2015-03-02 | 2015-05-27 | 武汉工程大学 | Method for trustworthiness computing of administrative division based on multiple factors |
CN106202028A (en) * | 2015-04-30 | 2016-12-07 | 阿里巴巴集团控股有限公司 | A kind of address information recognition methods and device |
CN106202028B (en) * | 2015-04-30 | 2019-10-11 | 阿里巴巴集团控股有限公司 | A kind of address information recognition methods and device |
CN105159949A (en) * | 2015-08-12 | 2015-12-16 | 北京京东尚科信息技术有限公司 | Chinese address word segmentation method and system |
CN105224610A (en) * | 2015-09-08 | 2016-01-06 | 方正国际软件有限公司 | The method and apparatus that a kind of address is compared |
CN106547770A (en) * | 2015-09-21 | 2017-03-29 | 阿里巴巴集团控股有限公司 | A kind of user's classification based on address of theenduser information, user identification method and device |
CN105404686B (en) * | 2015-12-10 | 2018-08-31 | 湖南科技大学 | A kind of media event place name address matching method based on geographical feature level participle |
CN105404686A (en) * | 2015-12-10 | 2016-03-16 | 湖南科技大学 | Method for matching place name and address in news event based on geographical feature hierarchical segmented words |
CN105631627A (en) * | 2015-12-23 | 2016-06-01 | 拉扎斯网络科技(上海)有限公司 | Order address matching method and device |
CN107203522B (en) * | 2016-03-16 | 2020-02-07 | 北京京东尚科信息技术有限公司 | Method and device for judging malicious order address |
CN107203522A (en) * | 2016-03-16 | 2017-09-26 | 北京京东尚科信息技术有限公司 | A kind of decision method and device of malice Order Address |
CN105786800A (en) * | 2016-03-23 | 2016-07-20 | 苏州数字地图信息科技股份有限公司 | Police standard address acquiring method and system |
CN107423295A (en) * | 2016-05-24 | 2017-12-01 | 张向利 | A kind of magnanimity address date intelligence fast matching method |
CN106096024A (en) * | 2016-06-24 | 2016-11-09 | 北京京东尚科信息技术有限公司 | The appraisal procedure of address similarity and apparatus for evaluating |
CN106502978A (en) * | 2016-09-19 | 2017-03-15 | 浪潮软件股份有限公司 | A kind of Chinese address segmenting method and device |
CN106649286A (en) * | 2016-10-15 | 2017-05-10 | 语联网(武汉)信息技术有限公司 | Method for conducting term matching on basis of double-array lexicographic tree |
CN106649286B (en) * | 2016-10-15 | 2019-07-02 | 语联网(武汉)信息技术有限公司 | One kind carrying out the matched method of term based on even numbers group dictionary tree |
CN108205625A (en) * | 2016-12-19 | 2018-06-26 | 北京京东尚科信息技术有限公司 | Information processing method and device |
CN108205625B (en) * | 2016-12-19 | 2021-02-26 | 北京京东尚科信息技术有限公司 | Information processing method and device, storage medium and electronic equipment |
CN106709065B (en) * | 2017-01-19 | 2020-08-04 | 国家电网公司 | Address information standardization processing method and device |
CN106709065A (en) * | 2017-01-19 | 2017-05-24 | 国家电网公司 | Standardization processing method and standardized processing device for address information |
CN108985672A (en) * | 2017-06-01 | 2018-12-11 | 北京京东尚科信息技术有限公司 | Information output method and device |
WO2019072193A1 (en) * | 2017-10-10 | 2019-04-18 | 捷开通讯(深圳)有限公司 | Method and apparatus for intelligently retrieving information, and storage medium |
CN107748745B (en) * | 2017-11-08 | 2021-08-03 | 厦门美亚商鼎信息科技有限公司 | Enterprise name keyword extraction method |
CN107748745A (en) * | 2017-11-08 | 2018-03-02 | 厦门美亚商鼎信息科技有限公司 | A kind of enterprise name keyword extraction method |
CN108628811A (en) * | 2018-04-10 | 2018-10-09 | 北京京东尚科信息技术有限公司 | The matching process and device of address text |
CN108763215A (en) * | 2018-05-30 | 2018-11-06 | 中智诚征信有限公司 | A kind of address storage method, device and computer equipment based on address participle |
CN108763215B (en) * | 2018-05-30 | 2022-04-29 | 中智诚征信有限公司 | Address storage method and device based on address word segmentation and computer equipment |
CN109033327B (en) * | 2018-07-19 | 2021-09-28 | 浪潮软件股份有限公司 | Management system and management method of Chinese address tree |
CN109033327A (en) * | 2018-07-19 | 2018-12-18 | 浪潮软件集团有限公司 | Management system and management method of Chinese address tree |
CN109145169B (en) * | 2018-07-26 | 2021-03-26 | 浙江省测绘科学技术研究院 | Address matching method based on statistical word segmentation |
CN109145169A (en) * | 2018-07-26 | 2019-01-04 | 浙江省测绘科学技术研究院 | A kind of address matching method based on statistics participle |
CN109344263A (en) * | 2018-08-01 | 2019-02-15 | 昆明理工大学 | A kind of address matching method |
CN111427977A (en) * | 2019-01-10 | 2020-07-17 | 阿里巴巴集团控股有限公司 | Electronic eye data processing method and device |
CN111427977B (en) * | 2019-01-10 | 2023-12-19 | 阿里巴巴集团控股有限公司 | Electronic eye data processing method and device |
CN109978829A (en) * | 2019-02-26 | 2019-07-05 | 深圳市华汉伟业科技有限公司 | A kind of detection method and its system of object to be detected |
CN110222238B (en) * | 2019-04-30 | 2022-02-25 | 上海交通大学 | Query method and system for bidirectional mapping of character string and identifier |
CN110222238A (en) * | 2019-04-30 | 2019-09-10 | 上海交通大学 | The querying method and system of character string and identifier biaxial stress structure |
CN112115214A (en) * | 2019-06-20 | 2020-12-22 | 中科聚信信息技术(北京)有限公司 | Address standardization method, address standardization device and electronic equipment |
CN112115214B (en) * | 2019-06-20 | 2024-04-02 | 中科聚信信息技术(北京)有限公司 | Address standardization method, address standardization device and electronic equipment |
CN110647832A (en) * | 2019-09-16 | 2020-01-03 | 贝壳技术有限公司 | Method and device for acquiring information in certificate, electronic equipment and storage medium |
CN110990520A (en) * | 2019-11-28 | 2020-04-10 | 中国建设银行股份有限公司 | Address coding method and device, electronic equipment and storage medium |
CN110990520B (en) * | 2019-11-28 | 2023-10-20 | 中国建设银行股份有限公司 | Address coding method and device, electronic equipment and storage medium |
CN111160827A (en) * | 2019-12-09 | 2020-05-15 | 上海东普信息科技有限公司 | Express collecting method, express collecting method of handheld terminal and storage medium |
CN111626052A (en) * | 2020-04-28 | 2020-09-04 | 北京明亿科技有限公司 | Hash dictionary-based alarm receiving and handling text item name extraction method and device |
CN112069276A (en) * | 2020-08-31 | 2020-12-11 | 平安科技(深圳)有限公司 | Address coding method and device, computer equipment and computer readable storage medium |
CN112069276B (en) * | 2020-08-31 | 2024-03-08 | 平安科技(深圳)有限公司 | Address coding method, address coding device, computer equipment and computer readable storage medium |
CN113656450A (en) * | 2021-07-12 | 2021-11-16 | 大箴(杭州)科技有限公司 | Address processing method and device, electronic equipment and storage medium |
CN114168705B (en) * | 2021-12-03 | 2022-11-11 | 南京大峡谷信息科技有限公司 | Chinese address matching method based on address element index |
CN114168705A (en) * | 2021-12-03 | 2022-03-11 | 南京大峡谷信息科技有限公司 | Chinese address matching method based on address element index |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103914544A (en) | Method for quickly matching Chinese addresses in multi-level manner on basis of address feature words | |
CN101719128B (en) | Fuzzy matching-based Chinese geo-code determination method | |
CN105069056B (en) | Identity certificate address information analytic method and system based on string matching | |
CN102395965B (en) | Method for searching objects in a database | |
CN107145577A (en) | Address standardization method, device, storage medium and computer | |
CN109684428A (en) | Spatial data building method, device, equipment and storage medium | |
CN103440311A (en) | Method and system for identifying geographical name entities | |
CN103246857B (en) | A kind of type of service dissolves the method that yard rule parsing isomery coding obtains Item Information | |
WO2019165644A1 (en) | Address error correction method and terminal | |
CN107766433A (en) | A kind of range query method and device based on Geo BTree | |
CN109933797A (en) | Geocoding and system based on Jieba participle and address dictionary | |
CN102147795A (en) | Method and device for searching points of interest as well as navigation system | |
CN103488710B (en) | The non-fixed-length data method of efficient storage in big data page | |
CN101520801B (en) | Method for storing space geometric objects to database | |
CN108009265B (en) | Spatial data indexing method in cloud computing environment | |
CN110990520B (en) | Address coding method and device, electronic equipment and storage medium | |
CN104199860A (en) | Dataset fragmentation method based on two-dimensional geographic position information | |
CN106202111A (en) | The method of calibration of database data and device | |
CN106991149B (en) | Massive space object storage method fusing coding and multi-version data | |
CN112256817A (en) | Geocoding method, system, terminal and storage medium | |
CN104391908A (en) | Locality sensitive hashing based indexing method for multiple keywords on graphs | |
CN104252507A (en) | Enterprise data matching method and device | |
CN111522892A (en) | Geographic element retrieval method and device | |
CN114048204A (en) | Beidou grid space indexing method and device based on database inverted index | |
CN109885638B (en) | Three-dimensional space indexing method and system |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20140709 |
|
WD01 | Invention patent application deemed withdrawn after publication |