CN101882163A - Fuzzy Chinese address geographic evaluation method based on matching rule - Google Patents

Fuzzy Chinese address geographic evaluation method based on matching rule Download PDF

Info

Publication number
CN101882163A
CN101882163A CN2010102219439A CN201010221943A CN101882163A CN 101882163 A CN101882163 A CN 101882163A CN 2010102219439 A CN2010102219439 A CN 2010102219439A CN 201010221943 A CN201010221943 A CN 201010221943A CN 101882163 A CN101882163 A CN 101882163A
Authority
CN
China
Prior art keywords
address
matching
administrative division
field
rule
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2010102219439A
Other languages
Chinese (zh)
Inventor
程昌秀
于滨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Institute of Geographic Sciences and Natural Resources of CAS
Original Assignee
Institute of Geographic Sciences and Natural Resources of CAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Institute of Geographic Sciences and Natural Resources of CAS filed Critical Institute of Geographic Sciences and Natural Resources of CAS
Priority to CN2010102219439A priority Critical patent/CN101882163A/en
Publication of CN101882163A publication Critical patent/CN101882163A/en
Pending legal-status Critical Current

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to a fuzzy Chinese address geographic evaluation method based on a matching rule, which belongs to the field of the address evaluation of a geographic information system. The method comprises the following steps: firstly, reading in an address character string and a standard address base; inquiring and dividing administrative division parts in the address character string and filtering reduced target data sets; then realizing the segmentation and the matching of an address by means of a matching rule tree and a rule base in allusion to the fuzzy problems of address element incompletion, address ambiguity and the like frequently appearing in the address character string, and returning a matching record meeting requirements. The invention integrates two important links of address segmentation and database matching in geographic evaluation, realizes that the database matching of the address is completed while in segmentation, effectively solves the address matching problem of the fuzzy Chinese address and improves the accuracy and correctness of address matching.

Description

A kind of fuzzy Chinese address geographic evaluation method based on matched rule
Technical field
The invention belongs to the Geographic Information System field, involvement aspect is to the address assignment method of fuzzy Chinese address.
Background technology
Along with the application of electronic chart with popularize, all trades and professions all ubiquity a large amount of Chinese address data by natural language description, need be mapped as geographic coordinate, and navigate on the electronic chart, thereby make original non-space data obtain volume coordinate information, the integration of the data of realization all departments and each geographic range is with shared.This just need use geography (address) evaluating technology, promptly the text address translation is become the technology of geographic coordinate.Geographic evaluation method generally is divided into the address standardization, address participle, database matching, several steps such as space orientation.
External geographic evaluation technology is mature on the whole, but is still waiting research for the geographic evaluation method of Chinese address.The one, because the difference between the Chinese and English, such as the existence that does not have problems such as the space separates between the speech of Chinese address and the speech.The 2nd, because the existing place name of China, the address system is complicated unusually, the address system confusion, unordered, lack regular and unified standard.Therefore, external existing geographic evaluation technology also is not suitable for China's actual conditions, and it is infeasible directly applying mechanically external geographic evaluation technology.
At present, domestic sectors and scholar have carried out Chinese address Standardization Research Work successively, for the good data basis has been established in the foundation in storehouse, normal address.But ordinary people is in input during its address that need locate, often Shu Ru some fuzzy Chinese address often." No. 5 Dong Xinglou of Dongzhimennei Dajie " are row with the normal address, the address of ordinary people input may be multifarious, as contain " Dong Xinglou " of " No. 5 Xi Dongxinglou of Dongzhimen bridge Dongzhimennei Dajie ", the information incompleteness of redundant information, easily cause ambiguity " No. 5, Dongzhimen ", use another name " No. 5 Dong Xinglou in round-mouthed food vessel with two or four loop handles street " etc.Matching addresses how to carry out fuzzy Chinese is geographic evaluation method enters the practical stage in China a major issue.
In addition, China different regions, different industries are to the accuracy requirement difference of matching addresses.For example, in the delivery of rural area mail, matching addresses minimum row administrative division " village " gets final product, and then may need to navigate in the mail in city is delivered " street Taoist monastic name+number " or " sub-district+building number+room number " etc.Therefore, in order to improve the versatility of Chinese address assignment method, need research how the Chinese address of participle to be carried out matching addresses based on user-defined matched rule.
Summary of the invention
The technical problem to be solved in the present invention is: overcome the deficiencies in the prior art, propose a kind of rule-based fuzzy Chinese address geographic evaluation method; This method can realize the participle and the coupling of fuzzy Chinese address based on storehouse, normal address and set matching addresses rule, thus the geographic evaluation of implementation model Chinese address.
The technical scheme that the present invention is adopted for its technical matters of solution is: a kind of rule-based fuzzy Chinese address geographic evaluation method may further comprise the steps:
(1) data are prepared:
A) Input Address character string Addr;
B) read in storehouse, whole normal address, as target data set RecSet; Comprise following content in the storehouse, normal address: the administrative division of 12 coded representation; Store five fields of lowest address key element in the detailed street address, i.e. road 1, number 2, residential quarters 3, building plate 4, point of interest POI (Point of Interest) 5; The field that is used for storage space information, storage geographic coordinate, longitude and latitude or buildings coding;
C) read in the administrative division code table; Comprise following field in the table: sequence number, administrative division title, administrative division rank, 12 codes of administrative division;
D) the definition matching rule base is summarized as common address formula with the address, and according to field number corresponding in the storehouse, described normal address in the step b) in the step (1), and it is regular and store among the text Rule every address formula to change into corresponding address;
(2) read in storehouse, normal address, administrative division code table and rule base; The address character string is represented with Addr as record to be matched; The storehouse, normal address is represented with RecSet as the initial target data set;
(3) administrative division among the Addr partly is converted to 12 codes, dwindles target data set:
A) in the administrative division code table, the administrative division part is discerned and split out to the administrative division mark words that exists among the search Addr according to described administrative division mark words from Addr;
B) if the administrative division mark words that searches is a plurality of, the level attribute that then compares each administrative division mark words, determine the minimum word of administrative grade in the administrative division mark words, and in view of the above described administrative division partly is converted into and minimum corresponding 12 the administrative division codes of determining of word of administrative grade;
C) filter RecSet, 12 records that the administrative division code is not inconsistent removing and obtain;
D) be Addr with the address character string of the administrative division of removing part assignment again;
(4) Addr is carried out address participle and coupling, stores word segmentation result into array Addr_Split[i] lining, matching result is stored in the data set RecSet:
A) substring of definition Addr is Sub, at first gives Sub with whole Addr assignment, and definition ambiguity stack is Stack, and Stack is used for storing the semantic ambiguity that coupling produces, and the element of storing among the Stack is structure variable Struct (i);
B) inquire address matching rule base according to the number of times in rule searching storehouse, limits the search field scope in the step c) in the step (4); If be the n time rule searching storehouse, every rule in the access library successively then is defined as the search field scope with the intersection of n field in every rule;
C) judge whether Sub is empty:
I), continue then to check whether Stack is empty, if Stack also is empty, then it fails to match for the address participle, and entire method stops withdrawing from if Sub is empty; If Stack is not empty, then take out stack top element, each value among the Struct (i) is composed given relevant variable according to principle first-in last-out, and according to the value of storing in the matching field component of storing in the structure variable, for coupling, forward this field mark in the step (4) step e);
If ii) Sub is not empty, then call the maximum forward matching algorithm, according to the field that limits in the step b) in the step (4), in the RecSet respective field, search for record respectively with the Sub coupling;
D) judge the field number of mating with Sub:
I) if it fails to match, then continue to call the maximum forward matching algorithm and carry out participle, forward the step c) in the step (4) to;
If ii) Pi Pei field number is greater than 1, then Sub is stored into participle array Addr_Split[i] in; Owing to produced ambiguity, a plurality of components of each ambiguity situation are stored among the structure variable Struct (i), and deposit among the Stack successively; Take out stack top element, and, this field mark is coupling according to the matching field of storing in the stack top element;
If iii) Pi Pei field number equals 1, then Sub is stored into participle array Addr_Split[i], and the matching field that inquires is labeled as coupling;
E) rule searching storehouse, comparison has been labeled as field and every rule of coupling, and whether check has the rule that satisfies condition to exist:
I) if exist, then return word segmentation result array Addr_Split and matching result data set RecSet, entire method stops withdrawing from;
Ii) if there is no, then the substring Sub that inquires is removed in character string Addr, assignment is Sub=Addr-Sub again, and the step b) of returning in the step step (4) is proceeded the participle coupling.
The advantage that the present invention is compared with prior art had is as follows:
(1) two link address participles with geographic evaluation have been incorporated into matching addresses, promptly carry out the database address coupling in participle, have realized also having found when participle is finished the record that is mated.Obviously, can effectively reduce the queried access number of times of database by this method, thereby accelerate matching speed.
(2) at first partly come target data set is once filtered in the algorithm, realized, thereby improved the matching efficiency of algorithm for administrative division in the address date and detailed separating of street address two parts information by the administrative division in the identification string.Then, when calling the maximum forward matching algorithm, to the search of data set, can reach and laterally dwindle target data set during by successful participle each time, accelerate the purpose of matching speed.
The meaning of one's words ambiguity that produces when (3) mating for participle, by set up an ambiguity tree and utilize stack to store temporarily,, the ambiguity tree is traveled through visit in the algorithm then according to the depth-first principle, stop algorithm until satisfying matched rule, otherwise continue to finish whole ambiguity traversal of tree.Thereby solved the participle matching problem of the fuzzy address of the first kind.
(4) by rule tree, on the one hand can be real-time be limited in the storehouse, normal address each search in step the time alternative field scope.For example, be that the alternative field of qualification is 1,3,5 when searching for for the first time, if search matched is to field 3 for the first time, then according to rule tree, limiting field when searching for for the second time can only be 4.By that analogy, thereby realize vertically dwindling target data set, further accelerate the purpose of matching speed; On the other hand, utilize the matched rule in the tree, the fuzzy address date of second class for incomplete address key element also can improve it and be matched to power.
Description of drawings
Fig. 1 is rule-based fuzzy Chinese address geographic evaluation method realization flow figure of the present invention;
Fig. 2 is storehouse, normal address, a Beijing partial data;
Fig. 3 is the sign indicating number section meaning of administrative division code;
Fig. 4 is Beijing's administrative division code table partial data;
The rule of Fig. 5 for comprising in the rule base;
Fig. 6 is the rule tree synoptic diagram;
Fig. 7 is address participle coupling process flow diagram (is example with character string " Room 1120, Building C, No. 22 building, north side, the peaceful village, Haidian ");
Fig. 8 is the experimental result statistics;
Fig. 9 is the geographic evaluation result signal of the fuzzy Chinese address of part.
Embodiment
Introduce the present invention in detail below in conjunction with the drawings and the specific embodiments.
Rule-based fuzzy Chinese address geographic evaluation method of the present invention, realization flow figure as shown in Figure 1.Here choosing Chinese address " Room 1120, Building C, No. 22 building, north side, the peaceful village, Haidian " describes specific implementation process of the present invention.At first the address is analyzed, this address is made up of administrative division part (" Haidian ") and detailed street address part (" Room 1120, Building C, No. 22 building, north side, the peaceful village ").Wherein comprise following several respects problem:
A) administrative division is partly described imperfectly, and " Haidian " vocabulary is stated fuzzy;
B) " the peaceful village " vocabulary has been stated ambiguity, may be link name, also may be cell name;
C) there is the key element incompleteness in the address, lacks information such as number;
D) " north side " is redundant interfere information with " Room 1120, Building C ".
This shows, have common fuzzy problem in the incomplete two class addresses of semantic ambiguity and address key element in this address, very representative.Below be example (with reference to figure 7) just with above-mentioned address, describe specific implementation process of the present invention in detail, its concrete steps:
(1) data are prepared:
A) Input Address character string Addr is " Room 1120, Building C, No. 22 building, north side, the peaceful village, Haidian ";
B) read in storehouse, whole normal address, as target data set RecSet.Comprise following content in the storehouse, normal address: the administrative division of 12 coded representation; Store five fields of lowest address key element in the detailed street address, i.e. road, number, residential quarters, building plate, POI (point of interest); The field that is used for storage space information, this field can be geographic coordinate, longitude and latitude or buildings coding etc.Beijing's part of standards address base data as shown in Figure 2.
C) read in the administrative division code table.The part of relevant administrative division in each address all is converted into 12 administrative division codes of a correspondence.Its coded system and sign indicating number section meaning be (with reference to figure 3) specifically: per two of first six digits is represented one-level, represent one-level for per three in back six, if known administrative division is partly described and is not sufficiently complete, cause being converted into behind the code 12 of less thaies, then back figure place 0 polishing.Beijing's part administrative division code table as shown in Figure 4.
D) definition matching rule base, the address of Beijing area is summarized as six kinds of common address formula, and (be road (1), number (2), residential quarters (3), building plate (4), point of interest POI (Point of Interest) (5) according to each field number corresponding in the storehouse, normal address in the formula of address, it is defined as rule, for example address pattern " road+residential quarters+building plate " is defined as " rule one: 1,3,4 ", Beijing's matching rule base is with reference to figure 5
(2) judge whether there is the administrative division part in the character string, by search comparison administrative division table, find " Haidian " speech and record " Haidian District " fuzzy matching in the character string, thus it is split, and be converted to corresponding 12 codes " 110108000000 ".RecSet filters to target data set, and the record that administrative division code field and this code first six digits " 110108 " are not inconsistent removes.
(3) residue character string " Room 1120, Building C, No. 22 building, north side, the peaceful village " is imported into address participle matching module, by inquire address matched rule tree (with reference to figure 6), the limit search field is 1,3,5.Call the maximum forward matching algorithm, inquire " the peaceful village " speech respectively with two field fuzzy matching of 1 (link name) and 3 (residential quarters name), therefore produce semantic ambiguity.The word segmentation result array charged in " the peaceful village " speech, and 1 field and 3 fields is successively stacked, get stack top element then, earlier " the peaceful village " is matched 3 fields (residential quarters name), the record that meets the demands among the data set RecSet is reduced into 8 simultaneously.This moment, owing to the rule that does not satisfy condition, mated so " Room 1120, Building C, No. 22 building, north side " are proceeded participle in the rule searching storehouse.The rule searching tree determines that alternative field is 4 for the second time.Continue to call maximum matching algorithm, inquiry residue substring " Room 1120, Building C, No. 22 building, north side " in 4 fields (building plate) in RecSet, no matching result.So choose stack top element again, " the peaceful village " is matched 1 field (link name), to filter back current data set RecSet and be reduced into 11, the rule searching storehouse does not have satisfied end condition, so continue the participle coupling.Again rule searching tree and definite alternative field are 2,3,4.Call the maximum forward matching algorithm, in the qualification field of RecSet, search character string " Room 1120, Building C, No. 22 building, north side ", find " No. 22 building " speech, with 4 fields (building plate) matched record is arranged, the word segmentation result phrase charged in this speech, and the record that meets the demands among the data set RecSet is reduced into 1 simultaneously.Continue the rule searching storehouse this moment, find to have the rule that satisfies condition, be i.e. rule three (1 and 4 fields match).Return the residue record among the target data set R, the participle matching algorithm successfully stops.Operation result: for fuzzy address " Room 1120, Building C, No. 22 building, north side, the peaceful village, Haidian ", word segmentation result is " the peaceful village, No. 22 building "; In target data set RecSet, found a fuzzy matching record " No. 15 No. 22 building, East Road, the peaceful village ".
With 1827 the fuzzy enterprises in Beijing in the national economic census is example, mates with the inventive method, and the result of coupling as shown in Figure 8.In above-mentioned experiment, the address that success is mated has 1527, accounts for 83.6% of sum.Wherein, one class address descriptor is more complete, or be the normal address, or the fuzzy address of second class for itself there being fraction address key element incompleteness, they have satisfied rule request in the participle matching process of address, returned correct matching result, totally 1292 of the addresses that this class is mated fully account for 70.7% of sum; Though the first kind address that another kind of address itself exists the description ambiguity to cause is fuzzy, but intelligent inference by the PROGRAM REASONING IN TEMPORAL LOGIC machine, finally find the record of satisfied rule and returned correct matching result, totally 235 of the addresses of this type of fuzzy matching, account for 12.9% of sum, the normal address and the sterically defined result that have obtained after having provided " Jintai Fudi Building ", " No. 6 building, happy beautify-house garden " in the experiment, " the peaceful village rub No. 1 building, yard garden " 3 fuzzy addresses among Fig. 9 the match is successful.The enterprise address that can't mate for wherein 300 (account for sum 16.4%), main reason by following two aspects causes: the one, because there is mistake in address itself, for example, the area is non-test block data under the address, exist this type of wrong address less, totally 9; The 2nd, because the description of address itself is too fuzzy, such as addresses such as " Xisanqi, Haidian District, Beijing City Qiao Xi ", " the Xi Erqi word Ning Zhuan north of a road, Qinghe, Haidian District, Beijing City sides ", they cause can't matching the address record that satisfies rule request owing to too bluring in the storehouse, normal address, this class mistake is in the great majority in the address that can't mate, totally 291.As seen, method of the present invention can realize the geographic evaluation of fuzzy Chinese address, and has higher locating accuracy.
The above only is a preferred implementation of the present invention; should be pointed out that for those skilled in the art, under the prerequisite that does not break away from the principle of the invention; can also make some improvements and modifications, these improvements and modifications also should be considered as protection scope of the present invention.

Claims (5)

1. fuzzy Chinese address geographic evaluation method based on matched rule is characterized in that step is as follows:
(1) data are prepared:
A) Input Address character string Addr;
B) read in storehouse, whole normal address, as target data set RecSet; Comprise following content in the storehouse, normal address: the administrative division of 12 coded representation; Store five fields of lowest address key element in the detailed street address, i.e. road 1, number 2, residential quarters 3, building plate 4, point of interest POI (Point of Interest) 5; The field that is used for storage space information, storage geographic coordinate, longitude and latitude or buildings coding;
C) read in the administrative division code table; Comprise following field in the table: sequence number, administrative division title, administrative division rank, 12 codes of administrative division;
D) the definition matching rule base is summarized as the address formula with the address, and according to field number corresponding in the storehouse, described normal address in the step b) in the step (1), and it is regular and store among the text Rule every address formula to change into corresponding address;
(2) read in storehouse, normal address, administrative division code table and rule base; The address character string is represented with Addr as record to be matched; The storehouse, normal address is represented with RecSet as the initial target data set;
(3) administrative division among the Addr partly is converted to 12 codes, dwindles target data set:
A) in the administrative division code table, the administrative division part is discerned and split out to the administrative division mark words that exists among the search Addr according to described administrative division mark words from Addr;
B) if the administrative division mark words that searches is a plurality of, the level attribute that then compares each administrative division mark words, determine the minimum word of administrative grade in the administrative division mark words, and in view of the above described administrative division partly is converted into and minimum corresponding 12 the administrative division codes of determining of word of administrative grade;
C) filter RecSet, 12 records that the administrative division code is not inconsistent removing and obtain;
D) be Addr with the address character string of the administrative division of removing part assignment again;
(4) Addr is carried out address participle and coupling, stores word segmentation result into array Addr_Split[i] lining, matching result is stored in the data set RecSet:
A) substring of definition Addr is Sub, at first gives Sub with whole Addr assignment, and definition ambiguity stack is Stack, and Stack is used for storing the semantic ambiguity that coupling produces, and the element of storing among the Stack is structure variable Struct (i);
B) inquire address matching rule base according to the number of times in rule searching storehouse, limits the search field scope in the step c) in the step (4); If be the n time rule searching storehouse, every rule in the access library successively then is defined as the search field scope with the intersection of n field in every rule;
C) judge whether Sub is empty:
I), continue then to check whether Stack is empty, if Stack also is empty, then it fails to match for the address participle, and entire method stops withdrawing from if Sub is empty; If Stack is not empty, then take out stack top element, each value among the Struct (i) is composed given relevant variable according to principle first-in last-out, and according to the value of storing in the matching field component of storing in the structure variable, for coupling, forward this field mark in the step (4) step e);
If ii) Sub is not empty, then call the maximum forward matching algorithm, according to the field that limits in the step b) in the step (4), in the RecSet respective field, search for record respectively with the Sub coupling;
D) judge the field number of mating with Sub:
I) if it fails to match, then continue to call the maximum forward matching algorithm and carry out participle, forward the step c) in the step (4) to;
If ii) Pi Pei field number is greater than 1, then Sub is stored into participle array Addr_Split[i] in; Owing to produced ambiguity, a plurality of components of each ambiguity situation are stored among the structure variable Struct (i), and deposit among the Stack successively; Take out stack top element, and, this field mark is coupling according to the matching field of storing in the stack top element;
If iii) Pi Pei field number equals 1, then Sub is stored into participle array Addr_Split[i], and the matching field that inquires is labeled as coupling;
E) rule searching storehouse, comparison has been labeled as field and every rule of coupling, and whether check has the rule that satisfies condition to exist:
I) if exist, then return word segmentation result array Addr_Split and matching result data set RecSet, entire method stops withdrawing from;
Ii) if there is no, then the substring Sub that inquires is removed in character string Addr, assignment is Sub=Addr-Sub again, and the step b) of returning in the step step (4) is proceeded the participle coupling.
2. method according to claim 1 is characterized in that:
In storehouse, described normal address, the administrative division in the Chinese address partly is stored as 12 codes; Detailed street address partly is divided into multiple different lowest address key element to be stored respectively; And set up a field separately and be used for storage space information.
3. method according to claim 2 is characterized in that:
In described definition matching rule base, five kinds of lowest address key element fields that comprise in the storehouse, normal address are replaced by digital 1-5 respectively: road 1, number 2, residential quarters 3, building plate 4, point of interest POI (Point of Interest) 5; Sum up common address formula, it is defined as corresponding a plurality of matching addresses rule.
4. method according to claim 1 is characterized in that:
Rank, title and corresponding 12 codes of storage administrative divisions at different levels in described administrative division code table utilize this table the part of relevant administrative division in each address all to be converted into 12 administrative division codes of a correspondence.
5. method according to claim 1 is characterized in that:
All semantic ambiguity situations that produce in the storage address participle matching process in the described ambiguity stack; Each element in the stack is represented a kind of ambiguity situation, is represented by structure variable Struct (i); Among each structure variable Struct (i), stored a plurality of global variables when producing a certain ambiguity, comprised target data set RecSet, residue character string Addr-Sub, write down the array MatchField that all have been labeled as matching field.
CN2010102219439A 2010-06-30 2010-06-30 Fuzzy Chinese address geographic evaluation method based on matching rule Pending CN101882163A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2010102219439A CN101882163A (en) 2010-06-30 2010-06-30 Fuzzy Chinese address geographic evaluation method based on matching rule

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2010102219439A CN101882163A (en) 2010-06-30 2010-06-30 Fuzzy Chinese address geographic evaluation method based on matching rule

Publications (1)

Publication Number Publication Date
CN101882163A true CN101882163A (en) 2010-11-10

Family

ID=43054177

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2010102219439A Pending CN101882163A (en) 2010-06-30 2010-06-30 Fuzzy Chinese address geographic evaluation method based on matching rule

Country Status (1)

Country Link
CN (1) CN101882163A (en)

Cited By (64)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102169498A (en) * 2011-04-14 2011-08-31 中国测绘科学研究院 Address model constructing method and address matching method and system
CN102289467A (en) * 2011-07-22 2011-12-21 浙江百世技术有限公司 Method and device for determining target site
CN102306161A (en) * 2011-07-22 2012-01-04 浙江百世技术有限公司 Method for multi-region repeated detection and equipment
CN103279539A (en) * 2013-06-04 2013-09-04 百度在线网络技术(北京)有限公司 Interest point set displaying method, electronic map displaying method, interest point set displaying device and electronic map displaying device
CN103558926A (en) * 2013-11-12 2014-02-05 金蝶软件(中国)有限公司 Geographical name entry method and geographical name entry device
CN103605752A (en) * 2013-11-21 2014-02-26 武大吉奥信息技术有限公司 Address matching method based on semantic recognition
CN103678708A (en) * 2013-12-30 2014-03-26 小米科技有限责任公司 Method and device for recognizing preset addresses
CN103714100A (en) * 2012-10-02 2014-04-09 信义房屋仲介股份有限公司 Fuzzy address display system and display method
CN103744854A (en) * 2013-11-15 2014-04-23 北京正图数创信息技术有限公司 Address data matching mining platform based on big data storage and mining technology
CN103984735A (en) * 2014-05-21 2014-08-13 北京京东尚科信息技术有限公司 Method and device for generating recommended delivery place name
CN104281578A (en) * 2013-07-02 2015-01-14 威盛电子股份有限公司 Region marking method and device for data file
CN104484790A (en) * 2014-12-26 2015-04-01 清华大学深圳研究生院 Address match method and device of logistics business
CN104537062A (en) * 2014-12-29 2015-04-22 北京牡丹电子集团有限责任公司数字电视技术中心 Address information extracting method and system
CN104615782A (en) * 2015-03-02 2015-05-13 武汉工程大学 Address matching method based on sliding window maximum matching algorithm
CN104657361A (en) * 2013-11-18 2015-05-27 阿里巴巴集团控股有限公司 Data processing method and data processing device
CN104657486A (en) * 2015-03-02 2015-05-27 武汉工程大学 Method for trustworthiness computing of administrative division based on multiple factors
CN104679801A (en) * 2013-12-03 2015-06-03 高德软件有限公司 Point of interest searching method and point of interest searching device
CN104899213A (en) * 2014-03-06 2015-09-09 阿里巴巴集团控股有限公司 Method and device for resolving organization names
CN105022748A (en) * 2014-04-28 2015-11-04 北京图盟科技有限公司 Waybill address classified method and apparatus
CN105069056A (en) * 2015-07-24 2015-11-18 湖北文理学院 Character string matching based method and system for analyzing address information of identification card
CN105209858A (en) * 2013-03-15 2015-12-30 邓白氏公司 Non-deterministic disambiguation and matching of business locale data
CN105426351A (en) * 2015-11-11 2016-03-23 中国建设银行股份有限公司 Participle processing method and system for customer address information
WO2016050088A1 (en) * 2014-09-30 2016-04-07 华为技术有限公司 Address search method and device
CN105677700A (en) * 2015-12-23 2016-06-15 武汉工程大学 Chinese address administrative division analytic method based on set operation
CN105786922A (en) * 2014-12-25 2016-07-20 高德软件有限公司 Method and equipment for determining missing electronic map data
CN106407221A (en) * 2015-07-31 2017-02-15 阿里巴巴集团控股有限公司 Address data retrieval method and apparatus
WO2017063531A1 (en) * 2015-10-14 2017-04-20 阿里巴巴集团控股有限公司 Account mapping method and device based on address information
CN106599303A (en) * 2016-12-29 2017-04-26 苏碧云 Address matching method and system
CN106649803A (en) * 2016-12-29 2017-05-10 华南师范大学 Address matching method and system
CN106709065A (en) * 2017-01-19 2017-05-24 国家电网公司 Standardization processing method and standardized processing device for address information
CN106796606A (en) * 2014-10-10 2017-05-31 歌乐株式会社 searching system
CN106846166A (en) * 2016-12-08 2017-06-13 北京中电普华信息技术有限公司 A kind of power marketing customer profile improving method based on the analysis of address big data
CN106919569A (en) * 2015-12-24 2017-07-04 北京四维图新科技股份有限公司 A kind of method and device of the administrative division information for obtaining point of interest POI
CN106934036A (en) * 2017-03-15 2017-07-07 衡阳师范学院 A kind of method and system of Network Learning Resource aggregate query
CN106959961A (en) * 2016-01-11 2017-07-18 阿里巴巴集团控股有限公司 A kind of Address Recognition method and device
CN106970903A (en) * 2016-01-13 2017-07-21 阿里巴巴集团控股有限公司 The processing method and processing device of address information in logistics system
CN107967313A (en) * 2017-11-21 2018-04-27 中科宇图科技股份有限公司 A kind of method for merging different industries data based on field data and coordinate general character
CN108628811A (en) * 2018-04-10 2018-10-09 北京京东尚科信息技术有限公司 The matching process and device of address text
CN108959236A (en) * 2017-05-19 2018-12-07 百度在线网络技术(北京)有限公司 Medical literature disaggregated model training method, medical literature classification method and its device
CN108959244A (en) * 2018-06-07 2018-12-07 北京京东尚科信息技术有限公司 The method and apparatus of address participle
CN109145169A (en) * 2018-07-26 2019-01-04 浙江省测绘科学技术研究院 A kind of address matching method based on statistics participle
CN109165273A (en) * 2018-08-24 2019-01-08 安徽讯飞智能科技有限公司 General Chinese address matching method facing big data environment
CN109255458A (en) * 2018-09-26 2019-01-22 蜜小蜂智慧(北京)科技有限公司 A kind of method and apparatus of identification registration
CN109344263A (en) * 2018-08-01 2019-02-15 昆明理工大学 A kind of address matching method
CN109933797A (en) * 2019-03-21 2019-06-25 东南大学 Geocoding and system based on Jieba participle and address dictionary
CN110334162A (en) * 2019-05-09 2019-10-15 德邦物流股份有限公司 Address Recognition method and device
CN110674367A (en) * 2019-09-09 2020-01-10 广州易起行信息技术有限公司 Single Chinese character retrieval method and device based on travel industry products
CN110688851A (en) * 2019-09-26 2020-01-14 税友软件集团股份有限公司 Method, device and medium for extracting key information of address text
CN110795472A (en) * 2019-11-11 2020-02-14 集奥聚合(北京)人工智能科技有限公司 Address standardization method, system, equipment and medium based on fuzzy matching
CN110909110A (en) * 2018-09-17 2020-03-24 阿里巴巴集团控股有限公司 Address standardization method and device, storage medium and processor
CN111159973A (en) * 2019-12-13 2020-05-15 中关村科技软件股份有限公司 Administrative division completion and standardization method for Chinese addresses
CN111353309A (en) * 2019-12-25 2020-06-30 北京合力亿捷科技股份有限公司 Method and system for processing communication quality complaint address based on text analysis
CN111737315A (en) * 2020-06-15 2020-10-02 中国工商银行股份有限公司 Address fuzzy matching method and device
CN111797182A (en) * 2020-05-29 2020-10-20 深圳市跨越新科技有限公司 Address code analysis method and system
CN111930829A (en) * 2020-06-18 2020-11-13 中国移动通信集团内蒙古有限公司 Standard address generation method, device, equipment and medium
CN112052407A (en) * 2020-08-28 2020-12-08 深圳市彬讯科技有限公司 Service area query method and device, computer equipment and readable storage medium
CN112069276A (en) * 2020-08-31 2020-12-11 平安科技(深圳)有限公司 Address coding method and device, computer equipment and computer readable storage medium
CN112084773A (en) * 2020-08-21 2020-12-15 国网湖北省电力有限公司电力科学研究院 Power grid power failure address matching method based on word bank bidirectional maximum matching method
CN112581252A (en) * 2020-12-03 2021-03-30 信用生活(广州)智能科技有限公司 Address fuzzy matching method and system fusing multidimensional similarity and rule set
CN112835897A (en) * 2021-01-29 2021-05-25 上海寻梦信息技术有限公司 Geographic region division management method, data conversion method and related equipment
CN112861532A (en) * 2019-11-12 2021-05-28 北京四维图新科技股份有限公司 Address standardization processing method, device and equipment and online search system
CN114676229A (en) * 2022-04-20 2022-06-28 国网安徽省电力有限公司滁州供电公司 Technical improvement major repair project file management system and management method
CN115809315A (en) * 2022-11-24 2023-03-17 中科星图智慧科技安徽有限公司 Geographical name and address standardized matching algorithm
CN116226362A (en) * 2023-05-06 2023-06-06 湖南德雅曼达科技有限公司 Word segmentation method for improving accuracy of searching hospital names

Cited By (100)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102169498A (en) * 2011-04-14 2011-08-31 中国测绘科学研究院 Address model constructing method and address matching method and system
CN102289467A (en) * 2011-07-22 2011-12-21 浙江百世技术有限公司 Method and device for determining target site
CN102306161A (en) * 2011-07-22 2012-01-04 浙江百世技术有限公司 Method for multi-region repeated detection and equipment
CN103714100A (en) * 2012-10-02 2014-04-09 信义房屋仲介股份有限公司 Fuzzy address display system and display method
CN105209858A (en) * 2013-03-15 2015-12-30 邓白氏公司 Non-deterministic disambiguation and matching of business locale data
CN105209858B (en) * 2013-03-15 2018-11-16 邓白氏公司 The uncertainty of business location's data disappears qi and matching
CN103279539A (en) * 2013-06-04 2013-09-04 百度在线网络技术(北京)有限公司 Interest point set displaying method, electronic map displaying method, interest point set displaying device and electronic map displaying device
CN104281578B (en) * 2013-07-02 2017-11-03 威盛电子股份有限公司 The region labeling method and device of data file
CN104281578A (en) * 2013-07-02 2015-01-14 威盛电子股份有限公司 Region marking method and device for data file
CN103558926A (en) * 2013-11-12 2014-02-05 金蝶软件(中国)有限公司 Geographical name entry method and geographical name entry device
CN103744854A (en) * 2013-11-15 2014-04-23 北京正图数创信息技术有限公司 Address data matching mining platform based on big data storage and mining technology
CN104657361A (en) * 2013-11-18 2015-05-27 阿里巴巴集团控股有限公司 Data processing method and data processing device
CN103605752A (en) * 2013-11-21 2014-02-26 武大吉奥信息技术有限公司 Address matching method based on semantic recognition
CN104679801A (en) * 2013-12-03 2015-06-03 高德软件有限公司 Point of interest searching method and point of interest searching device
CN104679801B (en) * 2013-12-03 2019-02-12 高德软件有限公司 A kind of interest point search method and device
CN103678708A (en) * 2013-12-30 2014-03-26 小米科技有限责任公司 Method and device for recognizing preset addresses
CN103678708B (en) * 2013-12-30 2017-01-18 小米科技有限责任公司 Method and device for recognizing preset addresses
CN104899213B (en) * 2014-03-06 2018-06-05 阿里巴巴集团控股有限公司 A kind of method and apparatus for parsing institution term
CN104899213A (en) * 2014-03-06 2015-09-09 阿里巴巴集团控股有限公司 Method and device for resolving organization names
CN105022748A (en) * 2014-04-28 2015-11-04 北京图盟科技有限公司 Waybill address classified method and apparatus
CN105022748B (en) * 2014-04-28 2019-05-07 高德软件有限公司 A kind of waybill address hierarchy method and device
CN103984735A (en) * 2014-05-21 2014-08-13 北京京东尚科信息技术有限公司 Method and device for generating recommended delivery place name
CN103984735B (en) * 2014-05-21 2017-02-15 北京京东尚科信息技术有限公司 Method and device for generating recommended delivery place name
WO2016050088A1 (en) * 2014-09-30 2016-04-07 华为技术有限公司 Address search method and device
CN105528372A (en) * 2014-09-30 2016-04-27 华为技术有限公司 An address search method and apparatus
CN105528372B (en) * 2014-09-30 2019-05-24 华为技术有限公司 A kind of address search method and equipment
US10783171B2 (en) 2014-09-30 2020-09-22 Huawei Technologies Co., Ltd. Address search method and device
CN106796606A (en) * 2014-10-10 2017-05-31 歌乐株式会社 searching system
CN106796606B (en) * 2014-10-10 2020-07-17 歌乐株式会社 Retrieval system
CN105786922A (en) * 2014-12-25 2016-07-20 高德软件有限公司 Method and equipment for determining missing electronic map data
CN105786922B (en) * 2014-12-25 2020-02-14 高德软件有限公司 Method and device for determining missing electronic map data
CN104484790A (en) * 2014-12-26 2015-04-01 清华大学深圳研究生院 Address match method and device of logistics business
CN104537062A (en) * 2014-12-29 2015-04-22 北京牡丹电子集团有限责任公司数字电视技术中心 Address information extracting method and system
CN104615782A (en) * 2015-03-02 2015-05-13 武汉工程大学 Address matching method based on sliding window maximum matching algorithm
CN104657486A (en) * 2015-03-02 2015-05-27 武汉工程大学 Method for trustworthiness computing of administrative division based on multiple factors
CN104615782B (en) * 2015-03-02 2017-10-10 武汉工程大学 Address matching process based on sliding window maximum matching algorithm
CN104657486B (en) * 2015-03-02 2018-01-19 武汉工程大学 A kind of method that confidence level based on polyfactorial administrative division calculates
CN105069056A (en) * 2015-07-24 2015-11-18 湖北文理学院 Character string matching based method and system for analyzing address information of identification card
CN106407221B (en) * 2015-07-31 2020-02-07 菜鸟智能物流控股有限公司 Address data retrieval method and device
CN106407221A (en) * 2015-07-31 2017-02-15 阿里巴巴集团控股有限公司 Address data retrieval method and apparatus
US10725737B2 (en) 2015-10-14 2020-07-28 Alibaba Group Holding Limited Address information-based account mapping method and apparatus
US10990353B2 (en) 2015-10-14 2021-04-27 Advanced New Technologies Co., Ltd. Address information-based account mapping method and apparatus
WO2017063531A1 (en) * 2015-10-14 2017-04-20 阿里巴巴集团控股有限公司 Account mapping method and device based on address information
CN105426351A (en) * 2015-11-11 2016-03-23 中国建设银行股份有限公司 Participle processing method and system for customer address information
CN105426351B (en) * 2015-11-11 2019-01-25 中国建设银行股份有限公司 A kind of participle processing method and system of customer address information
CN105677700A (en) * 2015-12-23 2016-06-15 武汉工程大学 Chinese address administrative division analytic method based on set operation
CN105677700B (en) * 2015-12-23 2018-12-14 武汉工程大学 A kind of Chinese address administrative division analytic method based on set operation
CN106919569A (en) * 2015-12-24 2017-07-04 北京四维图新科技股份有限公司 A kind of method and device of the administrative division information for obtaining point of interest POI
CN106959961A (en) * 2016-01-11 2017-07-18 阿里巴巴集团控股有限公司 A kind of Address Recognition method and device
CN106970903B (en) * 2016-01-13 2020-08-04 菜鸟智能物流控股有限公司 Method and device for processing address information in logistics system
CN106970903A (en) * 2016-01-13 2017-07-21 阿里巴巴集团控股有限公司 The processing method and processing device of address information in logistics system
CN106846166A (en) * 2016-12-08 2017-06-13 北京中电普华信息技术有限公司 A kind of power marketing customer profile improving method based on the analysis of address big data
CN106649803A (en) * 2016-12-29 2017-05-10 华南师范大学 Address matching method and system
CN106599303A (en) * 2016-12-29 2017-04-26 苏碧云 Address matching method and system
CN106709065B (en) * 2017-01-19 2020-08-04 国家电网公司 Address information standardization processing method and device
CN106709065A (en) * 2017-01-19 2017-05-24 国家电网公司 Standardization processing method and standardized processing device for address information
CN106934036A (en) * 2017-03-15 2017-07-07 衡阳师范学院 A kind of method and system of Network Learning Resource aggregate query
CN108959236A (en) * 2017-05-19 2018-12-07 百度在线网络技术(北京)有限公司 Medical literature disaggregated model training method, medical literature classification method and its device
CN108959236B (en) * 2017-05-19 2021-11-09 百度在线网络技术(北京)有限公司 Medical literature classification model training method, medical literature classification method and device thereof
CN107967313A (en) * 2017-11-21 2018-04-27 中科宇图科技股份有限公司 A kind of method for merging different industries data based on field data and coordinate general character
CN107967313B (en) * 2017-11-21 2022-02-01 中科宇图科技股份有限公司 Method for combining data of different industries based on field data and coordinate commonality
CN108628811A (en) * 2018-04-10 2018-10-09 北京京东尚科信息技术有限公司 The matching process and device of address text
CN108959244B (en) * 2018-06-07 2022-08-09 北京京东尚科信息技术有限公司 Address word segmentation method and device
CN108959244A (en) * 2018-06-07 2018-12-07 北京京东尚科信息技术有限公司 The method and apparatus of address participle
CN109145169B (en) * 2018-07-26 2021-03-26 浙江省测绘科学技术研究院 Address matching method based on statistical word segmentation
CN109145169A (en) * 2018-07-26 2019-01-04 浙江省测绘科学技术研究院 A kind of address matching method based on statistics participle
CN109344263A (en) * 2018-08-01 2019-02-15 昆明理工大学 A kind of address matching method
CN109165273A (en) * 2018-08-24 2019-01-08 安徽讯飞智能科技有限公司 General Chinese address matching method facing big data environment
CN109165273B (en) * 2018-08-24 2021-10-26 安徽讯飞智能科技有限公司 General Chinese address matching method facing big data environment
CN110909110B (en) * 2018-09-17 2023-05-30 阿里巴巴集团控股有限公司 Address standardization method and device, storage medium and processor
CN110909110A (en) * 2018-09-17 2020-03-24 阿里巴巴集团控股有限公司 Address standardization method and device, storage medium and processor
CN109255458A (en) * 2018-09-26 2019-01-22 蜜小蜂智慧(北京)科技有限公司 A kind of method and apparatus of identification registration
CN109933797A (en) * 2019-03-21 2019-06-25 东南大学 Geocoding and system based on Jieba participle and address dictionary
CN110334162A (en) * 2019-05-09 2019-10-15 德邦物流股份有限公司 Address Recognition method and device
CN110674367A (en) * 2019-09-09 2020-01-10 广州易起行信息技术有限公司 Single Chinese character retrieval method and device based on travel industry products
CN110674367B (en) * 2019-09-09 2022-02-01 广州易起行信息技术有限公司 Single Chinese character retrieval method and device based on travel industry products
CN110688851B (en) * 2019-09-26 2023-07-28 亿企赢网络科技有限公司 Method, device and medium for extracting key information of address text
CN110688851A (en) * 2019-09-26 2020-01-14 税友软件集团股份有限公司 Method, device and medium for extracting key information of address text
CN110795472A (en) * 2019-11-11 2020-02-14 集奥聚合(北京)人工智能科技有限公司 Address standardization method, system, equipment and medium based on fuzzy matching
CN112861532B (en) * 2019-11-12 2024-04-02 北京四维图新科技股份有限公司 Address standardization processing method, device, equipment and online searching system
CN112861532A (en) * 2019-11-12 2021-05-28 北京四维图新科技股份有限公司 Address standardization processing method, device and equipment and online search system
CN111159973A (en) * 2019-12-13 2020-05-15 中关村科技软件股份有限公司 Administrative division completion and standardization method for Chinese addresses
CN111159973B (en) * 2019-12-13 2023-06-02 中关村科技软件股份有限公司 Administrative division alignment and standardization method for Chinese addresses
CN111353309A (en) * 2019-12-25 2020-06-30 北京合力亿捷科技股份有限公司 Method and system for processing communication quality complaint address based on text analysis
CN111797182A (en) * 2020-05-29 2020-10-20 深圳市跨越新科技有限公司 Address code analysis method and system
CN111797182B (en) * 2020-05-29 2024-01-30 深圳市跨越新科技有限公司 Address code analysis method and system
CN111737315B (en) * 2020-06-15 2023-08-11 中国工商银行股份有限公司 Address fuzzy matching method and device
CN111737315A (en) * 2020-06-15 2020-10-02 中国工商银行股份有限公司 Address fuzzy matching method and device
CN111930829A (en) * 2020-06-18 2020-11-13 中国移动通信集团内蒙古有限公司 Standard address generation method, device, equipment and medium
CN112084773A (en) * 2020-08-21 2020-12-15 国网湖北省电力有限公司电力科学研究院 Power grid power failure address matching method based on word bank bidirectional maximum matching method
CN112052407A (en) * 2020-08-28 2020-12-08 深圳市彬讯科技有限公司 Service area query method and device, computer equipment and readable storage medium
CN112052407B (en) * 2020-08-28 2024-05-03 深圳市彬讯科技有限公司 Service area query method, device, computer equipment and readable storage medium
CN112069276A (en) * 2020-08-31 2020-12-11 平安科技(深圳)有限公司 Address coding method and device, computer equipment and computer readable storage medium
CN112069276B (en) * 2020-08-31 2024-03-08 平安科技(深圳)有限公司 Address coding method, address coding device, computer equipment and computer readable storage medium
CN112581252A (en) * 2020-12-03 2021-03-30 信用生活(广州)智能科技有限公司 Address fuzzy matching method and system fusing multidimensional similarity and rule set
CN112835897B (en) * 2021-01-29 2024-03-15 上海寻梦信息技术有限公司 Geographic area division management method, data conversion method and related equipment
CN112835897A (en) * 2021-01-29 2021-05-25 上海寻梦信息技术有限公司 Geographic region division management method, data conversion method and related equipment
CN114676229A (en) * 2022-04-20 2022-06-28 国网安徽省电力有限公司滁州供电公司 Technical improvement major repair project file management system and management method
CN115809315A (en) * 2022-11-24 2023-03-17 中科星图智慧科技安徽有限公司 Geographical name and address standardized matching algorithm
CN116226362A (en) * 2023-05-06 2023-06-06 湖南德雅曼达科技有限公司 Word segmentation method for improving accuracy of searching hospital names

Similar Documents

Publication Publication Date Title
CN101882163A (en) Fuzzy Chinese address geographic evaluation method based on matching rule
CN109145169B (en) Address matching method based on statistical word segmentation
CN102033954B (en) Full text retrieval inquiry index method for extensible markup language document in relational database
EP3407223B1 (en) Location based full text search
CN109359200A (en) Place name address date intelligently parsing system
CN111324679B (en) Method, device and system for processing address information
CN109101474B (en) Address aggregation method, package aggregation method and equipment
CN103605752A (en) Address matching method based on semantic recognition
CN104866593A (en) Database searching method based on knowledge graph
CN109933797A (en) Geocoding and system based on Jieba participle and address dictionary
CN112612863B (en) Address matching method and system based on Chinese word segmentation device
CN104252507B (en) A kind of business data matching process and device
CN107463711A (en) A kind of tag match method and device of data
CN111522892A (en) Geographic element retrieval method and device
CN104391908A (en) Locality sensitive hashing based indexing method for multiple keywords on graphs
CN114780680A (en) Retrieval and completion method and system based on place name and address database
CN114168705B (en) Chinese address matching method based on address element index
CN107644050A (en) A kind of querying method and device of the Hbase based on solr
CN101963993B (en) Method for fast searching database sheet table record
EP2783308B1 (en) Full text search based on interwoven string tokens
CN114153821A (en) Electric quantity graph database construction and search method based on graph theory
CN112905642B (en) Method for storing IEC61850 report data into relational database based on CSV mapping file
CN102385597B (en) The fault-tolerant searching method of a kind of POI
CN105740374A (en) Distributed memory based three-dimensional platform data fuzzy query method
CN108932351B (en) Method and system for generating route map of carbon capture and sequestration technology

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C12 Rejection of a patent application after its publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20101110