CN101719128A - Fuzzy matching-based Chinese geo-code determination method - Google Patents

Fuzzy matching-based Chinese geo-code determination method Download PDF

Info

Publication number
CN101719128A
CN101719128A CN200910156650A CN200910156650A CN101719128A CN 101719128 A CN101719128 A CN 101719128A CN 200910156650 A CN200910156650 A CN 200910156650A CN 200910156650 A CN200910156650 A CN 200910156650A CN 101719128 A CN101719128 A CN 101719128A
Authority
CN
China
Prior art keywords
address
matching
tree
chinese
original
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN200910156650A
Other languages
Chinese (zh)
Other versions
CN101719128B (en
Inventor
张贵军
吴海涛
洪榛
俞立
郭海峰
何尚秋
陈宁宁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhejiang University of Technology ZJUT
Original Assignee
Zhejiang University of Technology ZJUT
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhejiang University of Technology ZJUT filed Critical Zhejiang University of Technology ZJUT
Priority to CN2009101566504A priority Critical patent/CN101719128B/en
Publication of CN101719128A publication Critical patent/CN101719128A/en
Application granted granted Critical
Publication of CN101719128B publication Critical patent/CN101719128B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

一种基于模糊匹配的中文地理编码确定方法,包括以下步骤:A1、读入描述性中文地址信息,以行政区级别为断点,采用正向最大搜索方法,对原始地址进行切分,得到原始地址元素数组;A2、将原始地址元素通过地址词典进行标准化;A3、读取标准地址树,采用分支定界算法,对原始地址元素数组进行匹配;同时,应用模糊规则对匹配操作进行控制:在获取原始地址切分后的关键字后;评价分数最高的作为最相近匹配结果,即得到更为精确的匹配地址。本发明提供一种地址模型合理、匹配率较高、快速性良好的基于模糊匹配的中文地理编码确定方法。

A method for determining Chinese geocoding based on fuzzy matching, comprising the following steps: A1, reading in descriptive Chinese address information, taking the administrative district level as a breakpoint, and adopting the forward maximum search method to segment the original address to obtain the original address Element array; A2. Standardize the original address elements through the address dictionary; A3. Read the standard address tree and use the branch and bound algorithm to match the original address element array; at the same time, apply fuzzy rules to control the matching operation: After the keyword after the original address is segmented; the one with the highest evaluation score will be the closest matching result, that is, a more accurate matching address will be obtained. The invention provides a fuzzy-matching-based Chinese geocoding determination method with reasonable address model, high matching rate and good rapidity.

Description

A kind of Chinese geocoding based on fuzzy matching is determined method
Technical field
The present invention relates to a kind of geographic information data processing, computer application field, in particular, a kind of geocoding method based on fuzzy matching.
Background technology
Geocoding is a process of setting up address descriptor and coordinate corresponding relation, that is to say the crossover tool between the description of locus, place and place.Owing to lack the support of effective spatial analysis technology, the analyzing and processing of spatial data can't satisfy the needs of science decision and management, causes the value of spatial data in decision-making management can not embody all the time for a long time.Can realize the fusion of Geographic Information System and spatial information by matching addresses, promote the city space informationization, so more effective, carry out spatial analysis more easily and decision-making is used.
In recent years, along with the continuous development of geographical information technology and perfect, the geocoding technology is also being updated.External research in this respect is comparative maturity, a kind of theory of multi-mode cross bearing has been proposed as Davis, but just at the zone that has the geocoding standard, and a plurality of spatial information database also caused the spatial information redundancy, reduced matching efficiency; Duncan has proposed homalographic cell Unified coding scheme, but the geocoding standard of Chinese city each department has nothing in common with each other, and the coding criterion of this complexity is once formation, in case the large-scale change that changes and will involve, cost is too high; People such as Bakshi have proposed a kind of geocoding technology based on text mark splitting scheme, this matching scheme has been obtained effect preferably concerning English address, but because Chinese typing mode and English exists than big-difference, therefore for the matching addresses effect of Chinese and not obvious.For domestic, the matching addresses technology is at the early-stage, has only done many work in application facet.As Beijing " addressing god " of Computer Company longways, the Map Searcher of Founder Digit etc., but this type of application system exists in the application to town problems such as the address model is single, matching rate is high inadequately.
Therefore, existing technology exists defective at the Chinese address encoding context at the town, needs to improve.
Summary of the invention
Single for the address model that overcomes existing Chinese geographic position coding method, matching rate is not high enough, slow-footed deficiency, the invention provides the Chinese geocoding that a kind of address model is reasonable, matching rate is higher, rapidity is good and determine method based on fuzzy matching.
The technical solution adopted for the present invention to solve the technical problems is:
A kind of Chinese geocoding based on fuzzy matching is determined method, may further comprise the steps:
A1, reading in descriptive Chinese address information, is breakpoint with the administrative area rank, adopts the forward maximum searching method, and original address is carried out cutting, obtains the original address element array;
A2, the original address element is carried out standardization by the address dictionary;
A3, read normal address tree, adopt branch-bound algorithm, the original address element array is mated: the address database of setting up the number of addresses storage format, stratification according to the china administration district is divided, set up tree-shaped address storage tree, highest-ranking administrative area unit is as the root node of number of addresses, and preserve as child node in its subordinate administrative area; Foundation is to address key element and number after the cutting of descriptive Chinese address information, in matching process, at first read normal address tree R, judge by other key word of highest line political affairs level in the candidate site key element after the cutting, the address node of setting the corresponding administrative grade of R with the normal address mates, give up uncorrelated branch tree after the match is successful, keep the correlated branch tree and carry out next administrative grade coupling;
Simultaneously, using fuzzy rule controls matching operation: behind the key word after obtaining the original address cutting, also comprise:
Adopt the fuzzy matching rule that matching operation is optimized, the fuzzy matching rule definition is as follows: the supposition matching field is character string address, and length is h; Criteria field is character string std_address, and length is H; The std_address set that address ∩ std_address ≠ Φ is satisfied in definition is the set of Satisfying Matching Conditions, wherein, address ∩ std_address ≠ Φ represents that character string address and criteria field character string std_address occur simultaneously not for empty, keep the high set element of degree of membership at last; Be defined as follows matched rule:
1. standard characters std_address is identical with i character among the matched character string address, and then degree of membership is i/H;
2. standard characters std_address comprises matched character string address, and then degree of membership is 1;
Obtain after the degree of membership, set μ and be the coupling degree of membership, be converted into the quantification score value according to mapping ruler f:sc → μ, mapping function: f (μ)=10 * μ, with the evaluation score of sc as this candidate record;
The most close matching result of conduct that evaluation score is the highest promptly obtains more accurate match address.
As preferred a kind of scheme: described Chinese geocoding determines that method also comprises:
If the number that the A4 match address comprises is carried out space orientation: set the urban road number with following regular distribution: according to the both sides of odd or even number regular distribution in road, be odd numbers just to the left, the right side is an even numbers; Be odd numbers just to the right, the left side is an even numbers; Record road flex point number with and geographic coordinate information, after obtaining the number information in the original address, judge to be between any two flex points, suppose that the match address number is between flex point A, B, with A, B is reference point, carry out the least square method linear interpolation, obtain the particular geographic coordinates that this number is positioned at road, navigate to map at last.
Further, in the described steps A 3, by normalizing operation, the candidate site array define of obtaining after the original address standardization is address[i], 0<i<N; The normal address node is made as sc with the coupling score value of corresponding level candidate element i, i represents the affiliated level of this node, N represents the degree of depth of initial address tree; It is as follows that coupling is passed judgment on rule:
Rule 1: number of addresses node and candidate's element accurately mate, Y → accurately mate N → fuzzy matching;
Rule 2: accurately search feasible solution after the coupling, Y → matching algorithm moves down, N → return the upper level node to search approximate solution;
Rule 3: judge whether to exist default, Y → preservation upper level branch tree, the current level of N → preservation branch tree;
Rule 4: judge whether to exist default, sc i=0, i is default the place number of plies;
Rule 5: the candidate record final score is its each layer node matching score sum:
sc=∑sc i
Further again, in the described steps A 3, auxiliary geographical name data bank is set, use comparatively frequent geographic position to build the storehouse separately simultaneously for having the important of the second feature identity.
In steps A 1, the original address that obtains, first character with original address is a starting point, address database search is searched corresponding normal address title, exist and then read the address information reservation, simultaneously this character is excised in the original address character string, otherwise read next character and last character composition character string, corresponding normal address title is searched in continuation in address database, read successively, determines the address key element of all administrative grades.
In steps A 2, if there be default in the candidate site array after the cutting,, obtain its higher level address at address database according to other address element of next stage, write in the candidate site key element array.
In steps A 2, be called for short the design address, the another name information database, preserves the specialized information database of current all normal address information and its another name, abbreviation.
In steps A 2, the wrongly written or mispronounced characters error correction of the address element after the cutting, suppose in the address information of typing and have wrongly written or mispronounced characters, it is address element after the cutting can't find complete correspondence in the dictionary of address normal address title, get the normal address title the most close and return, and replace the address information of typing with the address information of typing.
Technical conceive of the present invention is: at first obtain original typing address information, adopt then and divide word algorithm that the original address of words input is carried out cutting, obtain the description key word with the corresponding locus of original address; The normal address data in city are pitched tree-like formula with K stores, wherein the K value is by the concrete quantity decision of each rank administrative unit, the key word that obtains is mated in the tree of normal address, adopt branch-bound algorithm that matching algorithm is optimized in the matching process, use simultaneously that fuzzy rule is accurately controlled matching operation and to the matching result screening of marking, obtain at least one and conform to fully with original address or be similar to the address information that conforms to.Application has reduced the scale of number of addresses based on the branch-and-bound matching algorithm of tree-shaped address information memory module, has optimized the algorithm complex of matching addresses process, has improved the efficient and the accuracy rate of address.
Beneficial effect of the present invention mainly shows: the present invention has optimized the algorithm complex of geocoding process, has improved the efficient and the accuracy rate of geocoding.
Description of drawings
The Chinese geocoding that Fig. 1 is based on fuzzy matching is determined the process flow diagram of method.
Fig. 2 is the synoptic diagram of normal address tree.
Fig. 3 is the synoptic diagram of matched rule.
Fig. 4 is the synoptic diagram of the odd or even number regular distribution of road.
Fig. 5 loads the initial address tree, and back extraction that accurately the match is successful is the branch tree of root node with " Zhejiang ", the synoptic diagram of deletion invalid branch tree.
Fig. 6 judges address[2]=" Hangzhou ", after accurately the match is successful, extracting with " Hangzhou " was the branch tree of root node; Judge address[3 again]=" East Lake ", after accurately the match is successful, extracting with " East Lake " was the synoptic diagram of the branch tree of root node.
Fig. 7 judges address[4]=" staying ", current branch tree does not have feasible solution, returns the father node in current root node " East Lake ", enables the fuzzy matching pattern, be met the branch tree of part matching condition, mate the synoptic diagram that keyword " stays " again.
Fig. 8 judges address[5]=" stay and close ", the child node of current branch tree root node can't accurately mate, and starts the fuzzy matching pattern, obtains part coupling branch tree, judge address[6]=" 288 ", the synoptic diagram that all part coupling branch trees mate.
Embodiment
Below in conjunction with accompanying drawing the present invention is further described.
With reference to Fig. 1~Fig. 8,
A kind of Chinese geocoding method based on fuzzy matching as shown in Figure 1, wherein comprises following steps:
A1, reading in descriptive Chinese address information, is breakpoint with the administrative area rank, adopts the forward maximum searching method, and original address is carried out cutting, obtains the original address element array.A2, the original address element is carried out standardization by the address dictionary, obtain through being called for short or another name is corrected, misspelling is revised, address element array behind default normalizing operation such as filling.A3, read normal address tree, adopt branch-bound algorithm, the original address element array is mated, use fuzzy rule simultaneously matching operation is controlled, obtain more accurate match address.A4, the number that comprises for match address adopt flex point to carry out space orientation with reference to interpolation algorithm.
Described method, wherein, in steps A 1, at Chinese address information, with reference to china administration area dividing standard, established standards typing pattern:
Administrative address pattern: province (municipality directly under the Central Government) → city → district (county, county-level city); Regional address pattern: street (town) → village (road) term position → number.As normal address information: Hangzhou, Zhejiang province city Xihu District stays the town and stays and No. 288, North Road.
Described method, wherein, in steps A 1, the original address that obtains is a starting point with first character of original address, and address database search is searched corresponding normal address title, exist and then read the address information reservation, simultaneously this character is excised in the original address character string, otherwise read next character and last character composition character string, continue the corresponding normal address of search title in address database.Read successively, determine the address key element of all administrative grades.
Described method wherein, in steps A 2, if there be default in the candidate site array after the cutting, according to other address element of next stage, is obtained its higher level address at address database, writes in the candidate site key element array.
Described method, wherein, in steps A 2, be called for short the design address, the another name information database, preserves the specialized information database of current all normal address information and its another name, abbreviation.If there is another name in the candidate site after the cutting or is called for short, distinguish and it be standardized as standard name that as " Shandong " is standardized as " Shandong ", " Shanghai " is standardized as " Shanghai ".
Described method, wherein, in steps A 2, the wrongly written or mispronounced characters error correction of the address element after the cutting, suppose in the address information of typing and have wrongly written or mispronounced characters, be address element after the cutting can't find complete correspondence in the dictionary of address normal address title, get the normal address title the most close and return, and replace the address information of typing with the address information of typing.As typing " Liu Helu ", do not exist in the dictionary of address " Liu Helu ", only there be " Liu Helu ", get " Liu Helu " replacement " Liu Helu ".
Described method, wherein, in steps A 3, comprise following steps, read address database, and address database is stored with the number of addresses form, highest-ranking administrative area unit is as the root node of number of addresses, and preserve as child node in its subordinate administrative area, as shown in Figure 2.
Described method, wherein, in steps A 3, also comprise following steps, under address information tree-like storage prerequisite, adopt branch-bound algorithm that matching process is optimized, the address information of corresponding level during promptly at first other key word of highest line political affairs level in the matching candidate address element is set with corresponding address, matched nodes and branch thereof that the match is successful then keeps in the corresponding address tree set, and give up other uncorrelated address information nodes at the same level and branch thereof tree.By normalizing operation, the candidate site array define of obtaining after the original address standardization is address[i], 0<i<N.The normal address node is made as sc with the coupling score value of corresponding level candidate element i, i represents the affiliated level of this node, N represents the degree of depth of initial address tree.It is as follows that coupling is passed judgment on rule:
Rule 1: number of addresses node and candidate's element accurately mate, Y → accurately mate N → fuzzy matching;
Rule 2: accurately search feasible solution after the coupling, Y → matching algorithm moves down, N → return the upper level node to search approximate solution;
Rule 3: judge whether to exist default, Y → preservation upper level branch tree, the current level of N → preservation branch tree;
Rule 4: judge whether to exist default, sc i=0, i is default the place number of plies;
Rule 5: the candidate record final score is its each layer node matching score sum:
sc=∑sc i
Described method wherein, in steps A 3, also comprises following steps, uses fuzzy rule control matching operation, if can't mate achievement fully for address information node at the same level in the number of addresses, then enables fuzzy rule, obtains the approximate match result.As typing key word at county level is " East Lake ", and only there be " West Lake " in node at county level in the number of addresses, then obtains node " West Lake " and branch thereof tree and keeps as matching result, gives up other nodes at the same level and branch thereof tree.
Described method wherein, in steps A 3, also comprises following steps, and matching result is quantized scoring.Coupling is given different score values with approximate match fully, and the most close matching result of conduct that score value is high returns, and the comparatively close matching result of the conduct that score value is low returns.Quantizing rule is as follows:
Suppose that matching field is character string address, length is h; Criteria field is character string std_address, and length is H.The std_address set that address ∩ std_address ≠ Φ is satisfied in definition is the set of Satisfying Matching Conditions, wherein, address ∩ std_address ≠ Φ represents that character string address and criteria field character string std_address occur simultaneously not for empty, keep the high set element of degree of membership at last.Be defined as follows matched rule Fig. 3):
1. standard characters std_address is identical with i character among the matched character string address, and then degree of membership is i/H;
2. standard characters std_address comprises matched character string address, and then degree of membership is 1.
Obtain after the degree of membership, set μ and be the coupling degree of membership, be converted into the quantification score value according to mapping ruler f:sc → μ, mapping function: f (μ)=1O * μ, with the evaluation score of sc as this candidate record.
Described method, wherein, in steps A 3, also comprise following steps, auxiliary geographical name data bank is set, having the important of the second feature identity for some uses comparatively frequent geographic position to build the storehouse separately simultaneously, the second feature identity as " Hangzhou, Zhejiang province city Xihu District stays the town and stays and No. 288, road " is " Zhejiang Polytechnical University Ping Feng school district ", if typing original address information is " Zhejiang Polytechnical University Ping Feng school district ", then directly navigate to the geographic position of " Hangzhou, Zhejiang province city Xihu District stays the town and stays and No. 288, road ".
Described method wherein, in steps A 4, comprises following steps, obtain final matching results after, carry out space interpolation location according to number information.If there is no number information then navigates to the region geometry center of the minimum administrative unit of original address information, is accurate to the street as original address information, then with the geometric space center of location positioning to this street.If there is number information, sets road and set the urban road number with following regular distribution: according to the both sides of odd or even number regular distribution in road: be odd numbers just to the left, the right side is an even numbers; Be odd numbers just to the right, the left side is even numbers (Fig. 4).Record road flex point number with and geographic coordinate information, after obtaining the number information in the original address, judge to be between any two flex points, suppose that the match address number is between flex point A, B, with A, B is reference point, carry out the least square method linear interpolation, obtain the particular geographic coordinates that this number is positioned at road, last space and geographical coordinate setting is to map.
Branch-and-bound matching algorithm average time complexity based on tree-shaped address information memory module among the present invention is log K N, wherein N represents the leafy node number of K fork number of addresses.
In the present embodiment, set original typing address information and after cutting, obtain candidate site array address[for " Hangzhou, Zhejiang province city Donghu District stays to press down to stay and closes the road No. 288 " original address] (table 1).
Table 1 candidate site array
Level Economize The city The district The town The road Number
Codomain Zhejiang Hangzhou East Lake Stay Stay and close ??288
Consider better expression algorithm thought, add some in the match address tree and upset data that matching process is as follows behind the introducing branch and bound algorithms:
Step1: load the initial address tree, judge address[1]=" Zhejiang ", after accurately the match is successful, extraction is the branch tree of root node with " Zhejiang ", deletion invalid branch tree, wherein sc represents the PTS after each node and candidate site speech section are mated, as shown in Figure 5.
Step2: judge address[2]=" Hangzhou ", after accurately the match is successful, extracting with " Hangzhou " was the branch tree of root node.Judge address[3]=" East Lake ", after accurately the match is successful, extracting with " East Lake " was the branch tree of root node, as shown in Figure 6.
Step3: judge address[4]=" staying ", current branch tree does not have feasible solution, returns the father node in current root node " East Lake ", enable the fuzzy matching pattern, be met the branch tree of part matching condition, mate keyword again and " stay ", as shown in Figure 7.
Step4: judge address[5]=" stay and close ", the child node of current branch tree root node can't accurately mate, and starts the fuzzy matching pattern, obtain part coupling branch tree, judge address[6]=" 288 ", all part coupling branch trees mate, as shown in Figure 8.
After all speech section couplings were finished in the candidate site array, the last evaluation score that each address is write down sorted, and the address record that obtains marking the highest returns as final matching results, shown in Fig. 9 solid line part.
Step5: obtain number information, read the geographical information in final match address information Middle St road, comprise flex point number data, as shown in Figure 9.Judge that initial number " No. 288 " is positioned between flex point A " No. 268 " and the flex point B " No. 296 ".With flex point A, B is that reference point carries out the least square method interpolation, obtains the locus of original number in the street, sees " * " position among Figure 10.
What more than set forth is the good optimization effect that a embodiment that the present invention provides shows, obviously the present invention not only is fit to the foregoing description, can do many variations to it under the prerequisite of the related content of flesh and blood of the present invention and is implemented not departing from essence spirit of the present invention and do not exceed.

Claims (8)

1.一种基于模糊匹配的中文地理编码确定方法,其特征在于:所述中文地理编码确定方法包括以下步骤:1. A method for determining Chinese geographical codes based on fuzzy matching, characterized in that: the method for determining Chinese geographical codes may further comprise the steps: A1、读入描述性中文地址信息,以行政区级别为断点,采用正向最大搜索方法,对原始地址进行切分,得到原始地址元素数组;A1. Read in the descriptive Chinese address information, take the administrative district level as the breakpoint, and use the forward maximum search method to segment the original address to obtain an array of original address elements; A2、将原始地址元素通过地址词典进行标准化;A2. Standardize the original address elements through the address dictionary; A3、读取标准地址树,采用分支定界算法,对原始地址元素数组进行匹配:建立地址树存储格式的地址数据库,根据中国行政区的层次化划分,建立树状地址存储树,级别最高的行政区单位作为地址树的根结点,其下属行政区作为子结点进行保存;依据对描述性中文地址信息切分后的地址要素和门牌号,在匹配过程中,首先读取标准地址树R,判断通过切分后的候选地址要素中最高行政级别的关键字,与标准地址树R的对应行政级别的地址结点进行匹配,匹配成功后舍弃不相关分支树,保留相关分支树进行下一行政级别匹配;A3. Read the standard address tree, and use the branch-and-bound algorithm to match the original address element array: establish an address database in the address tree storage format, and establish a tree-like address storage tree according to the hierarchical division of administrative regions in China. The administrative region with the highest level The unit serves as the root node of the address tree, and its subordinate administrative districts are saved as child nodes; according to the address elements and house numbers after the descriptive Chinese address information is segmented, in the matching process, first read the standard address tree R, and judge Match the keywords of the highest administrative level in the segmented candidate address elements with the address nodes of the corresponding administrative level in the standard address tree R. After the matching is successful, the irrelevant branch tree is discarded, and the relevant branch tree is reserved for the next administrative level. match; 同时,应用模糊规则对匹配操作进行控制:在获取原始地址切分后的关键字后,还包括:At the same time, apply fuzzy rules to control the matching operation: after obtaining the keywords after the original address segmentation, it also includes: 采用模糊匹配规则对匹配操作进行优化,模糊匹配规则定义如下:假定匹配字段为字符串address,长度为h;标准字段为字符串std_address,长度为H;定义满足address∩std_address≠Φ的std_address集合为满足匹配条件的集合,其中,address∩std_address≠Φ表示字符串address与标准字段字符串std_address交集不为空,最后保留隶属度高的集合元素;定义如下匹配规则:The fuzzy matching rule is used to optimize the matching operation. The fuzzy matching rule is defined as follows: Assume that the matching field is a string address with a length of h; the standard field is a string std_address with a length of H; define the std_address set satisfying address∩std_address≠Φ as Sets that meet the matching conditions, where address∩std_address≠Φ means that the intersection of the string address and the standard field string std_address is not empty, and finally keep the set elements with high membership; define the following matching rules: ①标准字符串std_address和匹配字符串address中i个字符相同,则隶属度为i/H;① If the i characters in the standard string std_address and the matching string address are the same, the degree of membership is i/H; ②标准字符串std_address包含匹配字符串address,则隶属度为1;②The standard string std_address contains the matching string address, then the degree of membership is 1; 得到隶属度之后,设定μ为匹配隶属度,按照映射规则f:sc→μ转化为量化分值,映射函数:f(μ)=10×μ,将sc作为该候选记录的评价分数;After obtaining the membership degree, set μ as the matching membership degree, convert it into a quantitative score according to the mapping rule f: sc→μ, the mapping function: f(μ)=10×μ, and use sc as the evaluation score of the candidate record; 评价分数最高的作为最相近匹配结果,即得到更为精确的匹配地址。The one with the highest evaluation score is taken as the closest matching result, that is, a more accurate matching address is obtained. 2.如权利要求1所述的一种基于模糊匹配的中文地理编码确定方法,其特征在于:所述中文地理编码确定方法还包括:2. a kind of Chinese geographic coding determination method based on fuzzy matching as claimed in claim 1, is characterized in that: described Chinese geographic coding determination method also comprises: A4、如果匹配地址包含的门牌号,进行空间定位:设定城市道路门牌号以以下规则分布:按照单双号规则分布于道路的两侧,正向左侧为单号,右侧为双号;正向右侧为单号,左侧为双号;记录道路拐点门牌号以及其地理坐标信息,获取原始地址中的门牌号信息后,判断处于哪两个拐点之间,假定匹配地址门牌号位于拐点A、B之间,以A、B为参照点,进行最小二乘法线性插值,得到该门牌号位于道路的具体地理坐标,最后定位到地图。A4. If the house number contained in the matching address is matched, perform spatial positioning: set the house number of the city road to be distributed according to the following rules: distribute on both sides of the road according to the rules of odd and even numbers, and the left side of the forward direction is an odd number, and the right side is an even number ;The right side of the forward direction is an odd number, and the left side is an even number; record the house number of the road turning point and its geographic coordinate information, and after obtaining the house number information in the original address, determine which two turning points are between, assuming that the house number of the address matches Located between inflection points A and B, with A and B as reference points, perform least squares linear interpolation to obtain the specific geographical coordinates of the house number on the road, and finally locate it on the map. 3.如权利要求1或2所述的一种基于模糊匹配的中文地理编码确定方法,其特征在于:所述步骤A3中,通过标准化操作,取得原始地址标准化后的候选地址数组定义为address[i],0<i<N;标准地址结点与对应层次候选元素的匹配分值设为sci,i表示该结点所属层次,N表示初始地址树的深度;匹配评判规则如下:3. a kind of Chinese geocoding determination method based on fuzzy matching as claimed in claim 1 or 2, is characterized in that: in described step A3, by standardization operation, the candidate address array that obtains original address standardization is defined as address[ i], 0<i<N; the matching score between the standard address node and the corresponding level candidate element is set to sc i , i indicates the level to which the node belongs, and N indicates the depth of the initial address tree; the matching evaluation rules are as follows: 规则1:地址树结点与候选元素进行精确匹配,Y→精确匹配,N→模糊匹配;Rule 1: Exact match between address tree nodes and candidate elements, Y→exact match, N→fuzzy match; 规则2:精确匹配后查找可行解,Y→匹配算法下移,N→返回上一级结点查找近似解;Rule 2: Find a feasible solution after exact matching, Y → move down the matching algorithm, N → return to the upper-level node to find an approximate solution; 规则3:判断是否存在缺省项,Y→保存上一级分支树,N→保存当前级分支树;Rule 3: Determine whether there is a default item, Y → save the upper branch tree, N → save the current level branch tree; 规则4:判断是否存在缺省项,sci=0,i为缺省项所在层数;Rule 4: judge whether there is a default item, sc i = 0, and i is the number of layers where the default item is located; 规则5:候选记录最终得分为其每一层结点匹配得分之和:Rule 5: The final score of the candidate record is the sum of the matching scores of each layer of nodes: sc=∑scisc = ∑ sc i . 4.如权利要求1或2所述的一种基于模糊匹配的中文地理编码确定方法,其特征在于:所述步骤A3中,设置辅助地名数据库,对于拥有第二特征身份的比较重要同时使用较为频繁的地理位置进行单独建库。4. a kind of Chinese geocoding determination method based on fuzzy matching as claimed in claim 1 or 2, is characterized in that: in described step A3, auxiliary place name database is set, relatively important for having the second characteristic identity to use relatively simultaneously Frequent geographic locations are used for separate library building. 5.如权利要求1或2所述的一种基于模糊匹配的中文地理编码确定方法,其特征在于:在步骤A1中,获取的原始地址,以原始地址的第一个字符为起始点,对地址数据库进行搜索查找对应的标准地址名称,存在则读取地址信息保留,同时将该字符在原始地址字符串切除,否则读取下一字符与上一个字符组成字符串,继续在地址数据库中搜索对应标准地址名称,依次进行读取,确定所有行政级别的地址要素。5. a kind of Chinese geocoding determination method based on fuzzy matching as claimed in claim 1 or 2, is characterized in that: in step A1, the original address that obtains, with the first character of original address as starting point, to Search the address database to find the corresponding standard address name. If it exists, read the address information and keep it, and cut the character from the original address string. Otherwise, read the next character and the previous character to form a string, and continue to search in the address database. Corresponding to the standard address name, it is read sequentially to determine the address elements of all administrative levels. 6.如权利要求1或2所述的一种基于模糊匹配的中文地理编码确定方法,其特征在于:在步骤A2中,如果切分后的候选地址数组存在缺省项,依据下一级别的地址元素,在地址数据库获取其上级地址,写入候选地址要素数组中。6. A kind of Chinese geocoding determination method based on fuzzy matching as claimed in claim 1 or 2, it is characterized in that: in step A2, if there is a default item in the candidate address array after segmentation, according to the next level For the address element, obtain its superior address from the address database and write it into the array of candidate address elements. 7.如权利要求6所述的一种基于模糊匹配的中文地理编码确定方法,其特征在于:在步骤A2中,设计地址简称、别名信息数据库,保存当前所有的标准地址信息与其别名、简称的专门信息数据库。7. a kind of Chinese geocoding determination method based on fuzzy matching as claimed in claim 6, is characterized in that: in step A2, design address is called for short, alias information database, preserves current all standard address information and its alias, abbreviated name Specialized information database. 8.如权利要求7所述的一种基于模糊匹配的中文地理编码确定方法,其特征在于:在步骤A2中,切分后的地址元素的错别字纠错,假定录入的地址信息中存在错别字,即切分后的地址元素在地址词典中无法找到完全对应的标准地址名称,取与录入的地址信息最相近的标准地址名称返回,并取代录入的地址信息。8. a kind of Chinese geocoding determination method based on fuzzy matching as claimed in claim 7, is characterized in that: in step A2, the typo error correction of the address element after the segmentation, assumes that there is a typo in the address information entered, That is, the segmented address element cannot find a completely corresponding standard address name in the address dictionary, and returns the standard address name that is closest to the entered address information, and replaces the entered address information.
CN2009101566504A 2009-12-31 2009-12-31 Fuzzy matching-based Chinese geo-code determination method Expired - Fee Related CN101719128B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2009101566504A CN101719128B (en) 2009-12-31 2009-12-31 Fuzzy matching-based Chinese geo-code determination method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009101566504A CN101719128B (en) 2009-12-31 2009-12-31 Fuzzy matching-based Chinese geo-code determination method

Publications (2)

Publication Number Publication Date
CN101719128A true CN101719128A (en) 2010-06-02
CN101719128B CN101719128B (en) 2012-05-23

Family

ID=42433702

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009101566504A Expired - Fee Related CN101719128B (en) 2009-12-31 2009-12-31 Fuzzy matching-based Chinese geo-code determination method

Country Status (1)

Country Link
CN (1) CN101719128B (en)

Cited By (59)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101980208A (en) * 2010-11-10 2011-02-23 百度在线网络技术(北京)有限公司 Address query method and system
CN101996247A (en) * 2010-11-10 2011-03-30 百度在线网络技术(北京)有限公司 Method and device for constructing address database
CN102024024A (en) * 2010-11-10 2011-04-20 百度在线网络技术(北京)有限公司 Method and device for constructing address database
CN102169498A (en) * 2011-04-14 2011-08-31 中国测绘科学研究院 Address model constructing method and address matching method and system
CN102289467A (en) * 2011-07-22 2011-12-21 浙江百世技术有限公司 Method and device for determining target site
CN102298585A (en) * 2010-06-24 2011-12-28 高德软件有限公司 Address splitting and level marking method and device
CN102393937A (en) * 2011-10-12 2012-03-28 深圳市络道科技有限公司 Address matching method and system of address tree based on backward production
CN102402533A (en) * 2010-09-13 2012-04-04 方正国际软件有限公司 Address matching method and system
CN102446186A (en) * 2010-10-13 2012-05-09 上海众恒信息产业股份有限公司 Chinese geographic coding and decoding method and device
CN102880650A (en) * 2012-08-27 2013-01-16 中国工商银行股份有限公司 Data matching method and device
CN102955832A (en) * 2011-08-31 2013-03-06 深圳市华傲数据技术有限公司 Correspondence address identifying and standardizing system
CN103383682A (en) * 2012-05-01 2013-11-06 刘龙 Geographic coding method, and position inquiring system and method
CN103413215A (en) * 2013-07-12 2013-11-27 广州银联网络支付有限公司 Electronic bank code matching method based on matrix similarity algorithm
CN103440311A (en) * 2013-08-27 2013-12-11 深圳市华傲数据技术有限公司 Method and system for identifying geographical name entities
CN103558926A (en) * 2013-11-12 2014-02-05 金蝶软件(中国)有限公司 Geographical name entry method and geographical name entry device
CN103593468A (en) * 2013-11-27 2014-02-19 北京金和软件股份有限公司 Audio content pushing method
CN104021184A (en) * 2014-06-10 2014-09-03 广州品唯软件有限公司 Positioning method and system
CN104092613A (en) * 2014-07-15 2014-10-08 山东超越数控电子有限公司 Rapid table lookup method based on fuzzy matching
CN104182509A (en) * 2014-08-20 2014-12-03 国家电网公司 Object-oriented address modeling method
CN104182510A (en) * 2014-08-20 2014-12-03 国家电网公司 Object-oriented address modeling method
WO2016050088A1 (en) * 2014-09-30 2016-04-07 华为技术有限公司 Address search method and device
CN105659637A (en) * 2013-09-30 2016-06-08 三星电子株式会社 Caching of locations on a device
CN105760360A (en) * 2014-12-16 2016-07-13 高德软件有限公司 Address correction method and device
WO2016165538A1 (en) * 2015-04-13 2016-10-20 阿里巴巴集团控股有限公司 Address data management method and device
CN106055635A (en) * 2016-05-30 2016-10-26 深圳市华傲数据技术有限公司 Address information searching method and address information searching device
CN106296209A (en) * 2015-06-05 2017-01-04 阿里巴巴集团控股有限公司 Address input control method and device
CN106502978A (en) * 2016-09-19 2017-03-15 浪潮软件股份有限公司 A kind of Chinese address segmenting method and device
CN106528605A (en) * 2016-09-27 2017-03-22 武汉工程大学 A rule-based Chinese address resolution method
CN106649464A (en) * 2016-09-26 2017-05-10 深圳市数字城市工程研究中心 Method of building Chinese address tree and device
CN106709065A (en) * 2017-01-19 2017-05-24 国家电网公司 Standardization processing method and standardized processing device for address information
CN106874384A (en) * 2017-01-10 2017-06-20 广东精规划信息科技股份有限公司 A kind of isomery address standard handovers and matching process
CN106875264A (en) * 2017-03-31 2017-06-20 北京京东尚科信息技术有限公司 Sequence information management method, device and order sorting system
CN107748778A (en) * 2017-10-20 2018-03-02 浪潮软件股份有限公司 A kind of method and device for extracting address
CN108369582A (en) * 2018-03-02 2018-08-03 福建联迪商用设备有限公司 A kind of address error correction method and terminal
CN108959244A (en) * 2018-06-07 2018-12-07 北京京东尚科信息技术有限公司 The method and apparatus of address participle
CN109255564A (en) * 2017-07-13 2019-01-22 菜鸟智能物流控股有限公司 Pick-up point address recommendation method and device
CN109254964A (en) * 2018-08-20 2019-01-22 中国平安人寿保险股份有限公司 Address Standardization method, apparatus, computer equipment and storage medium
CN109344213A (en) * 2018-08-28 2019-02-15 浙江工业大学 A Chinese Geocoding Method Based on Dictionary Tree
CN109784308A (en) * 2019-02-01 2019-05-21 腾讯科技(深圳)有限公司 A kind of address error correction method, device and storage medium
CN109933797A (en) * 2019-03-21 2019-06-25 东南大学 Geocoding method and system based on Jieba word segmentation and address thesaurus
CN110099246A (en) * 2019-02-18 2019-08-06 深度好奇(北京)科技有限公司 Monitoring and scheduling method, apparatus, computer equipment and storage medium
CN110515999A (en) * 2019-08-27 2019-11-29 北京百度网讯科技有限公司 General record processing method, device, electronic device and storage medium
CN110674367A (en) * 2019-09-09 2020-01-10 广州易起行信息技术有限公司 Single Chinese character retrieval method and device based on travel industry products
CN110704564A (en) * 2019-09-27 2020-01-17 北京沃东天骏信息技术有限公司 Address error correction method and device
CN110895651A (en) * 2018-08-23 2020-03-20 北京京东金融科技控股有限公司 Address standardization processing method, device, equipment and computer readable storage medium
CN111144117A (en) * 2019-12-26 2020-05-12 同济大学 Knowledge Graph Chinese Address Disambiguation Method
CN111291277A (en) * 2020-01-14 2020-06-16 浙江邦盛科技有限公司 Address standardization method based on semantic recognition and high-level language search
CN111414357A (en) * 2019-01-07 2020-07-14 阿里巴巴集团控股有限公司 Address data processing method, device, system and storage medium
CN111753515A (en) * 2020-06-24 2020-10-09 广东科杰通信息科技有限公司 Address information extraction and matching method for realizing entity positioning
CN111859849A (en) * 2020-07-01 2020-10-30 邦道科技有限公司 Power utilization address management method and device
CN112052413A (en) * 2020-08-28 2020-12-08 上海谋乐网络科技有限公司 URL fuzzy matching method, device and system
CN112364113A (en) * 2020-11-13 2021-02-12 北京明略软件系统有限公司 Address error correction method and system
CN112417179A (en) * 2020-11-23 2021-02-26 杭州橙鹰数据技术有限公司 Address processing method and device
CN112925922A (en) * 2019-12-06 2021-06-08 农业农村部信息中心 Method, device, electronic equipment and medium for obtaining address
CN113204606A (en) * 2021-04-30 2021-08-03 武汉大学 A Semantic Location Network-Based Address Location Inference Method
CN113656450A (en) * 2021-07-12 2021-11-16 大箴(杭州)科技有限公司 Address processing method and device, electronic equipment and storage medium
CN114091454A (en) * 2021-11-29 2022-02-25 重庆市地理信息和遥感应用中心 Method for extracting place name information and positioning space in internet text
CN116910386A (en) * 2023-09-14 2023-10-20 深圳市智慧城市科技发展集团有限公司 Address completion method, terminal device and computer-readable storage medium
CN117874309A (en) * 2024-03-12 2024-04-12 北京全路通信信号研究设计院集团有限公司 Train control data processing method and device, electronic equipment and storage medium

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101350012B (en) * 2007-07-18 2013-01-16 北京灵图软件技术有限公司 Method and system for matching address
CN101350013A (en) * 2007-07-18 2009-01-21 北京灵图软件技术有限公司 Method and system for searching geographical information
CN100535907C (en) * 2007-08-21 2009-09-02 北京大学 Method for extracting entity address message in text context
CN101393544A (en) * 2008-10-07 2009-03-25 南京师范大学 Chinese Address Semantic Analysis Method Oriented to Address Coding

Cited By (92)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102298585B (en) * 2010-06-24 2016-01-13 高德软件有限公司 A kind of address cutting and rank mask method and address cutting and rank annotation equipment
CN102298585A (en) * 2010-06-24 2011-12-28 高德软件有限公司 Address splitting and level marking method and device
CN102402533A (en) * 2010-09-13 2012-04-04 方正国际软件有限公司 Address matching method and system
CN102446186B (en) * 2010-10-13 2016-03-30 上海众恒信息产业股份有限公司 Chinese geocoding and coding/decoding method and device
CN102446186A (en) * 2010-10-13 2012-05-09 上海众恒信息产业股份有限公司 Chinese geographic coding and decoding method and device
CN101996247B (en) * 2010-11-10 2013-02-20 百度在线网络技术(北京)有限公司 Method and device for constructing address database
CN101996247A (en) * 2010-11-10 2011-03-30 百度在线网络技术(北京)有限公司 Method and device for constructing address database
CN102024024A (en) * 2010-11-10 2011-04-20 百度在线网络技术(北京)有限公司 Method and device for constructing address database
CN102024024B (en) * 2010-11-10 2013-07-10 百度在线网络技术(北京)有限公司 Method and device for constructing address database
CN101980208A (en) * 2010-11-10 2011-02-23 百度在线网络技术(北京)有限公司 Address query method and system
CN102169498A (en) * 2011-04-14 2011-08-31 中国测绘科学研究院 Address model constructing method and address matching method and system
CN102289467A (en) * 2011-07-22 2011-12-21 浙江百世技术有限公司 Method and device for determining target site
CN102955832A (en) * 2011-08-31 2013-03-06 深圳市华傲数据技术有限公司 Correspondence address identifying and standardizing system
CN102955832B (en) * 2011-08-31 2015-11-25 深圳市华傲数据技术有限公司 A kind of address identification, standardized system
CN102393937A (en) * 2011-10-12 2012-03-28 深圳市络道科技有限公司 Address matching method and system of address tree based on backward production
CN103383682A (en) * 2012-05-01 2013-11-06 刘龙 Geographic coding method, and position inquiring system and method
CN103383682B (en) * 2012-05-01 2017-12-26 刘龙 A kind of Geocoding, position enquiring system and method
CN102880650A (en) * 2012-08-27 2013-01-16 中国工商银行股份有限公司 Data matching method and device
CN102880650B (en) * 2012-08-27 2015-11-18 中国工商银行股份有限公司 A kind of data matching method and device
CN103413215B (en) * 2013-07-12 2017-02-08 广州银联网络支付有限公司 Electronic bank code matching method based on matrix similarity algorithm
CN103413215A (en) * 2013-07-12 2013-11-27 广州银联网络支付有限公司 Electronic bank code matching method based on matrix similarity algorithm
WO2015027836A1 (en) * 2013-08-27 2015-03-05 深圳市华傲数据技术有限公司 Method and system for place name entity recognition
CN103440311A (en) * 2013-08-27 2013-12-11 深圳市华傲数据技术有限公司 Method and system for identifying geographical name entities
CN105659637A (en) * 2013-09-30 2016-06-08 三星电子株式会社 Caching of locations on a device
CN103558926A (en) * 2013-11-12 2014-02-05 金蝶软件(中国)有限公司 Geographical name entry method and geographical name entry device
CN103593468A (en) * 2013-11-27 2014-02-19 北京金和软件股份有限公司 Audio content pushing method
CN103593468B (en) * 2013-11-27 2016-11-16 北京金和软件股份有限公司 A kind of audio content method for pushing
CN104021184B (en) * 2014-06-10 2017-07-11 广州品唯软件有限公司 A kind of localization method and system
CN104021184A (en) * 2014-06-10 2014-09-03 广州品唯软件有限公司 Positioning method and system
CN104092613A (en) * 2014-07-15 2014-10-08 山东超越数控电子有限公司 Rapid table lookup method based on fuzzy matching
CN104182509A (en) * 2014-08-20 2014-12-03 国家电网公司 Object-oriented address modeling method
CN104182510A (en) * 2014-08-20 2014-12-03 国家电网公司 Object-oriented address modeling method
WO2016050088A1 (en) * 2014-09-30 2016-04-07 华为技术有限公司 Address search method and device
US10783171B2 (en) 2014-09-30 2020-09-22 Huawei Technologies Co., Ltd. Address search method and device
CN105528372A (en) * 2014-09-30 2016-04-27 华为技术有限公司 An address search method and apparatus
CN105760360A (en) * 2014-12-16 2016-07-13 高德软件有限公司 Address correction method and device
CN105760360B (en) * 2014-12-16 2018-09-11 高德软件有限公司 A kind of address correcting method and device
WO2016165538A1 (en) * 2015-04-13 2016-10-20 阿里巴巴集团控股有限公司 Address data management method and device
CN106156145A (en) * 2015-04-13 2016-11-23 阿里巴巴集团控股有限公司 The management method of a kind of address date and device
CN106296209B (en) * 2015-06-05 2021-02-02 菜鸟智能物流控股有限公司 Address input control method and device
CN106296209A (en) * 2015-06-05 2017-01-04 阿里巴巴集团控股有限公司 Address input control method and device
CN106055635B (en) * 2016-05-30 2019-11-19 深圳市华傲数据技术有限公司 Address information lookup method and device
CN106055635A (en) * 2016-05-30 2016-10-26 深圳市华傲数据技术有限公司 Address information searching method and address information searching device
CN106502978A (en) * 2016-09-19 2017-03-15 浪潮软件股份有限公司 A kind of Chinese address segmenting method and device
CN106649464A (en) * 2016-09-26 2017-05-10 深圳市数字城市工程研究中心 Method of building Chinese address tree and device
CN106649464B (en) * 2016-09-26 2019-08-30 深圳市数字城市工程研究中心 A kind of construction method and device of Chinese address tree
CN106528605A (en) * 2016-09-27 2017-03-22 武汉工程大学 A rule-based Chinese address resolution method
CN106874384B (en) * 2017-01-10 2020-12-04 航天精一(广东)信息科技有限公司 Heterogeneous address standard conversion and matching method
CN106874384A (en) * 2017-01-10 2017-06-20 广东精规划信息科技股份有限公司 A kind of isomery address standard handovers and matching process
CN106709065A (en) * 2017-01-19 2017-05-24 国家电网公司 Standardization processing method and standardized processing device for address information
CN106709065B (en) * 2017-01-19 2020-08-04 国家电网公司 Address information standardization processing method and device
CN106875264A (en) * 2017-03-31 2017-06-20 北京京东尚科信息技术有限公司 Sequence information management method, device and order sorting system
CN109255564A (en) * 2017-07-13 2019-01-22 菜鸟智能物流控股有限公司 Pick-up point address recommendation method and device
CN107748778A (en) * 2017-10-20 2018-03-02 浪潮软件股份有限公司 A kind of method and device for extracting address
CN107748778B (en) * 2017-10-20 2021-03-23 浪潮软件股份有限公司 A method and device for extracting addresses
CN108369582B (en) * 2018-03-02 2021-06-25 福建联迪商用设备有限公司 Address error correction method and terminal
WO2019165644A1 (en) * 2018-03-02 2019-09-06 福建联迪商用设备有限公司 Address error correction method and terminal
CN108369582A (en) * 2018-03-02 2018-08-03 福建联迪商用设备有限公司 A kind of address error correction method and terminal
CN108959244B (en) * 2018-06-07 2022-08-09 北京京东尚科信息技术有限公司 Address word segmentation method and device
CN108959244A (en) * 2018-06-07 2018-12-07 北京京东尚科信息技术有限公司 The method and apparatus of address participle
CN109254964A (en) * 2018-08-20 2019-01-22 中国平安人寿保险股份有限公司 Address Standardization method, apparatus, computer equipment and storage medium
CN110895651B (en) * 2018-08-23 2024-02-02 京东科技控股股份有限公司 Address standardization processing method, device, equipment and computer readable storage medium
CN110895651A (en) * 2018-08-23 2020-03-20 北京京东金融科技控股有限公司 Address standardization processing method, device, equipment and computer readable storage medium
CN109344213B (en) * 2018-08-28 2021-06-18 浙江工业大学 A Chinese Geocoding Method Based on Dictionary Tree
CN109344213A (en) * 2018-08-28 2019-02-15 浙江工业大学 A Chinese Geocoding Method Based on Dictionary Tree
CN111414357A (en) * 2019-01-07 2020-07-14 阿里巴巴集团控股有限公司 Address data processing method, device, system and storage medium
CN109784308A (en) * 2019-02-01 2019-05-21 腾讯科技(深圳)有限公司 A kind of address error correction method, device and storage medium
CN109784308B (en) * 2019-02-01 2020-09-29 腾讯科技(深圳)有限公司 Address error correction method, device and storage medium
CN110099246A (en) * 2019-02-18 2019-08-06 深度好奇(北京)科技有限公司 Monitoring and scheduling method, apparatus, computer equipment and storage medium
CN109933797A (en) * 2019-03-21 2019-06-25 东南大学 Geocoding method and system based on Jieba word segmentation and address thesaurus
CN110515999A (en) * 2019-08-27 2019-11-29 北京百度网讯科技有限公司 General record processing method, device, electronic device and storage medium
CN110674367B (en) * 2019-09-09 2022-02-01 广州易起行信息技术有限公司 Single Chinese character retrieval method and device based on travel industry products
CN110674367A (en) * 2019-09-09 2020-01-10 广州易起行信息技术有限公司 Single Chinese character retrieval method and device based on travel industry products
CN110704564A (en) * 2019-09-27 2020-01-17 北京沃东天骏信息技术有限公司 Address error correction method and device
CN112925922A (en) * 2019-12-06 2021-06-08 农业农村部信息中心 Method, device, electronic equipment and medium for obtaining address
CN111144117B (en) * 2019-12-26 2023-08-29 同济大学 Disambiguation Method for Chinese Address in Knowledge Graph
CN111144117A (en) * 2019-12-26 2020-05-12 同济大学 Knowledge Graph Chinese Address Disambiguation Method
CN111291277A (en) * 2020-01-14 2020-06-16 浙江邦盛科技有限公司 Address standardization method based on semantic recognition and high-level language search
CN111753515A (en) * 2020-06-24 2020-10-09 广东科杰通信息科技有限公司 Address information extraction and matching method for realizing entity positioning
CN111859849A (en) * 2020-07-01 2020-10-30 邦道科技有限公司 Power utilization address management method and device
CN111859849B (en) * 2020-07-01 2023-11-24 邦道科技有限公司 Management method and device for electricity utilization address
CN112052413A (en) * 2020-08-28 2020-12-08 上海谋乐网络科技有限公司 URL fuzzy matching method, device and system
CN112052413B (en) * 2020-08-28 2024-02-13 上海谋乐网络科技有限公司 URL fuzzy matching method, device and system
CN112364113A (en) * 2020-11-13 2021-02-12 北京明略软件系统有限公司 Address error correction method and system
CN112417179A (en) * 2020-11-23 2021-02-26 杭州橙鹰数据技术有限公司 Address processing method and device
CN113204606A (en) * 2021-04-30 2021-08-03 武汉大学 A Semantic Location Network-Based Address Location Inference Method
CN113656450A (en) * 2021-07-12 2021-11-16 大箴(杭州)科技有限公司 Address processing method and device, electronic equipment and storage medium
CN114091454A (en) * 2021-11-29 2022-02-25 重庆市地理信息和遥感应用中心 Method for extracting place name information and positioning space in internet text
CN116910386B (en) * 2023-09-14 2024-02-02 深圳市智慧城市科技发展集团有限公司 Address completion method, terminal device and computer-readable storage medium
CN116910386A (en) * 2023-09-14 2023-10-20 深圳市智慧城市科技发展集团有限公司 Address completion method, terminal device and computer-readable storage medium
CN117874309A (en) * 2024-03-12 2024-04-12 北京全路通信信号研究设计院集团有限公司 Train control data processing method and device, electronic equipment and storage medium
CN117874309B (en) * 2024-03-12 2024-05-24 北京全路通信信号研究设计院集团有限公司 Train control data processing method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN101719128B (en) 2012-05-23

Similar Documents

Publication Publication Date Title
CN101719128A (en) Fuzzy matching-based Chinese geo-code determination method
CN108369582B (en) Address error correction method and terminal
CN112612863A (en) Address matching method and system based on Chinese word segmentation device
US7917544B2 (en) Method and apparatus for retrieving data representing a postal address from a plurality of postal addresses
CN112528174B (en) Address trimming and complementing method based on knowledge graph and multiple matching and application
CN107145577A (en) Address standardization method, device, storage medium and computer
WO2016165538A1 (en) Address data management method and device
CN104679801B (en) A kind of interest point search method and device
KR100903961B1 (en) High-Dimensional Data Indexing and Retrieval Using Signature Files and Its System
CN105209858B (en) The uncertainty of business location&#39;s data disappears qi and matching
CN107832404A (en) A kind of complementing method of POI
CN1590964A (en) Iterative logical renewal of navigable map database
CN103914544A (en) Method for quickly matching Chinese addresses in multi-level manner on basis of address feature words
CN106503223B (en) An online housing search method and device combining location and keyword information
CN113505190B (en) Address information correction method, device, computer equipment and storage medium
CN111291099B (en) Address fuzzy matching method and system and computer equipment
CN112364114A (en) Address standardization method and device, computer equipment and storage medium
CN110990520A (en) Address coding method and device, electronic equipment and storage medium
CN114168705B (en) Chinese address matching method based on address element index
CN101844135A (en) Method for sorting postal letters according to addresses driven by address information base
CN113721969A (en) Multi-scale space vector data cascade updating method
CN110390099B (en) An object relation extraction system and extraction method based on template library
CN104346444A (en) Optimum site selection method based on road network reverse spatial keyword query
CN111311173A (en) National county level unit economic arrangement and spatialization method
CN104598887B (en) Recognition methods for non-canonical format handwritten Chinese address

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120523

CF01 Termination of patent right due to non-payment of annual fee