CN106021336A - A method for automatic administrative district division for mass address information - Google Patents
A method for automatic administrative district division for mass address information Download PDFInfo
- Publication number
- CN106021336A CN106021336A CN201610299934.9A CN201610299934A CN106021336A CN 106021336 A CN106021336 A CN 106021336A CN 201610299934 A CN201610299934 A CN 201610299934A CN 106021336 A CN106021336 A CN 106021336A
- Authority
- CN
- China
- Prior art keywords
- administrative
- information
- division
- community
- address information
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/953—Querying, e.g. by the use of web search engines
- G06F16/9537—Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/22—Matching criteria, e.g. proximity measures
Abstract
The invention provides a method for automatic administrative district division for mass address information. The method comprises the steps of: 10, a preparation stage, 20, a start stage, and 30, a completion stage. The preparation stage includes the steps of obtaining the names of administrative districts and storing the names of the administrative districts, the tree form correlation and codes into a database. The start stage includes the steps of 21, acquiring mass original address information and optimizing original addresses, the optimization including screening and missing inspection; 22, invoking a map API to obtain the longitude and latitude information of each original address; 23, invoking the map API and employing a map search function; 24, acquiring matching results according to the tree form correlation, the matching results being the names of administrative district of all levels correlating with the addresses. The completion stage includes the steps of 31, storing matching-successful results in a database; 32, outputting a log for matching-failing results; 22, counting the number of matching and calculating the hit rate. The method supports multi-level simultaneous search and mass search only based on address information, can furthest increase the hit rate of result matching and is extremely practical and efficient.
Description
Technical field
The present invention relates to a kind of method that batch address information is carried out automatic administrative division division.
Background technology
In today that the technology such as internet, applications technology, software development are flourish; usually can in large quantity the data of various samples be processed; and geographic information processing is one of which; during software design and development; based on business demand, usually can run into and obtain the needs of administrative information region, China's current administrative division belonging to address according to address of theenduser; the most provincial, region, at county level, township level, at village level, group level, wherein save,
County, three grades of township are basic row administrative division.During the exploitation of various developments or internet site construction etc.; often there is a need to address is carried out the demand of administrative area division, such as governments at all levels website, urban sanitation construction, the administrative division classification of physical distribution delivery system, the geographic classification etc. of e-commerce website.
At present, existing method is searched by input address information one by one mainly by electronic chart, or by administrative area information such as the official website input province of Ministry of Civil Affairs of the People's Republic of China, cities, after consult a map and judged by naked eyes, thus obtain developer and want administrative division information, existing method to have the disadvantage in that
(1) do not support address disposably carries out city, district, street, the administrative area inquiry of four ranks in community simultaneously, need a point different filterconditions inquiry address periphery;
(2) being manually entered inefficiency, a people once can only mate one, it is impossible to accomplishes automatization's batch coupling.
The present inventor, through further investigation, proposes a kind of method that batch address information carries out automatic administrative division division.
Summary of the invention
The present invention solves the problems referred to above, provide a kind of method that batch address information is carried out automatic administrative division division, it is aimed at below provincial, comprise city, method that district, street/town, the administrative division of modal four the little ranks in community/village divide automatically, have only to address information and just can support multistage lookup, bulk lookup simultaneously as according to condition, promote the hit rate of result coupling to greatest extent, very useful and efficient.
For achieving the above object, the technical solution used in the present invention is:
A kind of method that batch address information carries out automatic administrative division division, comprises the following steps:
10. the preparatory stage: obtain each administrative area title, according to city > district > street/town > relationship between superior and subordinate in community/village sets up tree-like association, correlation rule be higher level be one-to-many to subordinate, each administrative area title, tree-like association and coding are stored in data base;
20. incipient stages, comprise the following steps:
21. obtain batch original address information, and each original address is optimized, and optimizes and includes examination and missing inspection;
22. invocation map API obtain the latitude and longitude information of each original address, if obtaining latitude and longitude information failure, then terminate;
23. invocation map API, use map search function, search for " latitude and longitude information+key word ", obtain the community/at village level administrative information region mating this latitude and longitude information+key word, if searching for unsuccessfully, and the next key word of switching;
24. intercept community/at village level administrative information region that the match is successful, parse the title in community/village, each administrative area title of the community parsed/village's title Yu database purchase is carried out Upward match step by step, obtaining matching result according to tree-like association, this matching result is all rank administrative areas title with the names associate in this community/village;
30. ending phase, comprise the following steps:
The result that the match is successful is stored in data base by 31.;
32. by the result output journal that it fails to match;
33. statistical match quantity also calculate hit rate.
Also including in described step 10: encode each administrative area, giving one, an administrative area pid value, this pid value is the coding of the upper level in this administrative area.
The examination of described step 21 with missing inspection address process is: the special string inside filtered addresses information, supplements complete city-level information.
The key word of described step 23 can be " community ", " neighbourhood committee ", " village " or " farm ".
The resolution rules that described step 24 uses is: choose all matching results central point at map, and according to the distance ascending order arrangement of each match address with central point, the name of the coordinate points that chosen distance is nearest is referred to as analysis result.
After using such scheme, the invention has the beneficial effects as follows:
During the present invention solves all kinds of software development, Web Hosting etc., when obtaining administrative division information at different levels, the work efficiency caused because method is limited is low, hit rate is the highest, process the problems such as data volume is little, substantially increase administrative division information matches efficiency, shoot straight.
Accompanying drawing explanation
Fig. 1 is the general flow chart of the present invention.
Detailed description of the invention
In order to make the technical problem to be solved, technical scheme and beneficial effect clearer, clear, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein, only in order to explain the present invention, is not intended to limit the present invention.
A kind of method that batch address information is carried out automatic administrative division division that the present invention discloses, it comprises the following steps:
10. the preparatory stage: obtain each administrative area title, according to city > district > street/town > relationship between superior and subordinate in community/village sets up tree-like association, correlation rule be higher level be one-to-many to subordinate, each administrative area title, tree-like association and coding are stored in data base;
20. incipient stages, comprise the following steps:
21. obtain batch original address information, and each original address is optimized, and optimizes and includes examination and missing inspection;
22. invocation map API obtain the latitude and longitude information of each original address, if obtaining latitude and longitude information failure, then terminate;
23. invocation map API, use map search function, search for " latitude and longitude information+key word ", obtain the community/at village level administrative information region mating this latitude and longitude information+key word, if searching for unsuccessfully, and the next key word of switching;
24. intercept community/at village level administrative information region that the match is successful, parse the title in community/village, each administrative area title of the community parsed/village's title Yu database purchase is carried out Upward match step by step, obtaining matching result according to tree-like association, this matching result is all rank administrative areas title with the names associate in this community/village;
30. ending phase, comprise the following steps:
The result that the match is successful is stored in data base by 31.;
32. by the result output journal that it fails to match;
33. statistical match quantity also calculate hit rate.
Also including in described step 10: encode each administrative area, giving one, an administrative area pid value, this pid value is the coding of the upper level in this administrative area.
The examination of described step 21 with missing inspection address process is: the special string inside filtered addresses information, supplements complete city-level information.
The key word of described step 23 can be " community ", " neighbourhood committee ", " village " or " farm ".
The resolution rules that described step 24 uses is: choose all matching results central point at map, and according to the distance ascending order arrangement of each match address with central point, the name of the coordinate points that chosen distance is nearest is referred to as analysis result.
Being below an application example of the division methods according to the present invention, following table is to the original address division result coming from Xiamen City.
Described above illustrate and describes the preferred embodiments of the present invention, it is to be understood that the present invention is not limited to form disclosed herein, it is not to be taken as the eliminating to other embodiments, and can be used for other combinations various, amendment and environment, and can be modified by above-mentioned teaching or the technology of association area or knowledge in invention contemplated scope herein.And the change that those skilled in the art are carried out and change are without departing from the spirit and scope of the present invention, the most all should be in the protection domain of claims of the present invention.
Claims (5)
1. the method that batch address information is carried out automatic administrative division division, it is characterised in that comprise the following steps:
10. the preparatory stage: obtain each administrative area title, according to city > district > street/town > relationship between superior and subordinate in community/village sets up tree-like association, correlation rule be higher level be one-to-many to subordinate, each administrative area title, tree-like association and coding are stored in data base;
20. incipient stages, comprise the following steps:
21. obtain batch original address information, and each original address is optimized, and optimizes and includes examination and missing inspection;
22. invocation map API obtain the latitude and longitude information of each original address, if obtaining latitude and longitude information failure, then terminate;
23. invocation map API, use map search function, search for " latitude and longitude information+key word ", obtain the community/at village level administrative information region mating this latitude and longitude information+key word, if searching for unsuccessfully, and the next key word of switching;
24. intercept community/at village level administrative information region that the match is successful, parse the title in community/village, each administrative area title of the community parsed/village's title Yu database purchase is carried out Upward match step by step, obtaining matching result according to tree-like association, this matching result is all rank administrative areas title with the names associate in this community/village;
30. ending phase, comprise the following steps:
The result that the match is successful is stored in data base by 31.;
32. by the result output journal that it fails to match;
33. statistical match quantity also calculate hit rate.
A kind of method that batch address information is carried out automatic administrative division division, it is characterized in that: described step 10 also includes: each administrative area is encoded, giving one, an administrative area pid value, this pid value is the coding of the upper level in this administrative area.
A kind of method that batch address information carries out automatic administrative division division, it is characterised in that the examination of described step 21 with missing inspection address process is: the special string inside filtered addresses information, supplements complete city-level information.
A kind of method that batch address information is carried out automatic administrative division division, it is characterised in that: the key word of described step 23 can be " community ", " neighbourhood committee ", " village " or " farm ".
A kind of method that batch address information is carried out automatic administrative division division, it is characterized in that: the resolution rules that described step 24 uses is: choose all matching results central point at map, according to the distance ascending order arrangement of each match address with central point, the name of the coordinate points that chosen distance is nearest is referred to as analysis result.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610299934.9A CN106021336A (en) | 2016-05-09 | 2016-05-09 | A method for automatic administrative district division for mass address information |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610299934.9A CN106021336A (en) | 2016-05-09 | 2016-05-09 | A method for automatic administrative district division for mass address information |
Publications (1)
Publication Number | Publication Date |
---|---|
CN106021336A true CN106021336A (en) | 2016-10-12 |
Family
ID=57099117
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610299934.9A Pending CN106021336A (en) | 2016-05-09 | 2016-05-09 | A method for automatic administrative district division for mass address information |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106021336A (en) |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106599303A (en) * | 2016-12-29 | 2017-04-26 | 苏碧云 | Address matching method and system |
CN106649803A (en) * | 2016-12-29 | 2017-05-10 | 华南师范大学 | Address matching method and system |
CN107832441A (en) * | 2017-11-17 | 2018-03-23 | 北京锐安科技有限公司 | A kind of method and device for parsing address |
CN109426415A (en) * | 2017-08-31 | 2019-03-05 | 北京国双科技有限公司 | A kind of method and device generating cascade selector |
CN110378634A (en) * | 2018-07-09 | 2019-10-25 | 北京京东尚科信息技术有限公司 | A kind of method and apparatus generating dispatching address |
CN111639493A (en) * | 2020-05-22 | 2020-09-08 | 上海微盟企业发展有限公司 | Address information standardization method, device, equipment and readable storage medium |
CN111949706A (en) * | 2020-08-03 | 2020-11-17 | 北京吉威空间信息股份有限公司 | Land big data distributed mining analysis-oriented storage method |
CN112330281A (en) * | 2020-11-05 | 2021-02-05 | 南京师范大学 | Chinese administrative division association method for leather-following data |
CN112434863A (en) * | 2020-11-30 | 2021-03-02 | 上海富勒信息科技有限公司 | Distribution scheduling method |
CN113723654A (en) * | 2020-12-31 | 2021-11-30 | 京东城市(北京)数字科技有限公司 | Disaster relief material demand assessment method and device based on multi-source data and computer equipment |
CN117271693A (en) * | 2023-10-17 | 2023-12-22 | 中运科技股份有限公司 | Automatic judging method for arrival attribution of traffic route based on big data analysis |
CN117271693B (en) * | 2023-10-17 | 2024-04-26 | 中运科技股份有限公司 | Automatic judging method for arrival attribution of traffic route based on big data analysis |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102169498A (en) * | 2011-04-14 | 2011-08-31 | 中国测绘科学研究院 | Address model constructing method and address matching method and system |
CN104281578A (en) * | 2013-07-02 | 2015-01-14 | 威盛电子股份有限公司 | Region marking method and device for data file |
CN104537102A (en) * | 2015-01-13 | 2015-04-22 | 蔡树彬 | Positive geocoding service method and system for obtaining longitude and latitude |
CN105512121A (en) * | 2014-09-23 | 2016-04-20 | 北京汇通天下物联科技有限公司 | Address query method based on keyword |
-
2016
- 2016-05-09 CN CN201610299934.9A patent/CN106021336A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102169498A (en) * | 2011-04-14 | 2011-08-31 | 中国测绘科学研究院 | Address model constructing method and address matching method and system |
CN104281578A (en) * | 2013-07-02 | 2015-01-14 | 威盛电子股份有限公司 | Region marking method and device for data file |
CN105512121A (en) * | 2014-09-23 | 2016-04-20 | 北京汇通天下物联科技有限公司 | Address query method based on keyword |
CN104537102A (en) * | 2015-01-13 | 2015-04-22 | 蔡树彬 | Positive geocoding service method and system for obtaining longitude and latitude |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106599303A (en) * | 2016-12-29 | 2017-04-26 | 苏碧云 | Address matching method and system |
CN106649803A (en) * | 2016-12-29 | 2017-05-10 | 华南师范大学 | Address matching method and system |
CN109426415A (en) * | 2017-08-31 | 2019-03-05 | 北京国双科技有限公司 | A kind of method and device generating cascade selector |
CN107832441A (en) * | 2017-11-17 | 2018-03-23 | 北京锐安科技有限公司 | A kind of method and device for parsing address |
CN110378634A (en) * | 2018-07-09 | 2019-10-25 | 北京京东尚科信息技术有限公司 | A kind of method and apparatus generating dispatching address |
CN111639493A (en) * | 2020-05-22 | 2020-09-08 | 上海微盟企业发展有限公司 | Address information standardization method, device, equipment and readable storage medium |
CN111949706A (en) * | 2020-08-03 | 2020-11-17 | 北京吉威空间信息股份有限公司 | Land big data distributed mining analysis-oriented storage method |
CN111949706B (en) * | 2020-08-03 | 2023-11-14 | 北京吉威空间信息股份有限公司 | Storage method for land big data distributed mining analysis |
CN112330281A (en) * | 2020-11-05 | 2021-02-05 | 南京师范大学 | Chinese administrative division association method for leather-following data |
CN112434863A (en) * | 2020-11-30 | 2021-03-02 | 上海富勒信息科技有限公司 | Distribution scheduling method |
CN113723654A (en) * | 2020-12-31 | 2021-11-30 | 京东城市(北京)数字科技有限公司 | Disaster relief material demand assessment method and device based on multi-source data and computer equipment |
CN117271693A (en) * | 2023-10-17 | 2023-12-22 | 中运科技股份有限公司 | Automatic judging method for arrival attribution of traffic route based on big data analysis |
CN117271693B (en) * | 2023-10-17 | 2024-04-26 | 中运科技股份有限公司 | Automatic judging method for arrival attribution of traffic route based on big data analysis |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN106021336A (en) | A method for automatic administrative district division for mass address information | |
CN109145169B (en) | Address matching method based on statistical word segmentation | |
CN101313300B (en) | Local search | |
CN108628811B (en) | Address text matching method and device | |
CN101454748B (en) | System and method for improving the information retrival to web pages | |
CN104572645B (en) | Interest point data association method and device | |
CN107145577A (en) | Address standardization method, device, storage medium and computer | |
CN101350013A (en) | Method and system for searching geographical information | |
CN102023984B (en) | Method and device for screening duplicated entity data | |
CN101299217B (en) | Method, apparatus and system for processing map information | |
CN102289467A (en) | Method and device for determining target site | |
CN106055650A (en) | Address standardization method and device | |
CN101350012A (en) | Method and system for matching address | |
CN106874384B (en) | Heterogeneous address standard conversion and matching method | |
CN101984422A (en) | Fault-tolerant text query method and equipment | |
CN104751232B (en) | Hotel's automatic matching method | |
CN102682046A (en) | Member searching and analyzing method in social network and searching system | |
US20150261786A1 (en) | Density-based dynamic geohash | |
CN102804180A (en) | Characterizing Unregistered Domain Names | |
CN107463711B (en) | Data tag matching method and device | |
CN102253972A (en) | Web crawler-based geographical name database maintenance method | |
CN106874287A (en) | A kind of processing method and processing device of point of interest POI geocodings | |
US8650024B1 (en) | Generating address term synonyms | |
CN104679801A (en) | Point of interest searching method and point of interest searching device | |
CN107766433A (en) | A kind of range query method and device based on Geo BTree |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
WD01 | Invention patent application deemed withdrawn after publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20161012 |