CN106021336A - A method for automatic administrative district division for mass address information - Google Patents

A method for automatic administrative district division for mass address information Download PDF

Info

Publication number
CN106021336A
CN106021336A CN201610299934.9A CN201610299934A CN106021336A CN 106021336 A CN106021336 A CN 106021336A CN 201610299934 A CN201610299934 A CN 201610299934A CN 106021336 A CN106021336 A CN 106021336A
Authority
CN
China
Prior art keywords
administrative
information
division
community
address information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610299934.9A
Other languages
Chinese (zh)
Inventor
钟昌贤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Sifang Zhongxin Technology Co Ltd
Original Assignee
Xiamen Sifang Zhongxin Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Sifang Zhongxin Technology Co Ltd filed Critical Xiamen Sifang Zhongxin Technology Co Ltd
Priority to CN201610299934.9A priority Critical patent/CN106021336A/en
Publication of CN106021336A publication Critical patent/CN106021336A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9537Spatial or temporal dependent retrieval, e.g. spatiotemporal queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/22Matching criteria, e.g. proximity measures

Abstract

The invention provides a method for automatic administrative district division for mass address information. The method comprises the steps of: 10, a preparation stage, 20, a start stage, and 30, a completion stage. The preparation stage includes the steps of obtaining the names of administrative districts and storing the names of the administrative districts, the tree form correlation and codes into a database. The start stage includes the steps of 21, acquiring mass original address information and optimizing original addresses, the optimization including screening and missing inspection; 22, invoking a map API to obtain the longitude and latitude information of each original address; 23, invoking the map API and employing a map search function; 24, acquiring matching results according to the tree form correlation, the matching results being the names of administrative district of all levels correlating with the addresses. The completion stage includes the steps of 31, storing matching-successful results in a database; 32, outputting a log for matching-failing results; 22, counting the number of matching and calculating the hit rate. The method supports multi-level simultaneous search and mass search only based on address information, can furthest increase the hit rate of result matching and is extremely practical and efficient.

Description

A kind of method that batch address information is carried out automatic administrative division division
Technical field
The present invention relates to a kind of method that batch address information is carried out automatic administrative division division.
Background technology
In today that the technology such as internet, applications technology, software development are flourish; usually can in large quantity the data of various samples be processed; and geographic information processing is one of which; during software design and development; based on business demand, usually can run into and obtain the needs of administrative information region, China's current administrative division belonging to address according to address of theenduser; the most provincial, region, at county level, township level, at village level, group level, wherein save, County, three grades of township are basic row administrative division.During the exploitation of various developments or internet site construction etc.; often there is a need to address is carried out the demand of administrative area division, such as governments at all levels website, urban sanitation construction, the administrative division classification of physical distribution delivery system, the geographic classification etc. of e-commerce website.
At present, existing method is searched by input address information one by one mainly by electronic chart, or by administrative area information such as the official website input province of Ministry of Civil Affairs of the People's Republic of China, cities, after consult a map and judged by naked eyes, thus obtain developer and want administrative division information, existing method to have the disadvantage in that
(1) do not support address disposably carries out city, district, street, the administrative area inquiry of four ranks in community simultaneously, need a point different filterconditions inquiry address periphery;
(2) being manually entered inefficiency, a people once can only mate one, it is impossible to accomplishes automatization's batch coupling.
The present inventor, through further investigation, proposes a kind of method that batch address information carries out automatic administrative division division.
Summary of the invention
The present invention solves the problems referred to above, provide a kind of method that batch address information is carried out automatic administrative division division, it is aimed at below provincial, comprise city, method that district, street/town, the administrative division of modal four the little ranks in community/village divide automatically, have only to address information and just can support multistage lookup, bulk lookup simultaneously as according to condition, promote the hit rate of result coupling to greatest extent, very useful and efficient.
For achieving the above object, the technical solution used in the present invention is:
A kind of method that batch address information carries out automatic administrative division division, comprises the following steps:
10. the preparatory stage: obtain each administrative area title, according to city > district > street/town > relationship between superior and subordinate in community/village sets up tree-like association, correlation rule be higher level be one-to-many to subordinate, each administrative area title, tree-like association and coding are stored in data base;
20. incipient stages, comprise the following steps:
21. obtain batch original address information, and each original address is optimized, and optimizes and includes examination and missing inspection;
22. invocation map API obtain the latitude and longitude information of each original address, if obtaining latitude and longitude information failure, then terminate;
23. invocation map API, use map search function, search for " latitude and longitude information+key word ", obtain the community/at village level administrative information region mating this latitude and longitude information+key word, if searching for unsuccessfully, and the next key word of switching;
24. intercept community/at village level administrative information region that the match is successful, parse the title in community/village, each administrative area title of the community parsed/village's title Yu database purchase is carried out Upward match step by step, obtaining matching result according to tree-like association, this matching result is all rank administrative areas title with the names associate in this community/village;
30. ending phase, comprise the following steps:
The result that the match is successful is stored in data base by 31.;
32. by the result output journal that it fails to match;
33. statistical match quantity also calculate hit rate.
Also including in described step 10: encode each administrative area, giving one, an administrative area pid value, this pid value is the coding of the upper level in this administrative area.
The examination of described step 21 with missing inspection address process is: the special string inside filtered addresses information, supplements complete city-level information.
The key word of described step 23 can be " community ", " neighbourhood committee ", " village " or " farm ".
The resolution rules that described step 24 uses is: choose all matching results central point at map, and according to the distance ascending order arrangement of each match address with central point, the name of the coordinate points that chosen distance is nearest is referred to as analysis result.
After using such scheme, the invention has the beneficial effects as follows:
During the present invention solves all kinds of software development, Web Hosting etc., when obtaining administrative division information at different levels, the work efficiency caused because method is limited is low, hit rate is the highest, process the problems such as data volume is little, substantially increase administrative division information matches efficiency, shoot straight.
Accompanying drawing explanation
Fig. 1 is the general flow chart of the present invention.
Detailed description of the invention
In order to make the technical problem to be solved, technical scheme and beneficial effect clearer, clear, below in conjunction with drawings and Examples, the present invention is further elaborated.Should be appreciated that specific embodiment described herein, only in order to explain the present invention, is not intended to limit the present invention.
A kind of method that batch address information is carried out automatic administrative division division that the present invention discloses, it comprises the following steps:
10. the preparatory stage: obtain each administrative area title, according to city > district > street/town > relationship between superior and subordinate in community/village sets up tree-like association, correlation rule be higher level be one-to-many to subordinate, each administrative area title, tree-like association and coding are stored in data base;
20. incipient stages, comprise the following steps:
21. obtain batch original address information, and each original address is optimized, and optimizes and includes examination and missing inspection;
22. invocation map API obtain the latitude and longitude information of each original address, if obtaining latitude and longitude information failure, then terminate;
23. invocation map API, use map search function, search for " latitude and longitude information+key word ", obtain the community/at village level administrative information region mating this latitude and longitude information+key word, if searching for unsuccessfully, and the next key word of switching;
24. intercept community/at village level administrative information region that the match is successful, parse the title in community/village, each administrative area title of the community parsed/village's title Yu database purchase is carried out Upward match step by step, obtaining matching result according to tree-like association, this matching result is all rank administrative areas title with the names associate in this community/village;
30. ending phase, comprise the following steps:
The result that the match is successful is stored in data base by 31.;
32. by the result output journal that it fails to match;
33. statistical match quantity also calculate hit rate.
Also including in described step 10: encode each administrative area, giving one, an administrative area pid value, this pid value is the coding of the upper level in this administrative area.
The examination of described step 21 with missing inspection address process is: the special string inside filtered addresses information, supplements complete city-level information.
The key word of described step 23 can be " community ", " neighbourhood committee ", " village " or " farm ".
The resolution rules that described step 24 uses is: choose all matching results central point at map, and according to the distance ascending order arrangement of each match address with central point, the name of the coordinate points that chosen distance is nearest is referred to as analysis result.
Being below an application example of the division methods according to the present invention, following table is to the original address division result coming from Xiamen City.
Described above illustrate and describes the preferred embodiments of the present invention, it is to be understood that the present invention is not limited to form disclosed herein, it is not to be taken as the eliminating to other embodiments, and can be used for other combinations various, amendment and environment, and can be modified by above-mentioned teaching or the technology of association area or knowledge in invention contemplated scope herein.And the change that those skilled in the art are carried out and change are without departing from the spirit and scope of the present invention, the most all should be in the protection domain of claims of the present invention.

Claims (5)

1. the method that batch address information is carried out automatic administrative division division, it is characterised in that comprise the following steps:
10. the preparatory stage: obtain each administrative area title, according to city > district > street/town > relationship between superior and subordinate in community/village sets up tree-like association, correlation rule be higher level be one-to-many to subordinate, each administrative area title, tree-like association and coding are stored in data base;
20. incipient stages, comprise the following steps:
21. obtain batch original address information, and each original address is optimized, and optimizes and includes examination and missing inspection;
22. invocation map API obtain the latitude and longitude information of each original address, if obtaining latitude and longitude information failure, then terminate;
23. invocation map API, use map search function, search for " latitude and longitude information+key word ", obtain the community/at village level administrative information region mating this latitude and longitude information+key word, if searching for unsuccessfully, and the next key word of switching;
24. intercept community/at village level administrative information region that the match is successful, parse the title in community/village, each administrative area title of the community parsed/village's title Yu database purchase is carried out Upward match step by step, obtaining matching result according to tree-like association, this matching result is all rank administrative areas title with the names associate in this community/village;
30. ending phase, comprise the following steps:
The result that the match is successful is stored in data base by 31.;
32. by the result output journal that it fails to match;
33. statistical match quantity also calculate hit rate.
A kind of method that batch address information is carried out automatic administrative division division, it is characterized in that: described step 10 also includes: each administrative area is encoded, giving one, an administrative area pid value, this pid value is the coding of the upper level in this administrative area.
A kind of method that batch address information carries out automatic administrative division division, it is characterised in that the examination of described step 21 with missing inspection address process is: the special string inside filtered addresses information, supplements complete city-level information.
A kind of method that batch address information is carried out automatic administrative division division, it is characterised in that: the key word of described step 23 can be " community ", " neighbourhood committee ", " village " or " farm ".
A kind of method that batch address information is carried out automatic administrative division division, it is characterized in that: the resolution rules that described step 24 uses is: choose all matching results central point at map, according to the distance ascending order arrangement of each match address with central point, the name of the coordinate points that chosen distance is nearest is referred to as analysis result.
CN201610299934.9A 2016-05-09 2016-05-09 A method for automatic administrative district division for mass address information Pending CN106021336A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610299934.9A CN106021336A (en) 2016-05-09 2016-05-09 A method for automatic administrative district division for mass address information

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610299934.9A CN106021336A (en) 2016-05-09 2016-05-09 A method for automatic administrative district division for mass address information

Publications (1)

Publication Number Publication Date
CN106021336A true CN106021336A (en) 2016-10-12

Family

ID=57099117

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610299934.9A Pending CN106021336A (en) 2016-05-09 2016-05-09 A method for automatic administrative district division for mass address information

Country Status (1)

Country Link
CN (1) CN106021336A (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106599303A (en) * 2016-12-29 2017-04-26 苏碧云 Address matching method and system
CN106649803A (en) * 2016-12-29 2017-05-10 华南师范大学 Address matching method and system
CN107832441A (en) * 2017-11-17 2018-03-23 北京锐安科技有限公司 A kind of method and device for parsing address
CN109426415A (en) * 2017-08-31 2019-03-05 北京国双科技有限公司 A kind of method and device generating cascade selector
CN110378634A (en) * 2018-07-09 2019-10-25 北京京东尚科信息技术有限公司 A kind of method and apparatus generating dispatching address
CN111639493A (en) * 2020-05-22 2020-09-08 上海微盟企业发展有限公司 Address information standardization method, device, equipment and readable storage medium
CN111949706A (en) * 2020-08-03 2020-11-17 北京吉威空间信息股份有限公司 Land big data distributed mining analysis-oriented storage method
CN112330281A (en) * 2020-11-05 2021-02-05 南京师范大学 Chinese administrative division association method for leather-following data
CN112434863A (en) * 2020-11-30 2021-03-02 上海富勒信息科技有限公司 Distribution scheduling method
CN113723654A (en) * 2020-12-31 2021-11-30 京东城市(北京)数字科技有限公司 Disaster relief material demand assessment method and device based on multi-source data and computer equipment
CN117271693A (en) * 2023-10-17 2023-12-22 中运科技股份有限公司 Automatic judging method for arrival attribution of traffic route based on big data analysis
CN117271693B (en) * 2023-10-17 2024-04-26 中运科技股份有限公司 Automatic judging method for arrival attribution of traffic route based on big data analysis

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102169498A (en) * 2011-04-14 2011-08-31 中国测绘科学研究院 Address model constructing method and address matching method and system
CN104281578A (en) * 2013-07-02 2015-01-14 威盛电子股份有限公司 Region marking method and device for data file
CN104537102A (en) * 2015-01-13 2015-04-22 蔡树彬 Positive geocoding service method and system for obtaining longitude and latitude
CN105512121A (en) * 2014-09-23 2016-04-20 北京汇通天下物联科技有限公司 Address query method based on keyword

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102169498A (en) * 2011-04-14 2011-08-31 中国测绘科学研究院 Address model constructing method and address matching method and system
CN104281578A (en) * 2013-07-02 2015-01-14 威盛电子股份有限公司 Region marking method and device for data file
CN105512121A (en) * 2014-09-23 2016-04-20 北京汇通天下物联科技有限公司 Address query method based on keyword
CN104537102A (en) * 2015-01-13 2015-04-22 蔡树彬 Positive geocoding service method and system for obtaining longitude and latitude

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106599303A (en) * 2016-12-29 2017-04-26 苏碧云 Address matching method and system
CN106649803A (en) * 2016-12-29 2017-05-10 华南师范大学 Address matching method and system
CN109426415A (en) * 2017-08-31 2019-03-05 北京国双科技有限公司 A kind of method and device generating cascade selector
CN107832441A (en) * 2017-11-17 2018-03-23 北京锐安科技有限公司 A kind of method and device for parsing address
CN110378634A (en) * 2018-07-09 2019-10-25 北京京东尚科信息技术有限公司 A kind of method and apparatus generating dispatching address
CN111639493A (en) * 2020-05-22 2020-09-08 上海微盟企业发展有限公司 Address information standardization method, device, equipment and readable storage medium
CN111949706A (en) * 2020-08-03 2020-11-17 北京吉威空间信息股份有限公司 Land big data distributed mining analysis-oriented storage method
CN111949706B (en) * 2020-08-03 2023-11-14 北京吉威空间信息股份有限公司 Storage method for land big data distributed mining analysis
CN112330281A (en) * 2020-11-05 2021-02-05 南京师范大学 Chinese administrative division association method for leather-following data
CN112434863A (en) * 2020-11-30 2021-03-02 上海富勒信息科技有限公司 Distribution scheduling method
CN113723654A (en) * 2020-12-31 2021-11-30 京东城市(北京)数字科技有限公司 Disaster relief material demand assessment method and device based on multi-source data and computer equipment
CN117271693A (en) * 2023-10-17 2023-12-22 中运科技股份有限公司 Automatic judging method for arrival attribution of traffic route based on big data analysis
CN117271693B (en) * 2023-10-17 2024-04-26 中运科技股份有限公司 Automatic judging method for arrival attribution of traffic route based on big data analysis

Similar Documents

Publication Publication Date Title
CN106021336A (en) A method for automatic administrative district division for mass address information
CN109145169B (en) Address matching method based on statistical word segmentation
CN101313300B (en) Local search
CN108628811B (en) Address text matching method and device
CN101454748B (en) System and method for improving the information retrival to web pages
CN104572645B (en) Interest point data association method and device
CN107145577A (en) Address standardization method, device, storage medium and computer
CN101350013A (en) Method and system for searching geographical information
CN102023984B (en) Method and device for screening duplicated entity data
CN101299217B (en) Method, apparatus and system for processing map information
CN102289467A (en) Method and device for determining target site
CN106055650A (en) Address standardization method and device
CN101350012A (en) Method and system for matching address
CN106874384B (en) Heterogeneous address standard conversion and matching method
CN101984422A (en) Fault-tolerant text query method and equipment
CN104751232B (en) Hotel's automatic matching method
CN102682046A (en) Member searching and analyzing method in social network and searching system
US20150261786A1 (en) Density-based dynamic geohash
CN102804180A (en) Characterizing Unregistered Domain Names
CN107463711B (en) Data tag matching method and device
CN102253972A (en) Web crawler-based geographical name database maintenance method
CN106874287A (en) A kind of processing method and processing device of point of interest POI geocodings
US8650024B1 (en) Generating address term synonyms
CN104679801A (en) Point of interest searching method and point of interest searching device
CN107766433A (en) A kind of range query method and device based on Geo BTree

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20161012