CN107622061A - A kind of method, apparatus and system for determining address uniqueness - Google Patents

A kind of method, apparatus and system for determining address uniqueness Download PDF

Info

Publication number
CN107622061A
CN107622061A CN201610552332.XA CN201610552332A CN107622061A CN 107622061 A CN107622061 A CN 107622061A CN 201610552332 A CN201610552332 A CN 201610552332A CN 107622061 A CN107622061 A CN 107622061A
Authority
CN
China
Prior art keywords
address
poi
latitude
text address
text
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610552332.XA
Other languages
Chinese (zh)
Inventor
邓勇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Cainiao Smart Logistics Holding Ltd
Original Assignee
Alibaba Group Holding Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba Group Holding Ltd filed Critical Alibaba Group Holding Ltd
Priority to CN201610552332.XA priority Critical patent/CN107622061A/en
Publication of CN107622061A publication Critical patent/CN107622061A/en
Pending legal-status Critical Current

Links

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Navigation (AREA)

Abstract

The embodiment of the present application provides a kind of method, apparatus and system for determining address uniqueness.Methods described includes:Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or point of interest POI;Clustering processing is carried out to the multiple latitude and longitude coordinates;Determine whether a plurality of Text Address comprising same link information or POI is unique according to the result of the clustering processing.The embodiment of the present application can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then significantly lifts the coverage rate of Address Recognition.

Description

A kind of method, apparatus and system for determining address uniqueness
Technical field
The application is related to technical field of information processing, more particularly to it is a kind of determine address uniqueness method, apparatus and System.
Background technology
With the fast development of ecommerce, shopping online is increasingly popularized, and consumer gets used to adopting on the net already Purchase commodity.Shopping online will send commodity in client's hand with charge free dependent on logistics, logistics company site carry out logistics send with charge free when, By sending rule to be matched with pulling shipping address, so that it is determined that sending region with charge free.Obviously, region is sent in above-mentioned existing determination with charge free Mode depends on the accuracy of address.However, when actually sending with charge free, the imperfect or wrong situation of address information is frequently encountered, So existing way can not just determine corresponding to send region with charge free.
Prior art also provides a kind of improvement project, i.e., first correction process is carried out to shipping address, then by after error correction Address sends rule to be matched with pulling, so that it is determined that sending scope with charge free.However, the prior art depends on the essence of address error correction algorithm Exactness, if the address after error correction is wrong, rule is sent to be matched with pulling wrong address, its result must It is wrong.
Therefore, how more accurately and efficiently to determine whether some address is unique in same city, turns into and needs badly The technical problem that those skilled in the art solve.
The content of the invention
In view of the above problems, it is proposed that the embodiment of the present application overcomes above mentioned problem or at least in part to provide one kind The method, apparatus and system of a kind of determination address uniqueness to solve the above problems.
This application discloses a kind of method for determining address uniqueness, including:
Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link or point of interest POI;
Clustering processing is carried out to the multiple latitude and longitude coordinates;
The a plurality of Text Address comprising same link information or POI is determined according to the result of the clustering processing Whether it is unique.
Accordingly, this application discloses a kind of device for determining address uniqueness, including:
Longitude and latitude acquisition module, for based on a plurality of Text Address comprising same link information or point of interest POI Obtain multiple latitude and longitude coordinates;
Clustering processing module, for carrying out clustering processing to the multiple latitude and longitude coordinates;
Uniqueness determining module, for according to the result of the clustering processing determine it is a plurality of comprising same link information or Whether the Text Address of POI is unique.
Disclosed herein as well is a kind of address is determined including a kind of of the device as described above for determining address uniqueness only The system of one property.
In addition, disclosed herein as well is a kind of method for determining that shipping address is reachable, including:
Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or point of interest POI;
Clustering processing is carried out to the multiple latitude and longitude coordinates;
The a plurality of Text Address comprising same link information or POI is determined according to the result of the clustering processing Whether it is unique;
If a plurality of Text Address comprising same link information or POI is unique, by more provisions This address is recorded as the address that can be sent to.
Accordingly, this application discloses a kind of device for determining that shipping address is reachable, including:
Longitude and latitude acquisition module, for based on a plurality of Text Address comprising same link information or point of interest POI Obtain multiple latitude and longitude coordinates;
Clustering processing module, for carrying out clustering processing to the multiple latitude and longitude coordinates;
Uniqueness determining module, for being determined described a plurality of to include same link information according to the result of the clustering processing Or whether the Text Address of POI is unique;
Logging modle, if determining described a plurality of comprising same link information or POI to believe for the uniqueness determining module The Text Address of breath is unique, then a plurality of Text Address is recorded as to the address that can be sent to.
The specific embodiment provided according to the application, this application discloses following technique effect:
The embodiment of the present application is passed through to being got based on a plurality of Text Address comprising same link information or POI Multiple corresponding latitude and longitude coordinates carry out clustering processings, further according to the clustering processing result judge it is described it is a plurality of include it is identical Whether unique the Text Address of road information or POI is.Therefore, the embodiment of the present application can in shipping address address Information is lack of standardization, even information errors situations, still can determine whether the address is unique in city.Therefore, originally Application embodiment can be more accurate, efficiently, neatly determine whether some address is unique, Jin Er great in same city Amplitude lifts the coverage rate of Address Recognition.
Further, the embodiment of the present application can first pass through the road information or POI judged in every Text Address Whether in same administrative area;In the case where judged result is in same administrative area, can directly determine to include institute It is unique to state the address of road information or POI.And in the case of being only not in same administrative area in judged result, Corresponding latitude and longitude coordinates are just obtained based on a plurality of Text Address comprising same link information or POI, and then by poly- The result of class processing judges whether unique the address comprising the road information or POI is.The embodiment of the present application passes through Text Address is analyzed and is combined with longitude and latitude clustering processing to determine the uniqueness of address, can be taken into account on product and engineering Need, realized under large-scale distributed parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly The shortening of degree calculates the time.
To sum up, can be more accurate, efficiently, neatly determine some address in same city by the embodiment of the present application Whether it is inside unique, so as to significantly lift the coverage rate of Address Recognition;Resource consumption can be also reduced simultaneously, it is substantial amounts of to save Time and human cost.
Certainly, any product for implementing the application it is not absolutely required to reach all the above advantage simultaneously.
Brief description of the drawings
, below will be to institute in embodiment in order to illustrate more clearly of the embodiment of the present application or technical scheme of the prior art The accompanying drawing needed to use is briefly described, it should be apparent that, drawings in the following description are only some implementations of the application Example, for those of ordinary skill in the art, on the premise of not paying creative work, can also be obtained according to these accompanying drawings Obtain other accompanying drawings.
Fig. 1 is a kind of step flow chart of the embodiment of the method for determination address uniqueness of the application;
Fig. 2 is the step flow chart of another embodiment of the method for determining address uniqueness of the application;
Fig. 3 is a kind of step flow chart of Text Address analysis method embodiment of the application;
Fig. 4 is a kind of step flow chart of longitude and latitude clustering processing embodiment of the method for the application;
Fig. 5 is a kind of structured flowchart of the device embodiment of determination address uniqueness of the application;
Fig. 6 is the structured flowchart of another device embodiment for determining address uniqueness of the application.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present application, the technical scheme in the embodiment of the present application is carried out clear, complete Site preparation describes, it is clear that described embodiment is only some embodiments of the present application, rather than whole embodiments.It is based on Embodiment in the application, the every other embodiment that those of ordinary skill in the art are obtained, belong to the application protection Scope.
It is below in conjunction with the accompanying drawings and specific real to enable the above-mentioned purpose of the application, feature and advantage more obvious understandable Mode is applied to be described in further detail the application.
The embodiment of the present application, which can be applied to pull in the existing logistics for sending rule to match based on shipping address and pulling, sends intelligence In system, then by multiple corresponding to being got based on a plurality of Text Address comprising same link information or POI Latitude and longitude coordinates carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link information Or whether unique the Text Address of POI is.Therefore, the embodiment of the present application address information can not advise in shipping address The situation of model, even information errors, it still can determine whether the address is unique in city.Compared with prior art, The embodiment of the present application can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then Significantly lift the coverage rate of Address Recognition.
Further, the embodiment of the present application can first pass through the road information or POI judged in every Text Address Whether in same administrative area;In the case where judged result is in same administrative area, can directly determine to include institute It is unique to state the address of road information or POI.And in the case of being only not in same administrative area in judged result, Corresponding latitude and longitude coordinates are just obtained based on a plurality of Text Address comprising same link information or POI, and then by poly- The result of class processing judges whether unique the address comprising the road information or POI is.The embodiment of the present application passes through Text Address is analyzed and is combined with longitude and latitude clustering processing to determine the uniqueness of address, can be taken into account on product and engineering Need, realized under large-scale distributed parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly The shortening of degree calculates the time.
To sum up, can be more accurate, efficiently, neatly determine some address in same city by the embodiment of the present application Whether it is inside unique, so as to significantly lift the coverage rate of Address Recognition;Resource consumption can be also reduced simultaneously, it is substantial amounts of to save Time and human cost.
Embodiment one
Reference picture 1, a kind of step flow chart of the embodiment of the method for determination address uniqueness of the application is shown, it is described Method comprises the following steps:
Step 102, based on a plurality of comprising same link information or POI (Point of Interest, point of interest) information Text Address obtains multiple latitude and longitude coordinates;
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse Breath.
Step 104, clustering processing is carried out to the multiple latitude and longitude coordinates;
Explanation is needed exist for, clustering algorithm can be utilized in the embodiment of the present application to the latitude and longitude coordinates information Carry out clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density- Based Spatial Clustering of Applications with Noise), DBSCAN is a kind of based on high density connection The clustering algorithm in logical region, its purpose seek to filter density regions, find dense sample point.That is, find low The high-density region of density area separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM Cluster, FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI A plurality of Text Address in any one address be unique in city.Therefore, the embodiment of the present invention passes through clustering processing Mode determine address uniqueness, can more accurately determine some address in same city whether be it is unique, from And it can significantly lift the coverage rate of Address Recognition.
Step 106, determined according to the result of the clustering processing it is described a plurality of comprising same link information or POI Whether Text Address is unique.
That is, when the result of the clustering processing is can be polymerized to a class, then it is to connect to illustrate the road or POI Continuous, it can be determined that any one Text Address in a plurality of Text Address comprising same link information or POI It is unique in city;When the result of the clustering processing could not gather for a class, then it is not connect to illustrate the road or POI Continuous;Any one Text Address in a plurality of Text Address comprising same link information or POI can be determined It is not unique in city.
For example, " Beijing Chang An Street ", Beijing only has a Chang'an street, by clustering processing, includes " Beijing Chang An Street " Multiple addresses corresponding to latitude and longitude information can only be polymerized to a class, so i.e. can determine whether that " Beijing Chang An Street " is continuous.
For another example " little Ying East Roads, Beijing ", there is two in Beijing:One in Chaoyang District, another in Haidian District, Liang Tiaolu It is separated by the distant of dozens of kilometres.According to multiple latitude and longitude informations of multiple addresses comprising little Ying East Roads, Beijing, clustering processing is carried out Afterwards, two classes are converged to, that is, cluster result is not 1.Therefore, it is possible to judge that " little Ying East Roads, Beijing " is not just continuous.
Preferably, if the embodiment of the present application to be applied to the application scenarios for determining that shipping address is reachable, in the step Following steps are can further include after rapid 106:
If a plurality of Text Address comprising same link information or POI is unique, by more provisions This address is recorded as the address that can be sent to.
So, can be sent with charge free the address with direct basis when actually sending with charge free what is wrap up etc..Preferably, can also By all address aggregations for being recorded as being sent into an address list or address base, in practical application, be able to can be sent to Address sends rule to match with pulling, so that it is determined that sending region with charge free.
Scheme provided in an embodiment of the present invention is the judgement based on extensive historical address, rather than for small-scale address Data set.Therefore, the embodiment of the present application is passed through to being obtained based on a plurality of Text Address comprising same link information or POI The multiple corresponding latitude and longitude coordinates arrived carry out clustering processing, and then determine a plurality of bag according to the result of the clustering processing Whether unique the Text Address of information containing same link or POI is.Therefore, the embodiment of the present application can be in shipping address Middle address information is lack of standardization, even information errors situations, still can determine whether the address is unique in city.With Prior art is compared, the embodiment of the present application can more accurate, efficiently, neatly determine some address in same city whether It is unique, and then significantly lifts the coverage rate of Address Recognition.
Embodiment two
Reference picture 2, show the step flow chart of another embodiment of the method for determining address uniqueness of the application, tool Body may include steps of:
Step 202, a plurality of Text Address is obtained from Text Address storehouse;
Normal conditions, the Text Address storehouse can be stored with the Text Address of extensive quantity.Described in the embodiment of the present application A plurality of Text Address directly can be obtained from the Text Address storehouse.
Step 204, judge road information in every Text Address or POI whether in same administrative area In, if it is judged that in same administrative area, then to perform step 206;Otherwise step 208 is performed;
Preferably, in another embodiment of the application, can also enter after obtaining Text Address from Text Address storehouse One step obtains city, administrative area, road or the POI of the address, based on these information, determine whether road information or Whether POI is in same administrative area.
Step 206, it is determined that the address comprising the road information or POI is unique.
Explanation is needed exist for, if it is determined that road or POI can be determined directly in same administrative area Address corresponding with the road information or POI is unique in city.
Step 208, if it is judged that not in same administrative area, then to be believed based on a plurality of same link that includes The Text Address of breath or POI obtains corresponding latitude and longitude coordinates;
Need exist for explanation, if it is determined that road information or POI not in same administrative area, this Inventive embodiments also need to by such as previous embodiment one provide clustering processing further determine that it is corresponding to the road or POI A plurality of Text Address whether be unique in city.
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse Breath.Participle and cutting word processing can also be first carried out for every Text Address, obtains city, the administration of every Text Address Area, road information or POI;Then the text with same link information or POI for belonging to same city is selected Address;Again based on a plurality of Text Address comprising same link or POI selected, obtained from address latitude and longitude information storehouse corresponding Latitude and longitude coordinates.
Step 210, clustering processing is carried out to the multiple latitude and longitude coordinates;
Explanation is needed exist for, the latitude and longitude information can be carried out using clustering algorithm in the embodiment of the present application Clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-based Spatial Clustering of Applications with Noise), DBSCAN is that one kind is based on high density UNICOM region Clustering algorithm, its purpose seek to filter density regions, find dense sample point.That is, find by low density area The high-density region of domain separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM cluster, FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI A plurality of Text Address in any one address be unique.Therefore, the embodiment of the present invention is true by way of clustering processing Determine address uniqueness, can more accurately determine whether some address is unique in same city, so as to big Amplitude lifts the coverage rate of Address Recognition.
Step 212, determined according to the result of the clustering processing described a plurality of comprising same link information or POI Whether Text Address is unique.
Specifically, when the result of the clustering processing can be gathered for a class, then it is to connect to illustrate the road or POI Continuous, it can be determined that any one Text Address in a plurality of Text Address comprising same link information or POI It is unique;When the result of the clustering processing could not gather for a class, then illustrate that the road or POI are discontinuous;Can To determine that any one Text Address in a plurality of Text Address comprising same link information or POI is not unique 's.
For example, Beijing Chang An Street, implements through multiple administrative areas such as Chaoyang District, Dongcheng District, Xicheng District, but by the present invention After the clustering processing that example proposes, cluster result 1, this explanation Chang'an street is continuous, therefore includes the address of " Beijing Chang An Street " It is unique in Beijing.
For another example in a large amount of Text Address comprising " Hangzhou one West Road of text ", might have:
Address A:Xihu District of Hangzhou City one West Road of text;
Address B:Hangzhou Yuhang District one West Road of text;
So, can see by substantial amounts of Text Address, " literary a West Road " in the administrative area of Hangzhou at least two all In the presence of.Therefore by way of judging road in the Text Address or POI whether in same administrative area, can not judge " a literary West Road " is a road or two road in Hangzhou.And by longitude and latitude clustering processing, it can be included all The address for having " Wenyi West Road, Hangzhou " is all converted into corresponding latitude and longitude coordinates point.Then these latitude and longitude coordinates point sets are entered The processing of row density clustering.If the result of clustering processing is 1, that is, is polymerized to a class, that just illustrates these longitudes and latitudes There is a closely located transitive relation in degree coordinate points, this just forms a line continuously to extend, so as to judge " Wen Yixi Road " is continuous, and it is all unique therefore, to include multiple addresses of " literary a West Road " in Hangzhou.
Therefore, can by the embodiment of the present invention no matter the administrative area described in address is " Xihu District " or " Yuhang District " Only to need to judge whether unique the address of the road comprising " a literary West Road " or POI is in Hangzhou, if it is determined that The address of road or POI comprising " a literary West Road " is unique in Hangzhou, then need not consider administrative area information, even if Administrative information region in address, which is filled in, wrong nor affects on result of determination.Therefore, accurate group can be reached in specific send with charge free The purpose sent.
The embodiment of the present application, which can be applied to pull in the existing logistics for sending rule to match based on shipping address and pulling, sends intelligence In system, then by multiple corresponding to being got based on a plurality of Text Address comprising same link information or POI Latitude and longitude coordinates carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link information Or whether unique the Text Address of POI is.Therefore, the embodiment of the present application address information can not advise in shipping address The situation of model, even information errors, it still can determine whether the address is unique in city.Compared with prior art, The embodiment of the present application can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then Significantly lift the coverage rate of Address Recognition.
Further, the embodiment of the present application can first pass through the road information or POI judged in every Text Address Whether in same administrative area;In the case where judged result is in same administrative area, can directly determine to include institute It is unique to state the address of road information or POI.And in the case of being only not in same administrative area in judged result, Corresponding latitude and longitude coordinates are just obtained based on a plurality of Text Address comprising same link information or POI, and then by poly- The result of class processing judges whether unique the address comprising the road information or POI is.The embodiment of the present application passes through Text Address is analyzed and is combined with longitude and latitude clustering processing to determine the uniqueness of address, can be taken into account on product and engineering Need, realized under large-scale distributed parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly The shortening of degree calculates the time.
To sum up, can be more accurate, efficiently, neatly determine some address in same city by the embodiment of the present application Whether it is inside unique, so as to significantly lift the coverage rate of Address Recognition;Resource consumption can be also reduced simultaneously, it is substantial amounts of to save Time and human cost.
Embodiment three
Reference picture 3, show a kind of step flow chart of Text Address analysis method embodiment of the application, methods described Specifically comprise the following steps:
Step 302, a plurality of Text Address is obtained from Text Address storehouse.
Normal conditions, the Text Address storehouse can be stored with the Text Address of extensive quantity.Described in the embodiment of the present application A plurality of Text Address directly can be obtained from the Text Address storehouse.
Step 304, carry out participle for every Text Address and cutting word is handled, obtain the city of every Text Address City, administrative area, road information or POI;
Calculated it should be noted that address participle Cooley can be based in the embodiment of the present application with existing participle and cutting word Method is handled every Text Address, obtains city, administrative area, road or the POI of every Text Address.Mesh Before, this kind of participle and cutting word algorithm have a lot, and the embodiment of the present application using which kind of algorithm to not being limited specifically.
Step 306, the road included in a plurality of Text Address or POI are counted in each administrative area in same city Distribution proportion;
Preferably, in another embodiment of the application, the road can be counted by the distribution mode of probability calculation The distribution proportion situation of road or POI in each administrative area in a city.
Step 308, if the distribution proportion is not less than predetermined threshold value, the road in the Text Address is determined whether Whether information or POI are only distributed in same administrative area, if so, then judging to include the road information or POI Text Address be unique.
If the distribution proportion is less than predetermined threshold value, judge that the Text Address is not deposited in each administrative area ;In such a case, it is possible to the Text Address is directly filtered out, but only to the remaining text after filtration treatment Address is judged.
Preferably, the predetermined threshold value can be pre-set based on experience value, such as it is a ten thousandth that can set threshold value. For example, province-city-area's information corresponding to road name in every Text Address;Count different corresponding to every road name The probability in province-city-area;The small data of probability are screened out respectively according to predetermined threshold value in each administrative region, then judge the road Whether title can be distributed in different province-city-areas, if it is not, then determining that the address comprising the road is in the city Uniquely.
Whether the embodiment of the present application can first pass through judges road information in every Text Address or POI same In individual administrative area;In the case where judged result is in same administrative area, can directly determine to include the road information Or the address of POI is unique.And in the case of being only not in same administrative area in judged result, just based on a plurality of Corresponding latitude and longitude coordinates, and then the knot for passing through clustering processing are obtained comprising the Text Address of same link information or POI Fruit judges whether unique the address comprising the road information or POI is.The embodiment of the present application is by by Text Address Analysis is combined with longitude and latitude clustering processing to determine the uniqueness of address, the needs on product and engineering can be taken into account, big Realized under scale distribution formula parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly shorten Calculate the time.
Example IV
Reference picture 4, show a kind of step flow chart of longitude and latitude clustering processing embodiment of the method for the application, the side Method specifically comprises the following steps:
Step 402, carry out participle for every Text Address and cutting word is handled, obtain the city of every Text Address City, administrative area, road information or POI;
Step 404, the Text Address with same link information or POI for belonging to same city is selected;
Step 406, based on a plurality of Text Address comprising same link information or POI selected, from address longitude and latitude Spend information bank and obtain corresponding latitude and longitude coordinates;
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse Breath.
Step 408, corresponding multiple longitude and latitude mesh coordinates are calculated respectively according to the multiple latitude and longitude coordinates;
Preferably, respectively by lng=lng*1000, lat=lat*1000 mode is according to the latitude and longitude coordinates meter Calculation obtains corresponding longitude and latitude mesh coordinate.Specifically, lng=lng*1000, lat=lat*1000 are to return longitude and latitude point Tie to above one about 100 meters * 100 meters of geographic grid, regard 100 meters * 100 meters corresponding scopes as a fundamental geological Grid.Wherein, a grid is identified using the central point of grid.In units of grid, by all longitude and latitude points in grid all Sum up in the point that on this grid, clustered in grid aspect.The purpose for the arrangement is that in order to compress longitude and latitude point data.
Step 410, clustering processing is carried out to the multiple longitude and latitude mesh coordinate;
Explanation is needed exist for, the latitude and longitude information can be carried out using clustering algorithm in the embodiment of the present application Clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-based Spatial Clustering of Applications with Noise), DBSCAN is that one kind is based on high density UNICOM region Clustering algorithm, its purpose seek to filter density regions, find dense sample point.That is, find by low density area The high-density region of domain separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM cluster, FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI A plurality of Text Address in any one address be unique in city.Therefore, the embodiment of the present invention passes through clustering processing Mode determine address uniqueness, can more accurately determine some address in same city whether be it is unique, from And it can significantly lift the coverage rate of Address Recognition.
Preferably, clustering processing is carried out to the longitude and latitude mesh coordinate using default minimal point and radius number; Wherein, the minimal point used and radius number of clustering can be selected based on experience value.
Step 412, according to the result of the clustering processing, determine described a plurality of to include same link information or POI Text Address it is whether unique.
That is, when the result of the clustering processing can be gathered for a class, then it is to connect to illustrate the road or POI Continuous, it can be determined that any one Text Address in a plurality of Text Address comprising same link information or POI It is unique in city;When the result of the clustering processing could not gather for a class, then it is not connect to illustrate the road or POI Continuous;Any one Text Address in a plurality of Text Address comprising same link information or POI can be determined It is not unique in city.
Scheme provided in an embodiment of the present invention is the judgement based on extensive historical address, rather than for small-scale address Data set.Therefore, the embodiment of the present application can be applied is sending the existing logistics that matches of rule to be pulled to send based on shipping address with pulling In intelligence system, then pass through multiple phases to being got based on a plurality of Text Address comprising same link information or POI The latitude and longitude coordinates answered carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link Whether unique the Text Address of information or POI is.Therefore, the embodiment of the present application can in shipping address address information Lack of standardization, even information errors situations, it still can determine whether the address is unique.Compared with prior art, this Shen Please embodiment can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then significantly The coverage rate of degree lifting Address Recognition.
Embodiment five
Reference picture 5, show a kind of structured flowchart of the device embodiment of determination address uniqueness of the application.It is described true Determining the device 500 of address uniqueness can specifically include:Longitude and latitude acquisition module 510, clustering processing module 520, uniqueness are true Cover half block 530;Wherein,
The longitude and latitude acquisition module 510, for based on a plurality of Text Address comprising same link information or POI Obtain multiple latitude and longitude informations;
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse Breath.
Preferably, in another embodiment of the application, the longitude and latitude acquisition module 510 can also be further used for Corresponding longitude and latitude is obtained from address latitude and longitude information storehouse based on a plurality of Text Address comprising same link information or POI Coordinate.
The clustering processing module 520, for carrying out clustering processing to the multiple latitude and longitude coordinates;
Explanation is needed exist for, the latitude and longitude information can be carried out using clustering algorithm in the embodiment of the present application Clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-based Spatial Clustering of Applications with Noise), DBSCAN is that one kind is based on high density UNICOM region Clustering algorithm, its purpose seek to filter density regions, find dense sample point.That is, find by low density area The high-density region of domain separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM cluster, FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI A plurality of Text Address in any one address be unique.Therefore, the embodiment of the present invention is true by way of clustering processing Determine address uniqueness, can more accurately determine whether some address is unique in same city, so as to big Amplitude lifts the coverage rate of Address Recognition.
Uniqueness determining module 530, for being determined a plurality of to include same link information according to the result of the clustering processing Or whether the Text Address of POI is unique.
That is, when the result of the clustering processing can be gathered for a class, then it is to connect to illustrate the road or POI Continuous, it can be determined that any one Text Address in a plurality of Text Address comprising same link information or POI It is unique;When the result of the clustering processing could not gather for a class, then illustrate that the road or POI are discontinuous;Can To determine that any one Text Address in a plurality of Text Address comprising same link information or POI is not unique 's.
Scheme provided in an embodiment of the present invention is the judgement based on extensive historical address, rather than for small-scale address Data set.Therefore, the embodiment of the present application can be applied is sending the existing logistics that matches of rule to be pulled to send based on shipping address with pulling In intelligence system, then pass through multiple phases to being got based on a plurality of Text Address comprising same link information or POI The latitude and longitude coordinates answered carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link Whether unique the Text Address of information or POI is.Therefore, the embodiment of the present application can in shipping address address information Lack of standardization, even information errors situations, it still can determine whether the address is unique.Compared with prior art, this Shen Please embodiment can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then significantly The coverage rate of degree lifting Address Recognition.
Embodiment six
Reference picture 6, show the structured flowchart of another device embodiment for determining address uniqueness of the application.It is described Determining the device 600 of address uniqueness can specifically include:Address acquisition module 610, administrative area judge module 620, longitude and latitude Acquisition module 630, clustering processing module 640, uniqueness determining module 650;Wherein,
The address acquisition module 610, for obtaining a plurality of Text Address from Text Address storehouse.
The administrative area judge module 620, for judging that road information in every Text Address or POI be It is no in same administrative area;
Preferably, in another embodiment of the application, can also enter after obtaining Text Address from Text Address storehouse One step obtains city, administrative area, road or the POI of the address, based on these information, determines whether road or POI Whether in same administrative area.Accordingly, the administrative area judge module 620 may further include:
First acquisition unit 621, for carrying out participle and cutting word processing for every Text Address, obtain described per provision City, administrative area, road information or the POI of this address;
Calculated it should be noted that address participle Cooley can be based in the embodiment of the present application with existing participle and cutting word Method is handled the Text Address, obtains city, administrative area, road or the POI of the Text Address.At present, this Class is segmented and cutting word algorithm has a lot, and the embodiment of the present application using which kind of algorithm to not being limited specifically.
Statistic unit 622, for counting point of the road information or POI in each administrative area in same city Cloth ratio;Preferably, in another embodiment of the application, the road can be counted by the distribution mode of probability calculation Or distribution proportion situations of the POI in each administrative area in a city.
Identifying unit 623, if being not less than predetermined threshold value for the distribution proportion, determine whether the Text Address In road information or POI whether be only distributed in same administrative area, if so, then judge comprising the road information or The Text Address of POI is unique.
Preferably, the predetermined threshold value can be pre-set based on experience value, such as it is a ten thousandth that can set threshold value. For example, province-city-area's information corresponding to road name in every Text Address;Count different corresponding to every road name The probability in province-city-area;The small data of probability are screened out respectively according to predetermined threshold value in each administrative region, then judge the road Whether title can be distributed in different province-city-areas, if it is not, then determining that the address comprising the road is in the city Uniquely.
The longitude and latitude acquisition module 630, if judging for the administrative area judge module 620 in the Text Address Road or POI be not in same administrative area, then based on a plurality of Text Address comprising same link information or POI Obtain corresponding latitude and longitude coordinates.Specifically, the longitude and latitude acquisition module 630 includes:
Second acquisition unit 631, for carrying out participle and cutting word processing for every Text Address, obtain described per provision City, administrative area, road information or the POI of this address;
Unit 632 is selected, the text with same link information or POI in same city is belonged to for selecting Location;
Latitude and longitude coordinates acquiring unit 633, for based on it is described select that unit selects a plurality of include same link or POI Text Address, obtain corresponding latitude and longitude coordinates from address latitude and longitude information storehouse.
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse Breath.
The clustering processing module 640, for carrying out clustering processing to the multiple latitude and longitude coordinates;Preferably, it is described Clustering processing module 640 can specifically include:
Computing unit 641, for by lng=lng*1000, lat=lat*1000 mode to be to the multiple warp respectively Latitude coordinate carries out that corresponding longitude and latitude mesh coordinate is calculated;
Clustering processing unit 642, for using default minimal point and radius number to the longitude and latitude mesh coordinate Information carries out clustering processing.Wherein, the minimal point used and radius number of clustering can be selected based on experience value.
Explanation is needed exist for, the latitude and longitude information can be carried out using clustering algorithm in the embodiment of the present application Clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-based Spatial Clustering of Applications with Noise), DBSCAN is that one kind is based on high density UNICOM region Clustering algorithm, its purpose seek to filter density regions, find dense sample point.That is, find by low density area The high-density region of domain separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM cluster, FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI A plurality of Text Address in any one address be unique.Therefore, the embodiment of the present invention is true by way of clustering processing Determine address uniqueness, can more accurately determine whether some address is unique in same city, so as to big Amplitude lifts the coverage rate of Address Recognition.
Uniqueness determining module 650, for being determined a plurality of to include same link information according to the result of the clustering processing Or whether the Text Address of POI is unique.Specifically, the uniqueness determining module 650 includes:
First determining unit 651, if the result for the clustering processing module 640 is that can be polymerized to a class, Then determine that a plurality of Text Address comprising same link information or POI is unique;
Second determining unit 652, if the result for the clustering processing module 640 is that can not be polymerized to one Class, it is determined that a plurality of Text Address comprising same link information or POI is not unique.
Preferably, if the embodiment of the present application to be applied to the application scenarios for determining that shipping address is reachable, described device It can further include:
Logging modle 660, if for the uniqueness determining module 650 determine it is described it is a plurality of comprising same link information or The Text Address of POI is unique, then a plurality of Text Address is recorded as to the address that can be sent to.
So, can be sent with charge free the address with direct basis when actually sending with charge free what is wrap up etc..Preferably, can also By all address aggregations for being recorded as being sent into an address list or address base, in practical application, be able to can be sent to Address sends rule to match with pulling, so that it is determined that sending region with charge free.
The embodiment of the present application, which can be applied to pull in the existing logistics for sending rule to match based on shipping address and pulling, sends intelligence In system, then by multiple corresponding to being got based on a plurality of Text Address comprising same link information or POI Latitude and longitude coordinates carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link information Or whether unique the Text Address of POI is.Therefore, the embodiment of the present application address information can not advise in shipping address The situation of model, even information errors, it still can determine whether the address is unique in city.Compared with prior art, The embodiment of the present application can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then Significantly lift the coverage rate of Address Recognition.
Further, the embodiment of the present application can first pass through the road information or POI judged in every Text Address Whether in same administrative area;In the case where judged result is in same administrative area, can directly determine to include institute It is unique to state the address of road information or POI.And in the case of being only not in same administrative area in judged result, Corresponding latitude and longitude coordinates are just obtained based on a plurality of Text Address comprising same link information or POI, and then by poly- The result of class processing judges whether unique the address comprising the road information or POI is.The embodiment of the present application passes through Text Address is analyzed and is combined with longitude and latitude clustering processing to determine the uniqueness of address, can be taken into account on product and engineering Need, realized under large-scale distributed parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly The shortening of degree calculates the time.
To sum up, can be more accurate, efficiently, neatly determine some address in same city by the embodiment of the present application Whether it is inside unique, so as to significantly lift the coverage rate of Address Recognition;Resource consumption can be also reduced simultaneously, it is substantial amounts of to save Time and human cost.
Embodiment seven
The embodiment of the present application also provides a kind of system for determining address uniqueness, and it has institute in above-described embodiment five, six All features of the device for the determination address uniqueness stated, therefore the system of the determination address uniqueness described in the embodiment of the present application has There are all beneficial effects in above-described embodiment five, six, the embodiment of the present application will not be repeated here.
For device embodiment, because it is substantially similar to embodiment of the method, so description is fairly simple, it is related Part illustrates referring to the part of embodiment of the method.
Each embodiment in this specification is described by the way of progressive, what each embodiment stressed be with The difference of other embodiment, between each embodiment identical similar part mutually referring to.
It should be understood by those skilled in the art that, the embodiment of the embodiment of the present application can be provided as method, apparatus or calculate Machine program product.Therefore, the embodiment of the present application can use complete hardware embodiment, complete software embodiment or combine software and The form of the embodiment of hardware aspect.Moreover, the embodiment of the present application can use one or more wherein include computer can With in the computer-usable storage medium (including but is not limited to magnetic disk storage, CD@ROM, optical memory etc.) of program code The form of the computer program product of implementation.
In a typical configuration, the computer equipment includes one or more processors (CPU), input/output Interface, network interface and internal memory.Internal memory may include the volatile memory in computer-readable medium, random access memory The form such as device (RAM) and/or Nonvolatile memory, such as read-only storage (ROM) or flash memory (flash RAM).Internal memory is to calculate The example of machine computer-readable recording medium.Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be with Realize that information stores by any method or technique.Information can be computer-readable instruction, data structure, the module of program or Other data.The example of the storage medium of computer includes, but are not limited to phase transition internal memory (PRAM), static RAM (SRAM), dynamic random access memory (DRAM), other kinds of random access memory (RAM), read-only storage (ROM), Electrically Erasable Read Only Memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc are read-only Memory (CD ROM), digital versatile disc (DVD) or other optical storages, magnetic cassette tape, tape magnetic rigid disk storage or Other magnetic storage apparatus or any other non-transmission medium, the information that can be accessed by a computing device available for storage.According to Herein defines, and computer-readable medium does not include the computer readable media (transitory media) of non-standing, such as The data-signal and carrier wave of modulation.
The embodiment of the present application is with reference to according to the method for the embodiment of the present application, terminal device (system) and computer program The flow chart and/or block diagram of product describes.It should be understood that can be by computer program instructions implementation process figure and/or block diagram In each flow and/or square frame and the flow in flow chart and/or block diagram and/or the combination of square frame.These can be provided Computer program instructions are set to all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing terminals Standby processor is to produce a machine so that is held by the processor of computer or other programmable data processing terminal equipments Capable instruction is produced for realizing in one flow of flow chart or multiple flows and/or one square frame of block diagram or multiple square frames The device for the function of specifying.
These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing terminal equipments In the computer-readable memory to work in a specific way so that the instruction being stored in the computer-readable memory produces bag The manufacture of command device is included, the command device is realized in one flow of flow chart or multiple flows and/or one side of block diagram The function of being specified in frame or multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing terminal equipments so that Series of operation steps is performed on computer or other programmable terminal equipments to produce computer implemented processing, so that The instruction performed on computer or other programmable terminal equipments is provided for realizing in one flow of flow chart or multiple flows And/or specified in one square frame of block diagram or multiple square frames function the step of.
Although having been described for the preferred embodiment of the embodiment of the present application, those skilled in the art once know base This creative concept, then other change and modification can be made to these embodiments.So appended claims are intended to be construed to Including preferred embodiment and fall into having altered and changing for the embodiment of the present application scope.
Finally, it is to be noted that, herein, such as first and second or the like relational terms be used merely to by One entity or operation make a distinction with another entity or operation, and not necessarily require or imply these entities or operation Between any this actual relation or order be present.Moreover, term " comprising ", "comprising" or its any other variant meaning Covering including for nonexcludability, so that process, method, article or terminal device including a series of elements are not only wrapped Those key elements, but also the other element including being not expressly set out are included, or is also included for this process, method, article Or the key element that terminal device is intrinsic.In the absence of more restrictions, wanted by what sentence "including a ..." limited Element, it is not excluded that other identical element in the process including the key element, method, article or terminal device also be present.
Above to the method, apparatus and system of a kind of determination address uniqueness provided herein, detailed Jie has been carried out Continue, specific case used herein is set forth to the principle and embodiment of the application, and the explanation of above example is only It is to be used to help understand the present processes and its core concept;Meanwhile for those of ordinary skill in the art, according to this Shen Thought please, can there is change part in specific embodiments and applications, in summary, this specification content should not manage Solve as the limitation to the application.

Claims (17)

  1. A kind of 1. method for determining address uniqueness, it is characterised in that including:
    Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or point of interest POI;
    Clustering processing is carried out to the multiple latitude and longitude coordinates;
    Whether a plurality of Text Address comprising same link information or POI is determined according to the result of the clustering processing It is unique.
  2. 2. according to the method for claim 1, it is characterised in that the result according to the clustering processing determines described more Bar includes whether same link information or the Text Address of POI are unique steps, including:
    If the result of the clustering processing is can be polymerized to a class, it is determined that described a plurality of to include same link information or POI The Text Address of information is unique;
    If the result of the clustering processing is can not be polymerized to a class, it is determined that it is described it is a plurality of comprising same link information or The Text Address of POI is not unique.
  3. 3. according to the method for claim 1, it is characterised in that described to include same link information or point of interest based on a plurality of Before the Text Address of POI obtains the step of multiple latitude and longitude coordinates, in addition to:
    A plurality of Text Address is obtained from Text Address storehouse.
  4. 4. according to the method for claim 3, it is characterised in that obtain a plurality of Text Address in the storehouse from Text Address After step, in addition to:
    Judge road information in every Text Address or POI whether in same administrative area;
    If it is judged that for not in same administrative area, then a plurality of same link information or POI are included based on described Text Address obtain corresponding latitude and longitude coordinates;
    If it is judged that in same administrative area, it is determined that the address comprising the road information or POI is only One.
  5. 5. according to the method for claim 4, it is characterised in that the road information judged in every Text Address Or POI whether the step in same administrative area, including:
    Participle and cutting word processing are carried out for every Text Address, obtains the city, administrative area, road of every Text Address Information or POI;
    Count the distribution proportion of the road information or POI in each administrative area in same city;
    If the distribution proportion is not less than predetermined threshold value, road information or POI letters in the Text Address are determined whether Whether breath is only distributed in same administrative area, if so, then judging that the Text Address comprising the road information or POI is Uniquely.
  6. 6. according to the method for claim 1, it is characterised in that described to include same link information or point of interest based on a plurality of The Text Address of POI obtains the step of multiple latitude and longitude coordinates, including:
    Participle and cutting word processing are carried out for every Text Address, obtains the city, administrative area, road of every Text Address Information or POI;
    Select the Text Address with same link information or POI for belonging to same city;
    Based on a plurality of Text Address comprising same link or POI selected, corresponding warp is obtained from address latitude and longitude information storehouse Latitude coordinate.
  7. 7. according to the method for claim 1, it is characterised in that described that clustering processing is carried out to the multiple latitude and longitude coordinates The step of include:
    The multiple latitude and longitude coordinates are calculated by lng=lng*1000, lat=lat*1000 mode respectively Corresponding longitude and latitude mesh coordinate;
    Clustering processing is carried out to the longitude and latitude mesh coordinate information using default minimal point and radius number.
  8. A kind of 8. method for determining that shipping address is reachable, it is characterised in that including:
    Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or point of interest POI;
    Clustering processing is carried out to the multiple latitude and longitude coordinates;
    Whether a plurality of Text Address comprising same link information or POI is determined according to the result of the clustering processing It is unique;
    If a plurality of Text Address comprising same link information or POI is unique, by a plurality of text Location is recorded as the address that can be sent to.
  9. A kind of 9. device for determining address uniqueness, it is characterised in that including:
    Longitude and latitude acquisition module, for being obtained based on a plurality of Text Address comprising same link information or point of interest POI Multiple latitude and longitude coordinates;
    Clustering processing module, for carrying out clustering processing to the multiple latitude and longitude coordinates;
    Uniqueness determining module, for determining a plurality of comprising same link information or POI to believe according to the result of the clustering processing Whether the Text Address of breath is unique.
  10. 10. device according to claim 9, it is characterised in that the uniqueness determining module includes:
    First determining unit, if the result for the clustering processing module is that can be polymerized to a class, it is determined that described The a plurality of Text Address comprising same link information or POI is unique;
    Second determining unit, if the result for the clustering processing module is that can not be polymerized to a class, it is determined that institute It is not unique to state a plurality of Text Address comprising same link information or POI.
  11. 11. device according to claim 9, it is characterised in that described device also includes:
    Address acquisition module, for obtaining a plurality of Text Address from Text Address storehouse.
  12. 12. device according to claim 11, it is characterised in that described device also includes:
    Administrative area judge module, for judging road information in every Text Address or POI whether same In administrative area;Accordingly
    The longitude and latitude acquisition module, if the judged result for the administrative area judge module is not in same administrative area In, then corresponding latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or POI;
    The uniqueness determining module, if the judged result for being additionally operable to the administrative area judge module is in same administrative area In, it is determined that the address comprising the road information or POI is unique.
  13. 13. device according to claim 12, it is characterised in that the administrative area judge module includes:
    First acquisition unit, for carrying out participle and cutting word processing for every Text Address, obtain every Text Address City, administrative area, road information or POI;
    Statistic unit, for counting the distribution proportion of the road information or POI in each administrative area in same city;
    Identifying unit, if being not less than predetermined threshold value for the distribution proportion, determine whether the road in the Text Address Whether road information or POI are only distributed in same administrative area, if so, then judging to believe comprising the road information or POI The Text Address of breath is unique.
  14. 14. device according to claim 9, it is characterised in that the longitude and latitude acquisition module includes:
    Second acquisition unit, for carrying out participle and cutting word processing for every Text Address, obtain every Text Address City, administrative area, road information or POI;
    Unit is selected, the Text Address with same link information or POI in same city is belonged to for selecting;
    Latitude and longitude coordinates acquiring unit, for based on a plurality of text comprising same link or POI selected unit and selected Address, corresponding latitude and longitude coordinates are obtained from address latitude and longitude information storehouse.
  15. 15. device according to claim 9, it is characterised in that the clustering processing module includes:
    Computing unit, for being sat respectively by lng=lng*1000, lat=lat*1000 mode to the multiple longitude and latitude Mark carries out that corresponding longitude and latitude mesh coordinate is calculated;
    Clustering processing unit, for being carried out using default minimal point and radius number to the longitude and latitude mesh coordinate information Clustering processing.
  16. A kind of 16. device for determining that shipping address is reachable, it is characterised in that including:
    Longitude and latitude acquisition module, for being obtained based on a plurality of Text Address comprising same link information or point of interest POI Multiple latitude and longitude coordinates;
    Clustering processing module, for carrying out clustering processing to the multiple latitude and longitude coordinates;
    Uniqueness determining module, for according to the result of the clustering processing determine it is described it is a plurality of comprising same link information or Whether the Text Address of POI is unique;
    Logging modle, if being determined for the uniqueness determining module described a plurality of comprising same link information or POI Text Address is unique, then a plurality of Text Address is recorded as to the address that can be sent to.
  17. 17. it is a kind of determine address uniqueness system, it is characterised in that including according to claim any one of 9-15 really Determine the device of address uniqueness.
CN201610552332.XA 2016-07-13 2016-07-13 A kind of method, apparatus and system for determining address uniqueness Pending CN107622061A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610552332.XA CN107622061A (en) 2016-07-13 2016-07-13 A kind of method, apparatus and system for determining address uniqueness

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610552332.XA CN107622061A (en) 2016-07-13 2016-07-13 A kind of method, apparatus and system for determining address uniqueness

Publications (1)

Publication Number Publication Date
CN107622061A true CN107622061A (en) 2018-01-23

Family

ID=61087314

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610552332.XA Pending CN107622061A (en) 2016-07-13 2016-07-13 A kind of method, apparatus and system for determining address uniqueness

Country Status (1)

Country Link
CN (1) CN107622061A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109376761A (en) * 2018-09-12 2019-02-22 北京三快在线科技有限公司 The method for digging and device of a kind of address mark and its longitude and latitude
CN109635063A (en) * 2018-12-06 2019-04-16 拉扎斯网络科技(上海)有限公司 Information processing method, device, electronic equipment and the storage medium of address base
CN109992638A (en) * 2019-03-29 2019-07-09 北京三快在线科技有限公司 Generation method, device, electronic equipment and the storage medium of geographical location POI
CN111506675A (en) * 2019-01-11 2020-08-07 阿里巴巴集团控股有限公司 Method, apparatus, device and medium for determining points of interest
CN111723165A (en) * 2019-03-18 2020-09-29 阿里巴巴集团控股有限公司 Address interest point determining method, device and system
CN112016326A (en) * 2020-09-25 2020-12-01 北京百度网讯科技有限公司 Map area word recognition method and device, electronic equipment and storage medium
CN113076746A (en) * 2020-01-06 2021-07-06 阿里巴巴集团控股有限公司 Data processing method and system, storage medium and computing device
CN116541474A (en) * 2023-07-05 2023-08-04 平安银行股份有限公司 Object acquisition method, device, electronic equipment and storage medium
CN113076746B (en) * 2020-01-06 2024-05-31 阿里巴巴集团控股有限公司 Data processing method and system, storage medium and computing device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102867004A (en) * 2011-07-06 2013-01-09 高德软件有限公司 Method and system for matching addresses
CN103023997A (en) * 2012-11-29 2013-04-03 江苏鸿信系统集成有限公司 Massive geographical information name and address conversion method and device based on grid caching technology
CN103810194A (en) * 2012-11-11 2014-05-21 刘龙 Geographic coding method, position inquiring system and position inquiring method
CN103853725A (en) * 2012-11-29 2014-06-11 深圳先进技术研究院 Traffic track data noise reduction method and system
CN104484790A (en) * 2014-12-26 2015-04-01 清华大学深圳研究生院 Address match method and device of logistics business
CN105509743A (en) * 2015-11-24 2016-04-20 上海汽车集团股份有限公司 A positioning processing method, a business platform and a network system
CN105718465A (en) * 2014-12-02 2016-06-29 阿里巴巴集团控股有限公司 Geofence generation method and device

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102867004A (en) * 2011-07-06 2013-01-09 高德软件有限公司 Method and system for matching addresses
CN103810194A (en) * 2012-11-11 2014-05-21 刘龙 Geographic coding method, position inquiring system and position inquiring method
CN103023997A (en) * 2012-11-29 2013-04-03 江苏鸿信系统集成有限公司 Massive geographical information name and address conversion method and device based on grid caching technology
CN103853725A (en) * 2012-11-29 2014-06-11 深圳先进技术研究院 Traffic track data noise reduction method and system
CN105718465A (en) * 2014-12-02 2016-06-29 阿里巴巴集团控股有限公司 Geofence generation method and device
CN104484790A (en) * 2014-12-26 2015-04-01 清华大学深圳研究生院 Address match method and device of logistics business
CN105509743A (en) * 2015-11-24 2016-04-20 上海汽车集团股份有限公司 A positioning processing method, a business platform and a network system

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109376761A (en) * 2018-09-12 2019-02-22 北京三快在线科技有限公司 The method for digging and device of a kind of address mark and its longitude and latitude
CN109376761B (en) * 2018-09-12 2021-01-22 北京三快在线科技有限公司 Address identification and longitude and latitude mining method and device thereof
CN109635063A (en) * 2018-12-06 2019-04-16 拉扎斯网络科技(上海)有限公司 Information processing method, device, electronic equipment and the storage medium of address base
CN111506675A (en) * 2019-01-11 2020-08-07 阿里巴巴集团控股有限公司 Method, apparatus, device and medium for determining points of interest
CN111723165A (en) * 2019-03-18 2020-09-29 阿里巴巴集团控股有限公司 Address interest point determining method, device and system
CN109992638A (en) * 2019-03-29 2019-07-09 北京三快在线科技有限公司 Generation method, device, electronic equipment and the storage medium of geographical location POI
CN109992638B (en) * 2019-03-29 2020-11-20 北京三快在线科技有限公司 Method and device for generating geographical position POI, electronic equipment and storage medium
CN113076746A (en) * 2020-01-06 2021-07-06 阿里巴巴集团控股有限公司 Data processing method and system, storage medium and computing device
CN113076746B (en) * 2020-01-06 2024-05-31 阿里巴巴集团控股有限公司 Data processing method and system, storage medium and computing device
CN112016326A (en) * 2020-09-25 2020-12-01 北京百度网讯科技有限公司 Map area word recognition method and device, electronic equipment and storage medium
CN116541474A (en) * 2023-07-05 2023-08-04 平安银行股份有限公司 Object acquisition method, device, electronic equipment and storage medium
CN116541474B (en) * 2023-07-05 2024-02-02 平安银行股份有限公司 Object acquisition method, device, electronic equipment and storage medium

Similar Documents

Publication Publication Date Title
CN107622061A (en) A kind of method, apparatus and system for determining address uniqueness
JP6321681B2 (en) Method and apparatus for identifying website users
Zheng et al. Detecting collective anomalies from multiple spatio-temporal datasets across different domains
CN104050196B (en) A kind of interest point data redundant detecting method and device
CN106201886B (en) A kind of Proxy Method and device of the verifying of real time data task
CN106156965B (en) Logistics service scheduling method and equipment
CN103902622B (en) Mass moving target aggregation method and device
CN104112284B (en) The similarity detection method and equipment of a kind of picture
CN109117433B (en) Index tree object creation and index method and related device thereof
CN108197873A (en) Warehouse article goods sorting method, device, computer equipment and storage medium
CN107783734A (en) A kind of resource allocation methods, device and terminal based on super fusion storage system
CN110458598A (en) Scene adaptation method, device and electronic equipment
CN106033510A (en) Method and system for identifying user equipment
CN106897342A (en) A kind of data verification method and equipment
CN108008936A (en) A kind of data processing method, device and electronic equipment
CN108664583A (en) A kind of index tree method for building up and image search method
CN105898835B (en) Generate the method and apparatus of the access point attribute information of WAP
Yin et al. A deep learning approach for rooftop geocoding
CN107395680A (en) Shop group's information push and output intent and device, equipment
CN106708648B (en) A kind of the storage method of calibration and system of text data
US9390105B2 (en) System and methods for storing and analyzing geographically-referenced data
Duan et al. Comprehending and analyzing multiday trip-chaining patterns of freight vehicles using a multiscale method with prolonged trajectory data
CN108932525A (en) A kind of behavior prediction method and device
US10903684B2 (en) Method for operating a network having multiple node devices, and network
CN116703132A (en) Management method and device for dynamic scheduling of shared vehicles and computer equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20180418

Address after: Four story 847 mailbox of the capital mansion of Cayman Islands, Cayman Islands, Cayman

Applicant after: CAINIAO SMART LOGISTICS HOLDING Ltd.

Address before: Cayman Islands Grand Cayman capital building a four storey No. 847 mailbox

Applicant before: ALIBABA GROUP HOLDING Ltd.

TA01 Transfer of patent application right
RJ01 Rejection of invention patent application after publication

Application publication date: 20180123

RJ01 Rejection of invention patent application after publication