CN107622061A - A kind of method, apparatus and system for determining address uniqueness - Google Patents
A kind of method, apparatus and system for determining address uniqueness Download PDFInfo
- Publication number
- CN107622061A CN107622061A CN201610552332.XA CN201610552332A CN107622061A CN 107622061 A CN107622061 A CN 107622061A CN 201610552332 A CN201610552332 A CN 201610552332A CN 107622061 A CN107622061 A CN 107622061A
- Authority
- CN
- China
- Prior art keywords
- address
- poi
- latitude
- text address
- text
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Navigation (AREA)
Abstract
The embodiment of the present application provides a kind of method, apparatus and system for determining address uniqueness.Methods described includes:Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or point of interest POI;Clustering processing is carried out to the multiple latitude and longitude coordinates;Determine whether a plurality of Text Address comprising same link information or POI is unique according to the result of the clustering processing.The embodiment of the present application can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then significantly lifts the coverage rate of Address Recognition.
Description
Technical field
The application is related to technical field of information processing, more particularly to it is a kind of determine address uniqueness method, apparatus and
System.
Background technology
With the fast development of ecommerce, shopping online is increasingly popularized, and consumer gets used to adopting on the net already
Purchase commodity.Shopping online will send commodity in client's hand with charge free dependent on logistics, logistics company site carry out logistics send with charge free when,
By sending rule to be matched with pulling shipping address, so that it is determined that sending region with charge free.Obviously, region is sent in above-mentioned existing determination with charge free
Mode depends on the accuracy of address.However, when actually sending with charge free, the imperfect or wrong situation of address information is frequently encountered,
So existing way can not just determine corresponding to send region with charge free.
Prior art also provides a kind of improvement project, i.e., first correction process is carried out to shipping address, then by after error correction
Address sends rule to be matched with pulling, so that it is determined that sending scope with charge free.However, the prior art depends on the essence of address error correction algorithm
Exactness, if the address after error correction is wrong, rule is sent to be matched with pulling wrong address, its result must
It is wrong.
Therefore, how more accurately and efficiently to determine whether some address is unique in same city, turns into and needs badly
The technical problem that those skilled in the art solve.
The content of the invention
In view of the above problems, it is proposed that the embodiment of the present application overcomes above mentioned problem or at least in part to provide one kind
The method, apparatus and system of a kind of determination address uniqueness to solve the above problems.
This application discloses a kind of method for determining address uniqueness, including:
Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link or point of interest POI;
Clustering processing is carried out to the multiple latitude and longitude coordinates;
The a plurality of Text Address comprising same link information or POI is determined according to the result of the clustering processing
Whether it is unique.
Accordingly, this application discloses a kind of device for determining address uniqueness, including:
Longitude and latitude acquisition module, for based on a plurality of Text Address comprising same link information or point of interest POI
Obtain multiple latitude and longitude coordinates;
Clustering processing module, for carrying out clustering processing to the multiple latitude and longitude coordinates;
Uniqueness determining module, for according to the result of the clustering processing determine it is a plurality of comprising same link information or
Whether the Text Address of POI is unique.
Disclosed herein as well is a kind of address is determined including a kind of of the device as described above for determining address uniqueness only
The system of one property.
In addition, disclosed herein as well is a kind of method for determining that shipping address is reachable, including:
Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or point of interest POI;
Clustering processing is carried out to the multiple latitude and longitude coordinates;
The a plurality of Text Address comprising same link information or POI is determined according to the result of the clustering processing
Whether it is unique;
If a plurality of Text Address comprising same link information or POI is unique, by more provisions
This address is recorded as the address that can be sent to.
Accordingly, this application discloses a kind of device for determining that shipping address is reachable, including:
Longitude and latitude acquisition module, for based on a plurality of Text Address comprising same link information or point of interest POI
Obtain multiple latitude and longitude coordinates;
Clustering processing module, for carrying out clustering processing to the multiple latitude and longitude coordinates;
Uniqueness determining module, for being determined described a plurality of to include same link information according to the result of the clustering processing
Or whether the Text Address of POI is unique;
Logging modle, if determining described a plurality of comprising same link information or POI to believe for the uniqueness determining module
The Text Address of breath is unique, then a plurality of Text Address is recorded as to the address that can be sent to.
The specific embodiment provided according to the application, this application discloses following technique effect:
The embodiment of the present application is passed through to being got based on a plurality of Text Address comprising same link information or POI
Multiple corresponding latitude and longitude coordinates carry out clustering processings, further according to the clustering processing result judge it is described it is a plurality of include it is identical
Whether unique the Text Address of road information or POI is.Therefore, the embodiment of the present application can in shipping address address
Information is lack of standardization, even information errors situations, still can determine whether the address is unique in city.Therefore, originally
Application embodiment can be more accurate, efficiently, neatly determine whether some address is unique, Jin Er great in same city
Amplitude lifts the coverage rate of Address Recognition.
Further, the embodiment of the present application can first pass through the road information or POI judged in every Text Address
Whether in same administrative area;In the case where judged result is in same administrative area, can directly determine to include institute
It is unique to state the address of road information or POI.And in the case of being only not in same administrative area in judged result,
Corresponding latitude and longitude coordinates are just obtained based on a plurality of Text Address comprising same link information or POI, and then by poly-
The result of class processing judges whether unique the address comprising the road information or POI is.The embodiment of the present application passes through
Text Address is analyzed and is combined with longitude and latitude clustering processing to determine the uniqueness of address, can be taken into account on product and engineering
Need, realized under large-scale distributed parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly
The shortening of degree calculates the time.
To sum up, can be more accurate, efficiently, neatly determine some address in same city by the embodiment of the present application
Whether it is inside unique, so as to significantly lift the coverage rate of Address Recognition;Resource consumption can be also reduced simultaneously, it is substantial amounts of to save
Time and human cost.
Certainly, any product for implementing the application it is not absolutely required to reach all the above advantage simultaneously.
Brief description of the drawings
, below will be to institute in embodiment in order to illustrate more clearly of the embodiment of the present application or technical scheme of the prior art
The accompanying drawing needed to use is briefly described, it should be apparent that, drawings in the following description are only some implementations of the application
Example, for those of ordinary skill in the art, on the premise of not paying creative work, can also be obtained according to these accompanying drawings
Obtain other accompanying drawings.
Fig. 1 is a kind of step flow chart of the embodiment of the method for determination address uniqueness of the application;
Fig. 2 is the step flow chart of another embodiment of the method for determining address uniqueness of the application;
Fig. 3 is a kind of step flow chart of Text Address analysis method embodiment of the application;
Fig. 4 is a kind of step flow chart of longitude and latitude clustering processing embodiment of the method for the application;
Fig. 5 is a kind of structured flowchart of the device embodiment of determination address uniqueness of the application;
Fig. 6 is the structured flowchart of another device embodiment for determining address uniqueness of the application.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present application, the technical scheme in the embodiment of the present application is carried out clear, complete
Site preparation describes, it is clear that described embodiment is only some embodiments of the present application, rather than whole embodiments.It is based on
Embodiment in the application, the every other embodiment that those of ordinary skill in the art are obtained, belong to the application protection
Scope.
It is below in conjunction with the accompanying drawings and specific real to enable the above-mentioned purpose of the application, feature and advantage more obvious understandable
Mode is applied to be described in further detail the application.
The embodiment of the present application, which can be applied to pull in the existing logistics for sending rule to match based on shipping address and pulling, sends intelligence
In system, then by multiple corresponding to being got based on a plurality of Text Address comprising same link information or POI
Latitude and longitude coordinates carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link information
Or whether unique the Text Address of POI is.Therefore, the embodiment of the present application address information can not advise in shipping address
The situation of model, even information errors, it still can determine whether the address is unique in city.Compared with prior art,
The embodiment of the present application can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then
Significantly lift the coverage rate of Address Recognition.
Further, the embodiment of the present application can first pass through the road information or POI judged in every Text Address
Whether in same administrative area;In the case where judged result is in same administrative area, can directly determine to include institute
It is unique to state the address of road information or POI.And in the case of being only not in same administrative area in judged result,
Corresponding latitude and longitude coordinates are just obtained based on a plurality of Text Address comprising same link information or POI, and then by poly-
The result of class processing judges whether unique the address comprising the road information or POI is.The embodiment of the present application passes through
Text Address is analyzed and is combined with longitude and latitude clustering processing to determine the uniqueness of address, can be taken into account on product and engineering
Need, realized under large-scale distributed parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly
The shortening of degree calculates the time.
To sum up, can be more accurate, efficiently, neatly determine some address in same city by the embodiment of the present application
Whether it is inside unique, so as to significantly lift the coverage rate of Address Recognition;Resource consumption can be also reduced simultaneously, it is substantial amounts of to save
Time and human cost.
Embodiment one
Reference picture 1, a kind of step flow chart of the embodiment of the method for determination address uniqueness of the application is shown, it is described
Method comprises the following steps:
Step 102, based on a plurality of comprising same link information or POI (Point of Interest, point of interest) information
Text Address obtains multiple latitude and longitude coordinates;
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining
Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI
Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established
Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse
Breath.
Step 104, clustering processing is carried out to the multiple latitude and longitude coordinates;
Explanation is needed exist for, clustering algorithm can be utilized in the embodiment of the present application to the latitude and longitude coordinates information
Carry out clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-
Based Spatial Clustering of Applications with Noise), DBSCAN is a kind of based on high density connection
The clustering algorithm in logical region, its purpose seek to filter density regions, find dense sample point.That is, find low
The high-density region of density area separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM
Cluster, FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI
Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI
A plurality of Text Address in any one address be unique in city.Therefore, the embodiment of the present invention passes through clustering processing
Mode determine address uniqueness, can more accurately determine some address in same city whether be it is unique, from
And it can significantly lift the coverage rate of Address Recognition.
Step 106, determined according to the result of the clustering processing it is described a plurality of comprising same link information or POI
Whether Text Address is unique.
That is, when the result of the clustering processing is can be polymerized to a class, then it is to connect to illustrate the road or POI
Continuous, it can be determined that any one Text Address in a plurality of Text Address comprising same link information or POI
It is unique in city;When the result of the clustering processing could not gather for a class, then it is not connect to illustrate the road or POI
Continuous;Any one Text Address in a plurality of Text Address comprising same link information or POI can be determined
It is not unique in city.
For example, " Beijing Chang An Street ", Beijing only has a Chang'an street, by clustering processing, includes " Beijing Chang An Street "
Multiple addresses corresponding to latitude and longitude information can only be polymerized to a class, so i.e. can determine whether that " Beijing Chang An Street " is continuous.
For another example " little Ying East Roads, Beijing ", there is two in Beijing:One in Chaoyang District, another in Haidian District, Liang Tiaolu
It is separated by the distant of dozens of kilometres.According to multiple latitude and longitude informations of multiple addresses comprising little Ying East Roads, Beijing, clustering processing is carried out
Afterwards, two classes are converged to, that is, cluster result is not 1.Therefore, it is possible to judge that " little Ying East Roads, Beijing " is not just continuous.
Preferably, if the embodiment of the present application to be applied to the application scenarios for determining that shipping address is reachable, in the step
Following steps are can further include after rapid 106:
If a plurality of Text Address comprising same link information or POI is unique, by more provisions
This address is recorded as the address that can be sent to.
So, can be sent with charge free the address with direct basis when actually sending with charge free what is wrap up etc..Preferably, can also
By all address aggregations for being recorded as being sent into an address list or address base, in practical application, be able to can be sent to
Address sends rule to match with pulling, so that it is determined that sending region with charge free.
Scheme provided in an embodiment of the present invention is the judgement based on extensive historical address, rather than for small-scale address
Data set.Therefore, the embodiment of the present application is passed through to being obtained based on a plurality of Text Address comprising same link information or POI
The multiple corresponding latitude and longitude coordinates arrived carry out clustering processing, and then determine a plurality of bag according to the result of the clustering processing
Whether unique the Text Address of information containing same link or POI is.Therefore, the embodiment of the present application can be in shipping address
Middle address information is lack of standardization, even information errors situations, still can determine whether the address is unique in city.With
Prior art is compared, the embodiment of the present application can more accurate, efficiently, neatly determine some address in same city whether
It is unique, and then significantly lifts the coverage rate of Address Recognition.
Embodiment two
Reference picture 2, show the step flow chart of another embodiment of the method for determining address uniqueness of the application, tool
Body may include steps of:
Step 202, a plurality of Text Address is obtained from Text Address storehouse;
Normal conditions, the Text Address storehouse can be stored with the Text Address of extensive quantity.Described in the embodiment of the present application
A plurality of Text Address directly can be obtained from the Text Address storehouse.
Step 204, judge road information in every Text Address or POI whether in same administrative area
In, if it is judged that in same administrative area, then to perform step 206;Otherwise step 208 is performed;
Preferably, in another embodiment of the application, can also enter after obtaining Text Address from Text Address storehouse
One step obtains city, administrative area, road or the POI of the address, based on these information, determine whether road information or
Whether POI is in same administrative area.
Step 206, it is determined that the address comprising the road information or POI is unique.
Explanation is needed exist for, if it is determined that road or POI can be determined directly in same administrative area
Address corresponding with the road information or POI is unique in city.
Step 208, if it is judged that not in same administrative area, then to be believed based on a plurality of same link that includes
The Text Address of breath or POI obtains corresponding latitude and longitude coordinates;
Need exist for explanation, if it is determined that road information or POI not in same administrative area, this
Inventive embodiments also need to by such as previous embodiment one provide clustering processing further determine that it is corresponding to the road or POI
A plurality of Text Address whether be unique in city.
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining
Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI
Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established
Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse
Breath.Participle and cutting word processing can also be first carried out for every Text Address, obtains city, the administration of every Text Address
Area, road information or POI;Then the text with same link information or POI for belonging to same city is selected
Address;Again based on a plurality of Text Address comprising same link or POI selected, obtained from address latitude and longitude information storehouse corresponding
Latitude and longitude coordinates.
Step 210, clustering processing is carried out to the multiple latitude and longitude coordinates;
Explanation is needed exist for, the latitude and longitude information can be carried out using clustering algorithm in the embodiment of the present application
Clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-based
Spatial Clustering of Applications with Noise), DBSCAN is that one kind is based on high density UNICOM region
Clustering algorithm, its purpose seek to filter density regions, find dense sample point.That is, find by low density area
The high-density region of domain separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM cluster,
FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI
Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI
A plurality of Text Address in any one address be unique.Therefore, the embodiment of the present invention is true by way of clustering processing
Determine address uniqueness, can more accurately determine whether some address is unique in same city, so as to big
Amplitude lifts the coverage rate of Address Recognition.
Step 212, determined according to the result of the clustering processing described a plurality of comprising same link information or POI
Whether Text Address is unique.
Specifically, when the result of the clustering processing can be gathered for a class, then it is to connect to illustrate the road or POI
Continuous, it can be determined that any one Text Address in a plurality of Text Address comprising same link information or POI
It is unique;When the result of the clustering processing could not gather for a class, then illustrate that the road or POI are discontinuous;Can
To determine that any one Text Address in a plurality of Text Address comprising same link information or POI is not unique
's.
For example, Beijing Chang An Street, implements through multiple administrative areas such as Chaoyang District, Dongcheng District, Xicheng District, but by the present invention
After the clustering processing that example proposes, cluster result 1, this explanation Chang'an street is continuous, therefore includes the address of " Beijing Chang An Street "
It is unique in Beijing.
For another example in a large amount of Text Address comprising " Hangzhou one West Road of text ", might have:
Address A:Xihu District of Hangzhou City one West Road of text;
Address B:Hangzhou Yuhang District one West Road of text;
So, can see by substantial amounts of Text Address, " literary a West Road " in the administrative area of Hangzhou at least two all
In the presence of.Therefore by way of judging road in the Text Address or POI whether in same administrative area, can not judge
" a literary West Road " is a road or two road in Hangzhou.And by longitude and latitude clustering processing, it can be included all
The address for having " Wenyi West Road, Hangzhou " is all converted into corresponding latitude and longitude coordinates point.Then these latitude and longitude coordinates point sets are entered
The processing of row density clustering.If the result of clustering processing is 1, that is, is polymerized to a class, that just illustrates these longitudes and latitudes
There is a closely located transitive relation in degree coordinate points, this just forms a line continuously to extend, so as to judge " Wen Yixi
Road " is continuous, and it is all unique therefore, to include multiple addresses of " literary a West Road " in Hangzhou.
Therefore, can by the embodiment of the present invention no matter the administrative area described in address is " Xihu District " or " Yuhang District "
Only to need to judge whether unique the address of the road comprising " a literary West Road " or POI is in Hangzhou, if it is determined that
The address of road or POI comprising " a literary West Road " is unique in Hangzhou, then need not consider administrative area information, even if
Administrative information region in address, which is filled in, wrong nor affects on result of determination.Therefore, accurate group can be reached in specific send with charge free
The purpose sent.
The embodiment of the present application, which can be applied to pull in the existing logistics for sending rule to match based on shipping address and pulling, sends intelligence
In system, then by multiple corresponding to being got based on a plurality of Text Address comprising same link information or POI
Latitude and longitude coordinates carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link information
Or whether unique the Text Address of POI is.Therefore, the embodiment of the present application address information can not advise in shipping address
The situation of model, even information errors, it still can determine whether the address is unique in city.Compared with prior art,
The embodiment of the present application can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then
Significantly lift the coverage rate of Address Recognition.
Further, the embodiment of the present application can first pass through the road information or POI judged in every Text Address
Whether in same administrative area;In the case where judged result is in same administrative area, can directly determine to include institute
It is unique to state the address of road information or POI.And in the case of being only not in same administrative area in judged result,
Corresponding latitude and longitude coordinates are just obtained based on a plurality of Text Address comprising same link information or POI, and then by poly-
The result of class processing judges whether unique the address comprising the road information or POI is.The embodiment of the present application passes through
Text Address is analyzed and is combined with longitude and latitude clustering processing to determine the uniqueness of address, can be taken into account on product and engineering
Need, realized under large-scale distributed parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly
The shortening of degree calculates the time.
To sum up, can be more accurate, efficiently, neatly determine some address in same city by the embodiment of the present application
Whether it is inside unique, so as to significantly lift the coverage rate of Address Recognition;Resource consumption can be also reduced simultaneously, it is substantial amounts of to save
Time and human cost.
Embodiment three
Reference picture 3, show a kind of step flow chart of Text Address analysis method embodiment of the application, methods described
Specifically comprise the following steps:
Step 302, a plurality of Text Address is obtained from Text Address storehouse.
Normal conditions, the Text Address storehouse can be stored with the Text Address of extensive quantity.Described in the embodiment of the present application
A plurality of Text Address directly can be obtained from the Text Address storehouse.
Step 304, carry out participle for every Text Address and cutting word is handled, obtain the city of every Text Address
City, administrative area, road information or POI;
Calculated it should be noted that address participle Cooley can be based in the embodiment of the present application with existing participle and cutting word
Method is handled every Text Address, obtains city, administrative area, road or the POI of every Text Address.Mesh
Before, this kind of participle and cutting word algorithm have a lot, and the embodiment of the present application using which kind of algorithm to not being limited specifically.
Step 306, the road included in a plurality of Text Address or POI are counted in each administrative area in same city
Distribution proportion;
Preferably, in another embodiment of the application, the road can be counted by the distribution mode of probability calculation
The distribution proportion situation of road or POI in each administrative area in a city.
Step 308, if the distribution proportion is not less than predetermined threshold value, the road in the Text Address is determined whether
Whether information or POI are only distributed in same administrative area, if so, then judging to include the road information or POI
Text Address be unique.
If the distribution proportion is less than predetermined threshold value, judge that the Text Address is not deposited in each administrative area
;In such a case, it is possible to the Text Address is directly filtered out, but only to the remaining text after filtration treatment
Address is judged.
Preferably, the predetermined threshold value can be pre-set based on experience value, such as it is a ten thousandth that can set threshold value.
For example, province-city-area's information corresponding to road name in every Text Address;Count different corresponding to every road name
The probability in province-city-area;The small data of probability are screened out respectively according to predetermined threshold value in each administrative region, then judge the road
Whether title can be distributed in different province-city-areas, if it is not, then determining that the address comprising the road is in the city
Uniquely.
Whether the embodiment of the present application can first pass through judges road information in every Text Address or POI same
In individual administrative area;In the case where judged result is in same administrative area, can directly determine to include the road information
Or the address of POI is unique.And in the case of being only not in same administrative area in judged result, just based on a plurality of
Corresponding latitude and longitude coordinates, and then the knot for passing through clustering processing are obtained comprising the Text Address of same link information or POI
Fruit judges whether unique the address comprising the road information or POI is.The embodiment of the present application is by by Text Address
Analysis is combined with longitude and latitude clustering processing to determine the uniqueness of address, the needs on product and engineering can be taken into account, big
Realized under scale distribution formula parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly shorten
Calculate the time.
Example IV
Reference picture 4, show a kind of step flow chart of longitude and latitude clustering processing embodiment of the method for the application, the side
Method specifically comprises the following steps:
Step 402, carry out participle for every Text Address and cutting word is handled, obtain the city of every Text Address
City, administrative area, road information or POI;
Step 404, the Text Address with same link information or POI for belonging to same city is selected;
Step 406, based on a plurality of Text Address comprising same link information or POI selected, from address longitude and latitude
Spend information bank and obtain corresponding latitude and longitude coordinates;
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining
Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI
Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established
Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse
Breath.
Step 408, corresponding multiple longitude and latitude mesh coordinates are calculated respectively according to the multiple latitude and longitude coordinates;
Preferably, respectively by lng=lng*1000, lat=lat*1000 mode is according to the latitude and longitude coordinates meter
Calculation obtains corresponding longitude and latitude mesh coordinate.Specifically, lng=lng*1000, lat=lat*1000 are to return longitude and latitude point
Tie to above one about 100 meters * 100 meters of geographic grid, regard 100 meters * 100 meters corresponding scopes as a fundamental geological
Grid.Wherein, a grid is identified using the central point of grid.In units of grid, by all longitude and latitude points in grid all
Sum up in the point that on this grid, clustered in grid aspect.The purpose for the arrangement is that in order to compress longitude and latitude point data.
Step 410, clustering processing is carried out to the multiple longitude and latitude mesh coordinate;
Explanation is needed exist for, the latitude and longitude information can be carried out using clustering algorithm in the embodiment of the present application
Clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-based
Spatial Clustering of Applications with Noise), DBSCAN is that one kind is based on high density UNICOM region
Clustering algorithm, its purpose seek to filter density regions, find dense sample point.That is, find by low density area
The high-density region of domain separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM cluster,
FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI
Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI
A plurality of Text Address in any one address be unique in city.Therefore, the embodiment of the present invention passes through clustering processing
Mode determine address uniqueness, can more accurately determine some address in same city whether be it is unique, from
And it can significantly lift the coverage rate of Address Recognition.
Preferably, clustering processing is carried out to the longitude and latitude mesh coordinate using default minimal point and radius number;
Wherein, the minimal point used and radius number of clustering can be selected based on experience value.
Step 412, according to the result of the clustering processing, determine described a plurality of to include same link information or POI
Text Address it is whether unique.
That is, when the result of the clustering processing can be gathered for a class, then it is to connect to illustrate the road or POI
Continuous, it can be determined that any one Text Address in a plurality of Text Address comprising same link information or POI
It is unique in city;When the result of the clustering processing could not gather for a class, then it is not connect to illustrate the road or POI
Continuous;Any one Text Address in a plurality of Text Address comprising same link information or POI can be determined
It is not unique in city.
Scheme provided in an embodiment of the present invention is the judgement based on extensive historical address, rather than for small-scale address
Data set.Therefore, the embodiment of the present application can be applied is sending the existing logistics that matches of rule to be pulled to send based on shipping address with pulling
In intelligence system, then pass through multiple phases to being got based on a plurality of Text Address comprising same link information or POI
The latitude and longitude coordinates answered carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link
Whether unique the Text Address of information or POI is.Therefore, the embodiment of the present application can in shipping address address information
Lack of standardization, even information errors situations, it still can determine whether the address is unique.Compared with prior art, this Shen
Please embodiment can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then significantly
The coverage rate of degree lifting Address Recognition.
Embodiment five
Reference picture 5, show a kind of structured flowchart of the device embodiment of determination address uniqueness of the application.It is described true
Determining the device 500 of address uniqueness can specifically include:Longitude and latitude acquisition module 510, clustering processing module 520, uniqueness are true
Cover half block 530;Wherein,
The longitude and latitude acquisition module 510, for based on a plurality of Text Address comprising same link information or POI
Obtain multiple latitude and longitude informations;
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining
Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI
Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established
Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse
Breath.
Preferably, in another embodiment of the application, the longitude and latitude acquisition module 510 can also be further used for
Corresponding longitude and latitude is obtained from address latitude and longitude information storehouse based on a plurality of Text Address comprising same link information or POI
Coordinate.
The clustering processing module 520, for carrying out clustering processing to the multiple latitude and longitude coordinates;
Explanation is needed exist for, the latitude and longitude information can be carried out using clustering algorithm in the embodiment of the present application
Clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-based
Spatial Clustering of Applications with Noise), DBSCAN is that one kind is based on high density UNICOM region
Clustering algorithm, its purpose seek to filter density regions, find dense sample point.That is, find by low density area
The high-density region of domain separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM cluster,
FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI
Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI
A plurality of Text Address in any one address be unique.Therefore, the embodiment of the present invention is true by way of clustering processing
Determine address uniqueness, can more accurately determine whether some address is unique in same city, so as to big
Amplitude lifts the coverage rate of Address Recognition.
Uniqueness determining module 530, for being determined a plurality of to include same link information according to the result of the clustering processing
Or whether the Text Address of POI is unique.
That is, when the result of the clustering processing can be gathered for a class, then it is to connect to illustrate the road or POI
Continuous, it can be determined that any one Text Address in a plurality of Text Address comprising same link information or POI
It is unique;When the result of the clustering processing could not gather for a class, then illustrate that the road or POI are discontinuous;Can
To determine that any one Text Address in a plurality of Text Address comprising same link information or POI is not unique
's.
Scheme provided in an embodiment of the present invention is the judgement based on extensive historical address, rather than for small-scale address
Data set.Therefore, the embodiment of the present application can be applied is sending the existing logistics that matches of rule to be pulled to send based on shipping address with pulling
In intelligence system, then pass through multiple phases to being got based on a plurality of Text Address comprising same link information or POI
The latitude and longitude coordinates answered carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link
Whether unique the Text Address of information or POI is.Therefore, the embodiment of the present application can in shipping address address information
Lack of standardization, even information errors situations, it still can determine whether the address is unique.Compared with prior art, this Shen
Please embodiment can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then significantly
The coverage rate of degree lifting Address Recognition.
Embodiment six
Reference picture 6, show the structured flowchart of another device embodiment for determining address uniqueness of the application.It is described
Determining the device 600 of address uniqueness can specifically include:Address acquisition module 610, administrative area judge module 620, longitude and latitude
Acquisition module 630, clustering processing module 640, uniqueness determining module 650;Wherein,
The address acquisition module 610, for obtaining a plurality of Text Address from Text Address storehouse.
The administrative area judge module 620, for judging that road information in every Text Address or POI be
It is no in same administrative area;
Preferably, in another embodiment of the application, can also enter after obtaining Text Address from Text Address storehouse
One step obtains city, administrative area, road or the POI of the address, based on these information, determines whether road or POI
Whether in same administrative area.Accordingly, the administrative area judge module 620 may further include:
First acquisition unit 621, for carrying out participle and cutting word processing for every Text Address, obtain described per provision
City, administrative area, road information or the POI of this address;
Calculated it should be noted that address participle Cooley can be based in the embodiment of the present application with existing participle and cutting word
Method is handled the Text Address, obtains city, administrative area, road or the POI of the Text Address.At present, this
Class is segmented and cutting word algorithm has a lot, and the embodiment of the present application using which kind of algorithm to not being limited specifically.
Statistic unit 622, for counting point of the road information or POI in each administrative area in same city
Cloth ratio;Preferably, in another embodiment of the application, the road can be counted by the distribution mode of probability calculation
Or distribution proportion situations of the POI in each administrative area in a city.
Identifying unit 623, if being not less than predetermined threshold value for the distribution proportion, determine whether the Text Address
In road information or POI whether be only distributed in same administrative area, if so, then judge comprising the road information or
The Text Address of POI is unique.
Preferably, the predetermined threshold value can be pre-set based on experience value, such as it is a ten thousandth that can set threshold value.
For example, province-city-area's information corresponding to road name in every Text Address;Count different corresponding to every road name
The probability in province-city-area;The small data of probability are screened out respectively according to predetermined threshold value in each administrative region, then judge the road
Whether title can be distributed in different province-city-areas, if it is not, then determining that the address comprising the road is in the city
Uniquely.
The longitude and latitude acquisition module 630, if judging for the administrative area judge module 620 in the Text Address
Road or POI be not in same administrative area, then based on a plurality of Text Address comprising same link information or POI
Obtain corresponding latitude and longitude coordinates.Specifically, the longitude and latitude acquisition module 630 includes:
Second acquisition unit 631, for carrying out participle and cutting word processing for every Text Address, obtain described per provision
City, administrative area, road information or the POI of this address;
Unit 632 is selected, the text with same link information or POI in same city is belonged to for selecting
Location;
Latitude and longitude coordinates acquiring unit 633, for based on it is described select that unit selects a plurality of include same link or POI
Text Address, obtain corresponding latitude and longitude coordinates from address latitude and longitude information storehouse.
Preferably, a plurality of Text Address can directly obtain from existing Text Address storehouse, be then based on obtaining
Every Text Address classified according to identical road or POI, based on a plurality of with same link information or POI
Text Address, obtain multiple latitude and longitude coordinates.In another embodiment of the application, Text Address storehouse can also be based on and established
Corresponding address latitude and longitude information storehouse, then address longitude and latitude is believed corresponding to acquisition directly from the address latitude and longitude information storehouse
Breath.
The clustering processing module 640, for carrying out clustering processing to the multiple latitude and longitude coordinates;Preferably, it is described
Clustering processing module 640 can specifically include:
Computing unit 641, for by lng=lng*1000, lat=lat*1000 mode to be to the multiple warp respectively
Latitude coordinate carries out that corresponding longitude and latitude mesh coordinate is calculated;
Clustering processing unit 642, for using default minimal point and radius number to the longitude and latitude mesh coordinate
Information carries out clustering processing.Wherein, the minimal point used and radius number of clustering can be selected based on experience value.
Explanation is needed exist for, the latitude and longitude information can be carried out using clustering algorithm in the embodiment of the present application
Clustering processing.Preferably, described clustering algorithm can use density-based algorithms (DBSCAN, Density-based
Spatial Clustering of Applications with Noise), DBSCAN is that one kind is based on high density UNICOM region
Clustering algorithm, its purpose seek to filter density regions, find dense sample point.That is, find by low density area
The high-density region of domain separation.At present, this kind of clustering algorithm has a lot, such as K-Means clusters, hierarchical clustering, SOM cluster,
FCM clusters etc., the embodiment of the present application using which kind of clustering algorithm to not being limited specifically.
The embodiment of the present application by clustering processing with may determine that a plurality of text comprising same link information or POI
Whether latitude and longitude information corresponding to location can gather for one kind, if, it is determined that contain the same link information or POI
A plurality of Text Address in any one address be unique.Therefore, the embodiment of the present invention is true by way of clustering processing
Determine address uniqueness, can more accurately determine whether some address is unique in same city, so as to big
Amplitude lifts the coverage rate of Address Recognition.
Uniqueness determining module 650, for being determined a plurality of to include same link information according to the result of the clustering processing
Or whether the Text Address of POI is unique.Specifically, the uniqueness determining module 650 includes:
First determining unit 651, if the result for the clustering processing module 640 is that can be polymerized to a class,
Then determine that a plurality of Text Address comprising same link information or POI is unique;
Second determining unit 652, if the result for the clustering processing module 640 is that can not be polymerized to one
Class, it is determined that a plurality of Text Address comprising same link information or POI is not unique.
Preferably, if the embodiment of the present application to be applied to the application scenarios for determining that shipping address is reachable, described device
It can further include:
Logging modle 660, if for the uniqueness determining module 650 determine it is described it is a plurality of comprising same link information or
The Text Address of POI is unique, then a plurality of Text Address is recorded as to the address that can be sent to.
So, can be sent with charge free the address with direct basis when actually sending with charge free what is wrap up etc..Preferably, can also
By all address aggregations for being recorded as being sent into an address list or address base, in practical application, be able to can be sent to
Address sends rule to match with pulling, so that it is determined that sending region with charge free.
The embodiment of the present application, which can be applied to pull in the existing logistics for sending rule to match based on shipping address and pulling, sends intelligence
In system, then by multiple corresponding to being got based on a plurality of Text Address comprising same link information or POI
Latitude and longitude coordinates carry out clustering processing, and then are determined according to the result of the clustering processing described a plurality of to include same link information
Or whether unique the Text Address of POI is.Therefore, the embodiment of the present application address information can not advise in shipping address
The situation of model, even information errors, it still can determine whether the address is unique in city.Compared with prior art,
The embodiment of the present application can be more accurate, efficiently, neatly determine whether some address is unique in same city, and then
Significantly lift the coverage rate of Address Recognition.
Further, the embodiment of the present application can first pass through the road information or POI judged in every Text Address
Whether in same administrative area;In the case where judged result is in same administrative area, can directly determine to include institute
It is unique to state the address of road information or POI.And in the case of being only not in same administrative area in judged result,
Corresponding latitude and longitude coordinates are just obtained based on a plurality of Text Address comprising same link information or POI, and then by poly-
The result of class processing judges whether unique the address comprising the road information or POI is.The embodiment of the present application passes through
Text Address is analyzed and is combined with longitude and latitude clustering processing to determine the uniqueness of address, can be taken into account on product and engineering
Need, realized under large-scale distributed parallel computation environment, can maximumlly reduce the consumption of computing resource, so as to significantly
The shortening of degree calculates the time.
To sum up, can be more accurate, efficiently, neatly determine some address in same city by the embodiment of the present application
Whether it is inside unique, so as to significantly lift the coverage rate of Address Recognition;Resource consumption can be also reduced simultaneously, it is substantial amounts of to save
Time and human cost.
Embodiment seven
The embodiment of the present application also provides a kind of system for determining address uniqueness, and it has institute in above-described embodiment five, six
All features of the device for the determination address uniqueness stated, therefore the system of the determination address uniqueness described in the embodiment of the present application has
There are all beneficial effects in above-described embodiment five, six, the embodiment of the present application will not be repeated here.
For device embodiment, because it is substantially similar to embodiment of the method, so description is fairly simple, it is related
Part illustrates referring to the part of embodiment of the method.
Each embodiment in this specification is described by the way of progressive, what each embodiment stressed be with
The difference of other embodiment, between each embodiment identical similar part mutually referring to.
It should be understood by those skilled in the art that, the embodiment of the embodiment of the present application can be provided as method, apparatus or calculate
Machine program product.Therefore, the embodiment of the present application can use complete hardware embodiment, complete software embodiment or combine software and
The form of the embodiment of hardware aspect.Moreover, the embodiment of the present application can use one or more wherein include computer can
With in the computer-usable storage medium (including but is not limited to magnetic disk storage, CD@ROM, optical memory etc.) of program code
The form of the computer program product of implementation.
In a typical configuration, the computer equipment includes one or more processors (CPU), input/output
Interface, network interface and internal memory.Internal memory may include the volatile memory in computer-readable medium, random access memory
The form such as device (RAM) and/or Nonvolatile memory, such as read-only storage (ROM) or flash memory (flash RAM).Internal memory is to calculate
The example of machine computer-readable recording medium.Computer-readable medium includes permanent and non-permanent, removable and non-removable media can be with
Realize that information stores by any method or technique.Information can be computer-readable instruction, data structure, the module of program or
Other data.The example of the storage medium of computer includes, but are not limited to phase transition internal memory (PRAM), static RAM
(SRAM), dynamic random access memory (DRAM), other kinds of random access memory (RAM), read-only storage
(ROM), Electrically Erasable Read Only Memory (EEPROM), fast flash memory bank or other memory techniques, read-only optical disc are read-only
Memory (CD ROM), digital versatile disc (DVD) or other optical storages, magnetic cassette tape, tape magnetic rigid disk storage or
Other magnetic storage apparatus or any other non-transmission medium, the information that can be accessed by a computing device available for storage.According to
Herein defines, and computer-readable medium does not include the computer readable media (transitory media) of non-standing, such as
The data-signal and carrier wave of modulation.
The embodiment of the present application is with reference to according to the method for the embodiment of the present application, terminal device (system) and computer program
The flow chart and/or block diagram of product describes.It should be understood that can be by computer program instructions implementation process figure and/or block diagram
In each flow and/or square frame and the flow in flow chart and/or block diagram and/or the combination of square frame.These can be provided
Computer program instructions are set to all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing terminals
Standby processor is to produce a machine so that is held by the processor of computer or other programmable data processing terminal equipments
Capable instruction is produced for realizing in one flow of flow chart or multiple flows and/or one square frame of block diagram or multiple square frames
The device for the function of specifying.
These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing terminal equipments
In the computer-readable memory to work in a specific way so that the instruction being stored in the computer-readable memory produces bag
The manufacture of command device is included, the command device is realized in one flow of flow chart or multiple flows and/or one side of block diagram
The function of being specified in frame or multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing terminal equipments so that
Series of operation steps is performed on computer or other programmable terminal equipments to produce computer implemented processing, so that
The instruction performed on computer or other programmable terminal equipments is provided for realizing in one flow of flow chart or multiple flows
And/or specified in one square frame of block diagram or multiple square frames function the step of.
Although having been described for the preferred embodiment of the embodiment of the present application, those skilled in the art once know base
This creative concept, then other change and modification can be made to these embodiments.So appended claims are intended to be construed to
Including preferred embodiment and fall into having altered and changing for the embodiment of the present application scope.
Finally, it is to be noted that, herein, such as first and second or the like relational terms be used merely to by
One entity or operation make a distinction with another entity or operation, and not necessarily require or imply these entities or operation
Between any this actual relation or order be present.Moreover, term " comprising ", "comprising" or its any other variant meaning
Covering including for nonexcludability, so that process, method, article or terminal device including a series of elements are not only wrapped
Those key elements, but also the other element including being not expressly set out are included, or is also included for this process, method, article
Or the key element that terminal device is intrinsic.In the absence of more restrictions, wanted by what sentence "including a ..." limited
Element, it is not excluded that other identical element in the process including the key element, method, article or terminal device also be present.
Above to the method, apparatus and system of a kind of determination address uniqueness provided herein, detailed Jie has been carried out
Continue, specific case used herein is set forth to the principle and embodiment of the application, and the explanation of above example is only
It is to be used to help understand the present processes and its core concept;Meanwhile for those of ordinary skill in the art, according to this Shen
Thought please, can there is change part in specific embodiments and applications, in summary, this specification content should not manage
Solve as the limitation to the application.
Claims (17)
- A kind of 1. method for determining address uniqueness, it is characterised in that including:Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or point of interest POI;Clustering processing is carried out to the multiple latitude and longitude coordinates;Whether a plurality of Text Address comprising same link information or POI is determined according to the result of the clustering processing It is unique.
- 2. according to the method for claim 1, it is characterised in that the result according to the clustering processing determines described more Bar includes whether same link information or the Text Address of POI are unique steps, including:If the result of the clustering processing is can be polymerized to a class, it is determined that described a plurality of to include same link information or POI The Text Address of information is unique;If the result of the clustering processing is can not be polymerized to a class, it is determined that it is described it is a plurality of comprising same link information or The Text Address of POI is not unique.
- 3. according to the method for claim 1, it is characterised in that described to include same link information or point of interest based on a plurality of Before the Text Address of POI obtains the step of multiple latitude and longitude coordinates, in addition to:A plurality of Text Address is obtained from Text Address storehouse.
- 4. according to the method for claim 3, it is characterised in that obtain a plurality of Text Address in the storehouse from Text Address After step, in addition to:Judge road information in every Text Address or POI whether in same administrative area;If it is judged that for not in same administrative area, then a plurality of same link information or POI are included based on described Text Address obtain corresponding latitude and longitude coordinates;If it is judged that in same administrative area, it is determined that the address comprising the road information or POI is only One.
- 5. according to the method for claim 4, it is characterised in that the road information judged in every Text Address Or POI whether the step in same administrative area, including:Participle and cutting word processing are carried out for every Text Address, obtains the city, administrative area, road of every Text Address Information or POI;Count the distribution proportion of the road information or POI in each administrative area in same city;If the distribution proportion is not less than predetermined threshold value, road information or POI letters in the Text Address are determined whether Whether breath is only distributed in same administrative area, if so, then judging that the Text Address comprising the road information or POI is Uniquely.
- 6. according to the method for claim 1, it is characterised in that described to include same link information or point of interest based on a plurality of The Text Address of POI obtains the step of multiple latitude and longitude coordinates, including:Participle and cutting word processing are carried out for every Text Address, obtains the city, administrative area, road of every Text Address Information or POI;Select the Text Address with same link information or POI for belonging to same city;Based on a plurality of Text Address comprising same link or POI selected, corresponding warp is obtained from address latitude and longitude information storehouse Latitude coordinate.
- 7. according to the method for claim 1, it is characterised in that described that clustering processing is carried out to the multiple latitude and longitude coordinates The step of include:The multiple latitude and longitude coordinates are calculated by lng=lng*1000, lat=lat*1000 mode respectively Corresponding longitude and latitude mesh coordinate;Clustering processing is carried out to the longitude and latitude mesh coordinate information using default minimal point and radius number.
- A kind of 8. method for determining that shipping address is reachable, it is characterised in that including:Multiple latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or point of interest POI;Clustering processing is carried out to the multiple latitude and longitude coordinates;Whether a plurality of Text Address comprising same link information or POI is determined according to the result of the clustering processing It is unique;If a plurality of Text Address comprising same link information or POI is unique, by a plurality of text Location is recorded as the address that can be sent to.
- A kind of 9. device for determining address uniqueness, it is characterised in that including:Longitude and latitude acquisition module, for being obtained based on a plurality of Text Address comprising same link information or point of interest POI Multiple latitude and longitude coordinates;Clustering processing module, for carrying out clustering processing to the multiple latitude and longitude coordinates;Uniqueness determining module, for determining a plurality of comprising same link information or POI to believe according to the result of the clustering processing Whether the Text Address of breath is unique.
- 10. device according to claim 9, it is characterised in that the uniqueness determining module includes:First determining unit, if the result for the clustering processing module is that can be polymerized to a class, it is determined that described The a plurality of Text Address comprising same link information or POI is unique;Second determining unit, if the result for the clustering processing module is that can not be polymerized to a class, it is determined that institute It is not unique to state a plurality of Text Address comprising same link information or POI.
- 11. device according to claim 9, it is characterised in that described device also includes:Address acquisition module, for obtaining a plurality of Text Address from Text Address storehouse.
- 12. device according to claim 11, it is characterised in that described device also includes:Administrative area judge module, for judging road information in every Text Address or POI whether same In administrative area;AccordinglyThe longitude and latitude acquisition module, if the judged result for the administrative area judge module is not in same administrative area In, then corresponding latitude and longitude coordinates are obtained based on a plurality of Text Address comprising same link information or POI;The uniqueness determining module, if the judged result for being additionally operable to the administrative area judge module is in same administrative area In, it is determined that the address comprising the road information or POI is unique.
- 13. device according to claim 12, it is characterised in that the administrative area judge module includes:First acquisition unit, for carrying out participle and cutting word processing for every Text Address, obtain every Text Address City, administrative area, road information or POI;Statistic unit, for counting the distribution proportion of the road information or POI in each administrative area in same city;Identifying unit, if being not less than predetermined threshold value for the distribution proportion, determine whether the road in the Text Address Whether road information or POI are only distributed in same administrative area, if so, then judging to believe comprising the road information or POI The Text Address of breath is unique.
- 14. device according to claim 9, it is characterised in that the longitude and latitude acquisition module includes:Second acquisition unit, for carrying out participle and cutting word processing for every Text Address, obtain every Text Address City, administrative area, road information or POI;Unit is selected, the Text Address with same link information or POI in same city is belonged to for selecting;Latitude and longitude coordinates acquiring unit, for based on a plurality of text comprising same link or POI selected unit and selected Address, corresponding latitude and longitude coordinates are obtained from address latitude and longitude information storehouse.
- 15. device according to claim 9, it is characterised in that the clustering processing module includes:Computing unit, for being sat respectively by lng=lng*1000, lat=lat*1000 mode to the multiple longitude and latitude Mark carries out that corresponding longitude and latitude mesh coordinate is calculated;Clustering processing unit, for being carried out using default minimal point and radius number to the longitude and latitude mesh coordinate information Clustering processing.
- A kind of 16. device for determining that shipping address is reachable, it is characterised in that including:Longitude and latitude acquisition module, for being obtained based on a plurality of Text Address comprising same link information or point of interest POI Multiple latitude and longitude coordinates;Clustering processing module, for carrying out clustering processing to the multiple latitude and longitude coordinates;Uniqueness determining module, for according to the result of the clustering processing determine it is described it is a plurality of comprising same link information or Whether the Text Address of POI is unique;Logging modle, if being determined for the uniqueness determining module described a plurality of comprising same link information or POI Text Address is unique, then a plurality of Text Address is recorded as to the address that can be sent to.
- 17. it is a kind of determine address uniqueness system, it is characterised in that including according to claim any one of 9-15 really Determine the device of address uniqueness.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610552332.XA CN107622061A (en) | 2016-07-13 | 2016-07-13 | A kind of method, apparatus and system for determining address uniqueness |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201610552332.XA CN107622061A (en) | 2016-07-13 | 2016-07-13 | A kind of method, apparatus and system for determining address uniqueness |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107622061A true CN107622061A (en) | 2018-01-23 |
Family
ID=61087314
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201610552332.XA Pending CN107622061A (en) | 2016-07-13 | 2016-07-13 | A kind of method, apparatus and system for determining address uniqueness |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107622061A (en) |
Cited By (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109376761A (en) * | 2018-09-12 | 2019-02-22 | 北京三快在线科技有限公司 | The method for digging and device of a kind of address mark and its longitude and latitude |
CN109635063A (en) * | 2018-12-06 | 2019-04-16 | 拉扎斯网络科技(上海)有限公司 | Information processing method, device, electronic equipment and the storage medium of address base |
CN109992638A (en) * | 2019-03-29 | 2019-07-09 | 北京三快在线科技有限公司 | Generation method, device, electronic equipment and the storage medium of geographical location POI |
CN111506675A (en) * | 2019-01-11 | 2020-08-07 | 阿里巴巴集团控股有限公司 | Method, apparatus, device and medium for determining points of interest |
CN111723165A (en) * | 2019-03-18 | 2020-09-29 | 阿里巴巴集团控股有限公司 | Address interest point determining method, device and system |
CN112016326A (en) * | 2020-09-25 | 2020-12-01 | 北京百度网讯科技有限公司 | Map area word recognition method and device, electronic equipment and storage medium |
CN113076746A (en) * | 2020-01-06 | 2021-07-06 | 阿里巴巴集团控股有限公司 | Data processing method and system, storage medium and computing device |
CN116541474A (en) * | 2023-07-05 | 2023-08-04 | 平安银行股份有限公司 | Object acquisition method, device, electronic equipment and storage medium |
CN113076746B (en) * | 2020-01-06 | 2024-05-31 | 阿里巴巴集团控股有限公司 | Data processing method and system, storage medium and computing device |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102867004A (en) * | 2011-07-06 | 2013-01-09 | 高德软件有限公司 | Method and system for matching addresses |
CN103023997A (en) * | 2012-11-29 | 2013-04-03 | 江苏鸿信系统集成有限公司 | Massive geographical information name and address conversion method and device based on grid caching technology |
CN103810194A (en) * | 2012-11-11 | 2014-05-21 | 刘龙 | Geographic coding method, position inquiring system and position inquiring method |
CN103853725A (en) * | 2012-11-29 | 2014-06-11 | 深圳先进技术研究院 | Traffic track data noise reduction method and system |
CN104484790A (en) * | 2014-12-26 | 2015-04-01 | 清华大学深圳研究生院 | Address match method and device of logistics business |
CN105509743A (en) * | 2015-11-24 | 2016-04-20 | 上海汽车集团股份有限公司 | A positioning processing method, a business platform and a network system |
CN105718465A (en) * | 2014-12-02 | 2016-06-29 | 阿里巴巴集团控股有限公司 | Geofence generation method and device |
-
2016
- 2016-07-13 CN CN201610552332.XA patent/CN107622061A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102867004A (en) * | 2011-07-06 | 2013-01-09 | 高德软件有限公司 | Method and system for matching addresses |
CN103810194A (en) * | 2012-11-11 | 2014-05-21 | 刘龙 | Geographic coding method, position inquiring system and position inquiring method |
CN103023997A (en) * | 2012-11-29 | 2013-04-03 | 江苏鸿信系统集成有限公司 | Massive geographical information name and address conversion method and device based on grid caching technology |
CN103853725A (en) * | 2012-11-29 | 2014-06-11 | 深圳先进技术研究院 | Traffic track data noise reduction method and system |
CN105718465A (en) * | 2014-12-02 | 2016-06-29 | 阿里巴巴集团控股有限公司 | Geofence generation method and device |
CN104484790A (en) * | 2014-12-26 | 2015-04-01 | 清华大学深圳研究生院 | Address match method and device of logistics business |
CN105509743A (en) * | 2015-11-24 | 2016-04-20 | 上海汽车集团股份有限公司 | A positioning processing method, a business platform and a network system |
Cited By (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109376761A (en) * | 2018-09-12 | 2019-02-22 | 北京三快在线科技有限公司 | The method for digging and device of a kind of address mark and its longitude and latitude |
CN109376761B (en) * | 2018-09-12 | 2021-01-22 | 北京三快在线科技有限公司 | Address identification and longitude and latitude mining method and device thereof |
CN109635063A (en) * | 2018-12-06 | 2019-04-16 | 拉扎斯网络科技(上海)有限公司 | Information processing method, device, electronic equipment and the storage medium of address base |
CN111506675A (en) * | 2019-01-11 | 2020-08-07 | 阿里巴巴集团控股有限公司 | Method, apparatus, device and medium for determining points of interest |
CN111723165A (en) * | 2019-03-18 | 2020-09-29 | 阿里巴巴集团控股有限公司 | Address interest point determining method, device and system |
CN109992638A (en) * | 2019-03-29 | 2019-07-09 | 北京三快在线科技有限公司 | Generation method, device, electronic equipment and the storage medium of geographical location POI |
CN109992638B (en) * | 2019-03-29 | 2020-11-20 | 北京三快在线科技有限公司 | Method and device for generating geographical position POI, electronic equipment and storage medium |
CN113076746A (en) * | 2020-01-06 | 2021-07-06 | 阿里巴巴集团控股有限公司 | Data processing method and system, storage medium and computing device |
CN113076746B (en) * | 2020-01-06 | 2024-05-31 | 阿里巴巴集团控股有限公司 | Data processing method and system, storage medium and computing device |
CN112016326A (en) * | 2020-09-25 | 2020-12-01 | 北京百度网讯科技有限公司 | Map area word recognition method and device, electronic equipment and storage medium |
CN116541474A (en) * | 2023-07-05 | 2023-08-04 | 平安银行股份有限公司 | Object acquisition method, device, electronic equipment and storage medium |
CN116541474B (en) * | 2023-07-05 | 2024-02-02 | 平安银行股份有限公司 | Object acquisition method, device, electronic equipment and storage medium |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107622061A (en) | A kind of method, apparatus and system for determining address uniqueness | |
JP6321681B2 (en) | Method and apparatus for identifying website users | |
Zheng et al. | Detecting collective anomalies from multiple spatio-temporal datasets across different domains | |
CN104050196B (en) | A kind of interest point data redundant detecting method and device | |
CN106201886B (en) | A kind of Proxy Method and device of the verifying of real time data task | |
CN106156965B (en) | Logistics service scheduling method and equipment | |
CN103902622B (en) | Mass moving target aggregation method and device | |
CN104112284B (en) | The similarity detection method and equipment of a kind of picture | |
CN109117433B (en) | Index tree object creation and index method and related device thereof | |
CN108197873A (en) | Warehouse article goods sorting method, device, computer equipment and storage medium | |
CN107783734A (en) | A kind of resource allocation methods, device and terminal based on super fusion storage system | |
CN110458598A (en) | Scene adaptation method, device and electronic equipment | |
CN106033510A (en) | Method and system for identifying user equipment | |
CN106897342A (en) | A kind of data verification method and equipment | |
CN108008936A (en) | A kind of data processing method, device and electronic equipment | |
CN108664583A (en) | A kind of index tree method for building up and image search method | |
CN105898835B (en) | Generate the method and apparatus of the access point attribute information of WAP | |
Yin et al. | A deep learning approach for rooftop geocoding | |
CN107395680A (en) | Shop group's information push and output intent and device, equipment | |
CN106708648B (en) | A kind of the storage method of calibration and system of text data | |
US9390105B2 (en) | System and methods for storing and analyzing geographically-referenced data | |
Duan et al. | Comprehending and analyzing multiday trip-chaining patterns of freight vehicles using a multiscale method with prolonged trajectory data | |
CN108932525A (en) | A kind of behavior prediction method and device | |
US10903684B2 (en) | Method for operating a network having multiple node devices, and network | |
CN116703132A (en) | Management method and device for dynamic scheduling of shared vehicles and computer equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20180418 Address after: Four story 847 mailbox of the capital mansion of Cayman Islands, Cayman Islands, Cayman Applicant after: CAINIAO SMART LOGISTICS HOLDING Ltd. Address before: Cayman Islands Grand Cayman capital building a four storey No. 847 mailbox Applicant before: ALIBABA GROUP HOLDING Ltd. |
|
TA01 | Transfer of patent application right | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20180123 |
|
RJ01 | Rejection of invention patent application after publication |