CN109635063A - Information processing method and device for address library, electronic equipment and storage medium - Google Patents

Information processing method and device for address library, electronic equipment and storage medium Download PDF

Info

Publication number
CN109635063A
CN109635063A CN201811488202.XA CN201811488202A CN109635063A CN 109635063 A CN109635063 A CN 109635063A CN 201811488202 A CN201811488202 A CN 201811488202A CN 109635063 A CN109635063 A CN 109635063A
Authority
CN
China
Prior art keywords
address
latitude
longitude
cluster
standard
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811488202.XA
Other languages
Chinese (zh)
Inventor
沈永新
张伟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Lazas Network Technology Shanghai Co Ltd
Original Assignee
Lazas Network Technology Shanghai Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Lazas Network Technology Shanghai Co Ltd filed Critical Lazas Network Technology Shanghai Co Ltd
Priority to CN201811488202.XA priority Critical patent/CN109635063A/en
Publication of CN109635063A publication Critical patent/CN109635063A/en
Pending legal-status Critical Current

Links

Landscapes

  • Mobile Radio Communication Systems (AREA)

Abstract

The embodiment of the invention relates to the technical field of information processing, and discloses an information processing method of an address library, which comprises the following steps: acquiring addresses and longitude and latitude of each address information in an address base; standardizing the address to obtain a standard address of each address information; clustering and integrating the longitude and latitude of the address information with the same standard address to obtain the standard longitude and latitude of the standard address; and storing standard address information corresponding to each address information in an address library, wherein the standard address information comprises a standard address and standard longitude and latitude. According to the invention, by acquiring each address information stored in the address base, a plurality of addresses are clustered and then processed into a standard address, so that redundant address data are reduced; in addition, clustering is carried out according to a plurality of longitude and latitude values, and then the clustering is processed into a standard longitude and latitude, so that the positioning accuracy of the address information is improved.

Description

Information processing method, device, electronic equipment and the storage medium of address base
Technical field
The present invention relates to technical field of information processing more particularly to a kind of information processing methods of address base, device, electronics Equipment and storage medium.
Background technique
With the development of internet, various electronic database of information are by more and more extensive application, in the take-away of catering field Industry, user search for businessman by electronic map and complete lower single operation.
Each POI (point of interest, Point of Interest) data in address base that electronic map uses include ground Location and corresponding longitude and latitude, longitude and latitude are usually to carry means of communication arrival specific location by staff to go to get ready, will be acquired To latitude and longitude information store into address base.Due in actual application, physical location locating for user and work people The position got ready before member may have bigger difference, will result in bigger position error.In addition, to same position, it is different The address full name that people names it may be different, therefore may store two POI numbers for the same position in address base According to, corresponding two different address full name, resulted in address base in this way there are data redundancy, user using electronic map into When row is searched, redundant data can be also shown.
Summary of the invention
A kind of information processing method for being designed to provide address base of embodiment of the present invention, device, electronic equipment and Storage medium is standardized address information by the way of cluster, eliminates redundant data, and obtains more quasi- True position.
In order to solve the above technical problems, embodiments of the present invention provide a kind of information processing method of address base, packet It includes: obtaining the address of each address information and longitude and latitude in address base;Address is standardized, the standard of each address information is obtained Address;The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains the standard longitude and latitude of normal address Degree;The corresponding normal address information of each address information is stored in address base, normal address information includes normal address and standard Longitude and latitude.
Embodiments of the present invention additionally provide a kind of information processing unit of address base, comprising: address information obtains mould Block, for obtaining the address of each address information and longitude and latitude in address base;Address clustering processing module, for being marked to address Standardization obtains the normal address of each address information;Longitude and latitude clustering processing module, for will be provided with the address of identical standard address Information carries out the cluster integration of longitude and latitude, obtains the standard longitude and latitude of normal address;Normal address information storage module, is used for Normal address information is stored in address base, normal address information includes normal address and standard longitude and latitude.
Embodiments of the present invention additionally provide a kind of electronic equipment, comprising: at least one processor;And at least one The memory of a processor communication connection;Wherein, memory is stored with the instruction that can be executed by least one processor, instructs quilt At least one processor is executed to realize: obtaining the address of each address information and longitude and latitude in address base;Standard is carried out to address Change, obtains the normal address of each address information;The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, Obtain the standard longitude and latitude of normal address;The corresponding normal address information of each address information, normal address are stored in address base Information includes normal address and standard longitude and latitude.
Embodiments of the present invention additionally provide a kind of non-volatile memory medium, for storing computer-readable program, Computer-readable program is used to execute the information processing method of address base as above for computer.
In terms of existing technologies, the main distinction and its effect are embodiment of the present invention: by obtaining address base Each address information of middle storage, it is a normal address that multiple addresses, which are carried out cluster post-processing, reduces the number of addresses of redundancy According to;It is a standard longitude and latitude according further to the cluster post-processing of multiple latitude and longitude values, improves the positional accuracy of address information.
In addition, being standardized to address, obtain the normal address of each address information, comprising: obtain the region of address at Point;The identical multiple addresses of regional part are clustered, normal address is obtained.Multiple addresses are distinguished according to regional part, really The multiple addresses for protecting cluster are the same address.
In addition, regional part includes province, city, area, street, Lou Hao, any number of combinations in number.Using multiple Regional part is identified, further ensures that multiple addresses of cluster are the same address.
In addition, the address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, normal address is obtained Standard longitude and latitude, comprising: according to the multiple longitudes and latitudes for the address information for having identical standard address, cluster obtains dot density and is greater than Or the several points cluster equal to the first dot density threshold value;The point cluster for meeting preset condition is chosen in several points cluster as target Point cluster;Standard longitude and latitude is calculated according to target point cluster.By the method for cluster, the primary focal zone domain of longitude and latitude point can be found, The standard longitude and latitude made is more accurate.
In addition, preset condition are as follows: the longitude and latitude points for including are most, and the longitude and latitude points for including account for multiple longitudes and latitudes The percentage always counted is greater than preset percentage.By the way that preset percentage is arranged, make the longitude and latitude for participating in calculating standard longitude and latitude It counts enough, further such that the standard longitude and latitude arrived is more accurate.
In addition, according to the multiple longitudes and latitudes for the address information for having identical standard address, cluster obtain dot density be greater than or After several points cluster equal to the first dot density threshold value, the point cluster for meeting preset condition is chosen in several points cluster as mesh Before punctuate cluster, further includes: judge that the longitude and latitude points in several points cluster, in maximum point cluster account for total points of multiple longitudes and latitudes Percentage whether be greater than or equal to preset percentage;If it is not, then according to the multiple of the address information for having identical standard address Longitude and latitude, cluster obtain the several points cluster that dot density is greater than or equal to second point density threshold;Second point density threshold is less than First dot density threshold value.By repeatedly clustering, the longitude and latitude points that guarantee participates in calculating standard longitude and latitude reach preset quantity, into The standard longitude and latitude that one step ensure that is more accurate.
In addition, calculating standard longitude and latitude according to target point cluster, comprising: according to each longitude and latitude in target point cluster, calculate warp Spend average value and latitude average value;Longitude average value and latitude average value constitute standard longitude and latitude.By calculating multiple longitudes and latitudes Average value obtain standard longitude and latitude, improve the accuracy of standard longitude and latitude.
In addition, longitude and latitude is the longitude and latitude for getting position ready for dispensing transport power.Using the longitude and latitude historical data got ready, it is not necessarily to Staff specially shows up acquisition, saves manpower.
Detailed description of the invention
Fig. 1 is the information processing method flow chart for the address base that first embodiment provides according to the present invention;
Fig. 2 is the Address Standardization processing method flow chart in first embodiment according to the present invention;
Fig. 3 is the latitude and longitude standard processing method flow chart in first embodiment according to the present invention;
Fig. 4 is the latitude and longitude standard processing method flow chart in second embodiment according to the present invention;
Fig. 5 is the information processing unit schematic diagram for the address base that third embodiment provides according to the present invention;
Fig. 6 is the electronic equipment schematic diagram that the 4th embodiment provides according to the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention Each embodiment be explained in detail.However, it will be understood by those skilled in the art that in each embodiment party of the present invention In formula, in order to make the reader understand this application better, many technical details are proposed.But even if without these technical details And various changes and modifications based on the following respective embodiments, the application technical solution claimed also may be implemented.With Under the division of each embodiment be for convenience, any restriction should not to be constituted to specific implementation of the invention, it is each Embodiment can be combined with each other mutual reference under the premise of reconcilable.
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention Each embodiment be explained in detail.However, it will be understood by those skilled in the art that in each embodiment party of the present invention In formula, in order to make the reader understand this application better, many technical details are proposed.But even if without these technical details And various changes and modifications based on the following respective embodiments, the application technical solution claimed also may be implemented.With Under the division of each embodiment be for convenience, any restriction should not to be constituted to specific implementation of the invention, it is each Embodiment can be combined with each other mutual reference under the premise of reconcilable.
The first embodiment of the present invention is related to a kind of information processing method of address base, present embodiment can be applied Terminal side is such as applied in mobile phone, the terminal devices such as tablet computer, can also be applied in the server of network side.
Fig. 1 is the information processing method flow chart for the address base that first embodiment provides according to the present invention, this method packet It includes:
Step S101, the address of each address information and longitude and latitude in address base are obtained.
Specifically, each address information stored in address base includes address and longitude and latitude, address is by multiple regions Ingredient is constituted, and regional part is for example including any number of combinations in province, city, area, street, Lou Hao, number.Each ground The regional part of the corresponding one group of determination in location, and corresponding determining latitude and longitude information.The longitude and latitude of each address information in address base It is recorded after getting acquisition ready by special communication staff arrival address geographic location when including creation address information in degree source Enter, further includes getting record ready after for example reaching dispatching place after address information creates when executing dispatching task by dispatching personnel Historical data, the latitude and longitude information that dispatching personnel get ready usually has multiple, such as multiple dispatching personnel are same in different time It is got ready after one user's dispatching or a dispatching personnel is repeatedly to get ready after the same user dispenses, can all generate multiple beat The latitude and longitude information of point.
Step S102, address is standardized, obtains the normal address of each address information.As shown in Fig. 2, step S102 Include:
Step S1021, the regional part of address is obtained;
Address is made of regional part, available to arrive corresponding each region ingredient for each address.For example, Certain address is " Shuangqing Road, Haidian District, Beijing City 30 ", then extracts each region ingredient are as follows: Beijing, Haidian, Shuan Qinglu, 30 Number.
Step S1022, the identical multiple addresses of regional part are clustered, obtains normal address;
Different people may be different to the address text of same address statement, therefore occur as soon as a variety of statements to same address. In this way, there have been a plurality of address informations for same address in address base, data redundancy is generated.By by the number of redundancy According to being clustered and carry out calibration, so that it may greatly reduce data redundancy, can not only discharge the memory space of address base, The engine search efficiency of address base can be improved.
In present embodiment, the region of address is obtained specifically, searching in address base to the method for multiple addresses cluster The identical a plurality of address information of ingredient, corresponding multiple addresses are standardized, and merge into an address to get study plot is arrived Location, then the address in a plurality of address information, can be unified for a normal address.
For example, to the address in BeiJing ZhongGuanCun square shopping center, there are several types of statements:
1, ZhongGuancun Street, BeiJing City 15
2, Beijing-Haidian-Zhongguancun Street -15
3, Zongguancun Street, Haidian District, Beijing City (No. 15)
1st kind is stated, extracting regional part includes: Beijing, Zhongguancun Street, No. 15, according to the ground of the address Position is managed, is located at Haidian District, therefore completion, i.e. regional part are carried out to regional part further include: Haidian;For the 2nd kind of table It states, extracting regional part includes: Beijing, Haidian, Zhongguancun Street, No. 15;3rd kind is stated, regional part is extracted It include: Beijing, Haidian, Zhongguancun Street, No. 15.Therefore above 3 kinds of address statements include identical regional part, Ke Yijin Row cluster and standardization, specifically such as, the address after standardization are Zongguancun Street, Haidian District, Beijing City 15.
Step S103, the address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains study plot The standard longitude and latitude of location.
The each address information stored in address base, when corresponding latitude and longitude information can be creation address information, by Special communication staff reaches position according to address and gets typing after acquisition ready, after being also possible to address information creation Typing after getting acquisition after dispensing place ready is reached when executing dispatching task by dispatching personnel.In the actual conditions for taking out dispatching In, there may be certain difference or different dispatching personnel for the position that the physical location and communication staff of user is got ready The longitude and latitude of typing can also have differences or the longitude and latitude of same dispatching personnel not homogeneous typing can also have differences.And it is right Answer an address actually only one accurate longitude and latitude.Therefore, it is got pair according to the historical data that dispatching personnel get ready The multiple latitude and longitude informations for answering the same normal address carry out cluster integration, the available mark to multiple latitude and longitude information The corresponding standard longitude and latitude in quasi- address.The specific implementation of step S103 is as shown in figure 3, specifically include:
Step S1031, according to the multiple longitudes and latitudes for the address information for having identical standard address, it is big that cluster obtains dot density In the several points cluster of dot density threshold value.
In present embodiment, a dot density threshold value, for example, the first dot density threshold value are set, with will be provided with identical standard Dot density is greater than the first dot density threshold value according to calculating dot density separated by a distance by multiple longitudes and latitudes of the address information of location Multiple latitude and longitude coordinates points cluster into same cluster, so cluster obtain several points cluster.
In one example, such as according to the history of dispatching personnel 100 of the address that data are obtained about user A are got ready Latitude and longitude coordinates point sets the first dot density threshold value as 1/100m2, 100 longitudes and latitudes are sat according to the first dot density threshold value Mark is clustered, and 5 clusters, for example, a, b, c, d, e are obtained.
Step S1032, the point cluster for meeting preset condition is chosen in several points cluster as target point cluster;
In present embodiment, from several points cluster when selection target point cluster, setting a preset condition is, for example, target The longitude and latitude that point cluster includes counts (i.e. the number of latitude and longitude coordinates point) at most, and the longitude and latitude points for including account for multiple longitudes and latitudes The percentage always counted be greater than preset percentage.
In the same example, preset percentage is, for example, 30%.When selection target point cluster, several points cluster is found first In include the longitude and latitude point cluster of counting most, if longitude and latitude points simultaneously in the cluster account for always counting for multiple longitudes and latitudes Percentage is greater than preset percentage, it is determined that the cluster is target point cluster.It is poly- according to 100 latitude and longitude coordinates points to user A 5 clusters that class obtains obtain the number of latitude and longitude coordinates point in each cluster, calculate latitude and longitude coordinates point in each cluster Number accounts for the longitude and latitude in the percentage of latitude and longitude coordinates point total quantity, such as point cluster a, point cluster b, point cluster c, point cluster d, point cluster e Coordinate points number is respectively 40,20,25,5,10, and corresponding percentage is respectively 40%, 20%, 25%, 5%, 10%.Point cluster a It for maximum point cluster, percentage highest, and has been more than preset percentage 30%.Accordingly, it is determined that point cluster a is target point cluster.
Step S1033, standard longitude and latitude is calculated according to target point cluster.
In present embodiment, according to the coordinate value of longitude and latitude each in target point cluster, standard longitude and latitude is calculated.Specifically, meter Calculation obtains longitude average value and latitude average value, constitutes standard longitude and latitude by longitude average value and latitude average value.
In the same example, for example, target point cluster is point cluster a, includes 40 longitude and latitude points, obtain 40 longitudes and latitudes The longitude of point, is calculated longitude average value, and obtain the latitude value of 40 longitude and latitude points, and it is average that latitude is calculated Value, further obtains standard longitude and latitude.
Step S104, the corresponding normal address information of each address information is stored in address base, normal address information includes Normal address and standard longitude and latitude.
In present embodiment, the standard longitude and latitude that the obtained normal address step S102 and step S103 are obtained, addition Into normal address information, and by normal address information storage into address base.Meanwhile not by original storage in address base Normalised address information is deleted, to remove redundant data.
As above, after completing standardization to the address information in address base, when staff searches on the electronic map It is obtaining the result is that an address information when one destination address, and there is the positioning of accurate longitude and latitude.
The information processing method of the address base of present embodiment will be more by each address information stored in address base It is a normal address that a address knows method for distinguishing to carry out cluster post-processing using regional part, reduces the number of addresses of redundancy According to;The multiple latitude and longitude values cluster post-processing got ready according further to history is a standard longitude and latitude, improves address information Positional accuracy.
Second embodiment of the present invention is related to a kind of information processing method of address base, this method comprises:
Step S101, the address of each address information and longitude and latitude in address base are obtained.
Step S102, address is standardized, obtains the normal address of each address information.
Step S103, the address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains study plot The standard longitude and latitude of location.
The each address information stored in address base, when corresponding latitude and longitude information can be creation address information, by Special communication staff reaches position according to address and gets typing after acquisition ready, after being also possible to address information creation Typing after getting acquisition after dispensing place ready is reached when executing dispatching task by dispatching personnel.In the actual conditions for taking out dispatching In, there may be certain difference or different dispatching personnel for the position that the physical location and communication staff of user is got ready The longitude and latitude of typing can also have differences or the longitude and latitude of same dispatching personnel not homogeneous typing can also have differences.And it is right Answer an address actually only one accurate longitude and latitude.Therefore, it is got pair according to the historical data that dispatching personnel get ready The multiple latitude and longitude informations for answering the same normal address carry out cluster integration, the available mark to multiple latitude and longitude information The corresponding standard longitude and latitude in quasi- address.In the present embodiment, the specific implementation of step S103 is as shown in figure 4, specifically include:
Step S1031, according to the multiple longitudes and latitudes for the address information for having identical standard address, it is big that cluster obtains dot density In the several points cluster of dot density threshold value.
In present embodiment, a dot density threshold value, for example, the first dot density threshold value are set, with will be provided with identical standard Dot density is greater than the first dot density threshold value according to calculating dot density separated by a distance by multiple longitudes and latitudes of the address information of location Multiple latitude and longitude coordinates points cluster into same cluster, so cluster obtain several points cluster.
Step S1032, judge in several points cluster, the longitude and latitude points in maximum point cluster account for total points of multiple longitudes and latitudes Percentage whether be greater than or equal to the preset percentage.
In one example, a preset percentage is set, for example, 30%.Further, it is looked for from several points cluster To maximum point cluster, i.e., the point cluster for counting most comprising longitude and latitude, judge the longitude and latitude points in the maximum point cluster account for have it is identical Whether the percentage of multiple longitudes and latitudes of the address information of normal address always counted is greater than preset percentage.Herein, it puts in cluster The longitude and latitude points for including refer to the quantity for the latitude and longitude coordinates point for including in the cluster.
If the determination result is YES, then it represents cluster to complete, directly execution step S1033.
If judging result be it is no, need to cluster again, that is, return to step S1031.In step S1031, change The size of preset dot density threshold value, for example, by the first dot density threshold modifying be second point density threshold, had according to described Multiple longitudes and latitudes of the address information of identical standard address, if cluster obtains dot density more than or equal to second point density threshold Dry cluster, wherein second point density threshold is less than the first dot density threshold value.
As described above, by the size for changing preset dot density threshold value, by cluster process at least once, by multiple warps Latitude is clustered into several points cluster.
In one example, when such as preset percentage is, for example, 30%, 50 longitudes and latitudes of the address about user B are sat Punctuate sets the first dot density threshold value as 1/100m2, 50 latitude and longitude coordinates are gathered according to the first dot density threshold value Class obtains 4 clusters, for example, a1, b1, c1, d1.Wherein maximum point cluster is point cluster b1, includes 14 latitude and longitude coordinates points, accounts for The percentage of latitude and longitude coordinates point total quantity is 28%, is less than preset percentage 30%.Then modifying point density threshold is second point Density threshold 0.8/100m2, clustered again, cluster obtains 4 clusters, for example, a2, b2, c2, d2, obtains maximum point cluster It include 18 latitude and longitude coordinates points for a cluster b2, the percentage for accounting for latitude and longitude coordinates point total quantity is 36%, is greater than default percentage Than 30%, cluster is completed.
Step S1033, the point cluster for meeting preset condition is chosen in several points cluster as target point cluster;
In present embodiment, from several points cluster when selection target point cluster, setting a preset condition is, for example, target The longitude and latitude points that point cluster includes are most, and the longitude and latitude points for including account for the percentage of multiple longitudes and latitudes always counted greater than pre- If percentage.
In one example, preset percentage is, for example, 30%.When selection target point cluster, found in several points cluster first The point cluster that the longitude and latitude for including is counted most, if the longitude and latitude points simultaneously in the cluster account for hundred always to count of multiple longitudes and latitudes Divide than being greater than preset percentage.
Step S1034, standard longitude and latitude is calculated according to target point cluster.
In present embodiment, according to the coordinate value of longitude and latitude each in target point cluster, standard longitude and latitude is calculated.Specifically, meter Calculation obtains longitude average value and latitude average value, constitutes standard longitude and latitude by longitude average value and latitude average value.
In the same example, for example, target point cluster is point cluster b2, includes 18 latitude and longitude coordinates points, obtain this 18 The longitude of longitude and latitude point, is calculated longitude average value, and obtains the latitude value of 18 longitude and latitude points, and latitude is calculated Average value further obtains standard longitude and latitude.
Step S104, the corresponding normal address information of each address information is stored in address base, normal address information includes Normal address and standard longitude and latitude.
In present embodiment, the standard longitude and latitude that the obtained normal address step S102 and step S103 are obtained, addition Into normal address information, and by normal address information storage into address base.Meanwhile not by original storage in address base Normalised address information is deleted, to remove redundant data.
As above, after completing standardization to the address information in address base, when staff searches on the electronic map It is obtaining the result is that an address information when one destination address, and there is the positioning of accurate longitude and latitude.
The information processing method of the address base of present embodiment will be more by each address information stored in address base It is a normal address that a address knows method for distinguishing to carry out cluster post-processing using regional part, reduces the number of addresses of redundancy According to;It is a standard warp additionally by preset density threshold is changed according to multiple latitude and longitude values cluster post-processing that history is got ready Latitude improves the positional accuracy of address information.
Third embodiment of the present invention is related to a kind of information processing unit of address base, and Fig. 5 is third according to the present invention The information processing unit schematic diagram for the address base that embodiment provides, the device 500 include:
Address information obtains module 501, for obtaining the address of each address information and longitude and latitude in address base;
Address clustering processing module 502 obtains the normal address of each address information for being standardized to address.
Longitude and latitude clustering processing module 503 carries out the poly- of longitude and latitude for will be provided with the address information of identical standard address Class integration, obtains the standard longitude and latitude of normal address.
Normal address information storage module 504, for storing normal address information, normal address packet in address base Include normal address and standard longitude and latitude.
In one example, address clustering processing module 502 obtains the regional part of address;Regional part is identical more A address is clustered, and normal address is obtained.
In one example, longitude and latitude clustering processing module 503 is according to the more of the address information for having identical standard address A longitude and latitude, cluster obtain the several points cluster that dot density is greater than or equal to the first dot density threshold value;It is selected in several points cluster Take the point cluster for meeting preset condition as target point cluster;Standard longitude and latitude is calculated according to target point cluster.Wherein, preset condition are as follows: The longitude and latitude points for including are most, and the longitude and latitude points for including account for the percentage of multiple longitudes and latitudes always counted greater than default hundred Divide ratio.
In one example, in multiple longitudes and latitudes according to the address information for having identical standard address, cluster is obtained a little Density is greater than or equal to after the several points cluster of the first dot density threshold value, chooses in several points cluster and meets preset condition Before point cluster is as target point cluster, longitude and latitude clustering processing module 503 is also used to judge in several points cluster, in maximum point cluster Whether the percentage always counted that longitude and latitude points account for multiple longitudes and latitudes is greater than or equal to preset percentage;If it is not, then according to tool Multiple longitudes and latitudes of the address information of standby identical standard address, cluster obtain dot density more than or equal to second point density threshold Several points cluster;Second point density threshold is less than the first dot density threshold value.
In one example, it is flat to calculate longitude according to each longitude and latitude in target point cluster for longitude and latitude clustering processing module 503 Mean value and latitude average value constitute standard longitude and latitude by longitude average value and latitude average value.
Four embodiment of the invention is related to a kind of electronic equipment, and Fig. 6 is the electronic equipment provided according to the present embodiment Schematic diagram, the electronic equipment include: at least one processor 601;And it is deposited with what at least one processor 601 communicated to connect Reservoir 602;And respectively with processor 601 and memory 602 be communication connection communication component 603, communication component 603 Data are sended and received under the control of processor 601;Wherein, memory 602, which is stored with, to be held by least one processor 601 Capable instruction, instruction are executed by least one processor 601 to realize:
Obtain the address of each address information and longitude and latitude in address base;
Address is standardized, the normal address of each address information is obtained;
The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains the standard warp of normal address Latitude;
Store the corresponding normal address information of each address information in address base, normal address information include normal address and Standard longitude and latitude.
The electronic equipment includes: one or more processors 601 and memory 602, with a processor 601 in Fig. 6 For.Processor 601, memory 602 can be connected by bus or other modes, in Fig. 6 for being connected by bus. Memory 602 is used as a kind of non-volatile computer readable storage medium storing program for executing, can be used for storing non-volatile software program, non-volatile Property computer executable program and module.Non-volatile software journey of the processor 601 by operation storage in the memory 602 Sequence, instruction and module realize the information in address above mentioned library thereby executing the various function application and data processing of equipment Processing method.
Memory 602 may include storing program area and storage data area, wherein storing program area can store operation system Application program required for system, at least one function;Storage data area can store normal address, standard longitude and latitude, history are got ready Longitude and latitude data etc..In addition, memory 602 may include high-speed random access memory, it can also include non-volatile deposit Reservoir, for example, at least a disk memory, flush memory device or other non-volatile solid state memory parts.In some implementations In mode, optional memory 602 includes the memory remotely located relative to processor 601, these remote memories can lead to Network connection is crossed to external equipment.The example of above-mentioned network includes but is not limited to internet, intranet, local area network, movement Communication network and combinations thereof.
One or more module stores in the memory 602, when being executed by one or more processor 601, holds The information processing method of address base in the above-mentioned any means embodiment of row.
The said goods can be performed the application embodiment provided by method, have the corresponding functional module of execution method and Beneficial effect, the not technical detail of detailed description in the present embodiment, reference can be made to method provided by the application embodiment.
5th embodiment of the invention is related to a kind of non-volatile memory medium, for storing computer-readable program, Computer-readable program is used to execute above-mentioned all or part of embodiment of the method for computer.
That is, it will be understood by those skilled in the art that implement the method for the above embodiments be can be with Relevant hardware is instructed to complete by program, which is stored in a storage medium, including some instructions are to make It obtains an equipment (can be single-chip microcontroller, chip etc.) or processor (processor) executes each embodiment method of the application All or part of the steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can store journey The medium of sequence code.
It will be understood by those skilled in the art that the respective embodiments described above are to realize specific embodiments of the present invention, And in practical applications, can to it, various changes can be made in the form and details, without departing from the spirit and scope of the present invention.
The embodiment of the present application discloses a kind of information processing method of address base of A1., comprising:
Obtain the address of each address information and longitude and latitude in address base;
The address is standardized, the normal address of each address information is obtained;
The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains the mark of the normal address Quasi- longitude and latitude;
The corresponding normal address information of each address information, the normal address packet are stored in the address base Include the normal address and the standard longitude and latitude.
The information processing method of A2, address base as described in a1, it is described that the address is standardized, obtain each address The normal address of information, comprising:
Obtain the regional part of the address;
The identical multiple addresses of the regional part are clustered, normal address is obtained.
The information processing method of A3, as described in A2 address base, the regional part include province, city, area, street, Lou Hao, Any number of combinations in number.
The information processing method of A4, address base as described in a1, the address information that will be provided with identical standard address into Pass through latitude cluster integration, obtain the standard longitude and latitude of the normal address, comprising:
According to the multiple longitudes and latitudes for the address information for having identical standard address, cluster obtains dot density and is greater than or equal to the The several points cluster of some density thresholds;
The point cluster for meeting preset condition is chosen in the several points cluster as target point cluster;
Standard longitude and latitude is calculated according to the target point cluster.
The information processing method of A5, address base as described in A4, the preset condition are as follows:
The longitude and latitude points for including are most, and the longitude and latitude points for including account for the percentage of the multiple longitude and latitude always counted Than being greater than preset percentage.
The information processing method of A6, address base as described in a5, the basis have the address information of identical standard address Multiple longitudes and latitudes, cluster obtain dot density more than or equal to the first dot density threshold value several points cluster after, if described Before choosing the point cluster for meeting preset condition as target point cluster in dry cluster, further includes:
Judge that the longitude and latitude points in the several points cluster, in maximum point cluster account for always counting for the multiple longitude and latitude Whether percentage is greater than or equal to the preset percentage;
If it is not, then having multiple longitudes and latitudes of the address information of identical standard address according to, cluster obtains dot density More than or equal to the several points cluster of second point density threshold;The second point density threshold is less than the first dot density threshold Value.
The information processing method of the described in any item address bases of A7, such as A4-A6, described calculated according to the target point cluster are marked Quasi- longitude and latitude, comprising:
According to each longitude and latitude in the target point cluster, longitude average value and latitude average value are calculated;
The longitude average value and the latitude average value constitute the standard longitude and latitude.
The information processing method of A8, address base as described in a1, the longitude and latitude are the warp for getting position ready for dispensing transport power Latitude.
The embodiment of the present application discloses a kind of information processing unit of address base of B1., comprising:
Address information obtains module, for obtaining the address of each address information and longitude and latitude in address base;
Address clustering processing module obtains the normal address of each address information for being standardized to the address;
Longitude and latitude clustering processing module carries out the cluster of longitude and latitude for will be provided with the address information of identical standard address Integration, obtains the standard longitude and latitude of the normal address;
Normal address information storage module, for storing normal address information, the normal address in the address base Information includes the normal address and the standard longitude and latitude.
The embodiment of the present application discloses C1. a kind of electronic equipment, comprising: at least one processor;And
The memory being connect at least one described processor communication;
Wherein, the memory is stored with the instruction that can be executed by least one described processor, and described instruction is described At least one processor is executed to realize: obtaining the address of each address information and longitude and latitude in address base;The address is carried out Standardization, obtains the normal address of each address information;The address information that will be provided with identical standard address carries out the cluster of longitude and latitude Integration, obtains the standard longitude and latitude of the normal address;The corresponding standard of each address information is stored in the address base Address information, the normal address information include the normal address and the standard longitude and latitude.
C2, the electronic equipment as described in C1, it is described that the address is standardized, obtain the study plot of each address information Location, comprising: obtain the regional part of the address;The identical multiple addresses of the regional part are clustered, are obtained Normal address.
C3, the electronic equipment as described in C2, the regional part include province, city, area, street, Lou Hao, appointing in number It anticipates multiple combinations.
C4, the electronic equipment as described in C1, the address information that will be provided with identical standard address carry out the poly- of longitude and latitude Class integration, obtains the standard longitude and latitude of the normal address, comprising: according to the multiple of the address information for having identical standard address Longitude and latitude, cluster obtain the several points cluster that dot density is greater than or equal to the first dot density threshold value;In the several points cluster The point cluster for meeting preset condition is chosen as target point cluster;Standard longitude and latitude is calculated according to the target point cluster.
C5, the electronic equipment as described in C4, the preset condition are as follows: the longitude and latitude points for including are most, and the warp for including The percentage always counted that latitude points account for the multiple longitude and latitude is greater than preset percentage.
C6, the electronic equipment as described in C5, the basis have multiple longitudes and latitudes of the address information of identical standard address, After cluster obtains dot density more than or equal to the several points cluster of the first dot density threshold value, chosen in the several points cluster Before meeting the point cluster of preset condition as target point cluster, further includes: judge the warp in the several points cluster, in maximum point cluster Whether the percentage always counted that latitude points account for the multiple longitude and latitude is greater than or equal to the preset percentage;If it is not, then According to multiple longitudes and latitudes of the address information for having identical standard address, cluster obtains dot density more than or equal to second point The several points cluster of density threshold;The second point density threshold is less than the first dot density threshold value.
The described in any item electronic equipments of C7, such as C4-C6, it is described that standard longitude and latitude, packet are calculated according to the target point cluster It includes: according to each longitude and latitude in the target point cluster, calculating longitude average value and latitude average value;The longitude average value and institute It states latitude average value and constitutes the standard longitude and latitude.
C8, the electronic equipment as described in C1, the longitude and latitude are the longitude and latitude for getting position ready for dispensing transport power.
The embodiment of the present application discloses a kind of non-volatile memory medium of D1., described for storing computer-readable program Computer-readable program is used to execute the information processing method of the address base as described in any one of A1 to A8 for computer.

Claims (10)

1. a kind of information processing method of address base characterized by comprising
Obtain the address of each address information and longitude and latitude in address base;
The address is standardized, the normal address of each address information is obtained;
The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains the standard warp of the normal address Latitude;
The corresponding normal address information of each address information is stored in the address base, the normal address information includes institute State normal address and the standard longitude and latitude.
2. the information processing method of address base according to claim 1, which is characterized in that described to be marked to the address Standardization obtains the normal address of each address information, comprising:
Obtain the regional part of the address;
The identical multiple addresses of the regional part are clustered, normal address is obtained.
3. the information processing method of address base according to claim 2, which is characterized in that the regional part include save, City, area, street, Lou Hao, any number of combinations in number.
4. the information processing method of address base according to claim 1, which is characterized in that it is described with will be provided with identical standard The address information of location carries out the cluster integration of longitude and latitude, obtains the standard longitude and latitude of the normal address, comprising:
According to the multiple longitudes and latitudes for the address information for having identical standard address, cluster obtains dot density more than or equal to first point The several points cluster of density threshold;
The point cluster for meeting preset condition is chosen in the several points cluster as target point cluster;
Standard longitude and latitude is calculated according to the target point cluster.
5. the information processing method of address base according to claim 4, which is characterized in that the preset condition are as follows:
Include longitude and latitude points at most, and include longitude and latitude points account for the multiple longitude and latitude the percentage always counted it is big In preset percentage.
6. the information processing method of address base according to claim 5, which is characterized in that the basis has identical standard Multiple longitudes and latitudes of the address information of address, cluster obtain the several points cluster that dot density is greater than or equal to the first dot density threshold value Later, before the point cluster for meeting preset condition is chosen in the several points cluster as target point cluster, further includes:
Judge that the longitude and latitude points in the several points cluster, in maximum point cluster account for the percentage of the multiple longitude and latitude always counted Than whether being greater than or equal to the preset percentage;
If it is not, then having multiple longitudes and latitudes of the address information of identical standard address according to, cluster obtains dot density and is greater than Or the several points cluster equal to second point density threshold;The second point density threshold is less than the first dot density threshold value.
7. the information processing method of the address base according to any one of claim 4 to 6, which is characterized in that the basis The target point cluster calculates standard longitude and latitude, comprising:
According to each longitude and latitude in the target point cluster, longitude average value and latitude average value are calculated;
The longitude average value and the latitude average value constitute the standard longitude and latitude.
8. a kind of information processing unit of address base characterized by comprising
Address information obtains module, for obtaining the address of each address information and longitude and latitude in address base;
Address clustering processing module obtains the normal address of each address information for being standardized to the address;
Longitude and latitude clustering processing module, for will be provided with the address information of identical standard address, the cluster for carrying out longitude and latitude is integrated, Obtain the standard longitude and latitude of the normal address;
Normal address information storage module, for storing normal address information, the normal address information in the address base Including the normal address and the standard longitude and latitude.
9. a kind of electronic equipment characterized by comprising at least one processor;And
The memory being connect at least one described processor communication;
Wherein, the memory be stored with can by least one described processor execute instruction, described instruction by it is described at least One processor is executed to realize:
Obtain the address of each address information and longitude and latitude in address base;The address is standardized, each address information is obtained Normal address;It will be provided with the address information of identical standard address, carry out the cluster integration of longitude and latitude, obtain the normal address Standard longitude and latitude;Normal address information is stored in the address base, the normal address information includes the normal address With the standard longitude and latitude.
10. a kind of non-volatile memory medium, for storing computer-readable program, which is characterized in that described computer-readable Program is used to execute the information processing method of the address base as described in any one of claims 1 to 7 for computer.
CN201811488202.XA 2018-12-06 2018-12-06 Information processing method and device for address library, electronic equipment and storage medium Pending CN109635063A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811488202.XA CN109635063A (en) 2018-12-06 2018-12-06 Information processing method and device for address library, electronic equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811488202.XA CN109635063A (en) 2018-12-06 2018-12-06 Information processing method and device for address library, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
CN109635063A true CN109635063A (en) 2019-04-16

Family

ID=66071742

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811488202.XA Pending CN109635063A (en) 2018-12-06 2018-12-06 Information processing method and device for address library, electronic equipment and storage medium

Country Status (1)

Country Link
CN (1) CN109635063A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110597943A (en) * 2019-09-16 2019-12-20 腾讯科技(深圳)有限公司 Interest point processing method and device based on artificial intelligence and electronic equipment
CN112016326A (en) * 2020-09-25 2020-12-01 北京百度网讯科技有限公司 Map area word recognition method and device, electronic equipment and storage medium
CN112487122A (en) * 2020-12-02 2021-03-12 电信科学技术第十研究所有限公司 Address normalization processing method and device
CN112801189A (en) * 2021-01-29 2021-05-14 上海寻梦信息技术有限公司 Method and device for detecting longitude and latitude abnormity, electronic equipment and storage medium
CN113537808A (en) * 2021-07-27 2021-10-22 石家庄开发区天远科技有限公司 Engineering machinery accessory library site selection method based on space-time big data

Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103123628A (en) * 2011-11-21 2013-05-29 腾讯科技(深圳)有限公司 Searching method and system for geographical location
CN103996293A (en) * 2014-06-09 2014-08-20 重庆大学 Real-time traffic status collecting and inquiring system and method based on rider collaboration
CN104050196A (en) * 2013-03-15 2014-09-17 阿里巴巴集团控股有限公司 Point of interest (POI) data redundancy detection method and device
CN104572955A (en) * 2014-12-29 2015-04-29 北京奇虎科技有限公司 System and method for determining POI name based on clustering
CN104935676A (en) * 2014-03-17 2015-09-23 阿里巴巴集团控股有限公司 Method and device for determining IP address fields and corresponding latitude and longitude
CN105808715A (en) * 2016-03-07 2016-07-27 武汉大学 Method for establishing map per location
WO2016127904A1 (en) * 2015-02-13 2016-08-18 阿里巴巴集团控股有限公司 Text address processing method and apparatus
CN106934015A (en) * 2017-03-10 2017-07-07 北京京东尚科信息技术有限公司 Address date treating method and apparatus
CN107133269A (en) * 2017-04-01 2017-09-05 中国人民解放军国防科学技术大学 Frequent location track generation method and device based on mobile target
CN107622061A (en) * 2016-07-13 2018-01-23 阿里巴巴集团控股有限公司 A kind of method, apparatus and system for determining address uniqueness
CN108763538A (en) * 2018-05-31 2018-11-06 北京嘀嘀无限科技发展有限公司 A kind of method and device in the geographical locations determining point of interest POI

Patent Citations (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103123628A (en) * 2011-11-21 2013-05-29 腾讯科技(深圳)有限公司 Searching method and system for geographical location
CN104050196A (en) * 2013-03-15 2014-09-17 阿里巴巴集团控股有限公司 Point of interest (POI) data redundancy detection method and device
CN104935676A (en) * 2014-03-17 2015-09-23 阿里巴巴集团控股有限公司 Method and device for determining IP address fields and corresponding latitude and longitude
CN103996293A (en) * 2014-06-09 2014-08-20 重庆大学 Real-time traffic status collecting and inquiring system and method based on rider collaboration
CN104572955A (en) * 2014-12-29 2015-04-29 北京奇虎科技有限公司 System and method for determining POI name based on clustering
WO2016127904A1 (en) * 2015-02-13 2016-08-18 阿里巴巴集团控股有限公司 Text address processing method and apparatus
CN105808715A (en) * 2016-03-07 2016-07-27 武汉大学 Method for establishing map per location
CN107622061A (en) * 2016-07-13 2018-01-23 阿里巴巴集团控股有限公司 A kind of method, apparatus and system for determining address uniqueness
CN106934015A (en) * 2017-03-10 2017-07-07 北京京东尚科信息技术有限公司 Address date treating method and apparatus
CN107133269A (en) * 2017-04-01 2017-09-05 中国人民解放军国防科学技术大学 Frequent location track generation method and device based on mobile target
CN108763538A (en) * 2018-05-31 2018-11-06 北京嘀嘀无限科技发展有限公司 A kind of method and device in the geographical locations determining point of interest POI

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
陈睿嘉 等: "基于网络爬虫的导航深度服务信息自动采集", 《测绘工程》 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110597943A (en) * 2019-09-16 2019-12-20 腾讯科技(深圳)有限公司 Interest point processing method and device based on artificial intelligence and electronic equipment
CN110597943B (en) * 2019-09-16 2022-04-01 腾讯科技(深圳)有限公司 Interest point processing method and device based on artificial intelligence and electronic equipment
CN112016326A (en) * 2020-09-25 2020-12-01 北京百度网讯科技有限公司 Map area word recognition method and device, electronic equipment and storage medium
CN112487122A (en) * 2020-12-02 2021-03-12 电信科学技术第十研究所有限公司 Address normalization processing method and device
CN112487122B (en) * 2020-12-02 2024-05-17 电信科学技术第十研究所有限公司 Address normalization processing method and device
CN112801189A (en) * 2021-01-29 2021-05-14 上海寻梦信息技术有限公司 Method and device for detecting longitude and latitude abnormity, electronic equipment and storage medium
CN113537808A (en) * 2021-07-27 2021-10-22 石家庄开发区天远科技有限公司 Engineering machinery accessory library site selection method based on space-time big data

Similar Documents

Publication Publication Date Title
CN109635063A (en) Information processing method and device for address library, electronic equipment and storage medium
CN110175216B (en) Coordinate error correction method and device and computer equipment
CN110020221B (en) Job distribution confirmation method, apparatus, server and computer readable storage medium
JP6689515B2 (en) Method and apparatus for identifying the type of user geographic location
CN109919437B (en) big data-based intelligent tourism target matching method and system
CN108122012B (en) Method, device and equipment for determining center point of stationary point and storage medium
CN109657163A (en) Destination address determining method and device, electronic equipment and storage medium
CN107124695A (en) The method and system of accessible location is marked based on associated person information
CN106210163B (en) IP address-based localization method and device
CN111639092B (en) Personnel flow analysis method and device, electronic equipment and storage medium
CN110046174B (en) population migration analysis method and system based on big data
CN108256718A (en) Declaration form service role distribution method, device, computer equipment and storage device
KR20140097805A (en) Coordinates (x, y) position value using a systematic block code generated and the address matching service using methods
CN107247791B (en) Parking lot map data generation method and device and machine-readable storage medium
CN107038620A (en) Based on user call a taxi preference information push and device
CN103617254A (en) Method, system and device for constructing geographic position coordinate information base
CN103198071B (en) Datagram table generating method and device thereof
CN114357097A (en) Map annotation construction method and device, computer equipment and storage medium
CN109857822A (en) Meta-model conversion method and management system based on chart database
CN105930313A (en) Method and device for processing notification message
CN111177589A (en) Address information query method, device, equipment and storage medium
CN106469205A (en) A kind of method and apparatus of the geographical location information determining user
Khoussainova et al. Probabilistic rfid data management
CN102184226B (en) Method for constructing real-time database and data searching method
CN109815278A (en) A kind of method for exhibiting data and its equipment, storage medium, electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20190416