CN109635063A - Information processing method and device for address library, electronic equipment and storage medium - Google Patents
Information processing method and device for address library, electronic equipment and storage medium Download PDFInfo
- Publication number
- CN109635063A CN109635063A CN201811488202.XA CN201811488202A CN109635063A CN 109635063 A CN109635063 A CN 109635063A CN 201811488202 A CN201811488202 A CN 201811488202A CN 109635063 A CN109635063 A CN 109635063A
- Authority
- CN
- China
- Prior art keywords
- address
- latitude
- longitude
- cluster
- standard
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000010365 information processing Effects 0.000 title claims abstract description 39
- 238000003672 processing method Methods 0.000 title claims abstract description 35
- 230000015654 memory Effects 0.000 claims description 27
- 230000010354 integration Effects 0.000 claims description 18
- 238000004891 communication Methods 0.000 claims description 13
- 238000012545 processing Methods 0.000 claims description 13
- 238000000034 method Methods 0.000 description 14
- 238000012805 post-processing Methods 0.000 description 6
- 238000010586 diagram Methods 0.000 description 4
- 239000004615 ingredient Substances 0.000 description 4
- 230000008859 change Effects 0.000 description 2
- 230000006870 function Effects 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 239000000284 extract Substances 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 239000007787 solid Substances 0.000 description 1
Landscapes
- Mobile Radio Communication Systems (AREA)
Abstract
The embodiment of the invention relates to the technical field of information processing, and discloses an information processing method of an address library, which comprises the following steps: acquiring addresses and longitude and latitude of each address information in an address base; standardizing the address to obtain a standard address of each address information; clustering and integrating the longitude and latitude of the address information with the same standard address to obtain the standard longitude and latitude of the standard address; and storing standard address information corresponding to each address information in an address library, wherein the standard address information comprises a standard address and standard longitude and latitude. According to the invention, by acquiring each address information stored in the address base, a plurality of addresses are clustered and then processed into a standard address, so that redundant address data are reduced; in addition, clustering is carried out according to a plurality of longitude and latitude values, and then the clustering is processed into a standard longitude and latitude, so that the positioning accuracy of the address information is improved.
Description
Technical field
The present invention relates to technical field of information processing more particularly to a kind of information processing methods of address base, device, electronics
Equipment and storage medium.
Background technique
With the development of internet, various electronic database of information are by more and more extensive application, in the take-away of catering field
Industry, user search for businessman by electronic map and complete lower single operation.
Each POI (point of interest, Point of Interest) data in address base that electronic map uses include ground
Location and corresponding longitude and latitude, longitude and latitude are usually to carry means of communication arrival specific location by staff to go to get ready, will be acquired
To latitude and longitude information store into address base.Due in actual application, physical location locating for user and work people
The position got ready before member may have bigger difference, will result in bigger position error.In addition, to same position, it is different
The address full name that people names it may be different, therefore may store two POI numbers for the same position in address base
According to, corresponding two different address full name, resulted in address base in this way there are data redundancy, user using electronic map into
When row is searched, redundant data can be also shown.
Summary of the invention
A kind of information processing method for being designed to provide address base of embodiment of the present invention, device, electronic equipment and
Storage medium is standardized address information by the way of cluster, eliminates redundant data, and obtains more quasi-
True position.
In order to solve the above technical problems, embodiments of the present invention provide a kind of information processing method of address base, packet
It includes: obtaining the address of each address information and longitude and latitude in address base;Address is standardized, the standard of each address information is obtained
Address;The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains the standard longitude and latitude of normal address
Degree;The corresponding normal address information of each address information is stored in address base, normal address information includes normal address and standard
Longitude and latitude.
Embodiments of the present invention additionally provide a kind of information processing unit of address base, comprising: address information obtains mould
Block, for obtaining the address of each address information and longitude and latitude in address base;Address clustering processing module, for being marked to address
Standardization obtains the normal address of each address information;Longitude and latitude clustering processing module, for will be provided with the address of identical standard address
Information carries out the cluster integration of longitude and latitude, obtains the standard longitude and latitude of normal address;Normal address information storage module, is used for
Normal address information is stored in address base, normal address information includes normal address and standard longitude and latitude.
Embodiments of the present invention additionally provide a kind of electronic equipment, comprising: at least one processor;And at least one
The memory of a processor communication connection;Wherein, memory is stored with the instruction that can be executed by least one processor, instructs quilt
At least one processor is executed to realize: obtaining the address of each address information and longitude and latitude in address base;Standard is carried out to address
Change, obtains the normal address of each address information;The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude,
Obtain the standard longitude and latitude of normal address;The corresponding normal address information of each address information, normal address are stored in address base
Information includes normal address and standard longitude and latitude.
Embodiments of the present invention additionally provide a kind of non-volatile memory medium, for storing computer-readable program,
Computer-readable program is used to execute the information processing method of address base as above for computer.
In terms of existing technologies, the main distinction and its effect are embodiment of the present invention: by obtaining address base
Each address information of middle storage, it is a normal address that multiple addresses, which are carried out cluster post-processing, reduces the number of addresses of redundancy
According to;It is a standard longitude and latitude according further to the cluster post-processing of multiple latitude and longitude values, improves the positional accuracy of address information.
In addition, being standardized to address, obtain the normal address of each address information, comprising: obtain the region of address at
Point;The identical multiple addresses of regional part are clustered, normal address is obtained.Multiple addresses are distinguished according to regional part, really
The multiple addresses for protecting cluster are the same address.
In addition, regional part includes province, city, area, street, Lou Hao, any number of combinations in number.Using multiple
Regional part is identified, further ensures that multiple addresses of cluster are the same address.
In addition, the address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, normal address is obtained
Standard longitude and latitude, comprising: according to the multiple longitudes and latitudes for the address information for having identical standard address, cluster obtains dot density and is greater than
Or the several points cluster equal to the first dot density threshold value;The point cluster for meeting preset condition is chosen in several points cluster as target
Point cluster;Standard longitude and latitude is calculated according to target point cluster.By the method for cluster, the primary focal zone domain of longitude and latitude point can be found,
The standard longitude and latitude made is more accurate.
In addition, preset condition are as follows: the longitude and latitude points for including are most, and the longitude and latitude points for including account for multiple longitudes and latitudes
The percentage always counted is greater than preset percentage.By the way that preset percentage is arranged, make the longitude and latitude for participating in calculating standard longitude and latitude
It counts enough, further such that the standard longitude and latitude arrived is more accurate.
In addition, according to the multiple longitudes and latitudes for the address information for having identical standard address, cluster obtain dot density be greater than or
After several points cluster equal to the first dot density threshold value, the point cluster for meeting preset condition is chosen in several points cluster as mesh
Before punctuate cluster, further includes: judge that the longitude and latitude points in several points cluster, in maximum point cluster account for total points of multiple longitudes and latitudes
Percentage whether be greater than or equal to preset percentage;If it is not, then according to the multiple of the address information for having identical standard address
Longitude and latitude, cluster obtain the several points cluster that dot density is greater than or equal to second point density threshold;Second point density threshold is less than
First dot density threshold value.By repeatedly clustering, the longitude and latitude points that guarantee participates in calculating standard longitude and latitude reach preset quantity, into
The standard longitude and latitude that one step ensure that is more accurate.
In addition, calculating standard longitude and latitude according to target point cluster, comprising: according to each longitude and latitude in target point cluster, calculate warp
Spend average value and latitude average value;Longitude average value and latitude average value constitute standard longitude and latitude.By calculating multiple longitudes and latitudes
Average value obtain standard longitude and latitude, improve the accuracy of standard longitude and latitude.
In addition, longitude and latitude is the longitude and latitude for getting position ready for dispensing transport power.Using the longitude and latitude historical data got ready, it is not necessarily to
Staff specially shows up acquisition, saves manpower.
Detailed description of the invention
Fig. 1 is the information processing method flow chart for the address base that first embodiment provides according to the present invention;
Fig. 2 is the Address Standardization processing method flow chart in first embodiment according to the present invention;
Fig. 3 is the latitude and longitude standard processing method flow chart in first embodiment according to the present invention;
Fig. 4 is the latitude and longitude standard processing method flow chart in second embodiment according to the present invention;
Fig. 5 is the information processing unit schematic diagram for the address base that third embodiment provides according to the present invention;
Fig. 6 is the electronic equipment schematic diagram that the 4th embodiment provides according to the present invention.
Specific embodiment
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention
Each embodiment be explained in detail.However, it will be understood by those skilled in the art that in each embodiment party of the present invention
In formula, in order to make the reader understand this application better, many technical details are proposed.But even if without these technical details
And various changes and modifications based on the following respective embodiments, the application technical solution claimed also may be implemented.With
Under the division of each embodiment be for convenience, any restriction should not to be constituted to specific implementation of the invention, it is each
Embodiment can be combined with each other mutual reference under the premise of reconcilable.
In order to make the object, technical scheme and advantages of the embodiment of the invention clearer, below in conjunction with attached drawing to the present invention
Each embodiment be explained in detail.However, it will be understood by those skilled in the art that in each embodiment party of the present invention
In formula, in order to make the reader understand this application better, many technical details are proposed.But even if without these technical details
And various changes and modifications based on the following respective embodiments, the application technical solution claimed also may be implemented.With
Under the division of each embodiment be for convenience, any restriction should not to be constituted to specific implementation of the invention, it is each
Embodiment can be combined with each other mutual reference under the premise of reconcilable.
The first embodiment of the present invention is related to a kind of information processing method of address base, present embodiment can be applied
Terminal side is such as applied in mobile phone, the terminal devices such as tablet computer, can also be applied in the server of network side.
Fig. 1 is the information processing method flow chart for the address base that first embodiment provides according to the present invention, this method packet
It includes:
Step S101, the address of each address information and longitude and latitude in address base are obtained.
Specifically, each address information stored in address base includes address and longitude and latitude, address is by multiple regions
Ingredient is constituted, and regional part is for example including any number of combinations in province, city, area, street, Lou Hao, number.Each ground
The regional part of the corresponding one group of determination in location, and corresponding determining latitude and longitude information.The longitude and latitude of each address information in address base
It is recorded after getting acquisition ready by special communication staff arrival address geographic location when including creation address information in degree source
Enter, further includes getting record ready after for example reaching dispatching place after address information creates when executing dispatching task by dispatching personnel
Historical data, the latitude and longitude information that dispatching personnel get ready usually has multiple, such as multiple dispatching personnel are same in different time
It is got ready after one user's dispatching or a dispatching personnel is repeatedly to get ready after the same user dispenses, can all generate multiple beat
The latitude and longitude information of point.
Step S102, address is standardized, obtains the normal address of each address information.As shown in Fig. 2, step S102
Include:
Step S1021, the regional part of address is obtained;
Address is made of regional part, available to arrive corresponding each region ingredient for each address.For example,
Certain address is " Shuangqing Road, Haidian District, Beijing City 30 ", then extracts each region ingredient are as follows: Beijing, Haidian, Shuan Qinglu, 30
Number.
Step S1022, the identical multiple addresses of regional part are clustered, obtains normal address;
Different people may be different to the address text of same address statement, therefore occur as soon as a variety of statements to same address.
In this way, there have been a plurality of address informations for same address in address base, data redundancy is generated.By by the number of redundancy
According to being clustered and carry out calibration, so that it may greatly reduce data redundancy, can not only discharge the memory space of address base,
The engine search efficiency of address base can be improved.
In present embodiment, the region of address is obtained specifically, searching in address base to the method for multiple addresses cluster
The identical a plurality of address information of ingredient, corresponding multiple addresses are standardized, and merge into an address to get study plot is arrived
Location, then the address in a plurality of address information, can be unified for a normal address.
For example, to the address in BeiJing ZhongGuanCun square shopping center, there are several types of statements:
1, ZhongGuancun Street, BeiJing City 15
2, Beijing-Haidian-Zhongguancun Street -15
3, Zongguancun Street, Haidian District, Beijing City (No. 15)
1st kind is stated, extracting regional part includes: Beijing, Zhongguancun Street, No. 15, according to the ground of the address
Position is managed, is located at Haidian District, therefore completion, i.e. regional part are carried out to regional part further include: Haidian;For the 2nd kind of table
It states, extracting regional part includes: Beijing, Haidian, Zhongguancun Street, No. 15;3rd kind is stated, regional part is extracted
It include: Beijing, Haidian, Zhongguancun Street, No. 15.Therefore above 3 kinds of address statements include identical regional part, Ke Yijin
Row cluster and standardization, specifically such as, the address after standardization are Zongguancun Street, Haidian District, Beijing City 15.
Step S103, the address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains study plot
The standard longitude and latitude of location.
The each address information stored in address base, when corresponding latitude and longitude information can be creation address information, by
Special communication staff reaches position according to address and gets typing after acquisition ready, after being also possible to address information creation
Typing after getting acquisition after dispensing place ready is reached when executing dispatching task by dispatching personnel.In the actual conditions for taking out dispatching
In, there may be certain difference or different dispatching personnel for the position that the physical location and communication staff of user is got ready
The longitude and latitude of typing can also have differences or the longitude and latitude of same dispatching personnel not homogeneous typing can also have differences.And it is right
Answer an address actually only one accurate longitude and latitude.Therefore, it is got pair according to the historical data that dispatching personnel get ready
The multiple latitude and longitude informations for answering the same normal address carry out cluster integration, the available mark to multiple latitude and longitude information
The corresponding standard longitude and latitude in quasi- address.The specific implementation of step S103 is as shown in figure 3, specifically include:
Step S1031, according to the multiple longitudes and latitudes for the address information for having identical standard address, it is big that cluster obtains dot density
In the several points cluster of dot density threshold value.
In present embodiment, a dot density threshold value, for example, the first dot density threshold value are set, with will be provided with identical standard
Dot density is greater than the first dot density threshold value according to calculating dot density separated by a distance by multiple longitudes and latitudes of the address information of location
Multiple latitude and longitude coordinates points cluster into same cluster, so cluster obtain several points cluster.
In one example, such as according to the history of dispatching personnel 100 of the address that data are obtained about user A are got ready
Latitude and longitude coordinates point sets the first dot density threshold value as 1/100m2, 100 longitudes and latitudes are sat according to the first dot density threshold value
Mark is clustered, and 5 clusters, for example, a, b, c, d, e are obtained.
Step S1032, the point cluster for meeting preset condition is chosen in several points cluster as target point cluster;
In present embodiment, from several points cluster when selection target point cluster, setting a preset condition is, for example, target
The longitude and latitude that point cluster includes counts (i.e. the number of latitude and longitude coordinates point) at most, and the longitude and latitude points for including account for multiple longitudes and latitudes
The percentage always counted be greater than preset percentage.
In the same example, preset percentage is, for example, 30%.When selection target point cluster, several points cluster is found first
In include the longitude and latitude point cluster of counting most, if longitude and latitude points simultaneously in the cluster account for always counting for multiple longitudes and latitudes
Percentage is greater than preset percentage, it is determined that the cluster is target point cluster.It is poly- according to 100 latitude and longitude coordinates points to user A
5 clusters that class obtains obtain the number of latitude and longitude coordinates point in each cluster, calculate latitude and longitude coordinates point in each cluster
Number accounts for the longitude and latitude in the percentage of latitude and longitude coordinates point total quantity, such as point cluster a, point cluster b, point cluster c, point cluster d, point cluster e
Coordinate points number is respectively 40,20,25,5,10, and corresponding percentage is respectively 40%, 20%, 25%, 5%, 10%.Point cluster a
It for maximum point cluster, percentage highest, and has been more than preset percentage 30%.Accordingly, it is determined that point cluster a is target point cluster.
Step S1033, standard longitude and latitude is calculated according to target point cluster.
In present embodiment, according to the coordinate value of longitude and latitude each in target point cluster, standard longitude and latitude is calculated.Specifically, meter
Calculation obtains longitude average value and latitude average value, constitutes standard longitude and latitude by longitude average value and latitude average value.
In the same example, for example, target point cluster is point cluster a, includes 40 longitude and latitude points, obtain 40 longitudes and latitudes
The longitude of point, is calculated longitude average value, and obtain the latitude value of 40 longitude and latitude points, and it is average that latitude is calculated
Value, further obtains standard longitude and latitude.
Step S104, the corresponding normal address information of each address information is stored in address base, normal address information includes
Normal address and standard longitude and latitude.
In present embodiment, the standard longitude and latitude that the obtained normal address step S102 and step S103 are obtained, addition
Into normal address information, and by normal address information storage into address base.Meanwhile not by original storage in address base
Normalised address information is deleted, to remove redundant data.
As above, after completing standardization to the address information in address base, when staff searches on the electronic map
It is obtaining the result is that an address information when one destination address, and there is the positioning of accurate longitude and latitude.
The information processing method of the address base of present embodiment will be more by each address information stored in address base
It is a normal address that a address knows method for distinguishing to carry out cluster post-processing using regional part, reduces the number of addresses of redundancy
According to;The multiple latitude and longitude values cluster post-processing got ready according further to history is a standard longitude and latitude, improves address information
Positional accuracy.
Second embodiment of the present invention is related to a kind of information processing method of address base, this method comprises:
Step S101, the address of each address information and longitude and latitude in address base are obtained.
Step S102, address is standardized, obtains the normal address of each address information.
Step S103, the address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains study plot
The standard longitude and latitude of location.
The each address information stored in address base, when corresponding latitude and longitude information can be creation address information, by
Special communication staff reaches position according to address and gets typing after acquisition ready, after being also possible to address information creation
Typing after getting acquisition after dispensing place ready is reached when executing dispatching task by dispatching personnel.In the actual conditions for taking out dispatching
In, there may be certain difference or different dispatching personnel for the position that the physical location and communication staff of user is got ready
The longitude and latitude of typing can also have differences or the longitude and latitude of same dispatching personnel not homogeneous typing can also have differences.And it is right
Answer an address actually only one accurate longitude and latitude.Therefore, it is got pair according to the historical data that dispatching personnel get ready
The multiple latitude and longitude informations for answering the same normal address carry out cluster integration, the available mark to multiple latitude and longitude information
The corresponding standard longitude and latitude in quasi- address.In the present embodiment, the specific implementation of step S103 is as shown in figure 4, specifically include:
Step S1031, according to the multiple longitudes and latitudes for the address information for having identical standard address, it is big that cluster obtains dot density
In the several points cluster of dot density threshold value.
In present embodiment, a dot density threshold value, for example, the first dot density threshold value are set, with will be provided with identical standard
Dot density is greater than the first dot density threshold value according to calculating dot density separated by a distance by multiple longitudes and latitudes of the address information of location
Multiple latitude and longitude coordinates points cluster into same cluster, so cluster obtain several points cluster.
Step S1032, judge in several points cluster, the longitude and latitude points in maximum point cluster account for total points of multiple longitudes and latitudes
Percentage whether be greater than or equal to the preset percentage.
In one example, a preset percentage is set, for example, 30%.Further, it is looked for from several points cluster
To maximum point cluster, i.e., the point cluster for counting most comprising longitude and latitude, judge the longitude and latitude points in the maximum point cluster account for have it is identical
Whether the percentage of multiple longitudes and latitudes of the address information of normal address always counted is greater than preset percentage.Herein, it puts in cluster
The longitude and latitude points for including refer to the quantity for the latitude and longitude coordinates point for including in the cluster.
If the determination result is YES, then it represents cluster to complete, directly execution step S1033.
If judging result be it is no, need to cluster again, that is, return to step S1031.In step S1031, change
The size of preset dot density threshold value, for example, by the first dot density threshold modifying be second point density threshold, had according to described
Multiple longitudes and latitudes of the address information of identical standard address, if cluster obtains dot density more than or equal to second point density threshold
Dry cluster, wherein second point density threshold is less than the first dot density threshold value.
As described above, by the size for changing preset dot density threshold value, by cluster process at least once, by multiple warps
Latitude is clustered into several points cluster.
In one example, when such as preset percentage is, for example, 30%, 50 longitudes and latitudes of the address about user B are sat
Punctuate sets the first dot density threshold value as 1/100m2, 50 latitude and longitude coordinates are gathered according to the first dot density threshold value
Class obtains 4 clusters, for example, a1, b1, c1, d1.Wherein maximum point cluster is point cluster b1, includes 14 latitude and longitude coordinates points, accounts for
The percentage of latitude and longitude coordinates point total quantity is 28%, is less than preset percentage 30%.Then modifying point density threshold is second point
Density threshold 0.8/100m2, clustered again, cluster obtains 4 clusters, for example, a2, b2, c2, d2, obtains maximum point cluster
It include 18 latitude and longitude coordinates points for a cluster b2, the percentage for accounting for latitude and longitude coordinates point total quantity is 36%, is greater than default percentage
Than 30%, cluster is completed.
Step S1033, the point cluster for meeting preset condition is chosen in several points cluster as target point cluster;
In present embodiment, from several points cluster when selection target point cluster, setting a preset condition is, for example, target
The longitude and latitude points that point cluster includes are most, and the longitude and latitude points for including account for the percentage of multiple longitudes and latitudes always counted greater than pre-
If percentage.
In one example, preset percentage is, for example, 30%.When selection target point cluster, found in several points cluster first
The point cluster that the longitude and latitude for including is counted most, if the longitude and latitude points simultaneously in the cluster account for hundred always to count of multiple longitudes and latitudes
Divide than being greater than preset percentage.
Step S1034, standard longitude and latitude is calculated according to target point cluster.
In present embodiment, according to the coordinate value of longitude and latitude each in target point cluster, standard longitude and latitude is calculated.Specifically, meter
Calculation obtains longitude average value and latitude average value, constitutes standard longitude and latitude by longitude average value and latitude average value.
In the same example, for example, target point cluster is point cluster b2, includes 18 latitude and longitude coordinates points, obtain this 18
The longitude of longitude and latitude point, is calculated longitude average value, and obtains the latitude value of 18 longitude and latitude points, and latitude is calculated
Average value further obtains standard longitude and latitude.
Step S104, the corresponding normal address information of each address information is stored in address base, normal address information includes
Normal address and standard longitude and latitude.
In present embodiment, the standard longitude and latitude that the obtained normal address step S102 and step S103 are obtained, addition
Into normal address information, and by normal address information storage into address base.Meanwhile not by original storage in address base
Normalised address information is deleted, to remove redundant data.
As above, after completing standardization to the address information in address base, when staff searches on the electronic map
It is obtaining the result is that an address information when one destination address, and there is the positioning of accurate longitude and latitude.
The information processing method of the address base of present embodiment will be more by each address information stored in address base
It is a normal address that a address knows method for distinguishing to carry out cluster post-processing using regional part, reduces the number of addresses of redundancy
According to;It is a standard warp additionally by preset density threshold is changed according to multiple latitude and longitude values cluster post-processing that history is got ready
Latitude improves the positional accuracy of address information.
Third embodiment of the present invention is related to a kind of information processing unit of address base, and Fig. 5 is third according to the present invention
The information processing unit schematic diagram for the address base that embodiment provides, the device 500 include:
Address information obtains module 501, for obtaining the address of each address information and longitude and latitude in address base;
Address clustering processing module 502 obtains the normal address of each address information for being standardized to address.
Longitude and latitude clustering processing module 503 carries out the poly- of longitude and latitude for will be provided with the address information of identical standard address
Class integration, obtains the standard longitude and latitude of normal address.
Normal address information storage module 504, for storing normal address information, normal address packet in address base
Include normal address and standard longitude and latitude.
In one example, address clustering processing module 502 obtains the regional part of address;Regional part is identical more
A address is clustered, and normal address is obtained.
In one example, longitude and latitude clustering processing module 503 is according to the more of the address information for having identical standard address
A longitude and latitude, cluster obtain the several points cluster that dot density is greater than or equal to the first dot density threshold value;It is selected in several points cluster
Take the point cluster for meeting preset condition as target point cluster;Standard longitude and latitude is calculated according to target point cluster.Wherein, preset condition are as follows:
The longitude and latitude points for including are most, and the longitude and latitude points for including account for the percentage of multiple longitudes and latitudes always counted greater than default hundred
Divide ratio.
In one example, in multiple longitudes and latitudes according to the address information for having identical standard address, cluster is obtained a little
Density is greater than or equal to after the several points cluster of the first dot density threshold value, chooses in several points cluster and meets preset condition
Before point cluster is as target point cluster, longitude and latitude clustering processing module 503 is also used to judge in several points cluster, in maximum point cluster
Whether the percentage always counted that longitude and latitude points account for multiple longitudes and latitudes is greater than or equal to preset percentage;If it is not, then according to tool
Multiple longitudes and latitudes of the address information of standby identical standard address, cluster obtain dot density more than or equal to second point density threshold
Several points cluster;Second point density threshold is less than the first dot density threshold value.
In one example, it is flat to calculate longitude according to each longitude and latitude in target point cluster for longitude and latitude clustering processing module 503
Mean value and latitude average value constitute standard longitude and latitude by longitude average value and latitude average value.
Four embodiment of the invention is related to a kind of electronic equipment, and Fig. 6 is the electronic equipment provided according to the present embodiment
Schematic diagram, the electronic equipment include: at least one processor 601;And it is deposited with what at least one processor 601 communicated to connect
Reservoir 602;And respectively with processor 601 and memory 602 be communication connection communication component 603, communication component 603
Data are sended and received under the control of processor 601;Wherein, memory 602, which is stored with, to be held by least one processor 601
Capable instruction, instruction are executed by least one processor 601 to realize:
Obtain the address of each address information and longitude and latitude in address base;
Address is standardized, the normal address of each address information is obtained;
The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains the standard warp of normal address
Latitude;
Store the corresponding normal address information of each address information in address base, normal address information include normal address and
Standard longitude and latitude.
The electronic equipment includes: one or more processors 601 and memory 602, with a processor 601 in Fig. 6
For.Processor 601, memory 602 can be connected by bus or other modes, in Fig. 6 for being connected by bus.
Memory 602 is used as a kind of non-volatile computer readable storage medium storing program for executing, can be used for storing non-volatile software program, non-volatile
Property computer executable program and module.Non-volatile software journey of the processor 601 by operation storage in the memory 602
Sequence, instruction and module realize the information in address above mentioned library thereby executing the various function application and data processing of equipment
Processing method.
Memory 602 may include storing program area and storage data area, wherein storing program area can store operation system
Application program required for system, at least one function;Storage data area can store normal address, standard longitude and latitude, history are got ready
Longitude and latitude data etc..In addition, memory 602 may include high-speed random access memory, it can also include non-volatile deposit
Reservoir, for example, at least a disk memory, flush memory device or other non-volatile solid state memory parts.In some implementations
In mode, optional memory 602 includes the memory remotely located relative to processor 601, these remote memories can lead to
Network connection is crossed to external equipment.The example of above-mentioned network includes but is not limited to internet, intranet, local area network, movement
Communication network and combinations thereof.
One or more module stores in the memory 602, when being executed by one or more processor 601, holds
The information processing method of address base in the above-mentioned any means embodiment of row.
The said goods can be performed the application embodiment provided by method, have the corresponding functional module of execution method and
Beneficial effect, the not technical detail of detailed description in the present embodiment, reference can be made to method provided by the application embodiment.
5th embodiment of the invention is related to a kind of non-volatile memory medium, for storing computer-readable program,
Computer-readable program is used to execute above-mentioned all or part of embodiment of the method for computer.
That is, it will be understood by those skilled in the art that implement the method for the above embodiments be can be with
Relevant hardware is instructed to complete by program, which is stored in a storage medium, including some instructions are to make
It obtains an equipment (can be single-chip microcontroller, chip etc.) or processor (processor) executes each embodiment method of the application
All or part of the steps.And storage medium above-mentioned includes: USB flash disk, mobile hard disk, read-only memory (ROM, Read-Only
Memory), random access memory (RAM, Random Access Memory), magnetic or disk etc. are various can store journey
The medium of sequence code.
It will be understood by those skilled in the art that the respective embodiments described above are to realize specific embodiments of the present invention,
And in practical applications, can to it, various changes can be made in the form and details, without departing from the spirit and scope of the present invention.
The embodiment of the present application discloses a kind of information processing method of address base of A1., comprising:
Obtain the address of each address information and longitude and latitude in address base;
The address is standardized, the normal address of each address information is obtained;
The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains the mark of the normal address
Quasi- longitude and latitude;
The corresponding normal address information of each address information, the normal address packet are stored in the address base
Include the normal address and the standard longitude and latitude.
The information processing method of A2, address base as described in a1, it is described that the address is standardized, obtain each address
The normal address of information, comprising:
Obtain the regional part of the address;
The identical multiple addresses of the regional part are clustered, normal address is obtained.
The information processing method of A3, as described in A2 address base, the regional part include province, city, area, street, Lou Hao,
Any number of combinations in number.
The information processing method of A4, address base as described in a1, the address information that will be provided with identical standard address into
Pass through latitude cluster integration, obtain the standard longitude and latitude of the normal address, comprising:
According to the multiple longitudes and latitudes for the address information for having identical standard address, cluster obtains dot density and is greater than or equal to the
The several points cluster of some density thresholds;
The point cluster for meeting preset condition is chosen in the several points cluster as target point cluster;
Standard longitude and latitude is calculated according to the target point cluster.
The information processing method of A5, address base as described in A4, the preset condition are as follows:
The longitude and latitude points for including are most, and the longitude and latitude points for including account for the percentage of the multiple longitude and latitude always counted
Than being greater than preset percentage.
The information processing method of A6, address base as described in a5, the basis have the address information of identical standard address
Multiple longitudes and latitudes, cluster obtain dot density more than or equal to the first dot density threshold value several points cluster after, if described
Before choosing the point cluster for meeting preset condition as target point cluster in dry cluster, further includes:
Judge that the longitude and latitude points in the several points cluster, in maximum point cluster account for always counting for the multiple longitude and latitude
Whether percentage is greater than or equal to the preset percentage;
If it is not, then having multiple longitudes and latitudes of the address information of identical standard address according to, cluster obtains dot density
More than or equal to the several points cluster of second point density threshold;The second point density threshold is less than the first dot density threshold
Value.
The information processing method of the described in any item address bases of A7, such as A4-A6, described calculated according to the target point cluster are marked
Quasi- longitude and latitude, comprising:
According to each longitude and latitude in the target point cluster, longitude average value and latitude average value are calculated;
The longitude average value and the latitude average value constitute the standard longitude and latitude.
The information processing method of A8, address base as described in a1, the longitude and latitude are the warp for getting position ready for dispensing transport power
Latitude.
The embodiment of the present application discloses a kind of information processing unit of address base of B1., comprising:
Address information obtains module, for obtaining the address of each address information and longitude and latitude in address base;
Address clustering processing module obtains the normal address of each address information for being standardized to the address;
Longitude and latitude clustering processing module carries out the cluster of longitude and latitude for will be provided with the address information of identical standard address
Integration, obtains the standard longitude and latitude of the normal address;
Normal address information storage module, for storing normal address information, the normal address in the address base
Information includes the normal address and the standard longitude and latitude.
The embodiment of the present application discloses C1. a kind of electronic equipment, comprising: at least one processor;And
The memory being connect at least one described processor communication;
Wherein, the memory is stored with the instruction that can be executed by least one described processor, and described instruction is described
At least one processor is executed to realize: obtaining the address of each address information and longitude and latitude in address base;The address is carried out
Standardization, obtains the normal address of each address information;The address information that will be provided with identical standard address carries out the cluster of longitude and latitude
Integration, obtains the standard longitude and latitude of the normal address;The corresponding standard of each address information is stored in the address base
Address information, the normal address information include the normal address and the standard longitude and latitude.
C2, the electronic equipment as described in C1, it is described that the address is standardized, obtain the study plot of each address information
Location, comprising: obtain the regional part of the address;The identical multiple addresses of the regional part are clustered, are obtained
Normal address.
C3, the electronic equipment as described in C2, the regional part include province, city, area, street, Lou Hao, appointing in number
It anticipates multiple combinations.
C4, the electronic equipment as described in C1, the address information that will be provided with identical standard address carry out the poly- of longitude and latitude
Class integration, obtains the standard longitude and latitude of the normal address, comprising: according to the multiple of the address information for having identical standard address
Longitude and latitude, cluster obtain the several points cluster that dot density is greater than or equal to the first dot density threshold value;In the several points cluster
The point cluster for meeting preset condition is chosen as target point cluster;Standard longitude and latitude is calculated according to the target point cluster.
C5, the electronic equipment as described in C4, the preset condition are as follows: the longitude and latitude points for including are most, and the warp for including
The percentage always counted that latitude points account for the multiple longitude and latitude is greater than preset percentage.
C6, the electronic equipment as described in C5, the basis have multiple longitudes and latitudes of the address information of identical standard address,
After cluster obtains dot density more than or equal to the several points cluster of the first dot density threshold value, chosen in the several points cluster
Before meeting the point cluster of preset condition as target point cluster, further includes: judge the warp in the several points cluster, in maximum point cluster
Whether the percentage always counted that latitude points account for the multiple longitude and latitude is greater than or equal to the preset percentage;If it is not, then
According to multiple longitudes and latitudes of the address information for having identical standard address, cluster obtains dot density more than or equal to second point
The several points cluster of density threshold;The second point density threshold is less than the first dot density threshold value.
The described in any item electronic equipments of C7, such as C4-C6, it is described that standard longitude and latitude, packet are calculated according to the target point cluster
It includes: according to each longitude and latitude in the target point cluster, calculating longitude average value and latitude average value;The longitude average value and institute
It states latitude average value and constitutes the standard longitude and latitude.
C8, the electronic equipment as described in C1, the longitude and latitude are the longitude and latitude for getting position ready for dispensing transport power.
The embodiment of the present application discloses a kind of non-volatile memory medium of D1., described for storing computer-readable program
Computer-readable program is used to execute the information processing method of the address base as described in any one of A1 to A8 for computer.
Claims (10)
1. a kind of information processing method of address base characterized by comprising
Obtain the address of each address information and longitude and latitude in address base;
The address is standardized, the normal address of each address information is obtained;
The address information that will be provided with identical standard address carries out the cluster integration of longitude and latitude, obtains the standard warp of the normal address
Latitude;
The corresponding normal address information of each address information is stored in the address base, the normal address information includes institute
State normal address and the standard longitude and latitude.
2. the information processing method of address base according to claim 1, which is characterized in that described to be marked to the address
Standardization obtains the normal address of each address information, comprising:
Obtain the regional part of the address;
The identical multiple addresses of the regional part are clustered, normal address is obtained.
3. the information processing method of address base according to claim 2, which is characterized in that the regional part include save,
City, area, street, Lou Hao, any number of combinations in number.
4. the information processing method of address base according to claim 1, which is characterized in that it is described with will be provided with identical standard
The address information of location carries out the cluster integration of longitude and latitude, obtains the standard longitude and latitude of the normal address, comprising:
According to the multiple longitudes and latitudes for the address information for having identical standard address, cluster obtains dot density more than or equal to first point
The several points cluster of density threshold;
The point cluster for meeting preset condition is chosen in the several points cluster as target point cluster;
Standard longitude and latitude is calculated according to the target point cluster.
5. the information processing method of address base according to claim 4, which is characterized in that the preset condition are as follows:
Include longitude and latitude points at most, and include longitude and latitude points account for the multiple longitude and latitude the percentage always counted it is big
In preset percentage.
6. the information processing method of address base according to claim 5, which is characterized in that the basis has identical standard
Multiple longitudes and latitudes of the address information of address, cluster obtain the several points cluster that dot density is greater than or equal to the first dot density threshold value
Later, before the point cluster for meeting preset condition is chosen in the several points cluster as target point cluster, further includes:
Judge that the longitude and latitude points in the several points cluster, in maximum point cluster account for the percentage of the multiple longitude and latitude always counted
Than whether being greater than or equal to the preset percentage;
If it is not, then having multiple longitudes and latitudes of the address information of identical standard address according to, cluster obtains dot density and is greater than
Or the several points cluster equal to second point density threshold;The second point density threshold is less than the first dot density threshold value.
7. the information processing method of the address base according to any one of claim 4 to 6, which is characterized in that the basis
The target point cluster calculates standard longitude and latitude, comprising:
According to each longitude and latitude in the target point cluster, longitude average value and latitude average value are calculated;
The longitude average value and the latitude average value constitute the standard longitude and latitude.
8. a kind of information processing unit of address base characterized by comprising
Address information obtains module, for obtaining the address of each address information and longitude and latitude in address base;
Address clustering processing module obtains the normal address of each address information for being standardized to the address;
Longitude and latitude clustering processing module, for will be provided with the address information of identical standard address, the cluster for carrying out longitude and latitude is integrated,
Obtain the standard longitude and latitude of the normal address;
Normal address information storage module, for storing normal address information, the normal address information in the address base
Including the normal address and the standard longitude and latitude.
9. a kind of electronic equipment characterized by comprising at least one processor;And
The memory being connect at least one described processor communication;
Wherein, the memory be stored with can by least one described processor execute instruction, described instruction by it is described at least
One processor is executed to realize:
Obtain the address of each address information and longitude and latitude in address base;The address is standardized, each address information is obtained
Normal address;It will be provided with the address information of identical standard address, carry out the cluster integration of longitude and latitude, obtain the normal address
Standard longitude and latitude;Normal address information is stored in the address base, the normal address information includes the normal address
With the standard longitude and latitude.
10. a kind of non-volatile memory medium, for storing computer-readable program, which is characterized in that described computer-readable
Program is used to execute the information processing method of the address base as described in any one of claims 1 to 7 for computer.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811488202.XA CN109635063A (en) | 2018-12-06 | 2018-12-06 | Information processing method and device for address library, electronic equipment and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811488202.XA CN109635063A (en) | 2018-12-06 | 2018-12-06 | Information processing method and device for address library, electronic equipment and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109635063A true CN109635063A (en) | 2019-04-16 |
Family
ID=66071742
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811488202.XA Pending CN109635063A (en) | 2018-12-06 | 2018-12-06 | Information processing method and device for address library, electronic equipment and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109635063A (en) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110597943A (en) * | 2019-09-16 | 2019-12-20 | 腾讯科技(深圳)有限公司 | Interest point processing method and device based on artificial intelligence and electronic equipment |
CN112016326A (en) * | 2020-09-25 | 2020-12-01 | 北京百度网讯科技有限公司 | Map area word recognition method and device, electronic equipment and storage medium |
CN112487122A (en) * | 2020-12-02 | 2021-03-12 | 电信科学技术第十研究所有限公司 | Address normalization processing method and device |
CN112801189A (en) * | 2021-01-29 | 2021-05-14 | 上海寻梦信息技术有限公司 | Method and device for detecting longitude and latitude abnormity, electronic equipment and storage medium |
CN113537808A (en) * | 2021-07-27 | 2021-10-22 | 石家庄开发区天远科技有限公司 | Engineering machinery accessory library site selection method based on space-time big data |
Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103123628A (en) * | 2011-11-21 | 2013-05-29 | 腾讯科技(深圳)有限公司 | Searching method and system for geographical location |
CN103996293A (en) * | 2014-06-09 | 2014-08-20 | 重庆大学 | Real-time traffic status collecting and inquiring system and method based on rider collaboration |
CN104050196A (en) * | 2013-03-15 | 2014-09-17 | 阿里巴巴集团控股有限公司 | Point of interest (POI) data redundancy detection method and device |
CN104572955A (en) * | 2014-12-29 | 2015-04-29 | 北京奇虎科技有限公司 | System and method for determining POI name based on clustering |
CN104935676A (en) * | 2014-03-17 | 2015-09-23 | 阿里巴巴集团控股有限公司 | Method and device for determining IP address fields and corresponding latitude and longitude |
CN105808715A (en) * | 2016-03-07 | 2016-07-27 | 武汉大学 | Method for establishing map per location |
WO2016127904A1 (en) * | 2015-02-13 | 2016-08-18 | 阿里巴巴集团控股有限公司 | Text address processing method and apparatus |
CN106934015A (en) * | 2017-03-10 | 2017-07-07 | 北京京东尚科信息技术有限公司 | Address date treating method and apparatus |
CN107133269A (en) * | 2017-04-01 | 2017-09-05 | 中国人民解放军国防科学技术大学 | Frequent location track generation method and device based on mobile target |
CN107622061A (en) * | 2016-07-13 | 2018-01-23 | 阿里巴巴集团控股有限公司 | A kind of method, apparatus and system for determining address uniqueness |
CN108763538A (en) * | 2018-05-31 | 2018-11-06 | 北京嘀嘀无限科技发展有限公司 | A kind of method and device in the geographical locations determining point of interest POI |
-
2018
- 2018-12-06 CN CN201811488202.XA patent/CN109635063A/en active Pending
Patent Citations (11)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN103123628A (en) * | 2011-11-21 | 2013-05-29 | 腾讯科技(深圳)有限公司 | Searching method and system for geographical location |
CN104050196A (en) * | 2013-03-15 | 2014-09-17 | 阿里巴巴集团控股有限公司 | Point of interest (POI) data redundancy detection method and device |
CN104935676A (en) * | 2014-03-17 | 2015-09-23 | 阿里巴巴集团控股有限公司 | Method and device for determining IP address fields and corresponding latitude and longitude |
CN103996293A (en) * | 2014-06-09 | 2014-08-20 | 重庆大学 | Real-time traffic status collecting and inquiring system and method based on rider collaboration |
CN104572955A (en) * | 2014-12-29 | 2015-04-29 | 北京奇虎科技有限公司 | System and method for determining POI name based on clustering |
WO2016127904A1 (en) * | 2015-02-13 | 2016-08-18 | 阿里巴巴集团控股有限公司 | Text address processing method and apparatus |
CN105808715A (en) * | 2016-03-07 | 2016-07-27 | 武汉大学 | Method for establishing map per location |
CN107622061A (en) * | 2016-07-13 | 2018-01-23 | 阿里巴巴集团控股有限公司 | A kind of method, apparatus and system for determining address uniqueness |
CN106934015A (en) * | 2017-03-10 | 2017-07-07 | 北京京东尚科信息技术有限公司 | Address date treating method and apparatus |
CN107133269A (en) * | 2017-04-01 | 2017-09-05 | 中国人民解放军国防科学技术大学 | Frequent location track generation method and device based on mobile target |
CN108763538A (en) * | 2018-05-31 | 2018-11-06 | 北京嘀嘀无限科技发展有限公司 | A kind of method and device in the geographical locations determining point of interest POI |
Non-Patent Citations (1)
Title |
---|
陈睿嘉 等: "基于网络爬虫的导航深度服务信息自动采集", 《测绘工程》 * |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110597943A (en) * | 2019-09-16 | 2019-12-20 | 腾讯科技(深圳)有限公司 | Interest point processing method and device based on artificial intelligence and electronic equipment |
CN110597943B (en) * | 2019-09-16 | 2022-04-01 | 腾讯科技(深圳)有限公司 | Interest point processing method and device based on artificial intelligence and electronic equipment |
CN112016326A (en) * | 2020-09-25 | 2020-12-01 | 北京百度网讯科技有限公司 | Map area word recognition method and device, electronic equipment and storage medium |
CN112487122A (en) * | 2020-12-02 | 2021-03-12 | 电信科学技术第十研究所有限公司 | Address normalization processing method and device |
CN112487122B (en) * | 2020-12-02 | 2024-05-17 | 电信科学技术第十研究所有限公司 | Address normalization processing method and device |
CN112801189A (en) * | 2021-01-29 | 2021-05-14 | 上海寻梦信息技术有限公司 | Method and device for detecting longitude and latitude abnormity, electronic equipment and storage medium |
CN113537808A (en) * | 2021-07-27 | 2021-10-22 | 石家庄开发区天远科技有限公司 | Engineering machinery accessory library site selection method based on space-time big data |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109635063A (en) | Information processing method and device for address library, electronic equipment and storage medium | |
CN110175216B (en) | Coordinate error correction method and device and computer equipment | |
CN110020221B (en) | Job distribution confirmation method, apparatus, server and computer readable storage medium | |
JP6689515B2 (en) | Method and apparatus for identifying the type of user geographic location | |
CN109919437B (en) | big data-based intelligent tourism target matching method and system | |
CN108122012B (en) | Method, device and equipment for determining center point of stationary point and storage medium | |
CN109657163A (en) | Destination address determining method and device, electronic equipment and storage medium | |
CN107124695A (en) | The method and system of accessible location is marked based on associated person information | |
CN106210163B (en) | IP address-based localization method and device | |
CN111639092B (en) | Personnel flow analysis method and device, electronic equipment and storage medium | |
CN110046174B (en) | population migration analysis method and system based on big data | |
CN108256718A (en) | Declaration form service role distribution method, device, computer equipment and storage device | |
KR20140097805A (en) | Coordinates (x, y) position value using a systematic block code generated and the address matching service using methods | |
CN107247791B (en) | Parking lot map data generation method and device and machine-readable storage medium | |
CN107038620A (en) | Based on user call a taxi preference information push and device | |
CN103617254A (en) | Method, system and device for constructing geographic position coordinate information base | |
CN103198071B (en) | Datagram table generating method and device thereof | |
CN114357097A (en) | Map annotation construction method and device, computer equipment and storage medium | |
CN109857822A (en) | Meta-model conversion method and management system based on chart database | |
CN105930313A (en) | Method and device for processing notification message | |
CN111177589A (en) | Address information query method, device, equipment and storage medium | |
CN106469205A (en) | A kind of method and apparatus of the geographical location information determining user | |
Khoussainova et al. | Probabilistic rfid data management | |
CN102184226B (en) | Method for constructing real-time database and data searching method | |
CN109815278A (en) | A kind of method for exhibiting data and its equipment, storage medium, electronic equipment |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20190416 |