WO2019128355A1 - Procédé et dispositif permettant de déterminer un emplacement géographique précis - Google Patents

Procédé et dispositif permettant de déterminer un emplacement géographique précis Download PDF

Info

Publication number
WO2019128355A1
WO2019128355A1 PCT/CN2018/108635 CN2018108635W WO2019128355A1 WO 2019128355 A1 WO2019128355 A1 WO 2019128355A1 CN 2018108635 W CN2018108635 W CN 2018108635W WO 2019128355 A1 WO2019128355 A1 WO 2019128355A1
Authority
WO
WIPO (PCT)
Prior art keywords
geographic location
optimal
clustering
location
geographic
Prior art date
Application number
PCT/CN2018/108635
Other languages
English (en)
Chinese (zh)
Inventor
肖明科
Original Assignee
北京京东尚科信息技术有限公司
北京京东世纪贸易有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京京东尚科信息技术有限公司, 北京京东世纪贸易有限公司 filed Critical 北京京东尚科信息技术有限公司
Publication of WO2019128355A1 publication Critical patent/WO2019128355A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/28Databases characterised by their database models, e.g. relational or object models
    • G06F16/284Relational databases
    • G06F16/285Clustering or classification
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L2101/00Indexing scheme associated with group H04L61/00
    • H04L2101/60Types of network addresses
    • H04L2101/668Internet protocol [IP] address subnets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L2101/00Indexing scheme associated with group H04L61/00
    • H04L2101/60Types of network addresses
    • H04L2101/69Types of network addresses using geographic information, e.g. room number

Definitions

  • the present invention relates to the field of Internet technologies, and in particular, to a method and apparatus for determining an accurate geographic location.
  • IP positioning technology in short, is a technology that determines the geographic location of a device by its IP address. IP positioning has an extremely wide range of applications, including targeted advertising, social networking, network security, performance optimization, and more.
  • terminal devices including GPS information modules, such as mobile phones, can easily obtain the user's street-level geographic location through data reporting. However, if it is a terminal such as a desktop computer or a notebook that does not contain GPS hardware devices, it is impossible to obtain the user's geographic location through technologies such as GPS. In this case, high-precision IP positioning technology is required. The traditional IP positioning can only be located at the municipal level, and the accuracy of the district-level data is also debatable.
  • the traditional IP positioning algorithm estimates the position based on the linear relationship between the delay and the geographical distance, and reduces the error through the topology.
  • BGP Border Gateway Protocol
  • ASN Automatic System Number
  • the embodiments of the present invention provide a method and apparatus for determining a precise geographic location, which improves positioning accuracy, and the present invention does not require a large number of monitoring points to be laid, thereby reducing the cost while improving positioning accuracy.
  • a method for determining an accurate geographic location includes: obtaining an IP and a plurality of geographic locations associated with the IP; using a clustering algorithm, Geographical clustering is performed to obtain a geographical location clustering result of the IP; and based on the geographical location clustering result, an optimal algorithm is used to determine an optimal geographic location corresponding to the IP; according to the optimal geographic location and pre- An artificial neural network model is set to determine the precise geographic location of the IP.
  • the clustering algorithm is a k-means algorithm
  • the optimization algorithm is a weighted least squares method
  • the step of clustering the plurality of geographic locations to obtain the geographical location clustering result of the IP by using a clustering algorithm comprises: selecting two geographical locations from multiple geographic locations associated with the IP a first initial centroid and a second initial centroid; calculating a first spherical distance between each of the plurality of geographic locations and the first initial centroid and a second spherical distance from the second initial centroid And clustering the plurality of geographical locations associated with the IP to obtain a high density cluster, and using the high density cluster as the geographic location cluster of the IP according to the first spherical distance and the second spherical distance result.
  • a first spherical distance between each geographic location and the first initial centroid and a second spherical distance from the second initial centroid are calculated according to equation (1) below:
  • R is the radius of the long axis of the earth
  • S is the spherical distance between the geographic location A and the geographic location B
  • ⁇ 1 is the latitude of the geographic location A
  • ⁇ 1 is the longitude of the geographic location A
  • ⁇ 2 is the latitude of the geographic location B
  • ⁇ 2 is Longitude of location B.
  • determining, according to the geographical location clustering result, an optimal algorithm for determining an optimal geographic location corresponding to each IP includes: for each geographic location in the high density cluster, according to each geographic location and a high density cluster centroid The spherical distance determines the weight of each of the geographic locations; according to the weights, the optimal geographic location corresponding to each IP is determined by a weighted least squares method.
  • the weight of each of the geographic locations is determined according to the following formula (2):
  • ⁇ i represents the weight of the i-th geographic location
  • d i represents the spherical distance between the i-th geographic location and the high-density cluster centroid
  • n is an integer greater than or equal to 1
  • determining the precise geographic location of the IP according to the optimal geographic location and the preset artificial neural network model includes: inputting the optimal geographic location into the preset artificial neural network model, and obtaining an output. As a result; if the output result is a preset target result, the optimal geographic location is the precise geographic location of the IP.
  • the input layer of the preset artificial neural network model has 3 neuron nodes
  • the hidden layer has 5 neuron nodes
  • the output layer has 1 neuron node
  • an apparatus for determining a precise geographic location including: an obtaining module, configured to acquire an IP and multiple geographic locations associated with the IP; a clustering module, The clustering algorithm is used to cluster the plurality of geographic locations to obtain a geographical location clustering result of the IP; an optimal geographic location determining module is configured to use an optimization algorithm based on the geographical location clustering result Determining an optimal geographic location corresponding to the IP; an accurate geographic location determining module, configured to determine an accurate geographic location of the IP according to the optimal geographic location and a preset artificial neural network model.
  • the clustering algorithm is a k-means algorithm
  • the optimization algorithm is a weighted least squares method
  • the clustering module is further configured to: select two geographic locations from the plurality of geographic locations associated with the IP as the first initial centroid and the second initial centroid; calculate each of the multiple geographic locations a first spherical distance between the geographic location and the first initial centroid and a second spherical distance from the second initial centroid; the IP association based on the first spherical distance and the second spherical distance A plurality of geographical locations are clustered to obtain a high density cluster, and the high density cluster is used as a geographical location clustering result of the IP.
  • the clustering module calculates a first spherical distance between each geographic location and the first initial centroid and a second spherical distance from the second initial centroid according to the following formula (1):
  • R is the radius of the long axis of the earth
  • S is the spherical distance between the geographic location A and the geographic location B
  • ⁇ 1 is the latitude of the geographic location A
  • ⁇ 1 is the longitude of the geographic location A
  • ⁇ 2 is the latitude of the geographic location B
  • ⁇ 2 is Longitude of location B.
  • the optimal geographic location determining module is further configured to: determine, for each geographic location in the high density cluster, a weight of each geographic location according to a spherical distance of each geographic location and a high density cluster centroid According to the weight, the optimal geographic location corresponding to each IP is determined by a weighted least squares method.
  • the weight of each of the geographic locations is determined according to the following formula (2):
  • ⁇ i represents the weight of the i-th geographic location
  • d i represents the spherical distance between the i-th geographic location and the high-density cluster centroid
  • n is an integer greater than or equal to 1
  • the precise geographic location determining module is further configured to: input the optimal geographic location into the preset artificial neural network model, and obtain an output result; if the output result is a preset target result, The optimal geographic location is the precise geographic location of the IP.
  • the input layer of the preset artificial neural network model has 3 neuron nodes
  • the hidden layer has 5 neuron nodes
  • the output layer has 1 neuron node
  • an electronic device includes: one or more processors; and storage means for storing one or more programs when the one or more programs are Executed by the one or more processors, such that the one or more processors implement the method of determining an accurate geographic location as described in an embodiment of the present invention.
  • a computer readable medium storing a computer program, the program being executed by a processor to implement a determined precise geographic location as described in an embodiment of the present invention Methods.
  • the clustering algorithm is used to cluster the plurality of geographic locations to obtain a geographical location clustering result for each IP; clustering results based on the geographic location Determining an optimal geographic location corresponding to the IP by using an optimization algorithm; determining a technical method of the precise geographic location of the IP according to the optimal geographic location and a preset artificial neural network model, thereby improving positioning accuracy, and There is no need to lay a large number of monitoring points, which reduces costs.
  • FIG. 1 is a schematic diagram of a main flow of a method of determining an accurate geographic location according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of a main flow of a method of determining an accurate geographic location according to another embodiment of the present invention
  • FIG. 3 is a schematic diagram of main modules of an apparatus for determining a precise geographic location, in accordance with an embodiment of the present invention
  • FIG. 4 is an exemplary system architecture diagram to which an embodiment of the present invention may be applied;
  • Figure 5 is a block diagram showing the structure of a computer system suitable for implementing a terminal device or server in accordance with an embodiment of the present invention.
  • FIG. 1 is a schematic diagram of a main flow chart of a method for determining an accurate geographic location of an IP-geographic data set in accordance with an embodiment of the present invention. As shown in Figure 1, the method includes:
  • Step S101 Obtain an IP and multiple geographical locations associated with the IP
  • Step S102 Clustering the plurality of geographical locations by using a clustering algorithm to obtain a geographical location clustering result of the IP;
  • Step S103 Determine, according to the geographical location clustering result, an optimal geographic location corresponding to the IP by using an optimization algorithm
  • Step S104 Determine an accurate geographic location of the IP according to the optimal geographic location and a preset artificial neural network model.
  • the IP in this embodiment and the plurality of geographic locations associated with the IP may be obtained through a public geographic information database. It can also be obtained by receiving the IP reported by the data collection source and multiple geographical locations associated with the IP, for example, receiving an IP address reported by a reporting device (for example, a smart phone, a tablet, etc.) having a GPS information module, and the IP address. The geographic location associated with the address.
  • a reporting device for example, a smart phone, a tablet, etc.
  • any terminal device such as a mobile phone or a tablet computer can be used as a data collection source in the present embodiment. Therefore, the embodiment of the present invention does not need to lay a large number of monitoring points and reduces The cost.
  • the device identifier for example, a MAC address
  • the time stamp when the data is reported may be acquired, thereby
  • the device identification, time stamp, IP, and geographic location of the IP constitute a valid data, such as IP-MAC-GPS-TIMESTAMP, where GPS is the reported latitude and longitude information, and TIMESTAMP is the timestamp when the data is reported.
  • the above geographical location may be expressed as satellite positioning information such as latitude and longitude information, altitude information, or may be expressed as location information such as cities, streets, merchants, and office buildings.
  • the geographic location is preferably latitude and longitude information.
  • the above IP is essentially a 32-bit unsigned int data ranging from 0 to 2 32.
  • the IP address in the form of a string is generally used, which is the usual 192.168.0.1.
  • the form in fact, converts every 8 binary bits into a corresponding decimal integer, abbreviated as a numeric IP.
  • 192.168.0.1 and 3232252721 are equivalent.
  • the IP is a numerical IP for ease of use.
  • a plurality of geographical locations of the IP are clustered by using a clustering algorithm to exclude a geographical location with a large error, thereby obtaining a relatively accurate geographic location corresponding to the IP, thereby improving positioning accuracy.
  • the clustering algorithm may be a k-means clustering algorithm.
  • the device identifier may be used as a dimension and clustered by a timestamp, that is, the data reported by the same reporting device in a certain period of time is aggregated. class.
  • the above k-means algorithm is a typical distance-based clustering algorithm.
  • the distance is used as the evaluation index of similarity, that is, the closer the distance between two objects is, the greater the similarity is.
  • the core of the algorithm is to solve the problem by optimizing the distance from the data point to the centroid as a function of the optimization target, and using the function to take the extreme value to iterate continuously, so the compact and independent cluster is the final goal.
  • the step of clustering a plurality of geographical locations associated with the IP to obtain a geographical location clustering result of the IP by using a k-means clustering algorithm includes the following steps:
  • Step S201 Select two geographic locations from the plurality of geographic locations associated with the IP as the first initial centroid and the second initial centroid;
  • Step S202 calculating a first spherical distance between each of the plurality of geographical locations and the first initial centroid and a second spherical distance between the second initial centroid;
  • Step S203 Cluster the geographical locations associated with the IP according to the first spherical distance and the second spherical distance to obtain a high density cluster and a low density cluster, and use the high density cluster as the IP Geographic location clustering results.
  • step S201 the latitude and longitude data collected for a period of time for the same reporting device (ie, the same IP) is hashed near the real geographical location of the IP, and such points are dense, but due to external factors Influence, a few points have a large deviation from the real position, and the density is sparse. Therefore, the embodiment of the present invention defines clusters as high-density regions separated by low-density regions. When the initial centroid is selected, two types are selected on the density-based clusters.
  • two latitude and longitude may be randomly selected as the first initial centroid and the second initial centroid, or the average of all the latitude and longitude may be selected as the first initial centroid, and the latitude and longitude with the largest deviation from the average is taken as the second initial centroid.
  • step S202 since the latitude and longitude is the coordinates of the ellipsoid, the Euclidean distance cannot be simply used as a compact index for measuring the cluster, and the embodiment of the present invention uses the spherical distance as a compact index for measuring the cluster.
  • the spherical distance between two geographic locations can be calculated by the following formula:
  • R is the radius of the long axis of the earth
  • S is the spherical distance between the geographic location A and the geographic location B
  • ⁇ 1 is the latitude of the geographic location A
  • ⁇ 1 is the longitude of the geographic location A
  • ⁇ 2 is the latitude of the geographic location B
  • ⁇ 2 is Longitude of location B.
  • the geographical position close to the first initial centroid is a cluster
  • the geographical position close to the second initial centroid is another cluster. Then, recalculate the centroid of each cluster and repeat the iteration until the final centroid is constant or the change is small.
  • the high-density cluster is selected as the geographical clustering result of the IP, and the low-density cluster is discarded as the error cluster to avoid data pollution.
  • an optimization algorithm is needed to determine the optimal geographical position corresponding to each IP.
  • an optimization algorithm can be used to obtain an optimal solution for a high-density cluster of the same IP.
  • the optimization algorithm may be a weighted least squares method.
  • the weighted least squares method described above is a mathematical optimization technique that finds the best function match of the data by minimizing the sum of the squares of the errors.
  • the weighted least squares method has a wide range of applications in the field of engineering technology.
  • the weighted least squares method can be used to easily obtain unknown parameters and minimize the sum of squared errors between these obtained data and actual data.
  • the process of determining the optimal geographic location corresponding to the IP by using a weighted least squares method based on the geographic location clustering result may include the following steps:
  • ⁇ i represents the weight of the i-th latitude and longitude
  • d i represents the distance between the i-th latitude and longitude and the centroid
  • n is an integer greater than or equal to 1.
  • the weighted least squares method is used to determine the optimal geographic location corresponding to each IP. In this process, it is necessary to establish a nonlinear curve fitting function for the latitude and longitude of the same IP to minimize the variance.
  • the specific formula is as follows: (3):
  • (x i , y i ) represents the ith geographic location
  • (x i , y i ) is the plane coordinate after the latitude and longitude is converted to the geodetic coordinates by the Gauss projection by the ith geographic location.
  • a nonlinear regression model is established for the latitude and longitude data of the same IP: among them For the center coordinates, r is the radius. Find the optimal geographic location corresponding to the IP Make it satisfy The smallest.
  • step S103 after the k-means algorithm and the weighted least squares method described above, it can be determined that the data reported by a sampling device has been correctly processed, but in the actual process, the reported IP and latitude and longitude data are present due to factors such as a simulator. There may be large deviations, and this part of the data can be considered as abnormal data. Therefore, in the present embodiment, an artificial neural network model can be utilized to filter the optimal geographic location calculated by the same IP, thereby eliminating abnormal data. Specifically, after determining the optimal geographic location of the IP, an artificial neural network model is introduced to perform a simple 'classification' on the optimal geographic location, that is, all the optimal geographic locations are divided into two categories, normal and abnormal. class.
  • the method further comprises: determining an accurate geographic location of the IP according to the optimal geographic location and a preset artificial neural network model.
  • the optimal geographic location is an exact geographic location of the IP.
  • the method further comprises: training the artificial neural network model, that is, adjusting the weight of each neural node through the training data, so that the expected output of the normal optimal geographic location is obtained. For 1, the expected output of the abnormally optimal geographic location is zero.
  • a plurality of IP data associated with the correct geographical location are selected as normal data (for example, greater than 20,000 data), and artificial abnormal data is added to the same IP, and the artificial neural network model hidden layer weight training is performed by using the normal data and the artificial abnormal data.
  • the final function is guaranteed to converge, and the hidden layer weight parameter is used as the initialization parameter.
  • the input layer of the preset artificial neural network model has three neuron nodes corresponding to IP (numerical IP), longitude and latitude; the hidden layer has five neuron nodes, and the number of nodes It is determined by the developer through the training data convergence time and method; the output layer has one neuron node, and the output result is used to determine whether the latitude and longitude is abnormal data, the output result is 1 indicating that the latitude and longitude is normal data, and the output result is 0 indicating the latitude and longitude. For abnormal data.
  • IP number of IP
  • the hidden layer has five neuron nodes, and the number of nodes It is determined by the developer through the training data convergence time and method
  • the output layer has one neuron node, and the output result is used to determine whether the latitude and longitude is abnormal data, the output result is 1 indicating that the latitude and longitude is normal data, and the output result is 0 indicating the latitude and longitude. For abnormal data.
  • the above-mentioned preset target result may be 1, and if the output result is 1, the optimal geographical position is the precise geographical position of the IP.
  • the obtained IP and the precise geographic location of the IP may be saved.
  • the Artificial Neural Network is: abstracting the human brain neural network from the perspective of information processing, establishing a simple model, and forming different networks according to different connection modes.
  • a neural network is an operational model consisting of a large number of nodes (or neurons) connected to each other. Each node represents a specific output function called an activation function.
  • the connection between every two nodes represents a weighting value for passing the connection signal, called weight, which is equivalent to the memory of the artificial neural network.
  • the output of the network varies depending on the connection method of the network, the weight value and the excitation function.
  • the network itself is usually an approximation of an algorithm or function in nature, or it may be an expression of a logic strategy.
  • the method for determining the precise geographical location of the embodiment of the invention improves the positioning accuracy, and does not need to lay a large number of monitoring points, thereby reducing the cost.
  • reduce redundant data reduce GPS positioning errors caused by weather, signals, surrounding environment and other factors; then use weighted least squares method for different users (MAC) but the same IP geographical location
  • MAC weighted least squares method for different users
  • the method of the embodiment of the present invention can also obtain the variance according to the formula (3), and provide a quantitative indicator for the accuracy of the IP positioning.
  • the apparatus 300 includes: an obtaining module 301, configured to acquire an IP and multiple geographic locations associated with the IP; and a clustering module 302, configured to use the clustering algorithm to perform the multiple geographic locations Performing clustering to obtain a geographical location clustering result of the IP; an optimal geographic location determining module 303, configured to determine, according to the geographic location clustering result, an optimal geographic location corresponding to the IP by using an optimization algorithm;
  • the location determining module 304 is configured to determine an accurate geographic location of the IP according to the optimal geographic location and a preset artificial neural network model.
  • the clustering algorithm is a k-means algorithm
  • the optimization algorithm is a weighted least squares method
  • the clustering module 302 is further configured to: select two geographic locations from the plurality of geographic locations associated with the IP as the first initial centroid and the second initial centroid; calculate each of the multiple geographic locations a first spherical distance between the geographic location and the first initial centroid and a second spherical distance from the second initial centroid; the IP according to the first spherical distance and the second spherical distance The associated plurality of geographic locations are clustered to obtain a high density cluster, and the high density cluster is used as a geographical location clustering result of the IP.
  • the clustering module 302 calculates a first spherical distance between each geographic location and the first initial centroid and a second spherical distance from the second initial centroid according to the following formula (1):
  • R is the radius of the long axis of the earth
  • S is the spherical distance between the geographic location A and the geographic location B
  • ⁇ 1 is the latitude of the geographic location A
  • ⁇ 1 is the longitude of the geographic location A
  • ⁇ 2 is the latitude of the geographic location B
  • ⁇ 2 is Longitude of location B.
  • the optimal geographic location determining module 303 is further configured to: determine, for each geographic location in the high density cluster, the geographic distance of each geographic location and the high density cluster centroid, determine each geographic location Weights; based on the weights, the weighted least squares method is used to determine the optimal geographic location corresponding to each IP.
  • the weight of each of the geographic locations is determined according to the following formula (2):
  • ⁇ i represents the weight of the i-th geographic location
  • d i represents the spherical distance between the i-th geographic location and the high-density cluster centroid
  • n is an integer greater than or equal to 1
  • the precise geographic location determining module 304 is further configured to: input the optimal geographic location into the preset artificial neural network model, and obtain an output result; if the output result is a preset target result, The optimal geographic location is then the precise geographic location of the IP.
  • the input layer of the preset artificial neural network model has 3 neuron nodes
  • the hidden layer has 5 neuron nodes
  • the output layer has 1 neuron node
  • the device for determining the precise geographical position of the embodiment of the invention improves the positioning accuracy, and does not need to lay a large number of monitoring points, thereby reducing the cost.
  • reduce redundant data reduce GPS positioning errors caused by weather, signals, surrounding environment and other factors; then use weighted least squares method for different users (MAC) but the same IP geographical location
  • MAC weighted least squares method for different users
  • FIG. 4 illustrates an exemplary system architecture 400 of an IP-geographic data set construction method or IP-geographic data set construction apparatus to which embodiments of the present invention may be applied.
  • system architecture 400 can include terminal devices 401, 402, 403, network 404, and server 405.
  • Network 404 is used to provide a medium for communication links between terminal devices 401, 402, 403 and server 405.
  • Network 404 can include a variety of connection types, such as wired, wireless communication links, fiber optic cables, and the like.
  • the user can interact with the server 405 via the network 404 using the terminal devices 401, 402, 403 to receive or send messages and the like.
  • the terminal devices 401, 402, 403 may be various electronic devices having a display screen and supporting web browsing, including but not limited to smart phones, tablet computers, laptop portable computers, desktop computers, and the like.
  • the server 405 may be a server that provides various services, such as a background management server that provides support to a shopping site browsed by the user using the terminal devices 401, 402, and 403.
  • the background management server may analyze and process data such as the received product information query request, and feed back the processing result (for example, target push information and product information) to the terminal device.
  • the method for determining the precise geographic location is generally performed by the server 405. Accordingly, the IP positioning device is generally disposed in the server 405.
  • terminal devices, networks, and servers in FIG. 4 is merely illustrative. Depending on the implementation needs, there can be any number of terminal devices, networks, and servers.
  • FIG. 5 there is shown a block diagram of a computer system 500 suitable for use in implementing a terminal device in accordance with an embodiment of the present invention.
  • the terminal device shown in FIG. 5 is merely an example, and should not impose any limitation on the function and scope of use of the embodiments of the present invention.
  • computer system 500 includes a central processing unit (CPU) 501 that can be loaded into a program in random access memory (RAM) 503 according to a program stored in read only memory (ROM) 502 or from storage portion 508. And perform various appropriate actions and processes.
  • RAM random access memory
  • ROM read only memory
  • RAM 503 various programs and data required for the operation of the system 500 are also stored.
  • the CPU 501, the ROM 502, and the RAM 503 are connected to each other through a bus 504.
  • An input/output (I/O) interface 505 is also coupled to bus 504.
  • the following components are connected to the I/O interface 505: an input portion 506 including a keyboard, a mouse, etc.; an output portion 507 including, for example, a cathode ray tube (CRT), a liquid crystal display (LCD), and the like, and a storage portion 508 including a hard disk or the like. And a communication portion 509 including a network interface card such as a LAN card, a modem, or the like. The communication section 509 performs communication processing via a network such as the Internet.
  • Driver 510 is also coupled to I/O interface 505 as needed.
  • a removable medium 511 such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory or the like is mounted on the drive 510 as needed so that a computer program read therefrom is installed into the storage portion 508 as needed.
  • embodiments of the present disclosure include a computer program product comprising a computer program embodied on a computer readable medium, the computer program comprising program code for executing the method illustrated in the flowchart.
  • the computer program can be downloaded and installed from the network via the communication portion 509, and/or installed from the removable medium 511.
  • CPU central processing unit
  • the computer readable medium shown in the present invention may be a computer readable signal medium or a computer readable storage medium or any combination of the two.
  • the computer readable storage medium can be, for example, but not limited to, an electronic, magnetic, optical, electromagnetic, infrared, or semiconductor system, apparatus, or device, or any combination of the above. More specific examples of computer readable storage media may include, but are not limited to, electrical connections having one or more wires, portable computer disks, hard disks, random access memory (RAM), read only memory (ROM), erasable Programmable read only memory (EPROM or flash memory), optical fiber, portable compact disk read only memory (CD-ROM), optical storage device, magnetic storage device, or any suitable combination of the foregoing.
  • a computer readable storage medium may be any tangible medium that can contain or store a program, which can be used by or in connection with an instruction execution system, apparatus or device.
  • a computer readable signal medium may include a data signal that is propagated in the baseband or as part of a carrier, in which computer readable program code is carried. Such propagated data signals can take a variety of forms including, but not limited to, electromagnetic signals, optical signals, or any suitable combination of the foregoing.
  • the computer readable signal medium can also be any computer readable medium other than a computer readable storage medium, which can transmit, propagate, or transport a program for use by or in connection with the instruction execution system, apparatus, or device.
  • Program code embodied on a computer readable medium can be transmitted by any appropriate medium, including but not limited to wireless, wire, optical cable, RF, etc., or any suitable combination of the foregoing.
  • each block of the flowchart or block diagrams can represent a module, a program segment, or a portion of code that includes one or more Executable instructions.
  • the functions noted in the blocks may also occur in a different order than that illustrated in the drawings. For example, two successively represented blocks may in fact be executed substantially in parallel, and they may sometimes be executed in the reverse order, depending upon the functionality involved.
  • each block of the block diagrams or flowcharts, and combinations of blocks in the block diagrams or flowcharts can be implemented by a dedicated hardware-based system that performs the specified function or operation, or can be used A combination of dedicated hardware and computer instructions is implemented.
  • the modules involved in the embodiments of the present invention may be implemented by software or by hardware.
  • the described modules may also be disposed in a processor, for example, as a processor including a transmitting module, an obtaining module, a determining module, and a first processing module.
  • the name of these modules does not constitute a limitation on the unit itself in some cases.
  • the sending module may also be described as a module that sends a picture acquisition request to the connected server.
  • the present invention also provides a computer readable medium, which may be included in the apparatus described in the above embodiments, or may be separately present and not incorporated in the apparatus.
  • the computer readable medium carries one or more programs, and when the one or more programs are executed by the device, the device includes: obtaining an IP and a plurality of geographic locations associated with the IP; using a clustering algorithm And clustering the plurality of geographic locations to obtain a geographical location clustering result of the IP; determining, according to the geographic location clustering result, an optimal geographic location corresponding to the IP by using an optimization algorithm; An excellent geographic location and a preset artificial neural network model to determine the precise geographic location of the IP.
  • the technical solution of the embodiment of the present invention uses a clustering algorithm to cluster the plurality of geographical locations to obtain a geographical location clustering result of each IP; and based on the geographical location clustering result, determine the The optimal geographic location corresponding to the IP; the technical means for determining the precise geographical location of the IP according to the optimal geographic location and the preset artificial neural network model, so the positioning accuracy is improved, and a large number of monitoring points are not required to be laid. Reduced costs.

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Position Fixing By Use Of Radio Waves (AREA)

Abstract

La présente invention se rapporte au domaine technique de l'Internet, et concerne ainsi un procédé et un dispositif permettant de déterminer un emplacement géographique précis. Un mode de réalisation préféré du procédé consiste à : acquérir un IP et une pluralité d'emplacements géographiques associés à l'IP ; regrouper la pluralité d'emplacements géographiques au moyen d'un algorithme de regroupement, de façon à obtenir un résultat de regroupement d'emplacements géographiques de l'IP ; déterminer, selon le résultat de regroupement d'emplacements géographiques, un emplacement géographique optimal correspondant à l'IP au moyen d'un algorithme d'optimisation ; et déterminer un emplacement géographique précis de l'IP selon l'emplacement géographique optimal et d'un modèle de réseau neuronal artificiel prédéfini. Selon ledit mode de réalisation préféré, l'emplacement géographique précis de l'IP peut être déterminé, ce qui permet d'améliorer la précision de positionnement sans avoir besoin d'installer un grand nombre de points de surveillance, de sorte que le coût soit réduit tandis que la précision de positionnement est augmentée.
PCT/CN2018/108635 2017-12-29 2018-09-29 Procédé et dispositif permettant de déterminer un emplacement géographique précis WO2019128355A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711481337.9 2017-12-29
CN201711481337.9A CN109995884B (zh) 2017-12-29 2017-12-29 确定精确地理位置的方法和装置

Publications (1)

Publication Number Publication Date
WO2019128355A1 true WO2019128355A1 (fr) 2019-07-04

Family

ID=67062986

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/108635 WO2019128355A1 (fr) 2017-12-29 2018-09-29 Procédé et dispositif permettant de déterminer un emplacement géographique précis

Country Status (2)

Country Link
CN (1) CN109995884B (fr)
WO (1) WO2019128355A1 (fr)

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109995884A (zh) * 2017-12-29 2019-07-09 北京京东尚科信息技术有限公司 确定精确地理位置的方法和装置
CN111080198A (zh) * 2019-11-29 2020-04-28 浙江大搜车软件技术有限公司 车辆物流路径生成的方法、装置、计算机设备及存储介质
CN111159493A (zh) * 2019-12-25 2020-05-15 乐山师范学院 一种基于特征权重的网络数据相似度计算方法与系统
CN111383051A (zh) * 2020-03-02 2020-07-07 杭州比智科技有限公司 实体对象的选址方法、装置、计算设备及计算机存储介质
CN111524176A (zh) * 2020-04-16 2020-08-11 深圳市沃特沃德股份有限公司 视距测量定位的方法、装置和计算机设备
CN111898624A (zh) * 2020-01-21 2020-11-06 北京畅行信息技术有限公司 定位信息的处理方法、装置、设备及存储介质
CN112801189A (zh) * 2021-01-29 2021-05-14 上海寻梦信息技术有限公司 经纬度异常检测方法、装置、电子设备和存储介质
CN113067913A (zh) * 2021-03-19 2021-07-02 北京达佳互联信息技术有限公司 定位方法、装置、服务器、介质及产品
CN113865604A (zh) * 2021-08-31 2021-12-31 北京三快在线科技有限公司 一种位置数据的生成方法和装置
US20220264250A1 (en) * 2019-11-04 2022-08-18 Beijing Digital Union Web Science And Technology Company Limited Ip positioning method and unit, computer storage medium and computing device
CN115242868A (zh) * 2022-07-13 2022-10-25 郑州埃文计算机科技有限公司 一种基于图神经网络的街道级ip地址定位方法

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110798543B (zh) * 2019-11-04 2020-11-10 北京数字联盟网络科技有限公司 Ip定位方法及装置、计算机存储介质、计算设备
CN111327721B (zh) * 2020-02-28 2023-01-10 加和(北京)信息科技有限公司 Ip地址定位方法及装置、存储介质及电子装置
CN112769702B (zh) * 2021-01-06 2023-07-21 郑州埃文计算机科技有限公司 一种基于路由器别名和基准点地理特征的路由器定位方法

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101267374A (zh) * 2008-04-18 2008-09-17 清华大学 基于神经网络和无线局域网基础架构的2.5d定位方法
US7543045B1 (en) * 2008-05-28 2009-06-02 International Business Machines Corporation System and method for estimating the geographical location and proximity of network devices and their directly connected neighbors
CN105718465A (zh) * 2014-12-02 2016-06-29 阿里巴巴集团控股有限公司 地理围栏生成方法及装置
CN105933294A (zh) * 2016-04-12 2016-09-07 晶赞广告(上海)有限公司 网络用户定位方法、装置及终端
CN106469205A (zh) * 2016-08-31 2017-03-01 百度在线网络技术(北京)有限公司 一种确定用户的地理位置信息的方法与装置

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE60036138T2 (de) * 2000-12-12 2008-05-21 Consejo Superior de Investigaciónes Científicas Nichtlineares datenabbildungs- und dimensionalitätsreduktionssystem
CN101814063A (zh) * 2010-05-24 2010-08-25 天津大学 基于距离权重的全局k-均值聚类算法
CN102932738A (zh) * 2012-10-31 2013-02-13 北京交通大学 一种改进的基于分簇神经网络的室内指纹定位方法
CN103561463B (zh) * 2013-10-24 2016-06-29 电子科技大学 一种基于样本聚类的rbf神经网络室内定位方法
CN104168341B (zh) * 2014-08-15 2018-01-19 北京百度网讯科技有限公司 Ip地址的定位方法和cdn调度方法以及装置
CN106534392B (zh) * 2015-09-10 2019-12-06 阿里巴巴集团控股有限公司 一种定位信息采集方法、定位方法及装置
CN106525678A (zh) * 2016-12-03 2017-03-22 安徽新华学院 一种基于地理位置的pm2.5浓度值的预测方法及装置
CN107247786A (zh) * 2017-06-15 2017-10-13 北京小度信息科技有限公司 用于确定相似用户的方法、装置和服务器
CN109995884B (zh) * 2017-12-29 2021-01-26 北京京东尚科信息技术有限公司 确定精确地理位置的方法和装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101267374A (zh) * 2008-04-18 2008-09-17 清华大学 基于神经网络和无线局域网基础架构的2.5d定位方法
US7543045B1 (en) * 2008-05-28 2009-06-02 International Business Machines Corporation System and method for estimating the geographical location and proximity of network devices and their directly connected neighbors
CN105718465A (zh) * 2014-12-02 2016-06-29 阿里巴巴集团控股有限公司 地理围栏生成方法及装置
CN105933294A (zh) * 2016-04-12 2016-09-07 晶赞广告(上海)有限公司 网络用户定位方法、装置及终端
CN106469205A (zh) * 2016-08-31 2017-03-01 百度在线网络技术(北京)有限公司 一种确定用户的地理位置信息的方法与装置

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109995884A (zh) * 2017-12-29 2019-07-09 北京京东尚科信息技术有限公司 确定精确地理位置的方法和装置
US20220264250A1 (en) * 2019-11-04 2022-08-18 Beijing Digital Union Web Science And Technology Company Limited Ip positioning method and unit, computer storage medium and computing device
CN111080198A (zh) * 2019-11-29 2020-04-28 浙江大搜车软件技术有限公司 车辆物流路径生成的方法、装置、计算机设备及存储介质
CN111080198B (zh) * 2019-11-29 2023-06-09 浙江大搜车软件技术有限公司 车辆物流路径生成的方法、装置、计算机设备及存储介质
CN111159493A (zh) * 2019-12-25 2020-05-15 乐山师范学院 一种基于特征权重的网络数据相似度计算方法与系统
CN111159493B (zh) * 2019-12-25 2023-07-18 乐山师范学院 一种基于特征权重的网络数据相似度计算方法与系统
CN111898624B (zh) * 2020-01-21 2024-04-02 北京畅行信息技术有限公司 定位信息的处理方法、装置、设备及存储介质
CN111898624A (zh) * 2020-01-21 2020-11-06 北京畅行信息技术有限公司 定位信息的处理方法、装置、设备及存储介质
CN111383051A (zh) * 2020-03-02 2020-07-07 杭州比智科技有限公司 实体对象的选址方法、装置、计算设备及计算机存储介质
CN111524176A (zh) * 2020-04-16 2020-08-11 深圳市沃特沃德股份有限公司 视距测量定位的方法、装置和计算机设备
CN112801189A (zh) * 2021-01-29 2021-05-14 上海寻梦信息技术有限公司 经纬度异常检测方法、装置、电子设备和存储介质
CN113067913B (zh) * 2021-03-19 2022-12-09 北京达佳互联信息技术有限公司 定位方法、装置、服务器、介质及产品
CN113067913A (zh) * 2021-03-19 2021-07-02 北京达佳互联信息技术有限公司 定位方法、装置、服务器、介质及产品
CN113865604A (zh) * 2021-08-31 2021-12-31 北京三快在线科技有限公司 一种位置数据的生成方法和装置
CN115242868A (zh) * 2022-07-13 2022-10-25 郑州埃文计算机科技有限公司 一种基于图神经网络的街道级ip地址定位方法

Also Published As

Publication number Publication date
CN109995884A (zh) 2019-07-09
CN109995884B (zh) 2021-01-26

Similar Documents

Publication Publication Date Title
WO2019128355A1 (fr) Procédé et dispositif permettant de déterminer un emplacement géographique précis
Meng et al. Truth discovery on crowd sensing of correlated entities
CN104995870B (zh) 多目标服务器布局确定方法和装置
US8825080B1 (en) Predicting geographic population density
US9495099B2 (en) Space-time-node engine signal structure
US10366113B2 (en) Method and system for generating a geocode trie and facilitating reverse geocode lookups
US10382556B2 (en) Iterative learning for reliable sensor sourcing systems
US10972862B2 (en) Visitor insights based on hyper-locating places-of-interest
CN110300084B (zh) 基于ip地址的画像方法和装置,电子设备,可读介质
Sallah et al. Mathematical models for predicting human mobility in the context of infectious disease spread: introducing the impedance model
CN114422267B (zh) 流量检测方法、装置、设备及介质
WO2022121801A1 (fr) Procédé et appareil de traitement d'informations, et dispositif électronique
CN109034232B (zh) 城市规划条件核实成果报告的自动化输出系统及控制方法
CN113128773B (zh) 地址预测模型的训练方法、地址预测方法及装置
CN113420067B (zh) 目标地点的位置可信度评估方法和装置
US20200053514A1 (en) Collaborative geo-positioning of electronic devices
WO2022017082A1 (fr) Procédé et appareil permettant de détecter des ordres de transaction faux
WO2023226448A1 (fr) Procédé et appareil de génération d'informations de points d'intérêt de logistique, et dispositif et support lisible par ordinateur
US11526800B2 (en) Determining value of corpora for machine learning using coresets
Zu et al. A delay deviation tolerance IP geolocation method with error estimation
Tai et al. TrustGeo: Uncertainty-Aware Dynamic Graph Learning for Trustworthy IP Geolocation
CN112269925A (zh) 一种获取地理位置点信息的方法和装置
CN113779370B (zh) 一种地址检索方法和装置
CN111339446A (zh) 一种兴趣点挖掘方法、装置、电子设备和存储介质
CN114978794B (zh) 网络接入方法、装置、存储介质以及电子设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18893734

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 28/09/2020)

122 Ep: pct application non-entry in european phase

Ref document number: 18893734

Country of ref document: EP

Kind code of ref document: A1