CN110661899A - Method and device for determining physical address of IP address - Google Patents

Method and device for determining physical address of IP address Download PDF

Info

Publication number
CN110661899A
CN110661899A CN201811636714.6A CN201811636714A CN110661899A CN 110661899 A CN110661899 A CN 110661899A CN 201811636714 A CN201811636714 A CN 201811636714A CN 110661899 A CN110661899 A CN 110661899A
Authority
CN
China
Prior art keywords
address
cluster
positioning information
preset
clusters
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811636714.6A
Other languages
Chinese (zh)
Inventor
朱航
张绍瑞
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Qihoo Technology Co Ltd
Original Assignee
Beijing Qihoo Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Qihoo Technology Co Ltd filed Critical Beijing Qihoo Technology Co Ltd
Priority to CN201811636714.6A priority Critical patent/CN110661899A/en
Publication of CN110661899A publication Critical patent/CN110661899A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L2101/00Indexing scheme associated with group H04L61/00
    • H04L2101/60Types of network addresses
    • H04L2101/618Details of network addresses
    • H04L2101/622Layer-2 addresses, e.g. medium access control [MAC] addresses
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • H04L61/50Address allocation
    • H04L61/5007Internet protocol [IP] addresses

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The invention discloses a method and a device for determining a physical address of an IP address, electronic equipment and a computer readable storage medium. The method comprises the following steps: acquiring a plurality of positioning information of each IP address; inputting a plurality of positioning information of each acquired IP address into a specified model; determining one or more clusters which are in accordance with preset conditions and correspond to each IP address according to the designated model; the cluster comprises one or more positioning information; and calculating the physical address of each IP address according to the determined positioning information in one or more clusters meeting the preset condition. Therefore, according to the technical scheme, even if the physical address of the IP address is changed, the physical address of the IP address can be determined through a plurality of positioning information of the IP address, and the requirements of users are met.

Description

Method and device for determining physical address of IP address
Technical Field
The invention relates to the technical field of computers, in particular to a method and a device for determining a physical address of an IP address, electronic equipment and a computer readable storage medium.
Background
The physical address of the IP address is added in the aspects of commercialization, product design, network security and the like, so that the commercial change, the product experience and the like are greatly improved. Therefore, the problem of physical address of IP address has been a major concern in the industry.
However, the physical address of the IP address is not fixed, and it is important to determine the physical address of the IP address accurately.
Disclosure of Invention
In view of the above problems, the present invention has been made to provide a method, an apparatus, an electronic device, and a computer-readable storage medium for physical address determination of an IP address that overcome or at least partially solve the above problems.
According to an aspect of the present invention, there is provided a method for determining a physical address of an IP address, wherein the method includes:
acquiring a plurality of positioning information of each IP address;
inputting a plurality of positioning information of each acquired IP address into a specified model;
determining one or more clusters which are in accordance with preset conditions and correspond to each IP address according to the specified model; the cluster comprises one or more positioning information;
and calculating the physical address of each IP address according to the determined positioning information in one or more clusters meeting the preset condition.
Optionally, the determining, according to the specified model, one or more clusters corresponding to each IP address that meet a preset condition includes:
for a single IP address, the IP address is,
dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model;
judging whether clusters meeting preset conditions exist or not according to one or more pieces of positioning information in each cluster;
if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition;
if all the clusters do not meet the preset condition, adjusting the first model parameter, dividing a plurality of positioning information of the IP address into one or more clusters according to the adjusted first model parameter, and judging whether the clusters meeting the preset condition exist or not according to one or more positioning information in each cluster; if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition; and if all the clusters do not meet the preset condition, adjusting the first model parameter again until the clusters meeting the preset condition exist.
Optionally, the determining, according to one or more pieces of positioning information in each cluster, whether a cluster meeting a preset condition exists includes:
calculating a radius value and a ratio value corresponding to each cluster according to one or more pieces of positioning information in each cluster;
judging whether the radius value of each cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value of each cluster is larger than or equal to a preset occupation ratio threshold value or not;
if yes, determining that the cluster meets a preset condition;
and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, determining that the cluster does not meet the preset condition.
Alternatively,
before the dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model, the method further includes:
using a plurality of positioning information of the acquired IP address as a cluster, and calculating a radius value and a ratio value corresponding to the cluster;
judging whether the radius value corresponding to the cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value corresponding to the cluster is larger than or equal to a preset occupation ratio threshold value or not;
if yes, directly outputting the cluster; the cluster comprises all positioning information of the acquired IP address;
and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, executing a step of dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model.
Optionally, the obtained positioning information includes time information;
after the inputting of the obtained positioning information of the IP addresses into the specified model, the method further comprises:
and sequencing the acquired positioning information of each IP address according to a time sequence, and filtering the positioning information corresponding to the illegal time points.
Optionally, after the inputting the obtained several positioning information of each IP address into the specified model, the method further includes:
and screening stable positioning information from the plurality of positioning information of the acquired IP addresses through the second model parameter of the specified model.
Optionally, the calculating a physical address of each IP address according to the determined one or more positioning information in the clusters meeting the preset condition includes:
and calculating the central position and the radius value corresponding to each cluster according to the determined positioning information in one or more clusters meeting the preset conditions, and taking the central position and the radius value corresponding to each cluster as the physical address information of the corresponding IP address.
Optionally, the specified model is a gaussian mixture model.
According to another aspect of the present invention, there is provided an apparatus for determining a physical address of an IP address, wherein the apparatus includes:
the acquisition unit is suitable for acquiring a plurality of positioning information of each IP address;
the input unit is suitable for inputting the acquired positioning information of each IP address into a specified model;
the determining unit is suitable for determining one or more clusters which are corresponding to each IP address and meet preset conditions according to the specified model; the cluster comprises one or more positioning information;
and the calculation unit is suitable for calculating the physical address of each IP address according to the determined positioning information in one or more clusters meeting the preset condition.
Alternatively,
the determining unit is suitable for dividing a plurality of positioning information of an IP address into one or more clusters through the first model parameter of the specified model for the IP address; judging whether clusters meeting preset conditions exist or not according to one or more pieces of positioning information in each cluster; if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition; if all the clusters do not meet the preset condition, adjusting the first model parameter, dividing a plurality of positioning information of the IP address into one or more clusters according to the adjusted first model parameter, and judging whether the clusters meeting the preset condition exist or not according to one or more positioning information in each cluster; if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition; and if all the clusters do not meet the preset condition, adjusting the first model parameter again until the clusters meeting the preset condition exist.
Alternatively,
the determining unit is suitable for calculating a radius value and a ratio value corresponding to each cluster according to one or more pieces of positioning information in each cluster; judging whether the radius value of each cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value of each cluster is larger than or equal to a preset occupation ratio threshold value or not; if yes, determining that the cluster meets a preset condition; and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, determining that the cluster does not meet the preset condition.
Alternatively,
the determining unit is adapted to, before the first model parameter of the specified model divides the positioning information of the IP address into one or more clusters, use the acquired positioning information of the IP address as a cluster, and calculate a radius value and an occupation ratio value corresponding to the cluster; judging whether the radius value corresponding to the cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value corresponding to the cluster is larger than or equal to a preset occupation ratio threshold value or not; if yes, directly outputting the cluster; the cluster comprises all positioning information of the acquired IP address; and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, executing a step of dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model.
Optionally, the obtained positioning information includes time information;
the input unit is suitable for sorting the acquired positioning information of the IP addresses according to a time sequence after the acquired positioning information of the IP addresses is input into the designated model, and filtering the positioning information corresponding to the illegal time points.
Alternatively,
and the input unit is suitable for screening stable positioning information from the plurality of positioning information of each acquired IP address through the second model parameter of the specified model after the plurality of positioning information of each acquired IP address are input into the specified model.
Alternatively,
and the calculating unit is suitable for calculating the central position and the radius value corresponding to each cluster according to the determined positioning information in one or more clusters meeting the preset conditions, and taking the central position and the radius value corresponding to each cluster as the physical address information of the corresponding IP address.
Optionally, the specified model is a gaussian mixture model.
According to still another aspect of the present invention, there is provided an electronic apparatus, wherein the electronic apparatus includes:
a processor; and the number of the first and second groups,
a memory arranged to store computer executable instructions that, when executed, cause the processor to perform a method according to the foregoing.
According to yet another aspect of the present invention, there is provided a computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement the aforementioned method.
According to the technical scheme of the invention, a plurality of positioning information of each IP address is obtained; inputting a plurality of positioning information of each acquired IP address into a specified model; determining one or more clusters which are in accordance with preset conditions and correspond to each IP address according to the designated model; the cluster comprises one or more positioning information; and calculating the physical address of each IP address according to the determined positioning information in one or more clusters meeting the preset condition. Therefore, according to the technical scheme, even if the physical address of the IP address is changed, the physical address of the IP address can be determined through a plurality of positioning information of the IP address, and the requirements of users are met.
The foregoing description is only an overview of the technical solutions of the present invention, and the embodiments of the present invention are described below in order to make the technical means of the present invention more clearly understood and to make the above and other objects, features, and advantages of the present invention more clearly understandable.
Drawings
Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiments. The drawings are only for purposes of illustrating the preferred embodiments and are not to be construed as limiting the invention. Also, like reference numerals are used to refer to like parts throughout the drawings. In the drawings:
fig. 1 shows a flow chart of a method for determining a physical address of an IP address according to an embodiment of the invention;
fig. 2 is a schematic structural diagram of a physical address determination apparatus for an IP address according to an embodiment of the present invention;
FIG. 3 shows a schematic structural diagram of an electronic device according to one embodiment of the invention;
fig. 4 shows a schematic structural diagram of a computer-readable storage medium according to an embodiment of the present invention.
Detailed Description
Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.
Fig. 1 is a flowchart illustrating a method for determining a physical address of an IP address according to an embodiment of the present invention. As shown in fig. 1, the method includes:
step S110, a plurality of positioning information of each IP address is obtained.
Preferably, the location information of each IP address here can be obtained by a location log. Because the physical address of the IP address is not fixed, a plurality of positioning information is generated, where the positioning information includes location information, such as GPS data.
Since the number of positioning information of an IP address per day is in the order of billions or more, in order to reduce the amount of calculation and ensure efficiency, several pieces of positioning information herein may be defined by a preset value, for example, 1000, according to the needs.
Step S120, inputting a plurality of positioning information of the acquired IP addresses into a designated model.
Preferably, the specified model herein comprises a Gaussian mixture (GMM) model.
The gaussian model is a model formed based on a gaussian probability density function (normal distribution curve) by accurately quantizing an object using the gaussian probability density function (normal distribution curve) and decomposing one object into a plurality of objects.
Step S130, determining one or more clusters which are corresponding to each IP address and meet preset conditions according to the specified model; the cluster includes one or more positioning information.
It should be noted that, here, the condition that the cluster meets the preset condition means that the cluster meets the preset condition. That is, one or more pieces of positioning information constitute a cluster, and the cluster meets a preset condition.
Step S140, calculating the physical address of each IP address according to the determined one or more positioning information in the clusters meeting the preset condition.
Therefore, according to the embodiment, the physical address of the IP address is determined through the distribution of the historical positioning data of the IP address, and even if the physical address of the IP address is changed, the physical address of the IP address can be determined through a plurality of pieces of positioning information of the IP address, so that the requirements of users are met.
In an embodiment of the present invention, the determining, according to the specified model, one or more clusters corresponding to each IP address and meeting the preset condition in step S130 in the method shown in fig. 1 includes:
for an IP address, dividing a plurality of positioning information of the IP address into one or more clusters by specifying a first model parameter of a model; judging whether clusters meeting preset conditions exist or not according to one or more pieces of positioning information in each cluster; if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition; if all the clusters do not meet the preset condition, adjusting a first model parameter, dividing a plurality of positioning information of the IP address into one or more clusters according to the adjusted first model parameter, and judging whether the clusters meeting the preset condition exist or not according to one or more positioning information in each cluster; if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition; and if all the clusters do not meet the preset condition, adjusting the first model parameter again until the clusters meeting the preset condition exist.
In this embodiment, each cluster includes one or more positioning information, that is, a location information representing that the IP address is present in the area represented by the cluster, and a preset condition is set to determine which clusters (areas) are reliable, so that the physical address of the IP address can be determined by the cluster.
Specifically, the cluster is divided by specifying a first model parameter of the model, and if the divided cluster does not meet the condition, which indicates that the setting of the first model parameter is not accurate, the first model parameter can be adjusted, that is, the standard for dividing the cluster is readjusted until there is a cluster meeting the preset condition.
For example, according to a value a corresponding to a first model parameter, an IP address is divided into 10 clusters, if it is determined that none of the 10 clusters meets a preset condition, it is determined that the value of the first model parameter is inaccurate, then the value corresponding to the first model parameter is adjusted from a to b, according to a value b corresponding to the first model parameter, an IP address is divided into 5 clusters again, it is determined whether the 5 clusters meet the preset condition, if one cluster meets the condition, it is determined that the cluster is reliable, and the IP address physical address can be determined.
It should be noted that, for each IP address, the above-mentioned scheme may be adopted to determine one or more corresponding clusters that meet the preset condition.
In practical applications, when a first IP address is executed, an initial value of a first model parameter may be set, and then, after determining a cluster of the IP address that meets a preset condition, a final value of the first model parameter is determined, and then, the final value of the first model parameter is executed only as the initial value of the first model parameter of a second IP address, and so on.
Specifically, the above determining whether there is a cluster satisfying a preset condition according to one or more positioning information in each cluster includes: calculating a radius value and a ratio value corresponding to each cluster according to one or more pieces of positioning information in each cluster; judging whether the radius value of each cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value of each cluster is larger than or equal to a preset occupation ratio threshold value or not; if yes, determining that the cluster meets a preset condition; and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, determining that the cluster does not meet the preset condition.
In the process of determining one or more clusters meeting the preset condition, the first model parameter needs to be adjusted, and it can be understood that the designated model is a basic model, and the designated model is continuously trained by using the positioning information of the IP address so as to train the IP address with the highest quality. When an IP address appears in the positioning data, its corresponding location information always appears in a certain area range, and it can be determined that the physical address of the IP address is there. The range can be expressed in terms of a Center position (Center) and a Radius (Radius). However, most of an IP address is present in not only one area but also a plurality of areas, and these areas are independent of each other, and it is necessary to measure which area is the most reliable with a confidence (pro). Therefore, in order to determine the physical address of an IP address and find an IP address with the highest quality, it is necessary to find an area with the smallest Radius (Radius) and the largest possible area (pro) as the information of the IP address. Therefore, two parameters, namely the maximum radius value (preset radius threshold value) and the minimum duty ratio value (preset duty ratio threshold value), are set as the threshold values for controlling training.
In this embodiment, each area is represented by a cluster, and it can be determined that the cluster meets the preset condition only if the radius and the ratio of the cluster both satisfy the condition. And if the radius of the cluster is smaller than or equal to the preset radius threshold value and the ratio value is larger than or equal to the preset ratio threshold value, directly outputting the cluster. If the radius of the cluster is greater than a preset radius threshold; or the occupation ratio value is smaller than a preset occupation ratio threshold value; or, if the radius of the cluster is larger than the preset radius threshold and the ratio value is smaller than the preset ratio threshold, it is determined that the cluster is not eligible, and the first model parameter needs to be adjusted.
For example, if the radius of a cluster is greater than a preset radius threshold, and the occupancy is greater than a preset occupancy threshold, which indicates that the radius is not satisfactory, the first model parameter is adjusted, and the cluster range is narrowed when the cluster is divided; if the radius of one cluster is smaller than the preset radius threshold value, the occupation ratio is smaller than the preset occupation ratio threshold value, and the occupation ratio is not in accordance with the requirement, the first model parameter is adjusted, and the trend that the occupation ratio of the cluster is increased is adjusted.
In an embodiment of the present invention, before the dividing the plurality of positioning information of the IP address into one or more clusters by specifying the first model parameter of the model, the method shown in fig. 1 further includes: using a plurality of positioning information of the acquired IP address as a cluster, and calculating a radius value and a ratio value corresponding to the cluster; judging whether the radius value corresponding to the cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value corresponding to the cluster is larger than or equal to a preset occupation ratio threshold value or not; if yes, directly outputting the cluster; the cluster comprises all positioning information of the acquired IP address; and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, executing a step of dividing a plurality of positioning information of the IP address into one or more clusters through a first model parameter of the designated model.
Considering that if the change of the physical address of an IP address is small, the acquired positioning information of the IP address is in a small range, and the range meets the preset condition, before dividing the positioning information of the IP address, the whole judgment is performed, that is, all the acquired positioning information of the IP address is taken as a cluster, the radius value and the ratio value corresponding to the cluster are calculated, whether the cluster meets the condition is judged, and if yes, the cluster containing all the positioning information is directly output; if not, dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model.
The acquired positioning information of each IP address is not directly usable, and needs to be preprocessed. See in particular the examples described below.
In one embodiment of the present invention, the positioning information obtained in the method shown in fig. 1 includes time information.
After inputting the acquired positioning information of each IP address into the designated model in step S120, the method shown in fig. 1 further includes: and sequencing the acquired positioning information of each IP address according to a time sequence, and filtering the positioning information corresponding to the illegal time points.
In an embodiment of the present invention, after inputting the obtained several positioning information of each IP address into the specified model in step S120 of the method shown in fig. 1, the method shown in fig. 1 further includes: and screening stable positioning information from the acquired positioning information of each IP address through a second model parameter of the designated model.
In this embodiment, the filtering of the positioning information may be implemented by setting a designated model parameter (second model parameter), for example, setting parameters such as covariance jump and time ratio, and filtering the positioning information with the closest variance jump being smooth.
In an embodiment of the present invention, the calculating the physical address of each IP address according to the determined location information in one or more clusters meeting the preset condition in step S140 of the method shown in fig. 1 includes: and calculating the central position corresponding to each cluster according to the determined positioning information in one or more clusters meeting the preset conditions, and taking the central position corresponding to each cluster as the physical address of the corresponding IP address.
In this embodiment, a central position of the cluster and a radius value of an area corresponding to the cluster can be determined by using the location information of each positioning information, and then the central position and the radius value are used as parameters describing physical address information of an IP address.
Preferably, the above-mentioned specified model is a mixture gaussian model.
Fig. 2 is a schematic structural diagram of an IP address physical address determination apparatus according to an embodiment of the present invention. As shown in fig. 2, the IP address physical address determination apparatus 200 includes:
the obtaining unit 210 is adapted to obtain a plurality of positioning information of each IP address.
Preferably, the location information of each IP address here can be obtained by a location log. Because the physical address of the IP address is not fixed, a plurality of positioning information is generated, where the positioning information includes location information, such as GPS data.
Since the number of positioning information of an IP address per day is in the order of billions or more, in order to reduce the amount of calculation and ensure efficiency, several pieces of positioning information herein may be defined by a preset value, for example, 1000, according to the needs.
An input unit 220 adapted to input the acquired positioning information of each IP address into the specified model.
Preferably, the specified model herein comprises a Gaussian mixture (GMM) model.
The gaussian model is a model formed based on a gaussian probability density function (normal distribution curve) by accurately quantizing an object using the gaussian probability density function (normal distribution curve) and decomposing one object into a plurality of objects.
A determining unit 230, adapted to determine, according to the specified model, one or more clusters that meet preset conditions and correspond to each IP address; the cluster includes one or more positioning information.
It should be noted that, here, the condition that the cluster meets the preset condition means that the cluster meets the preset condition. That is, one or more pieces of positioning information constitute a cluster, and the cluster meets a preset condition.
And a calculating unit 240 adapted to calculate a physical address of each IP address according to the determined one or more positioning information in the clusters meeting the preset condition.
Therefore, according to the embodiment, the physical address of the IP address is determined through the distribution of the historical positioning data of the IP address, and even if the physical address of the IP address is changed, the physical address of the IP address can be determined through a plurality of pieces of positioning information of the IP address, so that the requirements of users are met.
In an embodiment of the present invention, the determining unit 230 shown in fig. 2 is adapted to, for an IP address, divide a number of positioning information of the IP address into one or more clusters by specifying a first model parameter of a model; judging whether clusters meeting preset conditions exist or not according to one or more pieces of positioning information in each cluster; if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition; if all the clusters do not meet the preset condition, adjusting a first model parameter, dividing a plurality of positioning information of the IP address into one or more clusters according to the adjusted first model parameter, and judging whether the clusters meeting the preset condition exist or not according to one or more positioning information in each cluster; if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition; and if all the clusters do not meet the preset condition, adjusting the first model parameter again until the clusters meeting the preset condition exist.
In this embodiment, each cluster includes one or more positioning information, that is, a location information representing that the IP address is present in the area represented by the cluster, and a preset condition is set to determine which clusters (areas) are reliable, so that the physical address of the IP address can be determined by the cluster.
Specifically, the cluster is divided by specifying a first model parameter of the model, and if the divided cluster does not meet the condition, which indicates that the setting of the first model parameter is not accurate, the first model parameter can be adjusted, that is, the standard for dividing the cluster is readjusted until there is a cluster meeting the preset condition.
For example, according to a value a corresponding to a first model parameter, an IP address is divided into 10 clusters, if it is determined that none of the 10 clusters meets a preset condition, it is determined that the value of the first model parameter is inaccurate, then the value corresponding to the first model parameter is adjusted from a to b, according to a value b corresponding to the first model parameter, an IP address is divided into 5 clusters again, it is determined whether the 5 clusters meet the preset condition, if one cluster meets the condition, it is determined that the cluster is reliable, and the IP address physical address can be determined.
It should be noted that, for each IP address, the above-mentioned scheme may be adopted to determine one or more corresponding clusters that meet the preset condition.
In practical applications, when a first IP address is executed, an initial value of a first model parameter may be set, and then, after determining a cluster of the IP address that meets a preset condition, a final value of the first model parameter is determined, and then, the final value of the first model parameter is executed only as the initial value of the first model parameter of a second IP address, and so on.
Specifically, the determining unit 230 is adapted to calculate a radius value and a ratio value corresponding to each cluster according to one or more positioning information in each cluster; judging whether the radius value of each cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value of each cluster is larger than or equal to a preset occupation ratio threshold value or not; if yes, determining that the cluster meets a preset condition; and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, determining that the cluster does not meet the preset condition.
In the process of determining one or more clusters meeting the preset condition, the first model parameter needs to be adjusted, and it can be understood that the designated model is a basic model, and the designated model is continuously trained by using the positioning information of the IP address so as to train the IP address with the highest quality. When an IP address appears in the positioning data, its corresponding location information always appears in a certain area range, and it can be determined that the physical address of the IP address is there. The range can be expressed in terms of a Center position (Center) and a Radius (Radius). However, most of an IP address is present in not only one area but also a plurality of areas, and these areas are independent of each other, and it is necessary to measure which area is the most reliable with a confidence (pro). Therefore, in order to determine the physical address of a UIP address and find an IP address with the highest quality possible, it is necessary to find an area with the Radius (Radius) as small as possible and occupying the area (pro) as large as possible as information of the IP address. Therefore, two parameters, namely the maximum radius value (preset radius threshold value) and the minimum duty ratio value (preset duty ratio threshold value), are set as the threshold values for controlling training.
In this embodiment, each area is represented by a cluster, and it can be determined that the cluster meets the preset condition only if the radius and the ratio of the cluster both satisfy the condition. And if the radius of the cluster is smaller than or equal to the preset radius threshold value and the ratio value is larger than or equal to the preset ratio threshold value, directly outputting the cluster. If the radius of the cluster is greater than a preset radius threshold; or the occupation ratio value is smaller than a preset occupation ratio threshold value; or, if the radius of the cluster is larger than the preset radius threshold and the ratio value is smaller than the preset ratio threshold, it is determined that the cluster is not eligible, and the first model parameter needs to be adjusted.
For example, if the radius of a cluster is greater than a preset radius threshold, and the occupancy is greater than a preset occupancy threshold, which indicates that the radius is not satisfactory, the first model parameter is adjusted, and the cluster range is narrowed when the cluster is divided; if the radius of one cluster is smaller than the preset radius threshold value, the occupation ratio is smaller than the preset occupation ratio threshold value, and the occupation ratio is not in accordance with the requirement, the first model parameter is adjusted, and the trend that the occupation ratio of the cluster is increased is adjusted.
In an embodiment of the present invention, the determining unit 230 shown in fig. 2 is adapted to, before the positioning information of the IP address is divided into one or more clusters by specifying the first model parameter of the model, take the acquired positioning information of the IP address as a cluster, and calculate a radius value and an occupancy value corresponding to the cluster; judging whether the radius value corresponding to the cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value corresponding to the cluster is larger than or equal to a preset occupation ratio threshold value or not; if yes, directly outputting the cluster; the cluster comprises all positioning information of the acquired IP address; and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, executing a step of dividing a plurality of positioning information of the IP address into one or more clusters through a first model parameter of the designated model.
Considering that if the change of the physical address of an IP address is small, the acquired positioning information of the IP address is in a small range, and the range meets the preset condition, before dividing the positioning information of the IP address, the whole judgment is performed, that is, all the acquired positioning information of the IP address is taken as a cluster, the radius value and the ratio value corresponding to the cluster are calculated, whether the cluster meets the condition is judged, and if yes, the cluster containing all the positioning information is directly output; if not, dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model.
The acquired positioning information of each IP address is not directly usable, and needs to be preprocessed. See in particular the examples described below.
In one embodiment of the present invention, the positioning information acquired by the acquiring unit 210 shown in fig. 2 includes time information.
The input unit 220 is adapted to sort the acquired positioning information of each IP address according to a time sequence after inputting the acquired positioning information of each IP address into the designated model, and filter the positioning information corresponding to the illegal time point.
In an embodiment of the present invention, the input unit 220 shown in fig. 2 is adapted to, after inputting the obtained several positioning information of each IP address into the specified model, screen smooth positioning information from the obtained several positioning information of each IP address through the second model parameters of the specified model.
In this embodiment, the filtering of the positioning information may be implemented by setting a designated model parameter (second model parameter), for example, setting parameters such as covariance jump and time ratio, and filtering the positioning information with the closest variance jump being smooth.
In an embodiment of the present invention, the calculating unit 240 shown in fig. 2 is adapted to calculate a central position corresponding to each cluster according to the determined positioning information in one or more clusters meeting the preset condition, and use the central position corresponding to each cluster as a physical address of a corresponding IP address.
In this embodiment, a central position of the cluster and a radius value of an area corresponding to the cluster can be determined by using the location information of each positioning information, and then the central position and the radius value are used as parameters describing physical address information of an IP address.
Preferably, the above-mentioned specified model is a Gaussian mixture (GMM) model.
In summary, according to the technical solution of the present invention, a plurality of positioning information of each IP address is obtained; inputting a plurality of positioning information of each acquired IP address into a specified model; determining one or more clusters which are in accordance with preset conditions and correspond to each IP address according to the designated model; the cluster comprises one or more positioning information; and calculating the physical address of each IP address according to the determined positioning information in one or more clusters meeting the preset condition. Therefore, according to the technical scheme, even if the physical address of the IP address is changed, the physical address of the IP address can be determined through a plurality of positioning information of the IP address, and the requirements of users are met.
It should be noted that:
the algorithms and displays presented herein are not inherently related to any particular computer, virtual machine, or other apparatus. Various general purpose devices may be used with the teachings herein. The required structure for constructing such a device will be apparent from the description above. Moreover, the present invention is not directed to any particular programming language. It is appreciated that a variety of programming languages may be used to implement the teachings of the present invention as described herein, and any descriptions of specific languages are provided above to disclose the best mode of the invention.
In the description provided herein, numerous specific details are set forth. It is understood, however, that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure an understanding of this description.
Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, various features of the invention are sometimes grouped together in a single embodiment, figure, or description thereof for the purpose of streamlining the disclosure and aiding in the understanding of one or more of the various inventive aspects. However, the disclosed method should not be interpreted as reflecting an intention that: that the invention as claimed requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the detailed description are hereby expressly incorporated into this detailed description, with each claim standing on its own as a separate embodiment of this invention.
Those skilled in the art will appreciate that the modules in the device in an embodiment may be adaptively changed and disposed in one or more devices different from the embodiment. The modules or units or components of the embodiments may be combined into one module or unit or component, and furthermore they may be divided into a plurality of sub-modules or sub-units or sub-components. All of the features disclosed in this specification (including any accompanying claims, abstract and drawings), and all of the processes or elements of any method or apparatus so disclosed, may be combined in any combination, except combinations where at least some of such features and/or processes or elements are mutually exclusive. Each feature disclosed in this specification (including any accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
Furthermore, those skilled in the art will appreciate that while some embodiments described herein include some features included in other embodiments, rather than other features, combinations of features of different embodiments are meant to be within the scope of the invention and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
The various component embodiments of the invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. It will be appreciated by those skilled in the art that a microprocessor or Digital Signal Processor (DSP) may be used in practice to implement some or all of the functions of some or all of the components of the physical address determination apparatus, the electronic device, and the computer-readable storage medium of IP addresses in accordance with embodiments of the present invention. The present invention may also be embodied as apparatus or device programs (e.g., computer programs and computer program products) for performing a portion or all of the methods described herein. Such programs implementing the present invention may be stored on computer-readable media or may be in the form of one or more signals. Such a signal may be downloaded from an internet website or provided on a carrier signal or in any other form.
For example, fig. 3 shows a schematic structural diagram of an electronic device according to an embodiment of the invention. The electronic device 300 conventionally comprises a processor 310 and a memory 320 arranged to store computer-executable instructions (program code). The memory 320 may be an electronic memory such as a flash memory, an EEPROM (electrically erasable programmable read only memory), an EPROM, a hard disk, or a ROM. Memory 320 has storage space 330 for storing program code 340 for performing the method steps shown in fig. 1 and in any of the embodiments. For example, the storage space 330 for the program code may comprise respective program codes 340 for implementing respective steps in the above method. The program code can be read from or written to one or more computer program products. These computer program products comprise a program code carrier such as a hard disk, a Compact Disc (CD), a memory card or a floppy disk. Such a computer program product is generally a computer-readable storage medium 400 such as described in fig. 4. The computer-readable storage medium 400 may have memory segments, memory spaces, etc. arranged similarly to the memory 320 in the electronic device of fig. 3. The program code may be compressed, for example, in a suitable form. In general, the memory unit stores a program code 410 for performing the steps of the method according to the invention, i.e. a program code readable by a processor such as 310, which program code, when executed by an electronic device, causes the electronic device to perform the individual steps of the method described above.
It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.

Claims (10)

1. A method for determining a physical address of an IP address, wherein the method comprises:
acquiring a plurality of positioning information of each IP address;
inputting a plurality of positioning information of each acquired IP address into a specified model;
determining one or more clusters which are in accordance with preset conditions and correspond to each IP address according to the specified model; the cluster comprises one or more positioning information;
and calculating the physical address of each IP address according to the determined positioning information in one or more clusters meeting the preset condition.
2. The method of claim 1, wherein the determining, according to the specified model, one or more clusters corresponding to each IP address that meet a preset condition comprises:
for a single IP address, the IP address is,
dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model;
judging whether clusters meeting preset conditions exist or not according to one or more pieces of positioning information in each cluster;
if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition;
if all the clusters do not meet the preset condition, adjusting the first model parameter, dividing a plurality of positioning information of the IP address into one or more clusters according to the adjusted first model parameter, and judging whether the clusters meeting the preset condition exist or not according to one or more positioning information in each cluster; if the cluster meeting the preset condition exists, outputting the cluster meeting the preset condition; and if all the clusters do not meet the preset condition, adjusting the first model parameter again until the clusters meeting the preset condition exist.
3. The method according to claim 1 or 2, wherein the determining whether there is a cluster satisfying a preset condition according to one or more positioning information in each cluster comprises:
calculating a radius value and a ratio value corresponding to each cluster according to one or more pieces of positioning information in each cluster;
judging whether the radius value of each cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value of each cluster is larger than or equal to a preset occupation ratio threshold value or not;
if yes, determining that the cluster meets a preset condition;
and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, determining that the cluster does not meet the preset condition.
4. The method of any one of claims 1-3,
before the dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model, the method further includes:
using a plurality of positioning information of the acquired IP address as a cluster, and calculating a radius value and a ratio value corresponding to the cluster;
judging whether the radius value corresponding to the cluster is smaller than or equal to a preset radius threshold value or not, and judging whether the occupation ratio value corresponding to the cluster is larger than or equal to a preset occupation ratio threshold value or not;
if yes, directly outputting the cluster; the cluster comprises all positioning information of the acquired IP address;
and if the radius value is larger than the preset radius threshold value and/or the occupation ratio value is smaller than the preset occupation ratio threshold value, executing a step of dividing a plurality of positioning information of the IP address into one or more clusters through the first model parameter of the specified model.
5. The method according to any of claims 1-4, wherein the acquired positioning information comprises time information;
after the inputting of the obtained positioning information of the IP addresses into the specified model, the method further comprises:
and sequencing the acquired positioning information of each IP address according to a time sequence, and filtering the positioning information corresponding to the illegal time points.
6. The method according to any of claims 1-5, wherein after entering a number of positioning information for each acquired IP address into a specified model, the method further comprises:
and screening stable positioning information from the plurality of positioning information of the acquired IP addresses through the second model parameter of the specified model.
7. The method according to any one of claims 1 to 6, wherein the calculating the physical address of each IP address according to the determined positioning information in one or more clusters meeting the preset condition comprises:
and calculating the central position and the radius value corresponding to each cluster according to the determined positioning information in one or more clusters meeting the preset conditions, and taking the central position and the radius value corresponding to each cluster as the physical address information of the corresponding IP address.
8. An apparatus for determining a physical address of an IP address, wherein the apparatus comprises:
the acquisition unit is suitable for acquiring a plurality of positioning information of each IP address;
the input unit is suitable for inputting the acquired positioning information of each IP address into a specified model;
the determining unit is suitable for determining one or more clusters which are corresponding to each IP address and meet preset conditions according to the specified model; the cluster comprises one or more positioning information;
and the calculation unit is suitable for calculating the physical address of each IP address according to the determined positioning information in one or more clusters meeting the preset condition.
9. An electronic device, wherein the electronic device comprises:
a processor; and the number of the first and second groups,
a memory arranged to store computer executable instructions that, when executed, cause the processor to perform the method of any one of claims 1 to 7.
10. A computer readable storage medium, wherein the computer readable storage medium stores one or more programs which, when executed by a processor, implement the method of any of claims 1-7.
CN201811636714.6A 2018-12-29 2018-12-29 Method and device for determining physical address of IP address Pending CN110661899A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811636714.6A CN110661899A (en) 2018-12-29 2018-12-29 Method and device for determining physical address of IP address

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811636714.6A CN110661899A (en) 2018-12-29 2018-12-29 Method and device for determining physical address of IP address

Publications (1)

Publication Number Publication Date
CN110661899A true CN110661899A (en) 2020-01-07

Family

ID=69028612

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811636714.6A Pending CN110661899A (en) 2018-12-29 2018-12-29 Method and device for determining physical address of IP address

Country Status (1)

Country Link
CN (1) CN110661899A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111327721A (en) * 2020-02-28 2020-06-23 加和(北京)信息科技有限公司 IP address positioning method and device, storage medium and electronic device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110055212A1 (en) * 2009-09-01 2011-03-03 Cheng-Fa Tsai Density-based data clustering method
CN105933294A (en) * 2016-04-12 2016-09-07 晶赞广告(上海)有限公司 Network user positioning method, device and terminal
CN106131238A (en) * 2016-06-30 2016-11-16 百度在线网络技术(北京)有限公司 The sorting technique of IP address and device
CN107317891A (en) * 2017-05-10 2017-11-03 郑州埃文计算机科技有限公司 A kind of geographic position locating method being distributed towards dynamic IP multizone

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110055212A1 (en) * 2009-09-01 2011-03-03 Cheng-Fa Tsai Density-based data clustering method
CN105933294A (en) * 2016-04-12 2016-09-07 晶赞广告(上海)有限公司 Network user positioning method, device and terminal
CN106131238A (en) * 2016-06-30 2016-11-16 百度在线网络技术(北京)有限公司 The sorting technique of IP address and device
CN107317891A (en) * 2017-05-10 2017-11-03 郑州埃文计算机科技有限公司 A kind of geographic position locating method being distributed towards dynamic IP multizone

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111327721A (en) * 2020-02-28 2020-06-23 加和(北京)信息科技有限公司 IP address positioning method and device, storage medium and electronic device
CN111327721B (en) * 2020-02-28 2023-01-10 加和(北京)信息科技有限公司 IP address positioning method and device, storage medium and electronic device

Similar Documents

Publication Publication Date Title
CN107766888A (en) Data processing method and device
CN112200081A (en) Abnormal behavior identification method and device, electronic equipment and storage medium
CN106651213B (en) Service order processing method and device
CN109859113B (en) Model generation method, image enhancement method, device and computer-readable storage medium
CN109145981B (en) Deep learning automatic model training method and equipment
CN110827086A (en) Product marketing prediction method and device, computer equipment and readable storage medium
CN109214446A (en) Potentiality good performance personnel kind identification method, system, terminal and computer readable storage medium
CN111737473A (en) Text classification method, device and equipment
CN109582741A (en) Characteristic treating method and apparatus
CN109214671B (en) Personnel grouping method, device, electronic device and computer readable storage medium
CN114219306A (en) Method, apparatus, medium, and program product for creating a weld quality detection model
CN110717394A (en) Training method and device of face recognition model, electronic equipment and storage medium
CN110661899A (en) Method and device for determining physical address of IP address
CN111143148B (en) Model parameter determining method, device and storage medium
US11544568B2 (en) Method for optimizing a data model and device using the same
CN112445978A (en) Electronic book pushing method, electronic equipment and storage medium
CN113674023A (en) Rights upgrading method, device, equipment and storage medium based on member level
CN109118274B (en) Task point denoising and classifying method and system, terminal device and storage medium
CN104951551A (en) Data classifying method and system
CN110674104B (en) Feature combination screening method, device, computer equipment and storage medium
CN111967616A (en) Automatic time series regression method and device
CN111121803A (en) Method and device for acquiring common stop points of road
CN115238194A (en) Book recommendation method, computing device and computer storage medium
CN115099356A (en) Industrial unbalanced data classification method and device, electronic equipment and storage medium
CN114782780A (en) Data set construction method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20200107