CN112040024B - Data processing method, device, equipment and storage medium - Google Patents

Data processing method, device, equipment and storage medium Download PDF

Info

Publication number
CN112040024B
CN112040024B CN202010888438.3A CN202010888438A CN112040024B CN 112040024 B CN112040024 B CN 112040024B CN 202010888438 A CN202010888438 A CN 202010888438A CN 112040024 B CN112040024 B CN 112040024B
Authority
CN
China
Prior art keywords
address
target
lbs
information
lbs information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010888438.3A
Other languages
Chinese (zh)
Other versions
CN112040024A (en
Inventor
李栋梁
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Minglue Zhaohui Technology Co Ltd
Original Assignee
Beijing Minglue Zhaohui Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Minglue Zhaohui Technology Co Ltd filed Critical Beijing Minglue Zhaohui Technology Co Ltd
Priority to CN202010888438.3A priority Critical patent/CN112040024B/en
Publication of CN112040024A publication Critical patent/CN112040024A/en
Application granted granted Critical
Publication of CN112040024B publication Critical patent/CN112040024B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L2101/00Indexing scheme associated with group H04L61/00
    • H04L2101/60Types of network addresses
    • H04L2101/69Types of network addresses using geographic information, e.g. room number
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/50Network services
    • H04L67/52Network services specially adapted for the location of the user terminal

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Telephonic Communication Services (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)

Abstract

The application provides a data processing method, a device, equipment and a storage medium, wherein the method comprises the following steps: acquiring an Internet Protocol (IP) address and location-based service (LBS) information included in advertisement log information generated in a current preset time period; counting LBS information in the advertisement log information with the same IP address, and determining the minimum administrative area to which each IP address belongs; the advertisement monitoring table comprising the corresponding relation between each IP address and the minimum administrative area is established, and the geographic position of the IP address is judged through a plurality of LBS information in the advertisement monitoring log, so that the accuracy of the geographic position of the IP address is improved.

Description

Data processing method, device, equipment and storage medium
Technical Field
The application relates to the technical field of third-party advertisement monitoring, in particular to a data processing method, a device, equipment and a storage medium.
Background
With the development of advertising industry, more and more advertisements appear in the field of view of the public, advertisers need to monitor the advertisements which are put out in order to put out more accurate advertisements according to regions, the advertisement monitoring data comprise IP addresses (Internet Protocol ), the IP addresses are IP addresses corresponding to clients playing the advertisements, after the IP addresses are monitored, the geographic positions of the IP addresses need to be judged, the geographic positions of the IP addresses are judged, the prior art purchases an IP address library, then uses the IP addresses to find out the position information corresponding to the IP addresses in the IP address library, and uses the position information as the geographic positions of the IP addresses, but because a network server adopts a method for dynamically distributing the IP addresses, the IP addresses can change frequently, for example, the corresponding IP addresses are different when the same user is surfing the internet each time; the same device is connected with different WiFi (Wireless Fidelity, wireless Internet surfing) or switching 4G (the 4th Generation mobile communication technology, fourth generation mobile phone mobile communication standard) networks, and the IP address is also changed; the IP address library cannot be updated along with the variation of the IP address, so that the geographic position corresponding to the IP address actually on the internet of the user is different from the position information corresponding to the IP address in the IP address library, and therefore, the method for judging the geographic position of the IP address by the position information corresponding to the IP address in the IP address library has larger error, and the accuracy of the geographic position of the IP address obtained from the IP address library is lower.
Disclosure of Invention
In view of this, embodiments of the present application provide a data processing method, apparatus, device, and storage medium, so as to improve accuracy of geographic location of an IP address.
In a first aspect, an embodiment of the present application provides a data processing method, including:
acquiring an Internet Protocol (IP) address and location-based service (LBS) information included in advertisement log information generated in a current preset time period;
counting LBS information in the advertisement log information with the same IP address, and determining the minimum administrative area to which each IP address belongs;
and establishing an advertisement monitoring table containing the corresponding relation between each IP address and the minimum administrative area.
According to the data processing method provided by the embodiment of the application, the geographic position of the IP address is judged by adopting a plurality of LBS (Location Based Service) information in the advertisement monitoring log, compared with the IP address which is randomly distributed, the LBS information of the user client side is based on the actual geographic position of the user to acquire the LBS longitude and latitude information of the user, the positioning precision is higher, and the corresponding relation between the LBS longitude and latitude information of the user client side and the LBS geographic position cannot be changed, so that compared with the method for judging the geographic position of the IP address through the position information corresponding to the IP address in the IP address library in the prior art, the geographic position of the IP address is judged through a plurality of LBS information corresponding to the same IP address, and the accuracy of the geographic position of the IP address is improved.
With reference to the first aspect, an embodiment of the present application provides a first possible implementation manner of the first aspect, wherein the counting LBS information in the advertisement log information with the same IP address to determine a minimum administrative area to which each IP address belongs includes:
counting LBS information in the advertisement log information with the same IP address to obtain the total frequency of LBS information corresponding to each IP address;
screening the total frequency of LBS information corresponding to each IP address to obtain a target IP address, wherein the total frequency of LBS information corresponding to the target IP address is larger than a preset frequency;
and for each target IP address, determining the minimum administrative region to which the target IP address belongs according to the frequency of each LBS information included in the target IP address.
Further, according to the data processing method provided by the embodiment of the application, all LBS information corresponding to the same IP address is counted through big data, the total number of all LBS information corresponding to the same IP address is calculated, the IP addresses, of which the total number of the LBS information exceeds the threshold value, in the advertisement monitoring log are screened out through the set threshold value, the number of LBS corresponding to the screened out IP addresses is more, so that the accuracy of the LBS information is higher, and the accuracy of the LBS information in the advertisement monitoring statistical data is improved through the counting method.
With reference to the first possible implementation manner of the first aspect, the embodiment of the present application provides a second possible implementation manner of the first aspect, wherein the determining, according to the frequency of LBS information included in the target IP address, the minimum administrative area to which the target IP address belongs includes:
when the number of the LBS information included in the target IP address is 1, taking an administrative area of a position corresponding to the LBS information included in the target IP address as the minimum administrative area;
when the number of LBS information included in the target IP address is larger than 1, determining the minimum administrative region to which the target IP address belongs according to the ratio of the frequency of each LBS information included in the target IP address to the total frequency of the LBS information corresponding to the target IP address.
Further, according to the data processing method provided by the embodiment of the present application, different data processing modes are performed for different LBS information amounts, and when the LBS information amounts to be greater than 1, complex processing needs to be performed on the IP address corresponding to the LBS information, that is: the duty ratio of each LBS information corresponding to the IP address is calculated and screened, and the method improves the efficiency of data processing.
With reference to the second possible implementation manner of the first aspect, the embodiment of the present application provides a third possible implementation manner of the first aspect, wherein the determining, according to a ratio of a frequency of each LBS information included in the target IP address to a total frequency of LBS information corresponding to the target IP address, the minimum administrative region to which the target IP address belongs includes:
Judging whether each piece of LBS information included in the target IP address includes target LBS information, wherein the target LBS information is LBS information with the ratio being greater than or equal to a preset ratio;
if the target LBS information is included, taking an administrative area of a position corresponding to the target LBS address as the minimum administrative area;
and if the target LBS information is not included, taking the upper level administrative area of the position corresponding to each LBS information included in the target IP address as the minimum administrative area.
Further, according to the data processing method provided by the embodiment of the application, the frequency duty ratio of each piece of LBS information corresponding to the same IP address is firstly screened, whether the screened result contains LBS information with the duty ratio larger than the preset ratio is judged, and different modes are selected according to different screened results to obtain the minimum administrative region corresponding to the IP address.
With reference to the third possible implementation manner of the first aspect, the embodiment of the present application provides a fourth possible implementation manner of the first aspect, and the data processing method further includes: and marking the corresponding relation of the upper level administrative region serving as the minimum administrative region in the advertisement monitoring table.
Further, in the data processing method provided by the embodiment of the present application, by marking the correspondence of the last level administrative region as the minimum administrative region in the advertisement monitoring table, the marked IP address can be focused when the LBS information is determined next time or the subsequent statistics is performed, so that the next determination and the subsequent statistics are facilitated.
With reference to the first aspect, an embodiment of the present application provides a fifth possible implementation manner of the first aspect, where the creating an advertisement monitoring table including a correspondence between each of the IP addresses and the minimum administrative area includes:
adding the IP address and administrative region included in the target IP database into the advertisement monitoring table in pairs;
and replacing the corresponding administrative region in the advertisement monitoring table by using the determined minimum administrative region.
Further, according to the data processing method provided by the embodiment of the application, the IP addresses and the corresponding administrative regions in the IP address library are added in pairs in the advertisement monitoring table, the administrative region corresponding to the target IP address is replaced by the minimum administrative region corresponding to the IP address, the original IP database is reserved, and the original IP database can be referred to when the advertisement monitoring table is manufactured or the advertisement detection table is modified each time, so that the availability of the original IP address library is improved.
With reference to the first aspect, an embodiment of the present application provides a sixth possible implementation manner of the first aspect, where the data processing method further includes: the advertisement monitoring table is marked with an identification representing a target usage scenario.
Further, according to the data processing method provided by the embodiment of the application, different advertisement monitoring tables are conveniently distinguished and identified by marking the advertisement monitoring tables with the identification of the target use scene.
In a second aspect, an embodiment of the present application further provides a data processing apparatus, including:
the acquisition module is used for acquiring the Internet Protocol (IP) address and the location-based service (LBS) information contained in the advertisement log information generated in the current preset time period;
the statistics module is used for carrying out statistics on LBS information in the advertisement log information with the same IP address and determining the minimum administrative area to which each IP address belongs;
the creation module is used for creating an advertisement monitoring table containing the corresponding relation between each IP address and the minimum administrative area.
With reference to the second aspect, an embodiment of the present application provides a first possible implementation manner of the second aspect, where the configuration of the statistics module when used for counting LBS information in the advertisement log information with the same IP address, to determine a minimum administrative area to which each IP address belongs includes:
The statistics unit is used for counting LBS information in the advertisement log information with the same IP address to obtain the total frequency of LBS information corresponding to each IP address;
the screening unit is used for screening the total frequency of the LBS information corresponding to each IP address to obtain a target IP address, wherein the total frequency of the LBS information corresponding to the target IP address is larger than a preset frequency;
and the determining unit is used for determining the minimum administrative region to which the target IP address belongs according to the frequency of each LBS information included in the target IP address for each target IP address.
With reference to the first possible implementation manner of the second aspect, the embodiment of the present application provides a second possible implementation manner of the second aspect, where the determining unit is configured to determine, according to a frequency of LBS information included in the target IP address, the minimum administrative area to which the target IP address belongs, where the determining unit includes:
a first determining unit configured to, when the number of LBS information included in the target IP address is 1, take an administrative area of a location corresponding to the LBS information included in the target IP address as the minimum administrative area;
and the second determining unit is used for determining the minimum administrative region to which the target IP address belongs according to the ratio of the frequency of each LBS information included by the target IP address to the total frequency of the LBS information corresponding to the target IP address when the number of the LBS information included by the target IP address is greater than 1.
With reference to the second possible implementation manner of the second aspect, the embodiment of the present application provides a third possible implementation manner of the second aspect, where the configuration of the second determining unit when configured to determine, according to a ratio of a frequency of each LBS information included in the target IP address to a total frequency of LBS information corresponding to the target IP address, the minimum administrative region to which the target IP address belongs includes:
judging whether each piece of LBS information included in the target IP address includes target LBS information, wherein the target LBS information is LBS information with the ratio being greater than or equal to a preset ratio;
if the target LBS information is included, taking an administrative area of a position corresponding to the target LBS address as the minimum administrative area;
and if the target LBS information is not included, taking the upper level administrative area of the position corresponding to each LBS information included in the target IP address as the minimum administrative area.
With reference to the third possible implementation manner of the second aspect, the embodiment of the present application provides a fourth possible implementation manner of the second aspect, and the data processing device is further configured to flag a correspondence relationship of a previous level administrative area as the minimum administrative area in the advertisement monitoring table.
With reference to the second aspect, an embodiment of the present application provides a fifth possible implementation manner of the second aspect, where the creating module is configured to, when used to create an advertisement monitoring table including a correspondence between each of the IP addresses and the minimum administrative area, include:
an adding unit for adding the IP address and administrative area included in the target IP database into the advertisement monitoring table in pairs;
and the replacing unit is used for replacing the corresponding administrative region in the advertisement monitoring table by using the determined minimum administrative region.
With reference to the second aspect, embodiments of the present application provide a sixth possible implementation manner of the second aspect, the data processing apparatus further includes:
and the marking module is used for marking the advertisement monitoring table by using the identification for representing the target use scene.
In a third aspect, an embodiment of the present application further provides a computer device, including a memory, a processor, and a computer program stored in the memory and executable on the processor, where the processor executes the computer program to implement the steps of the data processing method described in the first aspect and the six possible implementation manners of the first aspect.
In a fourth aspect, the embodiments of the present application further provide a computer readable storage medium, where a computer program is stored, where the computer program is executed by a processor to perform the steps of the data processing method described in the first aspect and the six possible implementation manners of the first aspect.
In order to make the above objects, features and advantages of the present application more comprehensible, preferred embodiments accompanied with figures are described in detail below.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present application, the drawings that are needed in the embodiments will be briefly described below, it being understood that the following drawings only illustrate some embodiments of the present application and therefore should not be considered limiting the scope, and that other related drawings may be obtained according to these drawings without inventive effort for a person skilled in the art.
FIG. 1 is a flowchart of a data processing method according to an embodiment of the present application;
FIG. 2 is a flowchart of another data processing method according to an embodiment of the present application;
fig. 3 is a schematic structural diagram of data statistics calculation according to an embodiment of the present application;
FIG. 4 is a flowchart of another data processing method according to an embodiment of the present application;
FIG. 5 is a flowchart of another data processing method according to an embodiment of the present application;
FIG. 6 is a flowchart of another data processing method according to an embodiment of the present application;
FIG. 7 is a schematic diagram of a data processing apparatus according to an embodiment of the present application;
FIG. 8 is a schematic diagram of another data processing apparatus according to an embodiment of the present disclosure;
FIG. 9 is a schematic diagram of another data processing apparatus according to an embodiment of the present disclosure;
FIG. 10 is a schematic diagram of another data processing apparatus according to an embodiment of the present disclosure;
FIG. 11 is a schematic diagram of another data processing apparatus according to an embodiment of the present disclosure;
fig. 12 is a schematic structural diagram of a computer device according to an embodiment of the present application.
Detailed Description
For the purposes of making the objects, technical solutions and advantages of the embodiments of the present application more clear, the technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application, and it is apparent that the described embodiments are only some embodiments of the present application, but not all embodiments. The components of the embodiments of the present application, which are generally described and illustrated in the figures herein, may be arranged and designed in a wide variety of different configurations. Thus, the following detailed description of the embodiments of the present application, as provided in the accompanying drawings, is not intended to limit the scope of the application, as claimed, but is merely representative of selected embodiments of the application. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present application without making any inventive effort, are intended to be within the scope of the present application.
In the advertisement world, in order to carry out more accurate advertisement delivery according to regions, an advertiser needs to monitor delivered advertisements, monitoring data comprise the IP addresses of users, in order to facilitate understanding, advertisement delivery monitoring is taken as an example for explanation, the advertiser delivers advertisements on an advertisement delivery platform, when a certain user browses advertisements at a user client, the advertisement platform sends a playing request to a monitoring party, meanwhile, user data are sent to the monitoring party, wherein the monitoring party comprises the IP addresses, LBS information and other information, the advertisement monitoring party accumulates the user data and forms advertisement monitoring logs, statistics is carried out on the user data in the advertisement monitoring logs, and the geographic position of the IP addresses is judged; regarding the determination of the geographic position of the IP address, generally, the IP address in the advertisement monitoring log is used to find the position information corresponding to the IP address in the IP address library (a mapping list from one IP address to one geographic position), and the position information is used as the geographic position of the IP address, but the variation of the IP address is frequent, for example, the same user client is on the internet each time, the network server can randomly allocate one IP address to the user client, the same device is connected with different WiFi or switched 4G networks, and the IP address also changes; however, the IP address library is a static library, and cannot be updated immediately when the IP address changes, so that a large error exists between the corresponding relationship of the IP address in the IP address library and the real mapping relationship; for the above-mentioned errors, the errors are generally reduced by shortening the time interval of updating the IP address library, for example, from updating once a month to updating once a week or updating once a day, but the IP address library cannot be updated every day in a real use scenario, and is generally updated monthly; shortening the time interval for updating the IP address library may reduce errors, but such errors may not be completely eliminated, so that the geographical location of the IP address obtained from the IP address library is inaccurate.
Based on this, the embodiment of the application provides a data processing method, device, equipment and storage medium, compared with the randomly allocated IP address, the LBS information of the user client obtains the LBS longitude and latitude information of the user based on the actual geographic position of the user, the positioning accuracy is higher, and the correspondence between the LBS longitude and latitude information of the user client and the geographic position of the LBS is not changed, so that the geographic position of the IP address is determined through a large amount of LBS information, and the accuracy of the geographic position of the IP address is improved.
Fig. 1 is a flowchart of a data processing method according to an embodiment of the present application, as shown in fig. 1, where the data processing method includes the following steps:
step S101: and acquiring the Internet Protocol (IP) address and the location-based service (LBS) information included in the advertisement log information generated in the current preset time period.
Specifically, the advertisement monitoring party determines the update time of the IP address library as the first day, monitors advertisement data, acquires an IP address through an external SDK (Software Development Kit ), acquires LBS longitude and latitude information corresponding to the IP address through a GPS (Global Positioning System ) chip of the client, wherein the IP data includes a digital tag allocated to an internet protocol device used by a user to surf the internet, the LBS data includes an advertisement monitoring log of a terminal device acquired through a radio communication network (e.g., LTE (Long Term Evolution, high-speed wireless communication standard) network, 5G (5 th generation mobile networks, fifth-generation mobile communication technology) network, GSM (Global System for Mobile Communications, global system for mobile communication) network, CDMA (Code Division Multiple Access ) network, etc.) or an external positioning mode (e.g., GPS, etc.), counts the above information and forms an advertisement monitoring log including the IP address, LBS information, and other information when the user browses advertisements, and stores the advertisement monitoring log in a local hard disk for reading when the program is running; the advertisement monitoring party reads the advertisement monitoring log through a self-defined time period, and when the advertisement monitoring log is read, the advertisement monitoring log generated within a preset time period from the current time is obtained, and the advertisement monitoring log comprises paired IP addresses and LBS information, such as: one IP address corresponds to one LBS information.
For example, for a user, when a user browses information or watches video on a mobile phone APP (Application) or a web page, the user sometimes jumps out of an advertisement and automatically plays the advertisement, and when the advertisement is played, a user client sends an advertisement playing request to a server of an advertisement monitoring party, and meanwhile, the obtained user data comprises information such as an IP address, LBS information, playing time, advertisement item number and the like to the advertisement monitoring party; for an advertisement monitoring party, setting the updating time of an IP address library as a starting point, starting to monitor a third-party advertisement, and accumulating user data to form an advertisement monitoring log, wherein one IP address corresponds to one piece of user data; when determining the geographic position of the IP address, acquiring an advertisement monitoring log within a preset time period from the current time, wherein the advertisement monitoring log comprises the IP address and corresponding LBS information, and the time period can be a time period from the update time of the IP address library to the current time.
Step S102: and counting LBS information in the advertisement log information with the same IP address, and determining the minimum administrative area to which each IP address belongs.
Specifically, the advertisement monitoring party purchases an IP address library and an LBS location mapping library of a third party, and stores the information in a program file, and the big data technology distributes LBS longitude and latitude information in an advertisement monitoring log to different servers for conversion processing, namely: determining LBS location information in the LBS location mapping library of the third party through the LBS longitude and latitude information, merging all LBS location information corresponding to the same IP address with the obtained LBS location information one by one, and determining a minimum administrative area to which each IP address belongs according to all LBS location information corresponding to the IP address, where the minimum administrative area is the most accurate geographic location corresponding to the IP address, for example, one LBS location information is a beijing city sea lake area in two LBS information corresponding to a certain IP address, and the other LBS location information is beijing city in china, and the minimum administrative area to which the IP address belongs is beijing in china.
For example, the data storage format in the advertisement log monitoring information is: the data storage format in the LBS location mapping library of the IP address, LBS longitude and latitude information and other information is as follows: LBS longitude and latitude information, LBS position information, namely, the corresponding LBS position information is found in a third-party LBS position mapping library through the LBS longitude and latitude information in advertisement monitoring log information, then the IP address and the LBS position information are in one-to-one correspondence, all the LBS position information corresponding to the same IP address are combined to form a table of a plurality of different LBS position information corresponding to one IP address, a plurality of LBS position information corresponding to the IP address is calculated in a statistics mode, the LBS position information with the highest accuracy is determined through comparison analysis, and the LBS position information is determined to be the smallest administrative area to which the IP address belongs.
Step S103: and establishing an advertisement monitoring table containing the corresponding relation between each IP address and the minimum administrative region.
Specifically, an advertisement monitoring table is established, wherein the advertisement monitoring table comprises IP addresses and minimum administrative areas corresponding to the IP addresses, the IP addresses and the minimum administrative areas are in one-to-one correspondence, and the specific corresponding formats are as follows: IP address, minimum administrative area, such as: 111.200.229.2 in Beijing, china.
In a possible implementation manner, fig. 2 is a flowchart of another data processing method provided in this application, and as shown in fig. 2, when calculating LBS information in the advertisement log information having the same IP address, determining a minimum administrative area to which each of the IP addresses belongs, the method includes the following steps:
step S201: and counting the LBS information in the advertisement log information with the same IP address to obtain the total frequency of the LBS information corresponding to each IP address.
Specifically, all the same IP addresses are combined into one IP address, all LBS location information corresponding to the IP address is counted, the total frequency (total number) of all LBS location information corresponding to the same IP address is calculated, then the frequency of each different LBS location information corresponding to the same IP address is calculated, and finally the different LBS location information corresponding to the same IP address, the frequency thereof and the total number of all LBS information are combined, namely: the above LBS information includes LBS location information, frequencies of different LBS location information, and a total number of all LBS location information.
For example, fig. 3 is a schematic structural diagram of data statistics calculation provided in this embodiment of the present application, as shown in fig. 3, in an advertisement monitoring log, an IP address corresponds to one LBS longitude and latitude information, corresponding LBS location information is found in a third party LBS location mapping library through the LBS longitude and latitude information, a table (a) of one IP address corresponds to one LBS location information is formed after statistics, then the same IP address and corresponding LBS location information thereof are combined into a row to form a table (b) of one IP address corresponds to a plurality of LBS location information, the total number of all LBS location information corresponding to the same IP address and the frequency of each different LBS location information are calculated respectively, and the finally formed table (c) includes one IP address, a plurality of different LBS location information corresponding to the IP address, the total number of times corresponding to each LBS location information and all LBS location information.
Step S202: and screening the total frequency of the LBS information corresponding to each IP address to obtain a target IP address, wherein the total frequency of the LBS information corresponding to the target IP address is larger than a preset frequency.
Specifically, a proportion parameter is set according to a service scene and a statistical play amount, a system obtains a screening threshold value according to the proportion parameter and the statistical play amount, the total frequency of all LBS information corresponding to each IP address is compared with the screening threshold value, the IP address with the total frequency of the LBS information larger than the screening threshold value is determined as a target IP address, and all the target IP addresses and the LBS information corresponding to the target IP address are counted.
For example, in the anti-cheating service scene, the statistical play amount is 100, the proportion parameter is set to be 60% according to the anti-cheating service scene and the statistical play amount 100, the system multiplies the statistical play amount 100 by 60% of the proportion parameter to obtain a screening threshold value of 60, and if the total frequency of LBS information corresponding to the IP address is 80, the IP address is the target IP address; if the total frequency of LBS information corresponding to the IP address is 60, the IP address is not the target IP address; if the total frequency of LBS information corresponding to the IP address is 55, the IP address is not the target IP address.
Step S203: for each target IP address, determining the minimum administrative region to which the target IP address belongs according to the frequency of each LBS information included in the target IP address.
Specifically, the frequency of each piece of LBS information corresponding to each target IP address is processed and analyzed, the frequency of LBS information which is most in line with the conditions is found according to the analysis result, and the LBS location information corresponding to the frequency is used as the minimum administrative area to which the target IP address belongs.
In a possible implementation manner, fig. 4 is a flowchart of another data processing method provided in this application, and as shown in fig. 4, when determining the above-mentioned minimum administrative area to which the target IP address belongs according to the frequency of each LBS information included in the target IP address, the method includes the following steps:
step S301: and when the number of the LBS information included in the target IP address is 1, taking an administrative area of the position corresponding to the LBS information included in the target IP address as the minimum administrative area.
Step S302: when the number of LBS information included in the target IP address is greater than 1, determining the minimum administrative region to which the target IP address belongs according to the ratio of the frequency of each LBS information included in the target IP address to the total frequency of the LBS information corresponding to the target IP address.
Specifically, the number of LBS information included in the target IP address is counted, that is: counting that the IP address corresponds to a plurality of different LBS information; if the target IP address corresponds to only one LBS message, taking the LBS position message in the LBS message as the minimum administrative area of the target IP address; if the target IP address corresponds to two or more LBS information, a ratio of a total frequency of LBS information corresponding to the target IP address to a frequency of each LBS information needs to be calculated, the most accurate LBS information is determined according to the ratio, and the LBS information is used as a minimum administrative area to which the target IP address belongs.
In a possible implementation manner, fig. 5 is a flowchart of another data processing method provided in this application, and as shown in fig. 5, when determining the minimum administrative area to which the target IP address belongs according to a ratio of a frequency of each LBS information included in the target IP address to a total frequency of LBS information corresponding to the target IP address, the method includes the following steps:
step S401: and judging whether each piece of LBS information included in the target IP address includes target LBS information, wherein the target LBS information is LBS information with the ratio being greater than or equal to a preset ratio.
Step S402: and if the target LBS information is included, taking an administrative area of a position corresponding to the target LBS address as the minimum administrative area.
Step S403: and if the target LBS information is not included, taking the upper level administrative area of the position corresponding to each LBS information included in the target IP address as the minimum administrative area.
Specifically, a preset ratio is set according to a service scene and a statistical play amount, the preset ratio needs to be more than 50%, then the ratio of the frequency of each piece of LBS information corresponding to a target IP address to the total frequency of the LBS information corresponding to the target IP address is calculated, the ratio is respectively compared with the preset ratio, if the LBS information with the ratio being more than or equal to the preset ratio exists, the LBS information is the target LBS information, and the LBS position information in the target LBS information is the minimum administrative area corresponding to the target IP address; if no LBS information with the ratio greater than or equal to the preset ratio exists, the LBS information corresponding to the target IP address does not include the target LBS information, and it is necessary to query a last level administrative area of each LBS location information corresponding to the target IP address in a regional division table of the civil administration part, and use the last level administrative area as a minimum administrative area corresponding to the target IP address.
For example, if the preset ratio is set to 80%, the target IP address corresponds to 5 times of the beijing city of china and 1 time of china, and there are 6 pieces of total LBS information, the duty ratio corresponding to each piece of LBS information is calculated, so that the duty ratio of the LBS information is 83% of the beijing city of china and the duty ratio of the LBS information is 17% of the beijing city of china, wherein the duty ratio of the LBS information is greater than 80% of the beijing city of china, so that the LBS information is target LBS information, and each piece of LBS information included in the target IP address includes target LBS information, then the minimum administrative area corresponding to the target IP address is beijing city of china; if the two pieces of LBS information corresponding to the target IP address are 4 times in beijing city in china and 1 time in china, 5 pieces of LBS information are total, and the duty ratio corresponding to each piece of LBS information is calculated, so that the duty ratio of LBS information is 80% in beijing city in china and the duty ratio of LBS information is 20% in china, wherein the duty ratio of LBS information is equal to 80% in beijing city in china, so that the LBS information is target LBS information, and each piece of LBS information included in the target IP address includes target LBS information, and the minimum administrative area corresponding to the target IP address is beijing city in china; if the three pieces of LBS information corresponding to the target IP address are 2 times in china, 2 times in shanghai city in china and 1 time in beijing city in china, a total of 5 pieces of LBS information are obtained, the duty ratio corresponding to each piece of LBS information is calculated, the duty ratios of the LBS information are 40% in china and shanghai city in china, the duty ratio of the LBS information is 20% in beijing city in china, and the duty ratio of all pieces of LBS information is less than 80%, so that the target LBS information is not included in each piece of LBS information included in the target IP address, the region division table of the civil administration part is searched for the upper level administrative region china of china, shanghai city in china and beijing city in china, and china is taken as the minimum administrative region corresponding to the target IP address.
In a possible embodiment, in performing steps S401 to S403, the method further includes marking a correspondence relationship of the upper level administrative area as the minimum administrative area in the advertisement monitoring table.
Specifically, the position corresponding to the IP address of the upper level administrative region as the minimum administrative region is found in the advertisement monitoring table, and the position of the minimum administrative region corresponding to the IP address is marked, and the marked position information varies.
In a possible implementation manner, fig. 6 is a flowchart of another data processing method provided in this application, and as shown in fig. 6, when an advertisement monitoring table including a correspondence relationship between each of the IP addresses and the minimum administrative area is established, the method includes the following steps:
step S601: and adding the IP address and administrative region included in the target IP database into the advertisement monitoring table in pairs.
Step S602: and replacing the corresponding administrative region in the advertisement monitoring table by using the determined minimum administrative region.
Specifically, a blank advertisement monitoring table is established, all IP addresses in an IP database and corresponding administrative areas (geographic position information corresponding to the IP addresses) are added into the advertisement monitoring table in a one-to-one correspondence manner, then the administrative area corresponding to the target IP address is found in the advertisement monitoring table according to the target IP address, and the determined minimum administrative area corresponding to the target IP address is used for replacing the original administrative area.
In a possible embodiment, the data processing method further comprises marking the advertisement monitoring table with an identification for representing a target usage scenario when performing steps S101-S103.
Specifically, the screening threshold values and the preset ratio values corresponding to different service scenes and the statistical play amount are different, so that different advertisement monitoring tables can be generated, the system marks the advertisement monitoring tables according to the use scenes of the advertisement monitoring tables, marks the use scenes, and facilitates distinguishing different advertisement monitoring tables.
For example, in the anti-cheating service scenario, a proportion parameter and a preset ratio are set manually according to the anti-cheating service requirement and the statistical play quantity, a system calculates a screening threshold according to the proportion parameter, the LBS information corresponding to the IP address is screened by combining the preset ratio, then the screening result is processed, an advertisement monitoring table is generated, and the advertisement monitoring table is marked with the identification of the anti-cheating service.
Fig. 7 is a schematic structural diagram of a data processing apparatus according to an embodiment of the present application, as shown in fig. 7, where the data processing apparatus includes:
an obtaining module 701, configured to obtain an internet protocol IP address and location-based service LBS information included in the advertisement log information generated in the current preset time period.
The statistics module 702 is configured to perform statistics on LBS information in the advertisement log information having the same IP address, and determine a minimum administrative area to which each IP address belongs.
A creating module 703, configured to create an advertisement monitoring table including a correspondence between each of the IP addresses and the minimum administrative area.
In a possible implementation manner, fig. 8 is a schematic structural diagram of another data processing apparatus provided in this application, and as shown in fig. 8, the configuration of the statistics module 702 in the foregoing is used for performing statistics on LBS information in the advertisement log information having the same IP address, and determining a minimum administrative area to which each IP address belongs, where the minimum administrative area includes:
and a statistics unit 704, configured to perform statistics on LBS information in the advertisement log information with the same IP address, so as to obtain a total frequency of LBS information corresponding to each IP address.
And a screening unit 705, configured to screen the total frequency of LBS information corresponding to each IP address to obtain a target IP address, where the total frequency of LBS information corresponding to the target IP address is greater than a preset frequency.
A determining unit 706, configured to determine, for each target IP address, the minimum administrative area to which the target IP address belongs according to the frequency of LBS information included in the target IP address.
In a possible implementation manner, fig. 9 is a schematic structural diagram of another data processing apparatus provided in this embodiment of the present application, where, as shown in fig. 9, the configuration of the determining unit 706, when determining, according to the frequency of LBS information included in the target IP address, the minimum administrative area to which the target IP address belongs, includes:
a first determining unit 707 configured to, when the number of LBS information included in the target IP address is 1, take an administrative area of a location corresponding to the LBS information included in the target IP address as the minimum administrative area.
A second determining unit 708, configured to determine, when the number of LBS information included in the target IP address is greater than 1, the minimum administrative area to which the target IP address belongs according to a ratio of a frequency of each LBS information included in the target IP address to a total frequency of LBS information corresponding to the target IP address.
In a possible embodiment, the configuration of the second determining unit 708, when determining the minimum administrative area to which the target IP address belongs according to the ratio of the frequency of each LBS information included in the target IP address to the total frequency of LBS information corresponding to the target IP address, includes:
and judging whether each piece of LBS information included in the target IP address includes target LBS information, wherein the target LBS information is LBS information with the ratio being greater than or equal to a preset ratio.
And if the target LBS information is included, taking an administrative area of the position corresponding to the target LBS address as the minimum administrative area.
And if the target LBS information is not included, taking the upper level administrative area of the position corresponding to each LBS information included in the target IP address as the minimum administrative area.
In a possible embodiment, the data processing apparatus is further configured to flag a correspondence relationship of the upper level administrative area as the minimum administrative area in the advertisement monitoring table.
In a possible implementation manner, fig. 10 is a schematic structural diagram of another data processing apparatus provided in this application, and as shown in fig. 10, the configuration of the creating module 703 includes, when used for creating an advertisement monitoring table including a correspondence between each of the IP addresses and the minimum administrative area:
an adding unit 709 for adding the IP address and administrative area included in the target IP database to the advertisement monitoring table in pairs.
And a replacing unit 710, configured to replace a corresponding administrative area in the advertisement monitoring table by using the determined minimum administrative area.
In a possible implementation manner, fig. 11 is a schematic structural diagram of another data processing apparatus provided in this application, where, as shown in fig. 11, the data processing apparatus further includes:
A marking module 711 for marking the advertisement monitoring table with an identification for representing a target usage scenario.
Based on the analysis, compared with the method in the related art that the corresponding position information is found from the IP address library through the IP address in the advertisement log information and is used as the position information of the IP address, the data processing method provided by the embodiment of the application judges the position information of the IP address by means of a large amount of LBS information in the advertisement monitoring log information, and improves the accuracy of the geographic position of the IP address.
Fig. 12 is a schematic structural diagram of a computer device provided in an embodiment of the present application, corresponding to the data processing method in fig. 1, and in the embodiment of the present application, a computer device 800 is provided, as shown in fig. 12, and includes a memory 801, a processor 802, and a computer program stored in the memory 801 and capable of running on the processor 802, where the processor 802 implements the steps of the data processing method when executing the computer program.
Specifically, the memory 801 and the processor 802 can be general-purpose memories and processors, which are not limited herein, and when the processor 802 runs a computer program stored in the memory 801, the data processing method can be executed, so as to solve the problem of errors caused by the update interval of the IP address library in the prior art.
Corresponding to the data processing method in fig. 1, the embodiment of the present application further provides a computer readable storage medium, on which a computer program is stored, which computer program, when being executed by a processor, performs the steps of the data processing method described above.
Specifically, the storage medium can be a general storage medium, such as a mobile disk, a hard disk, etc., and when the computer program on the storage medium is executed, the above data processing method can be executed, so that the problem of errors caused by the update interval of the IP address library in the prior art is solved.
The data processing apparatus provided in the embodiments of the present application may be specific hardware on a device or software or firmware installed on a device. The device provided in the embodiments of the present application has the same implementation principle and technical effects as those of the foregoing method embodiments, and for a brief description, reference may be made to corresponding matters in the foregoing method embodiments where the device embodiment section is not mentioned. It will be clear to those skilled in the art that, for convenience and brevity, the specific operation of the system, apparatus and unit described above may refer to the corresponding process in the above method embodiment, which is not described in detail herein.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other manners. The above-described apparatus embodiments are merely illustrative, and for example, the above-described division of units is merely a logical function division, and there may be other manners of division in actual implementation, and for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be through some communication interface, device or unit indirect coupling or communication connection, which may be in electrical, mechanical or other form.
The units described above as separate components may or may not be physically separate, and components shown as units may or may not be physical units, may be located in one place, or may be distributed over a plurality of network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments provided in the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit.
The above functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be embodied in essence or a part contributing to the prior art or a part of the technical solution, or in the form of a software product stored in a storage medium, including several instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform all or part of the steps of the above-described method of the various embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a magnetic disk, or an optical disk, or other various media capable of storing program codes.
It should be noted that: like reference numerals and letters in the following figures denote like items, and thus once an item is defined in one figure, no further definition or explanation of it is required in the following figures, and furthermore, the terms "first," "second," "third," etc. are used merely to distinguish one description from another and are not to be construed as indicating or implying relative importance.
Finally, it should be noted that: the foregoing examples are merely specific embodiments of the present application, and are not intended to limit the scope of the present application, but the present application is not limited thereto, and those skilled in the art will appreciate that while the foregoing examples are described in detail, the present application is not limited thereto. Any person skilled in the art may modify or easily conceive of the technical solution described in the foregoing embodiments, or make equivalent substitutions for some of the technical features within the technical scope of the disclosure of the present application; such modifications, changes or substitutions do not depart from the spirit and scope of the corresponding technical solutions. Are intended to be encompassed within the scope of this application. Therefore, the protection scope of the present application shall be subject to the protection scope of the claims.

Claims (9)

1. A method of data processing, comprising:
acquiring an Internet Protocol (IP) address and location-based service (LBS) information included in advertisement log information generated in a current preset time period;
counting LBS information in the advertisement log information with the same IP address, and determining the minimum administrative area to which each IP address belongs;
Establishing an advertisement monitoring table containing the corresponding relation between each IP address and the minimum administrative area;
the step of counting LBS information in the advertisement log information having the same IP address, and determining a minimum administrative area to which each IP address belongs, includes:
counting LBS information in the advertisement log information with the same IP address to obtain the total frequency of LBS information corresponding to each IP address;
screening the total frequency of LBS information corresponding to each IP address to obtain a target IP address, wherein the total frequency of LBS information corresponding to the target IP address is larger than a preset frequency;
and for each target IP address, determining the minimum administrative region to which the target IP address belongs according to the frequency of each LBS information included in the target IP address.
2. The data processing method of claim 1, wherein said determining the minimum administrative area to which the target IP address belongs according to the frequency of LBS information included in the target IP address comprises:
when the number of the LBS information included in the target IP address is 1, taking an administrative area of a position corresponding to the LBS information included in the target IP address as the minimum administrative area;
When the number of LBS information included in the target IP address is larger than 1, determining the minimum administrative region to which the target IP address belongs according to the ratio of the frequency of each LBS information included in the target IP address to the total frequency of the LBS information corresponding to the target IP address.
3. The data processing method as claimed in claim 2, wherein said determining the minimum administrative area to which the target IP address belongs according to a ratio of a frequency of each LBS information included in the target IP address to a total frequency of LBS information corresponding to the target IP address comprises:
judging whether each piece of LBS information included in the target IP address includes target LBS information, wherein the target LBS information is LBS information with the ratio being greater than or equal to a preset ratio;
if the target LBS information is included, taking an administrative area of a position corresponding to the target LBS information as the minimum administrative area;
and if the target LBS information is not included, taking the upper level administrative area of the position corresponding to each LBS information included in the target IP address as the minimum administrative area.
4. A data processing method as claimed in claim 3, wherein the method further comprises:
And marking the corresponding relation of the upper level administrative region serving as the minimum administrative region in the advertisement monitoring table.
5. The data processing method as set forth in claim 1, wherein said creating an advertisement monitoring table containing correspondence between each of said IP addresses and said minimum administrative area, comprises:
adding the IP address and administrative region included in the target IP database into the advertisement monitoring table in pairs;
and replacing the corresponding administrative region in the advertisement monitoring table by using the determined minimum administrative region.
6. The data processing method of claim 1, wherein the method further comprises:
the advertisement monitoring table is marked with an identification representing a target usage scenario.
7. A data processing apparatus, comprising:
the acquisition module is used for acquiring the Internet Protocol (IP) address and the location-based service (LBS) information contained in the advertisement log information generated in the current preset time period;
the statistics module is used for carrying out statistics on LBS information in the advertisement log information with the same IP address and determining the minimum administrative area to which each IP address belongs;
The creation module is used for creating an advertisement monitoring table containing the corresponding relation between each IP address and the minimum administrative area;
the step of counting LBS information in the advertisement log information having the same IP address, and determining a minimum administrative area to which each IP address belongs, includes:
counting LBS information in the advertisement log information with the same IP address to obtain the total frequency of LBS information corresponding to each IP address;
screening the total frequency of LBS information corresponding to each IP address to obtain a target IP address, wherein the total frequency of LBS information corresponding to the target IP address is larger than a preset frequency;
and for each target IP address, determining the minimum administrative region to which the target IP address belongs according to the frequency of each LBS information included in the target IP address.
8. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor implements the steps of the data processing method according to any of the preceding claims 1-6 when the computer program is executed.
9. A computer-readable storage medium, on which a computer program is stored, characterized in that the computer program, when being executed by a processor, performs the steps of the data processing method of any of the preceding claims 1-6.
CN202010888438.3A 2020-08-28 2020-08-28 Data processing method, device, equipment and storage medium Active CN112040024B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010888438.3A CN112040024B (en) 2020-08-28 2020-08-28 Data processing method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010888438.3A CN112040024B (en) 2020-08-28 2020-08-28 Data processing method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN112040024A CN112040024A (en) 2020-12-04
CN112040024B true CN112040024B (en) 2023-05-09

Family

ID=73586825

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010888438.3A Active CN112040024B (en) 2020-08-28 2020-08-28 Data processing method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN112040024B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113434553A (en) * 2021-06-30 2021-09-24 青岛海尔科技有限公司 Method, device, storage medium and computer equipment for acquiring information

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049870A (en) * 2012-12-19 2013-04-17 东莞市东信网络技术有限公司 Mobile advertisement putting system and method
WO2019061656A1 (en) * 2017-09-30 2019-04-04 平安科技(深圳)有限公司 Electronic apparatus, service place recommendation method based on lbs data, and storage medium

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9026145B1 (en) * 2012-03-23 2015-05-05 Google Inc. Systems and methods for mapping IP-addresses to geolocations
US20170180313A1 (en) * 2012-04-05 2017-06-22 Blis Media Limited Associating Geolocation Data With IP Addresses
CN106534392B (en) * 2015-09-10 2019-12-06 阿里巴巴集团控股有限公司 Positioning information acquisition method, positioning method and device
CN106651442B (en) * 2016-12-16 2020-10-27 阿里巴巴(中国)有限公司 Advertisement putting control method and device
CN107169805A (en) * 2017-06-23 2017-09-15 上海斐讯数据通信技术有限公司 A kind of advertisement placement method, apparatus and system
CN108900566B (en) * 2018-05-23 2020-07-10 中国科学院信息工程研究所 Method and device for determining position of IP (Internet protocol) equipment in network
CN110650146A (en) * 2019-09-26 2020-01-03 秒针信息技术有限公司 Anti-cheating method and device and electronic equipment
CN111343301B (en) * 2020-04-21 2022-08-16 北京字节跳动网络技术有限公司 Positioning method, positioning device, electronic equipment and storage medium

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103049870A (en) * 2012-12-19 2013-04-17 东莞市东信网络技术有限公司 Mobile advertisement putting system and method
WO2019061656A1 (en) * 2017-09-30 2019-04-04 平安科技(深圳)有限公司 Electronic apparatus, service place recommendation method based on lbs data, and storage medium

Also Published As

Publication number Publication date
CN112040024A (en) 2020-12-04

Similar Documents

Publication Publication Date Title
US11556946B2 (en) Methods and apparatus for associating media devices with a demographic composition of a geographic area
US9674665B2 (en) System and method for automated location-based widgets
US11997332B2 (en) Methods and apparatus to associate audience members with over-the-top device media impressions
US9715554B2 (en) Methods and apparatus to identify usage of quick response codes
CN108011987B (en) IP address positioning method and device, electronic equipment and storage medium
EP2216747A2 (en) Method and apparatus to associate demographic and geographic information with influential consumer relationships
CN108596690B (en) Advertisement processing method and device
WO2018121619A1 (en) Multimedia data processing method and device for service, server, and storage medium
CN107454126B (en) Message pushing method, server and terminal
US20120084430A1 (en) Methods and apparatus to measure mobile broadband market share
CN108574715A (en) Information recommendation method, apparatus and system
US20120150490A1 (en) Management server, communication system and statistical processing method
CN111541986B (en) Positioning method, positioning device, storage medium and processor
CN110659945A (en) Method and device for counting advertisement exposure times and computer equipment
CN112040024B (en) Data processing method, device, equipment and storage medium
CN110650146A (en) Anti-cheating method and device and electronic equipment
JP2011191911A (en) Advertisement distribution device, advertisement distribution system, advertisement distribution method and program
CN105657652A (en) Providing streaming geolocation infomation
CN110990244B (en) Target equipment identification determining method and device, electronic equipment and readable storage medium
US10959041B1 (en) Traffic analysis of mobile phones partitioned by geohash
CN103618639A (en) Method, device and system for monitoring media data
KR20150080144A (en) Apparatus and method for providing advertisement stream, and method for viewing advertisement stream
WO2023045434A1 (en) Access detection method, system, and apparatus
CN108170795B (en) Information pushing method, device and equipment
CN109769202B (en) Method and device for positioning flow data, storage medium and server

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant