CN106933905A - The monitoring method and device of web page access data - Google Patents

The monitoring method and device of web page access data Download PDF

Info

Publication number
CN106933905A
CN106933905A CN201511032354.5A CN201511032354A CN106933905A CN 106933905 A CN106933905 A CN 106933905A CN 201511032354 A CN201511032354 A CN 201511032354A CN 106933905 A CN106933905 A CN 106933905A
Authority
CN
China
Prior art keywords
pixel
data
access data
access
judging
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201511032354.5A
Other languages
Chinese (zh)
Other versions
CN106933905B (en
Inventor
祁国晟
徐烨
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Gridsum Technology Co Ltd
Original Assignee
Beijing Gridsum Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Gridsum Technology Co Ltd filed Critical Beijing Gridsum Technology Co Ltd
Priority to CN201511032354.5A priority Critical patent/CN106933905B/en
Publication of CN106933905A publication Critical patent/CN106933905A/en
Application granted granted Critical
Publication of CN106933905B publication Critical patent/CN106933905B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Abstract

This application discloses the monitoring method and device of a kind of web page access data.Wherein, the method includes:Obtain the corresponding access data of each pixel on target web;Data will be accessed to be ranked up, each pixel point-rendering pixel curve corresponding to the access data after sequence;Whether the corresponding distribution situation for accessing data of each pixel meets default distribution in judging pixel point curve;In the case where judging that distribution situation meets default distribution, the corresponding access data of target pixel points are determined from the corresponding access data of each pixel, wherein, target pixel points are according to default screening conditions, the pixel determined from each pixel;According to corresponding access data and the predetermined threshold value of target pixel points, the corresponding monitoring result for accessing data of each pixel is determined.Present application addresses cannot determine in the prior art access the whether real technical problem of data.

Description

The monitoring method and device of web page access data
Technical field
The application is related to computer realm, in particular to the monitoring method and device of a kind of web page access data.
Background technology
As the popularization and development of internet, the user for understanding information by internet and being traded are more and more, enter Obtained from Internet user access data it is also increasingly huge therewith.More product providers start with internet This platform is publicized, is concluded the business and maintenance items, and this is resulted in accessing data processing and the demand for presenting hurricane all the way Rise, in the prior art, data providing is mostly that displayed web page is accessed by way of scheming (for example, thermodynamic chart), table The situation of change of data.
Thermodynamic chart is a kind of highly effective and intuitively web page access data display methods, and it can preset webpage Time interval in access data be shown, and combine various dimensions dissect function, be applied to Consumer's Experience optimization (referred to as:UEO optimizes), visitor's behavioural analysis, the aspect such as the judgement of webpage general performance.
By the above, the emphasis of data providing is only that the access behavior for representing visitor colony at this stage (that is, demonstrating access data), so party in request (that is, product provider) can only be allowed to see on visitor colony The static state of access behavior (that is, accessing data) represents, and lacks the judge to the authenticity of above-mentioned access data, and then Also cannot just know the access behavior represented in thermodynamic chart whether be visitor true access behavior.
For above-mentioned problem, effective solution is not yet proposed at present.
The content of the invention
The embodiment of the present application provides the monitoring method and device of a kind of web page access data, at least to solve prior art In cannot determine access the whether real technical problem of data.
According to the one side of the embodiment of the present application, there is provided a kind of monitoring method of web page access data, including:Obtain Take the corresponding access data of each pixel on target web;The access data are ranked up, to the visit after sequence Ask data corresponding described each pixel point-rendering pixel curve;Judge each pixel described in the pixel point curve Whether the corresponding distribution situation for accessing data of point meets default distribution;Judging that it is described pre- that the distribution situation meets If in the case of distribution, the corresponding access of target pixel points is determined from the corresponding access data of described each pixel Data, wherein, the target pixel points are according to default screening conditions, the picture determined from described each pixel Vegetarian refreshments;According to corresponding access data and the predetermined threshold value of the target pixel points, it is determined that described each pixel is corresponding Access the monitoring result of data.
Further, according to corresponding access data and the predetermined threshold value of the target pixel points, it is determined that described each pixel The corresponding monitoring result for accessing data of point includes:Calculate the corresponding access data for accessing data of the target pixel points Summation;Judge whether the access data summation reaches the predetermined threshold value;Judge it is described access data summation reach In the case of the predetermined threshold value, determine that the monitoring result accesses data for true, wherein, the true access It is effectively to access data that data are used to characterize the corresponding data that access of described each pixel;Judging the access In the case that data summation is not up to the predetermined threshold value, the monitoring result is determined for untrue access data, wherein, It is invalid access data that the untrue access data are used to characterize the corresponding data that access of described each pixel.
Further, according to corresponding access data and the predetermined threshold value of the target pixel points, it is determined that described each pixel The corresponding monitoring result for accessing data of point includes:Calculate the corresponding access data for accessing data of the target pixel points Summation;Judge that the access data summation accounts for whether total ratio for accessing data reaches the predetermined threshold value, wherein, institute It is the corresponding access data sum of described each pixel to state total data that access;Judging that it is described pre- that the ratio reaches If in the case of threshold value, determining that the monitoring result accesses data for true, wherein, the true access data are used for Characterize the corresponding data that access of described each pixel and access data for effective;Judging the ratio not up to institute In the case of stating predetermined threshold value, the monitoring result is determined for untrue access data, wherein, the untrue access It is invalid access data that data are used to characterize the corresponding data that access of described each pixel.
Further, the access data are ranked up, described each pixel corresponding to the access data after sequence Point-rendering pixel point curve includes:Described each pixel is ranked up from high to low according to the access data;Base Each pixel point curve described in pixel point-rendering described in after being sorted from high to low according to access data;Judge the picture Whether the corresponding distribution situation for accessing data of each pixel described in vegetarian refreshments curve meets default distribution includes:Judge Whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets long-tail distribution.
Further, in the case where judging that the distribution situation does not meet the default distribution, the monitoring is determined Result is untrue access data, wherein, the untrue access data are corresponding for characterizing described each pixel It is invalid access data to access data.
Further, obtaining the corresponding data that access of each pixel on target web includes:Obtained from database full The corresponding access data of described each pixel on the pre-conditioned target web of foot, wherein, it is described pre-conditioned Comprise at least:Preset time period.
Further, described each pixel it is corresponding access data include it is following any one:Click volume, session amount and Mouse stay time.
According to the another aspect of the embodiment of the present application, a kind of monitoring device of web page access data is additionally provided, including: Acquiring unit, for obtaining the corresponding access data of each pixel on target web;Drawing unit, for by described in Access data to be ranked up, described each pixel point-rendering pixel curve corresponding to the access data after sequence;Sentence Disconnected unit, for whether judging the corresponding distribution situation for accessing data of each pixel described in the pixel point curve Meet default distribution;First determining unit, for judging that the distribution situation meets the situation of the default distribution Under, the corresponding access data of target pixel points are determined from the corresponding access data of described each pixel, wherein, The target pixel points are according to default screening conditions, the pixel determined from described each pixel;Second is true Order unit, for each pixel pair according to the corresponding access data of the target pixel points and predetermined threshold value determination The monitoring result of the access data answered.
Further, second determining unit includes:First computing module, for calculating the target pixel points pair The access data summation of the access data answered;First judge module, for judging that the access data summation accounts for total access Whether the ratio of data reaches the predetermined threshold value, wherein, total access data are that described each pixel is corresponding Access data sum;First determining module, in the case where judging that the ratio reaches the predetermined threshold value, Determine that the monitoring result accesses data for true, wherein, the true access data are used to characterize described each pixel The corresponding access data of point access data for effective;Second determining module, for judging that the ratio is not up to In the case of the predetermined threshold value, the monitoring result is determined for untrue access data, wherein, the untrue visit Ask that data are invalid access data for characterizing the corresponding data that access of described each pixel.
Further, the drawing unit includes:Order module, for according to it is described access data from high to low to institute Each pixel is stated to be ranked up;Drafting module, for based on described each after data sort from high to low according to accessing Pixel point curve described in individual pixel point-rendering;The judging unit includes:Second judge module, for judging the picture Whether the corresponding distribution situation for accessing data of each pixel described in vegetarian refreshments curve meets long-tail distribution.
Further, second determining unit includes:Second computing module, for calculating the target pixel points pair The access data summation of the access data answered;3rd judge module, for judging whether the access data summation reaches The predetermined threshold value;3rd determining module, for judge it is described access data summation reach the predetermined threshold value In the case of, determine that the monitoring result accesses data for true, wherein, the true access data are described for characterizing The corresponding data that access of each pixel access data for effective;4th determining module, for judging the visit In the case of asking that data summation is not up to the predetermined threshold value, the monitoring result is determined for untrue access data, its In, it is invalid access data that the untrue access data are used to characterize the corresponding data that access of described each pixel.
Further, described device also includes:5th determining module, for judging that the distribution situation do not meet In the case of the default distribution, the monitoring result is determined for untrue access data, wherein, the untrue visit Ask that data are invalid access data for characterizing the corresponding data that access of described each pixel.
Further, acquiring unit includes:Acquisition module, for from database obtain meet it is pre-conditioned described in The corresponding access data of described each pixel on target web, wherein, it is described pre-conditioned to comprise at least:When default Between section.
Further, described each pixel it is corresponding access data include it is following any one:Click volume, session amount and Mouse stay time.
In the embodiment of the present application, using the corresponding access data of each pixel on acquisition target web;By the visit Ask that data are ranked up, described each pixel point-rendering pixel curve corresponding to the access data after sequence;Judge Whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets default distribution; In the case of judging that the distribution situation meets the default distribution, from the corresponding access data of described each pixel In determine the corresponding access data of target pixel points, wherein, the target pixel points be according to default screening conditions, The pixel determined from described each pixel;According to corresponding access data and the default threshold of the target pixel points Value determines the mode of the corresponding monitoring result for accessing data of described each pixel, characterizes pixel and visits by drawing Ask the pixel point curve of the corresponding relation between data, and judge to access in pixel point curve data distribution situation whether Normally (namely, if meet the default distributions such as long-tail distribution), and then judging the distribution situation of above-mentioned access data In the case of normal, by the size for comparing the corresponding access data of target pixel points and predetermined threshold value in each pixel Can just obtain each pixel it is corresponding access data whether be truly access data monitoring result, reached quantization and There is the whether real purpose of access data of the monitoring webpage of foundation, it is achieved thereby that whether monitoring web page access data is true Real technique effect, and then solve and cannot determine in the prior art the access whether real technical problem of data.
Brief description of the drawings
Accompanying drawing described herein is used for providing further understanding of the present application, constitutes the part of the application, this Shen Schematic description and description please does not constitute the improper restriction to the application for explaining the application.In accompanying drawing In:
Fig. 1 is a kind of flow chart of the monitoring method of the web page access data according to the embodiment of the present application;
Fig. 2 is a kind of schematic diagram of the pixel point curve according to the embodiment of the present application;And
Fig. 3 is a kind of schematic diagram of the monitoring device of the web page access data according to the embodiment of the present application.
Specific embodiment
In order that those skilled in the art more fully understand application scheme, below in conjunction with the embodiment of the present application Accompanying drawing, is clearly and completely described to the technical scheme in the embodiment of the present application, it is clear that described embodiment The only embodiment of the application part, rather than whole embodiments.Based on the embodiment in the application, ability The every other embodiment that domain those of ordinary skill is obtained under the premise of creative work is not made, should all belong to The scope of the application protection.
It should be noted that term " first ", " in the description and claims of this application and above-mentioned accompanying drawing Two " it is etc. for distinguishing similar object, without for describing specific order or precedence.It should be appreciated that this The data that sample is used can be exchanged in the appropriate case, so as to embodiments herein described herein can with except Here the order beyond those for illustrating or describing is implemented.Additionally, term " comprising " and " having " and they Any deformation, it is intended that covering is non-exclusive to be included, for example, containing process, the side of series of steps or unit Method, system, product or equipment are not necessarily limited to those steps clearly listed or unit, but may include unclear List or for these processes, method, product or other intrinsic steps of equipment or unit.
According to the embodiment of the present application, there is provided a kind of embodiment of the method for the monitoring method of web page access data is, it is necessary to say It is bright, can be in the such as one group computer system of computer executable instructions the step of the flow of accompanying drawing is illustrated Middle execution, and, although logical order is shown in flow charts, but in some cases, can be being different from Order herein performs shown or described step.
Fig. 1 is a kind of flow chart of the monitoring method of the web page access data according to the embodiment of the present application, as shown in figure 1, The method comprises the following steps S102 to step S110:
Step S102, obtains the corresponding access data of each pixel on target web, wherein, target web is to wait to supervise Survey the webpage for accessing data.
Specifically, target web can be any webpage for accessing data to be monitored, and it can be any treating also to be equivalent to It is determined that accessing the whether real webpage of data.
Wherein, each pixel it is corresponding access that data can be in the amount of being click on, session amount and mouse stay time appoint Meaning is a kind of, can specifically determine according to demand.
Step S104, will access data and is ranked up, each pixel point-rendering picture corresponding to the access data after sequence Vegetarian refreshments curve, wherein, pixel point curve is used to characterize between each pixel and the corresponding access data of each pixel Corresponding relation.
Specifically, both can be to be sorted from high to low, or to visiting to accessing data to access data to be ranked up Ask that data sort from low to high, the sortord for accessing data is not defined in the present embodiment.
Step S106, whether the corresponding distribution situation for accessing data of each pixel meets pre- in judging pixel point curve If distribution.
Step S108, in the case where judging that distribution situation meets default distribution, from the corresponding access of each pixel The corresponding access data of target pixel points are determined in data, wherein, target pixel points are the default screening conditions of basis, The pixel determined from each pixel.
Specifically, default screening conditions can be set according to demand, for example, default screening conditions are in each pixel The pixel that data come preceding N is accessed, wherein, the pixel that access data come preceding N refers to by each pixel pair The access data answered come the pixel of preceding N when sorting from high to low.The value of N can be set according to demand, for example: 20%.If N is 20%, the target pixel points in above-mentioned steps S108 access data in being each pixel Preceding 20% pixel is come, if so each pixel has 100, target pixel points are 20;If Each pixel has 200, then target pixel points are 40.
Step S110, according to corresponding access data and the predetermined threshold value of target pixel points, determines that each pixel is corresponding Access the monitoring result of data.
Specifically, monitoring result has two kinds, and one kind is truly to access data, and another kind is untrue access data.Such as Fruit monitoring result is truly to access data, illustrates that the most of access behavior represented by the access data of webpage is all true Visitor's behavior, then above-mentioned access data are mostly to access webpage by real user to produce;If instead monitoring result is Untrue access data, illustrate that the most of access behavior represented by the access data of webpage is not true visitor's behavior, Then above-mentioned access data are not mostly to access webpage by real user to produce.
Predetermined threshold value can be set according to demand, both can be percents, or numeric form, can be with It is decimal form.
Further, since each pixel is the pixel on target web, so the corresponding access data of each pixel Monitoring result be each pixel where webpage (that is, target web) access data monitoring result, It is exactly the monitoring result of the access data of above-mentioned webpage.
In the embodiment of the present application, the pixel that the corresponding relation between pixel and access data is characterized by drawing is bent Line, and judge whether the distribution situation that data are accessed in pixel point curve normal (namely, if meet long-tail distribution etc. Default distribution), and then in the case of the distribution situation for judging above-mentioned access data is normal, by comparing each pixel Corresponding data and the size of predetermined threshold value of accessing of target pixel points can just obtain the corresponding access number of each pixel in point According to whether be truly access data monitoring result, reached quantify and have foundation monitoring webpage access data whether Real purpose, it is achieved thereby that the monitoring whether real technique effect of web page access data, and then solve existing skill Cannot determine to access the whether real technical problem of data in art.
It should be noted that for the access data of each target web, can be by performing step S102 to step Whether S110 obtains the real monitoring result of access data of the target web.
It is alternatively possible to by two ways realize according to target pixel points it is corresponding access data and predetermined threshold value, really Determine the corresponding monitoring result for accessing data of each pixel, above two mode is specific as follows:
Mode one:It is specific as follows including step S1101 to step S1107:
Step S1101, calculates the corresponding access data summation for accessing data of target pixel points.
Specifically, the pixel (that is, target pixel points) determined from each pixel is usually multiple, on State step S1101 and namely calculate the corresponding access data sum of whole target pixel points, obtain accessing data summation.
Step S1103, judges to access whether data summation reaches predetermined threshold value.
Specifically, in the embodiment of the present application, predetermined threshold value is numeric form, for example, could be arranged to each pixel It is corresponding to access the 80% of data sum.
Step S1105, in the case where judging that accessing data summation reaches predetermined threshold value, determines that monitoring result is true It is real to access data, wherein, true access data are accessed for characterizing the corresponding data that access of each pixel for effective Data.
Specifically, above-mentioned steps S1105 is namely judging to access in the case that data summation reaches predetermined threshold value, Determine that the corresponding data that access of each pixel access data for effective, then illustrate that above-mentioned access data are mostly by true Real user accesses what webpage was produced.
Step S1107, in the case where judging to access data summation not up to predetermined threshold value, determines that monitoring result is Untrue access data, wherein, it is invalid that untrue access data are used to characterize the corresponding data that access of each pixel Access data.
Specifically, above-mentioned steps S1107 is namely in the case where judging to access data summations not up to predetermined threshold value, Determine each pixel it is corresponding access data be invalid access data, then illustrate above-mentioned access data be not mostly by Real user accesses what webpage was produced.
Mode two:It is specific as follows including step S1109 to step S11015:
Step S1109, calculates the corresponding access data summation for accessing data of target pixel points, and the step is with above-mentioned step Rapid S1101, is not repeated.
Step S11011, judges that accessing data summation accounts for whether total ratio for accessing data reaches predetermined threshold value, wherein, Total data that access are the corresponding access data sum of each pixel.
Specifically, in the embodiment of the present application, predetermined threshold value is percents or decimal form, for example, can set It is set to 80% or 0.8.
Step S11013, in the case where judging that ratio reaches predetermined threshold value, determines that monitoring result accesses number for true According to, wherein, it is effective access data that true access data are used to characterize the corresponding data that access of each pixel.
Specifically, above-mentioned steps S11013 determines each namely in the case where judging that ratio reaches predetermined threshold value The corresponding data that access of pixel access data for effective, then illustrate that above-mentioned access data are visited by real user Ask what webpage was produced.
Step S11015, in the case where ratio not up to predetermined threshold value is judged, determines that monitoring result is untrue visit Data are asked, wherein, it is invalid access number that untrue access data are used to characterize the corresponding data that access of each pixel According to.
Specifically, above-mentioned steps S11015 is namely in the case where ratio not up to predetermined threshold value is judged, it is determined that respectively The corresponding data that access of individual pixel are invalid access data, then illustrate that above-mentioned access data are not mostly by truly using Family accesses what webpage was produced.
It should be noted that any one in can according to demand selecting above two mode, determines each pixel The corresponding monitoring result for accessing data, that is, the monitoring result for determining the access data of webpage.
If the value of predetermined threshold value is that each pixel is corresponding accesses 80%, 0.8 or the 80% of data sum, Exactly judge target pixel points it is corresponding access data sum with each pixel it is corresponding access data sum compared with whether Meet " Pareto Law ".
Alternatively, in the embodiment of the present application, data will be accessed to be ranked up, it is corresponding to the access data after sequence Each pixel point-rendering pixel curve includes:Each pixel is ranked up from high to low according to data are accessed;Base Each pixel point-rendering pixel curve after being sorted from high to low according to access data.Judge each in pixel point curve Whether the corresponding distribution situation for accessing data of individual pixel meets default distribution includes:Judge in pixel point curve each Whether the corresponding distribution situation for accessing data of pixel meets long-tail distribution.
When it is click volume to access data, to the corresponding click volume of each pixel in certain webpage according to arranging from high to low Sequence, the schematic diagram of the pixel point curve drawn out based on the pixel after being sorted from high to low according to click volume may refer to Fig. 2.It should be noted that in Fig. 2, x-axis represents pixel, y-axis represents click volume, wherein, each pixel It is arranged in x-axis from high to low according to click volume.
Alternatively, in the embodiment of the present application, in the case where judging that distribution situation does not meet default distribution, it is determined that Monitoring result is untrue access data, wherein, untrue access data are used to characterize the corresponding access of each pixel Data are invalid access data.
Alternatively, in the embodiment of the present application, obtaining the corresponding data that access of each pixel on target web includes: Obtained from database and meet the corresponding access data of each pixel on pre-conditioned target web, wherein, preset Condition is comprised at least:Preset time period.
It is, from database extract target web under some screening conditions, the corresponding access number of each pixel According to, wherein, some screening conditions are pre-conditioned in above-described embodiment.
Specifically, database can be that any can get that each pixel on target pages is corresponding to access data Database, for example, thermodynamic chart database.
Alternatively, it is pre-conditioned to include default sources, default source class in addition to preset time period Type etc..Above-mentioned preset time period, default sources and default source type can be set according to demand.
For example, when it is pre-conditioned only include preset time period when, preset time period be 1 day to 2015 October in 2015 In on October 31, in, target web is the webpage of certain commodity A, and referred to as webpage A is then obtained from database On October 1st, 2015 between 31 days October in 2015, the corresponding access data of each pixel on webpage A.
For example, when pre-conditioned comprising preset time period and during default sources, preset time period is 2015 10 In on October 31st, 1 day 1 moon, it is Sina weibo to preset sources, and target web is the net of certain commodity A Page, referred to as webpage A is then obtained on October 1st, 2015 between 31 days October in 2015 from database The corresponding access data of each pixel on webpage A are accessed by Sina weibo.
In the embodiment of the present application, data can accordingly be accessed according to customer requirement retrieval, and then to above-mentioned access number According to whether being truly monitored, the effect for improving user satisfaction has been reached.
As a example by accessing data for click volume, it is described as follows:
During the same webpage of a large amount of guest access, the degree of concern performance with other affairs is consistent, there is general character and individual character Feature:The point of interest of i.e. most of visitors is similar, and the scope clicked on webpage also can be similar;But different visitors Personality is incomplete same, always has differences, and is not excluded for indivedual new visitors and is unfamiliar with webpage the overdue of generation hitting Situation, has the click in a small amount of other regions certainly.The click volume difference of general character and individual character has much, then depending on net The layout and own characteristic of page.Dissected under paths with other with a period of time, each picture of certain page (that is, webpage) The click volume of vegetarian refreshments is research object, there are following two features:
(1) click volume of each pixel is presented long-tail distribution on webpage, there is main body and long-tail two parts pixel;
(2) most of click volume is produced by a small amount of pixel, meets Pareto Law.
Specifically, long-tail depicts " small, and the only a few events of most events well respectively Scale is quite big ".
So click volume etc. browses data (that is, accessing data) while symbol in the web page access behavior of a large amount of true visitors Close Pareto Law and long-tail distribution, if it is, meet more than 2 points, from the entirety of the corresponding click volume of each pixel From the point of view of distribution, most access behavior meets true visitor's behavior.
With the access behavior of true visitor it is have one due to the access behavior of non-genuine visitor by the above Determine difference, the scheme combination various dimensions that the embodiment of the present application is provided dissect function, using each pixel pair on webpage The access data answered, judge whether the access data of above-mentioned webpage true, that is, judge in the visitor of the webpage whether Most of is non-genuine visitor, so as to obtain the monitoring result of the webpage.
According to the embodiment of the present application, a kind of monitoring device of web page access data is additionally provided, the web page access data Monitoring device is used to perform the monitoring method of the web page access data that the embodiment of the present application the above is provided, right below The monitoring device of the web page access data that the embodiment of the present application is provided does specific introduction:
Fig. 3 is a kind of schematic diagram of the monitoring device of the web page access data according to the embodiment of the present application, as shown in figure 3, The monitoring device mainly includes acquiring unit 31, drawing unit 33, judging unit 35, the first determining unit 37 and the Two determining units 39, wherein:
Acquiring unit 31, for obtaining the corresponding access data of each pixel on target web.
Specifically, target web can be any webpage for accessing data to be monitored, and it can be any treating also to be equivalent to It is determined that accessing the whether real webpage of data.
Wherein, each pixel it is corresponding access that data can be in the amount of being click on, session amount and mouse stay time appoint Meaning is a kind of, can specifically determine according to demand.
Drawing unit 33, is ranked up, to corresponding each pixel of access data after sequence for will access data Draw pixel point curve.
Specifically, both can be to be sorted from high to low, or to visiting to accessing data to access data to be ranked up Ask that data sort from low to high, the sortord for accessing data is not defined in the present embodiment.
Judging unit 35, for judging in pixel point curve whether is the corresponding distribution situation for accessing data of each pixel Meet default distribution.
First determining unit 37, in the case where judging that distribution situation meets default distribution, from each pixel The corresponding access data of target pixel points are determined in corresponding access data, wherein, target pixel points are according to default Screening conditions, the pixel determined from each pixel.
Specifically, default screening conditions can be set according to demand, for example, default screening conditions are in each pixel The pixel that data come preceding N is accessed, wherein, the pixel that access data come preceding N refers to by each pixel pair The access data answered come the pixel of preceding N when sorting from high to low.The value of N can be set according to demand, for example: 20%.If N is 20%, the target pixel points in above-mentioned steps S108 access data in being each pixel Preceding 20% pixel is come, if so each pixel has 100, target pixel points are 20;If Each pixel has 200, then target pixel points are 40.
Second determining unit 39, for according to corresponding access data and the predetermined threshold value of target pixel points, determining each picture The corresponding monitoring result for accessing data of vegetarian refreshments.
Specifically, monitoring result has two kinds, and one kind is truly to access data, and another kind is untrue access data.Such as Fruit monitoring result is truly to access data, illustrates that the most of access behavior represented by the access data of webpage is all true Visitor's behavior, then above-mentioned access data are mostly to access webpage by real user to produce;If instead monitoring result is Untrue access data, illustrate that the most of access behavior represented by the access data of webpage is not true visitor's behavior, Then above-mentioned access data are not mostly to access webpage by real user to produce.
Predetermined threshold value can be set according to demand, both can be percents, or numeric form, can be with It is decimal form.
Further, since each pixel is the pixel on target web, so the corresponding access data of each pixel Monitoring result be each pixel where webpage (that is, target web) access data monitoring result, It is exactly the monitoring result of the access data of above-mentioned webpage.
In the embodiment of the present application, the pixel that the corresponding relation between pixel and access data is characterized by drawing is bent Line, and judge whether the distribution situation that data are accessed in pixel point curve normal (namely, if meet long-tail distribution etc. Default distribution), and then in the case of the distribution situation for judging above-mentioned access data is normal, by comparing each pixel Corresponding data and the size of predetermined threshold value of accessing of target pixel points can just obtain the corresponding access number of each pixel in point According to whether be truly access data monitoring result, reached quantify and have foundation monitoring webpage access data whether Real purpose, it is achieved thereby that the monitoring whether real technique effect of web page access data, and then solve existing skill Cannot determine to access the whether real technical problem of data in art.
It should be noted that for the access data of each target web, can be by calling acquiring unit, drawing Whether the access data that unit, judging unit, the first determining unit and the second determining unit obtain the target web are true Monitoring result.
It is alternatively possible to the access data sum and predetermined threshold value according to target pixel points are realized by two ways, really The monitoring result of the access data of fixed each pixel, above two mode is specific as follows:
Mode one:Second determining unit includes:First computing module, the first judge module, the first determining module and Two determining modules, wherein:
First computing module, for calculating the corresponding access data summation for accessing data of target pixel points.
Specifically, the pixel (that is, target pixel points) determined from each pixel is usually multiple, on Whole target pixel points are corresponding to access data sum namely for calculating to state the first computing module, obtains accessing number According to summation.
First judge module, for judging that accessing data summation accounts for whether total ratio for accessing data reaches predetermined threshold value, Wherein, total data that access are the corresponding access data sum of each pixel.
Specifically, in the embodiment of the present application, predetermined threshold value is numeric form, for example, could be arranged to each pixel It is corresponding to access the 80% of data sum.
First determining module, in the case where judging that ratio reaches predetermined threshold value, determining that monitoring result is true Data are accessed, wherein, it is effective access number that true access data are used to characterize the corresponding data that access of each pixel According to.
Specifically, above-mentioned first determining module is namely for judging that accessing data summation reaches the feelings of predetermined threshold value Under condition, determine that the corresponding data that access of each pixel access data for effective, then illustrate above-mentioned access data mostly It is to access webpage by real user to produce.
Second determining module, in the case where ratio not up to predetermined threshold value is judged, determining monitoring result for not It is true to access data, wherein, it is invalid that untrue access data are used to characterize the corresponding data that access of each pixel Access data.
Specifically, above-mentioned second determining module is namely for judging to access data summations not up to predetermined threshold value In the case of, determine that the corresponding data that access of each pixel are invalid access data, then illustrate that above-mentioned access data are big It is not to access webpage by real user to produce.
Mode one:Second determining unit includes:Second computing module, the 3rd judge module, the 3rd determining module and Four determining modules, wherein:
Second computing module, for calculating the corresponding access data summation for accessing data of target pixel points.
3rd judge module, for judging to access whether data summation reaches predetermined threshold value.
Specifically, in the embodiment of the present application, predetermined threshold value is percents or decimal form, for example, can set It is set to 80% or 0.8.
3rd determining module, in the case where judging that accessing data summation reaches predetermined threshold value, it is determined that monitoring knot Fruit accesses data for true, wherein, it is effective for truly access data being used to characterize the corresponding data that access of each pixel Access data.
Specifically, above-mentioned 3rd determining module is namely in the case where judging that ratio reaches predetermined threshold value, really The corresponding data that access of fixed each pixel access data for effective, then illustrate that above-mentioned access data are mostly by true User accesses what webpage was produced.
4th determining module, in the case where judging to access data summation not up to predetermined threshold value, it is determined that monitoring Result is untrue access data, wherein, untrue access data are used to characterize the corresponding access data of each pixel It is invalid access data.
Specifically, above-mentioned 4th determining module is namely in the case where ratio not up to predetermined threshold value is judged, Determine each pixel it is corresponding access data be invalid access data, then illustrate above-mentioned access data be not mostly by Real user accesses what webpage was produced.
It should be noted that any one in can according to demand selecting above two mode, determines each pixel The corresponding monitoring result for accessing data, that is, the monitoring result for determining the access data of webpage.
Alternatively, in the embodiment of the present application, drawing unit includes:Order module, for according to access data by height Each pixel is ranked up to low;Drafting module, for based on each after data sort from high to low according to accessing Individual pixel point-rendering pixel curve;Judging unit includes:Second judge module, it is each in pixel point curve for judging Whether the corresponding distribution situation for accessing data of individual pixel meets long-tail distribution.
Alternatively, in the embodiment of the present application, device also includes:5th determining module, for judging to be distributed feelings In the case that condition does not meet default distribution, monitoring result is determined for untrue access data, wherein, untrue access number It is invalid access data according to for characterizing the corresponding data that access of each pixel.
Alternatively, in the embodiment of the present application, acquiring unit includes:Acquisition module, for obtaining full from database The corresponding access data of each pixel on the pre-conditioned target web of foot, wherein, it is pre-conditioned to comprise at least:In advance If the time period.
Above-mentioned acquisition module namely for from database extract target web under some screening conditions, each pixel The corresponding access data of point, wherein, some screening conditions are pre-conditioned in above-described embodiment.
Alternatively, it is pre-conditioned to include default sources, default source class in addition to preset time period Type etc..Above-mentioned preset time period, default sources and default source type can be set according to demand.
In the embodiment of the present application, data can accordingly be accessed according to customer requirement retrieval, and then to above-mentioned access number According to whether being truly monitored, the effect for improving user satisfaction has been reached.
The monitoring device of the web page access data include processor and memory, above-mentioned acquiring unit, drawing unit, Judging unit, the first determining unit and second determining unit etc. are stored in memory, by processing as program unit Device performs storage said procedure unit in memory.
Kernel is included in processor, is gone in memory to transfer corresponding program unit by kernel.Kernel can set one Or more, determine whether the access data of webpage are true by adjusting kernel parameter.
Memory potentially includes the volatile memory in computer-readable medium, random access memory (RAM) and/ Or the form, such as read-only storage (ROM) or flash memory (flash RAM) such as Nonvolatile memory, memory includes at least one Individual storage chip.
Present invention also provides a kind of embodiment of computer program product, when being performed on data processing equipment, fit In the program code for performing initialization there are as below methods step:Obtain the corresponding access number of each pixel on target web According to;The access data are ranked up, described each pixel point-rendering pixel corresponding to the access data after sequence Point curve;Judge whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets Default distribution;In the case where judging that the distribution situation meets the default distribution, from described each pixel pair The corresponding access data of target pixel points are determined in the access data answered, wherein, the target pixel points are according to pre- If screening conditions, the pixel determined from described each pixel;According to the corresponding access of the target pixel points Data and predetermined threshold value, it is determined that the corresponding monitoring result for accessing data of described each pixel.
In above-described embodiment of the application, the description to each embodiment all emphasizes particularly on different fields, and does not have in certain embodiment The part of detailed description, may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed technology contents, can be by other Mode realize.Wherein, device embodiment described above is only schematical, such as division of described unit, Can be a kind of division of logic function, there can be other dividing mode when actually realizing, for example multiple units or component Can combine or be desirably integrated into another system, or some features can be ignored, or do not perform.It is another, institute Display or the coupling each other for discussing or direct-coupling or communication connection can be by some interfaces, unit or mould The INDIRECT COUPLING of block or communication connection, can be electrical or other forms.
The unit that is illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to On multiple units.Some or all of unit therein can be according to the actual needs selected to realize this embodiment scheme Purpose.
In addition, during each functional unit in the application each embodiment can be integrated in a processing unit, it is also possible to It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.It is above-mentioned integrated Unit can both be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.
If the integrated unit is to realize in the form of SFU software functional unit and as independent production marketing or when using, Can store in a computer read/write memory medium.Based on such understanding, the technical scheme essence of the application On all or part of the part that is contributed to prior art in other words or the technical scheme can be with software product Form is embodied, and the computer software product is stored in a storage medium, including some instructions are used to so that one Platform computer equipment (can be personal computer, server or network equipment etc.) performs each embodiment institute of the application State all or part of step of method.And foregoing storage medium includes:USB flash disk, read-only storage (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disc or CD Etc. it is various can be with the medium of store program codes.
The above is only the preferred embodiment of the application, it is noted that for the ordinary skill people of the art For member, on the premise of the application principle is not departed from, some improvements and modifications can also be made, these improve and moisten Decorations also should be regarded as the protection domain of the application.

Claims (10)

1. a kind of monitoring method of web page access data, it is characterised in that including:
Obtain the corresponding access data of each pixel on target web;
The access data are ranked up, described each pixel point-rendering corresponding to the access data after sequence Pixel point curve;
Judge whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets Default distribution;
In the case where judging that the distribution situation meets the default distribution, from described each pixel correspondence Access data in determine the corresponding access data of target pixel points, wherein, according to the target pixel points Default screening conditions, the pixel determined from described each pixel;
According to corresponding access data and the predetermined threshold value of the target pixel points, it is determined that described each pixel correspondence Access data monitoring result.
2. method according to claim 1, it is characterised in that according to the corresponding access data of the target pixel points And predetermined threshold value, it is determined that the corresponding monitoring result for accessing data of described each pixel includes:
Calculate the corresponding access data summation for accessing data of the target pixel points;
Judge whether the access data summation reaches the predetermined threshold value;
In the case where judging that the access data summation reaches the predetermined threshold value, the monitoring result is determined Data are accessed for true, wherein, the true access data are used to characterize the corresponding access of described each pixel Data access data for effective;
In the case where judging that the access data summation is not up to the predetermined threshold value, the monitoring knot is determined Fruit is untrue access data, wherein, the untrue access data are used to characterize each pixel correspondence Access data be invalid access data.
3. method according to claim 1, it is characterised in that according to the corresponding access data of the target pixel points And predetermined threshold value, it is determined that the corresponding monitoring result for accessing data of described each pixel includes:
Calculate the corresponding access data summation for accessing data of the target pixel points;
Judge that the access data summation accounts for whether total ratio for accessing data reaches the predetermined threshold value, wherein, Total access data are that described each pixel is corresponding accesses data sum;
In the case where judging that the ratio reaches the predetermined threshold value, determine that the monitoring result is true visit Data are asked, wherein, the true access data are used to characterize the corresponding data that access of described each pixel to have The access data of effect;
In the case where judging that the ratio is not up to the predetermined threshold value, determine that the monitoring result is untrue It is real to access data, wherein, the untrue access data are used to characterize the corresponding access number of described each pixel According to being invalid access data.
4. method according to claim 1, it is characterised in that be ranked up the access data, after sequence Corresponding described each pixel point-rendering pixel curve of access data include:
Described each pixel is ranked up from high to low according to the access data;
Based on according to access data sort from high to low after described in each pixel point curve described in pixel point-rendering;
Judge whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets Default distribution includes:Judge the corresponding distribution feelings for accessing data of each pixel described in the pixel point curve Whether condition meets long-tail distribution.
5. method according to claim 1, it is characterised in that judging that it is described pre- that the distribution situation does not meet If in the case of distribution, determining the monitoring result for untrue access data, wherein, the untrue access It is invalid access data that data are used to characterize the corresponding data that access of described each pixel.
6. method according to claim 1, it is characterised in that obtain the corresponding visit of each pixel on target web Ask that data include:
Obtained from database and meet the corresponding access of described each pixel on the pre-conditioned target web Data, wherein, it is described pre-conditioned to comprise at least:Preset time period.
7. method according to claim 1, it is characterised in that the corresponding data that access of described each pixel include Below any one:Click volume, session amount and mouse stay time.
8. a kind of monitoring device of web page access data, it is characterised in that including:
Acquiring unit, for obtaining the corresponding access data of each pixel on target web;
Drawing unit, it is corresponding to the access data after sequence described for the access data to be ranked up Each pixel point-rendering pixel curve;
Judging unit, dividing for data is accessed for judging that each pixel described in the pixel point curve is corresponding Whether cloth situation meets default distribution;
First determining unit, in the case where judging that the distribution situation meets the default distribution, from Described each pixel is corresponding to be accessed and determine the corresponding access data of target pixel points in data, wherein, institute It is according to default screening conditions, the pixel determined from described each pixel to state target pixel points;
Second determining unit, for according to the target pixel points it is corresponding access data and predetermined threshold value, it is determined that The corresponding monitoring result for accessing data of described each pixel.
9. device according to claim 8, it is characterised in that second determining unit includes:
First computing module, for calculating the corresponding access data summation for accessing data of the target pixel points;
First judge module, for judging that the access data summation accounts for whether total ratio for accessing data reaches institute Predetermined threshold value is stated, wherein, total access data are that described each pixel is corresponding accesses data sum;
First determining module, in the case where judging that the ratio reaches the predetermined threshold value, determining institute State monitoring result and access data for true, wherein, the true access data are used to characterize described each pixel Corresponding access data access data for effective;
Second determining module, in the case where judging that the ratio is not up to the predetermined threshold value, it is determined that The monitoring result is untrue access data, wherein, the untrue access data be used to characterizing it is described each The corresponding data that access of pixel are invalid access data.
10. device according to claim 8, it is characterised in that
The drawing unit includes:Order module, for according to it is described access data from high to low to it is described each Pixel is ranked up;Drafting module, for based on each pixel after being sorted from high to low according to access data Pixel point curve described in point-rendering;
The judging unit includes:Second judge module, for judging each pixel in the pixel point curve Whether the corresponding distribution situation for accessing data meets long-tail distribution.
CN201511032354.5A 2015-12-31 2015-12-31 Method and device for monitoring webpage access data Active CN106933905B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201511032354.5A CN106933905B (en) 2015-12-31 2015-12-31 Method and device for monitoring webpage access data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201511032354.5A CN106933905B (en) 2015-12-31 2015-12-31 Method and device for monitoring webpage access data

Publications (2)

Publication Number Publication Date
CN106933905A true CN106933905A (en) 2017-07-07
CN106933905B CN106933905B (en) 2019-12-24

Family

ID=59444716

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201511032354.5A Active CN106933905B (en) 2015-12-31 2015-12-31 Method and device for monitoring webpage access data

Country Status (1)

Country Link
CN (1) CN106933905B (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109729054A (en) * 2017-10-31 2019-05-07 阿里巴巴集团控股有限公司 Access data monitoring method and relevant device
CN109976985A (en) * 2017-12-27 2019-07-05 北京国双科技有限公司 A kind of method for drafting and device of thermodynamic chart
CN111104559A (en) * 2018-10-29 2020-05-05 百度在线网络技术(北京)有限公司 Method and device for dividing distribution form of user data
US20220391528A1 (en) * 2020-02-20 2022-12-08 Beijing Bytedance Network Technology Co., Ltd. Online document display method and apparatus, device and medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101799830A (en) * 2010-03-25 2010-08-11 北京国双科技有限公司 Flow data processing method capable of realizing multi-dimensional free analysis
CN102075352A (en) * 2010-12-17 2011-05-25 北京邮电大学 Method and device for predicting network user behavior
CN102279786A (en) * 2011-08-25 2011-12-14 百度在线网络技术(北京)有限公司 Method and device for monitoring effective access amount of application program
CN103559277A (en) * 2013-11-06 2014-02-05 北京国双科技有限公司 Data processing method and device for webpage page click quantity statistics
CN103593415A (en) * 2013-10-29 2014-02-19 北京国双科技有限公司 Method and device for detecting cheating on visitor volumes of web pages
CN104580447A (en) * 2014-12-29 2015-04-29 中国科学院计算机网络信息中心 Spatio-temporal data service scheduling method based on access heat
CN105072089A (en) * 2015-07-10 2015-11-18 中国科学院信息工程研究所 WEB malicious scanning behavior abnormity detection method and system
CN105119764A (en) * 2015-09-29 2015-12-02 百度在线网络技术(北京)有限公司 Method and device for monitoring flow

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101799830A (en) * 2010-03-25 2010-08-11 北京国双科技有限公司 Flow data processing method capable of realizing multi-dimensional free analysis
CN102075352A (en) * 2010-12-17 2011-05-25 北京邮电大学 Method and device for predicting network user behavior
CN102279786A (en) * 2011-08-25 2011-12-14 百度在线网络技术(北京)有限公司 Method and device for monitoring effective access amount of application program
CN103593415A (en) * 2013-10-29 2014-02-19 北京国双科技有限公司 Method and device for detecting cheating on visitor volumes of web pages
CN103559277A (en) * 2013-11-06 2014-02-05 北京国双科技有限公司 Data processing method and device for webpage page click quantity statistics
CN104580447A (en) * 2014-12-29 2015-04-29 中国科学院计算机网络信息中心 Spatio-temporal data service scheduling method based on access heat
CN105072089A (en) * 2015-07-10 2015-11-18 中国科学院信息工程研究所 WEB malicious scanning behavior abnormity detection method and system
CN105119764A (en) * 2015-09-29 2015-12-02 百度在线网络技术(北京)有限公司 Method and device for monitoring flow

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
王进 等: ""基于大偏差统计模型的Http-Flood DDoS 检测机制及性能分析"", 《软件学报》 *

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109729054A (en) * 2017-10-31 2019-05-07 阿里巴巴集团控股有限公司 Access data monitoring method and relevant device
CN109729054B (en) * 2017-10-31 2021-08-13 阿里巴巴集团控股有限公司 Access data monitoring method and related equipment
CN109976985A (en) * 2017-12-27 2019-07-05 北京国双科技有限公司 A kind of method for drafting and device of thermodynamic chart
CN111104559A (en) * 2018-10-29 2020-05-05 百度在线网络技术(北京)有限公司 Method and device for dividing distribution form of user data
US20220391528A1 (en) * 2020-02-20 2022-12-08 Beijing Bytedance Network Technology Co., Ltd. Online document display method and apparatus, device and medium

Also Published As

Publication number Publication date
CN106933905B (en) 2019-12-24

Similar Documents

Publication Publication Date Title
CN107818344B (en) Method and system for classifying and predicting user behaviors
CN102929939B (en) The offer method and device of customized information
CN107730389A (en) Electronic installation, insurance products recommend method and computer-readable recording medium
US11275748B2 (en) Influence score of a social media domain
CN101493832A (en) Website content combine recommendation system and method
CN106933905A (en) The monitoring method and device of web page access data
CN106528777A (en) Cross-screen user identification normalizing method and system
CN111400586A (en) Group display method, terminal, server, system and storage medium
CN106708841A (en) Website access path aggregation method and apparatus
CN105786965A (en) URL-based user behavior analysis method and device
CN111782953A (en) Recommendation method, device, equipment and storage medium
CN111738785A (en) Product selection method, system and storage medium
CN112232933A (en) House source information recommendation method, device, equipment and readable storage medium
CN111861605A (en) Business object recommendation method
Shamsuzzoha et al. The role of human capital on the performance of manufacturing firms in Bangladesh
CN103049497A (en) Method and device for website navigation
Chiappini Do overseas investments create or replace trade? New insights from a macro-sectoral study on Japan
JP2009289172A (en) Conduct history analysis system and its method
CN105389714B (en) Method for identifying user characteristics from behavior data
CN106372158A (en) Method and device for processing user behavior data
CN108959289B (en) Website category acquisition method and device
WO2015149550A1 (en) Method and apparatus for determining grades of links within website
CN104636470A (en) Method and device for recommending business information
Miniukovich et al. Visual diversity and user interface quality
CN107436940A (en) The method of web front-end Dynamic Display data based on user profile behavioural analysis

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing

Applicant after: Beijing Guoshuang Technology Co.,Ltd.

Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing

Applicant before: Beijing Guoshuang Technology Co.,Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant