CN106933905A - The monitoring method and device of web page access data - Google Patents
The monitoring method and device of web page access data Download PDFInfo
- Publication number
- CN106933905A CN106933905A CN201511032354.5A CN201511032354A CN106933905A CN 106933905 A CN106933905 A CN 106933905A CN 201511032354 A CN201511032354 A CN 201511032354A CN 106933905 A CN106933905 A CN 106933905A
- Authority
- CN
- China
- Prior art keywords
- pixel
- data
- access data
- access
- judging
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/90—Details of database functions independent of the retrieved data types
- G06F16/95—Retrieval from the web
- G06F16/958—Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking
Abstract
This application discloses the monitoring method and device of a kind of web page access data.Wherein, the method includes:Obtain the corresponding access data of each pixel on target web;Data will be accessed to be ranked up, each pixel point-rendering pixel curve corresponding to the access data after sequence;Whether the corresponding distribution situation for accessing data of each pixel meets default distribution in judging pixel point curve;In the case where judging that distribution situation meets default distribution, the corresponding access data of target pixel points are determined from the corresponding access data of each pixel, wherein, target pixel points are according to default screening conditions, the pixel determined from each pixel;According to corresponding access data and the predetermined threshold value of target pixel points, the corresponding monitoring result for accessing data of each pixel is determined.Present application addresses cannot determine in the prior art access the whether real technical problem of data.
Description
Technical field
The application is related to computer realm, in particular to the monitoring method and device of a kind of web page access data.
Background technology
As the popularization and development of internet, the user for understanding information by internet and being traded are more and more, enter
Obtained from Internet user access data it is also increasingly huge therewith.More product providers start with internet
This platform is publicized, is concluded the business and maintenance items, and this is resulted in accessing data processing and the demand for presenting hurricane all the way
Rise, in the prior art, data providing is mostly that displayed web page is accessed by way of scheming (for example, thermodynamic chart), table
The situation of change of data.
Thermodynamic chart is a kind of highly effective and intuitively web page access data display methods, and it can preset webpage
Time interval in access data be shown, and combine various dimensions dissect function, be applied to Consumer's Experience optimization
(referred to as:UEO optimizes), visitor's behavioural analysis, the aspect such as the judgement of webpage general performance.
By the above, the emphasis of data providing is only that the access behavior for representing visitor colony at this stage
(that is, demonstrating access data), so party in request (that is, product provider) can only be allowed to see on visitor colony
The static state of access behavior (that is, accessing data) represents, and lacks the judge to the authenticity of above-mentioned access data, and then
Also cannot just know the access behavior represented in thermodynamic chart whether be visitor true access behavior.
For above-mentioned problem, effective solution is not yet proposed at present.
The content of the invention
The embodiment of the present application provides the monitoring method and device of a kind of web page access data, at least to solve prior art
In cannot determine access the whether real technical problem of data.
According to the one side of the embodiment of the present application, there is provided a kind of monitoring method of web page access data, including:Obtain
Take the corresponding access data of each pixel on target web;The access data are ranked up, to the visit after sequence
Ask data corresponding described each pixel point-rendering pixel curve;Judge each pixel described in the pixel point curve
Whether the corresponding distribution situation for accessing data of point meets default distribution;Judging that it is described pre- that the distribution situation meets
If in the case of distribution, the corresponding access of target pixel points is determined from the corresponding access data of described each pixel
Data, wherein, the target pixel points are according to default screening conditions, the picture determined from described each pixel
Vegetarian refreshments;According to corresponding access data and the predetermined threshold value of the target pixel points, it is determined that described each pixel is corresponding
Access the monitoring result of data.
Further, according to corresponding access data and the predetermined threshold value of the target pixel points, it is determined that described each pixel
The corresponding monitoring result for accessing data of point includes:Calculate the corresponding access data for accessing data of the target pixel points
Summation;Judge whether the access data summation reaches the predetermined threshold value;Judge it is described access data summation reach
In the case of the predetermined threshold value, determine that the monitoring result accesses data for true, wherein, the true access
It is effectively to access data that data are used to characterize the corresponding data that access of described each pixel;Judging the access
In the case that data summation is not up to the predetermined threshold value, the monitoring result is determined for untrue access data, wherein,
It is invalid access data that the untrue access data are used to characterize the corresponding data that access of described each pixel.
Further, according to corresponding access data and the predetermined threshold value of the target pixel points, it is determined that described each pixel
The corresponding monitoring result for accessing data of point includes:Calculate the corresponding access data for accessing data of the target pixel points
Summation;Judge that the access data summation accounts for whether total ratio for accessing data reaches the predetermined threshold value, wherein, institute
It is the corresponding access data sum of described each pixel to state total data that access;Judging that it is described pre- that the ratio reaches
If in the case of threshold value, determining that the monitoring result accesses data for true, wherein, the true access data are used for
Characterize the corresponding data that access of described each pixel and access data for effective;Judging the ratio not up to institute
In the case of stating predetermined threshold value, the monitoring result is determined for untrue access data, wherein, the untrue access
It is invalid access data that data are used to characterize the corresponding data that access of described each pixel.
Further, the access data are ranked up, described each pixel corresponding to the access data after sequence
Point-rendering pixel point curve includes:Described each pixel is ranked up from high to low according to the access data;Base
Each pixel point curve described in pixel point-rendering described in after being sorted from high to low according to access data;Judge the picture
Whether the corresponding distribution situation for accessing data of each pixel described in vegetarian refreshments curve meets default distribution includes:Judge
Whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets long-tail distribution.
Further, in the case where judging that the distribution situation does not meet the default distribution, the monitoring is determined
Result is untrue access data, wherein, the untrue access data are corresponding for characterizing described each pixel
It is invalid access data to access data.
Further, obtaining the corresponding data that access of each pixel on target web includes:Obtained from database full
The corresponding access data of described each pixel on the pre-conditioned target web of foot, wherein, it is described pre-conditioned
Comprise at least:Preset time period.
Further, described each pixel it is corresponding access data include it is following any one:Click volume, session amount and
Mouse stay time.
According to the another aspect of the embodiment of the present application, a kind of monitoring device of web page access data is additionally provided, including:
Acquiring unit, for obtaining the corresponding access data of each pixel on target web;Drawing unit, for by described in
Access data to be ranked up, described each pixel point-rendering pixel curve corresponding to the access data after sequence;Sentence
Disconnected unit, for whether judging the corresponding distribution situation for accessing data of each pixel described in the pixel point curve
Meet default distribution;First determining unit, for judging that the distribution situation meets the situation of the default distribution
Under, the corresponding access data of target pixel points are determined from the corresponding access data of described each pixel, wherein,
The target pixel points are according to default screening conditions, the pixel determined from described each pixel;Second is true
Order unit, for each pixel pair according to the corresponding access data of the target pixel points and predetermined threshold value determination
The monitoring result of the access data answered.
Further, second determining unit includes:First computing module, for calculating the target pixel points pair
The access data summation of the access data answered;First judge module, for judging that the access data summation accounts for total access
Whether the ratio of data reaches the predetermined threshold value, wherein, total access data are that described each pixel is corresponding
Access data sum;First determining module, in the case where judging that the ratio reaches the predetermined threshold value,
Determine that the monitoring result accesses data for true, wherein, the true access data are used to characterize described each pixel
The corresponding access data of point access data for effective;Second determining module, for judging that the ratio is not up to
In the case of the predetermined threshold value, the monitoring result is determined for untrue access data, wherein, the untrue visit
Ask that data are invalid access data for characterizing the corresponding data that access of described each pixel.
Further, the drawing unit includes:Order module, for according to it is described access data from high to low to institute
Each pixel is stated to be ranked up;Drafting module, for based on described each after data sort from high to low according to accessing
Pixel point curve described in individual pixel point-rendering;The judging unit includes:Second judge module, for judging the picture
Whether the corresponding distribution situation for accessing data of each pixel described in vegetarian refreshments curve meets long-tail distribution.
Further, second determining unit includes:Second computing module, for calculating the target pixel points pair
The access data summation of the access data answered;3rd judge module, for judging whether the access data summation reaches
The predetermined threshold value;3rd determining module, for judge it is described access data summation reach the predetermined threshold value
In the case of, determine that the monitoring result accesses data for true, wherein, the true access data are described for characterizing
The corresponding data that access of each pixel access data for effective;4th determining module, for judging the visit
In the case of asking that data summation is not up to the predetermined threshold value, the monitoring result is determined for untrue access data, its
In, it is invalid access data that the untrue access data are used to characterize the corresponding data that access of described each pixel.
Further, described device also includes:5th determining module, for judging that the distribution situation do not meet
In the case of the default distribution, the monitoring result is determined for untrue access data, wherein, the untrue visit
Ask that data are invalid access data for characterizing the corresponding data that access of described each pixel.
Further, acquiring unit includes:Acquisition module, for from database obtain meet it is pre-conditioned described in
The corresponding access data of described each pixel on target web, wherein, it is described pre-conditioned to comprise at least:When default
Between section.
Further, described each pixel it is corresponding access data include it is following any one:Click volume, session amount and
Mouse stay time.
In the embodiment of the present application, using the corresponding access data of each pixel on acquisition target web;By the visit
Ask that data are ranked up, described each pixel point-rendering pixel curve corresponding to the access data after sequence;Judge
Whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets default distribution;
In the case of judging that the distribution situation meets the default distribution, from the corresponding access data of described each pixel
In determine the corresponding access data of target pixel points, wherein, the target pixel points be according to default screening conditions,
The pixel determined from described each pixel;According to corresponding access data and the default threshold of the target pixel points
Value determines the mode of the corresponding monitoring result for accessing data of described each pixel, characterizes pixel and visits by drawing
Ask the pixel point curve of the corresponding relation between data, and judge to access in pixel point curve data distribution situation whether
Normally (namely, if meet the default distributions such as long-tail distribution), and then judging the distribution situation of above-mentioned access data
In the case of normal, by the size for comparing the corresponding access data of target pixel points and predetermined threshold value in each pixel
Can just obtain each pixel it is corresponding access data whether be truly access data monitoring result, reached quantization and
There is the whether real purpose of access data of the monitoring webpage of foundation, it is achieved thereby that whether monitoring web page access data is true
Real technique effect, and then solve and cannot determine in the prior art the access whether real technical problem of data.
Brief description of the drawings
Accompanying drawing described herein is used for providing further understanding of the present application, constitutes the part of the application, this Shen
Schematic description and description please does not constitute the improper restriction to the application for explaining the application.In accompanying drawing
In:
Fig. 1 is a kind of flow chart of the monitoring method of the web page access data according to the embodiment of the present application;
Fig. 2 is a kind of schematic diagram of the pixel point curve according to the embodiment of the present application;And
Fig. 3 is a kind of schematic diagram of the monitoring device of the web page access data according to the embodiment of the present application.
Specific embodiment
In order that those skilled in the art more fully understand application scheme, below in conjunction with the embodiment of the present application
Accompanying drawing, is clearly and completely described to the technical scheme in the embodiment of the present application, it is clear that described embodiment
The only embodiment of the application part, rather than whole embodiments.Based on the embodiment in the application, ability
The every other embodiment that domain those of ordinary skill is obtained under the premise of creative work is not made, should all belong to
The scope of the application protection.
It should be noted that term " first ", " in the description and claims of this application and above-mentioned accompanying drawing
Two " it is etc. for distinguishing similar object, without for describing specific order or precedence.It should be appreciated that this
The data that sample is used can be exchanged in the appropriate case, so as to embodiments herein described herein can with except
Here the order beyond those for illustrating or describing is implemented.Additionally, term " comprising " and " having " and they
Any deformation, it is intended that covering is non-exclusive to be included, for example, containing process, the side of series of steps or unit
Method, system, product or equipment are not necessarily limited to those steps clearly listed or unit, but may include unclear
List or for these processes, method, product or other intrinsic steps of equipment or unit.
According to the embodiment of the present application, there is provided a kind of embodiment of the method for the monitoring method of web page access data is, it is necessary to say
It is bright, can be in the such as one group computer system of computer executable instructions the step of the flow of accompanying drawing is illustrated
Middle execution, and, although logical order is shown in flow charts, but in some cases, can be being different from
Order herein performs shown or described step.
Fig. 1 is a kind of flow chart of the monitoring method of the web page access data according to the embodiment of the present application, as shown in figure 1,
The method comprises the following steps S102 to step S110:
Step S102, obtains the corresponding access data of each pixel on target web, wherein, target web is to wait to supervise
Survey the webpage for accessing data.
Specifically, target web can be any webpage for accessing data to be monitored, and it can be any treating also to be equivalent to
It is determined that accessing the whether real webpage of data.
Wherein, each pixel it is corresponding access that data can be in the amount of being click on, session amount and mouse stay time appoint
Meaning is a kind of, can specifically determine according to demand.
Step S104, will access data and is ranked up, each pixel point-rendering picture corresponding to the access data after sequence
Vegetarian refreshments curve, wherein, pixel point curve is used to characterize between each pixel and the corresponding access data of each pixel
Corresponding relation.
Specifically, both can be to be sorted from high to low, or to visiting to accessing data to access data to be ranked up
Ask that data sort from low to high, the sortord for accessing data is not defined in the present embodiment.
Step S106, whether the corresponding distribution situation for accessing data of each pixel meets pre- in judging pixel point curve
If distribution.
Step S108, in the case where judging that distribution situation meets default distribution, from the corresponding access of each pixel
The corresponding access data of target pixel points are determined in data, wherein, target pixel points are the default screening conditions of basis,
The pixel determined from each pixel.
Specifically, default screening conditions can be set according to demand, for example, default screening conditions are in each pixel
The pixel that data come preceding N is accessed, wherein, the pixel that access data come preceding N refers to by each pixel pair
The access data answered come the pixel of preceding N when sorting from high to low.The value of N can be set according to demand, for example:
20%.If N is 20%, the target pixel points in above-mentioned steps S108 access data in being each pixel
Preceding 20% pixel is come, if so each pixel has 100, target pixel points are 20;If
Each pixel has 200, then target pixel points are 40.
Step S110, according to corresponding access data and the predetermined threshold value of target pixel points, determines that each pixel is corresponding
Access the monitoring result of data.
Specifically, monitoring result has two kinds, and one kind is truly to access data, and another kind is untrue access data.Such as
Fruit monitoring result is truly to access data, illustrates that the most of access behavior represented by the access data of webpage is all true
Visitor's behavior, then above-mentioned access data are mostly to access webpage by real user to produce;If instead monitoring result is
Untrue access data, illustrate that the most of access behavior represented by the access data of webpage is not true visitor's behavior,
Then above-mentioned access data are not mostly to access webpage by real user to produce.
Predetermined threshold value can be set according to demand, both can be percents, or numeric form, can be with
It is decimal form.
Further, since each pixel is the pixel on target web, so the corresponding access data of each pixel
Monitoring result be each pixel where webpage (that is, target web) access data monitoring result,
It is exactly the monitoring result of the access data of above-mentioned webpage.
In the embodiment of the present application, the pixel that the corresponding relation between pixel and access data is characterized by drawing is bent
Line, and judge whether the distribution situation that data are accessed in pixel point curve normal (namely, if meet long-tail distribution etc.
Default distribution), and then in the case of the distribution situation for judging above-mentioned access data is normal, by comparing each pixel
Corresponding data and the size of predetermined threshold value of accessing of target pixel points can just obtain the corresponding access number of each pixel in point
According to whether be truly access data monitoring result, reached quantify and have foundation monitoring webpage access data whether
Real purpose, it is achieved thereby that the monitoring whether real technique effect of web page access data, and then solve existing skill
Cannot determine to access the whether real technical problem of data in art.
It should be noted that for the access data of each target web, can be by performing step S102 to step
Whether S110 obtains the real monitoring result of access data of the target web.
It is alternatively possible to by two ways realize according to target pixel points it is corresponding access data and predetermined threshold value, really
Determine the corresponding monitoring result for accessing data of each pixel, above two mode is specific as follows:
Mode one:It is specific as follows including step S1101 to step S1107:
Step S1101, calculates the corresponding access data summation for accessing data of target pixel points.
Specifically, the pixel (that is, target pixel points) determined from each pixel is usually multiple, on
State step S1101 and namely calculate the corresponding access data sum of whole target pixel points, obtain accessing data summation.
Step S1103, judges to access whether data summation reaches predetermined threshold value.
Specifically, in the embodiment of the present application, predetermined threshold value is numeric form, for example, could be arranged to each pixel
It is corresponding to access the 80% of data sum.
Step S1105, in the case where judging that accessing data summation reaches predetermined threshold value, determines that monitoring result is true
It is real to access data, wherein, true access data are accessed for characterizing the corresponding data that access of each pixel for effective
Data.
Specifically, above-mentioned steps S1105 is namely judging to access in the case that data summation reaches predetermined threshold value,
Determine that the corresponding data that access of each pixel access data for effective, then illustrate that above-mentioned access data are mostly by true
Real user accesses what webpage was produced.
Step S1107, in the case where judging to access data summation not up to predetermined threshold value, determines that monitoring result is
Untrue access data, wherein, it is invalid that untrue access data are used to characterize the corresponding data that access of each pixel
Access data.
Specifically, above-mentioned steps S1107 is namely in the case where judging to access data summations not up to predetermined threshold value,
Determine each pixel it is corresponding access data be invalid access data, then illustrate above-mentioned access data be not mostly by
Real user accesses what webpage was produced.
Mode two:It is specific as follows including step S1109 to step S11015:
Step S1109, calculates the corresponding access data summation for accessing data of target pixel points, and the step is with above-mentioned step
Rapid S1101, is not repeated.
Step S11011, judges that accessing data summation accounts for whether total ratio for accessing data reaches predetermined threshold value, wherein,
Total data that access are the corresponding access data sum of each pixel.
Specifically, in the embodiment of the present application, predetermined threshold value is percents or decimal form, for example, can set
It is set to 80% or 0.8.
Step S11013, in the case where judging that ratio reaches predetermined threshold value, determines that monitoring result accesses number for true
According to, wherein, it is effective access data that true access data are used to characterize the corresponding data that access of each pixel.
Specifically, above-mentioned steps S11013 determines each namely in the case where judging that ratio reaches predetermined threshold value
The corresponding data that access of pixel access data for effective, then illustrate that above-mentioned access data are visited by real user
Ask what webpage was produced.
Step S11015, in the case where ratio not up to predetermined threshold value is judged, determines that monitoring result is untrue visit
Data are asked, wherein, it is invalid access number that untrue access data are used to characterize the corresponding data that access of each pixel
According to.
Specifically, above-mentioned steps S11015 is namely in the case where ratio not up to predetermined threshold value is judged, it is determined that respectively
The corresponding data that access of individual pixel are invalid access data, then illustrate that above-mentioned access data are not mostly by truly using
Family accesses what webpage was produced.
It should be noted that any one in can according to demand selecting above two mode, determines each pixel
The corresponding monitoring result for accessing data, that is, the monitoring result for determining the access data of webpage.
If the value of predetermined threshold value is that each pixel is corresponding accesses 80%, 0.8 or the 80% of data sum,
Exactly judge target pixel points it is corresponding access data sum with each pixel it is corresponding access data sum compared with whether
Meet " Pareto Law ".
Alternatively, in the embodiment of the present application, data will be accessed to be ranked up, it is corresponding to the access data after sequence
Each pixel point-rendering pixel curve includes:Each pixel is ranked up from high to low according to data are accessed;Base
Each pixel point-rendering pixel curve after being sorted from high to low according to access data.Judge each in pixel point curve
Whether the corresponding distribution situation for accessing data of individual pixel meets default distribution includes:Judge in pixel point curve each
Whether the corresponding distribution situation for accessing data of pixel meets long-tail distribution.
When it is click volume to access data, to the corresponding click volume of each pixel in certain webpage according to arranging from high to low
Sequence, the schematic diagram of the pixel point curve drawn out based on the pixel after being sorted from high to low according to click volume may refer to
Fig. 2.It should be noted that in Fig. 2, x-axis represents pixel, y-axis represents click volume, wherein, each pixel
It is arranged in x-axis from high to low according to click volume.
Alternatively, in the embodiment of the present application, in the case where judging that distribution situation does not meet default distribution, it is determined that
Monitoring result is untrue access data, wherein, untrue access data are used to characterize the corresponding access of each pixel
Data are invalid access data.
Alternatively, in the embodiment of the present application, obtaining the corresponding data that access of each pixel on target web includes:
Obtained from database and meet the corresponding access data of each pixel on pre-conditioned target web, wherein, preset
Condition is comprised at least:Preset time period.
It is, from database extract target web under some screening conditions, the corresponding access number of each pixel
According to, wherein, some screening conditions are pre-conditioned in above-described embodiment.
Specifically, database can be that any can get that each pixel on target pages is corresponding to access data
Database, for example, thermodynamic chart database.
Alternatively, it is pre-conditioned to include default sources, default source class in addition to preset time period
Type etc..Above-mentioned preset time period, default sources and default source type can be set according to demand.
For example, when it is pre-conditioned only include preset time period when, preset time period be 1 day to 2015 October in 2015
In on October 31, in, target web is the webpage of certain commodity A, and referred to as webpage A is then obtained from database
On October 1st, 2015 between 31 days October in 2015, the corresponding access data of each pixel on webpage A.
For example, when pre-conditioned comprising preset time period and during default sources, preset time period is 2015 10
In on October 31st, 1 day 1 moon, it is Sina weibo to preset sources, and target web is the net of certain commodity A
Page, referred to as webpage A is then obtained on October 1st, 2015 between 31 days October in 2015 from database
The corresponding access data of each pixel on webpage A are accessed by Sina weibo.
In the embodiment of the present application, data can accordingly be accessed according to customer requirement retrieval, and then to above-mentioned access number
According to whether being truly monitored, the effect for improving user satisfaction has been reached.
As a example by accessing data for click volume, it is described as follows:
During the same webpage of a large amount of guest access, the degree of concern performance with other affairs is consistent, there is general character and individual character
Feature:The point of interest of i.e. most of visitors is similar, and the scope clicked on webpage also can be similar;But different visitors
Personality is incomplete same, always has differences, and is not excluded for indivedual new visitors and is unfamiliar with webpage the overdue of generation hitting
Situation, has the click in a small amount of other regions certainly.The click volume difference of general character and individual character has much, then depending on net
The layout and own characteristic of page.Dissected under paths with other with a period of time, each picture of certain page (that is, webpage)
The click volume of vegetarian refreshments is research object, there are following two features:
(1) click volume of each pixel is presented long-tail distribution on webpage, there is main body and long-tail two parts pixel;
(2) most of click volume is produced by a small amount of pixel, meets Pareto Law.
Specifically, long-tail depicts " small, and the only a few events of most events well respectively
Scale is quite big ".
So click volume etc. browses data (that is, accessing data) while symbol in the web page access behavior of a large amount of true visitors
Close Pareto Law and long-tail distribution, if it is, meet more than 2 points, from the entirety of the corresponding click volume of each pixel
From the point of view of distribution, most access behavior meets true visitor's behavior.
With the access behavior of true visitor it is have one due to the access behavior of non-genuine visitor by the above
Determine difference, the scheme combination various dimensions that the embodiment of the present application is provided dissect function, using each pixel pair on webpage
The access data answered, judge whether the access data of above-mentioned webpage true, that is, judge in the visitor of the webpage whether
Most of is non-genuine visitor, so as to obtain the monitoring result of the webpage.
According to the embodiment of the present application, a kind of monitoring device of web page access data is additionally provided, the web page access data
Monitoring device is used to perform the monitoring method of the web page access data that the embodiment of the present application the above is provided, right below
The monitoring device of the web page access data that the embodiment of the present application is provided does specific introduction:
Fig. 3 is a kind of schematic diagram of the monitoring device of the web page access data according to the embodiment of the present application, as shown in figure 3,
The monitoring device mainly includes acquiring unit 31, drawing unit 33, judging unit 35, the first determining unit 37 and the
Two determining units 39, wherein:
Acquiring unit 31, for obtaining the corresponding access data of each pixel on target web.
Specifically, target web can be any webpage for accessing data to be monitored, and it can be any treating also to be equivalent to
It is determined that accessing the whether real webpage of data.
Wherein, each pixel it is corresponding access that data can be in the amount of being click on, session amount and mouse stay time appoint
Meaning is a kind of, can specifically determine according to demand.
Drawing unit 33, is ranked up, to corresponding each pixel of access data after sequence for will access data
Draw pixel point curve.
Specifically, both can be to be sorted from high to low, or to visiting to accessing data to access data to be ranked up
Ask that data sort from low to high, the sortord for accessing data is not defined in the present embodiment.
Judging unit 35, for judging in pixel point curve whether is the corresponding distribution situation for accessing data of each pixel
Meet default distribution.
First determining unit 37, in the case where judging that distribution situation meets default distribution, from each pixel
The corresponding access data of target pixel points are determined in corresponding access data, wherein, target pixel points are according to default
Screening conditions, the pixel determined from each pixel.
Specifically, default screening conditions can be set according to demand, for example, default screening conditions are in each pixel
The pixel that data come preceding N is accessed, wherein, the pixel that access data come preceding N refers to by each pixel pair
The access data answered come the pixel of preceding N when sorting from high to low.The value of N can be set according to demand, for example:
20%.If N is 20%, the target pixel points in above-mentioned steps S108 access data in being each pixel
Preceding 20% pixel is come, if so each pixel has 100, target pixel points are 20;If
Each pixel has 200, then target pixel points are 40.
Second determining unit 39, for according to corresponding access data and the predetermined threshold value of target pixel points, determining each picture
The corresponding monitoring result for accessing data of vegetarian refreshments.
Specifically, monitoring result has two kinds, and one kind is truly to access data, and another kind is untrue access data.Such as
Fruit monitoring result is truly to access data, illustrates that the most of access behavior represented by the access data of webpage is all true
Visitor's behavior, then above-mentioned access data are mostly to access webpage by real user to produce;If instead monitoring result is
Untrue access data, illustrate that the most of access behavior represented by the access data of webpage is not true visitor's behavior,
Then above-mentioned access data are not mostly to access webpage by real user to produce.
Predetermined threshold value can be set according to demand, both can be percents, or numeric form, can be with
It is decimal form.
Further, since each pixel is the pixel on target web, so the corresponding access data of each pixel
Monitoring result be each pixel where webpage (that is, target web) access data monitoring result,
It is exactly the monitoring result of the access data of above-mentioned webpage.
In the embodiment of the present application, the pixel that the corresponding relation between pixel and access data is characterized by drawing is bent
Line, and judge whether the distribution situation that data are accessed in pixel point curve normal (namely, if meet long-tail distribution etc.
Default distribution), and then in the case of the distribution situation for judging above-mentioned access data is normal, by comparing each pixel
Corresponding data and the size of predetermined threshold value of accessing of target pixel points can just obtain the corresponding access number of each pixel in point
According to whether be truly access data monitoring result, reached quantify and have foundation monitoring webpage access data whether
Real purpose, it is achieved thereby that the monitoring whether real technique effect of web page access data, and then solve existing skill
Cannot determine to access the whether real technical problem of data in art.
It should be noted that for the access data of each target web, can be by calling acquiring unit, drawing
Whether the access data that unit, judging unit, the first determining unit and the second determining unit obtain the target web are true
Monitoring result.
It is alternatively possible to the access data sum and predetermined threshold value according to target pixel points are realized by two ways, really
The monitoring result of the access data of fixed each pixel, above two mode is specific as follows:
Mode one:Second determining unit includes:First computing module, the first judge module, the first determining module and
Two determining modules, wherein:
First computing module, for calculating the corresponding access data summation for accessing data of target pixel points.
Specifically, the pixel (that is, target pixel points) determined from each pixel is usually multiple, on
Whole target pixel points are corresponding to access data sum namely for calculating to state the first computing module, obtains accessing number
According to summation.
First judge module, for judging that accessing data summation accounts for whether total ratio for accessing data reaches predetermined threshold value,
Wherein, total data that access are the corresponding access data sum of each pixel.
Specifically, in the embodiment of the present application, predetermined threshold value is numeric form, for example, could be arranged to each pixel
It is corresponding to access the 80% of data sum.
First determining module, in the case where judging that ratio reaches predetermined threshold value, determining that monitoring result is true
Data are accessed, wherein, it is effective access number that true access data are used to characterize the corresponding data that access of each pixel
According to.
Specifically, above-mentioned first determining module is namely for judging that accessing data summation reaches the feelings of predetermined threshold value
Under condition, determine that the corresponding data that access of each pixel access data for effective, then illustrate above-mentioned access data mostly
It is to access webpage by real user to produce.
Second determining module, in the case where ratio not up to predetermined threshold value is judged, determining monitoring result for not
It is true to access data, wherein, it is invalid that untrue access data are used to characterize the corresponding data that access of each pixel
Access data.
Specifically, above-mentioned second determining module is namely for judging to access data summations not up to predetermined threshold value
In the case of, determine that the corresponding data that access of each pixel are invalid access data, then illustrate that above-mentioned access data are big
It is not to access webpage by real user to produce.
Mode one:Second determining unit includes:Second computing module, the 3rd judge module, the 3rd determining module and
Four determining modules, wherein:
Second computing module, for calculating the corresponding access data summation for accessing data of target pixel points.
3rd judge module, for judging to access whether data summation reaches predetermined threshold value.
Specifically, in the embodiment of the present application, predetermined threshold value is percents or decimal form, for example, can set
It is set to 80% or 0.8.
3rd determining module, in the case where judging that accessing data summation reaches predetermined threshold value, it is determined that monitoring knot
Fruit accesses data for true, wherein, it is effective for truly access data being used to characterize the corresponding data that access of each pixel
Access data.
Specifically, above-mentioned 3rd determining module is namely in the case where judging that ratio reaches predetermined threshold value, really
The corresponding data that access of fixed each pixel access data for effective, then illustrate that above-mentioned access data are mostly by true
User accesses what webpage was produced.
4th determining module, in the case where judging to access data summation not up to predetermined threshold value, it is determined that monitoring
Result is untrue access data, wherein, untrue access data are used to characterize the corresponding access data of each pixel
It is invalid access data.
Specifically, above-mentioned 4th determining module is namely in the case where ratio not up to predetermined threshold value is judged,
Determine each pixel it is corresponding access data be invalid access data, then illustrate above-mentioned access data be not mostly by
Real user accesses what webpage was produced.
It should be noted that any one in can according to demand selecting above two mode, determines each pixel
The corresponding monitoring result for accessing data, that is, the monitoring result for determining the access data of webpage.
Alternatively, in the embodiment of the present application, drawing unit includes:Order module, for according to access data by height
Each pixel is ranked up to low;Drafting module, for based on each after data sort from high to low according to accessing
Individual pixel point-rendering pixel curve;Judging unit includes:Second judge module, it is each in pixel point curve for judging
Whether the corresponding distribution situation for accessing data of individual pixel meets long-tail distribution.
Alternatively, in the embodiment of the present application, device also includes:5th determining module, for judging to be distributed feelings
In the case that condition does not meet default distribution, monitoring result is determined for untrue access data, wherein, untrue access number
It is invalid access data according to for characterizing the corresponding data that access of each pixel.
Alternatively, in the embodiment of the present application, acquiring unit includes:Acquisition module, for obtaining full from database
The corresponding access data of each pixel on the pre-conditioned target web of foot, wherein, it is pre-conditioned to comprise at least:In advance
If the time period.
Above-mentioned acquisition module namely for from database extract target web under some screening conditions, each pixel
The corresponding access data of point, wherein, some screening conditions are pre-conditioned in above-described embodiment.
Alternatively, it is pre-conditioned to include default sources, default source class in addition to preset time period
Type etc..Above-mentioned preset time period, default sources and default source type can be set according to demand.
In the embodiment of the present application, data can accordingly be accessed according to customer requirement retrieval, and then to above-mentioned access number
According to whether being truly monitored, the effect for improving user satisfaction has been reached.
The monitoring device of the web page access data include processor and memory, above-mentioned acquiring unit, drawing unit,
Judging unit, the first determining unit and second determining unit etc. are stored in memory, by processing as program unit
Device performs storage said procedure unit in memory.
Kernel is included in processor, is gone in memory to transfer corresponding program unit by kernel.Kernel can set one
Or more, determine whether the access data of webpage are true by adjusting kernel parameter.
Memory potentially includes the volatile memory in computer-readable medium, random access memory (RAM) and/
Or the form, such as read-only storage (ROM) or flash memory (flash RAM) such as Nonvolatile memory, memory includes at least one
Individual storage chip.
Present invention also provides a kind of embodiment of computer program product, when being performed on data processing equipment, fit
In the program code for performing initialization there are as below methods step:Obtain the corresponding access number of each pixel on target web
According to;The access data are ranked up, described each pixel point-rendering pixel corresponding to the access data after sequence
Point curve;Judge whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets
Default distribution;In the case where judging that the distribution situation meets the default distribution, from described each pixel pair
The corresponding access data of target pixel points are determined in the access data answered, wherein, the target pixel points are according to pre-
If screening conditions, the pixel determined from described each pixel;According to the corresponding access of the target pixel points
Data and predetermined threshold value, it is determined that the corresponding monitoring result for accessing data of described each pixel.
In above-described embodiment of the application, the description to each embodiment all emphasizes particularly on different fields, and does not have in certain embodiment
The part of detailed description, may refer to the associated description of other embodiment.
In several embodiments provided herein, it should be understood that disclosed technology contents, can be by other
Mode realize.Wherein, device embodiment described above is only schematical, such as division of described unit,
Can be a kind of division of logic function, there can be other dividing mode when actually realizing, for example multiple units or component
Can combine or be desirably integrated into another system, or some features can be ignored, or do not perform.It is another, institute
Display or the coupling each other for discussing or direct-coupling or communication connection can be by some interfaces, unit or mould
The INDIRECT COUPLING of block or communication connection, can be electrical or other forms.
The unit that is illustrated as separating component can be or may not be it is physically separate, it is aobvious as unit
The part for showing can be or may not be physical location, you can with positioned at a place, or can also be distributed to
On multiple units.Some or all of unit therein can be according to the actual needs selected to realize this embodiment scheme
Purpose.
In addition, during each functional unit in the application each embodiment can be integrated in a processing unit, it is also possible to
It is that unit is individually physically present, it is also possible to which two or more units are integrated in a unit.It is above-mentioned integrated
Unit can both be realized in the form of hardware, it would however also be possible to employ the form of SFU software functional unit is realized.
If the integrated unit is to realize in the form of SFU software functional unit and as independent production marketing or when using,
Can store in a computer read/write memory medium.Based on such understanding, the technical scheme essence of the application
On all or part of the part that is contributed to prior art in other words or the technical scheme can be with software product
Form is embodied, and the computer software product is stored in a storage medium, including some instructions are used to so that one
Platform computer equipment (can be personal computer, server or network equipment etc.) performs each embodiment institute of the application
State all or part of step of method.And foregoing storage medium includes:USB flash disk, read-only storage (ROM, Read-Only
Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disc or CD
Etc. it is various can be with the medium of store program codes.
The above is only the preferred embodiment of the application, it is noted that for the ordinary skill people of the art
For member, on the premise of the application principle is not departed from, some improvements and modifications can also be made, these improve and moisten
Decorations also should be regarded as the protection domain of the application.
Claims (10)
1. a kind of monitoring method of web page access data, it is characterised in that including:
Obtain the corresponding access data of each pixel on target web;
The access data are ranked up, described each pixel point-rendering corresponding to the access data after sequence
Pixel point curve;
Judge whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets
Default distribution;
In the case where judging that the distribution situation meets the default distribution, from described each pixel correspondence
Access data in determine the corresponding access data of target pixel points, wherein, according to the target pixel points
Default screening conditions, the pixel determined from described each pixel;
According to corresponding access data and the predetermined threshold value of the target pixel points, it is determined that described each pixel correspondence
Access data monitoring result.
2. method according to claim 1, it is characterised in that according to the corresponding access data of the target pixel points
And predetermined threshold value, it is determined that the corresponding monitoring result for accessing data of described each pixel includes:
Calculate the corresponding access data summation for accessing data of the target pixel points;
Judge whether the access data summation reaches the predetermined threshold value;
In the case where judging that the access data summation reaches the predetermined threshold value, the monitoring result is determined
Data are accessed for true, wherein, the true access data are used to characterize the corresponding access of described each pixel
Data access data for effective;
In the case where judging that the access data summation is not up to the predetermined threshold value, the monitoring knot is determined
Fruit is untrue access data, wherein, the untrue access data are used to characterize each pixel correspondence
Access data be invalid access data.
3. method according to claim 1, it is characterised in that according to the corresponding access data of the target pixel points
And predetermined threshold value, it is determined that the corresponding monitoring result for accessing data of described each pixel includes:
Calculate the corresponding access data summation for accessing data of the target pixel points;
Judge that the access data summation accounts for whether total ratio for accessing data reaches the predetermined threshold value, wherein,
Total access data are that described each pixel is corresponding accesses data sum;
In the case where judging that the ratio reaches the predetermined threshold value, determine that the monitoring result is true visit
Data are asked, wherein, the true access data are used to characterize the corresponding data that access of described each pixel to have
The access data of effect;
In the case where judging that the ratio is not up to the predetermined threshold value, determine that the monitoring result is untrue
It is real to access data, wherein, the untrue access data are used to characterize the corresponding access number of described each pixel
According to being invalid access data.
4. method according to claim 1, it is characterised in that be ranked up the access data, after sequence
Corresponding described each pixel point-rendering pixel curve of access data include:
Described each pixel is ranked up from high to low according to the access data;
Based on according to access data sort from high to low after described in each pixel point curve described in pixel point-rendering;
Judge whether the corresponding distribution situation for accessing data of each pixel described in the pixel point curve meets
Default distribution includes:Judge the corresponding distribution feelings for accessing data of each pixel described in the pixel point curve
Whether condition meets long-tail distribution.
5. method according to claim 1, it is characterised in that judging that it is described pre- that the distribution situation does not meet
If in the case of distribution, determining the monitoring result for untrue access data, wherein, the untrue access
It is invalid access data that data are used to characterize the corresponding data that access of described each pixel.
6. method according to claim 1, it is characterised in that obtain the corresponding visit of each pixel on target web
Ask that data include:
Obtained from database and meet the corresponding access of described each pixel on the pre-conditioned target web
Data, wherein, it is described pre-conditioned to comprise at least:Preset time period.
7. method according to claim 1, it is characterised in that the corresponding data that access of described each pixel include
Below any one:Click volume, session amount and mouse stay time.
8. a kind of monitoring device of web page access data, it is characterised in that including:
Acquiring unit, for obtaining the corresponding access data of each pixel on target web;
Drawing unit, it is corresponding to the access data after sequence described for the access data to be ranked up
Each pixel point-rendering pixel curve;
Judging unit, dividing for data is accessed for judging that each pixel described in the pixel point curve is corresponding
Whether cloth situation meets default distribution;
First determining unit, in the case where judging that the distribution situation meets the default distribution, from
Described each pixel is corresponding to be accessed and determine the corresponding access data of target pixel points in data, wherein, institute
It is according to default screening conditions, the pixel determined from described each pixel to state target pixel points;
Second determining unit, for according to the target pixel points it is corresponding access data and predetermined threshold value, it is determined that
The corresponding monitoring result for accessing data of described each pixel.
9. device according to claim 8, it is characterised in that second determining unit includes:
First computing module, for calculating the corresponding access data summation for accessing data of the target pixel points;
First judge module, for judging that the access data summation accounts for whether total ratio for accessing data reaches institute
Predetermined threshold value is stated, wherein, total access data are that described each pixel is corresponding accesses data sum;
First determining module, in the case where judging that the ratio reaches the predetermined threshold value, determining institute
State monitoring result and access data for true, wherein, the true access data are used to characterize described each pixel
Corresponding access data access data for effective;
Second determining module, in the case where judging that the ratio is not up to the predetermined threshold value, it is determined that
The monitoring result is untrue access data, wherein, the untrue access data be used to characterizing it is described each
The corresponding data that access of pixel are invalid access data.
10. device according to claim 8, it is characterised in that
The drawing unit includes:Order module, for according to it is described access data from high to low to it is described each
Pixel is ranked up;Drafting module, for based on each pixel after being sorted from high to low according to access data
Pixel point curve described in point-rendering;
The judging unit includes:Second judge module, for judging each pixel in the pixel point curve
Whether the corresponding distribution situation for accessing data meets long-tail distribution.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511032354.5A CN106933905B (en) | 2015-12-31 | 2015-12-31 | Method and device for monitoring webpage access data |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201511032354.5A CN106933905B (en) | 2015-12-31 | 2015-12-31 | Method and device for monitoring webpage access data |
Publications (2)
Publication Number | Publication Date |
---|---|
CN106933905A true CN106933905A (en) | 2017-07-07 |
CN106933905B CN106933905B (en) | 2019-12-24 |
Family
ID=59444716
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201511032354.5A Active CN106933905B (en) | 2015-12-31 | 2015-12-31 | Method and device for monitoring webpage access data |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN106933905B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109729054A (en) * | 2017-10-31 | 2019-05-07 | 阿里巴巴集团控股有限公司 | Access data monitoring method and relevant device |
CN109976985A (en) * | 2017-12-27 | 2019-07-05 | 北京国双科技有限公司 | A kind of method for drafting and device of thermodynamic chart |
CN111104559A (en) * | 2018-10-29 | 2020-05-05 | 百度在线网络技术(北京)有限公司 | Method and device for dividing distribution form of user data |
US20220391528A1 (en) * | 2020-02-20 | 2022-12-08 | Beijing Bytedance Network Technology Co., Ltd. | Online document display method and apparatus, device and medium |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101799830A (en) * | 2010-03-25 | 2010-08-11 | 北京国双科技有限公司 | Flow data processing method capable of realizing multi-dimensional free analysis |
CN102075352A (en) * | 2010-12-17 | 2011-05-25 | 北京邮电大学 | Method and device for predicting network user behavior |
CN102279786A (en) * | 2011-08-25 | 2011-12-14 | 百度在线网络技术(北京)有限公司 | Method and device for monitoring effective access amount of application program |
CN103559277A (en) * | 2013-11-06 | 2014-02-05 | 北京国双科技有限公司 | Data processing method and device for webpage page click quantity statistics |
CN103593415A (en) * | 2013-10-29 | 2014-02-19 | 北京国双科技有限公司 | Method and device for detecting cheating on visitor volumes of web pages |
CN104580447A (en) * | 2014-12-29 | 2015-04-29 | 中国科学院计算机网络信息中心 | Spatio-temporal data service scheduling method based on access heat |
CN105072089A (en) * | 2015-07-10 | 2015-11-18 | 中国科学院信息工程研究所 | WEB malicious scanning behavior abnormity detection method and system |
CN105119764A (en) * | 2015-09-29 | 2015-12-02 | 百度在线网络技术(北京)有限公司 | Method and device for monitoring flow |
-
2015
- 2015-12-31 CN CN201511032354.5A patent/CN106933905B/en active Active
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101799830A (en) * | 2010-03-25 | 2010-08-11 | 北京国双科技有限公司 | Flow data processing method capable of realizing multi-dimensional free analysis |
CN102075352A (en) * | 2010-12-17 | 2011-05-25 | 北京邮电大学 | Method and device for predicting network user behavior |
CN102279786A (en) * | 2011-08-25 | 2011-12-14 | 百度在线网络技术(北京)有限公司 | Method and device for monitoring effective access amount of application program |
CN103593415A (en) * | 2013-10-29 | 2014-02-19 | 北京国双科技有限公司 | Method and device for detecting cheating on visitor volumes of web pages |
CN103559277A (en) * | 2013-11-06 | 2014-02-05 | 北京国双科技有限公司 | Data processing method and device for webpage page click quantity statistics |
CN104580447A (en) * | 2014-12-29 | 2015-04-29 | 中国科学院计算机网络信息中心 | Spatio-temporal data service scheduling method based on access heat |
CN105072089A (en) * | 2015-07-10 | 2015-11-18 | 中国科学院信息工程研究所 | WEB malicious scanning behavior abnormity detection method and system |
CN105119764A (en) * | 2015-09-29 | 2015-12-02 | 百度在线网络技术(北京)有限公司 | Method and device for monitoring flow |
Non-Patent Citations (1)
Title |
---|
王进 等: ""基于大偏差统计模型的Http-Flood DDoS 检测机制及性能分析"", 《软件学报》 * |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109729054A (en) * | 2017-10-31 | 2019-05-07 | 阿里巴巴集团控股有限公司 | Access data monitoring method and relevant device |
CN109729054B (en) * | 2017-10-31 | 2021-08-13 | 阿里巴巴集团控股有限公司 | Access data monitoring method and related equipment |
CN109976985A (en) * | 2017-12-27 | 2019-07-05 | 北京国双科技有限公司 | A kind of method for drafting and device of thermodynamic chart |
CN111104559A (en) * | 2018-10-29 | 2020-05-05 | 百度在线网络技术(北京)有限公司 | Method and device for dividing distribution form of user data |
US20220391528A1 (en) * | 2020-02-20 | 2022-12-08 | Beijing Bytedance Network Technology Co., Ltd. | Online document display method and apparatus, device and medium |
Also Published As
Publication number | Publication date |
---|---|
CN106933905B (en) | 2019-12-24 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107818344B (en) | Method and system for classifying and predicting user behaviors | |
CN102929939B (en) | The offer method and device of customized information | |
CN107730389A (en) | Electronic installation, insurance products recommend method and computer-readable recording medium | |
US11275748B2 (en) | Influence score of a social media domain | |
CN101493832A (en) | Website content combine recommendation system and method | |
CN106933905A (en) | The monitoring method and device of web page access data | |
CN106528777A (en) | Cross-screen user identification normalizing method and system | |
CN111400586A (en) | Group display method, terminal, server, system and storage medium | |
CN106708841A (en) | Website access path aggregation method and apparatus | |
CN105786965A (en) | URL-based user behavior analysis method and device | |
CN111782953A (en) | Recommendation method, device, equipment and storage medium | |
CN111738785A (en) | Product selection method, system and storage medium | |
CN112232933A (en) | House source information recommendation method, device, equipment and readable storage medium | |
CN111861605A (en) | Business object recommendation method | |
Shamsuzzoha et al. | The role of human capital on the performance of manufacturing firms in Bangladesh | |
CN103049497A (en) | Method and device for website navigation | |
Chiappini | Do overseas investments create or replace trade? New insights from a macro-sectoral study on Japan | |
JP2009289172A (en) | Conduct history analysis system and its method | |
CN105389714B (en) | Method for identifying user characteristics from behavior data | |
CN106372158A (en) | Method and device for processing user behavior data | |
CN108959289B (en) | Website category acquisition method and device | |
WO2015149550A1 (en) | Method and apparatus for determining grades of links within website | |
CN104636470A (en) | Method and device for recommending business information | |
Miniukovich et al. | Visual diversity and user interface quality | |
CN107436940A (en) | The method of web front-end Dynamic Display data based on user profile behavioural analysis |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
CB02 | Change of applicant information |
Address after: 100083 No. 401, 4th Floor, Haitai Building, 229 North Fourth Ring Road, Haidian District, Beijing Applicant after: Beijing Guoshuang Technology Co.,Ltd. Address before: 100086 Cuigong Hotel, 76 Zhichun Road, Shuangyushu District, Haidian District, Beijing Applicant before: Beijing Guoshuang Technology Co.,Ltd. |
|
CB02 | Change of applicant information | ||
GR01 | Patent grant | ||
GR01 | Patent grant |