CN111461545A - Method and device for determining machine access data - Google Patents

Method and device for determining machine access data Download PDF

Info

Publication number
CN111461545A
CN111461545A CN202010246446.8A CN202010246446A CN111461545A CN 111461545 A CN111461545 A CN 111461545A CN 202010246446 A CN202010246446 A CN 202010246446A CN 111461545 A CN111461545 A CN 111461545A
Authority
CN
China
Prior art keywords
access
determining
score
data
interactable
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010246446.8A
Other languages
Chinese (zh)
Other versions
CN111461545B (en
Inventor
陈铬亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Shenyan Intelligent Technology Co ltd
Original Assignee
Beijing Shenyan Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Shenyan Intelligent Technology Co ltd filed Critical Beijing Shenyan Intelligent Technology Co ltd
Priority to CN202010246446.8A priority Critical patent/CN111461545B/en
Publication of CN111461545A publication Critical patent/CN111461545A/en
Application granted granted Critical
Publication of CN111461545B publication Critical patent/CN111461545B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0242Determining effectiveness of advertisements
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The invention discloses a method and a device for determining machine access data. Wherein, the method comprises the following steps: counting access data for accessing the webpage in the current access, wherein the access data comprise a plurality of interactive elements of the accessed webpage; determining the score of the access corresponding to the access data; determining the score of the access address corresponding to the access according to the score of the access; and determining the access data of the access address as machine access data under the condition that the score of the access address is lower than the preset score. The invention solves the technical problem of large error of data statistical result caused by the false data of the machine accessing the webpage in the related technology.

Description

Method and device for determining machine access data
Technical Field
The invention relates to the field of data processing, in particular to a method and a device for determining machine access data.
Background
In the advertisement placement effectiveness statistical process, statistics and analysis of access data of advertisements are generally required. However, due to the diversity of market conditions, machine access data exists for the delivered advertisements, and the machine access data is counted as normal user access in the counting process, so that errors occur in the data processing and result analysis in the later period. Resulting in large errors between the results of the data statistics and the real situation.
In view of the above problems, no effective solution has been proposed.
Disclosure of Invention
The embodiment of the invention provides a method and a device for determining machine access data, which are used for at least solving the technical problem of large data statistical result error caused by false data of a machine access webpage in the related technology.
According to an aspect of an embodiment of the present invention, there is provided a method for determining machine access data, including: counting access data for accessing the webpage in the current access, wherein the access data comprise a plurality of interactive elements of the accessed webpage; determining the score of the current visit corresponding to the visit data; determining the score of the access address corresponding to the access according to the score of the access; and under the condition that the score of the access address is lower than a preset score, determining that the access data of the access address is machine access data.
Optionally, determining the score of the current visit corresponding to the visit data includes: determining weights for a plurality of interactable elements of the web page; and determining the score of the access according to the weights of the interactive elements.
Optionally, determining the score of the current visit according to the weights of the interactive elements includes: determining importance scores corresponding to the interactive elements according to the weights of the interactive elements and the operation times required for accessing the interactive elements; and taking the sum of the importance scores of the interactive elements accessed by the access as the score of the access.
Optionally, before determining the importance scores corresponding to the plurality of interactable elements according to the weights of the plurality of interactable elements and the number of operations required for accessing the interactable elements, the method includes: for each interactive element of the webpage, determining the operation times of the user for accessing the interactive element for multiple times respectively by simulating multiple accesses; and averaging the operation times corresponding to the interactive elements accessed by the user for multiple times, and determining the operation times required by the interactive elements.
Optionally, determining importance scores corresponding to the plurality of interactable elements according to the weights of the plurality of interactable elements and the number of operations required for accessing the interactable elements includes: determining whether the display area of the interactive elements on the webpage is smaller than a preset area; under the condition that the display area of the interactive element is not smaller than the preset area, determining the importance score of the interactive element according to the value of the weight of the interactive element and the required operation times; and under the condition that the display area of the interactive element is smaller than the preset area, determining the importance value of the interactive element according to the value of the weight of the interactive element, the required operation times and the magnification, wherein the magnification is the ratio of the preset area to the display area of the interactive element.
Optionally, the determining the importance score of the interactable element by using the value of the weight of the interactable element and the required operation times of the interactable element comprises: the importance score of the interactable element is equal to the product of the value of the weight and the square of the number of required operations; determining an importance score for the interactable element based on the value of the weight of the interactable element, the number of operations required with the interactable element, and the magnification factor comprises: the importance score of the interactable element is equal to the product of the value of the weight, the square of the number of required operations, and the magnification factor.
Optionally, determining the score of the access address corresponding to the access according to the score of the access includes: determining the time attenuation coefficient of the current visit according to the visit time and the current time of the current visit; and determining the score of the access address corresponding to the access according to the score of the access and the corresponding time attenuation coefficient.
Optionally, when the score of the access address is lower than a preset score, determining that the access data of the access address is the machine access data includes: determining whether the access accesses preset key interactive elements or not under the condition that the score of the access address of the webpage is lower than a preset score; and under the condition that the current access accesses the key interactive element, determining the access data of the access address as machine access data.
According to another aspect of the embodiments of the present invention, there is also provided an apparatus for determining machine access data, including: the statistical module is used for counting access data for accessing the webpage in the current access, wherein the access data comprise a plurality of interactive elements of the accessed webpage; the first determining module is used for determining the score of the current visit corresponding to the visit data; the second determining module is used for determining the score of the access address corresponding to the access according to the score of the access; and the third determining module is used for determining that the access data of the access address is machine access data under the condition that the score of the access address is lower than a preset score.
According to another aspect of the embodiments of the present invention, there is also provided a storage medium, where the storage medium includes a stored program, and when the program runs, a device in which the storage medium is located is controlled to execute any one of the above methods.
According to another aspect of the embodiments of the present invention, there is also provided a processor, configured to execute a program, where the program executes to perform the method described in any one of the above.
In the embodiment of the invention, access data for accessing the webpage in the current access is counted, wherein the access data comprises a plurality of interactive elements of the accessed webpage; determining the score of the access corresponding to the access data; determining the score of the access address corresponding to the access according to the score of the access; under the condition that the score of the access address is lower than the preset score, the access data of the access address is determined to be machine access data, the access is scored through a plurality of interactive elements of the webpage accessed at this time, so that the access address accessed at this time is scored, whether the access address is the machine access address or not is determined, the purpose of determining the machine access data is achieved, the technical effect of reducing data statistics errors caused by the machine access data is achieved, and the technical problem that the data statistics results are large in errors due to the fact that false data of the webpage accessed by the machine exist in the related art is solved.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, illustrate embodiment(s) of the invention and together with the description serve to explain the invention without limiting the invention. In the drawings:
FIG. 1 is a flow diagram of a method for determining machine access data according to an embodiment of the present invention;
fig. 2 is a schematic diagram of a device for determining machine access data according to an embodiment of the present invention.
Detailed Description
In order to make the technical solutions of the present invention better understood, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that the terms "first," "second," and the like in the description and claims of the present invention and in the drawings described above are used for distinguishing between similar elements and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used is interchangeable under appropriate circumstances such that the embodiments of the invention described herein are capable of operation in sequences other than those illustrated or described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed, but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
In accordance with an embodiment of the present invention, there is provided a method embodiment of a method for machine-access-data determination, it being noted that the steps illustrated in the flowchart of the figures may be performed in a computer system such as a set of computer-executable instructions and that, although a logical order is illustrated in the flowchart, in some cases the steps illustrated or described may be performed in an order different than presented herein.
Fig. 1 is a flowchart of a method for detecting an electronic seal according to an embodiment of the present invention, as shown in fig. 1, the method includes the steps of:
step S102, counting access data for accessing the webpage in the current access, wherein the access data comprises a plurality of interactive elements of the accessed webpage;
step S104, determining the score of the current access corresponding to the access data;
step S106, determining the score of the access address corresponding to the access according to the score of the access;
and step S108, determining the access data of the access address as machine access data under the condition that the score of the access address is lower than the preset score.
Through the steps, access data for accessing the webpage in the current access are counted, wherein the access data comprise a plurality of interactive elements of the accessed webpage; determining the score of the access corresponding to the access data; determining the score of the access address corresponding to the access according to the score of the access; under the condition that the score of the access address is lower than the preset score, the access data of the access address is determined to be machine access data, the access is scored through a plurality of interactive elements of the webpage accessed at this time, so that the access address accessed at this time is scored, whether the access address is the machine access address or not is determined, the purpose of determining the machine access data is achieved, the technical effect of reducing data statistics errors caused by the machine access data is achieved, and the technical problem that the data statistics results are large in errors due to the fact that false data of the webpage accessed by the machine exist in the related art is solved.
The web page may include a plurality of interactive elements, and the interactive elements may be elements that are displayed on the web page and can be operated by a user, such as option boxes, buttons, links, and the like. The interactive elements can be displayed on pages of different depths of the webpage, the access can access a plurality of interactive elements, and whether the interactive elements accessed at this time have interactive elements with higher probability of being accessed by machine access data can be determined through the weight of the interactive elements in the webpage, so that whether the data accessed at this time are machine access data is determined.
The score of the access of this time corresponding to the access data can be determined through the weight of the interactive elements in the webpage, and in addition, whether the access behavior determined by accessing the interactive elements in the access of this time and the operation times is the access behavior of accessing the data by the machine can be determined through combining the operation times of the interactive elements.
The score of the access address of the current access is determined according to the score of the current access, and the machine access data is usually accessed through the virtual access address of the machine, so that the access data access method only determining single access effectively deletes the machine access data, and the repeated identification of the access data of the same virtual address of the machine is caused to cause low efficiency. Therefore, in this embodiment, the score of the corresponding access address is determined according to the score of the current access, so as to determine whether the access address is a machine virtual address corresponding to the machine access data.
And determining the access address as a machine virtual address of the machine access data under the condition that the score of the access address is lower than a preset score. The preset score may be an average value of scores of machine virtual addresses determined by the machine access data for a plurality of times, and may be used for scoring the machine access data determined in the historical access data and determining a score of the machine virtual address in the same scoring manner.
Optionally, determining the score of the current access corresponding to the access data includes: determining weights of a plurality of interactable elements of a web page; and determining the score of the access according to the weights of the interactive elements.
The webpage comprises a plurality of interactive elements, different interactive elements have different weights in machine access data, some interactive elements appear in the access process of a machine virtual address at high frequency, and some interactive elements appear in the access process of the machine virtual address infrequently, so that different weights exist when the scoring of the current access and the machine access data is judged according to different interactive elements, and the current access is scored according to the weights of the different interactive elements.
The weights of the interactive elements can be determined through machine access data determined in multiple times of historical access data, the interactive elements accessed by the machine access data at each time are respectively determined, and the weights of the interactive elements are determined according to the access times of the interactive elements. It should be noted that the weights of the interactive elements may be adjusted according to the newly determined machine access data, so that the weights of the interactive elements are more accurate, and the accuracy of determining the machine access data is improved.
Optionally, determining the score of the current visit according to the weights of the interactive elements includes: determining importance scores corresponding to the interactive elements according to the weights of the interactive elements and the operation times required for accessing the interactive elements; and taking the sum of the importance scores of the interactive elements accessed by the access as the score of the access.
As an optional implementation manner, when determining the importance scores of the interactable elements, the importance scores corresponding to the interactable elements may also be determined according to the weights of the interactable elements and the number of accesses required for accessing the interactable elements.
The operation times can be obtained by simulating multiple accesses to each interactive element of the webpage, and the operation times corresponding to the interactive elements accessed by the user for multiple times are determined; and averaging the operation times corresponding to the interactive elements accessed by the user for multiple times, and determining the operation times required by the interactive elements accessed.
Specifically, in this embodiment, according to the weights of the multiple interactive elements and the number of operations required to access the interactive elements, importance scores corresponding to the multiple interactive elements are determined, and it is possible to determine whether the display area of the interactive elements on the web page is smaller than a preset area; under the condition that the display area of the interactive elements is not smaller than the preset area, determining the importance scores of the interactive elements according to the weight values of the interactive elements and the required operation times; and under the condition that the display area of the interactive element is smaller than the preset area, determining the importance value of the interactive element according to the value of the weight of the interactive element, the required operation times and the magnification, wherein the magnification is the ratio of the preset area to the display area of the interactive element.
Because the display area of some interactive elements on the webpage is small, the interactive elements with small area are not easy to be found and used, for example, a return key of the mobile terminal is very small and inconspicuous, and therefore, the interactive elements can return to the previous interface through other gestures or modes most of the time. When the importance score of the interactive element with the smaller display area is determined, a magnification factor is provided for the interactive element, so that the interactive element is in the same level with the areas of other interactive elements.
In this embodiment, the magnification is a ratio of a preset area to a display area of the interactive element.
Optionally, determining the importance score of the interactable element by using the value of the weight of the interactable element and the required operation times of the interactable element comprises: the importance score of an interactable element is equal to the product of the value of the weight and the square of the number of required operations; determining the importance score of the interactable element based on the value of the weight of the interactable element, the number of operations required with the interactable element, and the magnification factor comprises: the importance score of an interactable element is equal to the product of the value of the weight, the square of the number of required operations, and the magnification factor.
Optionally, determining the score of the access address corresponding to the access according to the score of the access includes: determining a time attenuation coefficient of the current visit according to the visit time and the current time of the current visit; and determining the score of the access address corresponding to the access according to the score of the access and the corresponding time attenuation coefficient.
The access may be the latest access, or after a period of time after the access, when the machine access data is determined according to the access, because of the time, the influence of the data of the access on the current time is weakened, so that the influence attenuation degree of the access and the current time is described by a attenuation coefficient, and the longer the specific time is, the larger the influence attenuation degree is, the shorter the time is, and the smaller the influence attenuation degree is.
Optionally, in a case that the score of the access address is lower than the preset score, determining that the access data of the access address is the machine access data includes: determining whether the access accesses a preset key interactive element or not under the condition that the score of the access address of the webpage is lower than a preset score; and under the condition that the current access accesses the key interactive elements, determining the access data of the access address as machine access data.
In order to further improve the accuracy of determining the machine access data, under the condition that the score of the access address is lower than a preset score, whether the current access accesses a preset key interactive element is determined, wherein the key interactive element can be a plurality of interactive elements accessed by the machine access data at high frequency, so that whether the access address accessed at the current time is a machine virtual access address is determined, and the access data of the machine virtual access address are determined to be machine access data.
It should be noted that this embodiment also provides an alternative implementation, which is described in detail below.
1. The specific scheme of the embodiment is as follows:
(1) counting the operation times (one click/mouse wheel is one operation at the pc end, and one finger sliding is one operation at the mobile end) required for interacting with the contact (such as browsing pages/videos, clicking to view details and downloading files) from the beginning of entering the home page of the official website. The statistical method is that the user behavior is simulated for a plurality of times, and the average number is obtained;
(2) setting different importance weights for each contact according to business understanding;
(3) additionally, for contacts with an area less than a preset threshold, setting the area importance magnification of the contact as (preset area threshold/contact area);
(4) noting the importance of each contact as the square of the number of operations importance weight as the area importance magnification;
(5) for each visit id, the score of the visit (two consecutive operations within a certain time limit are regarded as the same visit) is recorded as sigma (the sum of the importance of all the contacts reached by the visit), wherein
Figure BDA0002434086740000071
Figure BDA0002434086740000072
(6) The score of each id is the maximum value of (the score of a certain visit of the id is a time attenuation coefficient), wherein the time attenuation coefficient is an index of the time from the present moment of the visit;
(7) based on a preset scoring threshold, access to certain specific contacts (e.g., a funded page) below the threshold is automatically made to the machine.
Fig. 2 is a schematic diagram of a device for determining machine access data according to an embodiment of the present invention, and as shown in fig. 2, according to another aspect of the embodiment of the present invention, there is also provided a device for determining machine access data, including: a statistics module 22, a first determination module 24, a second determination module 26, and a third determination module 28, which are described in detail below.
The statistical module 22 is configured to perform statistics on access data of the web page accessed in the current access, where the access data includes a plurality of interactable elements of the accessed web page; a first determining module 24, connected to the counting module 22, for determining a score of the current visit corresponding to the visit data; a second determining module 26, connected to the first determining module 24, for determining the score of the access address corresponding to the current access according to the score of the current access; and a third determining module 28, connected to the second determining module 26, for determining the access data of the access address as the machine access data if the score of the access address is lower than the preset score.
By the device, the statistical module 22 is adopted to count the access data of the webpage accessed in the current access, wherein the access data comprises a plurality of interactive elements of the accessed webpage; the first determining module 24 determines the score of the current access corresponding to the access data; the second determining module 26 determines the score of the access address corresponding to the access according to the score of the access; the third determining module 28 determines the access data of the access address as the machine access data when the score of the access address is lower than the preset score, scores the access through a plurality of interactive elements of the web page accessed this time, and thereby scores the access address accessed this time to determine whether the access address is the machine access address, thereby achieving the purpose of determining the machine access data, and further achieving the technical effect of reducing data statistics errors caused by the machine access data, and further solving the technical problem that the data statistics errors are large due to the existence of false data of the machine access web page in the related art.
According to another aspect of the embodiments of the present invention, there is also provided a storage medium including a stored program, wherein when the program runs, a device in which the storage medium is located is controlled to execute the method of any one of the above.
According to another aspect of the embodiments of the present invention, there is also provided a processor, configured to execute a program, where the program executes to perform the method of any one of the above.
The above-mentioned serial numbers of the embodiments of the present invention are merely for description and do not represent the merits of the embodiments.
In the above embodiments of the present invention, the descriptions of the respective embodiments have respective emphasis, and for parts that are not described in detail in a certain embodiment, reference may be made to related descriptions of other embodiments.
In the embodiments provided in the present application, it should be understood that the disclosed technology can be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units may be a logical division, and in actual implementation, there may be another division, for example, multiple units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection through some interfaces, units or modules, and may be in an electrical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments of the present invention may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit. The integrated unit can be realized in a form of hardware, and can also be realized in a form of a software functional unit.
The integrated unit, if implemented in the form of a software functional unit and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, a server, or a network device) to execute all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a removable hard disk, a magnetic disk, or an optical disk, which can store program codes.
The foregoing is only a preferred embodiment of the present invention, and it should be noted that, for those skilled in the art, various modifications and decorations can be made without departing from the principle of the present invention, and these modifications and decorations should also be regarded as the protection scope of the present invention.

Claims (11)

1. A method for determining machine access data, comprising:
counting access data for accessing the webpage in the current access, wherein the access data comprise a plurality of interactive elements of the accessed webpage;
determining the score of the current visit corresponding to the visit data;
determining the score of the access address corresponding to the access according to the score of the access;
and under the condition that the score of the access address is lower than a preset score, determining that the access data of the access address is machine access data.
2. The method of claim 1, wherein determining the score for the visit corresponding to the visit data comprises:
determining weights for a plurality of interactable elements of the web page;
and determining the score of the access according to the weights of the interactive elements.
3. The method of claim 2, wherein determining the score for the current visit based on the weights for the plurality of interactable elements comprises:
determining importance scores corresponding to the interactive elements according to the weights of the interactive elements and the operation times required for accessing the interactive elements;
and taking the sum of the importance scores of the interactive elements accessed by the access as the score of the access.
4. The method of claim 3, wherein determining the importance scores corresponding to the plurality of interactable elements based on the weights of the plurality of interactable elements and the number of operations required to access the interactable elements comprises:
for each interactive element of the webpage, determining the operation times of the user for accessing the interactive element for multiple times respectively by simulating multiple accesses;
and averaging the operation times corresponding to the interactive elements accessed by the user for multiple times, and determining the operation times required by the interactive elements.
5. The method of claim 3, wherein determining the importance scores corresponding to the plurality of interactable elements based on the weights of the plurality of interactable elements and the number of operations required to access the interactable elements comprises:
determining whether the display area of the interactive elements on the webpage is smaller than a preset area;
under the condition that the display area of the interactive element is not smaller than the preset area, determining the importance score of the interactive element according to the value of the weight of the interactive element and the required operation times;
and under the condition that the display area of the interactive element is smaller than the preset area, determining the importance value of the interactive element according to the value of the weight of the interactive element, the required operation times and the magnification, wherein the magnification is the ratio of the preset area to the display area of the interactive element.
6. The method of claim 5, wherein determining the importance score of the interactable element based on the value of the weight of the interactable element and the number of operations required with the interactable element comprises:
the importance score of the interactable element is equal to the product of the value of the weight and the square of the number of required operations;
determining an importance score for the interactable element based on the value of the weight of the interactable element, the number of operations required with the interactable element, and the magnification factor comprises:
the importance score of the interactable element is equal to the product of the value of the weight, the square of the number of required operations, and the magnification factor.
7. The method of claim 1, wherein determining the score of the access address corresponding to the current access according to the score of the current access comprises:
determining the time attenuation coefficient of the current visit according to the visit time and the current time of the current visit;
and determining the score of the access address corresponding to the access according to the score of the access and the corresponding time attenuation coefficient.
8. The method of claim 1, wherein determining that the access data for the access address is machine access data if the access address scores below a preset score comprises:
determining whether the access accesses preset key interactive elements or not under the condition that the score of the access address of the webpage is lower than a preset score;
and under the condition that the current access accesses the key interactive element, determining the access data of the access address as machine access data.
9. An apparatus for determining machine access data, comprising:
the statistical module is used for counting access data for accessing the webpage in the current access, wherein the access data comprise a plurality of interactive elements of the accessed webpage;
the first determining module is used for determining the score of the current visit corresponding to the visit data;
the second determining module is used for determining the score of the access address corresponding to the access according to the score of the access;
and the third determining module is used for determining that the access data of the access address is machine access data under the condition that the score of the access address is lower than a preset score.
10. A storage medium, comprising a stored program, wherein the program, when executed, controls an apparatus in which the storage medium is located to perform the method of any one of claims 1 to 8.
11. A processor, characterized in that the processor is configured to run a program, wherein the program when running performs the method of any of claims 1 to 8.
CN202010246446.8A 2020-03-31 2020-03-31 Method and device for determining machine access data Active CN111461545B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010246446.8A CN111461545B (en) 2020-03-31 2020-03-31 Method and device for determining machine access data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010246446.8A CN111461545B (en) 2020-03-31 2020-03-31 Method and device for determining machine access data

Publications (2)

Publication Number Publication Date
CN111461545A true CN111461545A (en) 2020-07-28
CN111461545B CN111461545B (en) 2023-11-10

Family

ID=71682435

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010246446.8A Active CN111461545B (en) 2020-03-31 2020-03-31 Method and device for determining machine access data

Country Status (1)

Country Link
CN (1) CN111461545B (en)

Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6356899B1 (en) * 1998-08-29 2002-03-12 International Business Machines Corporation Method for interactively creating an information database including preferred information elements, such as preferred-authority, world wide web pages
US20100076910A1 (en) * 2008-09-25 2010-03-25 Microsoft Corporation Calculating web page importance based on web behavior model
US20140090009A1 (en) * 2012-09-27 2014-03-27 Hong Li Secure data container for web applications
CN103853839A (en) * 2014-03-18 2014-06-11 北京博雅立方科技有限公司 Method and device for evaluating advertisement page malicious click suspected degree
US20160019310A1 (en) * 2013-06-27 2016-01-21 Tencent Technology (Shenzhen) Co., Ltd. Method and apparatus for rendering statistics on web page visits by a browser
CN105491054A (en) * 2015-12-22 2016-04-13 网易(杭州)网络有限公司 Method and apparatus for determining malicious access, and method and apparatus for intercepting malicious access
CN105808639A (en) * 2016-02-24 2016-07-27 平安科技(深圳)有限公司 Network access behavior recognizing method and device
CN106506451A (en) * 2016-09-30 2017-03-15 百度在线网络技术(北京)有限公司 The processing method and processing device of malicious access
US20170195331A1 (en) * 2015-12-31 2017-07-06 General Electric Company Identity management and device enrollment in a cloud service
US20170331855A1 (en) * 2016-05-13 2017-11-16 International Business Machines Corporation Detection and warning of imposter web sites
US20180124073A1 (en) * 2016-10-31 2018-05-03 Microsoft Technology Licensing, Llc Network attack detection
CN107995152A (en) * 2016-10-27 2018-05-04 腾讯科技(深圳)有限公司 A kind of malicious access detection method, device and detection service device
CN108334273A (en) * 2018-02-09 2018-07-27 网易(杭州)网络有限公司 Method for information display and device, storage medium, processor, terminal
CN109711123A (en) * 2018-11-21 2019-05-03 武汉极意网络科技有限公司 Behavioral value method and device based on simulation browser detection
US20190173905A1 (en) * 2016-08-08 2019-06-06 Alibaba Group Holding Limited Method and apparatus for identifying fake traffic
CN110401660A (en) * 2019-07-26 2019-11-01 秒针信息技术有限公司 Recognition methods, device, processing equipment and the storage medium of false flow
CN110442230A (en) * 2018-05-04 2019-11-12 脸谱科技有限责任公司 Prevent the user interface in reality environment from blocking
CN110609937A (en) * 2019-08-15 2019-12-24 平安科技(深圳)有限公司 Crawler identification method and device
WO2020019484A1 (en) * 2018-07-27 2020-01-30 平安科技(深圳)有限公司 Simulator recognition method, recognition device, and computer readable medium
CN110889745A (en) * 2019-11-22 2020-03-17 无线生活(北京)信息技术有限公司 Method and device for intelligently identifying robbery behavior

Patent Citations (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6356899B1 (en) * 1998-08-29 2002-03-12 International Business Machines Corporation Method for interactively creating an information database including preferred information elements, such as preferred-authority, world wide web pages
US20100076910A1 (en) * 2008-09-25 2010-03-25 Microsoft Corporation Calculating web page importance based on web behavior model
US20140090009A1 (en) * 2012-09-27 2014-03-27 Hong Li Secure data container for web applications
US20160019310A1 (en) * 2013-06-27 2016-01-21 Tencent Technology (Shenzhen) Co., Ltd. Method and apparatus for rendering statistics on web page visits by a browser
CN103853839A (en) * 2014-03-18 2014-06-11 北京博雅立方科技有限公司 Method and device for evaluating advertisement page malicious click suspected degree
CN105491054A (en) * 2015-12-22 2016-04-13 网易(杭州)网络有限公司 Method and apparatus for determining malicious access, and method and apparatus for intercepting malicious access
US20170195331A1 (en) * 2015-12-31 2017-07-06 General Electric Company Identity management and device enrollment in a cloud service
CN105808639A (en) * 2016-02-24 2016-07-27 平安科技(深圳)有限公司 Network access behavior recognizing method and device
US20170331855A1 (en) * 2016-05-13 2017-11-16 International Business Machines Corporation Detection and warning of imposter web sites
US20190173905A1 (en) * 2016-08-08 2019-06-06 Alibaba Group Holding Limited Method and apparatus for identifying fake traffic
CN106506451A (en) * 2016-09-30 2017-03-15 百度在线网络技术(北京)有限公司 The processing method and processing device of malicious access
CN107995152A (en) * 2016-10-27 2018-05-04 腾讯科技(深圳)有限公司 A kind of malicious access detection method, device and detection service device
US20180124073A1 (en) * 2016-10-31 2018-05-03 Microsoft Technology Licensing, Llc Network attack detection
CN108334273A (en) * 2018-02-09 2018-07-27 网易(杭州)网络有限公司 Method for information display and device, storage medium, processor, terminal
CN110442230A (en) * 2018-05-04 2019-11-12 脸谱科技有限责任公司 Prevent the user interface in reality environment from blocking
WO2020019484A1 (en) * 2018-07-27 2020-01-30 平安科技(深圳)有限公司 Simulator recognition method, recognition device, and computer readable medium
CN109711123A (en) * 2018-11-21 2019-05-03 武汉极意网络科技有限公司 Behavioral value method and device based on simulation browser detection
CN110401660A (en) * 2019-07-26 2019-11-01 秒针信息技术有限公司 Recognition methods, device, processing equipment and the storage medium of false flow
CN110609937A (en) * 2019-08-15 2019-12-24 平安科技(深圳)有限公司 Crawler identification method and device
CN110889745A (en) * 2019-11-22 2020-03-17 无线生活(北京)信息技术有限公司 Method and device for intelligently identifying robbery behavior

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
GAYATHRI SHIVARAJ ET AL.: "Using Hidden Markov Model to detect rogue access points", SECURITY AND COMMUNICATION NETWORKS, vol. 3 *
丘海澜;文翰;肖南峰;: "基于访问日志的网页内容监控挖掘系统", 计算机工程, no. 04 *
李雯;: "电子商务平台中流量统计模块的设计研究", 硅谷, no. 20 *
王建;张仰森;陈若愚;蒋玉茹;尤建清;: "网络用户角色辨识及其恶意访问行为的发现方法", 计算机科学, no. 10 *

Also Published As

Publication number Publication date
CN111461545B (en) 2023-11-10

Similar Documents

Publication Publication Date Title
CN109145934B (en) User behavior data processing method, medium, equipment and device based on log
CN106897284B (en) Recommendation method and device for electronic books
WO2019056721A1 (en) Information pushing method, electronic device and computer storage medium
CN104899220B (en) Application program recommendation method and system
CN110381151B (en) Abnormal equipment detection method and device
CN102693229B (en) Software analysis method, recommend method, analytical equipment and recommendation apparatus
CN107786545A (en) A kind of attack detection method and terminal device
CN108876464B (en) Cheating behavior detection method and device, service equipment and storage medium
CN101685521A (en) Method for showing advertisements in webpage and system
CN113412607B (en) Content pushing method and device, mobile terminal and storage medium
CN106874165A (en) Page detection method and device
CN103761228A (en) Ranking threshold determination method and ranking threshold determination system for application program
CN106168968A (en) A kind of Website classification method and device
WO2015014260A1 (en) Data processing method and server therefor
CN106933905B (en) Method and device for monitoring webpage access data
CN107135199B (en) Method and device for detecting webpage backdoor
CN111461545B (en) Method and device for determining machine access data
CN107679883A (en) The method and system of advertisement generation
CN114187037A (en) Information pushing method and device and nonvolatile storage medium
CN115470399A (en) ID (identity) communication method, device, equipment and storage medium based on big data
CN113315670A (en) Network flow analysis method, device and storage medium
CN105653645B (en) Network information attention degree evaluation method and device
CN113139102A (en) Data processing method, data processing device, nonvolatile storage medium and processor
CN106874299A (en) Page detection method and device
CN111563769B (en) Data processing method, device, nonvolatile storage medium and processor

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant