CN111461545B - Method and device for determining machine access data - Google Patents

Method and device for determining machine access data Download PDF

Info

Publication number
CN111461545B
CN111461545B CN202010246446.8A CN202010246446A CN111461545B CN 111461545 B CN111461545 B CN 111461545B CN 202010246446 A CN202010246446 A CN 202010246446A CN 111461545 B CN111461545 B CN 111461545B
Authority
CN
China
Prior art keywords
access
determining
score
interactable
current
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010246446.8A
Other languages
Chinese (zh)
Other versions
CN111461545A (en
Inventor
陈铬亮
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Shenyan Intelligent Technology Co ltd
Original Assignee
Beijing Shenyan Intelligent Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Shenyan Intelligent Technology Co ltd filed Critical Beijing Shenyan Intelligent Technology Co ltd
Priority to CN202010246446.8A priority Critical patent/CN111461545B/en
Publication of CN111461545A publication Critical patent/CN111461545A/en
Application granted granted Critical
Publication of CN111461545B publication Critical patent/CN111461545B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/06Resources, workflows, human or project management; Enterprise or organisation planning; Enterprise or organisation modelling
    • G06Q10/063Operations research, analysis or management
    • G06Q10/0639Performance analysis of employees; Performance analysis of enterprise or organisation operations
    • G06Q10/06393Score-carding, benchmarking or key performance indicator [KPI] analysis
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2458Special types of queries, e.g. statistical queries, fuzzy queries or distributed queries
    • G06F16/2462Approximate or statistical queries
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0242Determining effectiveness of advertisements
    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Abstract

The application discloses a method and a device for determining machine access data. Wherein the method comprises the following steps: counting access data for accessing the webpage in the current access, wherein the access data comprises a plurality of interactable elements of the accessed webpage; determining the score of the current visit corresponding to the visit data; determining the score of the access address corresponding to the current access according to the score of the current access; and under the condition that the score of the access address is lower than a preset score, determining the access data of the access address as machine access data. The application solves the technical problem of large error of data statistics results caused by false data of machine access web pages in the related technology.

Description

Method and device for determining machine access data
Technical Field
The present application relates to the field of data processing, and in particular, to a method and apparatus for determining machine access data.
Background
In the advertisement placement effectiveness statistics process, statistics and analysis of advertisement access data are generally required. However, due to the diversity of market conditions, machine access data exist for advertisements to be put in, and the machine access data are counted as normal user access in the counting process, so that errors occur in later data processing and result analysis. Resulting in a large error between the result of the data statistics and the real situation.
In view of the above problems, no effective solution has been proposed at present.
Disclosure of Invention
The embodiment of the application provides a method and a device for determining machine access data, which are used for at least solving the technical problem that the error of a data statistics result is large because false data of a machine access webpage exist in the related technology.
According to an aspect of an embodiment of the present application, there is provided a method for determining machine access data, including: counting access data for accessing a webpage in the current access, wherein the access data comprises a plurality of interactable elements of the accessed webpage; determining the score of the current visit corresponding to the visit data; determining the score of the access address corresponding to the current access according to the score of the current access; and under the condition that the score of the access address is lower than a preset score, determining the access data of the access address as machine access data.
Optionally, determining the score of the current access corresponding to the access data includes: determining weights of a plurality of interactable elements of the web page; and determining the scores of the current visit according to the weights of the interactable elements.
Optionally, determining the score of the current access according to the weights of the plurality of interactable elements includes: determining importance scores corresponding to the interactable elements according to the weights of the interactable elements and the operation times required for accessing the interactable elements; and taking the sum of importance scores of the interactive elements accessed by the current access as the score of the current access.
Optionally, before determining importance scores corresponding to the interactable elements according to weights of the interactable elements and the operation times required for accessing the interactable elements, the method includes: determining the operation times of users for accessing each interactive element of the webpage for multiple times by simulating multiple accesses; and averaging the operation times corresponding to the interactive elements accessed by the user for multiple times, and determining the operation times required for accessing the interactive elements.
Optionally, determining importance scores corresponding to the interactable elements according to weights of the interactable elements and the operation times required for accessing the interactable elements comprises: determining whether the display area of the interactable element on the webpage is smaller than a preset area; under the condition that the display area of the interactable element is not smaller than the preset area, determining an importance score of the interactable element by the value of the weight of the interactable element and the required operation times; and under the condition that the display area of the interactable element is smaller than the preset area, determining the importance score of the interactable element according to the value of the weight of the interactable element, the required operation times and the amplification factor, wherein the amplification factor is the ratio of the preset area to the display area of the interactable element.
Optionally, determining the importance score of the interactable element by combining the value of the weight of the interactable element with the number of operations required for the interactable element comprises: the importance score of the interactable element is equal to the product of the value of the weight and the square of the number of operations required; determining an importance score for the interactable element based upon the value of the weight for the interactable element, the number of operations required with the interactable element, and the magnification factor, comprises: the importance score of the interactable element is equal to the product of the value of the weight, the square of the number of operations required, and the magnification.
Optionally, determining the score of the access address corresponding to the current access according to the score of the current access includes: determining a time attenuation coefficient of the current access according to the access time and the current time of the current access; and determining the score of the access address corresponding to the current access through the score of the current access and the corresponding time attenuation coefficient.
Optionally, determining that the access data of the access address is machine access data if the score of the access address is lower than a preset score includes: determining whether the access accesses a preset key interactable element or not under the condition that the score of the access address is lower than a preset score; and under the condition that the key interactable element is accessed by the current access, determining the access data of the access address to be machine access data.
According to another aspect of the embodiment of the present application, there is also provided a device for determining machine access data, including: the system comprises a statistics module, a processing module and a processing module, wherein the statistics module is used for counting access data for accessing a webpage in the current access, and the access data comprises a plurality of interactable elements of the accessed webpage; the first determining module is used for determining the score of the current visit corresponding to the visit data; the second determining module is used for determining the score of the access address corresponding to the current access according to the score of the current access; and the third determining module is used for determining that the access data of the access address is machine access data under the condition that the score of the access address is lower than a preset score.
According to another aspect of the embodiments of the present application, there is further provided a storage medium including a stored program, where the program, when executed, controls a device in which the storage medium is located to perform any one of the methods described above.
According to another aspect of the embodiment of the present application, there is also provided a processor, where the processor is configured to execute a program, where the program executes any one of the methods described above.
In the embodiment of the application, the access data for carrying out the access to the webpage in the current access is counted, wherein the access data comprises a plurality of interactable elements of the accessed webpage; determining the score of the current visit corresponding to the visit data; determining the score of the access address corresponding to the current access according to the score of the current access; under the condition that the score of the access address is lower than the preset score, determining that the access data of the access address is the machine access data, scoring the access through a plurality of interactable elements of the webpage accessed at this time, and accordingly scoring the access address accessed at this time to determine whether the access address is the machine access address, thereby achieving the purpose of determining the machine access data, achieving the technical effect of reducing data statistics errors caused by the machine access data, and further solving the technical problem that false data of the webpage accessed by a machine exist in related technologies, and the data statistics result errors are large.
Drawings
The accompanying drawings, which are included to provide a further understanding of the application and are incorporated in and constitute a part of this specification, illustrate embodiments of the application and together with the description serve to explain the application and do not constitute a limitation on the application. In the drawings:
FIG. 1 is a flow chart of a method of determining machine access data according to an embodiment of the present application;
fig. 2 is a schematic diagram of a device for determining machine access data according to an embodiment of the present application.
Detailed Description
In order that those skilled in the art will better understand the present application, a technical solution in the embodiments of the present application will be clearly and completely described below with reference to the accompanying drawings in which it is apparent that the described embodiments are only some embodiments of the present application, not all embodiments. All other embodiments, which can be made by those skilled in the art based on the embodiments of the present application without making any inventive effort, shall fall within the scope of the present application.
It should be noted that the terms "first," "second," and the like in the description and the claims of the present application and the above figures are used for distinguishing between similar objects and not necessarily for describing a particular sequential or chronological order. It is to be understood that the data so used may be interchanged where appropriate such that the embodiments of the application described herein may be implemented in sequences other than those illustrated or otherwise described herein. Furthermore, the terms "comprises," "comprising," and "having," and any variations thereof, are intended to cover a non-exclusive inclusion, such that a process, method, system, article, or apparatus that comprises a list of steps or elements is not necessarily limited to those steps or elements expressly listed but may include other steps or elements not expressly listed or inherent to such process, method, article, or apparatus.
According to an embodiment of the present application, there is provided a method embodiment of a method for determining machine access data, it should be noted that the steps illustrated in the flowchart of the drawings may be performed in a computer system such as a set of computer executable instructions, and that although a logical order is illustrated in the flowchart, in some cases the steps illustrated or described may be performed in an order other than that illustrated herein.
Fig. 1 is a flowchart of a method for detecting an electronic seal according to an embodiment of the present application, as shown in fig. 1, the method includes the following steps:
step S102, counting access data for accessing the webpage in the current access, wherein the access data comprises a plurality of interactable elements of the accessed webpage;
step S104, determining the score of the current visit corresponding to the visit data;
step S106, determining the score of the access address corresponding to the current access according to the score of the current access;
step S108, determining the access data of the access address as machine access data under the condition that the score of the access address is lower than a preset score.
Through the steps, the access data for carrying out the access to the webpage in the current access is counted, wherein the access data comprises a plurality of interactable elements of the accessed webpage; determining the score of the current visit corresponding to the visit data; determining the score of the access address corresponding to the current access according to the score of the current access; under the condition that the score of the access address is lower than the preset score, determining that the access data of the access address is the machine access data, scoring the access through a plurality of interactable elements of the webpage accessed at this time, and accordingly scoring the access address accessed at this time to determine whether the access address is the machine access address, thereby achieving the purpose of determining the machine access data, achieving the technical effect of reducing data statistics errors caused by the machine access data, and further solving the technical problem that false data of the webpage accessed by a machine exist in related technologies, and the data statistics result errors are large.
The web page may include a plurality of interactable elements, which may be elements displayed on the web page that a user may perform operations, such as boxes, buttons, links, and the like. The interactive elements can be displayed on pages with different depths of the webpage, the access can access a plurality of interactive elements, and whether the interactive elements accessed at the time have the interactive elements with larger probability for machine access data access can be determined through the weights of the interactive elements in the webpage, so that whether the data accessed at the time are machine access data is determined.
The score of the current access corresponding to the determined access data can be determined by the weight of the interactable element in the webpage, and in addition, the operation times of the interactable element can be combined to determine whether the access behavior determined by accessing a plurality of interactable elements and the operation times in the current access is the access behavior of the machine access data.
The score of the access address of the current access is determined according to the score of the current access, and the machine access data is usually accessed through the virtual access address of the machine, so that only the access data access method for determining single access effectively deletes the machine access data, and the repeated identification of the access data of the same virtual address of the machine can cause inefficiency. In this embodiment, the score of the access address is determined according to the score of the access, so as to determine whether the access address is a machine virtual address corresponding to the machine access data.
And under the condition that the score of the access address is lower than a preset score, determining the access address as a machine virtual address of the machine access data. The preset score may score the accesses of the machine access data determined in the historical access data, and determine the score of the machine virtual address according to the same scoring mode, where the preset score may be an average value of scores of machine virtual addresses determined by multiple machine access data.
Optionally, determining the score of the current access corresponding to the access data includes: determining weights of a plurality of interactable elements of the web page; and determining the score of the current visit according to the weights of the interactive elements.
The web page comprises a plurality of interactable elements, the weights of different interactable elements in machine access data are different, some interactable elements can appear at high frequency in the process of accessing a machine virtual address, and some interactable elements can appear infrequently in the process of accessing the machine virtual address, so that the different interactable elements have different weights when judging the scoring of the current access and the machine access data, and the scoring is carried out on the current access according to the weights of the different interactable elements.
The weights of the interactable elements can be determined by machine access data determined in the historical access data for a plurality of times, the interactable elements accessed by the machine access data each time are respectively determined, and the weights of the interactable elements are determined according to the access times of the interactable elements. It should be noted that, the weight of the interactable element may be adjusted according to the newly determined machine access data, so that the weight of the interactable element is more accurate, thereby improving the accuracy of determining the machine access data.
Optionally, determining the score of the current access according to the weights of the plurality of interactable elements includes: determining importance scores corresponding to the interactable elements according to the weights of the interactable elements and the operation times required for accessing the interactable elements; and taking the sum of importance scores of the interactive elements accessed by the current visit as the score of the current visit.
In an alternative implementation manner, when determining the importance scores of the interactable elements, the importance scores corresponding to the interactable elements may be determined according to the weights of the interactable elements and the number of accesses required to access the interactable elements.
The operation times can be determined by simulating multiple accesses to each interactive element of the webpage, and the operation times respectively corresponding to the interactive elements accessed by the user for multiple times; and averaging the operation times corresponding to the interactive elements accessed by the user for multiple times, and determining the operation times required for accessing the interactive elements.
Specifically, in this embodiment, according to weights of a plurality of interactable elements and the number of operations required to access the interactable elements, importance scores corresponding to the interactable elements may be determined by determining whether a display area of the interactable elements on a web page is smaller than a preset area; under the condition that the display area of the interactable element is not smaller than the preset area, determining the importance score of the interactable element by the value of the weight of the interactable element and the required operation times; and under the condition that the display area of the interactable element is smaller than the preset area, determining the importance score of the interactable element according to the value of the weight of the interactable element, the required operation times and the amplification factor, wherein the amplification factor is the ratio of the preset area to the display area of the interactable element.
Because some interactable elements have smaller display areas on the web page, considering that the interactable elements with small areas are not easy to be found and used, for example, the return key of the mobile terminal is very small and unobtrusive, so that most of the time can return to the previous interface through other gestures or modes. And when the importance scores of the interactable elements with smaller display areas are determined, providing a magnification factor for the interactable elements so that the interactable elements are in the same level with the areas of other interactable elements.
In this embodiment, the magnification is a ratio of a preset area to a display area of the interactable element.
Optionally, determining the importance score of the interactable element from the value of the weight of the interactable element and the number of operations required with the interactable element comprises: the importance score of the interactable element is equal to the product of the value of the weight and the square of the number of operations required; determining the importance score of the interactable element based upon the value of the weight of the interactable element, the number of operations required with the interactable element, and the magnification factor comprises: the importance score of an interactable element is equal to the product of the value of the weight, the square of the number of operations required, and the magnification.
Optionally, determining the score of the access address corresponding to the current access according to the score of the current access includes: determining a time attenuation coefficient of the current access according to the access time and the current time of the current access; and determining the score of the access address corresponding to the current access through the score of the current access and the corresponding time attenuation coefficient.
The above access may be the last access, or after a period of time after the access, when determining that the machine accesses data according to the access, the influence of the data of the access on the current time is weakened due to the time, so that the influence attenuation degree of the access and the current time is described by an attenuation coefficient, the longer the specific time, the larger the influence attenuation degree, the shorter the time, and the influence attenuation degree is smaller.
Optionally, in the case that the score of the access address is lower than the preset score, determining that the access data of the access address is machine access data includes: determining whether the access accesses the preset key interactable elements or not under the condition that the score of the access address is lower than a preset score; and under the condition that the key interactable elements are accessed in the current access, determining the access data of the access address as machine access data.
In order to further improve accuracy of determining machine access data, under the condition that the score of the access address is lower than a preset score, determining whether the access is performed on preset key interactable elements, wherein the key interactable elements can be a plurality of interactable elements accessed by the machine access data at high frequency, so as to determine whether the access address of the access is a machine virtual access address, and determine that the access data of the machine virtual access address are all machine access data.
It should be noted that this embodiment also provides an alternative implementation, and this implementation is described in detail below.
1. The specific scheme of the embodiment is as follows:
(1) For each page and interactable element (hereinafter collectively referred to as a contact) of the official network, counting the operation times (pc end, one click/mouse wheel is one operation, and one finger sliding is one operation for the mobile end) required for interaction with the contact (such as browsing pages/videos, clicking to view details and downloading files) from entering the home page of the official network. The statistical mode is that the user behavior is simulated for a plurality of times, and an average is taken;
(2) Setting different importance weights for each contact according to service understanding;
(3) Additionally, contacts with areas smaller than a preset threshold are provided with area importance magnification of (preset area threshold/contact area);
(4) Record the importance of each contact = square of number of operations × importance weight × area importance magnification;
(5) For each visit id, note its score = σ (sum of the importance of all contacts touched by this visit) for this visit (defining that two consecutive operations are within a certain time limit, considered to be the same visit), where
(6) A maximum value of score = (the score of a certain visit of the id is time decay coefficient), wherein the time decay coefficient is an index of the visit time to the present time;
(7) According to a preset scoring threshold, a machine is automatically accessed when certain specific contacts (such as a funding page) are accessed below the threshold.
Fig. 2 is a schematic diagram of a device for determining machine access data according to an embodiment of the present application, and as shown in fig. 2, according to another aspect of an embodiment of the present application, there is further provided a device for determining machine access data, including: the statistics module 22, the first determination module 24, the second determination module 26, and the third determination module 28 are described in detail below.
A statistics module 22, configured to count access data for accessing the web page in the current access, where the access data includes a plurality of interactable elements of the accessed web page; the first determining module 24 is connected to the statistics module 22, and is configured to determine a score of the current access corresponding to the access data; the second determining module 26 is connected to the first determining module 24, and is configured to determine a score of the access address corresponding to the current access according to the score of the current access; and a third determining module 28, connected to the second determining module 26, for determining that the access data of the access address is machine access data if the score of the access address is lower than the preset score.
By the device, the statistics module 22 is adopted to count the access data of the web page in the current access, wherein the access data comprises a plurality of interactable elements of the accessed web page; the first determining module 24 determines a score of the current access corresponding to the access data; the second determining module 26 determines the score of the access address corresponding to the current access according to the score of the current access; the third determining module 28 determines that the access data of the access address is the machine access data when the score of the access address is lower than the preset score, and scores the access through a plurality of interactable elements of the web page accessed at this time, so as to score the access address accessed at this time to determine whether the access address is the machine access address, thereby achieving the purpose of determining the machine access data, achieving the technical effect of reducing the data statistics error caused by the machine access data, and further solving the technical problem that the error of the data statistics result is large because of the false data of the web page accessed by the machine in the related art.
According to another aspect of the embodiments of the present application, there is also provided a storage medium, including a stored program, where the program, when executed, controls a device in which the storage medium is located to perform any one of the methods described above.
According to another aspect of the embodiment of the present application, there is also provided a processor, configured to execute a program, where the program executes the method of any one of the above.
The foregoing embodiment numbers of the present application are merely for the purpose of description, and do not represent the advantages or disadvantages of the embodiments.
In the foregoing embodiments of the present application, the descriptions of the embodiments are emphasized, and for a portion of this disclosure that is not described in detail in this embodiment, reference is made to the related descriptions of other embodiments.
In the several embodiments provided in the present application, it should be understood that the disclosed technology may be implemented in other manners. The above-described embodiments of the apparatus are merely exemplary, and the division of the units, for example, may be a logic function division, and may be implemented in another manner, for example, a plurality of units or components may be combined or may be integrated into another system, or some features may be omitted, or not performed. Alternatively, the coupling or direct coupling or communication connection shown or discussed with each other may be through some interfaces, units or modules, or may be in electrical or other forms.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution of this embodiment.
In addition, each functional unit in the embodiments of the present application may be integrated in one processing unit, or each unit may exist alone physically, or two or more units may be integrated in one unit. The integrated units may be implemented in hardware or in software functional units.
The integrated units, if implemented in the form of software functional units and sold or used as stand-alone products, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present application may be embodied essentially or in part or all of the technical solution or in part in the form of a software product stored in a storage medium, including instructions for causing a computer device (which may be a personal computer, a server, or a network device, etc.) to perform all or part of the steps of the method according to the embodiments of the present application. And the aforementioned storage medium includes: a U-disk, a Read-Only Memory (ROM), a random access Memory (RAM, random Access Memory), a removable hard disk, a magnetic disk, or an optical disk, or other various media capable of storing program codes.
The foregoing is merely a preferred embodiment of the present application and it should be noted that modifications and adaptations to those skilled in the art may be made without departing from the principles of the present application, which are intended to be comprehended within the scope of the present application.

Claims (10)

1. A method of determining machine access data, comprising:
counting access data for accessing a webpage in the current access, wherein the access data comprises a plurality of interactable elements of the accessed webpage;
determining the score of the current visit corresponding to the visit data;
determining the score of the access address corresponding to the current access according to the score of the current access;
under the condition that the score of the access address is lower than a preset score, determining the access data of the access address as machine access data;
wherein determining the score of the current access corresponding to the access data includes: determining whether the display area of the interactable element on the webpage is smaller than a preset area;
under the condition that the display area of the interactable element is not smaller than the preset area, determining the importance score of the interactable element by combining the value of the weight of the interactable element with the required operation times;
and under the condition that the display area of the interactable element is smaller than the preset area, determining an importance score of the interactable element according to the value of the weight of the interactable element, the required operation times and the amplification factor, and determining the score of the current visit based on the importance score, wherein the amplification factor is the ratio of the preset area to the display area of the interactable element.
2. The method of claim 1, wherein determining the score of the current visit to which the visit data corresponds comprises:
determining weights of a plurality of interactable elements of the web page;
and determining the scores of the current visit according to the weights of the interactable elements.
3. The method of claim 2, wherein determining the score for the current visit based on the weights of a plurality of the interactable elements comprises:
determining importance scores corresponding to the interactable elements according to the weights of the interactable elements and the operation times required for accessing the interactable elements;
and taking the sum of importance scores of the interactive elements accessed by the current access as the score of the current access.
4. A method according to claim 3, wherein before determining importance scores corresponding to a plurality of said interactable elements based upon weights of said interactable elements and a number of operations required to access said interactable elements, comprising:
determining the operation times of users for accessing each interactive element of the webpage for multiple times by simulating multiple accesses;
and averaging the operation times corresponding to the interactive elements accessed by the user for multiple times, and determining the operation times required for accessing the interactive elements.
5. The method of claim 1, wherein determining an importance score for the interactable element by combining a value of the weight of the interactable element with a number of operations required for the interactable element comprises:
the importance score of the interactable element is equal to the product of the value of the weight and the square of the number of operations required;
determining an importance score for the interactable element based upon the value of the weight for the interactable element, the number of operations required with the interactable element, and the magnification factor, comprises:
the importance score of the interactable element is equal to the product of the value of the weight, the square of the number of operations required, and the magnification.
6. The method of claim 1, wherein determining the score of the access address corresponding to the current access based on the score of the current access comprises:
determining a time attenuation coefficient of the current access according to the access time and the current time of the current access;
and determining the score of the access address corresponding to the current access through the score of the current access and the corresponding time attenuation coefficient.
7. The method of claim 1, wherein determining that the access data of the access address is machine access data if the score of the access address is below a preset score comprises:
determining whether the access accesses a preset key interactable element or not under the condition that the score of the access address is lower than a preset score;
and under the condition that the key interactable element is accessed by the current access, determining the access data of the access address to be machine access data.
8. A device for determining machine access data, comprising:
the system comprises a statistics module, a processing module and a processing module, wherein the statistics module is used for counting access data for accessing a webpage in the current access, and the access data comprises a plurality of interactable elements of the accessed webpage;
the first determining module is used for determining the score of the current visit corresponding to the visit data;
the second determining module is used for determining the score of the access address corresponding to the current access according to the score of the current access;
the third determining module is used for determining that the access data of the access address is machine access data under the condition that the score of the access address is lower than a preset score;
the first determining module is further configured to determine whether a display area of the interactable element on the webpage is smaller than a preset area; under the condition that the display area of the interactable element is not smaller than the preset area, determining the importance score of the interactable element by combining the value of the weight of the interactable element with the required operation times; and under the condition that the display area of the interactable element is smaller than the preset area, determining an importance score of the interactable element according to the value of the weight of the interactable element, the required operation times and the amplification factor, and determining the score of the current visit based on the importance score, wherein the amplification factor is the ratio of the preset area to the display area of the interactable element.
9. A storage medium comprising a stored program, wherein the program, when run, controls a device in which the storage medium is located to perform the method of any one of claims 1 to 7.
10. A processor for running a program, wherein the program when run performs the method of any one of claims 1 to 7.
CN202010246446.8A 2020-03-31 2020-03-31 Method and device for determining machine access data Active CN111461545B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010246446.8A CN111461545B (en) 2020-03-31 2020-03-31 Method and device for determining machine access data

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010246446.8A CN111461545B (en) 2020-03-31 2020-03-31 Method and device for determining machine access data

Publications (2)

Publication Number Publication Date
CN111461545A CN111461545A (en) 2020-07-28
CN111461545B true CN111461545B (en) 2023-11-10

Family

ID=71682435

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010246446.8A Active CN111461545B (en) 2020-03-31 2020-03-31 Method and device for determining machine access data

Country Status (1)

Country Link
CN (1) CN111461545B (en)

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6356899B1 (en) * 1998-08-29 2002-03-12 International Business Machines Corporation Method for interactively creating an information database including preferred information elements, such as preferred-authority, world wide web pages
CN103853839A (en) * 2014-03-18 2014-06-11 北京博雅立方科技有限公司 Method and device for evaluating advertisement page malicious click suspected degree
CN105491054A (en) * 2015-12-22 2016-04-13 网易(杭州)网络有限公司 Method and apparatus for determining malicious access, and method and apparatus for intercepting malicious access
CN105808639A (en) * 2016-02-24 2016-07-27 平安科技(深圳)有限公司 Network access behavior recognizing method and device
CN106506451A (en) * 2016-09-30 2017-03-15 百度在线网络技术(北京)有限公司 The processing method and processing device of malicious access
CN107995152A (en) * 2016-10-27 2018-05-04 腾讯科技(深圳)有限公司 A kind of malicious access detection method, device and detection service device
CN108334273A (en) * 2018-02-09 2018-07-27 网易(杭州)网络有限公司 Method for information display and device, storage medium, processor, terminal
CN109711123A (en) * 2018-11-21 2019-05-03 武汉极意网络科技有限公司 Behavioral value method and device based on simulation browser detection
CN110401660A (en) * 2019-07-26 2019-11-01 秒针信息技术有限公司 Recognition methods, device, processing equipment and the storage medium of false flow
CN110442230A (en) * 2018-05-04 2019-11-12 脸谱科技有限责任公司 Prevent the user interface in reality environment from blocking
CN110609937A (en) * 2019-08-15 2019-12-24 平安科技(深圳)有限公司 Crawler identification method and device
WO2020019484A1 (en) * 2018-07-27 2020-01-30 平安科技(深圳)有限公司 Simulator recognition method, recognition device, and computer readable medium
CN110889745A (en) * 2019-11-22 2020-03-17 无线生活(北京)信息技术有限公司 Method and device for intelligently identifying robbery behavior

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8103599B2 (en) * 2008-09-25 2012-01-24 Microsoft Corporation Calculating web page importance based on web behavior model
US9245144B2 (en) * 2012-09-27 2016-01-26 Intel Corporation Secure data container for web applications
CN104252348B (en) * 2013-06-27 2018-07-20 腾讯科技(深圳)有限公司 A kind of web page access statistical method and device based on browser
US10156842B2 (en) * 2015-12-31 2018-12-18 General Electric Company Device enrollment in a cloud service using an authenticated application
US10505979B2 (en) * 2016-05-13 2019-12-10 International Business Machines Corporation Detection and warning of imposter web sites
CN107707509B (en) * 2016-08-08 2020-09-29 阿里巴巴集团控股有限公司 Method, device and system for identifying and assisting in identifying false traffic
US10581915B2 (en) * 2016-10-31 2020-03-03 Microsoft Technology Licensing, Llc Network attack detection

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6356899B1 (en) * 1998-08-29 2002-03-12 International Business Machines Corporation Method for interactively creating an information database including preferred information elements, such as preferred-authority, world wide web pages
CN103853839A (en) * 2014-03-18 2014-06-11 北京博雅立方科技有限公司 Method and device for evaluating advertisement page malicious click suspected degree
CN105491054A (en) * 2015-12-22 2016-04-13 网易(杭州)网络有限公司 Method and apparatus for determining malicious access, and method and apparatus for intercepting malicious access
CN105808639A (en) * 2016-02-24 2016-07-27 平安科技(深圳)有限公司 Network access behavior recognizing method and device
CN106506451A (en) * 2016-09-30 2017-03-15 百度在线网络技术(北京)有限公司 The processing method and processing device of malicious access
CN107995152A (en) * 2016-10-27 2018-05-04 腾讯科技(深圳)有限公司 A kind of malicious access detection method, device and detection service device
CN108334273A (en) * 2018-02-09 2018-07-27 网易(杭州)网络有限公司 Method for information display and device, storage medium, processor, terminal
CN110442230A (en) * 2018-05-04 2019-11-12 脸谱科技有限责任公司 Prevent the user interface in reality environment from blocking
WO2020019484A1 (en) * 2018-07-27 2020-01-30 平安科技(深圳)有限公司 Simulator recognition method, recognition device, and computer readable medium
CN109711123A (en) * 2018-11-21 2019-05-03 武汉极意网络科技有限公司 Behavioral value method and device based on simulation browser detection
CN110401660A (en) * 2019-07-26 2019-11-01 秒针信息技术有限公司 Recognition methods, device, processing equipment and the storage medium of false flow
CN110609937A (en) * 2019-08-15 2019-12-24 平安科技(深圳)有限公司 Crawler identification method and device
CN110889745A (en) * 2019-11-22 2020-03-17 无线生活(北京)信息技术有限公司 Method and device for intelligently identifying robbery behavior

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
Using Hidden Markov Model to detect rogue access points;Gayathri Shivaraj et al.;SECURITY AND COMMUNICATION NETWORKS;第3卷;全文 *
基于访问日志的网页内容监控挖掘系统;丘海澜;文翰;肖南峰;;计算机工程(04);全文 *
电子商务平台中流量统计模块的设计研究;李雯;;硅谷(20);全文 *
网络用户角色辨识及其恶意访问行为的发现方法;王建;张仰森;陈若愚;蒋玉茹;尤建清;;计算机科学(10);全文 *

Also Published As

Publication number Publication date
CN111461545A (en) 2020-07-28

Similar Documents

Publication Publication Date Title
CN109145934B (en) User behavior data processing method, medium, equipment and device based on log
CN104836781B (en) Distinguish the method and device for accessing user identity
WO2019056721A1 (en) Information pushing method, electronic device and computer storage medium
CN103905532B (en) The recognition methods of microblogging marketing account and system
CN107786545A (en) A kind of attack detection method and terminal device
CN108876464B (en) Cheating behavior detection method and device, service equipment and storage medium
CN108112038B (en) Method and device for controlling access flow
CN109831454B (en) False traffic identification method and device
CN108829769B (en) Suspicious group discovery method and device
CN113412607B (en) Content pushing method and device, mobile terminal and storage medium
Ikram et al. Measuring, characterizing, and detecting Facebook like farms
CN113779481B (en) Method, device, equipment and storage medium for identifying fraud websites
CN106874165A (en) Page detection method and device
CN106168968A (en) A kind of Website classification method and device
CN106910135A (en) User recommends method and device
CN106933905B (en) Method and device for monitoring webpage access data
CN111461545B (en) Method and device for determining machine access data
CN106257449A (en) A kind of information determines method and apparatus
CN110618797B (en) Method and device for generating character trotting horse lamp and terminal equipment
CN116932549A (en) Intelligent model-based platform data storage method, system, medium and equipment
CN113315670B (en) Network flow analysis method, device and storage medium
CN105761107A (en) Method for acquiring target new users in internet products and device thereof
CN105653645B (en) Network information attention degree evaluation method and device
CN114187037A (en) Information pushing method and device and nonvolatile storage medium
CN106874299A (en) Page detection method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant