CN111310044A - Method, device, equipment and storage medium for extracting page element information - Google Patents

Method, device, equipment and storage medium for extracting page element information Download PDF

Info

Publication number
CN111310044A
CN111310044A CN202010093390.7A CN202010093390A CN111310044A CN 111310044 A CN111310044 A CN 111310044A CN 202010093390 A CN202010093390 A CN 202010093390A CN 111310044 A CN111310044 A CN 111310044A
Authority
CN
China
Prior art keywords
element information
page
candidate
extracting
triggered
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010093390.7A
Other languages
Chinese (zh)
Other versions
CN111310044B (en
Inventor
王晓元
陈鑫
周振宇
叶峻
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202010093390.7A priority Critical patent/CN111310044B/en
Publication of CN111310044A publication Critical patent/CN111310044A/en
Application granted granted Critical
Publication of CN111310044B publication Critical patent/CN111310044B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The application discloses a method, a device, equipment and a storage medium for extracting page element information, and relates to the technical field of data mining. The specific implementation scheme is as follows: acquiring an element information set triggered by a user interaction event on a target page; and extracting target element information from the element information set according to the attribute of the element, which is associated with the page service, and the number index of triggered times of the element. The method and the device extract the element information which is oriented to the service and can clearly reflect the user behavior semantics so as to improve the application value of the element information to product and operation optimization.

Description

Method, device, equipment and storage medium for extracting page element information
Technical Field
The application relates to computer technology, in particular to the technical field of data mining.
Background
The embedded point is a common data analysis method for website analysis, and some program codes are added in a website or an application program to collect browsing, access data and application use conditions of a user in the website or the application program and analyze user interaction behaviors, so that subsequent optimization of products and operation is facilitated.
In the prior art, the user interaction behavior is generally analyzed by collecting element information triggered by the user interaction behavior. With the gradually enriched services of websites or application programs, more and more element information needs to be collected, and the user behaviors reflected by the element information are disordered, so that the application value of product and operation optimization is lower and lower.
Disclosure of Invention
The embodiment of the application provides a method, a device, equipment and a storage medium for extracting page element information, so as to extract element information which has a useful value for product and operation optimization.
In a first aspect, an embodiment of the present application provides a method for extracting page element information, including:
acquiring an element information set triggered by a user interaction event on a target page;
and extracting target element information from the element information set according to the attribute of the element, which is associated with the page service, and the number index of triggered times of the element.
In the embodiment of the application, all user behavior data are acquired by collecting an element information set triggered by a user interaction event on a target page; target element information is extracted from the element information set according to attributes of the elements, which are associated with the page service, and the number index of triggered times of the elements, so that the element information which is oriented to the service and can clearly reflect the user behavior semantics is extracted, and the application value of the element information to product and operation optimization is improved.
Optionally, the attribute of the element associated with the page service includes: at least one of a click type event bound by the element, a button type of the element, a link function of the element, an element content length smaller than a set value, a landing page of the element and a promotion page of the element;
the number of times the element is triggered index includes: conversion and/or click-through of the element.
In an optional implementation manner in the above application, the service attribute of the element can be clearly embodied by a click event bound to the element, a button type of the element, a link function of the element, a content length of the element smaller than a set value, a landing page of the element, and a promotion page of the element, so that target element information extracted based on the attributes is element information for a page service; the conversion rate and/or click rate of the elements reflect the focusing behavior of the user on the elements, reflect definite behavior semantics and are beneficial to accurately extracting target element information.
Optionally, extracting target element information from the element information set according to an attribute of the element associated with the page service and a number index of times that the element is triggered, where the extracting includes:
extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service;
and extracting target element information from the candidate element information set according to the number index of triggered elements.
In an optional implementation manner in the above application, the attribute may be directly obtained from the element information set, and the number index needs to be obtained through calculation, the element information is preferentially extracted according to the attribute, and then secondary extraction is performed according to the number index, so that the target element information can be quickly extracted, and the calculation amount can be reduced.
Optionally, the extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service includes:
extracting element information triggered by a click event from the element information set;
and extracting at least one of the element information of the button type, the element information of the link type, the element information of which the content length is smaller than a set value, the element information positioned on the landing page and the element information positioned on the promotion page from the element information triggered by the clicked event to form a candidate element information set.
In an alternative embodiment of the above application, a click-type event may reflect the focus of the user on the element better than other interaction events; the element information is preferentially selected according to the types of the click events, and compared with other attributes, the available element information can be selected in the largest range; and then, candidate elements are extracted according to the buttons, the links, the content length and the located pages of the elements, so that the extraction efficiency of the element information is improved.
Optionally, extracting target element information from the candidate element information set according to the number of times that the element is triggered includes:
calculating the conversion rate and/or click rate of each candidate element;
and determining the candidate element information with the conversion rate within a first preset range and/or the candidate element information with the click rate within a second preset range from the candidate element information set as target element information.
In an optional implementation manner in the above application, the conversion rate and the click rate can significantly reflect the focusing behavior of the user on the element, and have definite behavior semantics; the numerical values of the conversion rate and the click rate can reflect the focusing degree of the user on the elements and also reflect the strength of behavior semantics. Based on the method, the conversion rate and/or click rate in the preset range can be selected according to the business needs, and the target element information with different semantic strengths is extracted.
Optionally, calculating the conversion rate, and/or click rate of each candidate element includes:
acquiring a plurality of sessions generated in the interaction process of a user and a target page, wherein each session comprises element information triggered by a click event in a preset time period;
calculating the conversion rate of each candidate element according to the number of sessions containing the information of each candidate element and the total number of the sessions; and/or calculating the click rate of each candidate element according to the number of the conversations containing the information of each candidate element and the display times of the information of each candidate element on the target page.
In an optional implementation manner in the above application, the conversion rate and the click rate are calculated by taking the session as the granularity and by whether the user clicks the candidate element within a preset time period, an error caused by repeatedly clicking the candidate element by the user within a short time is avoided, and the accuracy of the conversion rate and the click rate is improved.
Optionally, before determining, from the candidate element information set, the element information of which the conversion rate is within a first preset range and/or the element information of which the click rate is within a second preset range as the target element information, the method further includes: and deleting the element information of which the conversion rate exceeds a first preset threshold value and/or the element information of which the click rate exceeds a second preset threshold value in the candidate element information set.
In an optional implementation manner in the above application, the higher index of times is generally an element touched conventionally, and is oriented to basic services of a page, such as a login button, a save button, and the like; the information of the elements hardly reflects the personalized behavior semantics of the user; by deleting the candidate element information with high index of times, the element information reflecting the personalized behavior semantics of the user can be intensively extracted.
Optionally, after extracting target element information from the element information set according to the attribute of the element associated with the page service and the number index of times the element is triggered, the method further includes:
and embedding points in the page to be embedded according to the target element information.
In an optional implementation manner in the above application, according to the target element information, the page to be buried is buried to accurately extract the element information which is service-oriented and can clearly reflect the user behavior semantics on the page to be buried, and the point burying method is suitable for a primary user or a user of a non-professional analyst, has a low technical threshold, starts from an element information set, fills a gap between the element information set and the user-set buried point, and produces a buried point recommendation result with service significance.
In a second aspect, an embodiment of the present application further provides an apparatus for extracting page element information, including:
the acquisition module is used for acquiring an element information set triggered by a user interaction event on a target page;
and the extracting module is used for extracting target element information from the element information set according to the attribute of the element, which is associated with the page service, and the number index of triggered times of the element.
In a third aspect, an embodiment of the present application further provides an electronic device, including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor, and the instructions are executed by the at least one processor to enable the at least one processor to execute a method for extracting page element information as provided in an embodiment of the first aspect.
In a fourth aspect, embodiments of the present application further provide a non-transitory computer-readable storage medium storing computer instructions for causing a computer to execute the method for extracting page element information provided in the first aspect.
Other effects of the above-described alternative will be described below with reference to specific embodiments.
Drawings
The drawings are included to provide a better understanding of the present solution and are not intended to limit the present application. Wherein:
fig. 1 is a flowchart of a method for extracting page element information in a first embodiment of the present application;
fig. 2 is a flowchart of a method for extracting page element information in the second embodiment of the present application;
fig. 3 is a structural diagram of an apparatus for extracting page element information in a third embodiment of the present application;
fig. 4 is a block diagram of an electronic device for implementing the method for extracting page element information according to the embodiment of the present application.
Detailed Description
The following description of the exemplary embodiments of the present application, taken in conjunction with the accompanying drawings, includes various details of the embodiments of the application for the understanding of the same, which are to be considered exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Example one
Fig. 1 is a flowchart of a method for extracting page element information in an embodiment of the present application, where the embodiment of the present application is applied to a case where element information of application value for product and operation optimization is extracted from page element information triggered by a user interaction behavior.
The method for extracting page element information shown in fig. 1 includes:
s101, collecting an element information set triggered by a user interaction event on a target page.
The target page can be a page of a certain website or a page of a certain application program, and the number of the target pages is at least one. In the embodiment, embedding points, such as full embedding points or code embedding points, on the target page is performed in advance. And binding the set user interaction event to the element on the target page through the embedded point, and triggering the bound element information when the user interaction event is monitored. The element information includes, but is not limited to, element content (or called element name), element type, element location, and page where the element is located.
Optionally, in the interaction process between the user and the target page, acquiring element information triggered by a user interaction event in real time through a buried point to form an element information set; or, selecting part or all of the element information from the element information collected by the buried points to form an element information set.
S102, extracting target element information from the element information set according to the attribute of the element, which is associated with the page service, and the number index of triggered times of the element.
The elements on a page have a variety of attributes, such as element content, element type, and page in which the element is located. The page service refers to functions, entries, services, and the like provided by a target page. Some element attributes are associated with page services, and some element attributes are not associated with page services. Whether the element attribute is associated with the page service or not can be determined by whether the page service is called or not after the element is triggered, namely whether the function, the entrance or the service provided by the page is called or not.
Optionally, the attribute of the element associated with the page service includes: the method comprises the following steps of at least one of a click type event bound by an element, a button type of the element, a link function of the element, a content length of the element smaller than a set value, a landing page of the element and a promotion page of the element. Wherein, the click-like events include but are not limited to touch, single click, double click, and the like. The page responds to the click operation of the user and calls the page service; moreover, compared with a sliding event, the clicking event can reflect the focusing action of the user on the element. Similarly, the page responds to the triggering operation of the user on the element of the button type, calls the corresponding page service, and responds to the triggering operation of the user on the element of the link function, and jumps to the corresponding page. For element content lengths less than the set value, if the element content length is less than the set value, i.e., the element content is a short text, the elements of the short text can identify the service or function of the page, e.g., a tag for a page title, music, video, game, etc. Landing pages are web pages that are displayed to a potential user when the user clicks on an advertisement or searches with a search engine. Typically, the page will display expanded content related to the clicked advertisement or search result link, and the page should be search engine optimized for a certain keyword (or phrase). Therefore, the landing page comprehensively shows the provided functions, entrances, services and the like, and is closely related to the page business; moreover, the landing page can directly interact with the user relative to the subsequent page, and the focusing degree of the user on the elements can be reflected. At present, registration in updating, order placement in promotion and the like are often achieved through promotion channels, elements on a promotion page are triggered by interaction events, promotion content is displayed, or a merchant page is jumped to.
The index of the number of times an element is triggered includes an element conversion rate and/or an element click rate.
The frequency index reflects the focusing degree of the element by the user, and the higher the focusing degree of the element by the user is, the more definite the reflected behavior semantics of the user are. The user behavior semantics in this embodiment are different from the semantics in the image or the text, but are service-oriented user behavior semantics, for example, a user clicks a reading button on a page, and uses a reading service provided by the page, which can reflect the behavior semantics of the user to a certain extent, but is not clear enough, and cannot exclude the possibility of misoperation or random operation; and in combination, the index of the triggered times of the reading button is in a preset range, and the clear reading behavior semantics of the user are reflected.
The target element information extracted in this embodiment includes, but is not limited to: element content, element type, and page on which the element is located (which may span multiple uniform resource locators).
In the embodiment of the application, all user behavior data are acquired by collecting an element information set triggered by a user interaction event on a target page; target element information is extracted from the element information set according to attributes of the elements, which are associated with the page service, and the number index of triggered times of the elements, so that the service-oriented element information which can clearly reflect user behavior semantics is extracted, and the application value of the element information to product and operation optimization is improved.
Furthermore, the service attributes of the elements can be clearly embodied by the clicking type event bound by the elements, the button types of the elements, the link functions of the elements, the content length of the elements smaller than a set value, the landing page of the elements and the promotion page of the elements, so that the target element information extracted based on the attributes is the element information facing the page service; the conversion rate and/or click rate of the elements reflect the focusing behavior of the user on the elements, reflect definite behavior semantics and are beneficial to accurately extracting target element information.
Example two
Fig. 2 is a flowchart of a method for extracting page element information in the second embodiment of the present application, and the second embodiment of the present application performs optimization and improvement on the basis of the technical solutions of the foregoing embodiments.
Further, the operation of extracting target element information from the element information set according to the attribute of the element associated with the page service and the number index of triggered times of the element is refined into the operation of extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service; and extracting target element information' from the candidate element information set according to the number index of triggered times of the elements so as to quickly extract the target element information and reduce the computation amount.
The method for extracting page element information shown in fig. 2 includes:
s201, collecting an element information set triggered by a user interaction event on a target page.
S202, extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service.
The element information set stores each attribute of the element, and based on the attribute, the attribute comparison can be directly carried out in the element information set, so that a candidate element information set with the attribute associated with the page service is extracted.
Optionally, first, extracting element information triggered by a click event from an element information set; and then, extracting at least one of the element information of the button type, the element information of the link type, the element information of which the content length is smaller than a set value, the element information positioned on the landing page and the element information positioned on the promotion page from the element information triggered by the clicked event to form a candidate element information set.
Specifically, after the element information triggered by the click type event is extracted, the element attribute triggered by the click type event is extracted, including the element type (class), whether the element is a link (href), the length of the element content (content), and the page where the element is located. Then, the element information of the button (button) type, the element information of which the element content length is smaller than the set value, the element information of the link type, the element information located in the Landing Page (Landing Page), and the element information located in the promotion Page are extracted to form a candidate element information set. For convenience of description and distinction, the element information in the candidate element information set is referred to as candidate element information.
In the embodiment, the click event can reflect the focusing of the user on the element better than other interaction events; the element information is preferentially selected according to the types of the click events, and compared with other attributes, the available element information can be selected in the largest range; and then, candidate elements are extracted according to the buttons, the links, the content length and the located pages of the elements, so that the extraction efficiency of the element information is improved.
And S203, extracting target element information from the candidate element information set according to the triggered times indexes of the elements.
The present operation includes the following three alternative embodiments.
A first alternative embodiment: and calculating the conversion rate of each candidate element, and determining the candidate element information with the conversion rate within a first preset range from the candidate element information set as target element information.
Second alternative embodiment: and calculating the click rate of each candidate element, and determining the candidate element information with the click rate in a second preset range from the candidate element information set as the target element information.
A third alternative embodiment: and calculating the conversion rate and the click rate of each candidate element, and determining candidate element information with the conversion rate within a first preset range and candidate element information with the click rate within a second preset range from the candidate element information set as target element information.
In the embodiment, the conversion rate and the click rate can obviously reflect the focusing behavior of the user on the elements, and have definite behavior semantics; the numerical values of the conversion rate and the click rate can reflect the focusing degree of the user on the elements and also reflect the strength of behavior semantics. Based on the method, a first preset range and/or a second preset range can be determined according to business needs, and the conversion rate in the first preset range and/or the click rate in the second preset range are/is selected, so that target element information with different semantic strengths is extracted.
In the embodiment, the click rate and the conversion rate of the candidate elements are calculated by taking the session as granularity.
Taking a session as an example of granularity, acquiring a plurality of sessions generated in the interaction process of a user and a target page, wherein each session comprises element information triggered by a click event in a preset time period; the preset time period may be 30 minutes, 40 minutes, etc. In calculating the conversion, for each candidate element, the element is scored by calculating the element conversion or click rate. For each session (session), if a click occurs to an element in the session, no matter how many clicks, the element in the session is considered to be converted into 1, otherwise, the element is 0. And counting the number of sessions with the conversion rate of 1 of each candidate element information, and dividing the number by the total number of sessions to obtain the conversion rate of each candidate element information.
When the click rate is calculated, the display frequency of each candidate element information on the target page can be determined according to the jumping frequency or the refreshing frequency of the target page. And dividing the number of the conversations containing each candidate element information by the display times of the corresponding candidate element information on the target page to obtain the click rate of each candidate element information.
Optionally, before determining the target element information, deleting the element information of which the conversion rate exceeds a first preset threshold and/or the element information of which the click rate exceeds a second preset threshold from the candidate element information set. Specifically, the first preset threshold and the second preset threshold may be set independently, and after the conversion rate and/or the click rate are calculated for each candidate element information, the element information with a higher conversion rate and/or click rate needs to be eliminated. For example, the conversion rate is usually higher than 5%, and these elements are often elements that are conventionally touched, and are page-oriented basic services, such as a login button, a save button, and the like; the information of the elements hardly reflects the personalized behavior semantics of the user; by deleting the candidate element information with high index of times, the element information reflecting the personalized behavior semantics of the user can be intensively extracted.
In this embodiment, the attribute may be directly obtained from the element information set, and the number index needs to be obtained through calculation, and the element information is preferentially extracted according to the attribute, and then secondary extraction is performed according to the number index, so that the target element information can be quickly extracted, and the calculation amount can be reduced. Furthermore, with the conversation as the granularity, the conversion rate and the click rate are calculated according to whether the user clicks the candidate element in a preset time period, so that the error caused by repeatedly clicking the candidate element by the user in a short time is avoided, and the accuracy of the conversion rate and the click rate is improved.
In the foregoing embodiments, after extracting target element information from an element information set according to an attribute of an element associated with a page service and a number of times that the element is triggered, the method further includes: and embedding points in the page to be embedded according to the target element information. Specifically, the code embedding or the visual embedding is adopted to embed the target element information on the page to be embedded, so that the element information which is oriented to the service and can clearly reflect the user behavior semantics can be acquired through embedding.
According to the method, the point burying is carried out on the page to be buried point according to the target element information, so that the element information which is on the page to be buried point, faces to the service and can clearly reflect the user behavior semantics can be accurately extracted, the point burying method is suitable for primary users or users of non-professional analysts, the technical threshold is low, the gap between the element information set and the buried point set by the user is filled from the element information set, and the buried point recommendation result with the service significance is output.
EXAMPLE III
Fig. 3 is a structural diagram of an apparatus for extracting page element information in a third embodiment of the present application, where the third embodiment of the present application is applicable to a case where element information of application value for product and operation optimization is extracted from page element information triggered by a user interaction behavior, and the apparatus is implemented by software and/or hardware and is specifically configured in an electronic device with a certain data operation capability.
An apparatus 300 for extracting page element information as shown in fig. 3 comprises: an acquisition module 301 and an extraction module 302; wherein,
the collection module 301 is configured to collect an element information set triggered by a user interaction event on a target page.
The extracting module 302 is configured to extract target element information from the element information set according to an attribute of the element associated with the page service and a number index of times that the element is triggered.
In the embodiment of the application, all user behavior data are acquired by collecting an element information set triggered by a user interaction event on a target page; target element information is extracted from the element information set according to attributes of the elements, which are associated with the page service, and the number index of triggered times of the elements, so that the service-oriented element information which can clearly reflect user behavior semantics is extracted, and the application value of the element information to product and operation optimization is improved.
Further, the attributes of the element associated with the page service include: at least one of a click type event bound by the element, a button type of the element, a link function of the element, an element content length smaller than a set value, a landing page of the element and a promotion page of the element;
the number of times that an element is triggered index includes: conversion and/or click-through of the element.
Further, the extraction module 302 includes a candidate element information set extraction unit and a target element information extraction unit. The candidate element information set extraction unit is used for extracting a candidate element information set from the element information set according to the attribute of the element, which is associated with the page service; the target element information extraction unit is used for extracting target element information from the candidate element information set according to the number index of triggering the element.
Further, the candidate element information set extraction unit is specifically configured to extract element information triggered by a click event from the element information set; and extracting at least one of the element information of the button type, the element information of the link type, the element information of which the content length is smaller than a set value, the element information positioned on the landing page and the element information positioned on the promotion page from the element information triggered by the clicked event to form a candidate element information set.
Further, the target element information extraction unit is specifically configured to calculate a conversion rate and/or a click rate of each candidate element; and determining candidate element information with the conversion rate within a first preset range and/or candidate element information with the click rate within a second preset range from the candidate element information set as target element information.
Further, when calculating the conversion rate and/or click rate of each candidate element, the target element information extraction unit is specifically configured to: acquiring a plurality of sessions generated in the interaction process of a user and a target page, wherein each session comprises element information triggered by a click event in a preset time period; calculating the conversion rate of each candidate element according to the number of sessions containing the information of each candidate element and the total number of the multiple sessions; and/or calculating the click rate of each candidate element according to the number of the conversations containing the information of each candidate element and the display times of the information of each candidate element on the target page.
The device further comprises a deleting unit, which is used for deleting the element information of which the conversion rate exceeds a first preset threshold value and/or the element information of which the click rate exceeds a second preset threshold value in the candidate element information set before determining the element information of which the conversion rate is within a first preset range and/or the element information of which the click rate is within a second preset range as the target element information from the candidate element information set.
Further, the device also comprises a point burying module, wherein the point burying module is used for burying points in the page of the point to be buried according to the target element information.
The device for extracting the page element information can execute the method for extracting the page element information provided by any embodiment of the application, and has the corresponding functional modules and beneficial effects of executing the method for extracting the page element information.
Example four
According to an embodiment of the present application, an electronic device and a readable storage medium are also provided.
Fig. 4 is a block diagram of an electronic device implementing the method for extracting page element information according to the embodiment of the present application. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the present application that are described and/or claimed herein.
As shown in fig. 4, the electronic apparatus includes: one or more processors 401, memory 402, and interfaces for connecting the various components, including high-speed interfaces and low-speed interfaces. The various components are interconnected using different buses and may be mounted on a common motherboard or in other manners as desired. The processor may process instructions for execution within the electronic device, including instructions stored in or on the memory to display graphical information of a GUI on an external input/output apparatus (such as a display device coupled to the interface). In other embodiments, multiple processors and/or multiple buses may be used, along with multiple memories and multiple memories, as desired. Also, multiple electronic devices may be connected, with each device providing portions of the necessary operations (e.g., as a server array, a group of blade servers, or a multi-processor system). In fig. 4, one processor 401 is taken as an example.
Memory 402 is a non-transitory computer readable storage medium as provided herein. The memory stores instructions executable by at least one processor to cause the at least one processor to execute the method for extracting page element information provided by the application. The non-transitory computer-readable storage medium of the present application stores computer instructions for causing a computer to execute the extraction method of page element information provided by the present application.
Memory 402, which is a non-transitory computer readable storage medium, may be used to store non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules corresponding to the method of extracting page element information in the embodiments of the present application (e.g., including acquisition module 301 and extraction module 302 shown in fig. 3). The processor 401 executes various functional applications of the server and data processing, i.e., a method of extracting page element information in the above-described method embodiments, by executing non-transitory software programs, instructions, and modules stored in the memory 402.
The memory 402 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created by use of an electronic device implementing the extraction method of the page element information, and the like. Further, the memory 402 may include high speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, the memory 402 may optionally include a memory remotely located from the processor 401, and these remote memories may be connected over a network to an electronic device that performs the extraction method of the page element information. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The electronic device performing the method of extracting page element information may further include: an input device 403 and an output device 404. The processor 401, the memory 402, the input device 403 and the output device 404 may be connected by a bus or other means, and fig. 4 illustrates an example of a connection by a bus.
The input device 403 may receive input numeric or character information and generate key signal inputs related to user settings and function control of an electronic apparatus performing the extraction method of the page element information, such as a touch screen, a keypad, a mouse, a track pad, a touch pad, a pointing stick, one or more mouse buttons, a track ball, a joystick, or the like. The output devices 404 may include a display device, auxiliary lighting devices (e.g., LEDs), and haptic feedback devices (e.g., vibrating motors), among others. The display device may include, but is not limited to, a Liquid Crystal Display (LCD), a Light Emitting Diode (LED) display, and a plasma display. In some implementations, the display device can be a touch screen.
Various implementations of the systems and techniques described here can be realized in digital electronic circuitry, integrated circuitry, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.
These computer programs (also known as programs, software applications, or code) include machine instructions for a programmable processor, and may be implemented using high-level procedural and/or object-oriented programming languages, and/or assembly/machine languages. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or device (e.g., magnetic discs, optical disks, memory, Programmable Logic Devices (PLDs)) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), Wide Area Networks (WANs), the internet, and blockchain networks.
The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
According to the technical scheme of the embodiment of the application, all user behavior data are acquired by collecting an element information set triggered by a user interaction event on a target page; target element information is extracted from the element information set according to attributes of the elements, which are associated with the page service, and the number index of triggered times of the elements, so that the element information which is oriented to the service and can clearly reflect the user behavior semantics is extracted, and the application value of the element information to product and operation optimization is improved.
It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present application may be executed in parallel, sequentially, or in different orders, and the present invention is not limited thereto as long as the desired results of the technical solutions disclosed in the present application can be achieved.
The above-described embodiments should not be construed as limiting the scope of the present application. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims (11)

1. A method for extracting page element information is characterized by comprising the following steps:
acquiring an element information set triggered by a user interaction event on a target page;
and extracting target element information from the element information set according to the attribute of the element, which is associated with the page service, and the number index of triggered times of the element.
2. The method of claim 1,
the attributes of the elements associated with the page service include: at least one of a click type event bound by the element, a button type of the element, a link function of the element, an element content length smaller than a set value, a landing page of the element and a promotion page of the element;
the number of times the element is triggered index includes: conversion and/or click-through of the element.
3. The method according to claim 1 or 2, wherein the extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered comprises:
extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service;
and extracting target element information from the candidate element information set according to the number index of triggered elements.
4. The method of claim 3, wherein the extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service comprises:
extracting element information triggered by a click event from the element information set;
and extracting at least one of the element information of the button type, the element information of the link type, the element information of which the content length is smaller than a set value, the element information positioned on the landing page and the element information positioned on the promotion page from the element information triggered by the clicked event to form a candidate element information set.
5. The method according to claim 3, wherein the extracting target element information from the candidate element information set according to the index of the number of times that the element is triggered comprises:
calculating the conversion rate and/or click rate of each candidate element;
and determining the candidate element information with the conversion rate within a first preset range and/or the candidate element information with the click rate within a second preset range from the candidate element information set as target element information.
6. The method of claim 5, wherein calculating the conversion rate and/or click-through rate for each candidate element comprises:
acquiring a plurality of sessions generated in the interaction process of the user and a target page, wherein each session comprises element information triggered by a click event in a preset time period;
calculating the conversion rate of each candidate element according to the number of sessions containing the information of each candidate element and the total number of the sessions; and/or calculating the click rate of each candidate element according to the number of the conversations containing the information of each candidate element and the display times of the information of each candidate element on the target page.
7. The method according to claim 5, before determining, from the candidate element information set, element information with the conversion rate within a first preset range and/or element information with the click rate within a second preset range as target element information, further comprising:
and deleting the element information of which the conversion rate exceeds a first preset threshold value and/or the element information of which the click rate exceeds a second preset threshold value in the candidate element information set.
8. The method according to any one of claims 1 to 7, further comprising, after extracting target element information from the element information set according to an attribute of an element associated with a page service and an index of a number of times the element is triggered:
and embedding points in the page to be embedded according to the target element information.
9. An apparatus for extracting information of a page element, comprising:
the acquisition module is used for acquiring an element information set triggered by a user interaction event on a target page;
and the extracting module is used for extracting target element information from the element information set according to the attribute of the element, which is associated with the page service, and the number index of triggered times of the element.
10. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform a method of extracting page element information as claimed in any one of claims 1 to 8.
11. A non-transitory computer readable storage medium storing computer instructions for causing a computer to execute a method of extracting page element information according to any one of claims 1 to 8.
CN202010093390.7A 2020-02-14 2020-02-14 Page element information extraction method, device, equipment and storage medium Active CN111310044B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010093390.7A CN111310044B (en) 2020-02-14 2020-02-14 Page element information extraction method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010093390.7A CN111310044B (en) 2020-02-14 2020-02-14 Page element information extraction method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN111310044A true CN111310044A (en) 2020-06-19
CN111310044B CN111310044B (en) 2023-09-26

Family

ID=71161722

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010093390.7A Active CN111310044B (en) 2020-02-14 2020-02-14 Page element information extraction method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111310044B (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112596837A (en) * 2020-12-24 2021-04-02 瑞声新能源发展(常州)有限公司科教城分公司 Method and device for replacing mobile terminal theme and readable storage medium
CN114626364A (en) * 2022-01-26 2022-06-14 科大讯飞股份有限公司 Content extraction method and related device, electronic equipment and storage medium
CN115017201A (en) * 2022-08-09 2022-09-06 中企云链(北京)金融信息服务有限公司 FLINK processing engine-based user behavior analysis method and system

Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105447139A (en) * 2015-11-20 2016-03-30 广州华多网络科技有限公司 Data acquisition statistical method, and system, terminal and service equipment thereof
US20170013072A1 (en) * 2015-07-09 2017-01-12 Guangzhou Ucweb Computer Technology Co., Ltd. Webpage pre-reading method, apparatus and smart terminal device
US20170236073A1 (en) * 2016-02-12 2017-08-17 Linkedln Corporation Machine learned candidate selection on inverted indices
US20170315676A1 (en) * 2016-04-28 2017-11-02 Linkedln Corporation Dynamic content insertion
CN108038053A (en) * 2017-11-29 2018-05-15 上海恺英网络科技有限公司 A kind of dynamic configuration buries method and apparatus a little
CN108334525A (en) * 2017-01-20 2018-07-27 阿里巴巴集团控股有限公司 A kind of method for exhibiting data and device
CN108459845A (en) * 2018-03-14 2018-08-28 北京思特奇信息技术股份有限公司 A kind of surveillance tag attribute buries point methods and device
CN109145230A (en) * 2017-06-15 2019-01-04 百度在线网络技术(北京)有限公司 Information output method and device
CN109522191A (en) * 2018-10-16 2019-03-26 深圳壹账通智能科技有限公司 A kind of method and device of the attribute information of acquisition interbehavior instruction
CN109740089A (en) * 2018-11-30 2019-05-10 东软集团股份有限公司 Collecting method, device, system, readable storage medium storing program for executing and electronic equipment
CN109767259A (en) * 2018-12-15 2019-05-17 深圳壹账通智能科技有限公司 Based on operation event promotion method, apparatus, equipment and the medium for burying point data
CN110378732A (en) * 2019-07-18 2019-10-25 腾讯科技(深圳)有限公司 Information display method, information correlation method, device, equipment and storage medium
CN110674022A (en) * 2019-09-27 2020-01-10 北京三快在线科技有限公司 Behavior data acquisition method and device and storage medium

Patent Citations (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20170013072A1 (en) * 2015-07-09 2017-01-12 Guangzhou Ucweb Computer Technology Co., Ltd. Webpage pre-reading method, apparatus and smart terminal device
CN105447139A (en) * 2015-11-20 2016-03-30 广州华多网络科技有限公司 Data acquisition statistical method, and system, terminal and service equipment thereof
US20170236073A1 (en) * 2016-02-12 2017-08-17 Linkedln Corporation Machine learned candidate selection on inverted indices
US20170315676A1 (en) * 2016-04-28 2017-11-02 Linkedln Corporation Dynamic content insertion
CN108334525A (en) * 2017-01-20 2018-07-27 阿里巴巴集团控股有限公司 A kind of method for exhibiting data and device
CN109145230A (en) * 2017-06-15 2019-01-04 百度在线网络技术(北京)有限公司 Information output method and device
CN108038053A (en) * 2017-11-29 2018-05-15 上海恺英网络科技有限公司 A kind of dynamic configuration buries method and apparatus a little
CN108459845A (en) * 2018-03-14 2018-08-28 北京思特奇信息技术股份有限公司 A kind of surveillance tag attribute buries point methods and device
CN109522191A (en) * 2018-10-16 2019-03-26 深圳壹账通智能科技有限公司 A kind of method and device of the attribute information of acquisition interbehavior instruction
CN109740089A (en) * 2018-11-30 2019-05-10 东软集团股份有限公司 Collecting method, device, system, readable storage medium storing program for executing and electronic equipment
CN109767259A (en) * 2018-12-15 2019-05-17 深圳壹账通智能科技有限公司 Based on operation event promotion method, apparatus, equipment and the medium for burying point data
CN110378732A (en) * 2019-07-18 2019-10-25 腾讯科技(深圳)有限公司 Information display method, information correlation method, device, equipment and storage medium
CN110674022A (en) * 2019-09-27 2020-01-10 北京三快在线科技有限公司 Behavior data acquisition method and device and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
余邹蓓蕾;: "互联网产品运营设计分析研究", 工业设计研究, no. 00 *

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112596837A (en) * 2020-12-24 2021-04-02 瑞声新能源发展(常州)有限公司科教城分公司 Method and device for replacing mobile terminal theme and readable storage medium
CN112596837B (en) * 2020-12-24 2024-05-17 瑞声新能源发展(常州)有限公司科教城分公司 Mobile terminal theme changing method and device and readable storage medium
CN114626364A (en) * 2022-01-26 2022-06-14 科大讯飞股份有限公司 Content extraction method and related device, electronic equipment and storage medium
CN115017201A (en) * 2022-08-09 2022-09-06 中企云链(北京)金融信息服务有限公司 FLINK processing engine-based user behavior analysis method and system

Also Published As

Publication number Publication date
CN111310044B (en) 2023-09-26

Similar Documents

Publication Publication Date Title
US11836662B2 (en) Systems and methods for identifying processes for robotic automation and building models therefor
CN102609474B (en) A kind of visit information supplying method and system
CN111475750B (en) Page preloading control method, device, system, equipment and storage medium
CN111310044B (en) Page element information extraction method, device, equipment and storage medium
JP6956119B2 (en) Systems and methods for providing contextual information
CN111460289B (en) News information pushing method and device
US11003667B1 (en) Contextual information for a displayed resource
CN111582477A (en) Training method and device of neural network model
CN112084150B (en) Model training and data retrieval method, device, equipment and storage medium
CN111078878A (en) Text processing method, device and equipment and computer readable storage medium
US11468236B2 (en) Method and apparatus for performing word segmentation on text, device, and medium
CN113159807A (en) Landing page processing method, device, equipment and medium
CN112380847A (en) Interest point processing method and device, electronic equipment and storage medium
JP2024507902A (en) Information retrieval methods, devices, electronic devices and storage media
CN112052397A (en) User feature generation method and device, electronic equipment and storage medium
CN111666417A (en) Method and device for generating synonyms, electronic equipment and readable storage medium
CN112000863A (en) User behavior data analysis method, device, equipment and medium
CN113516491A (en) Promotion information display method and device, electronic equipment and storage medium
CN111290912A (en) Single-page application performance monitoring method and device and electronic equipment
US10467300B1 (en) Topical resource recommendations for a displayed resource
CN112308602A (en) Method and apparatus for generating a model
US10250705B2 (en) Interaction trajectory retrieval
CN112148979B (en) Event-associated user identification method, device, electronic equipment and storage medium
CN112101012B (en) Interactive domain determining method and device, electronic equipment and storage medium
CN112446728B (en) Advertisement recall method, device, equipment and storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant