CN111310044B - Page element information extraction method, device, equipment and storage medium - Google Patents

Page element information extraction method, device, equipment and storage medium Download PDF

Info

Publication number
CN111310044B
CN111310044B CN202010093390.7A CN202010093390A CN111310044B CN 111310044 B CN111310044 B CN 111310044B CN 202010093390 A CN202010093390 A CN 202010093390A CN 111310044 B CN111310044 B CN 111310044B
Authority
CN
China
Prior art keywords
element information
page
candidate
triggered
extracting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202010093390.7A
Other languages
Chinese (zh)
Other versions
CN111310044A (en
Inventor
王晓元
陈鑫
周振宇
叶峻
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202010093390.7A priority Critical patent/CN111310044B/en
Publication of CN111310044A publication Critical patent/CN111310044A/en
Application granted granted Critical
Publication of CN111310044B publication Critical patent/CN111310044B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The application discloses a method, a device, equipment and a storage medium for extracting page element information, and relates to the technical field of data mining. The specific implementation scheme is as follows: collecting element information sets triggered by user interaction events on a target page; and extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered. The embodiment of the application extracts the element information which is oriented to the service and can definitely embody the user behavior semantics so as to improve the application value of the element information to the product and operation optimization.

Description

Page element information extraction method, device, equipment and storage medium
Technical Field
The application relates to the computer technology, in particular to the technical field of data mining.
Background
The embedded point is a common data analysis method for website analysis, and is used for collecting browsing and access data and application use conditions of a user in a website or an application program by adding some program codes into the website or the application program and analyzing user interaction behaviors so as to help products and operations to carry out subsequent optimization.
In the prior art, the user interaction behavior is generally analyzed by collecting element information triggered by the user interaction behavior. Along with the gradual enrichment of the business of websites or application programs, more and more element information needs to be acquired, the user behaviors reflected by the element information are disordered, and the application value for optimizing products and operation is lower and lower.
Disclosure of Invention
The embodiment of the application provides a method, a device, equipment and a storage medium for extracting page element information, so as to extract element information with utilization value for product and operation optimization.
In a first aspect, an embodiment of the present application provides a method for extracting page element information, including:
collecting element information sets triggered by user interaction events on a target page;
and extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered.
In the embodiment of the application, all user behavior data are acquired by acquiring the element information set triggered by the user interaction event on the target page; according to the attribute of the element associated with the page service and the number index of triggered elements, the target element information is extracted from the element information set, so that the service-oriented element information which can clearly embody the user behavior semantics is extracted, and the application value of the element information to the product and operation optimization is improved.
Optionally, the attribute of the element associated with the page service includes: at least one of click event of element binding, button type of element, link function of element, element content length smaller than set value, element in floor page and element in promotion page;
the number of times the element is triggered index includes: conversion rate and/or click through rate of the element.
In an optional implementation manner in the above application, the click event of the element binding, the button type of the element, the link function of the element, the content length of the element smaller than the set value, the service attribute of the element can be clearly embodied by the element being located on the landing page and the element being located on the promotion page, so that the target element information extracted based on the attribute is the element information facing the page service; the conversion rate and/or click rate of the elements embody the focusing behavior of the user on the elements, reflect the explicit behavior semantics, and are favorable for accurately extracting the target element information.
Optionally, extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered, including:
extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service;
and extracting target element information from the candidate element information set according to the number index of triggered elements.
In an optional implementation manner in the above application, the attribute may be directly obtained from the element information set, and the frequency index needs to be obtained by operation, so that the element information is preferentially extracted according to the attribute, and then the secondary extraction is performed according to the frequency index, so that the target element information can be rapidly extracted and the operation amount is reduced.
Optionally, the extracting the candidate element information set from the element information set according to the attribute of the element associated with the page service includes:
extracting element information triggered by a clicked event from the element information set;
and extracting at least one of button type element information, link type element information, element information with content length smaller than a set value, element information positioned on a landing page and element information positioned on a popularization page from element information triggered by the clicked event to form a candidate element information set.
In an optional implementation manner in the above application, the click event can embody focusing of the element by the user more than other interaction events; the element information is preferentially selected according to the click event type, and compared with other attributes, the available element information can be selected in a maximum range; and then extracting candidate elements according to buttons, links, content lengths and pages of the elements, thereby being beneficial to improving the extraction efficiency of element information.
Optionally, extracting target element information from the candidate element information set according to the number index of triggered elements includes:
calculating the conversion rate and/or click rate of each candidate element;
and determining the candidate element information with the conversion rate within a first preset range and/or the candidate element information with the click rate within a second preset range from the candidate element information set as target element information.
In an optional implementation manner in the above application, the conversion rate and the click rate can obviously reflect the focusing behavior of the user on the element, and have clear behavior semantics; the numerical values of the conversion rate and the click rate can reflect the focusing degree of the user on the elements and also reflect the strength of behavior semantics. Based on the method, the conversion rate and/or the click rate within a preset range can be selected according to service requirements, and target element information with different semantic intensities can be extracted.
Optionally, calculating the conversion rate, and/or click-through rate, of each candidate element includes:
acquiring a plurality of sessions generated in the interaction process of a user and a target page, wherein each session comprises element information triggered by a click event in a preset period;
calculating the conversion rate of each candidate element according to the number of sessions containing the information of each candidate element and the total number of the plurality of sessions; and/or calculating the click rate of each candidate element according to the number of sessions containing each candidate element information and the display times of each candidate element information on the target page.
In an optional implementation manner in the above application, with the conversation as granularity, the conversion rate and the click rate are calculated by whether the user clicks the candidate element or not in a preset period, so that errors caused by repeatedly clicking the candidate element in a short time by the user are avoided, and the accuracy of the conversion rate and the click rate is improved.
Optionally, before determining the element information with the conversion rate within the first preset range and/or the element information with the click rate within the second preset range as the target element information from the candidate element information set, the method further includes: and deleting the element information of which the conversion rate exceeds a first preset threshold value and/or the element information of which the click rate exceeds a second preset threshold value from the candidate element information set.
In an optional implementation manner in the above application, the elements with higher frequency indexes are generally conventionally touched elements, and are oriented to basic services of pages, such as login buttons, save buttons and the like; these element information hardly reflects the personalized behavior semantics of the user; by deleting the candidate element information with the high number of times index, the element information reflecting the personalized behavior semantics of the user can be intensively extracted.
Optionally, after extracting the target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered, the method further includes:
and burying points on the page to be buried according to the target element information.
According to the method, the embedded point is carried out on the to-be-embedded point page according to the target element information, so that the element information which is oriented to the service and can clearly reflect the user behavior semantics on the to-be-embedded point page is accurately extracted, the embedded point method is suitable for primary users or users of non-professional analysts, the technical threshold is low, the gap between the element information set and the embedded point set by the user is filled from the element information set, and the embedded point recommendation result with service significance is produced.
In a second aspect, an embodiment of the present application further provides an apparatus for extracting page element information, including:
the acquisition module is used for acquiring an element information set triggered by the user interaction event on the target page;
and the extraction module is used for extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered.
In a third aspect, an embodiment of the present application further provides an electronic device, including:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein, the liquid crystal display device comprises a liquid crystal display device,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform a method of extracting page element information as provided by embodiments of the first aspect.
In a fourth aspect, embodiments of the present application also provide a non-transitory computer-readable storage medium storing computer instructions for causing the computer to perform a method for extracting page element information as provided by the embodiments of the first aspect.
Other effects of the above alternative will be described below in connection with specific embodiments.
Drawings
The drawings are included to provide a better understanding of the present application and are not to be construed as limiting the application. Wherein:
FIG. 1 is a flow chart of a method for extracting page element information in a first embodiment of the application;
FIG. 2 is a flowchart of a method for extracting page element information in a second embodiment of the present application;
fig. 3 is a block diagram of an apparatus for extracting page element information in the third embodiment of the present application;
fig. 4 is a block diagram of an electronic device for implementing a method for extracting page element information according to an embodiment of the present application.
Detailed Description
Exemplary embodiments of the present application will now be described with reference to the accompanying drawings, in which various details of the embodiments of the present application are included to facilitate understanding, and are to be considered merely exemplary. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Example 1
Fig. 1 is a flowchart of a method for extracting page element information in a first embodiment of the present application, where the embodiment of the present application is applicable to a case of extracting element information of application values for product and operation optimization from page element information triggered by user interaction, where the method is performed by an apparatus for extracting page element information, where the apparatus is implemented by software and/or hardware, and is specifically configured in an electronic device having a certain data computing capability.
The method for extracting the page element information shown in fig. 1 comprises the following steps:
s101, acquiring an element information set triggered by a user interaction event on a target page.
The target pages may be pages of a website or pages of an application, and the number of target pages is at least one. In this embodiment, the embedded point, for example, the full embedded point or the code embedded point is performed on the target page in advance. And binding the set user interaction event to the element on the target page through the embedded point, and triggering the bound element information when the user interaction event is monitored. The element information includes, but is not limited to, element content (or referred to as element name), element type, element location, page where the element is located, and the like.
Optionally, in the interaction process of the user and the target page, element information triggered by the user interaction event is collected in real time through the buried point to form an element information set; or selecting part or all of the element information from the element information acquired through the buried points to form an element information set.
S102, extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered.
Elements on a page have a variety of attributes, such as element content, element type, and page on which the element is located. Page services refer to functions, portals, services, etc. provided by a target page. Some element attributes are associated with the page traffic and some element attributes are not associated with the page traffic. Whether the element attribute is associated with the page service or not can be determined by whether the page service is invoked after the element is triggered, namely whether the function, the entry or the service provided by the page is invoked.
Optionally, the attributes of the element associated with the page service include: the element binding click event, the button type of the element, the link function of the element, the element content length smaller than a set value, and at least one of the element being located on a floor page and the element being located on a promotion page. Wherein click-type events include, but are not limited to, touches, single clicks, double clicks, and the like. The page responds to the clicking operation of the user and calls the page service; moreover, compared with the sliding event, the clicking event can embody the focusing action of the user on the element. Similarly, the page calls a corresponding page service in response to a triggering operation of the user on the button type element, and jumps to the corresponding page in response to a triggering operation of the user on the link function element. For element content lengths less than the set point, if the element content length is less than the set point, i.e., the element content is short text, the element of the short text can identify a service or function of the page, such as a tag of a page title, music, video, game, etc. A landing page is a web page that is displayed to a user when a potential user clicks on an advertisement or searches using a search engine. Typically, the page will display expanded content related to the clicked advertisement or search result link, and the page should be search engine optimized for a certain keyword (or phrase). Thus, the landing page comprehensively displays the provided functions, portals, services and the like, and is closely related to the page business; moreover, the floor page can directly face the interaction of the user relative to the subsequent page, and the focusing degree of the user on the elements can be reflected. At present, registration in pull-up, ordering in promotion and the like are often achieved through a promotion channel, elements on a promotion page are triggered by interaction events, and promotion content is displayed or jumped to a merchant page.
The index of the number of times an element is triggered includes an element conversion rate and/or an element click rate.
The number index reflects the focusing degree of the user on the element, and the higher the focusing degree of the user on the element is, the more definite the reflected user behavior semantics are. The user behavior semantics in the embodiment are different from the semantics in the image or the text, but are service-oriented, for example, the user clicks a reading button on the page, and the reading service provided by the page is used, so that the behavior semantics of the user can be reflected to a certain extent, but the user behavior semantics are not clear enough, and the possibility of misoperation or random operation cannot be eliminated; and then combining, the number index of the triggered reading buttons is in a preset range, and the definite reading behavior semantics of the user are reflected.
The target element information extracted in this embodiment includes, but is not limited to: element content, element type, and page on which the element is located (which may span multiple uniform resource locators).
In the embodiment of the application, all user behavior data are acquired by acquiring the element information set triggered by the user interaction event on the target page; according to the attribute of the element associated with the page service and the number index of triggered elements, the target element information is extracted from the element information set, so that the service-oriented element information which can clearly embody the user behavior semantics is extracted, and the application value of the element information to the product and operation optimization is improved.
Further, the click event, the button type, the link function, the content length of the element smaller than the set value, the landing page and the promotion page of the element binding can clearly embody the service attribute of the element, so that the target element information extracted based on the attribute is the element information facing the page service; the conversion rate and/or click rate of the elements embody the focusing behavior of the user on the elements, reflect the explicit behavior semantics, and are favorable for accurately extracting the target element information.
Example two
Fig. 2 is a flowchart of a method for extracting page element information in the second embodiment of the present application, where the embodiment of the present application is optimized and improved based on the technical solutions of the foregoing embodiments.
Further, extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered, and refining the target element information into candidate element information set from the element information set according to the attribute of the element associated with the page service; and extracting target element information from the candidate element information set according to the number index of triggered elements so as to quickly extract the target element information and reduce the operand.
The method for extracting the page element information shown in fig. 2 comprises the following steps:
s201, acquiring an element information set triggered by a user interaction event on a target page.
S202, extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service.
The element information set stores all the attributes of the elements, and based on the attributes, attribute comparison can be directly carried out in the element information set, so that candidate element information sets with the attributes associated with the page business are extracted.
Optionally, firstly extracting element information triggered by a click event from an element information set; then, at least one of button type element information, link type element information, element information with content length smaller than a set value, element information on a landing page and element information on a promotion page is extracted from element information triggered by the click event to form a candidate element information set.
Specifically, after extracting the element information triggered by the click event, the element attribute triggered by the click event is extracted, including an element type (class), whether the element is a link (href), an element content (content) length and a page where the element is located. Then, element information of a button (button) type, element information of which element content length is smaller than a set value, element information of a link type, element information on a Landing Page (Landing Page), and element information on a promotion Page are extracted to constitute a candidate element information set. For convenience of description and distinction, element information in the candidate element information set is referred to as candidate element information.
In this embodiment, compared with other interaction events, the click event can embody focusing of the element by the user; the element information is preferentially selected according to the click event type, and compared with other attributes, the available element information can be selected in a maximum range; and then extracting candidate elements according to buttons, links, content lengths and pages of the elements, thereby being beneficial to improving the extraction efficiency of element information.
S203, extracting target element information from the candidate element information set according to the number index of triggered elements.
This operation includes the following three alternative embodiments.
First alternative embodiment: and calculating the conversion rate of each candidate element, and determining candidate element information with the conversion rate within a first preset range from the candidate element information set as target element information.
Second alternative embodiment: and calculating the click rate of each candidate element, and determining candidate element information with the click rate within a second preset range from the candidate element information set as target element information.
Third alternative embodiment: and calculating the conversion rate and the click rate of each candidate element, and determining candidate element information with the conversion rate in a first preset range and candidate element information with the click rate in a second preset range from the candidate element information set as target element information.
In the embodiment, the conversion rate and the click rate can obviously reflect the focusing behavior of the user on the element, and have definite behavior semantics; the numerical values of the conversion rate and the click rate can reflect the focusing degree of the user on the elements and also reflect the strength of behavior semantics. Based on the above, the first preset range and/or the second preset range can be determined according to the service requirement, and the conversion rate in the first preset range and/or the click rate in the second preset range can be selected, so that the target element information with different semantic intensities can be extracted.
In the embodiment, the click rate and the conversion rate of the candidate elements are calculated by taking the session as granularity.
Taking a conversation as an example, acquiring a plurality of conversations generated in the interaction process of a user and a target page, wherein each conversation comprises element information triggered by a click event in a preset period; the preset time period may be 30 minutes, 40 minutes, etc. In calculating the conversion rate, the element is scored by calculating the element conversion rate or click rate for each candidate element. For each session (session), an element is considered to be turned to 1 if a click on the element occurs within the session, regardless of the number of clicks, and to be 0 otherwise. Counting the number of sessions with the conversion rate of 1 of each candidate element information, and dividing the number by the total number of sessions to obtain the conversion rate of each candidate element information.
When the click rate is calculated, the display times of each candidate element information on the target page can be determined according to the jump times or the refreshing times of the target page. Dividing the conversation number containing each candidate element information by the display times of the corresponding candidate element information on the target page to obtain the click rate of each candidate element information.
Optionally, before determining the target element information, deleting element information with a conversion rate exceeding a first preset threshold value and/or element information with a click rate exceeding a second preset threshold value in the candidate element information set. Specifically, the first preset threshold and the second preset threshold may be set independently, and after the conversion rate and/or the click rate are calculated for each candidate element information, the element information with higher conversion rate and/or click rate needs to be removed. For example, conversions above 5% are often high, and these elements are often normally reached elements, which are oriented towards the underlying business of the page, such as login buttons, save buttons, etc.; these element information hardly reflects the personalized behavior semantics of the user; by deleting the candidate element information with the high number of times index, the element information reflecting the personalized behavior semantics of the user can be intensively extracted.
In this embodiment, the attribute may be directly obtained from the element information set, and the frequency index needs to be calculated, so that the element information is preferentially extracted according to the attribute, and then the secondary extraction is performed according to the frequency index, so that the target element information can be rapidly extracted and the calculation amount is reduced. Furthermore, with the conversation as granularity, the conversion rate and the click rate are calculated by whether the user clicks the candidate element or not in a preset period of time, so that errors caused by repeatedly clicking the candidate element in a short time by the user are avoided, and the accuracy of the conversion rate and the click rate is improved.
In the above embodiments, after extracting the target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered, the method further includes: and burying the point to be buried page according to the target element information. Specifically, the code embedded points or the visual embedded points are adopted to embed the target element information on the page to be embedded points, so that the element information which faces to the service and can clearly embody the user behavior semantics can be acquired through the embedded points.
According to the embodiment, the point to be buried is performed on the point to be buried page according to the target element information, so that the element information which faces to the service and can clearly reflect the user behavior semantics on the point to be buried page is accurately extracted, the point burying method is suitable for primary users or users of non-professional analysts, the technical threshold is low, the gap between the element information set and the point to be buried set by the users is filled from the element information set, and the point burying recommended result with the service meaning is produced.
Example III
Fig. 3 is a block diagram of a device for extracting page element information in a third embodiment of the present application, where the embodiment of the present application is applicable to a case of extracting element information of application value for product and operation optimization from page element information triggered by user interaction, and the device is implemented by software and/or hardware and is specifically configured in an electronic device having a certain data computing capability.
The apparatus 300 for extracting page element information shown in fig. 3 includes: an acquisition module 301 and an extraction module 302; wherein, the liquid crystal display device comprises a liquid crystal display device,
the collection module 301 is configured to collect an element information set triggered by a user interaction event on a target page.
The extracting module 302 is configured to extract the target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered.
In the embodiment of the application, all user behavior data are acquired by acquiring the element information set triggered by the user interaction event on the target page; according to the attribute of the element associated with the page service and the number index of triggered elements, the target element information is extracted from the element information set, so that the service-oriented element information which can clearly embody the user behavior semantics is extracted, and the application value of the element information to the product and operation optimization is improved.
Further, the attribute of the element associated with the page service includes: at least one of click event of element binding, button type of element, link function of element, element content length smaller than set value, element in floor page and element in promotion page;
a count index for an element to be triggered, comprising: conversion rate and/or click through rate of the element.
Further, the extraction module 302 includes a candidate element information set extraction unit and a target element information extraction unit. The candidate element information set extraction unit is used for extracting a candidate element information set from the element information set according to the attribute of the element associated with the page business; the target element information extraction unit is used for extracting target element information from the candidate element information set according to the number index of triggered elements.
Further, the candidate element information set extracting unit is specifically configured to extract element information triggered by a click event from the element information set; and extracting at least one of button type element information, link type element information, element information with content length smaller than a set value, element information positioned on a landing page and element information positioned on a popularization page from element information triggered by the clicked event to form a candidate element information set.
Further, the target element information extraction unit is specifically configured to calculate a conversion rate and/or a click rate of each candidate element; and determining candidate element information with the conversion rate in a first preset range and/or candidate element information with the click rate in a second preset range from the candidate element information set as target element information.
Further, the target element information extraction unit is specifically configured to, when calculating the conversion rate and/or click rate of each candidate element: acquiring a plurality of sessions generated in the interaction process of a user and a target page, wherein each session comprises element information triggered by a click event in a preset period; calculating the conversion rate of each candidate element according to the number of sessions containing the information of each candidate element and the total number of a plurality of sessions; and/or calculating the click rate of each candidate element according to the number of sessions containing each candidate element information and the display times of each candidate element information on the target page.
Further, the device further comprises a deleting unit, which is used for deleting the element information with the conversion rate in the first preset range and/or the element information with the click rate in the second preset range from the candidate element information set before determining the element information with the conversion rate in the first preset range as the target element information, wherein the element information with the conversion rate exceeding the first preset threshold and/or the element information with the click rate exceeding the second preset threshold in the candidate element information set.
Further, the device also comprises a point burying module, wherein the point burying module is used for burying points on the page to be buried according to the target element information.
The device for extracting the page element information can execute the method for extracting the page element information provided by any embodiment of the application, and has the corresponding functional modules and beneficial effects of executing the method for extracting the page element information.
Example IV
According to an embodiment of the present application, the present application also provides an electronic device and a readable storage medium.
As shown in fig. 4, a block diagram of an electronic device implementing the method for extracting page element information according to an embodiment of the present application is shown. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular telephones, smartphones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be exemplary only, and are not meant to limit implementations of the applications described and/or claimed herein.
As shown in fig. 4, the electronic device includes: one or more processors 401, memory 402, and interfaces for connecting the components, including a high-speed interface and a low-speed interface. The various components are interconnected using different buses and may be mounted on a common motherboard or in other manners as desired. The processor may process instructions executing within the electronic device, including instructions stored in or on memory to display graphical information of the GUI on an external input/output device, such as a display device coupled to the interface. In other embodiments, multiple processors and/or multiple buses may be used, if desired, along with multiple memories and multiple memories. Also, multiple electronic devices may be connected, each providing a portion of the necessary operations (e.g., as a server array, a set of blade servers, or a multiprocessor system). One processor 401 is illustrated in fig. 4.
Memory 402 is a non-transitory computer readable storage medium provided by the present application. The memory stores instructions executable by at least one processor to cause the at least one processor to perform the method for extracting page element information provided by the application. The non-transitory computer-readable storage medium of the present application stores computer instructions for causing a computer to execute the method of extracting page element information provided by the present application.
The memory 402 is used as a non-transitory computer readable storage medium for storing non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules (e.g., including the acquisition module 301 and the extraction module 302 shown in fig. 3) corresponding to a method for extracting page element information in an embodiment of the present application. The processor 401 executes various functional applications of the server and data processing, i.e., a method of implementing extraction of page element information in the above-described method embodiment, by running non-transitory software programs, instructions, and modules stored in the memory 402.
Memory 402 may include a storage program area that may store an operating system, at least one application program required for functionality, and a storage data area; the storage data area may store data created by the use of the electronic device implementing the extraction method of page element information, and the like. In addition, memory 402 may include high-speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid-state storage device. In some embodiments, memory 402 may optionally include memory remotely located with respect to processor 401, which may be connected via a network to an electronic device performing the method of extracting page element information. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The electronic device that performs the page element information extraction method may further include: an input device 403 and an output device 404. The processor 401, memory 402, input device 403, and output device 404 may be connected by a bus or otherwise, for example in fig. 4.
The input device 403 may receive input numeric or character information and generate key signal inputs related to user settings and function control of the electronic device performing the method of extracting page element information, such as input devices for a touch screen, a keypad, a mouse, a track pad, a touch pad, a pointer stick, one or more mouse buttons, a track ball, a joystick, etc. The output device 404 may include a display apparatus, auxiliary lighting devices (e.g., LEDs), and haptic feedback devices (e.g., vibration motors), among others. The display device may include, but is not limited to, a Liquid Crystal Display (LCD), a Light Emitting Diode (LED) display, and a plasma display. In some implementations, the display device may be a touch screen.
Various implementations of the systems and techniques described here can be realized in digital electronic circuitry, integrated circuitry, application specific ASIC (application specific integrated circuit), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs, the one or more computer programs may be executed and/or interpreted on a programmable system including at least one programmable processor, which may be a special purpose or general-purpose programmable processor, that may receive data and instructions from, and transmit data and instructions to, a storage system, at least one input device, and at least one output device.
These computing programs (also referred to as programs, software applications, or code) include machine instructions for a programmable processor, and may be implemented in a high-level procedural and/or object-oriented programming language, and/or in assembly/machine language. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or device (e.g., magnetic discs, optical disks, memory, programmable Logic Devices (PLDs)) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and pointing device (e.g., a mouse or trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user may be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic input, speech input, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a background component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such background, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), wide Area Networks (WANs), the internet, and blockchain networks.
The computer system may include a client and a server. The client and server are typically remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
According to the technical scheme of the embodiment of the application, all user behavior data are acquired by acquiring the element information set triggered by the user interaction event on the target page; according to the attribute of the element associated with the page service and the number index of triggered elements, the target element information is extracted from the element information set, so that the service-oriented element information which can clearly embody the user behavior semantics is extracted, and the application value of the element information to the product and operation optimization is improved.
It should be appreciated that various forms of the flows shown above may be used to reorder, add, or delete steps. For example, the steps described in the present application may be performed in parallel, sequentially, or in a different order, provided that the desired results of the disclosed embodiments are achieved, and are not limited herein.
The above embodiments do not limit the scope of the present application. It will be apparent to those skilled in the art that various modifications, combinations, sub-combinations and alternatives are possible, depending on design requirements and other factors. Any modifications, equivalent substitutions and improvements made within the spirit and principles of the present application should be included in the scope of the present application.

Claims (11)

1. The method for extracting the page element information is characterized by comprising the following steps of:
acquiring an element information set triggered by a user interaction event on a target page through a fully buried point in the target page;
extracting target element information from the element information set according to the attribute of the element associated with the page service and the number index of triggered elements; whether the element attribute is associated with the page service or not is determined by whether the page service is called after the element is triggered.
2. The method of claim 1, wherein the step of determining the position of the substrate comprises,
the attribute of the element associated with the page service comprises: at least one of click event of element binding, button type of element, link function of element, element content length smaller than set value, element in floor page and element in promotion page;
the number of times the element is triggered index includes: conversion rate and/or click through rate of the element.
3. The method according to claim 1 or 2, wherein the extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered comprises:
extracting a candidate element information set from the element information set according to the attribute of the element associated with the page service;
and extracting target element information from the candidate element information set according to the number index of triggered elements.
4. A method according to claim 3, wherein said extracting a candidate element information set from said element information set according to an attribute of an element associated with a page service comprises:
extracting element information triggered by a clicked event from the element information set;
and extracting at least one of button type element information, link type element information, element information with content length smaller than a set value, element information positioned on a landing page and element information positioned on a popularization page from element information triggered by the clicked event to form a candidate element information set.
5. A method according to claim 3, wherein extracting target element information from the candidate element information set according to the index of the number of times an element is triggered comprises:
calculating the conversion rate and/or click rate of each candidate element;
and determining the candidate element information with the conversion rate within a first preset range and/or the candidate element information with the click rate within a second preset range from the candidate element information set as target element information.
6. The method of claim 5, wherein the calculating the conversion and/or click through rate for each candidate element comprises:
acquiring a plurality of sessions generated in the interaction process of the user and the target page, wherein each session comprises element information triggered by a click event in a preset period;
calculating the conversion rate of each candidate element according to the number of sessions containing the information of each candidate element and the total number of the plurality of sessions; and/or calculating the click rate of each candidate element according to the number of sessions containing each candidate element information and the display times of each candidate element information on the target page.
7. The method according to claim 5, wherein before determining, from the candidate element information set, element information of which the conversion rate is within a first preset range and/or element information of which the click rate is within a second preset range as target element information, further comprising:
and deleting the element information of which the conversion rate exceeds a first preset threshold value and/or the element information of which the click rate exceeds a second preset threshold value from the candidate element information set.
8. The method according to claim 1, further comprising, after extracting target element information from the element information set according to the attribute of the element associated with the page service and the index of the number of times the element is triggered:
and burying points on the page to be buried according to the target element information.
9. An extracting device for page element information, comprising:
the acquisition module is used for acquiring an element information set triggered by a user interaction event on the target page through the fully buried point in the target page;
the extraction module is used for extracting target element information from the element information set according to the attribute of the element associated with the page service and the number index of triggered elements; whether the element attribute is associated with the page service or not is determined by whether the page service is called after the element is triggered.
10. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein, the liquid crystal display device comprises a liquid crystal display device,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform a method of extracting page element information as claimed in any one of claims 1 to 8.
11. A non-transitory computer-readable storage medium storing computer instructions for causing a computer to perform a method of extracting page element information according to any one of claims 1 to 8.
CN202010093390.7A 2020-02-14 2020-02-14 Page element information extraction method, device, equipment and storage medium Active CN111310044B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010093390.7A CN111310044B (en) 2020-02-14 2020-02-14 Page element information extraction method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010093390.7A CN111310044B (en) 2020-02-14 2020-02-14 Page element information extraction method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN111310044A CN111310044A (en) 2020-06-19
CN111310044B true CN111310044B (en) 2023-09-26

Family

ID=71161722

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010093390.7A Active CN111310044B (en) 2020-02-14 2020-02-14 Page element information extraction method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111310044B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112596837B (en) * 2020-12-24 2024-05-17 瑞声新能源发展(常州)有限公司科教城分公司 Mobile terminal theme changing method and device and readable storage medium
CN115017201A (en) * 2022-08-09 2022-09-06 中企云链(北京)金融信息服务有限公司 FLINK processing engine-based user behavior analysis method and system

Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105447139A (en) * 2015-11-20 2016-03-30 广州华多网络科技有限公司 Data acquisition statistical method, and system, terminal and service equipment thereof
CN108038053A (en) * 2017-11-29 2018-05-15 上海恺英网络科技有限公司 A kind of dynamic configuration buries method and apparatus a little
CN108334525A (en) * 2017-01-20 2018-07-27 阿里巴巴集团控股有限公司 A kind of method for exhibiting data and device
CN108459845A (en) * 2018-03-14 2018-08-28 北京思特奇信息技术股份有限公司 A kind of surveillance tag attribute buries point methods and device
CN109145230A (en) * 2017-06-15 2019-01-04 百度在线网络技术(北京)有限公司 Information output method and device
CN109522191A (en) * 2018-10-16 2019-03-26 深圳壹账通智能科技有限公司 A kind of method and device of the attribute information of acquisition interbehavior instruction
CN109740089A (en) * 2018-11-30 2019-05-10 东软集团股份有限公司 Collecting method, device, system, readable storage medium storing program for executing and electronic equipment
CN109767259A (en) * 2018-12-15 2019-05-17 深圳壹账通智能科技有限公司 Based on operation event promotion method, apparatus, equipment and the medium for burying point data
CN110378732A (en) * 2019-07-18 2019-10-25 腾讯科技(深圳)有限公司 Information display method, information correlation method, device, equipment and storage medium
CN110674022A (en) * 2019-09-27 2020-01-10 北京三快在线科技有限公司 Behavior data acquisition method and device and storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106339398B (en) * 2015-07-09 2019-10-18 广州市动景计算机科技有限公司 A kind of pre-reading method of Webpage, device and intelligent terminal
US20170236073A1 (en) * 2016-02-12 2017-08-17 Linkedln Corporation Machine learned candidate selection on inverted indices
US20170315676A1 (en) * 2016-04-28 2017-11-02 Linkedln Corporation Dynamic content insertion

Patent Citations (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105447139A (en) * 2015-11-20 2016-03-30 广州华多网络科技有限公司 Data acquisition statistical method, and system, terminal and service equipment thereof
CN108334525A (en) * 2017-01-20 2018-07-27 阿里巴巴集团控股有限公司 A kind of method for exhibiting data and device
CN109145230A (en) * 2017-06-15 2019-01-04 百度在线网络技术(北京)有限公司 Information output method and device
CN108038053A (en) * 2017-11-29 2018-05-15 上海恺英网络科技有限公司 A kind of dynamic configuration buries method and apparatus a little
CN108459845A (en) * 2018-03-14 2018-08-28 北京思特奇信息技术股份有限公司 A kind of surveillance tag attribute buries point methods and device
CN109522191A (en) * 2018-10-16 2019-03-26 深圳壹账通智能科技有限公司 A kind of method and device of the attribute information of acquisition interbehavior instruction
CN109740089A (en) * 2018-11-30 2019-05-10 东软集团股份有限公司 Collecting method, device, system, readable storage medium storing program for executing and electronic equipment
CN109767259A (en) * 2018-12-15 2019-05-17 深圳壹账通智能科技有限公司 Based on operation event promotion method, apparatus, equipment and the medium for burying point data
CN110378732A (en) * 2019-07-18 2019-10-25 腾讯科技(深圳)有限公司 Information display method, information correlation method, device, equipment and storage medium
CN110674022A (en) * 2019-09-27 2020-01-10 北京三快在线科技有限公司 Behavior data acquisition method and device and storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
互联网产品运营设计分析研究;余邹蓓蕾;;工业设计研究(第00期);全文 *

Also Published As

Publication number Publication date
CN111310044A (en) 2020-06-19

Similar Documents

Publication Publication Date Title
CN111475750B (en) Page preloading control method, device, system, equipment and storage medium
US10726095B1 (en) Network content layout using an intermediary system
JP6956119B2 (en) Systems and methods for providing contextual information
CN111460289B (en) News information pushing method and device
CN104331474A (en) Page processing method and device
CN111310044B (en) Page element information extraction method, device, equipment and storage medium
CN110727668A (en) Data cleaning method and device
US11574013B1 (en) Query recommendations for a displayed resource
US11003667B1 (en) Contextual information for a displayed resource
JP2024507902A (en) Information retrieval methods, devices, electronic devices and storage media
EP3482308A1 (en) Contextual information for a displayed resource that includes an image
US10467300B1 (en) Topical resource recommendations for a displayed resource
CN111881339B (en) Method and device for pushing and notifying resource information, electronic equipment and storage medium
CN111666417A (en) Method and device for generating synonyms, electronic equipment and readable storage medium
CN112148979B (en) Event-associated user identification method, device, electronic equipment and storage medium
CN111931524B (en) Method, apparatus, device and storage medium for outputting information
CN112101012B (en) Interactive domain determining method and device, electronic equipment and storage medium
CN111581366B (en) User intention determining method, device, electronic equipment and readable storage medium
CN111723318B (en) Page data processing method, device, equipment and storage medium
CN112446728B (en) Advertisement recall method, device, equipment and storage medium
CN111080369B (en) Advertisement information display method, device, equipment and readable storage medium
CN111611476A (en) Method and device for displaying special topic page
CN111552879B (en) Data processing method and device
US20210216597A1 (en) Method and apparatus for processing mini app material, electronic device and medium
CN111552877A (en) Data processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant