CN111723318A - Page data processing method, device, equipment and storage medium - Google Patents

Page data processing method, device, equipment and storage medium Download PDF

Info

Publication number
CN111723318A
CN111723318A CN202010519083.0A CN202010519083A CN111723318A CN 111723318 A CN111723318 A CN 111723318A CN 202010519083 A CN202010519083 A CN 202010519083A CN 111723318 A CN111723318 A CN 111723318A
Authority
CN
China
Prior art keywords
block
page
determining
path
optimized
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202010519083.0A
Other languages
Chinese (zh)
Other versions
CN111723318B (en
Inventor
王晓元
陈承泽
徐瑜
叶峻
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Baidu Netcom Science and Technology Co Ltd
Original Assignee
Beijing Baidu Netcom Science and Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Baidu Netcom Science and Technology Co Ltd filed Critical Beijing Baidu Netcom Science and Technology Co Ltd
Priority to CN202010519083.0A priority Critical patent/CN111723318B/en
Publication of CN111723318A publication Critical patent/CN111723318A/en
Application granted granted Critical
Publication of CN111723318B publication Critical patent/CN111723318B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9574Browsing optimisation, e.g. caching or content distillation of access to content, e.g. by caching
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/957Browsing optimisation, e.g. caching or content distillation
    • G06F16/9577Optimising the visualization of content, e.g. distillation of HTML documents
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/958Organisation or management of web site content, e.g. publishing, maintaining pages or automatic linking

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Information Transfer Between Computers (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The application discloses a page data processing method, a page data processing device, page data processing equipment and a storage medium, and relates to the technical field of webpage processing. The specific implementation scheme is as follows: determining blocks in a page according to a Document Object Model (DOM) tree of the page; determining a block path of the page; wherein the block path is a path through at least one of the blocks; and determining a block to be optimized in the page according to the user behavior data of the page and the block path of the page. The application provides a new way of determining content to be optimized in a page.

Description

Page data processing method, device, equipment and storage medium
Technical Field
The present application relates to the field of internet technologies, and in particular, to a method, an apparatus, a device, and a storage medium for processing page data.
Background
A Landing Page (LP) is the first Page that a user arrives at a web site within a session. In a search scene, a search result page is provided based on search keywords of a user, and when the user needs to check details, a certain search result is clicked to enter a landing page. The landing page has a key impact on the user's subsequent behavior.
Disclosure of Invention
The disclosure provides a method, an apparatus, a device and a storage medium for page data processing.
According to an aspect of the present disclosure, there is provided a page data processing method, including:
determining blocks in a page according to a Document Object Model (DOM) tree of the page;
determining a block path of the page; wherein the block path is a path through at least one of the blocks;
and determining a block to be optimized in the page according to the user behavior data of the page and the block path of the page.
According to another aspect of the present disclosure, there is provided a page data processing apparatus including:
the block determining module is used for determining a block in a page according to a Document Object Model (DOM) tree of the page;
the block path module is used for determining a block path of the page; wherein the block path is a path through at least one of the blocks;
and the block selection module is used for determining a block to be optimized in the page according to the user behavior data of the page and the block path of the page.
According to a third aspect, there is provided an electronic device comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform a page data processing method as in any one of the embodiments of the present application.
According to a fourth aspect, there is provided a non-transitory computer-readable storage medium storing computer instructions for causing a computer to execute the page data processing method according to any one of the embodiments of the present application.
The technology according to the present application provides a new way of determining content to be optimized in a page.
It should be understood that the statements in this section do not necessarily identify key or critical features of the embodiments of the present disclosure, nor do they limit the scope of the present disclosure. Other features of the present disclosure will become apparent from the following description.
Drawings
The drawings are included to provide a better understanding of the present solution and are not intended to limit the present application. Wherein:
FIG. 1a is a schematic flowchart of a page data processing method according to an embodiment of the present application;
FIG. 1b is a DOM diagram of a page provided in accordance with an embodiment of the present application;
FIG. 1c is a schematic diagram of a page block structure provided in accordance with an embodiment of the present application;
FIG. 2 is a schematic flowchart of a page data processing method according to an embodiment of the present application;
FIG. 3 is a flowchart illustrating a page data processing method according to an embodiment of the present application;
FIG. 4 is a schematic structural diagram of a page data processing apparatus according to an embodiment of the present application;
fig. 5 is a block diagram of an electronic device for implementing a page data processing method according to an embodiment of the present application.
Detailed Description
The following description of the exemplary embodiments of the present application, taken in conjunction with the accompanying drawings, includes various details of the embodiments of the application for the understanding of the same, which are to be considered exemplary only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made without departing from the scope and spirit of the present application. Also, descriptions of well-known functions and constructions are omitted in the following description for clarity and conciseness.
Fig. 1a is a schematic flowchart of a page data processing method according to an embodiment of the present application. The embodiment can be applied to the condition of optimizing the network page, particularly the landing page. The page data processing method disclosed in this embodiment may be executed by an electronic device, and specifically may be executed by a page data processing apparatus, where the apparatus may be implemented by software and/or hardware and configured in the electronic device. Referring to fig. 1a, the page data processing method provided in this embodiment includes:
s110, determining blocks in the page according to a Document Object Model (DOM) tree of the page.
The page refers to a page to be optimized, such as a landing page. The DOM (Document Object Model) tree is a tree-based application program interface Document that is platform and language independent. The blocks refer to interactive units in a page, one interactive unit is one block, and the page may include block types such as forms, communications (e.g., online consultation or call-up), application program downloads, two-dimensional codes, pictures, texts, or videos.
Specifically, the block type included in the page may be determined according to each node tag in the DOM tree of the page, and the page may be divided according to the block type to obtain at least two blocks.
In an alternative, S110 includes: determining a sub DOM tree of the block according to the candidate block type and the association relationship between the node tag and the node in the document object model DOM tree of the page; and determining the blocks in the page according to the sub DOM tree of the blocks.
And all the child nodes in the DOM tree of the maximum father node in the child DOM tree of the block belong to the child DOM tree of the block, the maximum father node in the child DOM tree of the block is not overlapped with the maximum father node of the child DOM trees of other blocks, and the child DOM tree does not include other block nodes. Specifically, the candidate block type and the association relationship between the candidate block type and the node tag are provided in advance, for example, the node tag associated with the form is form (table element) or table, the node tag associated with the image is img (image element), the node tag associated with the text is text (text element), and the node tag associated with the video is video (video element). And analyzing the DOM tree of the page, finding the node tag associated with the candidate block type, and finding the maximum father node containing the block type element along the DOM tree from the associated node tag, wherein the node is not overlapped with other block type father nodes, so as to obtain a child DOM tree of the block. Fig. 1b is a schematic diagram of a DOM of a page provided in an embodiment of the present application, and referring to fig. 1b, the DOM tree includes img (image element) nodes associated with an image, and table (table element) nodes associated with a form, only the img nodes in a sub-DOM tree of the image in the page can be obtained, and the sub-DOM tree of the form includes three nodes of div (background structure element), table, and h2 (title element). The accuracy of the blocks is improved by determining the boundaries of the blocks according to the node structures in the DOM tree of the page. By dividing the page into a plurality of blocks, the page can be optimized based on the block granularity, and particularly, the block to be optimized of the page can be determined subsequently by combining the user behavior data and the block structure characteristics of the page.
S120, determining a block path of the page; wherein the block path is a path through at least one of the blocks.
Specifically, a block path passing through at least one block can be obtained according to the blocks in the page, and there may be a plurality of block paths passing through one block, for example, a block path passing through only the block, and a block path passing through the block and other blocks. A block path passing through only one block means that the user exits the page after touching the block, and a block path passing through at least two blocks means that the user exits the page after touching at least two blocks successively, for example, a block path passing through block a and block B means that block a and block B are touched in sequence and the page exits at block B. Fig. 1c is a schematic diagram of a page block structure according to an embodiment of the present invention. Referring to fig. 1c, the page includes fig. 1, form 1, fig. 2, fig. 3, fig. 4, fig. 5 and form 2 in sequence, and the block path of fig. 1 may have fig. 1, fig. 1 to form 1, fig. 1 to fig. 2 to fig. 3 to fig. 4, and so on.
S130, determining a block to be optimized in the page according to the user behavior data of the page and the block path of the page.
Specifically, the behavior characteristics of the user on the block or block path on the page, such as streaming characteristics or conversion characteristics, can be obtained by mapping the user behavior data onto the block or block path of the page, and the block to be optimized of the page is selected according to the behavior characteristics of the user on the block or block path. Compared with the method for determining the content to be optimized of the page from the user behavior data or the page structure, for example, the method for analyzing the thermodynamic diagram of the control in the page according to the user behavior data in the page using stage, and the method for analyzing the thermodynamic diagram based on the thermodynamic diagram, or the method for performing static structure design on aspects of page layout, style design, control selection and the like in the page design stage are determined. The block to be optimized in the page is determined by combining the user behavior data and the block path of the page, so that the accuracy of page optimization can be improved.
According to the technical scheme, the novel page optimization technology for optimizing the page based on the block granularity is provided, the efficiency of page optimization can be improved, the block to be optimized of the page is determined by combining the user behavior data and the block structure of the page, and the accuracy of the block to be optimized can be improved.
Fig. 2 is a schematic flowchart of a page data processing method according to an embodiment of the present application. The present embodiment is an alternative proposed on the basis of the above-described embodiments. Referring to fig. 2, the page data processing method provided in this embodiment includes:
s210, determining blocks in the page according to a Document Object Model (DOM) tree of the page.
S220, determining a block path of the page; wherein the block path is a path through at least one of the blocks.
And S230, determining the receding rate of the block path in the page according to the user behavior data of the page.
The exit rate of the block path is used for representing the user loss condition passing through the block path. If the exit rate of the block path is low, the quality of the blocks in the block path is better, otherwise, the blocks to be optimized may exist in the block path with the high exit rate.
Specifically, the user behavior data of the page may be mapped to the block path to obtain the user behavior characteristics on the block path, for example, the number of times of user loss on the block path or the total number of times of user passing the block path, and the exit rate of the block path is obtained according to the user behavior characteristics on the block path.
In an alternative embodiment, S230 includes: determining the times of exiting from the block path and the total times of passing through the block path according to the user behavior data of the page; and determining the exit rate of the block path according to the number of exits from the block path and the total number of times of passing through the block path. Specifically, the ratio of the number of times of exiting from the block path to the total number of times of passing through the block path may be used as the exit rate of the block path. It should be noted that multiple continuous touch-ups in a user behavior data block are aggregated into a single touch-up. The exit rate of the block path is used as the combination characteristic of the user behavior data and the page block structure, so that the subsequently determined block to be optimized is not only related to the user behavior data and the page block structure, but also is the result of the mutual close interaction of the user behavior data and the page block structure, and the accuracy of the block to be optimized is improved.
S240, determining a block to be optimized in the page according to the exit rate of the block path.
Specifically, the block to be optimized may be selected from the block paths having the exit rate higher than the threshold.
In an alternative embodiment, S240 includes: comparing the exit rates of block paths in at least two pages with the same block structure, and determining a block path to be optimized according to a comparison result; and selecting the block to be optimized from the block path to be optimized.
According to the method and the device, the exit rates of at least two pages with the same block structure are compared, so that the influence of the difference of different block structures on the exit rate of the block path is avoided, and the accuracy of the block to be optimized is improved. It should be noted that if the number of pages of the same block structure is relatively sparse, the locally similar block structures may also be regarded as the same block structure, that is, the same block structure may include similar block structures, such as the first block structure: fig. 1-table 1-fig. 2-fig. 3-table 2, and the second block structure: fig. 1-form 1-fig. 2-fig. 3 have similar partial structures: fig. 1-table 1-fig. 2 can compare the path drop rates of the two paths on the similar local structure to reduce the influence of the block structure on the drop rate of the block path.
In an alternative embodiment, S240 further includes: determining the exit rates of at least two controls in the block to be optimized; and selecting a control to be optimized from the at least two controls according to the exit rates of the at least two controls.
The block to be optimized may include a plurality of controls, for example, the communication session block includes "detailed introduction", "go-to-function network" and "telephone consultation" controls. By means of control level analysis of the blocks to be optimized, fine-grained optimization of the page is facilitated, and therefore accuracy of page optimization is improved.
Specifically, for each control in the block to be optimized, the number of times of exiting from the control and the total number of times of passing through the control can be determined according to the user behavior data; and taking the ratio of the number of times of quitting from the control in the total number of times of passing through the control as the quit rate of the control. And comparing the withdrawal rates of different controls, and taking the control with the higher withdrawal rate as the control to be optimized.
According to the technical scheme of the embodiment of the application, the exit rate of the block path in the page is determined according to the user behavior data, and different pages with the same block structure are compared, so that the accuracy of the block to be optimized of the page is improved.
Fig. 3 is a schematic flowchart of a page data processing method according to an embodiment of the present application. The present embodiment is an alternative proposed on the basis of the above-described embodiments. Referring to fig. 3, the page data processing method provided in this embodiment includes:
s310, determining blocks in the page according to a Document Object Model (DOM) tree of the page.
S320, determining a block path of the page; wherein the block path is a path through at least one of the blocks.
In an alternative embodiment, S320 includes: determining the frequency of passing through the candidate paths in the page according to the user behavior data; and selecting a block path of the page from the candidate paths according to the frequency.
The candidate path refers to a path obtained by combining different blocks in the page block structure. The frequency of each candidate path is determined according to the user behavior data, and the candidate paths with higher frequency and preset numerical values are used as the block paths to be analyzed of the page, so that the efficiency and accuracy of the block to be optimized can be improved.
In an alternative embodiment, S320 includes: and taking the candidate path passing through the conversion class block as the block path of the page.
The conversion block refers to a block that contacts conversion with the page, for example, the conversion block may be a list block, a consultation block, an application download block, or the like. By taking the candidate path of the block passing through the conversion class as the block path to be analyzed, the efficiency and the accuracy of the block to be optimized can be improved.
S330, determining the conversion contribution value of the block in the page according to the user behavior data of the page.
The conversion contribution value of the block is used for representing the contribution value of the block to the page conversion behavior, the quality of the block is better when the contribution value of the block is higher, otherwise, the block with low conversion rate may be the block to be optimized.
Specifically, the user behavior data of the page may be mapped to the block to obtain the user behavior characteristics of the block, for example, the frequency of reaching the block and the frequency of reaching the block path are used as the characteristics.
In an alternative embodiment, S340 includes: determining the number of conversion block paths passing through any block and the number of non-conversion block paths passing through the block; and obtaining the conversion contribution value of the block according to the conversion block path number and the non-conversion block path number. The conversion block path is a block path with a conversion behavior, and a path including a conversion type block can be used as the conversion block path, and the conversion type block can be a communication block, an application download block and the like; a non-translation block path refers to a block path where translation does not occur. Specifically, the ratio of the number of transformed block paths to the number of non-transformed block paths of any block may be used as the transformation contribution value of the block. The conversion contribution value of the block path is used as the combination characteristic of the user behavior data and the page block structure, so that the subsequently determined block to be optimized is not only related to the user behavior data and the page block structure, but also is the result of the mutual close interaction of the user behavior data and the page block structure, and the accuracy of the block to be optimized is improved.
S340, comparing the conversion contribution values of different blocks in the page, and determining a block to be optimized in the page according to the comparison result.
Specifically, the block with a high conversion contribution value may be multiplexed and referred to, and the block with a low conversion contribution value may be modified and replaced. The conversion contribution of the block granularity has better generalization capability compared with the conventional control granularity.
It should be noted that, in the embodiment of the present application, the to-be-optimized block may be selected by comprehensively applying the exit rate of the block path and the transformation contribution value of the block, so that the loss information and the transformation information of the page are considered to optimize the page, and the page quality is further improved.
The technical scheme of the embodiment of the application provides a novel page optimization technology based on block granularity, and by combining user behavior data and page block structural characteristics, key influence factors of user loss and conversion on a page are solved, the efficiency and accuracy of page optimization are improved, and the quality of the page is improved.
Fig. 4 is a schematic structural diagram of a page data processing apparatus according to an embodiment of the present application. Referring to fig. 4, an embodiment of the present application discloses a page data processing apparatus 400, where the apparatus 400 includes:
the block determining module 401 is configured to determine a block in a page according to a Document Object Model (DOM) tree of the page;
a block path module 402, configured to determine a block path of the page; wherein the block path is a path through at least one of the blocks;
the block selection module 403 is configured to determine a block to be optimized in the page according to the user behavior data of the page and the block path of the page.
Further, the block selection module 403 includes:
the exit rate unit is used for determining the exit rate of the block path in the page according to the user behavior data of the page;
and the first block selection unit is used for determining a block to be optimized in the page according to the drop-out rate of the block path.
Further, the exit rate unit specifically includes:
the number determining subunit is used for determining the number of times of exiting from the block path and the total number of times of passing through the block path according to the user behavior data of the page;
a drop rate subunit, configured to determine a drop rate of the block path according to the number of drops from the block path and the total number of times of passing through the block path.
Further, the block selecting unit specifically includes:
the block comparison subunit is used for comparing the exit rates of the block paths in at least two pages with the same block structure and determining the block path to be optimized according to the comparison result;
and the block selection subunit is used for selecting the block to be optimized from the block path to be optimized.
Further, the block selection module 403 further includes a control selection unit, where the control selection unit is specifically configured to:
determining the exit rates of at least two controls in the block to be optimized;
and selecting a control to be optimized from the at least two controls according to the exit rates of the at least two controls.
Further, the block selection module 403 includes:
the conversion contribution unit is used for determining a conversion contribution value of a block in the page according to the user behavior data of the page;
and the second block selection unit is used for comparing the conversion contribution values of different blocks in the page and determining a block to be optimized in the page according to the comparison result.
Further, the conversion contribution unit specifically includes:
a quantity subunit, configured to determine the number of transformed block paths passing through any block and the number of non-transformed block paths passing through the block;
and the conversion contribution subunit is used for obtaining a conversion contribution value of the block according to the conversion block path number and the non-conversion block path number.
Further, the block determining module 401 includes:
the sub-tree unit is used for determining a sub-DOM tree of the block according to the candidate block type and the association relationship between the node tag and the node in the Document Object Model (DOM) tree of the page;
and the block determining unit is used for determining the block in the page according to the sub DOM tree of the block.
Further, the block path module 402 is specifically configured to:
determining the frequency of passing through the candidate paths in the page according to the user behavior data;
and selecting a block path of the page from the candidate paths according to the frequency.
Further, the block path module 402 is specifically configured to:
and taking the candidate path passing through the conversion class block as the block path of the page.
The technical scheme of the embodiment of the application provides a novel page optimization technology based on block granularity, and by combining user behavior data and page block structural characteristics, key influence factors of user loss and conversion on a page are solved, the efficiency and accuracy of page optimization are improved, and the quality of the page is improved.
According to an embodiment of the present application, an electronic device and a readable storage medium are also provided.
As shown in fig. 5, the embodiment of the present application is a block diagram of an electronic device of a method for processing page data. Electronic devices are intended to represent various forms of digital computers, such as laptops, desktops, workstations, personal digital assistants, servers, blade servers, mainframes, and other appropriate computers. The electronic device may also represent various forms of mobile devices, such as personal digital processing, cellular phones, smart phones, wearable devices, and other similar computing devices. The components shown herein, their connections and relationships, and their functions, are meant to be examples only, and are not meant to limit implementations of the present application that are described and/or claimed herein.
As shown in fig. 5, the electronic apparatus includes: one or more processors 501, memory 502, and interfaces for connecting the various components, including high-speed interfaces and low-speed interfaces. The various components are interconnected using different buses and may be mounted on a common motherboard or in other manners as desired. The processor may process instructions for execution within the electronic device, including instructions stored in or on the memory to display graphical information of a GUI on an external input/output apparatus (such as a display device coupled to the interface). In other embodiments, multiple processors and/or multiple buses may be used, along with multiple memories and multiple memories, as desired. Also, multiple electronic devices may be connected, with each device providing portions of the necessary operations (e.g., as a server array, a group of blade servers, or a multi-processor system). In fig. 5, one processor 501 is taken as an example.
Memory 502 is a non-transitory computer readable storage medium as provided herein. The memory stores instructions executable by at least one processor to cause the at least one processor to perform the method for processing page data provided by the present application. The non-transitory computer readable storage medium of the present application stores computer instructions for causing a computer to perform the method of page data processing provided herein.
The memory 502, which is a non-transitory computer readable storage medium, may be used to store non-transitory software programs, non-transitory computer executable programs, and modules, such as program instructions/modules corresponding to the method of page data processing in the embodiments of the present application (e.g., the block determination module 401, the block routing module 402, and the block selection module 403 shown in fig. 4). The processor 501 executes various functional applications of the server and data processing, i.e., a method of implementing page data processing in the above-described method embodiments, by executing non-transitory software programs, instructions, and modules stored in the memory 502.
The memory 502 may include a storage program area and a storage data area, wherein the storage program area may store an operating system, an application program required for at least one function; the storage data area may store data created according to use of the electronic device for page data processing, and the like. Further, the memory 502 may include high speed random access memory, and may also include non-transitory memory, such as at least one magnetic disk storage device, flash memory device, or other non-transitory solid state storage device. In some embodiments, memory 502 may optionally include memory located remotely from processor 501, which may be connected to page data processing electronics over a network. Examples of such networks include, but are not limited to, the internet, intranets, local area networks, mobile communication networks, and combinations thereof.
The electronic device of the page data processing method may further include: an input device 503 and an output device 504. The processor 501, the memory 502, the input device 503 and the output device 504 may be connected by a bus or other means, and fig. 5 illustrates the connection by a bus as an example.
The input device 503 may receive input numeric or character information and generate key signal inputs related to user settings and function control of the electronic device for page data processing, such as an input device such as a touch screen, a keypad, a mouse, a track pad, a touch pad, a pointing stick, one or more mouse buttons, a track ball, a joystick, or the like. The output devices 504 may include a display device, auxiliary lighting devices (e.g., LEDs), and haptic feedback devices (e.g., vibrating motors), among others. The display device may include, but is not limited to, a Liquid Crystal Display (LCD), a Light Emitting Diode (LED) display, and a plasma display. In some implementations, the display device can be a touch screen.
Various implementations of the systems and techniques described here can be realized in digital electronic circuitry, integrated circuitry, application specific ASICs (application specific integrated circuits), computer hardware, firmware, software, and/or combinations thereof. These various embodiments may include: implemented in one or more computer programs that are executable and/or interpretable on a programmable system including at least one programmable processor, which may be special or general purpose, receiving data and instructions from, and transmitting data and instructions to, a storage system, at least one input device, and at least one output device.
These computer programs (also known as programs, software applications, or code) include machine instructions for a programmable processor, and may be implemented using high-level procedural and/or object-oriented programming languages, and/or assembly/machine languages. As used herein, the terms "machine-readable medium" and "computer-readable medium" refer to any computer program product, apparatus, and/or device (e.g., magnetic discs, optical disks, memory, Programmable Logic Devices (PLDs)) used to provide machine instructions and/or data to a programmable processor, including a machine-readable medium that receives machine instructions as a machine-readable signal. The term "machine-readable signal" refers to any signal used to provide machine instructions and/or data to a programmable processor.
To provide for interaction with a user, the systems and techniques described here can be implemented on a computer having: a display device (e.g., a CRT (cathode ray tube) or LCD (liquid crystal display) monitor) for displaying information to a user; and a keyboard and a pointing device (e.g., a mouse or a trackball) by which a user can provide input to the computer. Other kinds of devices may also be used to provide for interaction with a user; for example, feedback provided to the user can be any form of sensory feedback (e.g., visual feedback, auditory feedback, or tactile feedback); and input from the user may be received in any form, including acoustic, speech, or tactile input.
The systems and techniques described here can be implemented in a computing system that includes a back-end component (e.g., as a data server), or that includes a middleware component (e.g., an application server), or that includes a front-end component (e.g., a user computer having a graphical user interface or a web browser through which a user can interact with an implementation of the systems and techniques described here), or any combination of such back-end, middleware, or front-end components. The components of the system can be interconnected by any form or medium of digital data communication (e.g., a communication network). Examples of communication networks include: local Area Networks (LANs), Wide Area Networks (WANs), and the Internet.
The computer system may include clients and servers. A client and server are generally remote from each other and typically interact through a communication network. The relationship of client and server arises by virtue of computer programs running on the respective computers and having a client-server relationship to each other.
The technical scheme of the embodiment of the application provides a novel page optimization technology based on block granularity, and by combining user behavior data and page block structural characteristics, key influence factors of user loss and conversion on a page are solved, the efficiency and accuracy of page optimization are improved, and the quality of the page is improved.
It should be understood that various forms of the flows shown above may be used, with steps reordered, added, or deleted. For example, the steps described in the present application may be executed in parallel, sequentially, or in different orders, and the present invention is not limited thereto as long as the desired results of the technical solutions disclosed in the present application can be achieved.
The above-described embodiments should not be construed as limiting the scope of the present application. It should be understood by those skilled in the art that various modifications, combinations, sub-combinations and substitutions may be made in accordance with design requirements and other factors. Any modification, equivalent replacement, and improvement made within the spirit and principle of the present application shall be included in the protection scope of the present application.

Claims (19)

1. A page data processing method comprises the following steps:
determining blocks in a page according to a Document Object Model (DOM) tree of the page;
determining a block path of the page; wherein the block path is a path through at least one of the blocks;
and determining a block to be optimized in the page according to the user behavior data of the page and the block path of the page.
2. The method of claim 1, wherein the determining a block to be optimized in the page according to the user behavior data of the page and a block path of the page comprises:
determining the exit rate of a block path in the page according to the user behavior data of the page;
and determining a block to be optimized in the page according to the exit rate of the block path.
3. The method of claim 2, wherein the determining an exit rate of a block path in the page from the user behavior data for the page comprises:
determining the times of exiting from the block path and the total times of passing through the block path according to the user behavior data of the page;
and determining the exit rate of the block path according to the number of exits from the block path and the total number of times of passing through the block path.
4. The method of claim 2, wherein the determining a block to be optimized in the page according to a drop-out rate of the block path comprises:
comparing the exit rates of block paths in at least two pages with the same block structure, and determining a block path to be optimized according to a comparison result;
and selecting the block to be optimized from the block path to be optimized.
5. The method of any of claims 2-4, after determining the block to be optimized in the page, further comprising:
determining the exit rates of at least two controls in the block to be optimized;
and selecting a control to be optimized from the at least two controls according to the exit rates of the at least two controls.
6. The method of claim 1, wherein the determining a block to be optimized in the page according to the user behavior data of the page and a block path of the page comprises:
determining a conversion contribution value of a block in the page according to the user behavior data of the page;
and comparing the conversion contribution values of different blocks in the page, and determining a block to be optimized in the page according to the comparison result.
7. The method of claim 6, wherein determining a conversion contribution value for a tile in the page from user behavior data for the page comprises:
determining the number of conversion block paths passing through any block and the number of non-conversion block paths passing through the block;
and obtaining the conversion contribution value of the block according to the conversion block path number and the non-conversion block path number.
8. The method of claim 1, wherein the determining a block in a page from a Document Object Model (DOM) tree of the page comprises:
determining a sub DOM tree of the block according to the candidate block type and the association relationship between the node tag and the node in the document object model DOM tree of the page;
and determining the blocks in the page according to the sub DOM tree of the blocks.
9. The method of claim 1, wherein the determining a block path of the page comprises:
determining the frequency of passing through the candidate paths in the page according to the user behavior data;
and selecting a block path of the page from the candidate paths according to the frequency.
10. The method of claim 1, wherein the determining a block path of the page comprises:
and taking the candidate path passing through the conversion class block as the block path of the page.
11. A page data processing apparatus comprising:
the block determining module is used for determining a block in a page according to a Document Object Model (DOM) tree of the page;
the block path module is used for determining a block path of the page; wherein the block path is a path through at least one of the blocks;
and the block selection module is used for determining a block to be optimized in the page according to the user behavior data of the page and the block path of the page.
12. The apparatus of claim 1, wherein the block selection module comprises:
the exit rate unit is used for determining the exit rate of the block path in the page according to the user behavior data of the page;
and the first block selection unit is used for determining a block to be optimized in the page according to the drop-out rate of the block path.
13. The apparatus according to claim 12, wherein the drop-out rate unit specifically comprises:
the number determining subunit is used for determining the number of times of exiting from the block path and the total number of times of passing through the block path according to the user behavior data of the page;
a drop rate subunit, configured to determine a drop rate of the block path according to the number of drops from the block path and the total number of times of passing through the block path.
14. The apparatus of claim 12, wherein the block selection unit comprises:
the block comparison subunit is used for comparing the exit rates of the block paths in at least two pages with the same block structure and determining the block path to be optimized according to the comparison result;
and the block selection subunit is used for selecting the block to be optimized from the block path to be optimized.
15. The apparatus of claim 11, wherein the block selection module comprises:
the conversion contribution unit is used for determining a conversion contribution value of a block in the page according to the user behavior data of the page;
and the second block selection unit is used for comparing the conversion contribution values of different blocks in the page and determining a block to be optimized in the page according to the comparison result.
16. The device according to claim 15, wherein the conversion contribution unit specifically comprises:
a quantity subunit, configured to determine the number of transformed block paths passing through any block and the number of non-transformed block paths passing through the block;
and the conversion contribution subunit is used for obtaining a conversion contribution value of the block according to the conversion block path number and the non-conversion block path number.
17. The apparatus of claim 11, wherein the tile determination module comprises:
the sub-tree unit is used for determining a sub-DOM tree of the block according to the candidate block type and the association relationship between the node tag and the node in the Document Object Model (DOM) tree of the page;
and the block determining unit is used for determining the block in the page according to the sub DOM tree of the block.
18. An electronic device, comprising:
at least one processor; and
a memory communicatively coupled to the at least one processor; wherein the content of the first and second substances,
the memory stores instructions executable by the at least one processor to enable the at least one processor to perform the method of any one of claims 1-10.
19. A non-transitory computer readable storage medium having stored thereon computer instructions for causing the computer to perform the method of any one of claims 1-10.
CN202010519083.0A 2020-06-09 2020-06-09 Page data processing method, device, equipment and storage medium Active CN111723318B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010519083.0A CN111723318B (en) 2020-06-09 2020-06-09 Page data processing method, device, equipment and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010519083.0A CN111723318B (en) 2020-06-09 2020-06-09 Page data processing method, device, equipment and storage medium

Publications (2)

Publication Number Publication Date
CN111723318A true CN111723318A (en) 2020-09-29
CN111723318B CN111723318B (en) 2023-09-01

Family

ID=72566327

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010519083.0A Active CN111723318B (en) 2020-06-09 2020-06-09 Page data processing method, device, equipment and storage medium

Country Status (1)

Country Link
CN (1) CN111723318B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113050936A (en) * 2021-04-20 2021-06-29 中国工商银行股份有限公司 Front-end data processing method and device based on integrated development environment

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102184189A (en) * 2011-04-18 2011-09-14 北京理工大学 Webpage core block determining method based on DOM (Document Object Model) node text density
US20130227391A1 (en) * 2012-02-29 2013-08-29 Pantech Co., Ltd. Method and apparatus for displaying webpage
CN103970770A (en) * 2013-01-29 2014-08-06 腾讯科技(深圳)有限公司 Optimization processing method and device for web page and client
CN105677827A (en) * 2016-01-04 2016-06-15 百度在线网络技术(北京)有限公司 Method and device for obtaining form
CN107846426A (en) * 2016-09-18 2018-03-27 腾讯科技(深圳)有限公司 The tracking and device of user trajectory in page access
CN109242164A (en) * 2018-08-22 2019-01-18 中国平安人寿保险股份有限公司 Optimize method and device, the computer storage medium, electronic equipment in product path
CN110781372A (en) * 2019-10-28 2020-02-11 珠海格力电器股份有限公司 Method and device for optimizing website, computer equipment and storage medium
CN111078522A (en) * 2019-12-18 2020-04-28 中国建设银行股份有限公司 Method and device for identifying to-be-optimized area of page, electronic equipment and storage medium

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102184189A (en) * 2011-04-18 2011-09-14 北京理工大学 Webpage core block determining method based on DOM (Document Object Model) node text density
US20130227391A1 (en) * 2012-02-29 2013-08-29 Pantech Co., Ltd. Method and apparatus for displaying webpage
CN103970770A (en) * 2013-01-29 2014-08-06 腾讯科技(深圳)有限公司 Optimization processing method and device for web page and client
CN105677827A (en) * 2016-01-04 2016-06-15 百度在线网络技术(北京)有限公司 Method and device for obtaining form
CN107846426A (en) * 2016-09-18 2018-03-27 腾讯科技(深圳)有限公司 The tracking and device of user trajectory in page access
CN109242164A (en) * 2018-08-22 2019-01-18 中国平安人寿保险股份有限公司 Optimize method and device, the computer storage medium, electronic equipment in product path
CN110781372A (en) * 2019-10-28 2020-02-11 珠海格力电器股份有限公司 Method and device for optimizing website, computer equipment and storage medium
CN111078522A (en) * 2019-12-18 2020-04-28 中国建设银行股份有限公司 Method and device for identifying to-be-optimized area of page, electronic equipment and storage medium

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
李维旺: "基于网页重构的网络用户体验优化研究与实现", pages 25 - 42 *
栗辉;唐萌;陈豪;: "基于用户行为分析的网站结构优化研究综述", 计算机科学, no. 1, pages 395 - 397 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113050936A (en) * 2021-04-20 2021-06-29 中国工商银行股份有限公司 Front-end data processing method and device based on integrated development environment
CN113050936B (en) * 2021-04-20 2024-02-27 中国工商银行股份有限公司 Front-end data processing method and device based on integrated development environment

Also Published As

Publication number Publication date
CN111723318B (en) 2023-09-01

Similar Documents

Publication Publication Date Title
CN111475750B (en) Page preloading control method, device, system, equipment and storage medium
US11508153B2 (en) Method for generating tag of video, electronic device, and storage medium
CN112102448B (en) Virtual object image display method, device, electronic equipment and storage medium
CN111832613B (en) Model training method and device, electronic equipment and storage medium
CN111078878B (en) Text processing method, device, equipment and computer readable storage medium
CN111984825A (en) Method and apparatus for searching video
CN110992112A (en) Method and device for processing advertisement information
CN111563198B (en) Material recall method, device, equipment and storage medium
CN111610972A (en) Page generation method, device, equipment and storage medium
CN111246305A (en) Video preview method, device, equipment and storage medium
CN110909390B (en) Task auditing method and device, electronic equipment and storage medium
CN111723318B (en) Page data processing method, device, equipment and storage medium
CN111310044B (en) Page element information extraction method, device, equipment and storage medium
CN111881339A (en) Resource information pushing and informing method and device, electronic equipment and storage medium
CN111666417A (en) Method and device for generating synonyms, electronic equipment and readable storage medium
CN111125445A (en) Community theme generation method and device, electronic equipment and storage medium
CN111160552A (en) Negative sampling processing method, device, equipment and computer storage medium
CN111460801A (en) Title generation method and device and electronic equipment
CN111680599A (en) Face recognition model processing method, device, equipment and storage medium
CN111522863A (en) Topic concept mining method, device, equipment and storage medium
CN111177479A (en) Method and device for acquiring feature vectors of nodes in relational network graph
CN111984883B (en) Label mining method, device, equipment and storage medium
CN112446728B (en) Advertisement recall method, device, equipment and storage medium
CN111931524B (en) Method, apparatus, device and storage medium for outputting information
CN113220982A (en) Advertisement searching method, device, electronic equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant