WO2023197910A1 - User behavior prediction method and related device thereof - Google Patents

User behavior prediction method and related device thereof Download PDF

Info

Publication number
WO2023197910A1
WO2023197910A1 PCT/CN2023/086192 CN2023086192W WO2023197910A1 WO 2023197910 A1 WO2023197910 A1 WO 2023197910A1 CN 2023086192 W CN2023086192 W CN 2023086192W WO 2023197910 A1 WO2023197910 A1 WO 2023197910A1
Authority
WO
WIPO (PCT)
Prior art keywords
item
feature
user
target
model
Prior art date
Application number
PCT/CN2023/086192
Other languages
French (fr)
Chinese (zh)
Inventor
刘卫文
唐睿明
张瑞
傅凌玥
林江浩
张伟楠
俞勇
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2023197910A1 publication Critical patent/WO2023197910A1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/044Recurrent networks, e.g. Hopfield networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • This application relates to the technical field of artificial intelligence (AI), and in particular to a user behavior prediction method and related equipment.
  • AI artificial intelligence
  • the arrangement of items on a certain page is often presented to the user in the form of multiple lists, that is, the page usually contains multiple lists, and each list contains multiple items.
  • the neural network model of AI technology can be used to determine the probability of the item being clicked by the user.
  • the neural network model provided by related technologies predicts the probability of a certain item being clicked by a user, it usually only considers the impact of the remaining items in the list where the item is located on the item. It can be seen that the factors considered by the relevant technology are relatively single, resulting in the probability that the item is clicked by the user finally obtained by the model, which is often less accurate. Therefore, it cannot accurately recommend items of interest to the user in the future.
  • the embodiments of this application provide a user behavior prediction method and related equipment, which can make the probability of items being clicked by the user obtained by the neural network model have a higher accuracy, which is conducive to subsequent accurate recommendation of items of interest to the user. project.
  • the first aspect of the embodiments of this application provides a user behavior prediction method, which method includes:
  • the first feature of the first item and the second feature of the second item can be obtained first, and the first The first feature of the item and the second feature of the second item are input into the target model.
  • the first item and the second item are located in different lists or the same list on the target page, and the second item is located before the first item, that is, the positional relationship between the first item and the second item exists in the following two situations: (1) The first item and the second item may be items in the same list, the second item is located before the first item and the second item is adjacent to the first item. (2) The first item and the second item can be items in different lists. The list where the second item is located is located before the list where the first item is located. The second item can be adjacent to the first item or not. One item is adjacent.
  • the first feature of the first item and the second feature of the second item can be processed by the target model to obtain the first feature of the first item.
  • Second characteristic It is worth noting that the first characteristic of the first item can be the attribute information of the first item itself, then the second characteristic of the first item Characteristics are information obtained by fusion based on the attribute information of the first item (that is, the first feature of the first item). Since the acquisition process of the second feature of the second item is the same as the acquisition process of the second feature of the first item, The second feature of the second item is also information obtained by fusion based on the attribute information of the second item (the first feature of the second item).
  • the second feature of the first item can be processed through the target model to obtain the probability that the first item is clicked by the user.
  • the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and The second item is in a different list or in the same list on the target page, and the second item is before the first item.
  • the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item.
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item.
  • the second item can not only be the list where the first item is located,
  • the items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model
  • the probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • the method further includes: obtaining the first characteristic of the third item, the first item and the third item are located in different lists or the same list on the target page, and the third item is related to the first item. Neighbor; based on the first feature of the first item and the first feature of the third item, obtaining the third feature of the first item; based on the second feature of the first item, obtaining the probability that the first item is clicked by the user includes: based on the The second feature of the item and the third feature of the first item are used to obtain the probability that the first item is clicked by the user.
  • the first feature of the third item can also be input to the target model, where the first item and the third item are located in different lists or the same list on the target page, and the third item is adjacent to the first item. , that is, the positional relationship between the first item and the third item exists in the following two situations: (1) The first item and the third item can be items in the same list, and the third item and the first item are adjacent. (2) The first item and the third item can be items in different lists, and the third item and the first item are adjacent. After the first feature of the third item is input into the target model, the first feature of the first item and the first feature of the third item can be processed by the target model, thereby obtaining the third feature of the first item.
  • the target model can calculate the second feature of the first item and the third feature of the first item, thereby obtaining the probability that the first item is clicked by the user.
  • the second characteristic of the first item can represent the impact of the second item on the first item, that is, when the user uses sequential browsing behavior and skipping behavior to browse to the first item, the user browses during these behaviors.
  • the third feature of the first project can represent the impact of the third project on the first project. That is, when the user uses the contrast behavior to browse to the first project, the user is in the process of performing this behavior.
  • the target model when predicting user behavior, not only introduces conventional sequential browsing behavior, but also introduces browsing behaviors such as jump behavior and comparison behavior, that is, That is to say, the target model will consider the impact of the items browsed by the user on the first item when the user uses these complex and diverse browsing behaviors to browse the first item, which can further improve the final result of the target model.
  • the accuracy of the probability that the first item is clicked by the user is not only introduces conventional sequential browsing behavior, but also introduces browsing behaviors such as jump behavior and comparison behavior, that is, That is to say, the target model will consider the impact of the items browsed by the user on the first item when the user uses these complex and diverse browsing behaviors to browse the first item, which can further improve the final result of the target model.
  • the accuracy of the probability that the first item is clicked by the user is a browsing behaviors such as jump behavior and comparison behavior
  • obtaining the second feature of the first item includes: mapping the first feature of the first item to obtain the second feature of the first item.
  • the fourth feature of the first item; the second feature of the second item is processed based on the self-attention mechanism to obtain the fifth feature of the first item; the first item
  • the fourth feature of the object and the fifth feature of the first item are subjected to a first fusion process to obtain the second feature of the first item.
  • the target model can map the first feature of the first item on the latent space to obtain the first feature of the first item.
  • the target model can also process the second feature of the second item based on the self-attention mechanism to obtain the fifth feature of the first item.
  • the target model can use the recurrent neural unit to process the fourth feature of the first item and the fifth feature of the first item, thereby accurately obtaining the first item's fourth feature. Second characteristic.
  • the first feature of the first item is mapped to obtain the fourth feature of the first item: the first feature of the first item, the user's request for the target page, and the second item being The probability of the user clicking is mapped to obtain the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item; the sixth feature of the first item, the seventh feature of the first item, and The eighth feature of the first item is subjected to the second fusion process to obtain the fourth feature of the first item.
  • the user's request for the target page and the probability that the second item is clicked by the user can also be input to the target model.
  • the target model can respectively obtain the third feature of the first item.
  • the first feature, the user's request for the target page and the probability of the second item being clicked by the user are mapped on the latent space, and accordingly the sixth feature of the first item, the seventh feature of the first item and the eighth feature of the first item are obtained , and then splice the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item to obtain the fourth feature of the first item.
  • the target model analyzes the first item, it not only takes into account the influence of the attribute information of the first item itself, but also considers the influence of external factors such as the user's request for the target page and the probability of the second item being clicked by the user.
  • the impact produced by the target model further improves the accuracy of the probability of the first item being clicked by the user.
  • obtaining the third feature of the first item includes: comparing the first feature of the first item and the third feature of the third item. Perform mapping processing on one feature to obtain the sixth feature of the first item and the ninth feature of the first item; perform a third fusion process on the sixth feature of the first item and the ninth feature of the first item to obtain the sixth feature of the first item.
  • the tenth feature performs the fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item.
  • the target model can respectively map the first feature of the first item and the first feature of the third item on the latent space, and obtain the first item accordingly.
  • the sixth characteristic of the first item and the ninth characteristic of the first item Then, the target model can calculate the sixth feature of the first item and the ninth feature of the first item through the comparison function, and then perform a weighted sum based on the calculation results to obtain the tenth feature of the first item.
  • the target model can perform an exclusive OR operation on the sixth feature of the first item and the tenth feature of the first item to accurately obtain the third feature of the first item.
  • performing a fourth fusion process on the sixth feature of the first item and the tenth feature of the first item, and obtaining the third feature of the first item includes: mapping the user's request for the target page , the seventh feature of the first item is obtained; the fourth fusion process is performed on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item, to obtain the third feature of the first item.
  • the target model when obtaining the third feature of the first item, can also map the user's request for the target page on the latent space to obtain the seventh feature of the first item. Then, the target model can map the third feature of the first item.
  • the sixth characteristic of one item, the seventh characteristic of the first item and the tenth characteristic of the first item are subjected to an exclusive OR operation to obtain the third characteristic of the first item. It can be seen that when the target model analyzes the first item, it not only takes into account the influence of the attribute information of the first item itself, but also considers the influence of external factors such as the user's request for the target page, thereby further improving the target model The final accuracy of the probability that the first item is clicked by the user.
  • the second characteristic of the second item is the preset value.
  • the target page contains multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
  • the second aspect of the embodiment of the present application provides a method for constructing a directed acyclic graph.
  • the method includes: obtaining the eye movement data of the user browsing the target page; based on the eye movement data, determining the user's browsing behavior for multiple items. Each item is located in multiple lists on the target page; based on browsing behavior, multiple items are connected to obtain a directed acyclic graph.
  • the user's browsing behavior for multiple items in the target page can be determined based on the eye movement data generated when the user browses the target page. Then, these browsing behaviors (for example, sequential browsing behavior and skipping behavior ), often determines the user's browsing order of items (for example, the user's browsing order in the same list and the user's browsing order between different lists), thereby connecting multiple items on the target page according to these browsing to obtain the target
  • the directed acyclic graph of the page can be used in subsequent predictions of user behavior on the target page. Since the directed acyclic graph involves users’ complex and diverse browsing behaviors, it is helpful to improve users’ understanding of the target page. Accuracy of behavioral predictions.
  • connecting multiple items to obtain a directed acyclic graph includes: connecting items in the same list that the user browsed in the first order, and The items in different lists browsed by the user in the second order are connected in the second order to obtain a directed acyclic graph.
  • the user's browsing behavior includes two major types of browsing behavior.
  • the first type of browsing behavior refers to the user browsing items in the same list, including the first type of sequential browsing behavior. Therefore, the user's browsing order in the same list can be called the first order, and the first order includes the first type of sequential browsing behavior. , the top-to-bottom and left-to-right order in which users browse all items in the same list.
  • the second type of browsing behavior refers to the user browsing items between different lists, including the second type of sequential browsing behavior and comparison behavior. Therefore, the user's browsing order between different lists can be called the second order, and the second order includes the second type.
  • the sequential browsing behavior the order in which the user browses several adjacent items in two adjacent lists
  • the comparison behavior the jump order in which the user browses two items in two non-adjacent lists. . Then, all items in the target page can be connected according to the first order and the second order, thereby obtaining a directed acyclic graph for the target page.
  • obtaining the eye movement data of the user browsing the target page : collecting the eye movement data of the user browsing the target page through an eye tracker.
  • the third aspect of the embodiment of the present application provides a model training method.
  • the method includes: obtaining the first feature of the first item and the second feature of the second item through the model to be trained, and the first item and the second item are located in the to-be-trained model. Process different lists on the page or the same list, and the second item is located before the first item; obtain the second feature of the first item based on the first feature of the first item and the second feature of the second item through the model to be trained, Among them, the first feature of the first item is the attribute information of the first item, the second feature of the first item is the information obtained by fusion based on the attribute information of the first item, and the second feature of the second item is the information based on the second item.
  • the target loss is used to indicate the difference between the probability that the first item is clicked by the user and the real probability that the first item is clicked by the user; based on the target loss, update the Train the parameters of the model until the model training conditions are met and the target model is obtained.
  • the target model obtained by the above method has the ability to predict user behavior on the page.
  • the first feature of the first item and the first feature of the second item can be input to the target model.
  • Second feature wherein the first item and the second item are located in different lists or the same list on the target page, and the second item is located before the first item.
  • the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. .
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model
  • the probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • the method further includes: obtaining the first feature of the third item through the model to be trained, the first item and the third item are located in different lists or the same list of the page to be processed, and the third item adjacent to the first item; through the model to be trained based on the first feature of the first item and the first feature of the third item, the third feature of the first item is obtained; through the model to be trained based on the second feature of the first item, Obtaining the probability that the first item is clicked by the user includes: using the to-be-trained model to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item.
  • obtaining the second feature of the first item includes: mapping the first feature of the first item to obtain the second feature of the first item.
  • the fourth feature of the first item; the second feature of the second item is processed based on the self-attention mechanism to obtain the fifth feature of the first item; the fourth feature of the first item and the fifth feature of the first item are processed
  • the first fusion process is to obtain the second feature of the first item.
  • the first feature of the first item is mapped to obtain the fourth feature of the first item: the first feature of the first item, the user's request for the page to be processed, and the second item being processed.
  • the probability of the user clicking is mapped to obtain the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item; the sixth feature of the first item, the seventh feature of the first item, and The eighth feature of the first item is subjected to the second fusion process to obtain the fourth feature of the first item.
  • obtaining the third feature of the first item includes: comparing the first feature of the first item and the third feature of the third item. Perform mapping processing on one feature to obtain the sixth feature of the first item and the ninth feature of the first item; perform a third fusion process on the sixth feature of the first item and the ninth feature of the first item to obtain the sixth feature of the first item.
  • the tenth feature performs the fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item.
  • performing a fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item includes: mapping the user's request for the page to be processed , the seventh feature of the first item is obtained; the fourth fusion process is performed on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item, to obtain the third feature of the first item.
  • the second characteristic of the second item is a preset value.
  • the page to be processed includes multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
  • the fourth aspect of the embodiment of the present application provides a user behavior prediction device.
  • the device includes: a first acquisition module, configured to acquire the first feature of the first item and the second feature of the second item through the target model.
  • the first project and second project is located in a different list or the same list on the target page, and the second item is located before the first item;
  • the second acquisition module is used to obtain the second item based on the first feature of the first item and the second feature of the second item through the target model.
  • the second characteristic of an item wherein the first characteristic of the first item is the attribute information of the first item, the second characteristic of the first item is the information obtained by fusion based on the attribute information of the first item, and the second characteristic of the second item is
  • the second feature is the information obtained by fusion based on the attribute information of the second item (i.e., the first feature of the second item); the third acquisition module is used to obtain the first item based on the second feature of the first item through the target model. The probability of a user clicking.
  • the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and The second item is in a different list or in the same list on the target page, and the second item is before the first item.
  • the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item.
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item.
  • the second item can not only be the list where the first item is located,
  • the items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model
  • the probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • the device further includes: a fourth acquisition module, configured to acquire the first feature of the third item through the target model, where the first item and the third item are located in different lists or the same list of the target page. , and the third item is adjacent to the first item; the fifth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the target model; the third The acquisition module is configured to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the target model.
  • a fourth acquisition module configured to acquire the first feature of the third item through the target model, where the first item and the third item are located in different lists or the same list of the target page. , and the third item is adjacent to the first item
  • the fifth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the target model
  • the third The acquisition module is configured to obtain the probability that the first item
  • the second acquisition module is configured to: map the first feature of the first item through the target model to obtain the fourth feature of the first item; map the third feature of the second item through the target model.
  • the two features are processed based on the self-attention mechanism to obtain the fifth feature of the first item; the fourth feature of the first item and the fifth feature of the first item are first fused through the target model to obtain the first item's fifth feature.
  • the second acquisition module is configured to: use the target model to map the first feature of the first item, the user's request for the target page, and the probability that the second item is clicked by the user, to obtain the third The sixth characteristic of the first item, the seventh characteristic of the first item and the eighth characteristic of the first item; through the target model, the sixth characteristic of the first item, the seventh characteristic of the first item and the eighth characteristic of the first item.
  • the second fusion process is performed to obtain the fourth feature of the first item.
  • the fifth acquisition module is used to map the first feature of the first item and the first feature of the third item through the target model to obtain the sixth feature of the first item and the first feature of the third item.
  • the ninth feature of the first item; the third fusion process is performed on the sixth feature of the first item and the ninth feature of the first item through the target model to obtain the tenth feature of the first item; the third feature of the first item is obtained through the target model.
  • the six features and the tenth feature of the first item are subjected to the fourth fusion process to obtain the third feature of the first item.
  • the fifth acquisition module is used to: map the user's request for the target page through the target model to obtain the seventh feature of the first item; map the sixth feature of the first item through the target model Features, the seventh feature of the first item, and the tenth feature of the first item are subjected to a fourth fusion process to obtain the third feature of the first item.
  • the second characteristic of the second item is the preset value.
  • the target page contains multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
  • the fifth aspect of the embodiment of the present application provides a device for constructing a directed acyclic graph.
  • the device includes: an acquisition module, used to obtain the eye movement data of the user browsing the target page; and a determination module, used to determine based on the eye movement data.
  • the user's browsing behavior for multiple items, multiple items are located in multiple lists on the target page; the connection module is used to connect multiple items based on the browsing behavior to obtain a directed acyclic graph.
  • the above device can determine the user's browsing behavior for multiple items in the target page based on the eye movement data generated when the user browses the target page. Then, these browsing behaviors (such as sequential browsing behavior and skipping behavior) often determine The order in which the user browses the items (for example, the order in which the user browses in the same list and the order in which the user browses between different lists) is used to connect multiple items of the target page according to these views, and a directed and undirected view of the target page is obtained. Ring graph, this directed acyclic graph can be used in the subsequent prediction of user behavior on the target page. Since this directed acyclic graph involves users' complex and diverse browsing behaviors, it is helpful to improve the accuracy of user behavior prediction on the target page. .
  • connection module is used to connect items in the same list that the user browses in the first order, and connect items in different lists that the user browses in the second order. Connect in the second order to obtain a directed acyclic graph.
  • the acquisition module is used to collect eye movement data of the user browsing the target page through an eye tracker.
  • the sixth aspect of the embodiment of the present application provides a schematic structural diagram of a model training device.
  • the device includes: a first acquisition module, configured to acquire the first feature of the first item and the second feature of the second item through the model to be trained.
  • Features, the first item and the second item are located in different lists or the same list on the page to be processed, and the second item is located before the first item;
  • the second acquisition module is used to use the model to be trained based on the first feature of the first item and the second feature of the second item, to obtain the second feature of the first item, where the first feature of the first item is the attribute information of the first item, and the second feature of the first item is the attribute information based on the first item
  • the information obtained by fusion, the second feature of the second item is the information obtained by fusion based on the attribute information of the second item (ie, the first feature of the second item);
  • the third acquisition module is used to use the model to be trained based on the first feature of the second item.
  • the second feature of an item is to obtain the probability that the first item is clicked by the user; the fourth acquisition module is used to obtain the target loss based on the probability of the first item being clicked by the user and the real probability of the first item being clicked by the user. Used to indicate the difference between the probability of the first item being clicked by the user and the real probability of the first item being clicked by the user; the update module is used to update the parameters of the model to be trained based on the target loss until the model training conditions are met and the target is obtained Model.
  • the target model trained by the above device has the ability to predict user behavior on the page.
  • the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second item are located on the target page. Different lists or the same list, with the second item before the first item.
  • the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. .
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists, so the factors considered by the target model are relatively comprehensive and can fit the user's needs in the target page. Based on the actual situation when browsing to the first item, the probability that the first item is clicked by the user finally obtained by the target model has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • the device includes: a fifth acquisition module, configured to acquire the first feature of the third item through the model to be trained, and the first item and the third item are located in different lists or the same list of the page to be processed. , and the third item is adjacent to the first item; the sixth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the model to be trained; The third acquisition module is used to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the model to be trained.
  • the second acquisition module is used to: perform mapping processing on the first feature of the first item to obtain the fourth feature of the first item; perform self-attention-based processing on the second feature of the second item.
  • the force mechanism is processed to obtain the fifth characteristic of the first item; the fourth characteristic of the first item and the fifth characteristic of the first item are first fused to obtain the second characteristic of the first item.
  • the second acquisition module is used to map the first feature of the first item, the user's request for the page to be processed, and the probability that the second item is clicked by the user, and obtain the first item's first characteristic.
  • the sixth acquisition module is configured to map the first feature of the first item and the first feature of the third item to obtain the sixth feature of the first item and the first feature of the first item.
  • the ninth characteristic perform a third fusion process on the sixth characteristic of the first item and the ninth characteristic of the first item to obtain the tenth characteristic of the first item; perform the sixth characteristic of the first item and the tenth characteristic of the first item.
  • the sixth acquisition module is used to: map the user's request for the page to be processed to obtain the seventh feature of the first item; map the sixth feature of the first item, the first item's The seventh feature and the tenth feature of the first item are subjected to the fourth fusion process to obtain the third feature of the first item.
  • the second characteristic of the second item is a preset value.
  • the page to be processed includes multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
  • the seventh aspect of the embodiment of the present application provides a user behavior prediction device, which includes a memory and a processor; the memory stores code, and the processor is configured to execute the code.
  • the user behavior prediction device executes as follows: The method described in the first aspect or any possible implementation manner of the first aspect.
  • the eighth aspect of the embodiment of the present application provides a device for constructing a directed acyclic graph, which includes a memory and a processor; the memory stores code, and the processor is configured to execute the code.
  • the directed acyclic graph is The ring graph construction device is as described in the second aspect or any possible implementation manner of the second aspect.
  • a ninth aspect of the embodiment of the present application provides a model training device, which includes a memory and a processor; the memory stores code, and the processor is configured to execute the code.
  • the model training device executes the third step The method described in any possible implementation manner of the aspect or the third aspect.
  • a tenth aspect of the embodiments of the present application provides a circuit system.
  • the circuit system includes a processing circuit configured to perform any of the possible implementations of the first aspect, the second aspect, Any possible implementation manner in the second aspect or the method described in any possible implementation manner in the third aspect or the third aspect.
  • An eleventh aspect of the embodiments of the present application provides a chip system.
  • the chip system includes a processor for calling a computer program or computer instructions stored in a memory, so that the processor executes the first aspect as described in the first aspect. Any one of the possible implementations in the second aspect, any one of the possible implementations of the second aspect, or the third aspect, the method described in any one of the possible implementations of the third aspect.
  • the processor is coupled to the memory through an interface.
  • the chip system further includes a memory, and computer programs or computer instructions are stored in the memory.
  • a twelfth aspect of the embodiments of the present application provides a computer storage medium.
  • the computer storage medium stores a computer program.
  • the program When the program is executed by a computer, the computer implements any one of the first aspect and the first aspect.
  • a thirteenth aspect of the embodiments of the present application provides a computer program product.
  • the computer program product stores instructions. When executed by a computer, the instructions make it possible for the computer to implement any one of the first aspect and the first aspect.
  • the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second feature can be input to the target model.
  • the two items are in different lists or in the same list on the target page, and the second item is before the first item.
  • the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. .
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model
  • the probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • Figure 1 is a structural schematic diagram of the main framework of artificial intelligence
  • Figure 2a is a schematic structural diagram of the user behavior prediction system provided by the embodiment of the present application.
  • Figure 2b is another structural schematic diagram of the user behavior prediction system provided by the embodiment of the present application.
  • Figure 2c is a schematic diagram of related equipment for data sequence processing provided by the embodiment of the present application.
  • Figure 3 is a schematic diagram of the architecture of the system 100 provided by the embodiment of the present application.
  • Figure 4 is a schematic flow chart of a directed acyclic graph construction method provided by an embodiment of the present application.
  • Figure 5 is a schematic diagram of the target page provided by the embodiment of the present application.
  • Figure 6 is a schematic diagram of an eye tracker provided by an embodiment of the present application.
  • Figure 7 is a schematic diagram of a directed acyclic graph provided by an embodiment of the present application.
  • Figure 8 is a schematic flow chart of the user behavior prediction method provided by the embodiment of the present application.
  • Figure 9 is a schematic structural diagram of the target model provided by the embodiment of the present application.
  • Figure 10 is a schematic flow chart of the model training method provided by the embodiment of the present application.
  • Figure 11 is a schematic structural diagram of a user behavior prediction device provided by an embodiment of the present application.
  • Figure 12 is a schematic structural diagram of a directed acyclic graph construction device provided by an embodiment of the present application.
  • Figure 13 is a schematic structural diagram of the model training device provided by the embodiment of the present application.
  • Figure 14 is a schematic structural diagram of an execution device provided by an embodiment of the present application.
  • Figure 15 is a schematic structural diagram of the training equipment provided by the embodiment of the present application.
  • Figure 16 is a schematic structural diagram of a chip provided by an embodiment of the present application.
  • the embodiments of this application provide a user behavior prediction method and related equipment, which can make the probability of items being clicked by the user obtained by the neural network model have a higher accuracy, which is conducive to subsequent accurate recommendation of items of interest to the user. project.
  • the arrangement of items on a certain page is often presented to the user in the form of multiple lists, that is, the page usually contains multiple lists, and each list contains multiple items.
  • the neural network model in the field of AI technology can be used to determine the probability of the item being clicked by the user. For example, on a page of an app mall, multiple horizontal lists and multiple vertical arrangements are displayed. These multiple horizontal lists and multiple vertical arrangements are staggered. Multiple applications in the horizontal list are arranged in a row, and in the vertical arrangement, Multiple applications are arranged in a row, so that the page can display the introduction information of various applications to the user in the form of a staggered list. In order to predict the user's clicking behavior on this page, the neural network model can be used to analyze each application one by one, thereby obtaining the probability that the user clicks on each application on the page.
  • the neural network model provided by related technologies predicts the probability of a certain item being clicked by a user, it usually only considers the impact of the remaining items in the list where the item is located on the item. It can be seen that the factors considered by the relevant technology are relatively single, resulting in the probability that the item is clicked by the user finally obtained by the model, which is often less accurate. Therefore, it cannot accurately recommend items of interest to the user in the future.
  • the user's browsing behavior on this page is often complicated. For example, when the user is browsing the items in the current list, he directly jumps to the items in another list (the other list and the current list are two non-adjacent lists). to browse.
  • Related technologies often fail to take into account the impact of multiple complex browsing behaviors, and will also reduce the accuracy of the probability of an item being clicked by the user finally obtained by the model.
  • models of related technologies analyze a certain project, they often only focus on the relevant information of the project itself.
  • the relevant information of the application includes the developer, type, size, etc. of the application.
  • the analysis does not take into account the impact of external factors such as users, which will also degrade the model. The accuracy of the final probability of the item being clicked by the user.
  • AI technology is a technical discipline that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence. AI technology obtains the best results by perceiving the environment, acquiring knowledge and using knowledge.
  • artificial intelligence technology is a branch of computer science that attempts to understand the nature of intelligence and produce a new intelligent machine that can respond in a similar way to human intelligence.
  • Using artificial intelligence for data processing is a common application method of artificial intelligence.
  • Figure 1 is a structural schematic diagram of the main framework of artificial intelligence.
  • the following is from the “intelligent information chain” (horizontal axis) and “IT value chain” (vertical axis)
  • the above artificial intelligence theme framework is elaborated on in two dimensions.
  • the "intelligent information chain” reflects a series of processes from data acquisition to processing. For example, it can be the general process of intelligent information perception, intelligent information representation and formation, intelligent reasoning, intelligent decision-making, intelligent execution and output. In this process, the data has gone through the condensation process of "data-information-knowledge-wisdom".
  • the "IT value chain” reflects the value that artificial intelligence brings to the information technology industry, from the underlying infrastructure of human intelligence and information (providing and processing technology implementation) to the systematic industrial ecological process.
  • Infrastructure provides computing power support for artificial intelligence systems, enables communication with the external world, and supports it through basic platforms.
  • computing power is provided by smart chips (hardware acceleration chips such as CPU, NPU, GPU, ASIC, FPGA, etc.);
  • the basic platform includes distributed computing framework and network and other related platform guarantees and support, which can include cloud storage and Computing, interconnection networks, etc.
  • sensors communicate with the outside world to obtain data, which are provided to smart chips in the distributed computing system provided by the basic platform for calculation.
  • Data from the upper layer of the infrastructure is used to represent data sources in the field of artificial intelligence.
  • the data involves graphics, images, voice, and text, as well as IoT data of traditional devices, including business data of existing systems and sensory data such as force, displacement, liquid level, temperature, and humidity.
  • Data processing usually includes data training, machine learning, deep learning, search, reasoning, decision-making and other methods.
  • machine learning and deep learning can perform symbolic and formal intelligent information modeling, extraction, preprocessing, training, etc. on data.
  • Reasoning refers to the process of simulating human intelligent reasoning in computers or intelligent systems, using formal information to perform machine thinking and problem solving based on reasoning control strategies. Typical functions are search and matching.
  • Decision-making refers to the process of decision-making after intelligent information is reasoned, and usually provides functions such as classification, sorting, and prediction.
  • some general capabilities can be formed based on the results of further data processing, such as algorithms or a general system, such as translation, text analysis, computer vision processing, speech recognition, and image processing. identification, etc.
  • Intelligent products and industry applications refer to the products and applications of artificial intelligence systems in various fields. They are the encapsulation of overall artificial intelligence solutions, productizing intelligent information decision-making and realizing practical applications. Its application fields mainly include: intelligent terminals, intelligent transportation, Smart healthcare, autonomous driving, smart cities, etc.
  • FIG 2a is a schematic structural diagram of a user behavior prediction system provided by an embodiment of the present application.
  • the user behavior prediction system includes user equipment and data processing equipment.
  • user equipment includes smart terminals such as mobile phones, personal computers, or information processing centers.
  • the user device is the initiator of user behavior prediction for the page. As the initiator of the user behavior prediction request, the user usually initiates the request through the user device.
  • the above-mentioned data processing equipment may be a cloud server, a network server, an application server, a management server, and other equipment or servers with data processing functions.
  • the data processing device receives the user behavior prediction request from the smart terminal for the page through the interactive interface, and then performs page processing through machine learning, deep learning, search, reasoning, decision-making and other methods through the memory for storing data and the processor for data processing.
  • the memory in the data processing device can be a general term, including local storage and a database that stores historical data.
  • the database can be on the data processing device or on other network servers.
  • the user device can receive instructions from the user. For example, the user device can obtain a page input/selected by the user, and then initiate a request to the data processing device, so that the data processing device can respond to the information obtained by the user device.
  • This page executes the user behavior prediction application to obtain the processing results for this page.
  • the user device can obtain a page input by the user, and then initiate a user behavior prediction request for the page to the data processing device, so that the data processing device processes the characteristics of each item in the page, thereby obtaining the processing result of the page. That is, the probability that each item on the page is clicked by the user.
  • the data processing device can execute the directed acyclic graph construction method and the user behavior prediction method of the embodiment of the present application.
  • Figure 2b is another schematic structural diagram of a user behavior prediction system provided by an embodiment of the present application.
  • the user equipment itself can execute the user behavior prediction application.
  • the user equipment can directly obtain input from the user and directly obtain input from the user equipment.
  • the hardware itself is used for processing. The specific process is similar to Figure 2a. Please refer to the above description and will not be repeated here.
  • the user device can receive instructions from the user. For example, the user device can obtain a page selected by the user on the user device, and then the user device itself can target the characteristics of each item in the page. Perform processing to obtain the processing result of the page, that is, the probability of each item on the page being clicked by the user.
  • the user equipment itself can execute the directed acyclic graph construction method and user behavior prediction method in the embodiment of the present application.
  • Figure 2c is a schematic diagram of related equipment for user behavior prediction processing provided by the embodiment of the present application.
  • the user equipment in Figure 2a and Figure 2b can be the local device 301 or the local device 302 in Figure 2c
  • the data processing device in Figure 2a can be the execution device 210 in Figure 2c
  • the data storage system 250 can To store the data to be processed by the execution device 210, the data storage system 250 can be integrated on the execution device 210, or can be set up on the cloud or other network servers.
  • the processors in Figure 2a and Figure 2b can perform data training/machine learning/deep learning through neural network models or other models (for example, models based on support vector machines), and use the data to ultimately train or learn the model to execute on the page User behavior prediction application to obtain corresponding processing results.
  • neural network models or other models for example, models based on support vector machines
  • Figure 3 is a schematic diagram of the architecture of the system 100 provided by the embodiment of the present application.
  • the execution device 110 is configured with an input/output (I/O) interface 112 for data interaction with external devices.
  • the user Data can be input to the I/O interface 112 through the client device 140.
  • the input data may include: various to-be-scheduled tasks, callable resources, and other parameters.
  • the execution device 110 When the execution device 110 preprocesses the input data, or when the calculation module 111 of the execution device 110 performs calculation and other related processing (such as implementing the function of the neural network in this application), the execution device 110 can call the data storage system 150
  • the data, codes, etc. in the system can be used for corresponding processing, and the data, instructions, etc. obtained by corresponding processing can also be stored in the data storage system 150 .
  • the I/O interface 112 returns the processing results to the client device 140, thereby providing them to the user.
  • the training device 120 can generate corresponding target models/rules based on different training data for different goals or different tasks, and the corresponding target models/rules can be used to achieve the above goals or complete the above tasks. , thereby providing users with the desired results.
  • the training data may be stored in the database 130 and come from training samples collected by the data collection device 160 .
  • the user can manually enter the input data, and the manual input can be operated through the interface provided by the I/O interface 112 .
  • the client device 140 can automatically send input data to the I/O interface 112. If requiring the client device 140 to automatically send input data requires the user's authorization, the user can set corresponding permissions in the client device 140.
  • the user can view the results output by the execution device 110 on the client device 140, and the specific presentation form may be display, sound, action, etc.
  • the client device 140 can also be used as a data collection end to collect the input data of the input I/O interface 112 and the output results of the output I/O interface 112 as new sample data, and store them in the database 130 .
  • the I/O interface 112 directly uses the input data input to the I/O interface 112 and the output result of the output I/O interface 112 as a new sample as shown in the figure.
  • the data is stored in database 130.
  • Figure 3 is only a schematic diagram of a system architecture provided by an embodiment of the present application.
  • the positional relationship between the devices, devices, modules, etc. shown in the figure does not constitute any limitation.
  • the data The storage system 150 is an external memory relative to the execution device 110. In other cases, the data storage system 150 can also be placed in the execution device 110.
  • the neural network can be trained according to the training device 120.
  • An embodiment of the present application also provides a chip, which includes a neural network processor NPU.
  • the chip can be disposed in the execution device 110 as shown in FIG. 3 to complete the calculation work of the calculation module 111.
  • the chip can also be installed in the training device 120 as shown in Figure 3 to complete the training work of the training device 120 and output the target model/rules.
  • Neural network processor NPU is mounted on the main central processing unit (CPU) (host CPU) as a co-processor, and the main CPU allocates tasks.
  • the core part of the NPU is the arithmetic circuit.
  • the controller controls the arithmetic circuit to extract the data in the memory (weight memory or input memory) and perform operations.
  • the computing circuit includes multiple processing units (PE).
  • the arithmetic circuit is a two-dimensional systolic array.
  • the arithmetic circuit may also be a one-dimensional systolic array or other electronic circuit capable of performing mathematical operations such as multiplication and addition.
  • the arithmetic circuit is a general-purpose matrix processor.
  • the arithmetic circuit fetches the corresponding data of matrix B from the weight memory and caches it on each PE in the arithmetic circuit.
  • the operation circuit takes matrix A data and matrix B from the input memory to perform matrix operations, and the partial result or final result of the obtained matrix is stored in the accumulator (accumulator).
  • the vector calculation unit can further process the output of the arithmetic circuit, such as vector multiplication, vector addition, exponential operation, etc. Numerical operations, size comparison, etc.
  • the vector computing unit can be used for network calculations in non-convolutional/non-FC layers in neural networks, such as pooling, batch normalization, local response normalization, etc.
  • the vector computation unit can store the processed output vectors into a unified buffer.
  • the vector calculation unit may apply a nonlinear function to the output of the arithmetic circuit, such as a vector of accumulated values, to generate activation values.
  • the vector computation unit generates normalized values, merged values, or both.
  • the processed output vector can be used as an activation input to an arithmetic circuit, such as for use in a subsequent layer in a neural network.
  • Unified memory is used to store input data and output data.
  • the weight data directly transfers the input data in the external memory to the input memory and/or the unified memory through the storage unit access controller (direct memory access controller, DMAC), stores the weight data in the external memory into the weight memory, and transfers the weight data to the unified memory.
  • DMAC direct memory access controller
  • the bus interface unit (BIU) is used to realize the interaction between the main CPU, DMAC and instruction memory through the bus.
  • the instruction fetch buffer connected to the controller is used to store instructions used by the controller
  • the controller is used to call instructions cached in the memory to control the working process of the computing accelerator.
  • the unified memory, input memory, weight memory and instruction memory are all on-chip memories, and the external memory is the memory outside the NPU.
  • the external memory can be double data rate synchronous dynamic random access memory (double data). rate synchronous dynamic random access memory (DDR SDRAM), high bandwidth memory (high bandwidth memory (HBM)) or other readable and writable memory.
  • DDR SDRAM double data rate synchronous dynamic random access memory
  • HBM high bandwidth memory
  • the neural network can be composed of neural units.
  • the neural unit can refer to an arithmetic unit that takes xs and intercept 1 as input.
  • the output of the arithmetic unit can be:
  • s 1, 2,...n, n is a natural number greater than 1
  • Ws is the weight of xs
  • b is the bias of the neural unit.
  • f is the activation function of the neural unit, which is used to introduce nonlinear characteristics into the neural network to convert the input signal in the neural unit into an output signal. The output signal of this activation function can be used as the input of the next convolutional layer.
  • the activation function can be a sigmoid function.
  • a neural network is a network formed by connecting many of the above-mentioned single neural units together, that is, the output of one neural unit can be the input of another neural unit.
  • the input of each neural unit can be connected to the local receptive field of the previous layer to extract the features of the local receptive field.
  • the local receptive field can be an area composed of several neural units.
  • W is a weight vector, and each value in the vector represents the weight value of a neuron in the neural network of this layer.
  • This vector W determines the spatial transformation from the input space to the output space described above, that is, the weight W of each layer controls how to transform the space.
  • the purpose of training a neural network is to finally obtain the weight matrix of all layers of the trained neural network (a weight matrix formed by the vector W of many layers). Therefore, the training process of neural network is essentially to learn how to control spatial transformation, and more specifically, to learn the weight matrix.
  • weight vector (of course, there is usually an initialization process before the first update, that is, pre-configuring parameters for each layer in the neural network). For example, if the predicted value of the network is high, adjust the weight vector to make it predict lower Some, constant adjustments are made until the neural network can predict the truly desired target value. Therefore, it is necessary to define in advance "how to compare the difference between the predicted value and the target value". This is the loss function (loss function) or objective function (objective function), which is used to measure the difference between the predicted value and the target value. Important equations. Among them, taking the loss function as an example, the higher the output value (loss) of the loss function, the greater the difference. Then the training of the neural network becomes a process of reducing this loss as much as possible.
  • the neural network can use the error back propagation (BP) algorithm to modify the size of the parameters in the initial neural network model during the training process, so that the reconstruction error loss of the neural network model becomes smaller and smaller. Specifically, forward propagation of the input signal until the output will produce an error loss, and the parameters in the initial neural network model are updated by backpropagating the error loss information, so that the error loss converges.
  • the backpropagation algorithm is a backpropagation movement dominated by error loss, aiming to obtain the optimal parameters of the neural network model, such as the weight matrix.
  • the model training method provided by the embodiment of this application involves the processing of data sequences, and can be specifically applied to data training, machine learning, deep learning and other methods.
  • training data for example, the first item of the page to be processed in this application is (features, etc.) to perform symbolic and formalized intelligent information modeling, extraction, preprocessing, training, etc., and finally obtain a trained neural network (such as the target model in this application); and, provided by the embodiments of this application
  • the user behavior prediction method can use the above-mentioned trained neural network to input input data (for example, the first feature of the first item of the target page in this application, etc.) into the trained neural network to obtain output data.
  • model training method and the user behavior prediction method provided in the embodiments of this application are inventions based on the same concept, and can also be understood as two parts of a system, or two stages of an overall process: such as Model training phase and model application phase.
  • FIG. 4 is a schematic flow chart of a directed acyclic graph construction method provided by an embodiment of the present application. As shown in Figure 4, the method includes:
  • the target page when it is necessary to predict user behavior on the target page, the target page can be obtained first.
  • the target page contains multiple lists.
  • the list contains multiple items, and the multiple items are arranged according to a certain Order Arrange (for example, if multiple items are arranged in a row, the list is a horizontal list, if multiple items are arranged in a column, the list is a vertical list).
  • Figure 5 Figure 5 is a schematic diagram of a target page provided by an embodiment of the present application).
  • the target page is a display page of an application mall. This page can display information about multiple applications to the user, so that the user can Browse and download the applications you need on this page.
  • This page contains vertical list B 1 , vertical list B 3 , vertical list B 5 , horizontal list B 2 and horizontal list B 4 .
  • These three vertical lists and two horizontal lists are staggered, that is, according to the vertical list B 1 and the horizontal list B 2, vertical list B 3 , horizontal list B 4 , and vertical list B 5 are arranged in this order.
  • horizontal lists B 2 and B 4 contain 5 applications
  • vertical lists B 1 and B 3 contain 3 applications. In this way, the page can display information about 19 applications to users.
  • the eye movement data of users browsing the target page can be obtained through the following methods:
  • a user-oriented eye tracker can be deployed near the user equipment used to display the target page, and the eye tracker can be connected to the user equipment. Electrical connection.
  • auxiliary tools can also be deployed. This auxiliary tool is used to stabilize the user's head so that the eye tracker can accurately track the user's line of sight.
  • the distance between the user and the screen of the user equipment for example, the distance can be set to 30-40cm, etc.
  • the tilt angle of the user equipment for example, the tilt angle can be set to 65°-70°, etc.
  • the eye tracker can track and record the user's gaze position and gaze movement on the target page, generate eye movement data of the user browsing the target page, and send it to the user device . In this way, the user device successfully obtains the eye movement data of the user browsing the target page.
  • the user equipment here can be the user equipment in the system shown in Figure 2a or Figure 2b. Then, after obtaining the eye movement data of the user browsing the target page, the user equipment can analyze and construct the eye movement data on its own. For the directed acyclic graph of the target page, the eye movement data can also be sent to the data processing device in the system shown in Figure 2a or Figure 2b, so that the data processing equipment can analyze the eye movement data and build an effective target page. Towards acyclic graph, we will not go into details later.
  • the eye movement data can be analyzed to obtain the user's browsing behavior of multiple items in the target page.
  • the following analysis can be performed based on the eye movement data:
  • item A is before item B and item A is adjacent to item B
  • item A and item B can be two items located in two adjacent lists, that is, t A ⁇ t B , for example, item A is the last item of the first list on the target page, item B is the first item of the second list on the target page, etc.
  • item A is the first item of the first list on the target page
  • item B is the first list on the target page. the 2nd item and so on)
  • item B is before item C and item B is adjacent to item C.
  • the first type of sequential browsing behavior refers to any list on the target page. If the list is a horizontal list, all items in the list will be browsed in order from left to right. If the list is vertical list, browse all items in the list in order from top to bottom. Still using the example shown in Figure 5, for list B 2 , the first type of sequential browsing behavior is: in the order of i 2,1 ⁇ i 2,2 ⁇ i 2,3 ⁇ i 2,4 ⁇ i 2,5 , Let's browse the five items i 2,1 , i 2,2 , i 2,3 , i 2,4 and i 2,5 in list B 2 . For list B 1 , the first type of sequential browsing behavior is: browsing i 1,1 , i 1,2 and i 1 in list B 1 in the order of i 1,1 ⁇ i 1,2 ⁇ i 1,3 3 these 3 items.
  • the second type of sequential browsing behavior means that for two adjacent lists on the target page, the adjacent items in the two lists are browsed in the order of front and back.
  • the former list is a vertical list and the latter list is a horizontal list
  • the adjacent items in the two lists include the last item in the vertical list and all the items in the horizontal list, that is, horizontal All items of a list can be considered items adjacent to the last item of the vertical list.
  • the former list is a horizontal list and the latter list is a vertical list
  • the adjacent items in the two lists include the first item of the vertical list and all the items of the horizontal list, that is, all the items of the horizontal list. Both can be considered as items adjacent to the first item in the vertical list.
  • the second type of sequential browsing behavior is: i 1,3 ⁇ i 2,1 , i 1,3 ⁇ i 2,2 , i 1,3 ⁇ i 2,3 , i 1,3 ⁇ i 2,4 , i 1,3 ⁇ i 2,5 in the order to browse i 1,3 , i 2,1 , i 2 in lists B 1 and B 2 , 2 , i 2,3 , i 2,4 and i 2,5 are six adjacent items.
  • the second type of sequential browsing behavior is: i 2,1 ⁇ i 3,1 , i 2,2 ⁇ i 3,1 , i 2,3 ⁇ i 3,1 , i 2, 4 ⁇ i 3,1 , i 2,5 ⁇ i 3,1 in the order to browse i 2,1 , i 2,2 , i 2,3 , i 2,4 , i 2 in lists B 2 and B 3 ,5 and i 3,1 are six adjacent items.
  • the browsing method with a list skip length of 2 accounts for the largest proportion, indicating that in addition to sequential browsing behavior, users also often send behaviors of skipping an entire list and browsing the next list directly. This browsing behavior can This is called block skip. It is worth noting that if the target page is a page with multiple horizontal lists (also called horizontal blocks) and multiple vertical lists (also called vertical blocks) staggered (i.e. F-type page), the skipping behavior with a list skip length of 2 includes two major types of behavior.
  • the first type of skipping behavior refers to jumping from a horizontal list to a horizontal list
  • the second type of skipping behavior refers to jumping from a vertical list to a vertical list, where , almost all skipping behaviors are from vertical lists to vertical lists, accounting for 94.5%, while jumping from horizontal lists to horizontal lists only account for 5.5%, indicating that users are more inclined to jump from vertical lists to vertical lists. .
  • the user's skipping behavior can be summarized as: in two non-adjacent lists (these two lists are separated by one list), the user jumps from the last item of the previous list to the next one. Browsing continues with the first item in the list, and the browsing sequence between these two items can be called a jump sequence. Still using the example shown in Figure 5, when the user browses i 1,3 , he directly skips B 2 and browses i 3,1 .
  • the user's browsing behavior for multiple items in the target page can be determined, including sequential browsing behavior, skipping behavior, comparison behavior, etc.
  • multiple items in the target page can be connected to obtain a directed acyclic graph for the target page.
  • these browsing behaviors include two major categories of browsing behaviors.
  • the first type of browsing behavior refers to the user browsing items in the same list, including the aforementioned first type of sequential browsing behavior. Therefore, the user's browsing order in the same list can be called the first order, and the first order includes the first type of sequential browsing.
  • Behavior the order in which the user browses all items in the same list, from top to bottom and from left to right.
  • the second type of browsing behavior refers to the user browsing items between different lists, including the aforementioned second type of sequential browsing behavior and comparison behavior.
  • the user's browsing order between different lists can be called the second order, and the second order includes
  • the second type of sequential browsing behavior the user browses several adjacent items in two adjacent lists in the order in which they browse, and in contrast behavior, the user browses two items in two non-adjacent lists according to the jump. Turn order.
  • the directed acyclic graph for the target page can be obtained in the following way:
  • Figure 7 is a schematic diagram of a directed acyclic graph provided by the embodiment of the present application, and Figure 7 is drawn based on Figure 5
  • list B 1 it can be calculated according to i 1
  • the order 1 ⁇ i 1,2 ⁇ i 1,3 is used to connect the three items i 1,1 , i 1,2 and i 1,3 in list B 1 .
  • list B 2 i 2,1 , i 2,2 , in list B 2 can be connected in the order of i 2,1 ⁇ i 2,2 ⁇ i 2,3 ⁇ i 2,4 ⁇ i 2,5
  • There are five items: i 2,3 , i 2,4 and i 2,5 There are five items: i 2,3 , i 2,4 and i 2,5 .
  • lists B 3 , B 4 and B 5 which will not be repeated here. In this way, the internal connections of these five lists in the target page are completed.
  • i 1,3 can be connected to i 3,1 in the order of i 1,3 ⁇ i 3,1 , and the same is true for lists B 3 and B 5 , this No further details will be given. In this way, a directed acyclic graph for the target page can be obtained.
  • the user's interest in the target page can be determined based on the eye movement data generated when the user browses the target page.
  • the browsing behavior of multiple items in the list then these browsing behaviors (for example, sequential browsing behavior and skipping behavior) often determine the user's browsing order of items (for example, the user's browsing order in the same list and the user's browsing order in different items). Browsing order between lists), thereby connecting multiple items of the target page according to these browses, and obtaining a directed acyclic graph for the target page.
  • This directed acyclic graph can be used in subsequent user behavior predictions for the target page. Since this directed acyclic graph involves users' complex and diverse browsing behaviors, it is helpful to improve the accuracy of predicting user behavior on the target page.
  • Figure 8 is a schematic flow chart of a user behavior prediction method provided by an embodiment of the present application. As shown in Figure 8, the method includes:
  • the first characteristic of the first item and the second characteristic of the second item through the target model.
  • the first item and the second item are located in different lists or the same list on the target page, and the second item is located before the first item.
  • the first characteristic of the project refers to the attribute information of the project itself.
  • the first characteristic of the application may include the developer of the application, the size of the application , the type of the application, the icon of the application, etc.
  • the first feature of the product may include the price of the product, the type of the product, the color of the product etc.
  • the target page contains multiple items, multiple rounds of operations can be performed on the target page.
  • One round operates on one item in the target page (that is, steps 801 to 801 are performed once in each round).
  • Step 805 that is, steps 801 to 805 will be executed for each item. Since one round of operation can obtain the probability of an item being clicked by the user, after completing all rounds, the probability of all items in the target page being clicked can be obtained. The probability of a user clicking. Based on this, this embodiment makes a schematic introduction using one of the items in the target page, and calls this item the first item.
  • the first feature of the first item and the second feature of the second item can be input to the target model (trained neural network model).
  • the first item and the second item are located in different lists or the same list on the target page, and the second item is located before the first item, that is, the positional relationship between the first item and the second item exists in the following two situations: (1) The first item and the second item may be items in the same list, the second item is located before the first item and the second item is adjacent to the first item. (2) The first item and the second item can be items in different lists. The list where the second item is located is located before the list where the first item is located. The second item can be adjacent to the first item or not. One item is adjacent.
  • any one of these items can be regarded as a node in the directed acyclic graph, and there is a one-way connection between the nodes in the directed acyclic graph. connection relationship.
  • the directed acyclic graph has parent nodes and child nodes, and the connection direction between the parent node and the child node is from the parent node to the child node.
  • the aforementioned first item can be regarded as a child node in the directed acyclic graph for the target page, and the second item is all the parent nodes of the child node. Still as shown in Figure 7, when the first When the item is i 1,2 , the second item is i 1,1 .
  • the second item is i 1,3 .
  • the second item is i 2,1 and i 1,3 .
  • the second item is i 2,1 , i 2,2 , i 2,3 , i 2,4 , i 2,5 , i 1,3 and so on.
  • the acquisition process of the second feature of the second item may refer to the subsequent acquisition process of the second feature of the first item, and will not be described again here.
  • the first item is the first item in the target page (for example, i 1,1 in the example shown in Figure 7), then There is no second item.
  • the second characteristic of the second item can be understood as a preset value (the size of the preset value can be set according to actual needs, and there is no limit here). Therefore, the first item can also be The first characteristic of the project and this preset value are input into the target model.
  • the first feature of the first item and the second feature of the second item can be processed by the target model to obtain the first feature of the first item.
  • Second characteristic Second characteristic.
  • the second feature of the first item can be obtained in the following way:
  • the user's request for the target page can include keywords entered by the user on the target page to search for certain items, etc., second
  • the probability that an item is clicked by the user can be understood as the probability that the item targeted in the previous round (that is, the item before the first item) is clicked by the user.
  • the user's request for the target page and the probability that the second item is clicked by the user can also be input into the target model.
  • the target model maps the first feature of the first item, the user's request for the target page, and the probability of the second item being clicked by the user on the latent space (i.e., the aforementioned mapping process), and accordingly obtains the first item's
  • the sixth characteristic, the seventh characteristic of the first item and the eighth characteristic of the first item are then spliced together with the sixth characteristic of the first item, the seventh characteristic of the first item and the eighth characteristic of the first item (that is, the aforementioned second fusion process) to obtain the fourth feature of the first item.
  • the target model can also process the second feature of the second item based on the self-attention mechanism to obtain the fifth feature of the first item.
  • Figure 9 is a schematic structural diagram of the target model provided by the embodiment of the present application
  • the first item is i t,j
  • the set of second items is P t,j
  • the first feature of the first item is I
  • the second feature of the k-th second item is h k
  • the request is Q
  • the probability of the second item being clicked by the user is C.
  • the target The model can map the first feature I of the first item on the latent space to obtain the sixth feature V I of the first item, and map the user's request Q for the target page on the latent space to obtain the seventh feature of the first item.
  • V Q map the probability C of the second item clicked by the user on the latent space, and obtain the eighth feature V C of the first item.
  • the target model can splice the sixth feature V I of the first item, the seventh feature V Q of the first item, and the eighth feature V C of the first item to obtain the fourth feature x t,j of the first item. .
  • the target model can also use the self-attention mechanism to calculate the second features h 1 ,..., h n of the second item, and obtain the fifth feature e t,j of the first item.
  • the calculation based on the self-attention mechanism is as shown in the following formula:
  • the target model can use recurrent neural units (GRUcell) processes the fourth feature of the first item and the fifth feature of the first item (ie, the aforementioned first fusion process) to obtain the second feature of the first item.
  • GRUcell recurrent neural units
  • the fourth feature x t,j of the first item and the fifth feature e t,j of the first item are obtained.
  • the target model can input these two features into the recurrent neural unit for processing.
  • the second characteristic of the first item can represent the impact of the second item on the first item (it can also be understood as the relationship between the second item and the first item), that is, the user's sequential browsing behavior and When skipping behaviors and browsing to the first item, the impact of the items browsed by the user on the first item while performing these behaviors.
  • the user's request for the target page and the probability of the second item being clicked by the user may not be input to the target model, so that the target model directly obtains the first feature of the first item.
  • the fourth feature of the first feature is obtained.
  • the first item and the third item are located in different lists or the same list on the target page, and the third item is adjacent to the first item.
  • the first feature of the third item can also be input to the target model, wherein the first item and the third item are located in different lists or the same list on the target page, and the third item is adjacent to the first item, that is, the third item is adjacent to the first item.
  • the first item and the third item can be items in the same list, and the third item and the first item are adjacent.
  • the first item and the third item can be items in different lists, and the third item and the first item are adjacent.
  • the third item is i 1,1 .
  • the third item is i 1,2 and i 2,1 .
  • the second item is i 1,3 , i 2,2 , i 3,1 and so on.
  • the first feature of the first item and the first feature of the third item can be processed by the target model, thereby obtaining the third feature of the first item.
  • the third feature of the first item can be obtained in the following way:
  • the target model maps the first feature of the first item, the user's request for the target page, and the first feature of the third item on the latent space respectively (i.e., the aforementioned mapping process), and accordingly obtains the first feature of the first item.
  • the set of third items be N t,j
  • the f-th third item in the set is The first characteristic of the project is If .
  • the target model After inputting the first feature I 1 ,..., I m of the third item into the target model, the target model can map the first feature I of the first item on the latent space to obtain the sixth feature V I of the first item. , map the user's request Q for the target page on the latent space, and obtain the seventh feature V Q of the first item, and map the first feature I 1 ,..., I m of the third item on the latent space, and obtain The ninth characteristic V 1 , ..., V m of the first item.
  • the target model can perform the comparison function on the sixth feature of the first item and the ninth feature of the first item. Calculate, and then perform weighted summation based on the calculation results (the calculation of the comparison function and the weighted summation calculation are the aforementioned third fusion process) to obtain the tenth feature of the first item.
  • comparison function g can be one of the following three functions: inner product function neural network function kernel function
  • the target model can perform an exclusive OR operation (i.e., the aforementioned fourth fusion process) on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item to obtain the first item
  • an exclusive OR operation i.e., the aforementioned fourth fusion process
  • the sixth feature V I of the first item, the seventh feature V Q of the first item, and the tenth feature of the first item can be
  • the features d t,j are subjected to an exclusive OR operation to obtain the third feature cp t,j of the first item.
  • the third characteristic of the first item can represent the impact of the third item on the first item (it can also be understood as the relationship between the third item and the first item), that is, the user uses the comparison behavior to browse to The first item refers to the impact of the items browsed by the user on the first item during the behavior.
  • the target model may only perform the third feature on the sixth feature of the first item and the tenth feature of the first item. Four fusion processes are performed to obtain the third feature of the first item.
  • the target model can calculate the second feature of the first item and the third feature of the first item, thereby obtaining the probability that the first item is clicked by the user. .
  • the two features can be calculated to obtain the first item that is used by the user. Probability of click C t,j .
  • the calculation process is as shown in the following formula:
  • the same operations as those performed on the first item can also be performed. Therefore, the probability of all items in the target page being clicked by the user can be obtained, thereby completing the analysis of the target page. User behavior prediction.
  • steps 803 and 804 may not be performed, so that the target model directly calculates the second feature of the first item to obtain the probability that the first item is clicked by the user.
  • the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second feature can be input to the target model.
  • the two items are in different lists or in the same list on the target page, and the second item is before the first item.
  • the target model can obtain the second feature of the first item based on the first feature of the first item and the second feature of the second item, and then based on the second feature of the first item Feature, obtain the probability that the first item is clicked by the user.
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model
  • the probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • the target model provided by the embodiments of this application not only introduces conventional sequential browsing behaviors, but also introduces browsing behaviors such as jump behaviors and comparison behaviors. In other words, the target model will consider the user's use of these complex and diverse browsing behaviors.
  • the impact of the item browsed by the user on the first item during the behavior can further improve the accuracy of the probability of the first item being clicked by the user finally obtained by the target model.
  • the target model provided by the embodiment of the present application not only takes into account the influence of the attribute information of the first item itself, but also considers the user's request for the target page and the second item clicked by the user.
  • the influence of external factors such as probability can further improve the accuracy of the probability of the first item being clicked by the user finally obtained by the target model.
  • Figure 10 is a schematic flow chart of the model training method provided by the embodiment of the present application. As shown in Figure 10, the method includes:
  • a batch of training data can be obtained first.
  • the batch of training data includes pages to be processed.
  • the pages to be processed include multiple lists, and each list contains at least one item. It is worth noting that in the page to be processed, the true probability of any item being clicked by the user is known.
  • first item, the second item, the first feature of the first item, and the second feature of the second item of the page to be processed reference may be made to the third item of the target page in step 801 in the embodiment shown in FIG. 8
  • the relevant descriptions of the first item, the second item, the first feature of the first item, and the second feature of the second item will not be described again here.
  • the first feature of the first item is the attribute information of the first item
  • the second feature of the first item is the information obtained by fusion based on the attribute information of the first item
  • the second feature of the second item is the information obtained based on the fusion of the attribute information of the first item.
  • Information obtained by fusing the attribute information of the second item that is, the first feature of the second item.
  • the first feature of the first item and the second feature of the second item can be processed by the model to be trained, thereby obtaining the first Secondary characteristics of the project.
  • obtaining the second feature of the first item includes: mapping the first feature of the first item to obtain the second feature of the first item.
  • the fourth feature of the first item; the second feature of the second item is processed based on the self-attention mechanism to obtain the fifth feature of the first item; the fourth feature of the first item and the fifth feature of the first item are processed
  • the first fusion process is to obtain the second feature of the first item.
  • the first feature of the first item is mapped to obtain the fourth feature of the first item: the first feature of the first item, the user's request for the page to be processed, and the second item being processed.
  • step 1002 reference may be made to the relevant description of step 802 in the embodiment shown in FIG. 8 , which will not be described again here.
  • the first item and the third item are located in different lists or the same list on the page to be processed, and the third item is adjacent to the first item.
  • the first feature of the third item can also be input to the target model. It should be noted that, regarding the third item of the page to be processed and the first feature of the third item, reference can be made to the embodiment shown in Figure 8 The relevant description of the third item of the target page and the first feature of the third item in step 803 will not be described again here.
  • the first feature of the third item After the first feature of the third item is input into the model to be trained, the first feature of the first item and the first feature of the third item can be processed by the model to be trained, thereby obtaining the third feature of the first item.
  • obtaining the third feature of the first item includes: comparing the first feature of the first item and the third feature of the third item. Perform mapping processing on one feature to obtain the sixth feature of the first item and the ninth feature of the first item; perform a third fusion process on the sixth feature of the first item and the ninth feature of the first item to obtain the sixth feature of the first item.
  • the tenth feature performs the fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item.
  • performing a fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item includes: mapping the user's request for the page to be processed , the seventh feature of the first item is obtained; the fourth fusion process is performed on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item, to obtain the third feature of the first item.
  • step 1004 reference may be made to the relevant description of step 804 in the embodiment shown in FIG. 8 and will not be described again here.
  • the second feature of the first item and the third feature of the first item can be processed by the model to be trained, thereby obtaining that the first item was clicked by the user.
  • the probability can also be called the predicted probability that the first item is clicked by the user).
  • step 1005 reference may be made to the relevant description of step 805 in the embodiment shown in FIG. 8 , which will not be described again here.
  • the target loss is used to indicate the probability of the first item being clicked by the user and the real probability of the first item being clicked by the user. difference between.
  • the predicted probability of the first item being clicked by the user After obtaining the predicted probability of the first item being clicked by the user, since the real probability of the first item being clicked by the user is known, the predicted probability of the first item being clicked by the user and the predicted probability of the first item being clicked by the user can be calculated through the preset target loss function The true probability of clicking is calculated to obtain the target loss.
  • the target loss is used to indicate the predicted probability of the first item being clicked by the user and the predicted probability of the first item being clicked. The difference between the true probability of an item being clicked by the user.
  • the parameters of the model to be trained can be updated based on the target loss, and the next batch of training data can be obtained, and the next batch of training data can be used to continue training the model to be trained after the updated parameters (i.e., re-execute steps 1001 to 1001). 1007), until the model training conditions are met (for example, the target loss reaches convergence, etc.), the target model in the embodiment shown in Figure 8 can be obtained.
  • the target model trained in the embodiment of this application has the ability to predict user behavior on the page.
  • the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second item are located on the target page. Different lists or the same list, with the second item before the first item.
  • the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. .
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model
  • the probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • Figure 11 is a schematic structural diagram of a user behavior prediction device provided by an embodiment of the present application. As shown in Figure 11, the device includes:
  • the first acquisition module 1101 is used to acquire the first characteristics of the first item and the second characteristics of the second item through the target model.
  • the first item and the second item are located in different lists or the same list of the target page, and the second item Located before the first item;
  • the second acquisition module 1102 is configured to acquire the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item through the target model, where the first characteristic of the first item is the first item. Attribute information of Information obtained by fusion;
  • the third acquisition module 1103 is configured to acquire the probability that the first item is clicked by the user based on the second feature of the first item through the target model.
  • the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second feature can be input to the target model.
  • the two items are in different lists or in the same list on the target page, and the second item is before the first item.
  • the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. .
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model
  • the probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • the device further includes: a fourth acquisition module, configured to acquire the first feature of the third item through the target model, where the first item and the third item are located in different lists or the same list of the target page. , and the third item is adjacent to the first item; the fifth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the target model; the third The acquisition module 1103 is configured to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the target model.
  • the second acquisition module 1102 is configured to: map the first feature of the first item through the target model to obtain the fourth feature of the first item; map the first feature of the second item through the target model.
  • the second feature is processed based on the self-attention mechanism to obtain the fifth feature of the first item; the fourth feature of the first item and the fifth feature of the first item are first fused through the target model to obtain the first item the second characteristic.
  • the second acquisition module 1102 is configured to perform mapping processing on the first feature of the first item, the user's request for the target page, and the probability that the second item is clicked by the user through the target model, to obtain
  • the sixth characteristic of the first item, the seventh characteristic of the first item, and the eighth characteristic of the first item; through the target model, the sixth characteristic of the first item, the seventh characteristic of the first item, and the eighth characteristic of the first item are The features undergo a second fusion process to obtain the fourth feature of the first item.
  • the fifth acquisition module is used to map the first feature of the first item and the first feature of the third item through the target model to obtain the sixth feature of the first item and the first feature of the third item.
  • the ninth feature of the first item; the third fusion process is performed on the sixth feature of the first item and the ninth feature of the first item through the target model to obtain the tenth feature of the first item; the third feature of the first item is obtained through the target model.
  • the six features and the tenth feature of the first item are subjected to the fourth fusion process to obtain the third feature of the first item.
  • the fifth acquisition module is used to: map the user's request for the target page through the target model to obtain the seventh feature of the first item; map the sixth feature of the first item through the target model Features, the seventh feature of the first item, and the tenth feature of the first item are subjected to a fourth fusion process to obtain the third feature of the first item.
  • the second characteristic of the second item is a preset value.
  • the target page contains multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
  • Figure 12 is a schematic structural diagram of a directed acyclic graph construction device provided by an embodiment of the present application. As shown in Figure 12, the device includes:
  • the acquisition module 1201 is used to obtain the eye movement data of the user browsing the target page;
  • the determination module 1202 is used to determine the user's browsing behavior for multiple items based on eye movement data, and the multiple items are located in multiple lists on the target page;
  • the connection module 1203 is used to connect multiple items based on browsing behavior to obtain a directed acyclic graph.
  • the user's browsing behavior for multiple items in the target page can be determined based on the eye movement data generated when the user browses the target page. Then, these browsing behaviors (for example, sequential browsing behavior and skipping behavior) , often determines the user's browsing order of items (for example, the user's browsing order in the same list and the user's browsing order between different lists), thereby connecting multiple items of the target page according to these browsing, and obtaining the target page
  • a directed acyclic graph which can be used in the subsequent prediction of user behavior on the target page. Since the directed acyclic graph involves users' complex and diverse browsing behaviors, it is conducive to improving user behavior on the target page. Prediction accuracy.
  • connection module 1203 is used to connect items in the same list that the user browses in the first order, and connect items in different lists that the user browses in the second order. , connect in the second order to obtain a directed acyclic graph.
  • the acquisition module 1201 is configured to collect eye movement data of the user browsing the target page through an eye tracker.
  • Figure 13 is a schematic structural diagram of a model training device provided by an embodiment of the present application. As shown in Figure 13, the device includes:
  • the first acquisition module 1301 is used to acquire the first feature of the first item and the second feature of the second item through the model to be trained.
  • the first item and the second item are located in different lists or the same list of the page to be processed, and the first item The second item precedes the first item;
  • the second acquisition module 1302 is configured to acquire the second feature of the first item based on the first feature of the first item and the second feature of the second item through the model to be trained, where the first feature of the first item is the first Attribute information of the item, the second feature of the first item is the information obtained by fusion based on the attribute information of the first item, and the second feature of the second item is based on the attribute information of the second item (i.e., the first feature of the second item ) information obtained by fusion;
  • the third acquisition module 1303 is used to obtain the probability that the first item is clicked by the user based on the second feature of the first item through the model to be trained;
  • the fourth acquisition module 1304 is used to obtain the target loss based on the probability that the first item is clicked by the user and the real probability that the first item is clicked by the user.
  • the target loss is used to indicate the probability that the first item is clicked by the user and the real probability that the first item is clicked by the user. The difference between the true probability of a user clicking;
  • the update module 1305 is used to update the parameters of the model to be trained based on the target loss until the model training conditions are met and the target model is obtained.
  • the target model trained in the embodiment of this application has the ability to predict user behavior on the page.
  • the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second item are located on the target page. Different lists or the same list, with the second item before the first item.
  • the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. .
  • the target model when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model
  • the probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
  • the device includes: a fifth acquisition module, configured to acquire the first feature of the third item through the model to be trained, and the first item and the third item are located in different lists or the same list of the page to be processed. , and the third item is adjacent to the first item; the sixth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the model to be trained; The third acquisition module 1303 is configured to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the model to be trained.
  • the second acquisition module 1302 is configured to: perform mapping processing on the first feature of the first item to obtain the fourth feature of the first item; perform self-based mapping on the second feature of the second item. Attention mechanism processing, got to the fifth feature of the first item; perform a first fusion process on the fourth feature of the first item and the fifth feature of the first item to obtain the second feature of the first item.
  • the second acquisition module 1302 is configured to perform mapping processing on the first feature of the first item, the user's request for the page to be processed, and the probability that the second item is clicked by the user, to obtain the first item.
  • the sixth acquisition module is configured to map the first feature of the first item and the first feature of the third item to obtain the sixth feature of the first item and the first feature of the first item.
  • the ninth characteristic perform a third fusion process on the sixth characteristic of the first item and the ninth characteristic of the first item to obtain the tenth characteristic of the first item; perform the sixth characteristic of the first item and the tenth characteristic of the first item.
  • the sixth acquisition module is used to: map the user's request for the page to be processed to obtain the seventh feature of the first item; map the sixth feature of the first item, the first item's The seventh feature and the tenth feature of the first item are subjected to the fourth fusion process to obtain the third feature of the first item.
  • the second characteristic of the second item is a preset value.
  • the page to be processed includes multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
  • FIG. 14 is a schematic structural diagram of the execution device provided by the embodiment of the present application.
  • the execution device 1400 can be embodied as a mobile phone, a tablet, a laptop, a smart wearable device, a server, etc., and is not limited here.
  • the user behavior prediction device described in the corresponding embodiment of FIG. 11 and the directed acyclic graph construction device described in the corresponding embodiment of FIG. 12 may be deployed on the execution device 1400 to implement the user behavior prediction device described in the corresponding embodiment of FIG. 4
  • the function of constructing an acyclic graph and the function of predicting user behavior in the corresponding embodiment of FIG. 8 The function of constructing an acyclic graph and the function of predicting user behavior in the corresponding embodiment of FIG. 8 .
  • the execution device 1400 includes: a receiver 1401, a transmitter 1402, a processor 1403 and a memory 1404 (the number of processors 1403 in the execution device 1400 can be one or more, one processor is taken as an example in Figure 14) , wherein the processor 1403 may include an application processor 14031 and a communication processor 14032.
  • the receiver 1401, the transmitter 1402, the processor 1403, and the memory 1404 may be connected by a bus or other means.
  • Memory 1404 may include read-only memory and random access memory and provides instructions and data to processor 1403 .
  • a portion of memory 1404 may also include non-volatile random access memory (NVRAM).
  • NVRAM non-volatile random access memory
  • the memory 1404 stores processor and operating instructions, executable modules or data structures, or a subset thereof, or an extended set thereof, where the operating instructions may include various operating instructions for implementing various operations.
  • the processor 1403 controls the execution of operations of the device.
  • various components of the execution device are coupled together through a bus system.
  • the bus system may also include a power bus, a control bus, a status signal bus, etc.
  • various buses are called bus systems in the figure.
  • the methods disclosed in the above embodiments of the present application can be applied to the processor 1403 or implemented by the processor 1403.
  • the processor 1403 may be an integrated circuit chip with signal processing capabilities. During the implementation process, each step of the above method The steps may be completed by instructions in the form of hardware integrated logic circuits or software in the processor 1403 .
  • the above-mentioned processor 1403 can be a general-purpose processor, a digital signal processor (DSP), a microprocessor or a microcontroller, and can further include an application specific integrated circuit (ASIC), a field programmable Gate array (field-programmable gate array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.
  • DSP digital signal processor
  • ASIC application specific integrated circuit
  • FPGA field-programmable gate array
  • the processor 1403 can implement or execute each method, step and logical block diagram disclosed in the embodiment of this application.
  • a general-purpose processor may be a microprocessor or the processor may be any conventional processor, etc.
  • the steps of the method disclosed in conjunction with the embodiments of the present application can be directly implemented by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor.
  • the software module can be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other mature storage media in this field.
  • the storage medium is located in the memory 1404.
  • the processor 1403 reads the information in the memory 1404 and completes the steps of the above method in combination with its hardware.
  • the receiver 1401 may be configured to receive input numeric or character information and generate signal inputs related to performing relevant settings and functional controls of the device.
  • the transmitter 1402 can be used to output numeric or character information through the first interface; the transmitter 1402 can also be used to send instructions to the disk group through the first interface to modify the data in the disk group; the transmitter 1402 can also include a display device such as a display screen .
  • the processor 1403 is used to predict user behavior for the target page through the target model in the corresponding embodiment of FIG. 8 .
  • FIG. 15 is a schematic structural diagram of the training device provided by the embodiment of the present application.
  • the training device 1500 is implemented by one or more servers.
  • the training device 1500 can vary greatly due to different configurations or performance, and can include one or more central processing units (CPU) 1514 (eg, one or more processors) and memory 1532, one or more storage media 1530 (eg, one or more mass storage devices) storing applications 1542 or data 1544.
  • the memory 1532 and the storage medium 1530 may be short-term storage or persistent storage.
  • the program stored in the storage medium 1530 may include one or more modules (not shown in the figure), and each module may include a series of instruction operations in the training device.
  • the central processor 1514 may be configured to communicate with the storage medium 1530 and execute a series of instruction operations in the storage medium 1530 on the training device 1500 .
  • the training device 1500 may also include one or more power supplies 1526, one or more wired or wireless network interfaces 1550, one or more input and output interfaces 1558; or, one or more operating systems 1541, such as Windows ServerTM, Mac OS XTM , UnixTM, LinuxTM, FreeBSDTM and so on.
  • operating systems 1541 such as Windows ServerTM, Mac OS XTM , UnixTM, LinuxTM, FreeBSDTM and so on.
  • the training device can execute the model training method in the corresponding embodiment of Figure 10.
  • Embodiments of the present application also relate to a computer storage medium.
  • the computer-readable storage medium stores a program for performing signal processing.
  • the program When the program is run on a computer, it causes the computer to perform the steps performed by the aforementioned execution device, or, The computer is caused to perform the steps performed by the aforementioned training device.
  • Embodiments of the present application also relate to a computer program product that stores instructions that, when executed by a computer, cause the computer to perform the steps performed by the foregoing execution device, or cause the computer to perform the steps performed by the foregoing training device. A step of.
  • the execution device, training device or terminal device provided by the embodiment of the present application may specifically be a chip.
  • the chip includes: a processing unit and a communication unit.
  • the processing unit may be, for example, a processor.
  • the communication unit may be, for example, an input/output interface. Pins or circuits, etc.
  • the processing unit can execute the computer execution instructions stored in the storage unit, so that the chip in the execution device executes the data processing method described in the above embodiment, or so that the chip in the training device executes the data processing method described in the above embodiment.
  • the storage unit is a storage unit within the chip, such as a register, cache, etc.
  • the storage unit may also be a storage unit located outside the chip in the wireless access device, such as Read-only memory (ROM) or other types of static storage devices that can store static information and instructions, random access memory (random access memory, RAM), etc.
  • ROM Read-only memory
  • RAM random access memory
  • Figure 16 is a schematic structural diagram of a chip provided by an embodiment of the present application.
  • the chip can be represented as a neural network processor NPU 1600.
  • the NPU 1600 serves as a co-processor and is mounted to the host CPU (Host CPU). ), tasks are allocated by the Host CPU.
  • the core part of the NPU is the arithmetic circuit 1603.
  • the arithmetic circuit 1603 is controlled by the controller 1604 to extract the matrix data in the memory and perform multiplication operations.
  • the computing circuit 1603 includes multiple processing units (Process Engine, PE).
  • arithmetic circuit 1603 is a two-dimensional systolic array.
  • the arithmetic circuit 1603 may also be a one-dimensional systolic array or other electronic circuit capable of performing mathematical operations such as multiplication and addition.
  • arithmetic circuit 1603 is a general-purpose matrix processor.
  • the arithmetic circuit obtains the corresponding data of matrix B from the weight memory 1602 and caches it on each PE in the arithmetic circuit.
  • the operation circuit takes matrix A data and matrix B from the input memory 1601 to perform matrix operations, and the partial result or final result of the matrix is stored in an accumulator (accumulator) 1608 .
  • the unified memory 1606 is used to store input data and output data.
  • the weight data directly passes through the storage unit access controller (Direct Memory Access Controller, DMAC) 1605, and the DMAC is transferred to the weight memory 1602.
  • Input data is also transferred to unified memory 1606 via DMAC.
  • DMAC Direct Memory Access Controller
  • BIU is the Bus Interface Unit, that is, the bus interface unit 1613, which is used for the interaction between the AXI bus and the DMAC and the Instruction Fetch Buffer (IFB) 1609.
  • IFB Instruction Fetch Buffer
  • the bus interface unit 1613 (Bus Interface Unit, BIU for short) is used to fetch the memory 1609 to obtain instructions from the external memory, and is also used for the storage unit access controller 1605 to obtain the original data of the input matrix A or the weight matrix B from the external memory.
  • BIU Bus Interface Unit
  • DMAC is mainly used to transfer the input data in the external memory DDR to the unified memory 1606 or the weight data to the weight memory 1602 or the input data to the input memory 1601 .
  • the vector calculation unit 1607 includes multiple arithmetic processing units, and if necessary, further processes the output of the arithmetic circuit 1603, such as vector multiplication, vector addition, exponential operation, logarithmic operation, size comparison, etc. It is mainly used for non-convolutional/fully connected layer network calculations in neural networks, such as Batch Normalization, pixel-level summation, upsampling of predicted label planes, etc.
  • vector calculation unit 1607 can store the processed output vectors to unified memory 1606 .
  • the vector calculation unit 1607 can apply a linear function; or a nonlinear function to the output of the operation circuit 1603, such as linear interpolation on the prediction label plane extracted by the convolution layer, or a vector of accumulated values, to generate an activation value.
  • vector calculation unit 1607 generates normalized values, pixel-wise summed values, or both.
  • the processed output vector can be used as an activation input to the arithmetic circuit 1603, such as for use in a subsequent layer in a neural network.
  • the instruction fetch buffer 1609 connected to the controller 1604 is used to store instructions used by the controller 1604;
  • the unified memory 1606, the input memory 1601, the weight memory 1602 and the fetch memory 1609 are all On-Chip memories. External memory is private to the NPU hardware architecture.
  • the processor mentioned in any of the above places can be a general central processing unit, a microprocessor, an ASIC, or one or more integrated circuits used to control the execution of the above programs.
  • the device embodiments described above are only illustrative.
  • the units described as separate components may or may not be physically separated, and the components shown as units may or may not be physically separate.
  • the physical unit can be located in one place, or it can be distributed across multiple network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • the connection relationship between modules indicates that there are communication connections between them, which can be specifically implemented as one or more communication buses or signal lines.
  • the present application can be implemented by software plus necessary general hardware. Of course, it can also be implemented by dedicated hardware including dedicated integrated circuits, dedicated CPUs, dedicated memories, Special components, etc. to achieve. In general, all functions performed by computer programs can be easily implemented with corresponding hardware. Moreover, the specific hardware structures used to implement the same function can also be diverse, such as analog circuits, digital circuits or special-purpose circuits. circuit etc. However, for this application, software program implementation is a better implementation in most cases. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or that contributes to the existing technology.
  • the computer software product is stored in a readable storage medium, such as a computer floppy disk. , U disk, mobile hard disk, ROM, RAM, magnetic disk or optical disk, etc., including several instructions to cause a computer device (which can be a personal computer, training device, or network device, etc.) to execute the steps described in various embodiments of this application. method.
  • a computer device which can be a personal computer, training device, or network device, etc.
  • the computer program product includes one or more computer instructions.
  • the computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device.
  • the computer instructions may be stored in or transmitted from one computer-readable storage medium to another, for example, the computer instructions may be transferred from a website, computer, training device, or data
  • the center transmits to another website site, computer, training equipment or data center through wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means.
  • wired such as coaxial cable, optical fiber, digital subscriber line (DSL)
  • wireless such as infrared, wireless, microwave, etc.
  • the computer-readable storage medium may be any available medium that a computer can store, or a data storage device such as a training device or a data center integrated with one or more available media.
  • the available media may be magnetic media (eg, floppy disk, hard disk, magnetic tape), optical media (eg, DVD), or semiconductor media (eg, solid state disk (Solid State Disk, SSD)), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Databases & Information Systems (AREA)
  • Evolutionary Computation (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Biomedical Technology (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biophysics (AREA)
  • Software Systems (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Disclosed in the present application are a user behavior prediction method and a related device thereof, by means of which method the probability that a project obtained by a neural network model is clicked on by a user has a higher degree of accuracy, thereby facilitating subsequent accurate recommendation of projects of interest to the user. The method of the present application comprises: acquiring a first feature of a first project and a second feature of a second project, wherein the first project and the second project are located in different lists or the same list of a target page, and the second project is located before the first project; acquiring a second feature of the first project on the basis of the first feature of the first project and the second feature of the second project; and on the basis of the second feature of the first project, acquiring the probability that the first project is clicked on by a user.

Description

一种用户行为预测方法及其相关设备A user behavior prediction method and related equipment
本申请要求于2022年4月12日提交中国专利局、申请号为202210379948.7、发明名称为“一种用户行为预测方法及其相关设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims priority to the Chinese patent application filed with the China Patent Office on April 12, 2022, with application number 202210379948.7 and the invention title "A user behavior prediction method and related equipment", the entire content of which is incorporated by reference. in this application.
技术领域Technical field
本申请涉及人工智能(artificial intelligence,AI)技术领域,尤其涉及一种用户行为预测方法及其相关设备。This application relates to the technical field of artificial intelligence (AI), and in particular to a user behavior prediction method and related equipment.
背景技术Background technique
随着计算机技术的快速发展,为了满足用户的上网需求,开发商越来越倾向于在页面上展现用户感兴趣的内容。基于此,针对于某个页面,往往需要预测用户会点击该页面上所展示的哪个或哪些项目,即预测用户针对该页面的行为,进而修改该页面上所需呈现的项目,以为用户推荐其感兴趣的项目。With the rapid development of computer technology, in order to meet users' Internet needs, developers are increasingly inclined to display content that users are interested in on their pages. Based on this, for a certain page, it is often necessary to predict which item or items displayed on the page the user will click on, that is, predict the user's behavior on the page, and then modify the items to be displayed on the page to recommend them to the user. Projects of interest.
一般地,项目在某个页面中的排布方式往往是以多个列表的形式呈现给用户观看的,即该页面通常包含多个列表,且每个列表中包含多个项目。在预测用户针对该页面的行为时,对于该页面中的任意一个项目,可利用AI技术的神经网络模型来与该项目被用户点击的概率。Generally, the arrangement of items on a certain page is often presented to the user in the form of multiple lists, that is, the page usually contains multiple lists, and each list contains multiple items. When predicting the user's behavior on this page, for any item on the page, the neural network model of AI technology can be used to determine the probability of the item being clicked by the user.
然而,相关技术提供的神经网络模型在预测某个项目被用户点击的概率时,通常仅考虑该项目所在列表中的其余项目,对该项目所产生的影响。可见,相关技术所考虑的因素较为单一,导致模型最终得到的该项目被用户点击的概率,准确度往往较低,故后续无法精准为用户推荐其感兴趣的项目。However, when the neural network model provided by related technologies predicts the probability of a certain item being clicked by a user, it usually only considers the impact of the remaining items in the list where the item is located on the item. It can be seen that the factors considered by the relevant technology are relatively single, resulting in the probability that the item is clicked by the user finally obtained by the model, which is often less accurate. Therefore, it cannot accurately recommend items of interest to the user in the future.
发明内容Contents of the invention
本申请实施例提供了一种用户行为预测方法及其相关设备,可以令神经网络模型所得到的项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。The embodiments of this application provide a user behavior prediction method and related equipment, which can make the probability of items being clicked by the user obtained by the neural network model have a higher accuracy, which is conducive to subsequent accurate recommendation of items of interest to the user. project.
本申请实施例的第一方面提供了一种用户行为预测方法,该方法包括:The first aspect of the embodiments of this application provides a user behavior prediction method, which method includes:
当需要对目标页面进行用户行为预测时,即需要获取目标页面中第一项目被用户点击的概率时,可先获取第一项目的第一特征以及第二项目的第二特征,并将第一项目的第一特征以及第二项目的第二特征输入目标模型。其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前,即第一项目和第二项目的位置关系存在以下两种情况:(1)第一项目和第二项目可以为同一列表中的项目,第二项目位于第一项目之前且第二项目和第一项目相邻。(2)第一项目和第二项目可以为不同列表中的项目,第二项目所在的列表位于第一项目所在的列表之前,第二项目既可以和第一项目相邻,也可以不和第一项目相邻。When it is necessary to predict user behavior on the target page, that is, when it is necessary to obtain the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be obtained first, and the first The first feature of the item and the second feature of the second item are input into the target model. Among them, the first item and the second item are located in different lists or the same list on the target page, and the second item is located before the first item, that is, the positional relationship between the first item and the second item exists in the following two situations: (1) The first item and the second item may be items in the same list, the second item is located before the first item and the second item is adjacent to the first item. (2) The first item and the second item can be items in different lists. The list where the second item is located is located before the list where the first item is located. The second item can be adjacent to the first item or not. One item is adjacent.
将第一项目的第一特征以及第二项目的第二特征输入目标模型后,可通过目标模型对第一项目的第一特征以及第二项目的第二特征进行处理,从而得到第一项目的第二特征。值得注意的是,第一项目的第一特征可以为第一项目自身的属性信息,那么,第一项目的第二特 征为基于第一项目的属性信息(也就是第一项目的第一特征)进行融合得到的信息,由于第二项目的第二特征的获取过程如同第一项目的第二特征的获取过程,故第二项目的第二特征也为基于第二项目的属性信息(第二项目的第一特征)进行融合得到的信息。After inputting the first feature of the first item and the second feature of the second item into the target model, the first feature of the first item and the second feature of the second item can be processed by the target model to obtain the first feature of the first item. Second characteristic. It is worth noting that the first characteristic of the first item can be the attribute information of the first item itself, then the second characteristic of the first item Characteristics are information obtained by fusion based on the attribute information of the first item (that is, the first feature of the first item). Since the acquisition process of the second feature of the second item is the same as the acquisition process of the second feature of the first item, The second feature of the second item is also information obtained by fusion based on the attribute information of the second item (the first feature of the second item).
最后,可通过目标模型对第一项目的第二特征进行处理,从而得到第一项目被用户点击的概率。Finally, the second feature of the first item can be processed through the target model to obtain the probability that the first item is clicked by the user.
从上述方法可以看出:当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。It can be seen from the above method that when it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and The second item is in a different list or in the same list on the target page, and the second item is before the first item. Then, the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. . In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model The probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
在一种可能的实现方式中,该方法还包括:获取第三项目的第一特征,第一项目和第三项目位于目标页面的不同列表或同一列表中,且第三项目与第一项目相邻;基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征;基于第一项目的第二特征,获取第一项目被用户点击的概率包括:基于第一项目的第二特征以及第一项目的第三特征,获取第一项目被用户点击的概率。前述实现方式中,还可将第三项目的第一特征输入至目标模型,其中,第一项目和第三项目位于目标页面的不同列表或同一列表中,且第三项目与第一项目相邻,即第一项目和第三项目的位置关系存在以下两种情况:(1)第一项目和第三项目可以为同一列表中的项目,第三项目和第一项目相邻。(2)第一项目和第三项目可以为不同列表中的项目,第三项目和第一项目相邻。将第三项目的第一特征输入目标模型后,可通过目标模型对第一项目的第一特征以及第三项目的第一特征进行处理,从而得到第一项目的第三特征。得到第一项目的第二特征以及第一项目的第三特征后,目标模型可对第一项目的第二特征以及第一项目的第三特征进行计算,从而得到第一项目被用户点击的概率。由于第一项目的第二特征可表征第二项目对第一项目所产生的影响,即用户使用顺序浏览行为以及跳过行为浏览到第一项目时,用户在进行这些行为的过程中所浏览的项目对第一项目所产生的影响,第一项目的第三特征可表征第三项目对第一项目所产生的影响,即用户使用对比行为浏览到第一项目时,用户在进行该行为的过程中所浏览的项目对第一项目所产生的影响,可见,目标模型在进行用户行为预测时,不仅引入了常规的顺序浏览行为,还引入到跳转行为和对比行为等等浏览行为,也就是说,目标模型会考虑用户使用这些复杂多样的浏览行为浏览至第一项目时,用户在进行这些行为的过程中所浏览的项目对第一项目所产生的影响,可进一步提高目标模型最终得到的第一项目被用户点击的概率的准确度。In a possible implementation, the method further includes: obtaining the first characteristic of the third item, the first item and the third item are located in different lists or the same list on the target page, and the third item is related to the first item. Neighbor; based on the first feature of the first item and the first feature of the third item, obtaining the third feature of the first item; based on the second feature of the first item, obtaining the probability that the first item is clicked by the user includes: based on the The second feature of the item and the third feature of the first item are used to obtain the probability that the first item is clicked by the user. In the aforementioned implementation, the first feature of the third item can also be input to the target model, where the first item and the third item are located in different lists or the same list on the target page, and the third item is adjacent to the first item. , that is, the positional relationship between the first item and the third item exists in the following two situations: (1) The first item and the third item can be items in the same list, and the third item and the first item are adjacent. (2) The first item and the third item can be items in different lists, and the third item and the first item are adjacent. After the first feature of the third item is input into the target model, the first feature of the first item and the first feature of the third item can be processed by the target model, thereby obtaining the third feature of the first item. After obtaining the second feature of the first item and the third feature of the first item, the target model can calculate the second feature of the first item and the third feature of the first item, thereby obtaining the probability that the first item is clicked by the user. . Since the second characteristic of the first item can represent the impact of the second item on the first item, that is, when the user uses sequential browsing behavior and skipping behavior to browse to the first item, the user browses during these behaviors. The impact of the project on the first project. The third feature of the first project can represent the impact of the third project on the first project. That is, when the user uses the contrast behavior to browse to the first project, the user is in the process of performing this behavior. It can be seen that when predicting user behavior, the target model not only introduces conventional sequential browsing behavior, but also introduces browsing behaviors such as jump behavior and comparison behavior, that is, That is to say, the target model will consider the impact of the items browsed by the user on the first item when the user uses these complex and diverse browsing behaviors to browse the first item, which can further improve the final result of the target model. The accuracy of the probability that the first item is clicked by the user.
在一种可能的实现方式中,基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征包括:对第一项目的第一特征进行映射处理,得到第一项目的第四特征;对第二项目的第二特征进行基于自注意力机制的处理,得到第一项目的第五特征;对第一项 目的第四特征以及第一项目的第五特征进行第一融合处理,得到第一项目的第二特征。前述实现方式中,将第一项目的第一特征和第二项目的第二特征输入目标模型后,目标模型可将第一项目的第一特征在隐空间上进行映射,得到第一项目的第四特征,与此同时,目标模型还可对第二项目的第二特征进行基于自注意力机制的处理,得到第一项目的第五特征。得到第一项目的第四特征以及第一项目的第五特征,目标模型可利用循环神经单元对第一项目的第四特征以及第一项目的第五特征进行处理,从而准确得到第一项目的第二特征。In a possible implementation, based on the first feature of the first item and the second feature of the second item, obtaining the second feature of the first item includes: mapping the first feature of the first item to obtain the second feature of the first item. The fourth feature of the first item; the second feature of the second item is processed based on the self-attention mechanism to obtain the fifth feature of the first item; the first item The fourth feature of the object and the fifth feature of the first item are subjected to a first fusion process to obtain the second feature of the first item. In the aforementioned implementation manner, after the first feature of the first item and the second feature of the second item are input into the target model, the target model can map the first feature of the first item on the latent space to obtain the first feature of the first item. Four features. At the same time, the target model can also process the second feature of the second item based on the self-attention mechanism to obtain the fifth feature of the first item. After obtaining the fourth feature of the first item and the fifth feature of the first item, the target model can use the recurrent neural unit to process the fourth feature of the first item and the fifth feature of the first item, thereby accurately obtaining the first item's fourth feature. Second characteristic.
在一种可能的实现方式中,对第一项目的第一特征进行映射处理,得到第一项目的第四特征:对第一项目的第一特征、用户对目标页面的请求以及第二项目被用户点击的概率进行映射处理,得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行第二融合处理,得到第一项目的第四特征。前述实现方式中,在获取第一项目的第四特征之前,还可向目标模型输入用户对目标页面的请求以及第二项目被用户点击的概率,那么,目标模型可分别将第一项目的第一特征、用户对目标页面的请求以及第二项目被用户点击的概率在隐空间上进行映射,相应得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征,再对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行拼接,得到第一项目的第四特征。由此可见,目标模型在对第一项目进行分析时,不仅考虑到第一项目自身的属性信息的影响,还考虑到用户对目标页面的请求以及第二项目被用户点击的概率等外界因素所产生的影响,从而进一步提高目标模型最终得到的第一项目被用户点击的概率的准确度。In a possible implementation, the first feature of the first item is mapped to obtain the fourth feature of the first item: the first feature of the first item, the user's request for the target page, and the second item being The probability of the user clicking is mapped to obtain the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item; the sixth feature of the first item, the seventh feature of the first item, and The eighth feature of the first item is subjected to the second fusion process to obtain the fourth feature of the first item. In the foregoing implementation, before obtaining the fourth feature of the first item, the user's request for the target page and the probability that the second item is clicked by the user can also be input to the target model. Then, the target model can respectively obtain the third feature of the first item. The first feature, the user's request for the target page and the probability of the second item being clicked by the user are mapped on the latent space, and accordingly the sixth feature of the first item, the seventh feature of the first item and the eighth feature of the first item are obtained , and then splice the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item to obtain the fourth feature of the first item. It can be seen that when the target model analyzes the first item, it not only takes into account the influence of the attribute information of the first item itself, but also considers the influence of external factors such as the user's request for the target page and the probability of the second item being clicked by the user. The impact produced by the target model further improves the accuracy of the probability of the first item being clicked by the user.
在一种可能的实现方式中,基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征包括:对第一项目的第一特征以及第三项目的第一特征进行映射处理,得到第一项目的第六特征以及第一项目的第九特征;对第一项目的第六特征和第一项目的第九特征进行第三融合处理,得到第一项目的第十特征;对第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。前述实现方式中,将第三项目的第一特征输入目标模型后,目标模型可分别将第一项目的第一特征以及第三项目的第一特征在隐空间上进行映射,相应得到第一项目的第六特征以及第一项目的第九特征。然后,目标模型可通过对比函数对第一项目的第六特征和第一项目的第九特征进行计算,再基于计算结果进行加权求和,得到第一项目的第十特征。最后,目标模型可对第一项目的第六特征以及第一项目的第十特征进行同或运算,准确得到第一项目的第三特征。In a possible implementation, based on the first feature of the first item and the first feature of the third item, obtaining the third feature of the first item includes: comparing the first feature of the first item and the third feature of the third item. Perform mapping processing on one feature to obtain the sixth feature of the first item and the ninth feature of the first item; perform a third fusion process on the sixth feature of the first item and the ninth feature of the first item to obtain the sixth feature of the first item. The tenth feature; performs the fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item. In the foregoing implementation, after the first feature of the third item is input into the target model, the target model can respectively map the first feature of the first item and the first feature of the third item on the latent space, and obtain the first item accordingly. The sixth characteristic of the first item and the ninth characteristic of the first item. Then, the target model can calculate the sixth feature of the first item and the ninth feature of the first item through the comparison function, and then perform a weighted sum based on the calculation results to obtain the tenth feature of the first item. Finally, the target model can perform an exclusive OR operation on the sixth feature of the first item and the tenth feature of the first item to accurately obtain the third feature of the first item.
在一种可能的实现方式中,第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征包括:对用户对目标页面的请求进行映射处理,得到第一项目的第七特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。前述实现方式中,在获取第一项目的第三特征时,目标模型还可将用户对目标页面的请求在隐空间上进行映射,得到第一项目的第七特征,那么,目标模型可对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行同或运算,得到第一项目的第三特征。由此可见,目标模型在对第一项目进行分析时,不仅考虑到第一项目自身的属性信息的影响,还考虑到用户对目标页面的请求等外界因素所产生的影响,从而进一步提高目标模型最终得到的第一项目被用户点击的概率的准确度。In a possible implementation, performing a fourth fusion process on the sixth feature of the first item and the tenth feature of the first item, and obtaining the third feature of the first item includes: mapping the user's request for the target page , the seventh feature of the first item is obtained; the fourth fusion process is performed on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item, to obtain the third feature of the first item. In the aforementioned implementation, when obtaining the third feature of the first item, the target model can also map the user's request for the target page on the latent space to obtain the seventh feature of the first item. Then, the target model can map the third feature of the first item. The sixth characteristic of one item, the seventh characteristic of the first item and the tenth characteristic of the first item are subjected to an exclusive OR operation to obtain the third characteristic of the first item. It can be seen that when the target model analyzes the first item, it not only takes into account the influence of the attribute information of the first item itself, but also considers the influence of external factors such as the user's request for the target page, thereby further improving the target model The final accuracy of the probability that the first item is clicked by the user.
在一种可能的实现方式中,若第一项目为目标页面中的首个项目,则第二项目的第二特 征为预置值。In one possible implementation, if the first item is the first item in the target page, then the second characteristic of the second item is the preset value.
在一种可能的实现方式中,目标页面包含多个列表,位于多个列表中的多个项目构成有向无环图,多个项目包含第一项目、第二项目以及第三项目。In a possible implementation, the target page contains multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
本申请实施例的第二方面提供了一种有向无环图构建方法,该方法包括:获取用户浏览目标页面的眼动数据;基于眼动数据,确定用户对多个项目的浏览行为,多个项目位于目标页面的多个列表中;基于浏览行为,对多个项目进行连接,得到有向无环图。The second aspect of the embodiment of the present application provides a method for constructing a directed acyclic graph. The method includes: obtaining the eye movement data of the user browsing the target page; based on the eye movement data, determining the user's browsing behavior for multiple items. Each item is located in multiple lists on the target page; based on browsing behavior, multiple items are connected to obtain a directed acyclic graph.
从上述方法可以看出:可基于用户浏览目标页面时所产生的眼动数据,来确定用户对目标页面中多个项目的浏览行为,那么,这些浏览行为(例如,顺序浏览行为以及跳过行为),往往决定了用户对项目的浏览顺序(例如,用户在同一列表中的浏览顺序以及用户在不同列表之间的浏览顺序),从而按照这些浏览来连接目标页面的多个项目,得到针对目标页面的有向无环图,该有向无环图可用于后续对目标页面的用户行为预测中,由于该有向无环图涉及了用户复杂多样的浏览行为,有利于提高对目标页面的用户行为预测的准确度。It can be seen from the above method that the user's browsing behavior for multiple items in the target page can be determined based on the eye movement data generated when the user browses the target page. Then, these browsing behaviors (for example, sequential browsing behavior and skipping behavior ), often determines the user's browsing order of items (for example, the user's browsing order in the same list and the user's browsing order between different lists), thereby connecting multiple items on the target page according to these browsing to obtain the target The directed acyclic graph of the page can be used in subsequent predictions of user behavior on the target page. Since the directed acyclic graph involves users’ complex and diverse browsing behaviors, it is helpful to improve users’ understanding of the target page. Accuracy of behavioral predictions.
在一种可能的实现方式中,基于浏览行为,对多个项目进行连接,得到有向无环图包括:将用户以第一顺序浏览的同一列表中的项目,按第一顺序进行连接,并将用户以第二顺序浏览的不同列表中的项目,按第二顺序进行连接,得到有向无环图。前述实现方式中,用户的浏览行为包含两大类浏览行为。第一类浏览行为指用户浏览同一列表中的项目,包含第一类顺序浏览行为,故可将用户在同一列表中的浏览顺序称为第一顺序,第一顺序包含第一类顺序浏览行为中,用户浏览同一列表中所有项目所按照的从上到下的顺序以及从左到右的顺序。第二类浏览行为指用户浏览不同列表之间的项目,包含第二类顺序浏览行为以及对比行为,故可将用户在不同列表之间的浏览顺序称为第二顺序,第二顺序包含第二类顺序浏览行为中,用户浏览相邻两个列表中相邻的若干个项目所按照的前后顺序,以及对比行为中,用户浏览非相邻两个列表中的两个项目所按照的跳转顺序。那么,可按照第一顺序以及第二顺序,将目标页面中的所有项目连接起来,从而得到针对目标页面的有向无环图。In one possible implementation, based on browsing behavior, connecting multiple items to obtain a directed acyclic graph includes: connecting items in the same list that the user browsed in the first order, and The items in different lists browsed by the user in the second order are connected in the second order to obtain a directed acyclic graph. In the aforementioned implementation method, the user's browsing behavior includes two major types of browsing behavior. The first type of browsing behavior refers to the user browsing items in the same list, including the first type of sequential browsing behavior. Therefore, the user's browsing order in the same list can be called the first order, and the first order includes the first type of sequential browsing behavior. , the top-to-bottom and left-to-right order in which users browse all items in the same list. The second type of browsing behavior refers to the user browsing items between different lists, including the second type of sequential browsing behavior and comparison behavior. Therefore, the user's browsing order between different lists can be called the second order, and the second order includes the second type. In the sequential browsing behavior, the order in which the user browses several adjacent items in two adjacent lists, and in the comparison behavior, the jump order in which the user browses two items in two non-adjacent lists. . Then, all items in the target page can be connected according to the first order and the second order, thereby obtaining a directed acyclic graph for the target page.
在一种可能的实现方式中,获取用户浏览目标页面的眼动数据:通过眼动仪采集用户浏览目标页面的眼动数据。In one possible implementation, obtaining the eye movement data of the user browsing the target page: collecting the eye movement data of the user browsing the target page through an eye tracker.
本申请实施例的第三方面提供了一种模型训练方法,该方法包括:通过待训练模型获取第一项目的第一特征以及第二项目的第二特征,第一项目和第二项目位于待处理页面的不同列表或同一列表中,且第二项目位于第一项目之前;通过待训练模型基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,其中,第一项目的第一特征为第一项目的属性信息,第一项目的第二特征为基于第一项目的属性信息进行融合得到的信息,第二项目的第二特征为基于第二项目的属性信息(即第二项目的第一特征)进行融合得到的信息;通过待训练模型基于第一项目的第二特征,获取第一项目被用户点击的概率;基于第一项目被用户点击的概率以及第一项目被用户点击的真实概率,获取目标损失,目标损失用于指示第一项目被用户点击的概率以及第一项目被用户点击的真实概率之间的差异;基于目标损失,更新待训练模型的参数,直至满足模型训练条件,得到目标模型。The third aspect of the embodiment of the present application provides a model training method. The method includes: obtaining the first feature of the first item and the second feature of the second item through the model to be trained, and the first item and the second item are located in the to-be-trained model. Process different lists on the page or the same list, and the second item is located before the first item; obtain the second feature of the first item based on the first feature of the first item and the second feature of the second item through the model to be trained, Among them, the first feature of the first item is the attribute information of the first item, the second feature of the first item is the information obtained by fusion based on the attribute information of the first item, and the second feature of the second item is the information based on the second item. Information obtained by fusing the attribute information of the second item (i.e., the first feature of the second item); using the model to be trained based on the second feature of the first item, the probability that the first item is clicked by the user is obtained; based on the probability that the first item is clicked by the user probability and the real probability that the first item is clicked by the user, and obtains the target loss. The target loss is used to indicate the difference between the probability that the first item is clicked by the user and the real probability that the first item is clicked by the user; based on the target loss, update the Train the parameters of the model until the model training conditions are met and the target model is obtained.
上述方法所得到的目标模型,具备对页面进行用户行为预测的能力。当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的 第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。The target model obtained by the above method has the ability to predict user behavior on the page. When it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the first feature of the second item can be input to the target model. Second feature, wherein the first item and the second item are located in different lists or the same list on the target page, and the second item is located before the first item. Then, the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. . In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model The probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
在一种可能的实现方式中,该方法还包括:通过待训练模型获取第三项目的第一特征,第一项目和第三项目位于待处理页面的不同列表或同一列表中,且第三项目与第一项目相邻;通过待训练模型基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征;通过待训练模型基于第一项目的第二特征,获取第一项目被用户点击的概率包括:通过待训练模型基于第一项目的第二特征以及第一项目的第三特征,获取第一项目被用户点击的概率。In a possible implementation, the method further includes: obtaining the first feature of the third item through the model to be trained, the first item and the third item are located in different lists or the same list of the page to be processed, and the third item adjacent to the first item; through the model to be trained based on the first feature of the first item and the first feature of the third item, the third feature of the first item is obtained; through the model to be trained based on the second feature of the first item, Obtaining the probability that the first item is clicked by the user includes: using the to-be-trained model to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item.
在一种可能的实现方式中,基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征包括:对第一项目的第一特征进行映射处理,得到第一项目的第四特征;对第二项目的第二特征进行基于自注意力机制的处理,得到第一项目的第五特征;对第一项目的第四特征以及第一项目的第五特征进行第一融合处理,得到第一项目的第二特征。In a possible implementation, based on the first feature of the first item and the second feature of the second item, obtaining the second feature of the first item includes: mapping the first feature of the first item to obtain the second feature of the first item. The fourth feature of the first item; the second feature of the second item is processed based on the self-attention mechanism to obtain the fifth feature of the first item; the fourth feature of the first item and the fifth feature of the first item are processed The first fusion process is to obtain the second feature of the first item.
在一种可能的实现方式中,对第一项目的第一特征进行映射处理,得到第一项目的第四特征:对第一项目的第一特征、用户对待处理页面的请求以及第二项目被用户点击的概率进行映射处理,得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行第二融合处理,得到第一项目的第四特征。In a possible implementation, the first feature of the first item is mapped to obtain the fourth feature of the first item: the first feature of the first item, the user's request for the page to be processed, and the second item being processed. The probability of the user clicking is mapped to obtain the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item; the sixth feature of the first item, the seventh feature of the first item, and The eighth feature of the first item is subjected to the second fusion process to obtain the fourth feature of the first item.
在一种可能的实现方式中,基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征包括:对第一项目的第一特征以及第三项目的第一特征进行映射处理,得到第一项目的第六特征以及第一项目的第九特征;对第一项目的第六特征和第一项目的第九特征进行第三融合处理,得到第一项目的第十特征;对第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, based on the first feature of the first item and the first feature of the third item, obtaining the third feature of the first item includes: comparing the first feature of the first item and the third feature of the third item. Perform mapping processing on one feature to obtain the sixth feature of the first item and the ninth feature of the first item; perform a third fusion process on the sixth feature of the first item and the ninth feature of the first item to obtain the sixth feature of the first item. The tenth feature; performs the fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item.
在一种可能的实现方式中,第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征包括:对用户对待处理页面的请求进行映射处理,得到第一项目的第七特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, performing a fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item includes: mapping the user's request for the page to be processed , the seventh feature of the first item is obtained; the fourth fusion process is performed on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item, to obtain the third feature of the first item.
在一种可能的实现方式中,若第一项目为待处理页面中的首个项目,则第二项目的第二特征为预置值。In a possible implementation, if the first item is the first item in the page to be processed, the second characteristic of the second item is a preset value.
在一种可能的实现方式中,待处理页面包含多个列表,位于多个列表中的多个项目构成有向无环图,多个项目包含第一项目、第二项目以及第三项目。In a possible implementation, the page to be processed includes multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
本申请实施例的第四方面提供了一种用户行为预测装置,该装置包括:第一获取模块,用于通过目标模型获取第一项目的第一特征以及第二项目的第二特征,第一项目和第二项目 位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前;第二获取模块,用于通过目标模型基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,其中,第一项目的第一特征为第一项目的属性信息,第一项目的第二特征为基于第一项目的属性信息进行融合得到的信息,第二项目的第二特征为基于第二项目的属性信息(即第二项目的第一特征)进行融合得到的信息;第三获取模块,用于通过目标模型基于第一项目的第二特征,获取第一项目被用户点击的概率。The fourth aspect of the embodiment of the present application provides a user behavior prediction device. The device includes: a first acquisition module, configured to acquire the first feature of the first item and the second feature of the second item through the target model. The first project and second project is located in a different list or the same list on the target page, and the second item is located before the first item; the second acquisition module is used to obtain the second item based on the first feature of the first item and the second feature of the second item through the target model. The second characteristic of an item, wherein the first characteristic of the first item is the attribute information of the first item, the second characteristic of the first item is the information obtained by fusion based on the attribute information of the first item, and the second characteristic of the second item is The second feature is the information obtained by fusion based on the attribute information of the second item (i.e., the first feature of the second item); the third acquisition module is used to obtain the first item based on the second feature of the first item through the target model. The probability of a user clicking.
从上述装置可以看出:当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。It can be seen from the above device that when it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and The second item is in a different list or in the same list on the target page, and the second item is before the first item. Then, the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. . In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model The probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
在一种可能的实现方式中,该装置还包括:第四获取模块,用于通过目标模型获取第三项目的第一特征,第一项目和第三项目位于目标页面的不同列表或同一列表中,且第三项目与第一项目相邻;第五获取模块,用于通过目标模型基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征;第三获取模块,用于通过目标模型基于第一项目的第二特征以及第一项目的第三特征,获取第一项目被用户点击的概率。In a possible implementation, the device further includes: a fourth acquisition module, configured to acquire the first feature of the third item through the target model, where the first item and the third item are located in different lists or the same list of the target page. , and the third item is adjacent to the first item; the fifth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the target model; the third The acquisition module is configured to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the target model.
在一种可能的实现方式中,第二获取模块,用于:通过目标模型对第一项目的第一特征进行映射处理,得到第一项目的第四特征;通过目标模型对第二项目的第二特征进行基于自注意力机制的处理,得到第一项目的第五特征;通过目标模型对第一项目的第四特征以及第一项目的第五特征进行第一融合处理,得到第一项目的第二特征。In a possible implementation, the second acquisition module is configured to: map the first feature of the first item through the target model to obtain the fourth feature of the first item; map the third feature of the second item through the target model. The two features are processed based on the self-attention mechanism to obtain the fifth feature of the first item; the fourth feature of the first item and the fifth feature of the first item are first fused through the target model to obtain the first item's fifth feature. Second characteristic.
在一种可能的实现方式中,第二获取模块,用于:通过目标模型对第一项目的第一特征、用户对目标页面的请求以及第二项目被用户点击的概率进行映射处理,得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征;通过目标模型对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行第二融合处理,得到第一项目的第四特征。In a possible implementation, the second acquisition module is configured to: use the target model to map the first feature of the first item, the user's request for the target page, and the probability that the second item is clicked by the user, to obtain the third The sixth characteristic of the first item, the seventh characteristic of the first item and the eighth characteristic of the first item; through the target model, the sixth characteristic of the first item, the seventh characteristic of the first item and the eighth characteristic of the first item The second fusion process is performed to obtain the fourth feature of the first item.
在一种可能的实现方式中,第五获取模块,用于:通过目标模型对第一项目的第一特征以及第三项目的第一特征进行映射处理,得到第一项目的第六特征以及第一项目的第九特征;通过目标模型对第一项目的第六特征和第一项目的第九特征进行第三融合处理,得到第一项目的第十特征;通过目标模型对第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation manner, the fifth acquisition module is used to map the first feature of the first item and the first feature of the third item through the target model to obtain the sixth feature of the first item and the first feature of the third item. The ninth feature of the first item; the third fusion process is performed on the sixth feature of the first item and the ninth feature of the first item through the target model to obtain the tenth feature of the first item; the third feature of the first item is obtained through the target model The six features and the tenth feature of the first item are subjected to the fourth fusion process to obtain the third feature of the first item.
在一种可能的实现方式中,第五获取模块,用于:通过目标模型对用户对目标页面的请求进行映射处理,得到第一项目的第七特征;通过目标模型对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, the fifth acquisition module is used to: map the user's request for the target page through the target model to obtain the seventh feature of the first item; map the sixth feature of the first item through the target model Features, the seventh feature of the first item, and the tenth feature of the first item are subjected to a fourth fusion process to obtain the third feature of the first item.
在一种可能的实现方式中,若第一项目为目标页面中的首个项目,则第二项目的第二特 征为预置值。In one possible implementation, if the first item is the first item in the target page, then the second characteristic of the second item is the preset value.
在一种可能的实现方式中,目标页面包含多个列表,位于多个列表中的多个项目构成有向无环图,多个项目包含第一项目、第二项目以及第三项目。In a possible implementation, the target page contains multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
本申请实施例的第五方面提供了一种有向无环图构建装置,该装置包括:获取模块,用于获取用户浏览目标页面的眼动数据;确定模块,用于基于眼动数据,确定用户对多个项目的浏览行为,多个项目位于目标页面的多个列表中;连接模块,用于基于浏览行为,对多个项目进行连接,得到有向无环图。The fifth aspect of the embodiment of the present application provides a device for constructing a directed acyclic graph. The device includes: an acquisition module, used to obtain the eye movement data of the user browsing the target page; and a determination module, used to determine based on the eye movement data. The user's browsing behavior for multiple items, multiple items are located in multiple lists on the target page; the connection module is used to connect multiple items based on the browsing behavior to obtain a directed acyclic graph.
上述装置可基于用户浏览目标页面时所产生的眼动数据,来确定用户对目标页面中多个项目的浏览行为,那么,这些浏览行为(例如,顺序浏览行为以及跳过行为),往往决定了用户对项目的浏览顺序(例如,用户在同一列表中的浏览顺序以及用户在不同列表之间的浏览顺序),从而按照这些浏览来连接目标页面的多个项目,得到针对目标页面的有向无环图,该有向无环图可用于后续对目标页面的用户行为预测中,由于该有向无环图涉及了用户复杂多样的浏览行为,有利于提高对目标页面的用户行为预测的准确度。The above device can determine the user's browsing behavior for multiple items in the target page based on the eye movement data generated when the user browses the target page. Then, these browsing behaviors (such as sequential browsing behavior and skipping behavior) often determine The order in which the user browses the items (for example, the order in which the user browses in the same list and the order in which the user browses between different lists) is used to connect multiple items of the target page according to these views, and a directed and undirected view of the target page is obtained. Ring graph, this directed acyclic graph can be used in the subsequent prediction of user behavior on the target page. Since this directed acyclic graph involves users' complex and diverse browsing behaviors, it is helpful to improve the accuracy of user behavior prediction on the target page. .
在一种可能的实现方式中,连接模块,用于将用户以第一顺序浏览的同一列表中的项目,按第一顺序进行连接,并将用户以第二顺序浏览的不同列表中的项目,按第二顺序进行连接,得到有向无环图。In a possible implementation, the connection module is used to connect items in the same list that the user browses in the first order, and connect items in different lists that the user browses in the second order. Connect in the second order to obtain a directed acyclic graph.
在一种可能的实现方式中,获取模块,用于通过眼动仪采集用户浏览目标页面的眼动数据。In a possible implementation, the acquisition module is used to collect eye movement data of the user browsing the target page through an eye tracker.
本申请实施例的第六方面提供了一种模型训练装置的一个结构示意图,该装置包括:第一获取模块,用于通过待训练模型获取第一项目的第一特征以及第二项目的第二特征,第一项目和第二项目位于待处理页面的不同列表或同一列表中,且第二项目位于第一项目之前;第二获取模块,用于通过待训练模型基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,其中,第一项目的第一特征为第一项目的属性信息,第一项目的第二特征为基于第一项目的属性信息进行融合得到的信息,第二项目的第二特征为基于第二项目的属性信息(即第二项目的第一特征)进行融合得到的信息;第三获取模块,用于通过待训练模型基于第一项目的第二特征,获取第一项目被用户点击的概率;第四获取模块,用于基于第一项目被用户点击的概率以及第一项目被用户点击的真实概率,获取目标损失,目标损失用于指示第一项目被用户点击的概率以及第一项目被用户点击的真实概率之间的差异;更新模块,用于基于目标损失,更新待训练模型的参数,直至满足模型训练条件,得到目标模型。The sixth aspect of the embodiment of the present application provides a schematic structural diagram of a model training device. The device includes: a first acquisition module, configured to acquire the first feature of the first item and the second feature of the second item through the model to be trained. Features, the first item and the second item are located in different lists or the same list on the page to be processed, and the second item is located before the first item; the second acquisition module is used to use the model to be trained based on the first feature of the first item and the second feature of the second item, to obtain the second feature of the first item, where the first feature of the first item is the attribute information of the first item, and the second feature of the first item is the attribute information based on the first item The information obtained by fusion, the second feature of the second item is the information obtained by fusion based on the attribute information of the second item (ie, the first feature of the second item); the third acquisition module is used to use the model to be trained based on the first feature of the second item. The second feature of an item is to obtain the probability that the first item is clicked by the user; the fourth acquisition module is used to obtain the target loss based on the probability of the first item being clicked by the user and the real probability of the first item being clicked by the user. Used to indicate the difference between the probability of the first item being clicked by the user and the real probability of the first item being clicked by the user; the update module is used to update the parameters of the model to be trained based on the target loss until the model training conditions are met and the target is obtained Model.
上述装置训练得到的目标模型,具备对页面进行用户行为预测的能力。当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中 浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。The target model trained by the above device has the ability to predict user behavior on the page. When it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second item are located on the target page. Different lists or the same list, with the second item before the first item. Then, the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. . In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists, so the factors considered by the target model are relatively comprehensive and can fit the user's needs in the target page. Based on the actual situation when browsing to the first item, the probability that the first item is clicked by the user finally obtained by the target model has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
在一种可能的实现方式中,该装置包括:第五获取模块,用于通过待训练模型获取第三项目的第一特征,第一项目和第三项目位于待处理页面的不同列表或同一列表中,且第三项目与第一项目相邻;第六获取模块,用于通过待训练模型基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征;第三获取模块,用于通过待训练模型基于第一项目的第二特征以及第一项目的第三特征,获取第一项目被用户点击的概率。In a possible implementation, the device includes: a fifth acquisition module, configured to acquire the first feature of the third item through the model to be trained, and the first item and the third item are located in different lists or the same list of the page to be processed. , and the third item is adjacent to the first item; the sixth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the model to be trained; The third acquisition module is used to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the model to be trained.
在一种可能的实现方式中,第二获取模块,用于:对第一项目的第一特征进行映射处理,得到第一项目的第四特征;对第二项目的第二特征进行基于自注意力机制的处理,得到第一项目的第五特征;对第一项目的第四特征以及第一项目的第五特征进行第一融合处理,得到第一项目的第二特征。In a possible implementation, the second acquisition module is used to: perform mapping processing on the first feature of the first item to obtain the fourth feature of the first item; perform self-attention-based processing on the second feature of the second item. The force mechanism is processed to obtain the fifth characteristic of the first item; the fourth characteristic of the first item and the fifth characteristic of the first item are first fused to obtain the second characteristic of the first item.
在一种可能的实现方式中,第二获取模块,用于:对第一项目的第一特征、用户对待处理页面的请求以及第二项目被用户点击的概率进行映射处理,得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行第二融合处理,得到第一项目的第四特征。In a possible implementation, the second acquisition module is used to map the first feature of the first item, the user's request for the page to be processed, and the probability that the second item is clicked by the user, and obtain the first item's first characteristic. The sixth feature, the seventh feature of the first item, and the eighth feature of the first item; performing a second fusion process on the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item, Get the fourth characteristic of the first item.
在一种可能的实现方式中,第六获取模块,用于:对第一项目的第一特征以及第三项目的第一特征进行映射处理,得到第一项目的第六特征以及第一项目的第九特征;对第一项目的第六特征和第一项目的第九特征进行第三融合处理,得到第一项目的第十特征;对第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, the sixth acquisition module is configured to map the first feature of the first item and the first feature of the third item to obtain the sixth feature of the first item and the first feature of the first item. The ninth characteristic; perform a third fusion process on the sixth characteristic of the first item and the ninth characteristic of the first item to obtain the tenth characteristic of the first item; perform the sixth characteristic of the first item and the tenth characteristic of the first item The features are subjected to the fourth fusion process to obtain the third feature of the first item.
在一种可能的实现方式中,第六获取模块,用于:对用户对待处理页面的请求进行映射处理,得到第一项目的第七特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, the sixth acquisition module is used to: map the user's request for the page to be processed to obtain the seventh feature of the first item; map the sixth feature of the first item, the first item's The seventh feature and the tenth feature of the first item are subjected to the fourth fusion process to obtain the third feature of the first item.
在一种可能的实现方式中,若第一项目为待处理页面中的首个项目,则第二项目的第二特征为预置值。In a possible implementation, if the first item is the first item in the page to be processed, the second characteristic of the second item is a preset value.
在一种可能的实现方式中,待处理页面包含多个列表,位于多个列表中的多个项目构成有向无环图,多个项目包含第一项目、第二项目以及第三项目。In a possible implementation, the page to be processed includes multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
本申请实施例的第七方面提供了一种用户行为预测装置,该装置包括存储器和处理器;存储器存储有代码,处理器被配置为执行代码,当代码被执行时,用户行为预测装置执行如第一方面或第一方面中任意一种可能的实现方式所述的方法。The seventh aspect of the embodiment of the present application provides a user behavior prediction device, which includes a memory and a processor; the memory stores code, and the processor is configured to execute the code. When the code is executed, the user behavior prediction device executes as follows: The method described in the first aspect or any possible implementation manner of the first aspect.
本申请实施例的第八方面提供了一种有向无环图构建装置,该装置包括存储器和处理器;存储器存储有代码,处理器被配置为执行代码,当代码被执行时,有向无环图构建装置如第二方面或第二方面中任意一种可能的实现方式所述的方法。The eighth aspect of the embodiment of the present application provides a device for constructing a directed acyclic graph, which includes a memory and a processor; the memory stores code, and the processor is configured to execute the code. When the code is executed, the directed acyclic graph is The ring graph construction device is as described in the second aspect or any possible implementation manner of the second aspect.
本申请实施例的第九方面提供了一种模型训练装置,该装置包括存储器和处理器;存储器存储有代码,处理器被配置为执行代码,当代码被执行时,模型训练装置执行如第三方面或第三方面中任意一种可能的实现方式所述的方法。A ninth aspect of the embodiment of the present application provides a model training device, which includes a memory and a processor; the memory stores code, and the processor is configured to execute the code. When the code is executed, the model training device executes the third step The method described in any possible implementation manner of the aspect or the third aspect.
本申请实施例的第十方面提供了一种电路系统,该电路系统包括处理电路,该处理电路配置为执行如第一方面、第一方面中的任意一种可能的实现方式、第二方面、第二方面中的任意一种可能的实现方式或第三方面、第三方面中的任意一种可能的实现方式所述的方法。 A tenth aspect of the embodiments of the present application provides a circuit system. The circuit system includes a processing circuit configured to perform any of the possible implementations of the first aspect, the second aspect, Any possible implementation manner in the second aspect or the method described in any possible implementation manner in the third aspect or the third aspect.
本申请实施例的第十一方面提供了一种芯片系统,该芯片系统包括处理器,用于调用存储器中存储的计算机程序或计算机指令,以使得该处理器执行如第一方面、第一方面中的任意一种可能的实现方式、第二方面、第二方面中的任意一种可能的实现方式或第三方面、第三方面中的任意一种可能的实现方式所述的方法。An eleventh aspect of the embodiments of the present application provides a chip system. The chip system includes a processor for calling a computer program or computer instructions stored in a memory, so that the processor executes the first aspect as described in the first aspect. Any one of the possible implementations in the second aspect, any one of the possible implementations of the second aspect, or the third aspect, the method described in any one of the possible implementations of the third aspect.
在一种可能的实现方式中,该处理器通过接口与存储器耦合。In one possible implementation, the processor is coupled to the memory through an interface.
在一种可能的实现方式中,该芯片系统还包括存储器,该存储器中存储有计算机程序或计算机指令。In a possible implementation, the chip system further includes a memory, and computer programs or computer instructions are stored in the memory.
本申请实施例的第十二方面提供了一种计算机存储介质,该计算机存储介质存储有计算机程序,该程序在由计算机执行时,使得计算机实施如第一方面、第一方面中的任意一种可能的实现方式、第二方面、第二方面中的任意一种可能的实现方式或第三方面、第三方面中的任意一种可能的实现方式所述的方法。A twelfth aspect of the embodiments of the present application provides a computer storage medium. The computer storage medium stores a computer program. When the program is executed by a computer, the computer implements any one of the first aspect and the first aspect. Possible implementations, the second aspect, any one possible implementation of the second aspect, or the third aspect, the method described in any one of the possible implementations of the third aspect.
本申请实施例的第十三方面提供了一种计算机程序产品,该计算机程序产品存储有指令,该指令在由计算机执行时,使得计算机实施如第一方面、第一方面中的任意一种可能的实现方式、第二方面、第二方面中的任意一种可能的实现方式或第三方面、第三方面中的任意一种可能的实现方式所述的方法。A thirteenth aspect of the embodiments of the present application provides a computer program product. The computer program product stores instructions. When executed by a computer, the instructions make it possible for the computer to implement any one of the first aspect and the first aspect. The method described in the implementation, the second aspect, any one possible implementation of the second aspect, or the third aspect, any one possible implementation of the third aspect.
本申请实施例中,当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。In the embodiment of the present application, when it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second feature can be input to the target model. The two items are in different lists or in the same list on the target page, and the second item is before the first item. Then, the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. . In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model The probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
附图说明Description of the drawings
图1为人工智能主体框架的一种结构示意图;Figure 1 is a structural schematic diagram of the main framework of artificial intelligence;
图2a为本申请实施例提供的用户行为预测系统的一个结构示意图;Figure 2a is a schematic structural diagram of the user behavior prediction system provided by the embodiment of the present application;
图2b为本申请实施例提供的用户行为预测系统的另一结构示意图;Figure 2b is another structural schematic diagram of the user behavior prediction system provided by the embodiment of the present application;
图2c为本申请实施例提供的数据序列处理的相关设备的一个示意图;Figure 2c is a schematic diagram of related equipment for data sequence processing provided by the embodiment of the present application;
图3为本申请实施例提供的系统100架构的一个示意图;Figure 3 is a schematic diagram of the architecture of the system 100 provided by the embodiment of the present application;
图4为本申请实施例提供的有向无环图构建方法的一个流程示意图;Figure 4 is a schematic flow chart of a directed acyclic graph construction method provided by an embodiment of the present application;
图5为本申请实施例提供的目标页面的一个示意图;Figure 5 is a schematic diagram of the target page provided by the embodiment of the present application;
图6为本申请实施例提供的眼动仪的一个示意图;Figure 6 is a schematic diagram of an eye tracker provided by an embodiment of the present application;
图7为本申请实施例提供的有向无环图的一个示意图;Figure 7 is a schematic diagram of a directed acyclic graph provided by an embodiment of the present application;
图8为本申请实施例提供的用户行为预测方法的一个流程示意图;Figure 8 is a schematic flow chart of the user behavior prediction method provided by the embodiment of the present application;
图9为本申请实施例提供的目标模型的一个结构示意图; Figure 9 is a schematic structural diagram of the target model provided by the embodiment of the present application;
图10为本申请实施例提供的模型训练方法的一个流程示意图;Figure 10 is a schematic flow chart of the model training method provided by the embodiment of the present application;
图11为本申请实施例提供的用户行为预测装置的一个结构示意图;Figure 11 is a schematic structural diagram of a user behavior prediction device provided by an embodiment of the present application;
图12为本申请实施例提供的有向无环图构建装置的一个结构示意图;Figure 12 is a schematic structural diagram of a directed acyclic graph construction device provided by an embodiment of the present application;
图13为本申请实施例提供的模型训练装置的一个结构示意图;Figure 13 is a schematic structural diagram of the model training device provided by the embodiment of the present application;
图14为本申请实施例提供的执行设备的一个结构示意图;Figure 14 is a schematic structural diagram of an execution device provided by an embodiment of the present application;
图15为本申请实施例提供的训练设备的一个结构示意图;Figure 15 is a schematic structural diagram of the training equipment provided by the embodiment of the present application;
图16为本申请实施例提供的芯片的一个结构示意图。Figure 16 is a schematic structural diagram of a chip provided by an embodiment of the present application.
具体实施方式Detailed ways
本申请实施例提供了一种用户行为预测方法及其相关设备,可以令神经网络模型所得到的项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。The embodiments of this application provide a user behavior prediction method and related equipment, which can make the probability of items being clicked by the user obtained by the neural network model have a higher accuracy, which is conducive to subsequent accurate recommendation of items of interest to the user. project.
本申请的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的术语在适当情况下可以互换,这仅仅是描述本申请的实施例中对相同属性的对象在描述时所采用的区分方式。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,以便包含一系列单元的过程、方法、系统、产品或设备不必限于那些单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它单元。The terms "first", "second", etc. in the description and claims of this application and the above-mentioned drawings are used to distinguish similar objects and are not necessarily used to describe a specific order or sequence. It should be understood that the terms so used are interchangeable under appropriate circumstances, and are merely a way of distinguishing objects with the same attributes in describing the embodiments of the present application. Furthermore, the terms "include" and "having" and any variations thereof, are intended to cover non-exclusive inclusions, such that a process, method, system, product or apparatus comprising a series of elements need not be limited to those elements, but may include not explicitly other elements specifically listed or inherent to such processes, methods, products or equipment.
随着计算机技术的快速发展,为了满足用户的上网需求,开发商越来越倾向于在页面上展现用户感兴趣的内容。基于此,针对于某个页面,往往需要预测用户会点击该页面上所展示的哪个或哪些项目,即预测用户针对该页面的行为,进而修改该页面上所需呈现的项目,以为用户推荐其感兴趣的项目。With the rapid development of computer technology, in order to meet users' Internet needs, developers are increasingly inclined to display content that users are interested in on their pages. Based on this, for a certain page, it is often necessary to predict which item or items displayed on the page the user will click on, that is, predict the user's behavior on the page, and then modify the items to be displayed on the page to recommend them to the user. Projects of interest.
一般地,项目在某个页面中的排布方式往往是以多个列表的形式呈现给用户观看的,即该页面通常包含多个列表,且每个列表中包含多个项目,在预测用户针对该页面的行为时,对于该页面中的任意一个项目,可利用AI技术领域中的神经网络模型来与该项目被用户点击的概率。例如,在某个应用商城的页面中,展示了多个横向列表和多个纵向排列,这多个横向列表和多个纵向排列交错排列,横向列表中的多个应用排成一行,纵向排列中的多个应用排成一列,如此一来,该页面则可以交错列表的形式,为用户展示各种应用的介绍信息。为了预测用户在针对该页面的点击行为,可以利用神经网络模型来逐个分析各个应用,从而得到用户点击该页面中各个应用的概率。Generally, the arrangement of items on a certain page is often presented to the user in the form of multiple lists, that is, the page usually contains multiple lists, and each list contains multiple items. When predicting the user's target When the page behaves, for any item on the page, the neural network model in the field of AI technology can be used to determine the probability of the item being clicked by the user. For example, on a page of an app mall, multiple horizontal lists and multiple vertical arrangements are displayed. These multiple horizontal lists and multiple vertical arrangements are staggered. Multiple applications in the horizontal list are arranged in a row, and in the vertical arrangement, Multiple applications are arranged in a row, so that the page can display the introduction information of various applications to the user in the form of a staggered list. In order to predict the user's clicking behavior on this page, the neural network model can be used to analyze each application one by one, thereby obtaining the probability that the user clicks on each application on the page.
然而,相关技术提供的神经网络模型在预测某个项目被用户点击的概率时,通常仅考虑该项目所在列表中的其余项目,对该项目所产生的影响。可见,相关技术所考虑的因素较为单一,导致模型最终得到的该项目被用户点击的概率,准确度往往较低,故后续无法精准为用户推荐其感兴趣的项目。However, when the neural network model provided by related technologies predicts the probability of a certain item being clicked by a user, it usually only considers the impact of the remaining items in the list where the item is located on the item. It can be seen that the factors considered by the relevant technology are relatively single, resulting in the probability that the item is clicked by the user finally obtained by the model, which is often less accurate. Therefore, it cannot accurately recommend items of interest to the user in the future.
进一步地,用户在该页面上的浏览行为往往较为复杂,例如,用户在浏览当前列表的项目时,直接跳转另一个列表(另一个列表与当前列表为非相邻的两个列表)的项目进行浏览。相关技术往往无法考虑到多种复杂的浏览行为所产生的影响,也会降低模型最终得到的项目被用户点击的概率的准确度。Furthermore, the user's browsing behavior on this page is often complicated. For example, when the user is browsing the items in the current list, he directly jumps to the items in another list (the other list and the current list are two non-adjacent lists). to browse. Related technologies often fail to take into account the impact of multiple complex browsing behaviors, and will also reduce the accuracy of the probability of an item being clicked by the user finally obtained by the model.
更进一步地,相关技术的模型在分析某个项目时,往往仅针对该项目自身的相关信息 (例如,设某个项目为某个应用,该应用的相关信息则包含该应用的开发商、类型、大小等等)进行分析,并未考虑用户等外界因素所产生的影响,也会降低模型最终得到的项目被用户点击的概率的准确度。Furthermore, when models of related technologies analyze a certain project, they often only focus on the relevant information of the project itself. (For example, assuming a project is an application, the relevant information of the application includes the developer, type, size, etc. of the application.) The analysis does not take into account the impact of external factors such as users, which will also degrade the model. The accuracy of the final probability of the item being clicked by the user.
为了解决该问题,本申请实施例提供了一种用户行为预测方法,该方法可结合人工智能(artificial intelligence,AI)技术实现。AI技术是利用数字计算机或者数字计算机控制的机器模拟、延伸和扩展人的智能的技术学科,AI技术通过感知环境、获取知识并使用知识获得最佳结果。换句话说,人工智能技术是计算机科学的一个分支,它企图了解智能的实质,并生产出一种新的能以人类智能相似的方式做出反应的智能机器。利用人工智能进行数据处理是人工智能常见的一个应用方式。In order to solve this problem, embodiments of the present application provide a user behavior prediction method, which can be implemented in conjunction with artificial intelligence (artificial intelligence, AI) technology. AI technology is a technical discipline that uses digital computers or machines controlled by digital computers to simulate, extend and expand human intelligence. AI technology obtains the best results by perceiving the environment, acquiring knowledge and using knowledge. In other words, artificial intelligence technology is a branch of computer science that attempts to understand the nature of intelligence and produce a new intelligent machine that can respond in a similar way to human intelligence. Using artificial intelligence for data processing is a common application method of artificial intelligence.
首先对人工智能系统总体工作流程进行描述,请参见图1,图1为人工智能主体框架的一种结构示意图,下面从“智能信息链”(水平轴)和“IT价值链”(垂直轴)两个维度对上述人工智能主题框架进行阐述。其中,“智能信息链”反映从数据的获取到处理的一列过程。举例来说,可以是智能信息感知、智能信息表示与形成、智能推理、智能决策、智能执行与输出的一般过程。在这个过程中,数据经历了“数据—信息—知识—智慧”的凝练过程。“IT价值链”从人智能的底层基础设施、信息(提供和处理技术实现)到系统的产业生态过程,反映人工智能为信息技术产业带来的价值。First, the overall workflow of the artificial intelligence system is described. Please refer to Figure 1. Figure 1 is a structural schematic diagram of the main framework of artificial intelligence. The following is from the "intelligent information chain" (horizontal axis) and "IT value chain" (vertical axis) The above artificial intelligence theme framework is elaborated on in two dimensions. Among them, the "intelligent information chain" reflects a series of processes from data acquisition to processing. For example, it can be the general process of intelligent information perception, intelligent information representation and formation, intelligent reasoning, intelligent decision-making, intelligent execution and output. In this process, the data has gone through the condensation process of "data-information-knowledge-wisdom". The "IT value chain" reflects the value that artificial intelligence brings to the information technology industry, from the underlying infrastructure of human intelligence and information (providing and processing technology implementation) to the systematic industrial ecological process.
(1)基础设施(1)Infrastructure
基础设施为人工智能系统提供计算能力支持,实现与外部世界的沟通,并通过基础平台实现支撑。通过传感器与外部沟通;计算能力由智能芯片(CPU、NPU、GPU、ASIC、FPGA等硬件加速芯片)提供;基础平台包括分布式计算框架及网络等相关的平台保障和支持,可以包括云存储和计算、互联互通网络等。举例来说,传感器和外部沟通获取数据,这些数据提供给基础平台提供的分布式计算系统中的智能芯片进行计算。Infrastructure provides computing power support for artificial intelligence systems, enables communication with the external world, and supports it through basic platforms. Communicate with the outside through sensors; computing power is provided by smart chips (hardware acceleration chips such as CPU, NPU, GPU, ASIC, FPGA, etc.); the basic platform includes distributed computing framework and network and other related platform guarantees and support, which can include cloud storage and Computing, interconnection networks, etc. For example, sensors communicate with the outside world to obtain data, which are provided to smart chips in the distributed computing system provided by the basic platform for calculation.
(2)数据(2)Data
基础设施的上一层的数据用于表示人工智能领域的数据来源。数据涉及到图形、图像、语音、文本,还涉及到传统设备的物联网数据,包括已有系统的业务数据以及力、位移、液位、温度、湿度等感知数据。Data from the upper layer of the infrastructure is used to represent data sources in the field of artificial intelligence. The data involves graphics, images, voice, and text, as well as IoT data of traditional devices, including business data of existing systems and sensory data such as force, displacement, liquid level, temperature, and humidity.
(3)数据处理(3)Data processing
数据处理通常包括数据训练,机器学习,深度学习,搜索,推理,决策等方式。Data processing usually includes data training, machine learning, deep learning, search, reasoning, decision-making and other methods.
其中,机器学习和深度学习可以对数据进行符号化和形式化的智能信息建模、抽取、预处理、训练等。Among them, machine learning and deep learning can perform symbolic and formal intelligent information modeling, extraction, preprocessing, training, etc. on data.
推理是指在计算机或智能系统中,模拟人类的智能推理方式,依据推理控制策略,利用形式化的信息进行机器思维和求解问题的过程,典型的功能是搜索与匹配。Reasoning refers to the process of simulating human intelligent reasoning in computers or intelligent systems, using formal information to perform machine thinking and problem solving based on reasoning control strategies. Typical functions are search and matching.
决策是指智能信息经过推理后进行决策的过程,通常提供分类、排序、预测等功能。Decision-making refers to the process of decision-making after intelligent information is reasoned, and usually provides functions such as classification, sorting, and prediction.
(4)通用能力(4) General ability
对数据经过上面提到的数据处理后,进一步基于数据处理的结果可以形成一些通用的能力,比如可以是算法或者一个通用系统,例如,翻译,文本的分析,计算机视觉的处理,语音识别,图像的识别等等。After the data is processed as mentioned above, some general capabilities can be formed based on the results of further data processing, such as algorithms or a general system, such as translation, text analysis, computer vision processing, speech recognition, and image processing. identification, etc.
(5)智能产品及行业应用 (5) Intelligent products and industry applications
智能产品及行业应用指人工智能系统在各领域的产品和应用,是对人工智能整体解决方案的封装,将智能信息决策产品化、实现落地应用,其应用领域主要包括:智能终端、智能交通、智能医疗、自动驾驶、智慧城市等。Intelligent products and industry applications refer to the products and applications of artificial intelligence systems in various fields. They are the encapsulation of overall artificial intelligence solutions, productizing intelligent information decision-making and realizing practical applications. Its application fields mainly include: intelligent terminals, intelligent transportation, Smart healthcare, autonomous driving, smart cities, etc.
接下来介绍几种本申请的应用场景。Next, several application scenarios of this application will be introduced.
图2a为本申请实施例提供的用户行为预测系统的一个结构示意图,该用户行为预测系统包括用户设备以及数据处理设备。其中,用户设备包括手机、个人电脑或者信息处理中心等智能终端。用户设备为针对页面的用户行为预测的发起端,作为用户行为预测请求的发起方,通常由用户通过用户设备发起请求。Figure 2a is a schematic structural diagram of a user behavior prediction system provided by an embodiment of the present application. The user behavior prediction system includes user equipment and data processing equipment. Among them, user equipment includes smart terminals such as mobile phones, personal computers, or information processing centers. The user device is the initiator of user behavior prediction for the page. As the initiator of the user behavior prediction request, the user usually initiates the request through the user device.
上述数据处理设备可以是云服务器、网络服务器、应用服务器以及管理服务器等具有数据处理功能的设备或服务器。数据处理设备通过交互接口接收来自智能终端对阵页面的用户行为预测请求,再通过存储数据的存储器以及数据处理的处理器环节进行机器学习,深度学习,搜索,推理,决策等方式的页面处理。数据处理设备中的存储器可以是一个统称,包括本地存储以及存储历史数据的数据库,数据库可以在数据处理设备上,也可以在其它网络服务器上。The above-mentioned data processing equipment may be a cloud server, a network server, an application server, a management server, and other equipment or servers with data processing functions. The data processing device receives the user behavior prediction request from the smart terminal for the page through the interactive interface, and then performs page processing through machine learning, deep learning, search, reasoning, decision-making and other methods through the memory for storing data and the processor for data processing. The memory in the data processing device can be a general term, including local storage and a database that stores historical data. The database can be on the data processing device or on other network servers.
在图2a所示的用户行为预测系统中,用户设备可以接收用户的指令,例如用户设备可以获取用户输入/选择的一个页面,然后向数据处理设备发起请求,使得数据处理设备针对用户设备得到的该页面执行用户行为预测应用,从而得到针对该页面的处理结果。示例性的,用户设备可以获取用户输入的一个页面,然后向数据处理设备发起页面的用户行为预测请求,使得数据处理设备对该页面中各个项目的特征进行处理,从而得到该页面的处理结果,即该页面中各个项目被用户点击的概率。In the user behavior prediction system shown in Figure 2a, the user device can receive instructions from the user. For example, the user device can obtain a page input/selected by the user, and then initiate a request to the data processing device, so that the data processing device can respond to the information obtained by the user device. This page executes the user behavior prediction application to obtain the processing results for this page. For example, the user device can obtain a page input by the user, and then initiate a user behavior prediction request for the page to the data processing device, so that the data processing device processes the characteristics of each item in the page, thereby obtaining the processing result of the page. That is, the probability that each item on the page is clicked by the user.
在图2a中,数据处理设备可以执行本申请实施例的有向无环图构建方法以及用户行为预测方法。In Figure 2a, the data processing device can execute the directed acyclic graph construction method and the user behavior prediction method of the embodiment of the present application.
图2b为本申请实施例提供的用户行为预测系统的另一结构示意图,在图2b中,用户设备自身就可以执行用户行为预测应用,该用户设备能够直接获取来自用户的输入并直接由用户设备本身的硬件进行处理,具体过程与图2a相似,可参考上面的描述,在此不再赘述。Figure 2b is another schematic structural diagram of a user behavior prediction system provided by an embodiment of the present application. In Figure 2b, the user equipment itself can execute the user behavior prediction application. The user equipment can directly obtain input from the user and directly obtain input from the user equipment. The hardware itself is used for processing. The specific process is similar to Figure 2a. Please refer to the above description and will not be repeated here.
在图2b所示的用户行为预测系统中,用户设备可以接收用户的指令,例如用户设备可以获取用户在用户设备中所选择的一个页面,然后再由用户设备自身针对该页面中各个项目的特征进行处理,从而得到该页面的处理结果,即该页面中各个项目被用户点击的概率。In the user behavior prediction system shown in Figure 2b, the user device can receive instructions from the user. For example, the user device can obtain a page selected by the user on the user device, and then the user device itself can target the characteristics of each item in the page. Perform processing to obtain the processing result of the page, that is, the probability of each item on the page being clicked by the user.
在图2b中,用户设备自身就可以执行本申请实施例的有向无环图构建方法以及用户行为预测方法。In Figure 2b, the user equipment itself can execute the directed acyclic graph construction method and user behavior prediction method in the embodiment of the present application.
图2c为本申请实施例提供的用户行为预测处理的相关设备的一个示意图。Figure 2c is a schematic diagram of related equipment for user behavior prediction processing provided by the embodiment of the present application.
上述图2a和图2b中的用户设备具体可以是图2c中的本地设备301或者本地设备302,图2a中的数据处理设备具体可以是图2c中的执行设备210,其中,数据存储系统250可以存储执行设备210的待处理数据,数据存储系统250可以集成在执行设备210上,也可以设置在云上或其它网络服务器上。The user equipment in Figure 2a and Figure 2b can be the local device 301 or the local device 302 in Figure 2c, and the data processing device in Figure 2a can be the execution device 210 in Figure 2c, where the data storage system 250 can To store the data to be processed by the execution device 210, the data storage system 250 can be integrated on the execution device 210, or can be set up on the cloud or other network servers.
图2a和图2b中的处理器可以通过神经网络模型或者其它模型(例如,基于支持向量机的模型)进行数据训练/机器学习/深度学习,并利用数据最终训练或者学习得到的模型针对页面执行用户行为预测应用,从而得到相应的处理结果。 The processors in Figure 2a and Figure 2b can perform data training/machine learning/deep learning through neural network models or other models (for example, models based on support vector machines), and use the data to ultimately train or learn the model to execute on the page User behavior prediction application to obtain corresponding processing results.
图3为本申请实施例提供的系统100架构的一个示意图,在图3中,执行设备110配置输入/输出(input/output,I/O)接口112,用于与外部设备进行数据交互,用户可以通过客户设备140向I/O接口112输入数据,所述输入数据在本申请实施例中可以包括:各个待调度任务、可调用资源以及其他参数。Figure 3 is a schematic diagram of the architecture of the system 100 provided by the embodiment of the present application. In Figure 3, the execution device 110 is configured with an input/output (I/O) interface 112 for data interaction with external devices. The user Data can be input to the I/O interface 112 through the client device 140. In this embodiment of the present application, the input data may include: various to-be-scheduled tasks, callable resources, and other parameters.
在执行设备110对输入数据进行预处理,或者在执行设备110的计算模块111执行计算等相关的处理(比如进行本申请中神经网络的功能实现)过程中,执行设备110可以调用数据存储系统150中的数据、代码等以用于相应的处理,也可以将相应处理得到的数据、指令等存入数据存储系统150中。When the execution device 110 preprocesses the input data, or when the calculation module 111 of the execution device 110 performs calculation and other related processing (such as implementing the function of the neural network in this application), the execution device 110 can call the data storage system 150 The data, codes, etc. in the system can be used for corresponding processing, and the data, instructions, etc. obtained by corresponding processing can also be stored in the data storage system 150 .
最后,I/O接口112将处理结果返回给客户设备140,从而提供给用户。Finally, the I/O interface 112 returns the processing results to the client device 140, thereby providing them to the user.
值得说明的是,训练设备120可以针对不同的目标或称不同的任务,基于不同的训练数据生成相应的目标模型/规则,该相应的目标模型/规则即可以用于实现上述目标或完成上述任务,从而为用户提供所需的结果。其中,训练数据可以存储在数据库130中,且来自于数据采集设备160采集的训练样本。It is worth mentioning that the training device 120 can generate corresponding target models/rules based on different training data for different goals or different tasks, and the corresponding target models/rules can be used to achieve the above goals or complete the above tasks. , thereby providing users with the desired results. The training data may be stored in the database 130 and come from training samples collected by the data collection device 160 .
在图3中所示情况下,用户可以手动给定输入数据,该手动给定可以通过I/O接口112提供的界面进行操作。另一种情况下,客户设备140可以自动地向I/O接口112发送输入数据,如果要求客户设备140自动发送输入数据需要获得用户的授权,则用户可以在客户设备140中设置相应权限。用户可以在客户设备140查看执行设备110输出的结果,具体的呈现形式可以是显示、声音、动作等具体方式。客户设备140也可以作为数据采集端,采集如图所示输入I/O接口112的输入数据及输出I/O接口112的输出结果作为新的样本数据,并存入数据库130。当然,也可以不经过客户设备140进行采集,而是由I/O接口112直接将如图所示输入I/O接口112的输入数据及输出I/O接口112的输出结果,作为新的样本数据存入数据库130。In the case shown in FIG. 3 , the user can manually enter the input data, and the manual input can be operated through the interface provided by the I/O interface 112 . In another case, the client device 140 can automatically send input data to the I/O interface 112. If requiring the client device 140 to automatically send input data requires the user's authorization, the user can set corresponding permissions in the client device 140. The user can view the results output by the execution device 110 on the client device 140, and the specific presentation form may be display, sound, action, etc. The client device 140 can also be used as a data collection end to collect the input data of the input I/O interface 112 and the output results of the output I/O interface 112 as new sample data, and store them in the database 130 . Of course, it is also possible to collect without going through the client device 140. Instead, the I/O interface 112 directly uses the input data input to the I/O interface 112 and the output result of the output I/O interface 112 as a new sample as shown in the figure. The data is stored in database 130.
值得注意的是,图3仅是本申请实施例提供的一种系统架构的示意图,图中所示设备、器件、模块等之间的位置关系不构成任何限制,例如,在图3中,数据存储系统150相对执行设备110是外部存储器,在其它情况下,也可以将数据存储系统150置于执行设备110中。如图3所示,可以根据训练设备120训练得到神经网络。It is worth noting that Figure 3 is only a schematic diagram of a system architecture provided by an embodiment of the present application. The positional relationship between the devices, devices, modules, etc. shown in the figure does not constitute any limitation. For example, in Figure 3, the data The storage system 150 is an external memory relative to the execution device 110. In other cases, the data storage system 150 can also be placed in the execution device 110. As shown in Figure 3, the neural network can be trained according to the training device 120.
本申请实施例还提供的一种芯片,该芯片包括神经网络处理器NPU。该芯片可以被设置在如图3所示的执行设备110中,用以完成计算模块111的计算工作。该芯片也可以被设置在如图3所示的训练设备120中,用以完成训练设备120的训练工作并输出目标模型/规则。An embodiment of the present application also provides a chip, which includes a neural network processor NPU. The chip can be disposed in the execution device 110 as shown in FIG. 3 to complete the calculation work of the calculation module 111. The chip can also be installed in the training device 120 as shown in Figure 3 to complete the training work of the training device 120 and output the target model/rules.
神经网络处理器NPU,NPU作为协处理器挂载到主中央处理器(central processing unit,CPU)(host CPU)上,由主CPU分配任务。NPU的核心部分为运算电路,控制器控制运算电路提取存储器(权重存储器或输入存储器)中的数据并进行运算。Neural network processor NPU, NPU is mounted on the main central processing unit (CPU) (host CPU) as a co-processor, and the main CPU allocates tasks. The core part of the NPU is the arithmetic circuit. The controller controls the arithmetic circuit to extract the data in the memory (weight memory or input memory) and perform operations.
在一些实现中,运算电路内部包括多个处理单元(process engine,PE)。在一些实现中,运算电路是二维脉动阵列。运算电路还可以是一维脉动阵列或者能够执行例如乘法和加法这样的数学运算的其它电子线路。在一些实现中,运算电路是通用的矩阵处理器。In some implementations, the computing circuit includes multiple processing units (PE). In some implementations, the arithmetic circuit is a two-dimensional systolic array. The arithmetic circuit may also be a one-dimensional systolic array or other electronic circuit capable of performing mathematical operations such as multiplication and addition. In some implementations, the arithmetic circuit is a general-purpose matrix processor.
举例来说,假设有输入矩阵A,权重矩阵B,输出矩阵C。运算电路从权重存储器中取矩阵B相应的数据,并缓存在运算电路中每一个PE上。运算电路从输入存储器中取矩阵A数据与矩阵B进行矩阵运算,得到的矩阵的部分结果或最终结果,保存在累加器(accumulator)中。For example, assume there is an input matrix A, a weight matrix B, and an output matrix C. The arithmetic circuit fetches the corresponding data of matrix B from the weight memory and caches it on each PE in the arithmetic circuit. The operation circuit takes matrix A data and matrix B from the input memory to perform matrix operations, and the partial result or final result of the obtained matrix is stored in the accumulator (accumulator).
向量计算单元可以对运算电路的输出做进一步处理,如向量乘,向量加,指数运算,对 数运算,大小比较等等。例如,向量计算单元可以用于神经网络中非卷积/非FC层的网络计算,如池化(pooling),批归一化(batch normalization),局部响应归一化(local response normalization)等。The vector calculation unit can further process the output of the arithmetic circuit, such as vector multiplication, vector addition, exponential operation, etc. Numerical operations, size comparison, etc. For example, the vector computing unit can be used for network calculations in non-convolutional/non-FC layers in neural networks, such as pooling, batch normalization, local response normalization, etc.
在一些实现中,向量计算单元能将经处理的输出的向量存储到统一缓存器。例如,向量计算单元可以将非线性函数应用到运算电路的输出,例如累加值的向量,用以生成激活值。在一些实现中,向量计算单元生成归一化的值、合并值,或二者均有。在一些实现中,处理过的输出的向量能够用作到运算电路的激活输入,例如用于在神经网络中的后续层中的使用。In some implementations, the vector computation unit can store the processed output vectors into a unified buffer. For example, the vector calculation unit may apply a nonlinear function to the output of the arithmetic circuit, such as a vector of accumulated values, to generate activation values. In some implementations, the vector computation unit generates normalized values, merged values, or both. In some implementations, the processed output vector can be used as an activation input to an arithmetic circuit, such as for use in a subsequent layer in a neural network.
统一存储器用于存放输入数据以及输出数据。Unified memory is used to store input data and output data.
权重数据直接通过存储单元访问控制器(direct memory access controller,DMAC)将外部存储器中的输入数据搬运到输入存储器和/或统一存储器、将外部存储器中的权重数据存入权重存储器,以及将统一存储器中的数据存入外部存储器。The weight data directly transfers the input data in the external memory to the input memory and/or the unified memory through the storage unit access controller (direct memory access controller, DMAC), stores the weight data in the external memory into the weight memory, and transfers the weight data to the unified memory. The data in is stored in external memory.
总线接口单元(bus interface unit,BIU),用于通过总线实现主CPU、DMAC和取指存储器之间进行交互。The bus interface unit (BIU) is used to realize the interaction between the main CPU, DMAC and instruction memory through the bus.
与控制器连接的取指存储器(instruction fetch buffer),用于存储控制器使用的指令;The instruction fetch buffer connected to the controller is used to store instructions used by the controller;
控制器,用于调用指存储器中缓存的指令,实现控制该运算加速器的工作过程。The controller is used to call instructions cached in the memory to control the working process of the computing accelerator.
一般地,统一存储器,输入存储器,权重存储器以及取指存储器均为片上(On-Chip)存储器,外部存储器为该NPU外部的存储器,该外部存储器可以为双倍数据率同步动态随机存储器(double data rate synchronous dynamic random access memory,DDR SDRAM)、高带宽存储器(high bandwidth memory,HBM)或其他可读可写的存储器。Generally, the unified memory, input memory, weight memory and instruction memory are all on-chip memories, and the external memory is the memory outside the NPU. The external memory can be double data rate synchronous dynamic random access memory (double data). rate synchronous dynamic random access memory (DDR SDRAM), high bandwidth memory (high bandwidth memory (HBM)) or other readable and writable memory.
由于本申请实施例涉及大量神经网络的应用,为了便于理解,下面先对本申请实施例涉及的相关术语及神经网络等相关概念进行介绍。Since the embodiments of the present application involve the application of a large number of neural networks, in order to facilitate understanding, the relevant terms involved in the embodiments of the present application and related concepts such as neural networks are first introduced below.
(1)神经网络(1)Neural network
神经网络可以是由神经单元组成的,神经单元可以是指以xs和截距1为输入的运算单元,该运算单元的输出可以为:
The neural network can be composed of neural units. The neural unit can refer to an arithmetic unit that takes xs and intercept 1 as input. The output of the arithmetic unit can be:
其中,s=1、2、……n,n为大于1的自然数,Ws为xs的权重,b为神经单元的偏置。f为神经单元的激活函数(activation functions),用于将非线性特性引入神经网络中,来将神经单元中的输入信号转换为输出信号。该激活函数的输出信号可以作为下一层卷积层的输入。激活函数可以是sigmoid函数。神经网络是将许多个上述单一的神经单元联结在一起形成的网络,即一个神经单元的输出可以是另一个神经单元的输入。每个神经单元的输入可以与前一层的局部接受域相连,来提取局部接受域的特征,局部接受域可以是由若干个神经单元组成的区域。Among them, s=1, 2,...n, n is a natural number greater than 1, Ws is the weight of xs, and b is the bias of the neural unit. f is the activation function of the neural unit, which is used to introduce nonlinear characteristics into the neural network to convert the input signal in the neural unit into an output signal. The output signal of this activation function can be used as the input of the next convolutional layer. The activation function can be a sigmoid function. A neural network is a network formed by connecting many of the above-mentioned single neural units together, that is, the output of one neural unit can be the input of another neural unit. The input of each neural unit can be connected to the local receptive field of the previous layer to extract the features of the local receptive field. The local receptive field can be an area composed of several neural units.
神经网络中的每一层的工作可以用数学表达式y=a(Wx+b)来描述:从物理层面神经网络中的每一层的工作可以理解为通过五种对输入空间(输入向量的集合)的操作,完成输入空间到输出空间的变换(即矩阵的行空间到列空间),这五种操作包括:1、升维/降维;2、 放大/缩小;3、旋转;4、平移;5、“弯曲”。其中1、2、3的操作由Wx完成,4的操作由+b完成,5的操作则由a()来实现。这里之所以用“空间”二字来表述是因为被分类的对象并不是单个事物,而是一类事物,空间是指这类事物所有个体的集合。其中,W是权重向量,该向量中的每一个值表示该层神经网络中的一个神经元的权重值。该向量W决定着上文所述的输入空间到输出空间的空间变换,即每一层的权重W控制着如何变换空间。训练神经网络的目的,也就是最终得到训练好的神经网络的所有层的权重矩阵(由很多层的向量W形成的权重矩阵)。因此,神经网络的训练过程本质上就是学习控制空间变换的方式,更具体的就是学习权重矩阵。The work of each layer in the neural network can be described by the mathematical expression y=a(Wx+b): From the physical level, the work of each layer in the neural network can be understood as five pairs of input spaces (input vectors) Set) operations to complete the transformation from input space to output space (i.e., row space to column space of the matrix). These five operations include: 1. Dimension raising/dimension reduction; 2. Zoom in/out; 3. Rotate; 4. Translate; 5. "Bend". Among them, the operations of 1, 2, and 3 are completed by Wx, the operation of 4 is completed by +b, and the operation of 5 is implemented by a(). The reason why the word "space" is used here is because the object to be classified is not a single thing, but a class of things. Space refers to the collection of all individuals of this type of thing. Among them, W is a weight vector, and each value in the vector represents the weight value of a neuron in the neural network of this layer. This vector W determines the spatial transformation from the input space to the output space described above, that is, the weight W of each layer controls how to transform the space. The purpose of training a neural network is to finally obtain the weight matrix of all layers of the trained neural network (a weight matrix formed by the vector W of many layers). Therefore, the training process of neural network is essentially to learn how to control spatial transformation, and more specifically, to learn the weight matrix.
因为希望神经网络的输出尽可能的接近真正想要预测的值,所以可以通过比较当前网络的预测值和真正想要的目标值,再根据两者之间的差异情况来更新每一层神经网络的权重向量(当然,在第一次更新之前通常会有初始化的过程,即为神经网络中的各层预先配置参数),比如,如果网络的预测值高了,就调整权重向量让它预测低一些,不断的调整,直到神经网络能够预测出真正想要的目标值。因此,就需要预先定义“如何比较预测值和目标值之间的差异”,这便是损失函数(loss function)或目标函数(objective function),它们是用于衡量预测值和目标值的差异的重要方程。其中,以损失函数举例,损失函数的输出值(loss)越高表示差异越大,那么神经网络的训练就变成了尽可能缩小这个loss的过程。Because you want the output of the neural network to be as close as possible to the value you really want to predict, you can compare the predicted value of the current network with the really desired target value, and then update each layer of the neural network based on the difference between the two. weight vector (of course, there is usually an initialization process before the first update, that is, pre-configuring parameters for each layer in the neural network). For example, if the predicted value of the network is high, adjust the weight vector to make it predict lower Some, constant adjustments are made until the neural network can predict the truly desired target value. Therefore, it is necessary to define in advance "how to compare the difference between the predicted value and the target value". This is the loss function (loss function) or objective function (objective function), which is used to measure the difference between the predicted value and the target value. Important equations. Among them, taking the loss function as an example, the higher the output value (loss) of the loss function, the greater the difference. Then the training of the neural network becomes a process of reducing this loss as much as possible.
(2)反向传播算法(2)Back propagation algorithm
神经网络可以采用误差反向传播(back propagation,BP)算法在训练过程中修正初始的神经网络模型中参数的大小,使得神经网络模型的重建误差损失越来越小。具体地,前向传递输入信号直至输出会产生误差损失,通过反向传播误差损失信息来更新初始的神经网络模型中参数,从而使误差损失收敛。反向传播算法是以误差损失为主导的反向传播运动,旨在得到最优的神经网络模型的参数,例如权重矩阵。The neural network can use the error back propagation (BP) algorithm to modify the size of the parameters in the initial neural network model during the training process, so that the reconstruction error loss of the neural network model becomes smaller and smaller. Specifically, forward propagation of the input signal until the output will produce an error loss, and the parameters in the initial neural network model are updated by backpropagating the error loss information, so that the error loss converges. The backpropagation algorithm is a backpropagation movement dominated by error loss, aiming to obtain the optimal parameters of the neural network model, such as the weight matrix.
下面从神经网络的训练侧和神经网络的应用侧对本申请提供的方法进行描述。The method provided by this application is described below from the training side of the neural network and the application side of the neural network.
本申请实施例提供的模型训练方法,涉及数据序列的处理,具体可以应用于数据训练、机器学习、深度学习等方法,对训练数据(例如,将本申请中待处理页面的第一项目的第一特征等等)进行符号化和形式化的智能信息建模、抽取、预处理、训练等,最终得到训练好的神经网络(如本申请中的目标模型);并且,本申请实施例提供的用户行为预测方法可以运用上述训练好的神经网络,将输入数据(例如,将本申请中目标页面的第一项目的第一特征等等)输入到所述训练好的神经网络中,得到输出数据(如本申请提供的用户行为预测方法中,第一项目被用户点击的概率等等)。需要说明的是,本申请实施例提供的模型训练方法和用户行为预测方法是基于同一个构思产生的发明,也可以理解为一个系统中的两个部分,或一个整体流程的两个阶段:如模型训练阶段和模型应用阶段。The model training method provided by the embodiment of this application involves the processing of data sequences, and can be specifically applied to data training, machine learning, deep learning and other methods. For training data (for example, the first item of the page to be processed in this application is (features, etc.) to perform symbolic and formalized intelligent information modeling, extraction, preprocessing, training, etc., and finally obtain a trained neural network (such as the target model in this application); and, provided by the embodiments of this application The user behavior prediction method can use the above-mentioned trained neural network to input input data (for example, the first feature of the first item of the target page in this application, etc.) into the trained neural network to obtain output data. (For example, in the user behavior prediction method provided by this application, the probability of the first item being clicked by the user, etc.). It should be noted that the model training method and the user behavior prediction method provided in the embodiments of this application are inventions based on the same concept, and can also be understood as two parts of a system, or two stages of an overall process: such as Model training phase and model application phase.
值得注意的是,在对目标页面进行用户行为预测之前,可先构造针对目标页面的有向无环图,下文先对向无环图的构建过程进行介绍。图4为本申请实施例提供的有向无环图构建方法的一个流程示意图,如图4所示,该方法包括:It is worth noting that before predicting user behavior on the target page, a directed acyclic graph for the target page can be constructed first. The construction process of the directed acyclic graph is introduced below. Figure 4 is a schematic flow chart of a directed acyclic graph construction method provided by an embodiment of the present application. As shown in Figure 4, the method includes:
401、获取用户浏览目标页面的眼动数据。401. Obtain the eye movement data of the user browsing the target page.
本实施例中,当需要对目标页面进行用户行为预测时,可先获取目标页面,目标页面包含多个列表,对于任意一个列表而言,该列表包含多个项目,且这多个项目按照一定的顺序 进行排列(比如,若这多个项目排成一行,则该列表为横向列表,若着多个项目排成一列,则该列表为纵向列表)。例如,如图5所示(图5为本申请实施例提供的目标页面的一个示意图),设目标页面为一个应用商城的展示页面,该页面可向用户展示多个应用的信息,以便用户在该页面上浏览并下载自身所需的应用。该页面包含纵向列表B1、纵向列表B3、纵向列表B5、横向列表B2以及横向列表B4,这3个纵向列表和2个横向列表交错排列,即按纵向列表B1、横向列表B2、纵向列表B3、横向列表B4、纵向列表B5的顺序进行佩列。其中,横向列表B2和B4包含5个应用,纵向列表B1和B3包含3个应用。如此一来,该页面可向用户展示19个应用的信息。In this embodiment, when it is necessary to predict user behavior on the target page, the target page can be obtained first. The target page contains multiple lists. For any list, the list contains multiple items, and the multiple items are arranged according to a certain Order Arrange (for example, if multiple items are arranged in a row, the list is a horizontal list, if multiple items are arranged in a column, the list is a vertical list). For example, as shown in Figure 5 (Figure 5 is a schematic diagram of a target page provided by an embodiment of the present application), assume that the target page is a display page of an application mall. This page can display information about multiple applications to the user, so that the user can Browse and download the applications you need on this page. This page contains vertical list B 1 , vertical list B 3 , vertical list B 5 , horizontal list B 2 and horizontal list B 4 . These three vertical lists and two horizontal lists are staggered, that is, according to the vertical list B 1 and the horizontal list B 2, vertical list B 3 , horizontal list B 4 , and vertical list B 5 are arranged in this order. Among them, horizontal lists B 2 and B 4 contain 5 applications, and vertical lists B 1 and B 3 contain 3 applications. In this way, the page can display information about 19 applications to users.
得到目标页面后,可邀请至少一位用户来浏览目标页面,并获取这些用户浏览目标页面时所产生的眼动数据。After obtaining the target page, you can invite at least one user to browse the target page, and obtain the eye movement data generated by these users when browsing the target page.
具体地,可通过以下方式来获取用户浏览目标页面的眼动数据:Specifically, the eye movement data of users browsing the target page can be obtained through the following methods:
如图6所示(图6为本申请实施例提供的眼动仪的一个示意图),在用于显示目标页面的用户设备附近,可部署面向用户的眼动仪,眼动仪可与用户设备电性连接。此外,还可部署有辅助工具,该辅助工具用于稳定用户的头部,使得眼动仪可以准确追踪用户的视线,且为了更好地模拟用户与用户设备之间的交互环境,可以设定用户与用户设备的屏幕之间的距离(例如,该距离可以设置为30-40cm等等),以及用户设备的倾斜角度(例如,该倾斜角度可以设置为65°-70°等等)。As shown in Figure 6 (Figure 6 is a schematic diagram of an eye tracker provided by an embodiment of the present application), a user-oriented eye tracker can be deployed near the user equipment used to display the target page, and the eye tracker can be connected to the user equipment. Electrical connection. In addition, auxiliary tools can also be deployed. This auxiliary tool is used to stabilize the user's head so that the eye tracker can accurately track the user's line of sight. In order to better simulate the interactive environment between the user and the user device, it can be set The distance between the user and the screen of the user equipment (for example, the distance can be set to 30-40cm, etc.), and the tilt angle of the user equipment (for example, the tilt angle can be set to 65°-70°, etc.).
当用户借助辅助工具开始浏览用户设备显示的目标页面时,眼动仪可追踪并记录用户在目标页面上的视线位置和视线移动方式,生成用户浏览目标页面的眼动数据,并发送至用户设备。如此一来,用户设备则成功获取了用户浏览目标页面的眼动数据。When the user starts browsing the target page displayed on the user device with the help of assistive tools, the eye tracker can track and record the user's gaze position and gaze movement on the target page, generate eye movement data of the user browsing the target page, and send it to the user device . In this way, the user device successfully obtains the eye movement data of the user browsing the target page.
应理解,此处的用户设备可以为前述图2a或图2b所示系统中的用户设备,那么,用户设备在获取用户浏览目标页面的眼动数据后,可自行对眼动数据进行分析并构建针对目标页面的有向无环图,也可将眼动数据发送至图2a或图2b所示系统中的数据处理设备,以使得数据处理设备对眼动数据进行分析并构建针对目标页面的有向无环图,后续不再赘述。It should be understood that the user equipment here can be the user equipment in the system shown in Figure 2a or Figure 2b. Then, after obtaining the eye movement data of the user browsing the target page, the user equipment can analyze and construct the eye movement data on its own. For the directed acyclic graph of the target page, the eye movement data can also be sent to the data processing device in the system shown in Figure 2a or Figure 2b, so that the data processing equipment can analyze the eye movement data and build an effective target page. Towards acyclic graph, we will not go into details later.
402、基于眼动数据,确定用户对多个项目的浏览行为,多个项目位于目标页面的多个列表中。402. Based on the eye movement data, determine the user's browsing behavior for multiple items, and the multiple items are located in multiple lists on the target page.
得到用户浏览目标页面的眼动数据后,可对眼动数据进行分析,从而得到用户对目标页面中多个项目的浏览行为,这多个项目分别位于目标页面的多个列表中,故这多个项目可以表示为集合i=[i1,1,i1,2,...,io,q],其中,it,j表示目标页面中第t个列表中的第j个项目,t=1,...,o,j=1,...,q。After obtaining the eye movement data of the user browsing the target page, the eye movement data can be analyzed to obtain the user's browsing behavior of multiple items in the target page. These multiple items are located in multiple lists on the target page, so there are many items can be expressed as a set i=[i 1,1 ,i 1,2 ,..., io,q ], where i t,j represents the j-th item in the t-th list in the target page, t=1,...,o, j=1,...,q.
具体地,得到用户浏览目标页面的眼动数据,可基于眼动数据进行以下分析:Specifically, after obtaining the eye movement data of the user browsing the target page, the following analysis can be performed based on the eye movement data:
(1)在集合i中任取位置连续的三个项目,分别称为项目A、项目B和项目C。在目标页面中,项目A位于项目B之前且项目A与项目B相邻(项目A和项目B可以是位于相邻两个列表中的两个项目,即tA<tB,例如,项目A为目标页面中第1个列表的最后1个项目,项目B为目标页面中第2个列表的第1个项目等等。项目A和项目B还可以是位于同一列表中的两个项目,且项目A的位置更靠前,即tA=tB,且jA<jB,例如,项目A为目标页面中第1个列表的第1个项目,项目B为目标页面中第1个列表的第2个项目等等),项目B位于项目C之前且项目B与项目C相邻。 (1) Take any three items with consecutive positions in set i, and call them item A, item B and item C respectively. In the target page, item A is before item B and item A is adjacent to item B (item A and item B can be two items located in two adjacent lists, that is, t A < t B , for example, item A is the last item of the first list on the target page, item B is the first item of the second list on the target page, etc. Item A and item B can also be two items in the same list, and The position of item A is higher, that is, t A = t B , and j A < j B . For example, item A is the first item of the first list on the target page, and item B is the first list on the target page. the 2nd item and so on), item B is before item C and item B is adjacent to item C.
基于眼动数据,可统计用户对项目A、项目B和项目C的浏览顺序,统计结果如表1所示:Based on the eye movement data, the user's browsing order of Item A, Item B and Item C can be counted. The statistical results are shown in Table 1:
表1
Table 1
基于表1可知,项目A→项目B→项目C的浏览顺序占了最大比重,说明用户浏览在浏览目标页面的多个项目时,主要是按照这多个项目在目标页面中的顺序(排序)来进行浏览的,这种浏览行为可称为顺序浏览行为(sequential examination),顺序浏览行为包含两大类,下文将分别进行介绍:Based on Table 1, it can be seen that the browsing order of item A→item B→item C accounts for the largest proportion, indicating that when users browse multiple items on the target page, they mainly follow the order (sorting) of these multiple items on the target page. To browse, this browsing behavior can be called sequential examination. Sequential browsing behavior includes two categories, which will be introduced separately below:
(1.1)第一类顺序浏览行为,指对于目标页面的任意一个列表而言,若该列表为横向列表,则按从左到右的顺序来浏览该列表中的所有项目,若该列表为纵向列表,则按从上到下的顺序来浏览该列表中的所有项目。依旧如图5所示的例子,对于列表B2,第一类顺序浏览行为是:按i2,1→i2,2→i2,3→i2,4→i2,5的顺序,来浏览列表B2中i2,1、i2,2、i2,3、i2,4以及i2,5这5个项目。对于列表B1,第一类顺序浏览行为是:按i1,1→i1,2→i1,3的顺序,来浏览列表B1中i1,1、i1,2以及i1,3这3个项目。(1.1) The first type of sequential browsing behavior refers to any list on the target page. If the list is a horizontal list, all items in the list will be browsed in order from left to right. If the list is vertical list, browse all items in the list in order from top to bottom. Still using the example shown in Figure 5, for list B 2 , the first type of sequential browsing behavior is: in the order of i 2,1 →i 2,2 →i 2,3 →i 2,4 →i 2,5 , Let's browse the five items i 2,1 , i 2,2 , i 2,3 , i 2,4 and i 2,5 in list B 2 . For list B 1 , the first type of sequential browsing behavior is: browsing i 1,1 , i 1,2 and i 1 in list B 1 in the order of i 1,1 → i 1,2 →i 1,3 3 these 3 items.
(1.2)第二类顺序浏览行为,指对于目标页面中相邻的两个列表而言,按这两个列表中相邻的项目的前后顺序,来浏览这些相邻的项目。需要说明的是,若这两个列表中,前一个列表为纵向列表,后一个列表为横向列表,两个列表中相邻的项目包括纵向列表的最后一个项目以及横向列表的所有项目,即横向列表的所有项目均可视为与纵向列表的最后一个项目相邻的项目。若这两个列表中,前一个列表为横向列表,后一个列表为纵向列表,两个列表中相邻的项目包括纵向列表的第一个项目以及横向列表的所有项目,即横向列表的所有项目均可视为与纵向列表的第一个项目相邻的项目。依旧如图5所示的例子,对于列表B1和B2,第二类顺序浏览行为是:按i1,3→i2,1、i1,3→i2,2、i1,3→i2,3、i1,3→i2,4、i1,3→i2,5的顺序,来浏览列表B1和B2中i1,3、i2,1、i2,2、i2,3、i2,4以及i2,5这6个相邻的项目。对于列表B2和B3,第二类顺序浏览行为是:按i2,1→i3,1、i2,2→i3,1、i2,3→i3,1、i2,4→i3,1、i2,5→i3,1的顺序,来浏览列表B2和B3中i2,1、i2,2、i2,3、i2,4、i2,5以及i3,1这6个相邻的项目。(1.2) The second type of sequential browsing behavior means that for two adjacent lists on the target page, the adjacent items in the two lists are browsed in the order of front and back. It should be noted that if among the two lists, the former list is a vertical list and the latter list is a horizontal list, the adjacent items in the two lists include the last item in the vertical list and all the items in the horizontal list, that is, horizontal All items of a list can be considered items adjacent to the last item of the vertical list. If among the two lists, the former list is a horizontal list and the latter list is a vertical list, the adjacent items in the two lists include the first item of the vertical list and all the items of the horizontal list, that is, all the items of the horizontal list. Both can be considered as items adjacent to the first item in the vertical list. Still using the example shown in Figure 5, for lists B 1 and B 2 , the second type of sequential browsing behavior is: i 1,3 →i 2,1 , i 1,3 →i 2,2 , i 1,3 →i 2,3 , i 1,3 →i 2,4 , i 1,3 →i 2,5 in the order to browse i 1,3 , i 2,1 , i 2 in lists B 1 and B 2 , 2 , i 2,3 , i 2,4 and i 2,5 are six adjacent items. For lists B 2 and B 3 , the second type of sequential browsing behavior is: i 2,1 →i 3,1 , i 2,2 →i 3,1 , i 2,3 →i 3,1 , i 2, 4 →i 3,1 , i 2,5 →i 3,1 in the order to browse i 2,1 , i 2,2 , i 2,3 , i 2,4 , i 2 in lists B 2 and B 3 ,5 and i 3,1 are six adjacent items.
应理解,若相邻的这两个列表均为横向列表,那么,对于前一个列表的任意一个项目而言,后一个列表中的所有项目均可视为与该项目相邻的项目,同样地,对于后一个列表的任意一个项目而言,前一个列表中的所有项目均可视为与该项目相邻的项目。 It should be understood that if the two adjacent lists are both horizontal lists, then for any item in the former list, all items in the latter list can be regarded as items adjacent to the item, and similarly , for any item in the latter list, all items in the previous list can be regarded as items adjacent to the item.
(2)用户在浏览目标页面时,可能从当前列表,直接跳转至另一个列表进行浏览,且当前列表与另一个列表之间相隔至少一个列表。为了便于介绍,下文将当前列表称为列表D,将另一个列表称为列表E,列表D与列表E之间相隔至少一个列表,定义列表D与列表E之间的列表跳过长度(skip length)l=tD-tE,例如,当列表D为目标页面中第2个列表,列表D为目标页面中第4个列表,则l=4-2=2。基于眼动数据,可统计不同的列表跳过长度,统计结果如表2所示:(2) When browsing the target page, the user may jump directly from the current list to another list for browsing, and the current list and the other list are separated by at least one list. For ease of introduction, the current list is called list D below, and the other list is called list E. List D and list E are separated by at least one list. The list skip length between list D and list E is defined. )l=t D -t E , for example, when list D is the second list in the target page and list D is the fourth list in the target page, then l=4-2=2. Based on eye movement data, different list skip lengths can be counted. The statistical results are shown in Table 2:
表2
Table 2
基于表2可知,列表跳过长度为2的浏览方式占据了最大的比重,说明用户除了顺序浏览行为,还会经常发送跳过一整个列表,直接浏览下一个列表的行为,这种浏览行为可以称为跳过行为(block skip)。值得注意的是,若目标页面为多个横向列表(也可以称为横划(horizontal block))和多个纵向列表(也可以称为竖划(Vertical block))交错排列的页面(即F型页面),列表跳过长度为2的跳过行为包括两大类行为,第一类跳过行为指从横向列表跳到横向列表,第二类跳过行为指从纵向列表跳到纵向列表,其中,几乎所有的跳过行为是从纵向列表跳到纵向列表,占比94.5%,而从横向列表跳到横向列表的占比,仅有5.5%,说明用户更倾向于从纵向列表跳到纵向列表。Based on Table 2, it can be seen that the browsing method with a list skip length of 2 accounts for the largest proportion, indicating that in addition to sequential browsing behavior, users also often send behaviors of skipping an entire list and browsing the next list directly. This browsing behavior can This is called block skip. It is worth noting that if the target page is a page with multiple horizontal lists (also called horizontal blocks) and multiple vertical lists (also called vertical blocks) staggered (i.e. F-type page), the skipping behavior with a list skip length of 2 includes two major types of behavior. The first type of skipping behavior refers to jumping from a horizontal list to a horizontal list, and the second type of skipping behavior refers to jumping from a vertical list to a vertical list, where , almost all skipping behaviors are from vertical lists to vertical lists, accounting for 94.5%, while jumping from horizontal lists to horizontal lists only account for 5.5%, indicating that users are more inclined to jump from vertical lists to vertical lists. .
基于此,可进一步统计从纵向列表跳到纵向列表这一类跳过行为中,主要发生在两个列表的哪一个项目上,即统计这一类跳过行为的起始项目和终止项目,统计结果如表3所示:Based on this, we can further calculate which item of the two lists the skipping behavior of jumping from the vertical list to the vertical list mainly occurs on, that is, count the starting item and ending item of this type of skipping behavior, and count The results are shown in Table 3:
表3
table 3
基于表3可知,用户的跳过行为的起始项目最大概率为某个列表中的最后一个项目,用户的跳过行为的终止项目最大概率为另一列表的第一个项目。Based on Table 3, it can be seen that the maximum probability of the starting item of the user's skipping behavior is the last item in a certain list, and the maximum probability of the ending item of the user's skipping behavior is the first item of another list.
那么,基于以上分析,用户的跳过行为可总结为:用户在非相邻的两个列表中(这两个列表之间相隔一个列表),从前一个列表的最后一个项目,跳转至后一个列表的第一个项目继续浏览,这两个项目之间的浏览顺序可称为跳转顺序。依旧如图5所示的例子,用户在浏览i1,3时,直接跳过B2,浏览i3,1Then, based on the above analysis, the user's skipping behavior can be summarized as: in two non-adjacent lists (these two lists are separated by one list), the user jumps from the last item of the previous list to the next one. Browsing continues with the first item in the list, and the browsing sequence between these two items can be called a jump sequence. Still using the example shown in Figure 5, when the user browses i 1,3 , he directly skips B 2 and browses i 3,1 .
(3)基于表1可知,除了顺序浏览行为外,非顺序浏览行为中占比最多的是按B→A→B的顺序所进行的浏览行为以及按A→B→A的顺序所进行的浏览行为,说明用户倾向于反复浏览相邻的两个项目,以进行项目之间的对比,此种浏览行为可以称为对比行为(comparison behaviors)。(3) Based on Table 1, it can be seen that in addition to sequential browsing behaviors, the largest proportion of non-sequential browsing behaviors are browsing behaviors in the order of B→A→B and browsing in the order of A→B→A. Behavior indicates that users tend to browse two adjacent items repeatedly to compare between items. This browsing behavior can be called comparison behaviors.
可见,对统计数据进行分析后,可确定用户对目标页面中多个项目的浏览行为,包含顺序浏览行为、跳过行为和对比行为等等。It can be seen that after analyzing the statistical data, the user's browsing behavior for multiple items in the target page can be determined, including sequential browsing behavior, skipping behavior, comparison behavior, etc.
403、基于浏览行为,对多个项目进行连接,得到有向无环图。403. Based on browsing behavior, connect multiple items to obtain a directed acyclic graph.
得到用户对目标页面中多个项目的浏览行为后,可基于这些浏览行为,对目标页面中多个项目进行连接,得到针对目标页面的有向无环图。After obtaining the user's browsing behavior for multiple items in the target page, based on these browsing behaviors, multiple items in the target page can be connected to obtain a directed acyclic graph for the target page.
具体地,这些浏览行为包含两大类浏览行为。第一类浏览行为指用户浏览同一列表中的项目,包含前述的第一类顺序浏览行为,故可将用户在同一列表中的浏览顺序称为第一顺序,第一顺序包含第一类顺序浏览行为中,用户浏览同一列表中所有项目所按照的从上到下的顺序以及从左到右的顺序。第二类浏览行为指用户浏览不同列表之间的项目,包含前述的第二类顺序浏览行为以及对比行为,故可将用户在不同列表之间的浏览顺序称为第二顺序,第二顺序包含第二类顺序浏览行为中,用户浏览相邻两个列表中相邻的若干个项目所按照的前后顺序,以及对比行为中,用户浏览非相邻两个列表中的两个项目所按照的跳转顺序。那么,可通过以下方式来获取针对目标页面的有向无环图:Specifically, these browsing behaviors include two major categories of browsing behaviors. The first type of browsing behavior refers to the user browsing items in the same list, including the aforementioned first type of sequential browsing behavior. Therefore, the user's browsing order in the same list can be called the first order, and the first order includes the first type of sequential browsing. Behavior, the order in which the user browses all items in the same list, from top to bottom and from left to right. The second type of browsing behavior refers to the user browsing items between different lists, including the aforementioned second type of sequential browsing behavior and comparison behavior. Therefore, the user's browsing order between different lists can be called the second order, and the second order includes In the second type of sequential browsing behavior, the user browses several adjacent items in two adjacent lists in the order in which they browse, and in contrast behavior, the user browses two items in two non-adjacent lists according to the jump. Turn order. Then, the directed acyclic graph for the target page can be obtained in the following way:
(1)在目标页面中,将用户以第一顺序浏览的同一列表中的项目,按第一顺序进行连接,即对于目标页面中的任意一个列表,若该列表为横向列表,则按从左到右的顺序来连接该列表中的所有项目,若该列表为纵向列表,则按从上到下的顺序来连接该列表中的所有项目,如此一来,可以完成目标页面中各个列表内部的连接。例如,如图7所示(图7为本申请实施例提供的有向无环图的一个示意图,图7是在图5的基础上绘制得到的),对于列表B1,可按照i1,1→i1,2→i1,3的顺序,来连接列表B1中i1,1、i1,2以及i1,3这3个项目。对于列表B2,可按照i2,1→i2,2→i2,3→i2,4→i2,5的顺序,来连接列表B2中i2,1、i2,2、i2,3、i2,4以及i2,5这5个项目。对于列表B3、B4以及B5也是如此,此处不再赘述,如此一来,则完成了目标页面中这5个列表内部的连接。(1) In the target page, connect the items in the same list that the user browsed in the first order, that is, for any list in the target page, if the list is a horizontal list, press from left to right Connect all the items in the list in order from right to right. If the list is a vertical list, connect all the items in the list in order from top to bottom. In this way, you can complete the internal content of each list on the target page. connect. For example, as shown in Figure 7 (Figure 7 is a schematic diagram of a directed acyclic graph provided by the embodiment of the present application, and Figure 7 is drawn based on Figure 5), for list B 1 , it can be calculated according to i 1, The order 1 →i 1,2 →i 1,3 is used to connect the three items i 1,1 , i 1,2 and i 1,3 in list B 1 . For list B 2 , i 2,1 , i 2,2 , in list B 2 can be connected in the order of i 2,1 →i 2,2 →i 2,3 →i 2,4 →i 2,5 There are five items: i 2,3 , i 2,4 and i 2,5 . The same is true for lists B 3 , B 4 and B 5 , which will not be repeated here. In this way, the internal connections of these five lists in the target page are completed.
(2)将用户以第二顺序浏览的不同列表中的项目,按第二顺序进行连接,得到有向无环图,即对于目标页面中相邻的两个列表,可按照这两个列表中相邻的项目的前后顺序,来连接这些相邻的项目,且对于目标页面中非相邻的两个列表(这两个列表之间相隔一个列表),可按照用户浏览这两个列表时的跳转顺序,来连接这两个列表中前一个列表的最后一个项目与后一个列表的第一个项目,如此一来,可完成目标页面中列表之间的连接,得懂啊有向无环图。依旧如图7所示的例子,对于列表B1和B2,可按照i1,3→i2,1、i1,3→i2,2、i1,3→i2,3、i1,3→i2,4、i1,3→i2,5的顺序,将i1,3与i2,1连接,将i1,3与i2,2连接,将i1,3与i2,3连接,将i1,3与i2,4连接,将i1,3与i2,5连接,对于列表B2和B3、列表B3和B4、列表B4和B5也是如此,此处不再赘述。进一步地,对于对于列表B1和B3,可按照i1,3→i3,1的顺序,将i1,3与i3,1连接,对于列表B3和B5,也是如此,此处不再赘述。如此一来,可得到针对目标页面的有向无环图。(2) Connect the items in different lists browsed by the user in the second order to obtain a directed acyclic graph. That is, for two adjacent lists in the target page, the items in the two lists can be connected according to the second order. The order of adjacent items is used to connect these adjacent items, and for two non-adjacent lists in the target page (one list is separated between the two lists), the user can browse the two lists according to the Jump sequence to connect the last item of the previous list and the first item of the next list in the two lists. In this way, the connection between the lists in the target page can be completed. Understand, directed acyclic picture. Still using the example shown in Figure 7, for lists B 1 and B 2 , i 1,3 →i 2,1 , i 1,3 →i 2,2 , i 1,3 →i 2,3 , i In the order of 1,3 →i 2,4 and i 1,3 →i 2,5 , connect i 1,3 to i 2,1 , connect i 1,3 to i 2,2 , connect i 1,3 Connect with i 2,3 , connect i 1,3 with i 2,4 , connect i 1,3 with i 2,5 , for lists B 2 and B 3 , lists B 3 and B 4 , lists B 4 and The same is true for B 5 , which will not be described again here. Furthermore, for lists B 1 and B 3 , i 1,3 can be connected to i 3,1 in the order of i 1,3 →i 3,1 , and the same is true for lists B 3 and B 5 , this No further details will be given. In this way, a directed acyclic graph for the target page can be obtained.
本申请实施例中,可基于用户浏览目标页面时所产生的眼动数据,来确定用户对目标页 面中多个项目的浏览行为,那么,这些浏览行为(例如,顺序浏览行为以及跳过行为),往往决定了用户对项目的浏览顺序(例如,用户在同一列表中的浏览顺序以及用户在不同列表之间的浏览顺序),从而按照这些浏览来连接目标页面的多个项目,得到针对目标页面的有向无环图,该有向无环图可用于后续对目标页面的用户行为预测中,由于该有向无环图涉及了用户复杂多样的浏览行为,有利于提高对目标页面的用户行为预测的准确度。In the embodiment of the present application, the user's interest in the target page can be determined based on the eye movement data generated when the user browses the target page. The browsing behavior of multiple items in the list, then these browsing behaviors (for example, sequential browsing behavior and skipping behavior) often determine the user's browsing order of items (for example, the user's browsing order in the same list and the user's browsing order in different items). Browsing order between lists), thereby connecting multiple items of the target page according to these browses, and obtaining a directed acyclic graph for the target page. This directed acyclic graph can be used in subsequent user behavior predictions for the target page. Since this directed acyclic graph involves users' complex and diverse browsing behaviors, it is helpful to improve the accuracy of predicting user behavior on the target page.
以上是对本申请实施例提供的有向无环图构建方法所进行的详细说明,以下将对本申请实施例提供的用户行为预测方法进行介绍。图8为本申请实施例提供的用户行为预测方法的一个流程示意图,如图8所示,该方法包括:The above is a detailed description of the directed acyclic graph construction method provided by the embodiment of the present application. The user behavior prediction method provided by the embodiment of the present application will be introduced below. Figure 8 is a schematic flow chart of a user behavior prediction method provided by an embodiment of the present application. As shown in Figure 8, the method includes:
801、通过目标模型获取第一项目的第一特征以及第二项目的第二特征,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。801. Obtain the first characteristic of the first item and the second characteristic of the second item through the target model. The first item and the second item are located in different lists or the same list on the target page, and the second item is located before the first item.
本实施例中,当需要目标页面进行用户行为预测时,即获取目标页面中各个项目被用户点击的概率,可先提取目标页面中各个项目的第一特征,需要说明的是,对于任意一个项目,该项目的第一特征指该项目自身的属性信息,例如,当该项目为应用商城的页面中的某个应用时,该应用的第一特征可包含该应用的开发商、该应用的大小、该应用的类型、该应用的图标等等,当该项目为购物商城的页面中的某个商品时,该商品的第一特征可包含该商品的价格、该商品的类型、该商品的颜色等等。In this embodiment, when the target page is required to predict user behavior, that is, to obtain the probability that each item in the target page is clicked by the user, the first feature of each item in the target page can be extracted first. It should be noted that for any item , the first characteristic of the project refers to the attribute information of the project itself. For example, when the project is an application on the page of the application mall, the first characteristic of the application may include the developer of the application, the size of the application , the type of the application, the icon of the application, etc., when the item is a product in the shopping mall page, the first feature of the product may include the price of the product, the type of the product, the color of the product etc.
值得注意的是,由于目标页面包含多个项目,故可针对目标页面执行多轮次的操作,一个轮次针对目标页面中的一个项目进行操作(即每个轮次均会执行一次步骤801至步骤805,也就是会对每个项目均执行步骤801至步骤805),由于一个轮次的操作可以得到一个项目被用户点击的概率,故完成所有轮次后,可得到目标页面中所有项目被用户点击的概率。基于此,本实施例以目标页面中的其中一个项目进行示意性介绍,并将该项目称为第一项目。It is worth noting that since the target page contains multiple items, multiple rounds of operations can be performed on the target page. One round operates on one item in the target page (that is, steps 801 to 801 are performed once in each round). Step 805, that is, steps 801 to 805 will be executed for each item. Since one round of operation can obtain the probability of an item being clicked by the user, after completing all rounds, the probability of all items in the target page being clicked can be obtained. The probability of a user clicking. Based on this, this embodiment makes a schematic introduction using one of the items in the target page, and calls this item the first item.
那么,当要预估第一项目被用户点击的概率时,可将第一项目的第一特征以及第二项目的第二特征,输入至目标模型(已训练的神经网络模型)。其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前,即第一项目和第二项目的位置关系存在以下两种情况:(1)第一项目和第二项目可以为同一列表中的项目,第二项目位于第一项目之前且第二项目和第一项目相邻。(2)第一项目和第二项目可以为不同列表中的项目,第二项目所在的列表位于第一项目所在的列表之前,第二项目既可以和第一项目相邻,也可以不和第一项目相邻。Then, when it is necessary to estimate the probability that the first item is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model (trained neural network model). Among them, the first item and the second item are located in different lists or the same list on the target page, and the second item is located before the first item, that is, the positional relationship between the first item and the second item exists in the following two situations: (1) The first item and the second item may be items in the same list, the second item is located before the first item and the second item is adjacent to the first item. (2) The first item and the second item can be items in different lists. The list where the second item is located is located before the list where the first item is located. The second item can be adjacent to the first item or not. One item is adjacent.
由于目标页面中的所有项目可构成有向无环图,故这些项目的任意一个项目可视为有向无环图中的一个节点,且有向无环图中的节点之间存在单向的连接关系。如此一来,有向无环图则存在父节点和子节点,且父节点和子节点之间的连接方向由父节点指向子节点。那么,前述的第一项目可视为针对目标页面的有向无环图中的某个子节点,第二项目则为该子节点的所有父节点,依旧如图7所示的例子,当第一项目为i1,2时,第二项目则为i1,1。当第一项目为i2,1时,第二项目则为i1,3。当第一项目为i2,2时,第二项目则为i2,1和i1,3。当第一项目为i3,1时,第二项目则为i2,1、i2,2、i2,3、i2,4、i2,5和i1,3等等。Since all the items in the target page can form a directed acyclic graph, any one of these items can be regarded as a node in the directed acyclic graph, and there is a one-way connection between the nodes in the directed acyclic graph. connection relationship. In this way, the directed acyclic graph has parent nodes and child nodes, and the connection direction between the parent node and the child node is from the parent node to the child node. Then, the aforementioned first item can be regarded as a child node in the directed acyclic graph for the target page, and the second item is all the parent nodes of the child node. Still as shown in Figure 7, when the first When the item is i 1,2 , the second item is i 1,1 . When the first item is i 2,1 , the second item is i 1,3 . When the first item is i 2,2 , the second item is i 2,1 and i 1,3 . When the first item is i 3,1 , the second item is i 2,1 , i 2,2 , i 2,3 , i 2,4 , i 2,5 , i 1,3 and so on.
应理解,第二项目的第二特征的获取过程可参考后续第一项目的第二特征的获取过程,此处不做赘述。It should be understood that the acquisition process of the second feature of the second item may refer to the subsequent acquisition process of the second feature of the first item, and will not be described again here.
还应理解,当第一项目为目标页面中的首个项目时(例如,图7所示例子中的i1,1),则 不存在第二项目,此时第二项目的第二特征可理解为是一个预置值(该预置值的大小可根据实际需要来设置,此处不做限制),故也可将第一项目的第一特征和该预置值输入至目标模型中。It should also be understood that when the first item is the first item in the target page (for example, i 1,1 in the example shown in Figure 7), then There is no second item. At this time, the second characteristic of the second item can be understood as a preset value (the size of the preset value can be set according to actual needs, and there is no limit here). Therefore, the first item can also be The first characteristic of the project and this preset value are input into the target model.
802、通过目标模型基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征。802. Obtain the second feature of the first item based on the first feature of the first item and the second feature of the second item through the target model.
将第一项目的第一特征以及第二项目的第二特征输入目标模型后,可通过目标模型对第一项目的第一特征以及第二项目的第二特征进行处理,从而得到第一项目的第二特征。After inputting the first feature of the first item and the second feature of the second item into the target model, the first feature of the first item and the second feature of the second item can be processed by the target model to obtain the first feature of the first item. Second characteristic.
具体地,可通过以下方式获取第一项目的第二特征:Specifically, the second feature of the first item can be obtained in the following way:
(1)提取用户对目标页面的请求以及第二项目被用户点击的概率,其中,用户对目标页面的请求可以包含用户在目标页面输入的用于搜索某些项目的关键词等等,第二项目被用户点击的概率可以理解为上一轮次所针对的项目(即第一项目的前一个项目)被用户点击的概率。(1) Extract the user's request for the target page and the probability that the second item is clicked by the user. The user's request for the target page can include keywords entered by the user on the target page to search for certain items, etc., second The probability that an item is clicked by the user can be understood as the probability that the item targeted in the previous round (that is, the item before the first item) is clicked by the user.
(2)在已将第一项目的第一特征和第二项目的第二特征输入到目标模型的基础上,还可将用户对目标页面的请求以及第二项目被用户点击的概率输入目标模型,以使得目标模型分别将第一项目的第一特征、用户对目标页面的请求以及第二项目被用户点击的概率在隐空间上进行映射(即前述的映射处理),相应得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征,再对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行拼接(即前述的第二融合处理),得到第一项目的第四特征。与此同时,目标模型还可对第二项目的第二特征进行基于自注意力机制(self attention)的处理,得到第一项目的第五特征。(2) After the first feature of the first item and the second feature of the second item have been input into the target model, the user's request for the target page and the probability that the second item is clicked by the user can also be input into the target model. , so that the target model maps the first feature of the first item, the user's request for the target page, and the probability of the second item being clicked by the user on the latent space (i.e., the aforementioned mapping process), and accordingly obtains the first item's The sixth characteristic, the seventh characteristic of the first item and the eighth characteristic of the first item are then spliced together with the sixth characteristic of the first item, the seventh characteristic of the first item and the eighth characteristic of the first item (that is, the aforementioned second fusion process) to obtain the fourth feature of the first item. At the same time, the target model can also process the second feature of the second item based on the self-attention mechanism to obtain the fifth feature of the first item.
例如,如图9所示的例子(图9为本申请实施例提供的目标模型的一个结构示意图),设第一项目为it,j,第二项目的集合为Pt,j,该集合中第k个第二项目为ik(k=1,...,n),第一项目的第一特征为I,第k个第二项目的第二特征为hk,用户对目标页面的请求为Q,第二项目被用户点击的概率为C。For example, in the example shown in Figure 9 (Figure 9 is a schematic structural diagram of the target model provided by the embodiment of the present application), assume that the first item is i t,j , and the set of second items is P t,j , and the set The k-th second item in is i k (k=1,...,n), the first feature of the first item is I, the second feature of the k-th second item is h k , the user’s target page The request is Q, and the probability of the second item being clicked by the user is C.
将第一项目的第一特征I,第二项目的第二特征h1、...、hn,用户对目标页面的请求Q,第二项目被用户点击的概率C输入目标模型后,目标模型可将第一项目的第一特征I映射在隐空间上,得到第一项目的第六特征VI,将用户对目标页面的请求Q映射在隐空间上,得到第一项目的第七特征VQ,将第二项目被用户点击的概率C映射在隐空间上,得到第一项目的第八特征VC。然后,目标模型可将第一项目的第六特征VI、第一项目的第七特征VQ、第一项目的第八特征VC进行拼接,得到第一项目的第四特征xt,jAfter inputting the first feature I of the first item, the second feature h 1 ,..., h n of the second item, the user's request Q for the target page, and the probability C of the second item being clicked by the user into the target model, the target The model can map the first feature I of the first item on the latent space to obtain the sixth feature V I of the first item, and map the user's request Q for the target page on the latent space to obtain the seventh feature of the first item. V Q , map the probability C of the second item clicked by the user on the latent space, and obtain the eighth feature V C of the first item. Then, the target model can splice the sixth feature V I of the first item, the seventh feature V Q of the first item, and the eighth feature V C of the first item to obtain the fourth feature x t,j of the first item. .
与此同时,目标模型还可利用自注意力机制对第二项目的第二特征h1、...、hn进行计算,得到第一项目的第五特征et,j。其中,基于自注意力机制的计算如以下公式所示:
At the same time, the target model can also use the self-attention mechanism to calculate the second features h 1 ,..., h n of the second item, and obtain the fifth feature e t,j of the first item. Among them, the calculation based on the self-attention mechanism is as shown in the following formula:
(3)得到第一项目的第四特征以及第一项目的第五特征,目标模型可利用循环神经单元 (GRUcell)对第一项目的第四特征以及第一项目的第五特征进行处理(即前述的第一融合处理),得到第一项目的第二特征。(3) Obtain the fourth feature of the first item and the fifth feature of the first item. The target model can use recurrent neural units (GRUcell) processes the fourth feature of the first item and the fifth feature of the first item (ie, the aforementioned first fusion process) to obtain the second feature of the first item.
依旧如图9所示的例子,得到第一项目的第四特征xt,j和第一项目的第五特征et,j,目标模型可将这两个特征输入循环神经单元中进行处理,得到第一项目的第二特征ht,j。其中,循环神经单元所实现的处理如以下公式所示:
ht,j=GRUcell(xt,j,et,j)        (3)
Still using the example shown in Figure 9, the fourth feature x t,j of the first item and the fifth feature e t,j of the first item are obtained. The target model can input these two features into the recurrent neural unit for processing. Get the second feature h t,j of the first item. Among them, the processing implemented by the recurrent neural unit is as shown in the following formula:
h t,j =GRUcell(x t,j ,e t,j ) (3)
可以理解的是,第一项目的第二特征可表征第二项目对第一项目所产生的影响(也可以理解为第二项目与第一项目之间的关系),即用户使用顺序浏览行为以及跳过行为浏览到第一项目时,用户在进行这些行为的过程中所浏览的项目对第一项目所产生的影响。It can be understood that the second characteristic of the first item can represent the impact of the second item on the first item (it can also be understood as the relationship between the second item and the first item), that is, the user's sequential browsing behavior and When skipping behaviors and browsing to the first item, the impact of the items browsed by the user on the first item while performing these behaviors.
应理解,在获取第一特征的第四特征的过程中,也可不向目标模型输入用户对目标页面的请求以及第二项目被用户点击的概率,使得目标模型直接对第一项目的第一特征继续宁映射处理后,得到第一特征的第四特征。It should be understood that in the process of obtaining the fourth feature of the first feature, the user's request for the target page and the probability of the second item being clicked by the user may not be input to the target model, so that the target model directly obtains the first feature of the first item. After continuing the Ning mapping process, the fourth feature of the first feature is obtained.
803、通过目标模型获取第三项目的第一特征,第一项目和第三项目位于目标页面的不同列表或同一列表中,且第三项目与第一项目相邻。803. Obtain the first characteristic of the third item through the target model. The first item and the third item are located in different lists or the same list on the target page, and the third item is adjacent to the first item.
此外,还可将第三项目的第一特征输入至目标模型,其中,第一项目和第三项目位于目标页面的不同列表或同一列表中,且第三项目与第一项目相邻,即第一项目和第三项目的位置关系存在以下两种情况:(1)第一项目和第三项目可以为同一列表中的项目,第三项目和第一项目相邻。(2)第一项目和第三项目可以为不同列表中的项目,第三项目和第一项目相邻。In addition, the first feature of the third item can also be input to the target model, wherein the first item and the third item are located in different lists or the same list on the target page, and the third item is adjacent to the first item, that is, the third item is adjacent to the first item. There are two situations in the positional relationship between the first item and the third item: (1) The first item and the third item can be items in the same list, and the third item and the first item are adjacent. (2) The first item and the third item can be items in different lists, and the third item and the first item are adjacent.
依旧如图7所示的例子,当第一项目为i1,2时,第三项目则为i1,1。当第一项目为i1,3时,第三项目则为i1,2和i2,1。当第一项目为i2,1时,第二项目则为i1,3、i2,2以及i3,1等等。Still as shown in the example in Figure 7, when the first item is i 1,2 , the third item is i 1,1 . When the first item is i 1,3 , the third item is i 1,2 and i 2,1 . When the first item is i 2,1 , the second item is i 1,3 , i 2,2 , i 3,1 and so on.
804、通过目标模型基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征。804. Obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the target model.
将第三项目的第一特征输入目标模型后,可通过目标模型对第一项目的第一特征以及第三项目的第一特征进行处理,从而得到第一项目的第三特征。After the first feature of the third item is input into the target model, the first feature of the first item and the first feature of the third item can be processed by the target model, thereby obtaining the third feature of the first item.
具体地,可通过以下方式获取第一项目的第三特征:Specifically, the third feature of the first item can be obtained in the following way:
(1)目标模型分别将第一项目的第一特征、用户对目标页面的请求以及第三项目的第一特征在隐空间上进行映射(即前述的映射处理),相应得到第一项目的第六特征、第一项目的第七特征以及第一项目的第九特征。(1) The target model maps the first feature of the first item, the user's request for the target page, and the first feature of the third item on the latent space respectively (i.e., the aforementioned mapping process), and accordingly obtains the first feature of the first item. Six characteristics, the seventh characteristic of the first item and the ninth characteristic of the first item.
依旧如图9所示的例子,设第三项目的集合为Nt,j,该集合中第f个第三项目为if(f=1,...,m),第f个第三项目的第一特征为IfStill using the example shown in Figure 9, let the set of third items be N t,j , the f-th third item in the set is if ( f =1,...,m), the f-th third item in the set is The first characteristic of the project is If .
将第三项目的第一特征I1、...、Im输入目标模型后,目标模型可将第一项目的第一特征I映射在隐空间上,得到第一项目的第六特征VI,将用户对目标页面的请求Q映射在隐空间上,得到第一项目的第七特征VQ,将第三项目的第一特征I1、...、Im映射在隐空间上,得到第一项目的第九特征V1、...、VmAfter inputting the first feature I 1 ,..., I m of the third item into the target model, the target model can map the first feature I of the first item on the latent space to obtain the sixth feature V I of the first item. , map the user's request Q for the target page on the latent space, and obtain the seventh feature V Q of the first item, and map the first feature I 1 ,..., I m of the third item on the latent space, and obtain The ninth characteristic V 1 , ..., V m of the first item.
(2)然后,目标模型可通过对比函数对第一项目的第六特征和第一项目的第九特征进行 计算,再基于计算结果进行加权求和(对比函数的计算以及加权求和计算即前述的第三融合处理),得到第一项目的第十特征。(2) Then, the target model can perform the comparison function on the sixth feature of the first item and the ninth feature of the first item. Calculate, and then perform weighted summation based on the calculation results (the calculation of the comparison function and the weighted summation calculation are the aforementioned third fusion process) to obtain the tenth feature of the first item.
依旧如图9所示的例子,得到第一项目的第六特征VI和第一项目的第九特征V1、...、Vm后,可对这些特征进行基于对比函数g的计算和加权求和计算,得到第一项目的第十特征dt,j。其中,该计算过程如以下公式所示:
Still using the example shown in Figure 9, after obtaining the sixth feature V I of the first item and the ninth feature V 1 ,..., V m of the first item, these features can be calculated and summed based on the contrast function g. Weighted summation calculation is performed to obtain the tenth feature d t,j of the first item. Among them, the calculation process is as shown in the following formula:
其中,对比函数g可以为以下三种函数中的其中一种:内积函数神经网络函数核函数 Among them, the comparison function g can be one of the following three functions: inner product function neural network function kernel function
(3)最后,目标模型可对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行同或运算(即前述的第四融合处理),得到第一项目的第三特征。(3) Finally, the target model can perform an exclusive OR operation (i.e., the aforementioned fourth fusion process) on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item to obtain the first item The third characteristic.
依旧如图9所示的例子,得到第一项目的第十特征dt,j,可将第一项目的第六特征VI、第一项目的第七特征VQ以及第一项目的第十特征dt,j进行同或运算,得到第一项目的第三特征cpt,j。其中,该运算过程如以下公式所示:
cpt,j=dt,j⊙VI⊙VQ    (5)
Still using the example shown in Figure 9, to obtain the tenth feature d t,j of the first item, the sixth feature V I of the first item, the seventh feature V Q of the first item, and the tenth feature of the first item can be The features d t,j are subjected to an exclusive OR operation to obtain the third feature cp t,j of the first item. Among them, the operation process is shown in the following formula:
cp t, j = d t, j ⊙V I ⊙V Q (5)
可以理解的是,第一项目的第三特征可表征第三项目对第一项目所产生的影响(也可以理解为第三项目与第一项目之间的关系),即用户使用对比行为浏览到第一项目时,用户在进行该行为的过程中所浏览的项目对第一项目所产生的影响。It can be understood that the third characteristic of the first item can represent the impact of the third item on the first item (it can also be understood as the relationship between the third item and the first item), that is, the user uses the comparison behavior to browse to The first item refers to the impact of the items browsed by the user on the first item during the behavior.
应理解,在获取第一项目的第三特征时,也可不向目标模型输入用户对目标页面的请求,故目标模型可仅对第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。It should be understood that when obtaining the third feature of the first item, the user's request for the target page may not be input to the target model. Therefore, the target model may only perform the third feature on the sixth feature of the first item and the tenth feature of the first item. Four fusion processes are performed to obtain the third feature of the first item.
805、通过目标模型基于第一项目的第二特征以及第一项目的第三特征,获取第一项目被用户点击的概率。805. Use the target model to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item.
得到第一项目的第二特征以及第一项目的第三特征后,目标模型可对第一项目的第二特征以及第一项目的第三特征进行计算,从而得到第一项目被用户点击的概率。After obtaining the second feature of the first item and the third feature of the first item, the target model can calculate the second feature of the first item and the third feature of the first item, thereby obtaining the probability that the first item is clicked by the user. .
依旧如图9所示的例子,得到第一项目的第二特征ht,j以及第一项目的第三特征cpt,j后,可对这两个特征进行计算,得到第一项目被用户点击的概率Ct,j。其中,该计算的过程如以下公式所示:
Still using the example shown in Figure 9, after obtaining the second feature h t,j of the first item and the third feature cp t,j of the first item, the two features can be calculated to obtain the first item that is used by the user. Probability of click C t,j . Among them, the calculation process is as shown in the following formula:
同理,对于目标页面中处第一项目之外的其余项目,也可执行如同对第一项目所执行的操作,故可得到目标页面中所有项目被用户点击的概率,从而完成对目标页面的用户行为预测。In the same way, for the remaining items in the target page other than the first item, the same operations as those performed on the first item can also be performed. Therefore, the probability of all items in the target page being clicked by the user can be obtained, thereby completing the analysis of the target page. User behavior prediction.
应理解,在获取第一项目被用户点击的概率时,也可不执行步骤803和步骤804,从而令目标模型直接对第一项目的第二特征进行计算,得到第一项目被用户点击的概率。It should be understood that when obtaining the probability that the first item is clicked by the user, steps 803 and 804 may not be performed, so that the target model directly calculates the second feature of the first item to obtain the probability that the first item is clicked by the user.
此外,还可将本申请实施例提供的目标模型的预测结果与相关技术的模型的预测结果进行比较,比较结果如表4所示:In addition, the prediction results of the target model provided by the embodiment of the present application can also be compared with the prediction results of the model of related technologies. The comparison results are shown in Table 4:
表4
Table 4
基于表4可知,本申请实施例提供的目标模型所展示出来的预测能力,相比于相关技术提供的模型,在两种指标上均有显著的提升。Based on Table 4, it can be seen that the prediction ability displayed by the target model provided by the embodiment of the present application is significantly improved in both indicators compared with the model provided by the related technology.
本申请实施例中,当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二 特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。In the embodiment of the present application, when it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second feature can be input to the target model. The two items are in different lists or in the same list on the target page, and the second item is before the first item. Then, the target model can obtain the second feature of the first item based on the first feature of the first item and the second feature of the second item, and then based on the second feature of the first item Feature, obtain the probability that the first item is clicked by the user. In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model The probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
进一步地,本申请实施例提供的目标模型,不仅引入了常规的顺序浏览行为,还引入到跳转行为和对比行为等等浏览行为,也就是说,目标模型会考虑用户使用这些复杂多样的浏览行为浏览至第一项目时,用户在进行这些行为的过程中所浏览的项目对第一项目所产生的影响,可进一步提高目标模型最终得到的第一项目被用户点击的概率的准确度。Furthermore, the target model provided by the embodiments of this application not only introduces conventional sequential browsing behaviors, but also introduces browsing behaviors such as jump behaviors and comparison behaviors. In other words, the target model will consider the user's use of these complex and diverse browsing behaviors. When the user browses to the first item, the impact of the item browsed by the user on the first item during the behavior can further improve the accuracy of the probability of the first item being clicked by the user finally obtained by the target model.
更进一步地,本申请实施例提供的目标模型在对第一项目进行分析时,不仅考虑到第一项目自身的属性信息的影响,还考虑到用户对目标页面的请求以及第二项目被用户点击的概率等外界因素所产生的影响,从而进一步提高目标模型最终得到的第一项目被用户点击的概率的准确度。Furthermore, when analyzing the first item, the target model provided by the embodiment of the present application not only takes into account the influence of the attribute information of the first item itself, but also considers the user's request for the target page and the second item clicked by the user. The influence of external factors such as probability can further improve the accuracy of the probability of the first item being clicked by the user finally obtained by the target model.
以上是对本申请实施例提供的用户行为预测方法所进行的详细说明,以下将对本申请实施例提供的模型训练方法进行介绍。图10为本申请实施例提供的模型训练方法的一个流程示意图,如图10所示,该方法包括:The above is a detailed description of the user behavior prediction method provided by the embodiment of the present application. The model training method provided by the embodiment of the present application will be introduced below. Figure 10 is a schematic flow chart of the model training method provided by the embodiment of the present application. As shown in Figure 10, the method includes:
1001、通过待训练模型获取第一项目的第一特征以及第二项目的第二特征,第一项目和第二项目位于待处理页面的不同列表或同一列表中,且第二项目位于第一项目之前。1001. Obtain the first feature of the first item and the second feature of the second item through the model to be trained. The first item and the second item are located in different lists or the same list of the page to be processed, and the second item is located in the first item. Before.
本实施例中,需要对待训练模型进行训练时,可先获取一批训练数据,该批训练数据包含待处理页面,待处理页面包含多个列表,且每个列表包含至少一个项目。值得注意的是,在待处理页面中,任意一个项目被用户点击的真实概率是已知的。In this embodiment, when the model to be trained needs to be trained, a batch of training data can be obtained first. The batch of training data includes pages to be processed. The pages to be processed include multiple lists, and each list contains at least one item. It is worth noting that in the page to be processed, the true probability of any item being clicked by the user is known.
需要说明的是,关于待处理页面的第一项目、第二项目、第一项目的第一特征以及第二项目的第二特征,可参考图8所示实施例中步骤801中目标页面的第一项目、第二项目、第一项目的第一特征以及第二项目的第二特征的相关说明部分,此处不再赘述。It should be noted that, regarding the first item, the second item, the first feature of the first item, and the second feature of the second item of the page to be processed, reference may be made to the third item of the target page in step 801 in the embodiment shown in FIG. 8 The relevant descriptions of the first item, the second item, the first feature of the first item, and the second feature of the second item will not be described again here.
可以理解的是,第一项目的第一特征为第一项目的属性信息,第一项目的第二特征为基于第一项目的属性信息进行融合得到的信息,第二项目的第二特征为基于第二项目的属性信息(即第二项目的第一特征)进行融合得到的信息。It can be understood that the first feature of the first item is the attribute information of the first item, the second feature of the first item is the information obtained by fusion based on the attribute information of the first item, and the second feature of the second item is the information obtained based on the fusion of the attribute information of the first item. Information obtained by fusing the attribute information of the second item (that is, the first feature of the second item).
1002、通过待训练模型基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征。1002. Obtain the second feature of the first item based on the first feature of the first item and the second feature of the second item through the model to be trained.
将第一项目的第一特征以及第二项目的第二特征输入待训练模型后,可通过待训练模型对第一项目的第一特征以及第二项目的第二特征进行处理,从而得到第一项目的第二特征。After inputting the first feature of the first item and the second feature of the second item into the model to be trained, the first feature of the first item and the second feature of the second item can be processed by the model to be trained, thereby obtaining the first Secondary characteristics of the project.
在一种可能的实现方式中,基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征包括:对第一项目的第一特征进行映射处理,得到第一项目的第四特征;对第二项目的第二特征进行基于自注意力机制的处理,得到第一项目的第五特征;对第一项目的第四特征以及第一项目的第五特征进行第一融合处理,得到第一项目的第二特征。In a possible implementation, based on the first feature of the first item and the second feature of the second item, obtaining the second feature of the first item includes: mapping the first feature of the first item to obtain the second feature of the first item. The fourth feature of the first item; the second feature of the second item is processed based on the self-attention mechanism to obtain the fifth feature of the first item; the fourth feature of the first item and the fifth feature of the first item are processed The first fusion process is to obtain the second feature of the first item.
在一种可能的实现方式中,对第一项目的第一特征进行映射处理,得到第一项目的第四特征:对第一项目的第一特征、用户对待处理页面的请求以及第二项目被用户点击的概率进 行映射处理,得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征;In a possible implementation, the first feature of the first item is mapped to obtain the fourth feature of the first item: the first feature of the first item, the user's request for the page to be processed, and the second item being processed. The probability of a user clicking on Perform mapping processing to obtain the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item;
对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行第二融合处理,得到第一项目的第四特征。Perform a second fusion process on the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item to obtain the fourth feature of the first item.
需要说明的是,步骤1002的介绍,可参考图8所示实施例中步骤802的相关说明部分,此处不再赘述。It should be noted that for the introduction of step 1002, reference may be made to the relevant description of step 802 in the embodiment shown in FIG. 8 , which will not be described again here.
1003、通过待训练模型获取第三项目的第一特征,第一项目和第三项目位于待处理页面的不同列表或同一列表中,且第三项目与第一项目相邻。1003. Obtain the first feature of the third item through the model to be trained. The first item and the third item are located in different lists or the same list on the page to be processed, and the third item is adjacent to the first item.
本实施例中,还可将第三项目的第一特征输入至目标模型,需要说明的是,关于待处理页面的第三项目以及第三项目的第一特征,可参考图8所示实施例中步骤803中目标页面的第三项目以及第三项目的第一特征的相关说明部分,此处不再赘述。In this embodiment, the first feature of the third item can also be input to the target model. It should be noted that, regarding the third item of the page to be processed and the first feature of the third item, reference can be made to the embodiment shown in Figure 8 The relevant description of the third item of the target page and the first feature of the third item in step 803 will not be described again here.
1004、通过待训练模型基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征。1004. Obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the model to be trained.
将第三项目的第一特征输入待训练模型后,可通过待训练模型对第一项目的第一特征以及第三项目的第一特征进行处理,从而得到第一项目的第三特征。After the first feature of the third item is input into the model to be trained, the first feature of the first item and the first feature of the third item can be processed by the model to be trained, thereby obtaining the third feature of the first item.
在一种可能的实现方式中,基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征包括:对第一项目的第一特征以及第三项目的第一特征进行映射处理,得到第一项目的第六特征以及第一项目的第九特征;对第一项目的第六特征和第一项目的第九特征进行第三融合处理,得到第一项目的第十特征;对第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, based on the first feature of the first item and the first feature of the third item, obtaining the third feature of the first item includes: comparing the first feature of the first item and the third feature of the third item. Perform mapping processing on one feature to obtain the sixth feature of the first item and the ninth feature of the first item; perform a third fusion process on the sixth feature of the first item and the ninth feature of the first item to obtain the sixth feature of the first item. The tenth feature; performs the fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item.
在一种可能的实现方式中,第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征包括:对用户对待处理页面的请求进行映射处理,得到第一项目的第七特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, performing a fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item includes: mapping the user's request for the page to be processed , the seventh feature of the first item is obtained; the fourth fusion process is performed on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item, to obtain the third feature of the first item.
需要说明的是,步骤1004的介绍,可参考图8所示实施例中步骤804的相关说明部分,此处不再赘述。It should be noted that for the introduction of step 1004, reference may be made to the relevant description of step 804 in the embodiment shown in FIG. 8 and will not be described again here.
1005、通过待训练模型基于第一项目的第二特征以及第一项目的第三特征,获取第一项目被用户点击的概率。1005. Obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the model to be trained.
得到第一项目的第二特征以及第一项目的第三特征后,可通过待训练模型对第一项目的第二特征以及第一项目的第三特征进行处理,从而得到第一项目被用户点击的概率(也可以称为第一项目被用户点击的预测概率)。After obtaining the second feature of the first item and the third feature of the first item, the second feature of the first item and the third feature of the first item can be processed by the model to be trained, thereby obtaining that the first item was clicked by the user. The probability (can also be called the predicted probability that the first item is clicked by the user).
需要说明的是,步骤1005的介绍,可参考图8所示实施例中步骤805的相关说明部分,此处不再赘述。It should be noted that for the introduction of step 1005, reference may be made to the relevant description of step 805 in the embodiment shown in FIG. 8 , which will not be described again here.
1006、基于第一项目被用户点击的概率以及第一项目被用户点击的真实概率,获取目标损失,目标损失用于指示第一项目被用户点击的概率以及第一项目被用户点击的真实概率之间的差异。1006. Based on the probability of the first item being clicked by the user and the real probability of the first item being clicked by the user, obtain the target loss. The target loss is used to indicate the probability of the first item being clicked by the user and the real probability of the first item being clicked by the user. difference between.
得到第一项目被用户点击的预测概率后,由于第一项目被用户点击的真实概率已知,故可通过预置的目标损失函数对第一项目被用户点击的预测概率以及第一项目被用户点击的真实概率进行计算,得到目标损失,目标损失用于指示第一项目被用户点击的预测概率以及第 一项目被用户点击的真实概率之间的差异。After obtaining the predicted probability of the first item being clicked by the user, since the real probability of the first item being clicked by the user is known, the predicted probability of the first item being clicked by the user and the predicted probability of the first item being clicked by the user can be calculated through the preset target loss function The true probability of clicking is calculated to obtain the target loss. The target loss is used to indicate the predicted probability of the first item being clicked by the user and the predicted probability of the first item being clicked. The difference between the true probability of an item being clicked by the user.
1007、基于目标损失,更新待训练模型的参数,直至满足模型训练条件,得到目标模型。1007. Based on the target loss, update the parameters of the model to be trained until the model training conditions are met and the target model is obtained.
得到目标损失后,可基于目标损失来更新待训练模型的参数,并获取下一批训练数据,利用下一批训练数据继续对更新参数后的待训练模型进行训练(即重新执行步骤1001至步骤1007),直至满足模型训练条件(例如,目标损失达到收敛等等),可得到图8所示实施例中的目标模型。After obtaining the target loss, the parameters of the model to be trained can be updated based on the target loss, and the next batch of training data can be obtained, and the next batch of training data can be used to continue training the model to be trained after the updated parameters (i.e., re-execute steps 1001 to 1001). 1007), until the model training conditions are met (for example, the target loss reaches convergence, etc.), the target model in the embodiment shown in Figure 8 can be obtained.
本申请实施例训练得到的目标模型,具备对页面进行用户行为预测的能力。当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。The target model trained in the embodiment of this application has the ability to predict user behavior on the page. When it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second item are located on the target page. Different lists or the same list, with the second item before the first item. Then, the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. . In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model The probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
以上是对本申请实施例提供的模型训练方法所进行的详细说明,以下将对本申请实施例提供的装置和设备进行介绍。图11为本申请实施例提供的用户行为预测装置的一个结构示意图,如图11所示,该装置包括:The above is a detailed description of the model training method provided by the embodiment of the present application. The device and equipment provided by the embodiment of the present application will be introduced below. Figure 11 is a schematic structural diagram of a user behavior prediction device provided by an embodiment of the present application. As shown in Figure 11, the device includes:
第一获取模块1101,用于通过目标模型获取第一项目的第一特征以及第二项目的第二特征,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前;The first acquisition module 1101 is used to acquire the first characteristics of the first item and the second characteristics of the second item through the target model. The first item and the second item are located in different lists or the same list of the target page, and the second item Located before the first item;
第二获取模块1102,用于通过目标模型基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,其中,第一项目的第一特征为第一项目的属性信息,第一项目的第二特征为基于第一项目的属性信息进行融合得到的信息,第二项目的第二特征为基于第二项目的属性信息(即第二项目的第一特征)进行融合得到的信息;The second acquisition module 1102 is configured to acquire the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item through the target model, where the first characteristic of the first item is the first item. Attribute information of Information obtained by fusion;
第三获取模块1103,用于通过目标模型基于第一项目的第二特征,获取第一项目被用户点击的概率。The third acquisition module 1103 is configured to acquire the probability that the first item is clicked by the user based on the second feature of the first item through the target model.
本申请实施例中,当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。 In the embodiment of the present application, when it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second feature can be input to the target model. The two items are in different lists or in the same list on the target page, and the second item is before the first item. Then, the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. . In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model The probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
在一种可能的实现方式中,该装置还包括:第四获取模块,用于通过目标模型获取第三项目的第一特征,第一项目和第三项目位于目标页面的不同列表或同一列表中,且第三项目与第一项目相邻;第五获取模块,用于通过目标模型基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征;第三获取模块1103,用于通过目标模型基于第一项目的第二特征以及第一项目的第三特征,获取第一项目被用户点击的概率。In a possible implementation, the device further includes: a fourth acquisition module, configured to acquire the first feature of the third item through the target model, where the first item and the third item are located in different lists or the same list of the target page. , and the third item is adjacent to the first item; the fifth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the target model; the third The acquisition module 1103 is configured to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the target model.
在一种可能的实现方式中,第二获取模块1102,用于:通过目标模型对第一项目的第一特征进行映射处理,得到第一项目的第四特征;通过目标模型对第二项目的第二特征进行基于自注意力机制的处理,得到第一项目的第五特征;通过目标模型对第一项目的第四特征以及第一项目的第五特征进行第一融合处理,得到第一项目的第二特征。In a possible implementation, the second acquisition module 1102 is configured to: map the first feature of the first item through the target model to obtain the fourth feature of the first item; map the first feature of the second item through the target model. The second feature is processed based on the self-attention mechanism to obtain the fifth feature of the first item; the fourth feature of the first item and the fifth feature of the first item are first fused through the target model to obtain the first item the second characteristic.
在一种可能的实现方式中,第二获取模块1102,用于:通过目标模型对第一项目的第一特征、用户对目标页面的请求以及第二项目被用户点击的概率进行映射处理,得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征;通过目标模型对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行第二融合处理,得到第一项目的第四特征。In a possible implementation, the second acquisition module 1102 is configured to perform mapping processing on the first feature of the first item, the user's request for the target page, and the probability that the second item is clicked by the user through the target model, to obtain The sixth characteristic of the first item, the seventh characteristic of the first item, and the eighth characteristic of the first item; through the target model, the sixth characteristic of the first item, the seventh characteristic of the first item, and the eighth characteristic of the first item are The features undergo a second fusion process to obtain the fourth feature of the first item.
在一种可能的实现方式中,第五获取模块,用于:通过目标模型对第一项目的第一特征以及第三项目的第一特征进行映射处理,得到第一项目的第六特征以及第一项目的第九特征;通过目标模型对第一项目的第六特征和第一项目的第九特征进行第三融合处理,得到第一项目的第十特征;通过目标模型对第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation manner, the fifth acquisition module is used to map the first feature of the first item and the first feature of the third item through the target model to obtain the sixth feature of the first item and the first feature of the third item. The ninth feature of the first item; the third fusion process is performed on the sixth feature of the first item and the ninth feature of the first item through the target model to obtain the tenth feature of the first item; the third feature of the first item is obtained through the target model The six features and the tenth feature of the first item are subjected to the fourth fusion process to obtain the third feature of the first item.
在一种可能的实现方式中,第五获取模块,用于:通过目标模型对用户对目标页面的请求进行映射处理,得到第一项目的第七特征;通过目标模型对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, the fifth acquisition module is used to: map the user's request for the target page through the target model to obtain the seventh feature of the first item; map the sixth feature of the first item through the target model Features, the seventh feature of the first item, and the tenth feature of the first item are subjected to a fourth fusion process to obtain the third feature of the first item.
在一种可能的实现方式中,若第一项目为目标页面中的首个项目,则第二项目的第二特征为预置值。In a possible implementation, if the first item is the first item in the target page, the second characteristic of the second item is a preset value.
在一种可能的实现方式中,目标页面包含多个列表,位于多个列表中的多个项目构成有向无环图,多个项目包含第一项目、第二项目以及第三项目。In a possible implementation, the target page contains multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
图12为本申请实施例提供的有向无环图构建装置的一个结构示意图,如图12所示,该装置包括:Figure 12 is a schematic structural diagram of a directed acyclic graph construction device provided by an embodiment of the present application. As shown in Figure 12, the device includes:
获取模块1201,用于获取用户浏览目标页面的眼动数据;The acquisition module 1201 is used to obtain the eye movement data of the user browsing the target page;
确定模块1202,用于基于眼动数据,确定用户对多个项目的浏览行为,多个项目位于目标页面的多个列表中;The determination module 1202 is used to determine the user's browsing behavior for multiple items based on eye movement data, and the multiple items are located in multiple lists on the target page;
连接模块1203,用于基于浏览行为,对多个项目进行连接,得到有向无环图。The connection module 1203 is used to connect multiple items based on browsing behavior to obtain a directed acyclic graph.
本申请实施例中,可基于用户浏览目标页面时所产生的眼动数据,来确定用户对目标页面中多个项目的浏览行为,那么,这些浏览行为(例如,顺序浏览行为以及跳过行为),往往决定了用户对项目的浏览顺序(例如,用户在同一列表中的浏览顺序以及用户在不同列表之间的浏览顺序),从而按照这些浏览来连接目标页面的多个项目,得到针对目标页面的有向无环图,该有向无环图可用于后续对目标页面的用户行为预测中,由于该有向无环图涉及了用户复杂多样的浏览行为,有利于提高对目标页面的用户行为预测的准确度。 In the embodiment of the present application, the user's browsing behavior for multiple items in the target page can be determined based on the eye movement data generated when the user browses the target page. Then, these browsing behaviors (for example, sequential browsing behavior and skipping behavior) , often determines the user's browsing order of items (for example, the user's browsing order in the same list and the user's browsing order between different lists), thereby connecting multiple items of the target page according to these browsing, and obtaining the target page A directed acyclic graph, which can be used in the subsequent prediction of user behavior on the target page. Since the directed acyclic graph involves users' complex and diverse browsing behaviors, it is conducive to improving user behavior on the target page. Prediction accuracy.
在一种可能的实现方式中,连接模块1203,用于将用户以第一顺序浏览的同一列表中的项目,按第一顺序进行连接,并将用户以第二顺序浏览的不同列表中的项目,按第二顺序进行连接,得到有向无环图。In a possible implementation, the connection module 1203 is used to connect items in the same list that the user browses in the first order, and connect items in different lists that the user browses in the second order. , connect in the second order to obtain a directed acyclic graph.
在一种可能的实现方式中,获取模块1201,用于通过眼动仪采集用户浏览目标页面的眼动数据。In a possible implementation, the acquisition module 1201 is configured to collect eye movement data of the user browsing the target page through an eye tracker.
图13为本申请实施例提供的模型训练装置的一个结构示意图,如图13所示,该装置包括:Figure 13 is a schematic structural diagram of a model training device provided by an embodiment of the present application. As shown in Figure 13, the device includes:
第一获取模块1301,用于通过待训练模型获取第一项目的第一特征以及第二项目的第二特征,第一项目和第二项目位于待处理页面的不同列表或同一列表中,且第二项目位于第一项目之前;The first acquisition module 1301 is used to acquire the first feature of the first item and the second feature of the second item through the model to be trained. The first item and the second item are located in different lists or the same list of the page to be processed, and the first item The second item precedes the first item;
第二获取模块1302,用于通过待训练模型基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,其中,第一项目的第一特征为第一项目的属性信息,第一项目的第二特征为基于第一项目的属性信息进行融合得到的信息,第二项目的第二特征为基于第二项目的属性信息(即第二项目的第一特征)进行融合得到的信息;The second acquisition module 1302 is configured to acquire the second feature of the first item based on the first feature of the first item and the second feature of the second item through the model to be trained, where the first feature of the first item is the first Attribute information of the item, the second feature of the first item is the information obtained by fusion based on the attribute information of the first item, and the second feature of the second item is based on the attribute information of the second item (i.e., the first feature of the second item ) information obtained by fusion;
第三获取模块1303,用于通过待训练模型基于第一项目的第二特征,获取第一项目被用户点击的概率;The third acquisition module 1303 is used to obtain the probability that the first item is clicked by the user based on the second feature of the first item through the model to be trained;
第四获取模块1304,用于基于第一项目被用户点击的概率以及第一项目被用户点击的真实概率,获取目标损失,目标损失用于指示第一项目被用户点击的概率以及第一项目被用户点击的真实概率之间的差异;The fourth acquisition module 1304 is used to obtain the target loss based on the probability that the first item is clicked by the user and the real probability that the first item is clicked by the user. The target loss is used to indicate the probability that the first item is clicked by the user and the real probability that the first item is clicked by the user. The difference between the true probability of a user clicking;
更新模块1305,用于基于目标损失,更新待训练模型的参数,直至满足模型训练条件,得到目标模型。The update module 1305 is used to update the parameters of the model to be trained based on the target loss until the model training conditions are met and the target model is obtained.
本申请实施例训练得到的目标模型,具备对页面进行用户行为预测的能力。当需要预测目标页面中第一项目被用户点击的概率时,可向目标模型输入第一项目的第一特征以及第二项目的第二特征,其中,第一项目和第二项目位于目标页面的不同列表或同一列表中,且第二项目位于第一项目之前。那么,目标模型可基于第一项目的第一特征以及第二项目的第二特征,获取第一项目的第二特征,再基于第一项目的第二特征,获取第一项目被用户点击的概率。前述过程中,目标模型在获取第一项目被用户点击的概率时,考虑了位于第一项目之前的第二项目对第一项目所产生的影响,由于第二项目不仅可以是第一项目所在列表中的项目,还可以是其余列表中的项目,故目标模型所考虑的因素较为全面,能够贴合用户在目标页面中浏览至第一项目时的实际情况,故目标模型最终得到的第一项目被用户点击的概率,具备较高的准确度,有利于后续精准为用户推荐其感兴趣的项目。The target model trained in the embodiment of this application has the ability to predict user behavior on the page. When it is necessary to predict the probability that the first item in the target page is clicked by the user, the first feature of the first item and the second feature of the second item can be input to the target model, where the first item and the second item are located on the target page. Different lists or the same list, with the second item before the first item. Then, the target model can obtain the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item, and then obtain the probability that the first item is clicked by the user based on the second characteristic of the first item. . In the aforementioned process, when obtaining the probability that the first item is clicked by the user, the target model considers the impact of the second item before the first item on the first item. Since the second item can not only be the list where the first item is located, The items in can also be items in other lists. Therefore, the factors considered by the target model are relatively comprehensive and can fit the actual situation when the user browses to the first item in the target page. Therefore, the first item finally obtained by the target model The probability of being clicked by the user has a high accuracy, which is conducive to accurately recommending items of interest to the user in the future.
在一种可能的实现方式中,该装置包括:第五获取模块,用于通过待训练模型获取第三项目的第一特征,第一项目和第三项目位于待处理页面的不同列表或同一列表中,且第三项目与第一项目相邻;第六获取模块,用于通过待训练模型基于第一项目的第一特征以及第三项目的第一特征,获取第一项目的第三特征;第三获取模块1303,用于通过待训练模型基于第一项目的第二特征以及第一项目的第三特征,获取第一项目被用户点击的概率。In a possible implementation, the device includes: a fifth acquisition module, configured to acquire the first feature of the third item through the model to be trained, and the first item and the third item are located in different lists or the same list of the page to be processed. , and the third item is adjacent to the first item; the sixth acquisition module is used to obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item through the model to be trained; The third acquisition module 1303 is configured to obtain the probability that the first item is clicked by the user based on the second feature of the first item and the third feature of the first item through the model to be trained.
在一种可能的实现方式中,第二获取模块1302,用于:对第一项目的第一特征进行映射处理,得到第一项目的第四特征;对第二项目的第二特征进行基于自注意力机制的处理,得 到第一项目的第五特征;对第一项目的第四特征以及第一项目的第五特征进行第一融合处理,得到第一项目的第二特征。In a possible implementation, the second acquisition module 1302 is configured to: perform mapping processing on the first feature of the first item to obtain the fourth feature of the first item; perform self-based mapping on the second feature of the second item. Attention mechanism processing, got to the fifth feature of the first item; perform a first fusion process on the fourth feature of the first item and the fifth feature of the first item to obtain the second feature of the first item.
在一种可能的实现方式中,第二获取模块1302,用于:对第一项目的第一特征、用户对待处理页面的请求以及第二项目被用户点击的概率进行映射处理,得到第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第八特征进行第二融合处理,得到第一项目的第四特征。In a possible implementation, the second acquisition module 1302 is configured to perform mapping processing on the first feature of the first item, the user's request for the page to be processed, and the probability that the second item is clicked by the user, to obtain the first item. The sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item; performing a second fusion process on the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item , get the fourth feature of the first item.
在一种可能的实现方式中,第六获取模块,用于:对第一项目的第一特征以及第三项目的第一特征进行映射处理,得到第一项目的第六特征以及第一项目的第九特征;对第一项目的第六特征和第一项目的第九特征进行第三融合处理,得到第一项目的第十特征;对第一项目的第六特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, the sixth acquisition module is configured to map the first feature of the first item and the first feature of the third item to obtain the sixth feature of the first item and the first feature of the first item. The ninth characteristic; perform a third fusion process on the sixth characteristic of the first item and the ninth characteristic of the first item to obtain the tenth characteristic of the first item; perform the sixth characteristic of the first item and the tenth characteristic of the first item The features are subjected to the fourth fusion process to obtain the third feature of the first item.
在一种可能的实现方式中,第六获取模块,用于:对用户对待处理页面的请求进行映射处理,得到第一项目的第七特征;对第一项目的第六特征、第一项目的第七特征以及第一项目的第十特征进行第四融合处理,得到第一项目的第三特征。In a possible implementation, the sixth acquisition module is used to: map the user's request for the page to be processed to obtain the seventh feature of the first item; map the sixth feature of the first item, the first item's The seventh feature and the tenth feature of the first item are subjected to the fourth fusion process to obtain the third feature of the first item.
在一种可能的实现方式中,若第一项目为待处理页面中的首个项目,则第二项目的第二特征为预置值。In a possible implementation, if the first item is the first item in the page to be processed, the second characteristic of the second item is a preset value.
在一种可能的实现方式中,待处理页面包含多个列表,位于多个列表中的多个项目构成有向无环图,多个项目包含第一项目、第二项目以及第三项目。In a possible implementation, the page to be processed includes multiple lists, multiple items located in the multiple lists form a directed acyclic graph, and the multiple items include a first item, a second item, and a third item.
需要说明的是,上述装置各模块/单元之间的信息交互、执行过程等内容,由于与本申请方法实施例基于同一构思,其带来的技术效果与本申请方法实施例相同,具体内容可参考本申请实施例前述所示的方法实施例中的叙述,此处不再赘述。It should be noted that the information interaction, execution process, etc. between the modules/units of the above-mentioned device are based on the same concept as the method embodiments of the present application, and the technical effects they bring are the same as those of the method embodiments of the present application. The specific content can be Refer to the description in the method embodiments shown above in the embodiments of the present application, which will not be described again here.
本申请实施例还涉及一种执行设备,图14为本申请实施例提供的执行设备的一个结构示意图。如图14所示,执行设备1400具体可以表现为手机、平板、笔记本电脑、智能穿戴设备、服务器等,此处不做限定。其中,执行设备1400上可部署有图11对应实施例中所描述的用户行为预测装置以及图12对应实施例中所描述的有向无环图构建装置,用于实现图4对应实施例中有向无环图构建的功能以及图8对应实施例中用户行为预测的功能。具体的,执行设备1400包括:接收器1401、发射器1402、处理器1403和存储器1404(其中执行设备1400中的处理器1403的数量可以一个或多个,图14中以一个处理器为例),其中,处理器1403可以包括应用处理器14031和通信处理器14032。在本申请的一些实施例中,接收器1401、发射器1402、处理器1403和存储器1404可通过总线或其它方式连接。The embodiment of the present application also relates to an execution device. Figure 14 is a schematic structural diagram of the execution device provided by the embodiment of the present application. As shown in Figure 14, the execution device 1400 can be embodied as a mobile phone, a tablet, a laptop, a smart wearable device, a server, etc., and is not limited here. Among them, the user behavior prediction device described in the corresponding embodiment of FIG. 11 and the directed acyclic graph construction device described in the corresponding embodiment of FIG. 12 may be deployed on the execution device 1400 to implement the user behavior prediction device described in the corresponding embodiment of FIG. 4 The function of constructing an acyclic graph and the function of predicting user behavior in the corresponding embodiment of FIG. 8 . Specifically, the execution device 1400 includes: a receiver 1401, a transmitter 1402, a processor 1403 and a memory 1404 (the number of processors 1403 in the execution device 1400 can be one or more, one processor is taken as an example in Figure 14) , wherein the processor 1403 may include an application processor 14031 and a communication processor 14032. In some embodiments of the present application, the receiver 1401, the transmitter 1402, the processor 1403, and the memory 1404 may be connected by a bus or other means.
存储器1404可以包括只读存储器和随机存取存储器,并向处理器1403提供指令和数据。存储器1404的一部分还可以包括非易失性随机存取存储器(non-volatile random access memory,NVRAM)。存储器1404存储有处理器和操作指令、可执行模块或者数据结构,或者它们的子集,或者它们的扩展集,其中,操作指令可包括各种操作指令,用于实现各种操作。Memory 1404 may include read-only memory and random access memory and provides instructions and data to processor 1403 . A portion of memory 1404 may also include non-volatile random access memory (NVRAM). The memory 1404 stores processor and operating instructions, executable modules or data structures, or a subset thereof, or an extended set thereof, where the operating instructions may include various operating instructions for implementing various operations.
处理器1403控制执行设备的操作。具体的应用中,执行设备的各个组件通过总线系统耦合在一起,其中总线系统除包括数据总线之外,还可以包括电源总线、控制总线和状态信号总线等。但是为了清楚说明起见,在图中将各种总线都称为总线系统。The processor 1403 controls the execution of operations of the device. In specific applications, various components of the execution device are coupled together through a bus system. In addition to the data bus, the bus system may also include a power bus, a control bus, a status signal bus, etc. However, for the sake of clarity, various buses are called bus systems in the figure.
上述本申请实施例揭示的方法可以应用于处理器1403中,或者由处理器1403实现。处理器1403可以是一种集成电路芯片,具有信号的处理能力。在实现过程中,上述方法的各步 骤可以通过处理器1403中的硬件的集成逻辑电路或者软件形式的指令完成。上述的处理器1403可以是通用处理器、数字信号处理器(digital signal processing,DSP)、微处理器或微控制器,还可进一步包括专用集成电路(application specific integrated circuit,ASIC)、现场可编程门阵列(field-programmable gate array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。该处理器1403可以实现或者执行本申请实施例中的公开的各方法、步骤及逻辑框图。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。结合本申请实施例所公开的方法的步骤可以直接体现为硬件译码处理器执行完成,或者用译码处理器中的硬件及软件模块组合执行完成。软件模块可以位于随机存储器,闪存、只读存储器,可编程只读存储器或者电可擦写可编程存储器、寄存器等本领域成熟的存储介质中。该存储介质位于存储器1404,处理器1403读取存储器1404中的信息,结合其硬件完成上述方法的步骤。The methods disclosed in the above embodiments of the present application can be applied to the processor 1403 or implemented by the processor 1403. The processor 1403 may be an integrated circuit chip with signal processing capabilities. During the implementation process, each step of the above method The steps may be completed by instructions in the form of hardware integrated logic circuits or software in the processor 1403 . The above-mentioned processor 1403 can be a general-purpose processor, a digital signal processor (DSP), a microprocessor or a microcontroller, and can further include an application specific integrated circuit (ASIC), a field programmable Gate array (field-programmable gate array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components. The processor 1403 can implement or execute each method, step and logical block diagram disclosed in the embodiment of this application. A general-purpose processor may be a microprocessor or the processor may be any conventional processor, etc. The steps of the method disclosed in conjunction with the embodiments of the present application can be directly implemented by a hardware decoding processor, or executed by a combination of hardware and software modules in the decoding processor. The software module can be located in random access memory, flash memory, read-only memory, programmable read-only memory or electrically erasable programmable memory, registers and other mature storage media in this field. The storage medium is located in the memory 1404. The processor 1403 reads the information in the memory 1404 and completes the steps of the above method in combination with its hardware.
接收器1401可用于接收输入的数字或字符信息,以及产生与执行设备的相关设置以及功能控制有关的信号输入。发射器1402可用于通过第一接口输出数字或字符信息;发射器1402还可用于通过第一接口向磁盘组发送指令,以修改磁盘组中的数据;发射器1402还可以包括显示屏等显示设备。The receiver 1401 may be configured to receive input numeric or character information and generate signal inputs related to performing relevant settings and functional controls of the device. The transmitter 1402 can be used to output numeric or character information through the first interface; the transmitter 1402 can also be used to send instructions to the disk group through the first interface to modify the data in the disk group; the transmitter 1402 can also include a display device such as a display screen .
本申请实施例中,在一种情况下,处理器1403,用于通过图8对应实施例中的目标模型,对目标页面进行用户行为预测。In the embodiment of the present application, in one case, the processor 1403 is used to predict user behavior for the target page through the target model in the corresponding embodiment of FIG. 8 .
本申请实施例还涉及一种训练设备,图15为本申请实施例提供的训练设备的一个结构示意图。如图15所示,训练设备1500由一个或多个服务器实现,训练设备1500可因配置或性能不同而产生比较大的差异,可以包括一个或一个以上中央处理器(central processing units,CPU)1514(例如,一个或一个以上处理器)和存储器1532,一个或一个以上存储应用程序1542或数据1544的存储介质1530(例如一个或一个以上海量存储设备)。其中,存储器1532和存储介质1530可以是短暂存储或持久存储。存储在存储介质1530的程序可以包括一个或一个以上模块(图示没标出),每个模块可以包括对训练设备中的一系列指令操作。更进一步地,中央处理器1514可以设置为与存储介质1530通信,在训练设备1500上执行存储介质1530中的一系列指令操作。The embodiment of the present application also relates to a training device. Figure 15 is a schematic structural diagram of the training device provided by the embodiment of the present application. As shown in Figure 15, the training device 1500 is implemented by one or more servers. The training device 1500 can vary greatly due to different configurations or performance, and can include one or more central processing units (CPU) 1514 (eg, one or more processors) and memory 1532, one or more storage media 1530 (eg, one or more mass storage devices) storing applications 1542 or data 1544. Among them, the memory 1532 and the storage medium 1530 may be short-term storage or persistent storage. The program stored in the storage medium 1530 may include one or more modules (not shown in the figure), and each module may include a series of instruction operations in the training device. Furthermore, the central processor 1514 may be configured to communicate with the storage medium 1530 and execute a series of instruction operations in the storage medium 1530 on the training device 1500 .
训练设备1500还可以包括一个或一个以上电源1526,一个或一个以上有线或无线网络接口1550,一个或一个以上输入输出接口1558;或,一个或一个以上操作系统1541,例如Windows ServerTM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTM等等。The training device 1500 may also include one or more power supplies 1526, one or more wired or wireless network interfaces 1550, one or more input and output interfaces 1558; or, one or more operating systems 1541, such as Windows ServerTM, Mac OS XTM , UnixTM, LinuxTM, FreeBSDTM and so on.
具体的,训练设备可以执行图10对应实施例中的模型训练方法。Specifically, the training device can execute the model training method in the corresponding embodiment of Figure 10.
本申请实施例还涉及一种计算机存储介质,该计算机可读存储介质中存储有用于进行信号处理的程序,当其在计算机上运行时,使得计算机执行如前述执行设备所执行的步骤,或者,使得计算机执行如前述训练设备所执行的步骤。Embodiments of the present application also relate to a computer storage medium. The computer-readable storage medium stores a program for performing signal processing. When the program is run on a computer, it causes the computer to perform the steps performed by the aforementioned execution device, or, The computer is caused to perform the steps performed by the aforementioned training device.
本申请实施例还涉及一种计算机程序产品,该计算机程序产品存储有指令,该指令在由计算机执行时使得计算机执行如前述执行设备所执行的步骤,或者,使得计算机执行如前述训练设备所执行的步骤。Embodiments of the present application also relate to a computer program product that stores instructions that, when executed by a computer, cause the computer to perform the steps performed by the foregoing execution device, or cause the computer to perform the steps performed by the foregoing training device. A step of.
本申请实施例提供的执行设备、训练设备或终端设备具体可以为芯片,芯片包括:处理单元和通信单元,所述处理单元例如可以是处理器,所述通信单元例如可以是输入/输出接口、 管脚或电路等。该处理单元可执行存储单元存储的计算机执行指令,以使执行设备内的芯片执行上述实施例描述的数据处理方法,或者,以使训练设备内的芯片执行上述实施例描述的数据处理方法。可选地,所述存储单元为所述芯片内的存储单元,如寄存器、缓存等,所述存储单元还可以是所述无线接入设备端内的位于所述芯片外部的存储单元,如只读存储器(read-only memory,ROM)或可存储静态信息和指令的其他类型的静态存储设备,随机存取存储器(random access memory,RAM)等。The execution device, training device or terminal device provided by the embodiment of the present application may specifically be a chip. The chip includes: a processing unit and a communication unit. The processing unit may be, for example, a processor. The communication unit may be, for example, an input/output interface. Pins or circuits, etc. The processing unit can execute the computer execution instructions stored in the storage unit, so that the chip in the execution device executes the data processing method described in the above embodiment, or so that the chip in the training device executes the data processing method described in the above embodiment. Optionally, the storage unit is a storage unit within the chip, such as a register, cache, etc. The storage unit may also be a storage unit located outside the chip in the wireless access device, such as Read-only memory (ROM) or other types of static storage devices that can store static information and instructions, random access memory (random access memory, RAM), etc.
具体的,请参阅图16,图16为本申请实施例提供的芯片的一个结构示意图,所述芯片可以表现为神经网络处理器NPU 1600,NPU 1600作为协处理器挂载到主CPU(Host CPU)上,由Host CPU分配任务。NPU的核心部分为运算电路1603,通过控制器1604控制运算电路1603提取存储器中的矩阵数据并进行乘法运算。Specifically, please refer to Figure 16. Figure 16 is a schematic structural diagram of a chip provided by an embodiment of the present application. The chip can be represented as a neural network processor NPU 1600. The NPU 1600 serves as a co-processor and is mounted to the host CPU (Host CPU). ), tasks are allocated by the Host CPU. The core part of the NPU is the arithmetic circuit 1603. The arithmetic circuit 1603 is controlled by the controller 1604 to extract the matrix data in the memory and perform multiplication operations.
在一些实现中,运算电路1603内部包括多个处理单元(Process Engine,PE)。在一些实现中,运算电路1603是二维脉动阵列。运算电路1603还可以是一维脉动阵列或者能够执行例如乘法和加法这样的数学运算的其它电子线路。在一些实现中,运算电路1603是通用的矩阵处理器。In some implementations, the computing circuit 1603 includes multiple processing units (Process Engine, PE). In some implementations, arithmetic circuit 1603 is a two-dimensional systolic array. The arithmetic circuit 1603 may also be a one-dimensional systolic array or other electronic circuit capable of performing mathematical operations such as multiplication and addition. In some implementations, arithmetic circuit 1603 is a general-purpose matrix processor.
举例来说,假设有输入矩阵A,权重矩阵B,输出矩阵C。运算电路从权重存储器1602中取矩阵B相应的数据,并缓存在运算电路中每一个PE上。运算电路从输入存储器1601中取矩阵A数据与矩阵B进行矩阵运算,得到的矩阵的部分结果或最终结果,保存在累加器(accumulator)1608中。For example, assume there is an input matrix A, a weight matrix B, and an output matrix C. The arithmetic circuit obtains the corresponding data of matrix B from the weight memory 1602 and caches it on each PE in the arithmetic circuit. The operation circuit takes matrix A data and matrix B from the input memory 1601 to perform matrix operations, and the partial result or final result of the matrix is stored in an accumulator (accumulator) 1608 .
统一存储器1606用于存放输入数据以及输出数据。权重数据直接通过存储单元访问控制器(Direct Memory Access Controller,DMAC)1605,DMAC被搬运到权重存储器1602中。输入数据也通过DMAC被搬运到统一存储器1606中。The unified memory 1606 is used to store input data and output data. The weight data directly passes through the storage unit access controller (Direct Memory Access Controller, DMAC) 1605, and the DMAC is transferred to the weight memory 1602. Input data is also transferred to unified memory 1606 via DMAC.
BIU为Bus Interface Unit即,总线接口单元1613,用于AXI总线与DMAC和取指存储器(Instruction Fetch Buffer,IFB)1609的交互。BIU is the Bus Interface Unit, that is, the bus interface unit 1613, which is used for the interaction between the AXI bus and the DMAC and the Instruction Fetch Buffer (IFB) 1609.
总线接口单元1613(Bus Interface Unit,简称BIU),用于取指存储器1609从外部存储器获取指令,还用于存储单元访问控制器1605从外部存储器获取输入矩阵A或者权重矩阵B的原数据。The bus interface unit 1613 (Bus Interface Unit, BIU for short) is used to fetch the memory 1609 to obtain instructions from the external memory, and is also used for the storage unit access controller 1605 to obtain the original data of the input matrix A or the weight matrix B from the external memory.
DMAC主要用于将外部存储器DDR中的输入数据搬运到统一存储器1606或将权重数据搬运到权重存储器1602中或将输入数据数据搬运到输入存储器1601中。DMAC is mainly used to transfer the input data in the external memory DDR to the unified memory 1606 or the weight data to the weight memory 1602 or the input data to the input memory 1601 .
向量计算单元1607包括多个运算处理单元,在需要的情况下,对运算电路1603的输出做进一步处理,如向量乘,向量加,指数运算,对数运算,大小比较等等。主要用于神经网络中非卷积/全连接层网络计算,如Batch Normalization(批归一化),像素级求和,对预测标签平面进行上采样等。The vector calculation unit 1607 includes multiple arithmetic processing units, and if necessary, further processes the output of the arithmetic circuit 1603, such as vector multiplication, vector addition, exponential operation, logarithmic operation, size comparison, etc. It is mainly used for non-convolutional/fully connected layer network calculations in neural networks, such as Batch Normalization, pixel-level summation, upsampling of predicted label planes, etc.
在一些实现中,向量计算单元1607能将经处理的输出的向量存储到统一存储器1606。例如,向量计算单元1607可以将线性函数;或,非线性函数应用到运算电路1603的输出,例如对卷积层提取的预测标签平面进行线性插值,再例如累加值的向量,用以生成激活值。在一些实现中,向量计算单元1607生成归一化的值、像素级求和的值,或二者均有。在一些实现中,处理过的输出的向量能够用作到运算电路1603的激活输入,例如用于在神经网络中的后续层中的使用。 In some implementations, vector calculation unit 1607 can store the processed output vectors to unified memory 1606 . For example, the vector calculation unit 1607 can apply a linear function; or a nonlinear function to the output of the operation circuit 1603, such as linear interpolation on the prediction label plane extracted by the convolution layer, or a vector of accumulated values, to generate an activation value. . In some implementations, vector calculation unit 1607 generates normalized values, pixel-wise summed values, or both. In some implementations, the processed output vector can be used as an activation input to the arithmetic circuit 1603, such as for use in a subsequent layer in a neural network.
控制器1604连接的取指存储器(instruction fetch buffer)1609,用于存储控制器1604使用的指令;The instruction fetch buffer 1609 connected to the controller 1604 is used to store instructions used by the controller 1604;
统一存储器1606,输入存储器1601,权重存储器1602以及取指存储器1609均为On-Chip存储器。外部存储器私有于该NPU硬件架构。The unified memory 1606, the input memory 1601, the weight memory 1602 and the fetch memory 1609 are all On-Chip memories. External memory is private to the NPU hardware architecture.
其中,上述任一处提到的处理器,可以是一个通用中央处理器,微处理器,ASIC,或一个或多个用于控制上述程序执行的集成电路。The processor mentioned in any of the above places can be a general central processing unit, a microprocessor, an ASIC, or one or more integrated circuits used to control the execution of the above programs.
另外需说明的是,以上所描述的装置实施例仅仅是示意性的,其中所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部模块来实现本实施例方案的目的。另外,本申请提供的装置实施例附图中,模块之间的连接关系表示它们之间具有通信连接,具体可以实现为一条或多条通信总线或信号线。In addition, it should be noted that the device embodiments described above are only illustrative. The units described as separate components may or may not be physically separated, and the components shown as units may or may not be physically separate. The physical unit can be located in one place, or it can be distributed across multiple network units. Some or all of the modules can be selected according to actual needs to achieve the purpose of the solution of this embodiment. In addition, in the drawings of the device embodiments provided in this application, the connection relationship between modules indicates that there are communication connections between them, which can be specifically implemented as one or more communication buses or signal lines.
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到本申请可借助软件加必需的通用硬件的方式来实现,当然也可以通过专用硬件包括专用集成电路、专用CPU、专用存储器、专用元器件等来实现。一般情况下,凡由计算机程序完成的功能都可以很容易地用相应的硬件来实现,而且,用来实现同一功能的具体硬件结构也可以是多种多样的,例如模拟电路、数字电路或专用电路等。但是,对本申请而言更多情况下软件程序实现是更佳的实施方式。基于这样的理解,本申请的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来,该计算机软件产品存储在可读取的存储介质中,如计算机的软盘、U盘、移动硬盘、ROM、RAM、磁碟或者光盘等,包括若干指令用以使得一台计算机设备(可以是个人计算机,训练设备,或者网络设备等)执行本申请各个实施例所述的方法。Through the above description of the embodiments, those skilled in the art can clearly understand that the present application can be implemented by software plus necessary general hardware. Of course, it can also be implemented by dedicated hardware including dedicated integrated circuits, dedicated CPUs, dedicated memories, Special components, etc. to achieve. In general, all functions performed by computer programs can be easily implemented with corresponding hardware. Moreover, the specific hardware structures used to implement the same function can also be diverse, such as analog circuits, digital circuits or special-purpose circuits. circuit etc. However, for this application, software program implementation is a better implementation in most cases. Based on this understanding, the technical solution of the present application can be embodied in the form of a software product in essence or that contributes to the existing technology. The computer software product is stored in a readable storage medium, such as a computer floppy disk. , U disk, mobile hard disk, ROM, RAM, magnetic disk or optical disk, etc., including several instructions to cause a computer device (which can be a personal computer, training device, or network device, etc.) to execute the steps described in various embodiments of this application. method.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product.
所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本申请实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、训练设备或数据中心通过有线(例如同轴电缆、光纤、数字用户线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、训练设备或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存储的任何可用介质或者是包含一个或多个可用介质集成的训练设备、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘(Solid State Disk,SSD))等。 The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on a computer, the processes or functions described in the embodiments of the present application are generated in whole or in part. The computer may be a general-purpose computer, a special-purpose computer, a computer network, or other programmable device. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another, for example, the computer instructions may be transferred from a website, computer, training device, or data The center transmits to another website site, computer, training equipment or data center through wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means. The computer-readable storage medium may be any available medium that a computer can store, or a data storage device such as a training device or a data center integrated with one or more available media. The available media may be magnetic media (eg, floppy disk, hard disk, magnetic tape), optical media (eg, DVD), or semiconductor media (eg, solid state disk (Solid State Disk, SSD)), etc.

Claims (29)

  1. 一种用户行为预测方法,其特征在于,所述方法通过目标模型实现,所述方法包括:A user behavior prediction method, characterized in that the method is implemented through a target model, and the method includes:
    获取第一项目的第一特征以及第二项目的第二特征,所述第一项目和所述第二项目位于目标页面的不同列表或同一列表中,且所述第二项目位于所述第一项目之前;Obtain the first characteristic of the first item and the second characteristic of the second item, the first item and the second item are located in different lists or the same list of the target page, and the second item is located in the first Before the project;
    基于所述第一项目的第一特征以及所述第二项目的第二特征,获取所述第一项目的第二特征,第一特征为属性信息,第二特征为基于所述属性信息进行融合后得到的信息;Based on the first feature of the first item and the second feature of the second item, the second feature of the first item is obtained. The first feature is attribute information, and the second feature is fusion based on the attribute information. information obtained later;
    基于所述第一项目的第二特征,获取所述第一项目被用户点击的概率。Based on the second characteristic of the first item, a probability that the first item is clicked by the user is obtained.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1, further comprising:
    获取所述第三项目的第一特征,所述第一项目和所述第三项目位于所述目标页面的不同列表或同一列表中,且所述第三项目与所述第一项目相邻;Obtain the first characteristic of the third item, the first item and the third item are located in different lists or the same list of the target page, and the third item is adjacent to the first item;
    基于所述第一项目的第一特征以及所述第三项目的第一特征,获取所述第一项目的第三特征;Obtaining a third characteristic of the first item based on the first characteristic of the first item and the first characteristic of the third item;
    所述基于所述第一项目的第二特征,获取所述第一项目被用户点击的概率包括:Obtaining the probability that the first item is clicked by the user based on the second characteristic of the first item includes:
    基于所述第一项目的第二特征以及所述第一项目的第三特征,获取所述第一项目被用户点击的概率。Based on the second characteristic of the first item and the third characteristic of the first item, a probability that the first item is clicked by the user is obtained.
  3. 根据权利要求1所述的方法,其特征在于,所述基于所述第一项目的第一特征以及所述第二项目的第二特征,获取所述第一项目的第二特征包括:The method according to claim 1, characterized in that, based on the first characteristic of the first item and the second characteristic of the second item, obtaining the second characteristic of the first item includes:
    对所述第一项目的第一特征进行映射处理,得到所述第一项目的第四特征;Perform mapping processing on the first feature of the first item to obtain a fourth feature of the first item;
    对所述第二项目的第二特征进行基于自注意力机制的处理,得到所述第一项目的第五特征;Perform processing based on the self-attention mechanism on the second feature of the second item to obtain the fifth feature of the first item;
    对所述第一项目的第四特征以及所述第一项目的第五特征进行第一融合处理,得到所述第一项目的第二特征。Perform a first fusion process on the fourth feature of the first item and the fifth feature of the first item to obtain the second feature of the first item.
  4. 根据权利要求3所述的方法,其特征在于,所述对所述第一项目的第一特征进行映射处理,得到所述第一项目的第四特征:The method according to claim 3, characterized in that the mapping process is performed on the first feature of the first item to obtain a fourth feature of the first item:
    对所述第一项目的第一特征、用户对所述目标页面的请求以及所述第二项目被用户点击的概率进行映射处理,得到所述第一项目的第六特征、所述第一项目的第七特征以及所述第一项目的第八特征;Perform mapping processing on the first feature of the first item, the user's request for the target page, and the probability that the second item is clicked by the user, to obtain the sixth feature of the first item, the first item The seventh characteristic and the eighth characteristic of the first item;
    对所述第一项目的第六特征、所述第一项目的第七特征以及所述第一项目的第八特征进行第二融合处理,得到所述第一项目的第四特征。Perform a second fusion process on the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item to obtain the fourth feature of the first item.
  5. 根据权利要求2所述的方法,其特征在于,所述基于所述第一项目的第一特征以及所述第三项目的第一特征,获取所述第一项目的第三特征包括:The method of claim 2, wherein obtaining the third characteristic of the first item based on the first characteristic of the first item and the first characteristic of the third item includes:
    对所述第一项目的第一特征以及所述第三项目的第一特征进行映射处理,得到所述第一项目的第六特征以及所述第一项目的第九特征;Perform mapping processing on the first feature of the first item and the first feature of the third item to obtain the sixth feature of the first item and the ninth feature of the first item;
    对所述第一项目的第六特征和所述第一项目的第九特征进行第三融合处理,得到所述第一项目的第十特征;Perform a third fusion process on the sixth feature of the first item and the ninth feature of the first item to obtain the tenth feature of the first item;
    对所述第一项目的第六特征以及所述第一项目的第十特征进行第四融合处理,得到所述第一项目的第三特征。Perform a fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item.
  6. 根据权利要求5所述的方法,其特征在于,所述第一项目的第六特征以及所述第一项 目的第十特征进行第四融合处理,得到所述第一项目的第三特征包括:The method of claim 5, wherein the sixth characteristic of the first item and the first item The tenth feature of the object is subjected to the fourth fusion process to obtain the third feature of the first item including:
    对所述用户对所述目标页面的请求进行映射处理,得到所述第一项目的第七特征;Perform mapping processing on the user's request for the target page to obtain the seventh feature of the first item;
    对所述第一项目的第六特征、所述第一项目的第七特征以及所述第一项目的第十特征进行第四融合处理,得到所述第一项目的第三特征。Perform a fourth fusion process on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item to obtain the third feature of the first item.
  7. 根据权利要求1至6任意一项所述的方法,其特征在于,若所述第一项目为目标页面中的首个项目,则所述第二项目的第二特征为预置值。The method according to any one of claims 1 to 6, characterized in that if the first item is the first item in the target page, the second characteristic of the second item is a preset value.
  8. 根据权利要求1至7任意一项所述的方法,其特征在于,所述目标页面包含多个列表,位于多个列表中的多个项目构成有向无环图,所述多个项目包含所述第一项目、所述第二项目以及所述第三项目。The method according to any one of claims 1 to 7, characterized in that the target page contains multiple lists, multiple items located in the multiple lists constitute a directed acyclic graph, and the multiple items include all the first item, the second item and the third item.
  9. 一种有向无环图构建方法,其特征在于,所述方法包括:A method for constructing a directed acyclic graph, characterized in that the method includes:
    获取用户浏览目标页面的眼动数据;Obtain the eye movement data of users browsing the target page;
    基于所述眼动数据,确定所述用户对多个项目的浏览行为,所述多个项目位于所述目标页面的多个列表中;Based on the eye movement data, determining the user's browsing behavior for a plurality of items located in a plurality of lists on the target page;
    基于所述浏览行为,对所述多个项目进行连接,得到有向无环图。Based on the browsing behavior, the multiple items are connected to obtain a directed acyclic graph.
  10. 根据权利要求9所述的方法,其特征在于,所述基于所述浏览行为,对所述多个项目进行连接,得到有向无环图包括:The method according to claim 9, characterized in that, based on the browsing behavior, connecting the plurality of items to obtain a directed acyclic graph includes:
    将所述用户以第一顺序浏览的同一列表中的项目,按所述第一顺序进行连接,并将所述用户以第二顺序浏览的不同列表中的项目,按所述第二顺序进行连接,得到有向无环图。The items in the same list browsed by the user in the first order are connected in the first order, and the items in different lists browsed by the user in the second order are connected in the second order. , get the directed acyclic graph.
  11. 根据权利要求9或10所述的方法,其特征在于,所述获取用户浏览目标页面的眼动数据:The method according to claim 9 or 10, characterized in that said obtaining the eye movement data of the user browsing the target page:
    通过眼动仪采集用户浏览目标页面的眼动数据。Use an eye tracker to collect eye movement data of users browsing the target page.
  12. 一种模型训练方法,其特征在于,所述方法包括:A model training method, characterized in that the method includes:
    通过待训练模型获取第一项目的第一特征以及第二项目的第二特征,所述第一项目和所述第二项目位于待处理页面的不同列表或同一列表中,且所述第二项目位于所述第一项目之前;The first feature of the first item and the second feature of the second item are obtained through the model to be trained, the first item and the second item are located in different lists or the same list of the page to be processed, and the second item located before said first item;
    通过所述待训练模型基于所述第一项目的第一特征以及所述第二项目的第二特征,获取所述第一项目的第二特征,第一特征为属性信息,第二特征为基于所述属性信息进行融合后得到的信息;The model to be trained obtains the second feature of the first item based on the first feature of the first item and the second feature of the second item. The first feature is attribute information, and the second feature is based on Information obtained after fusion of the attribute information;
    通过所述待训练模型基于所述第一项目的第二特征,获取所述第一项目被用户点击的概率;Obtain the probability that the first item is clicked by the user based on the second feature of the first item through the model to be trained;
    基于所述第一项目被用户点击的概率以及第一项目被用户点击的真实概率,获取目标损失,所述目标损失用于指示所述第一项目被用户点击的概率以及第一项目被用户点击的真实概率之间的差异;Based on the probability that the first item is clicked by the user and the real probability that the first item is clicked by the user, a target loss is obtained, the target loss is used to indicate the probability that the first item is clicked by the user and the first item is clicked by the user The difference between the true probabilities;
    基于所述目标损失,更新所述待训练模型的参数,直至满足模型训练条件,得到目标模型。Based on the target loss, the parameters of the model to be trained are updated until the model training conditions are met, and the target model is obtained.
  13. 根据权利要求12所述的方法,其特征在于,所述方法还包括:The method of claim 12, further comprising:
    通过所述待训练模型获取所述第三项目的第一特征,所述第一项目和所述第三项目位于所述待处理页面的不同列表或同一列表中,且所述第三项目与所述第一项目相邻; The first feature of the third item is obtained through the model to be trained, the first item and the third item are located in different lists or the same list of the page to be processed, and the third item is the same as the third item. The first item mentioned above is adjacent;
    通过所述待训练模型基于所述第一项目的第一特征以及所述第三项目的第一特征,获取所述第一项目的第三特征;Obtain the third feature of the first item based on the first feature of the first item and the first feature of the third item by the to-be-trained model;
    所述通过所述待训练模型基于所述第一项目的第二特征,获取所述第一项目被用户点击的概率包括:The method of obtaining the probability that the first item is clicked by the user based on the second feature of the first item through the model to be trained includes:
    通过所述待训练模型基于所述第一项目的第二特征以及所述第一项目的第三特征,获取所述第一项目被用户点击的概率。The probability that the first item is clicked by the user is obtained by using the to-be-trained model based on the second feature of the first item and the third feature of the first item.
  14. 一种用户行为预测装置,其特征在于,所述装置包括:A user behavior prediction device, characterized in that the device includes:
    第一获取模块,用于通过目标模型获取第一项目的第一特征以及第二项目的第二特征,所述第一项目和所述第二项目位于目标页面的不同列表或同一列表中,且所述第二项目位于所述第一项目之前;A first acquisition module configured to acquire the first feature of the first item and the second feature of the second item through the target model, where the first item and the second item are located in different lists or the same list on the target page, and The second item is located before the first item;
    第二获取模块,用于通过目标模型基于所述第一项目的第一特征以及所述第二项目的第二特征,获取所述第一项目的第二特征,第一特征为属性信息,第二特征为基于所述属性信息进行融合后得到的信息;The second acquisition module is configured to acquire the second characteristics of the first item based on the first characteristics of the first item and the second characteristics of the second item through the target model. The first characteristics are attribute information. The second feature is information obtained after fusion based on the attribute information;
    第三获取模块,用于通过目标模型基于所述第一项目的第二特征,获取所述第一项目被用户点击的概率。The third acquisition module is configured to acquire the probability that the first item is clicked by the user based on the second feature of the first item through the target model.
  15. 根据权利要求14所述的装置,其特征在于,所述装置还包括:The device according to claim 14, characterized in that the device further includes:
    第四获取模块,用于通过目标模型获取所述第三项目的第一特征,所述第一项目和所述第三项目位于所述目标页面的不同列表或同一列表中,且所述第三项目与所述第一项目相邻;The fourth acquisition module is used to acquire the first characteristics of the third item through the target model. The first item and the third item are located in different lists or the same list of the target page, and the third item an item adjacent to said first item;
    第五获取模块,用于通过目标模型基于所述第一项目的第一特征以及所述第三项目的第一特征,获取所述第一项目的第三特征;A fifth acquisition module, configured to acquire the third feature of the first item based on the first feature of the first item and the first feature of the third item through the target model;
    所述第三获取模块,用于通过目标模型基于所述第一项目的第二特征以及所述第一项目的第三特征,获取所述第一项目被用户点击的概率。The third acquisition module is configured to acquire the probability that the first item is clicked by the user based on the second characteristic of the first item and the third characteristic of the first item through a target model.
  16. 根据权利要求14所述的装置,其特征在于,所述第二获取模块,用于:The device according to claim 14, characterized in that the second acquisition module is used for:
    通过目标模型对所述第一项目的第一特征进行映射处理,得到所述第一项目的第四特征;Perform mapping processing on the first feature of the first item through the target model to obtain the fourth feature of the first item;
    通过目标模型对所述第二项目的第二特征进行基于自注意力机制的处理,得到所述第一项目的第五特征;The second feature of the second item is processed based on the self-attention mechanism through the target model to obtain the fifth feature of the first item;
    通过目标模型对所述第一项目的第四特征以及所述第一项目的第五特征进行第一融合处理,得到所述第一项目的第二特征。The fourth feature of the first item and the fifth feature of the first item are subjected to a first fusion process through the target model to obtain the second feature of the first item.
  17. 根据权利要求16所述的装置,其特征在于,所述第二获取模块,用于:The device according to claim 16, characterized in that the second acquisition module is used for:
    通过目标模型对所述第一项目的第一特征、用户对所述目标页面的请求以及所述第二项目被用户点击的概率进行映射处理,得到所述第一项目的第六特征、所述第一项目的第七特征以及所述第一项目的第八特征;The first feature of the first item, the user's request for the target page, and the probability that the second item is clicked by the user are mapped through the target model to obtain the sixth feature of the first item, the The seventh feature of the first item and the eighth feature of said first item;
    通过目标模型对所述第一项目的第六特征、所述第一项目的第七特征以及所述第一项目的第八特征进行第二融合处理,得到所述第一项目的第四特征。The fourth feature of the first item is obtained by performing a second fusion process on the sixth feature of the first item, the seventh feature of the first item, and the eighth feature of the first item through the target model.
  18. 根据权利要求17所述的装置,其特征在于,所述第五获取模块,用于:The device according to claim 17, characterized in that the fifth acquisition module is used for:
    通过目标模型对所述第一项目的第一特征以及所述第三项目的第一特征进行映射处理,得到所述第一项目的第六特征以及所述第一项目的第九特征;The first feature of the first item and the first feature of the third item are mapped through the target model to obtain the sixth feature of the first item and the ninth feature of the first item;
    通过目标模型对所述第一项目的第六特征和所述第一项目的第九特征进行第三融合处理, 得到所述第一项目的第十特征;Perform a third fusion process on the sixth feature of the first item and the ninth feature of the first item through the target model, Obtain the tenth characteristic of the first item;
    通过目标模型对所述第一项目的第六特征以及所述第一项目的第十特征进行第四融合处理,得到所述第一项目的第三特征。The target model performs a fourth fusion process on the sixth feature of the first item and the tenth feature of the first item to obtain the third feature of the first item.
  19. 根据权利要求18所述的装置,其特征在于,所述第五获取模块,用于:The device according to claim 18, characterized in that the fifth acquisition module is used for:
    通过目标模型对所述用户对所述目标页面的请求进行映射处理,得到所述第一项目的第七特征;Perform mapping processing on the user's request for the target page through a target model to obtain the seventh feature of the first item;
    通过目标模型对所述第一项目的第六特征、所述第一项目的第七特征以及所述第一项目的第十特征进行第四融合处理,得到所述第一项目的第三特征。The third feature of the first item is obtained by performing a fourth fusion process on the sixth feature of the first item, the seventh feature of the first item, and the tenth feature of the first item through the target model.
  20. 根据权利要求14至19任意一项所述的装置,其特征在于,若所述第一项目为目标页面中的首个项目,则所述第二项目的第二特征为预置值。The device according to any one of claims 14 to 19, characterized in that if the first item is the first item in the target page, the second characteristic of the second item is a preset value.
  21. 根据权利要求14至20任意一项所述的装置,其特征在于,所述目标页面包含多个列表,位于多个列表中的多个项目构成有向无环图,所述多个项目包含所述第一项目、所述第二项目以及所述第三项目。The device according to any one of claims 14 to 20, wherein the target page includes multiple lists, multiple items located in the multiple lists constitute a directed acyclic graph, and the multiple items include all the first item, the second item and the third item.
  22. 一种有向无环图构建装置,其特征在于,所述装置包括:A directed acyclic graph construction device, characterized in that the device includes:
    获取模块,用于获取用户浏览目标页面的眼动数据;The acquisition module is used to obtain eye movement data of users browsing the target page;
    确定模块,用于基于所述眼动数据,确定所述用户对多个项目的浏览行为,所述多个项目位于所述目标页面的多个列表中;A determining module, configured to determine the user's browsing behavior for multiple items based on the eye movement data, where the multiple items are located in multiple lists of the target page;
    连接模块,用于基于所述浏览行为,对所述多个项目进行连接,得到有向无环图。The connection module is used to connect the multiple items based on the browsing behavior to obtain a directed acyclic graph.
  23. 根据权利要求22所述的装置,其特征在于,所述连接模块,用于将所述用户以第一顺序浏览的同一列表中的项目,按所述第一顺序进行连接,并将所述用户以第二顺序浏览的不同列表中的项目,按所述第二顺序进行连接,得到有向无环图。The device according to claim 22, characterized in that the connection module is used to connect items in the same list browsed by the user in the first order in the first order, and connect the user Items in different lists browsed in the second order are connected according to the second order to obtain a directed acyclic graph.
  24. 根据权利要求22或23所述的装置,其特征在于,所述获取模块,用于通过眼动仪采集用户浏览目标页面的眼动数据。The device according to claim 22 or 23, characterized in that the acquisition module is used to collect the eye movement data of the user browsing the target page through an eye tracker.
  25. 一种模型训练装置,其特征在于,所述装置包括:A model training device, characterized in that the device includes:
    第一获取模块,用于通过待训练模型获取第一项目的第一特征以及第二项目的第二特征,所述第一项目和所述第二项目位于待处理页面的不同列表或同一列表中,且所述第二项目位于所述第一项目之前;A first acquisition module, configured to acquire the first feature of the first item and the second feature of the second item through the model to be trained, where the first item and the second item are located in different lists or the same list of the page to be processed. , and the second item is located before the first item;
    第二获取模块,用于通过所述待训练模型基于所述第一项目的第一特征以及所述第二项目的第二特征,获取所述第一项目的第二特征,第一特征为属性信息,第二特征为基于所述属性信息进行融合后得到的信息;The second acquisition module is used to acquire the second characteristic of the first item based on the first characteristic of the first item and the second characteristic of the second item through the to-be-trained model, where the first characteristic is an attribute. Information, the second feature is information obtained after fusion based on the attribute information;
    第三获取模块,用于通过所述待训练模型基于所述第一项目的第二特征,获取所述第一项目被用户点击的概率;A third acquisition module, configured to acquire the probability that the first item is clicked by the user based on the second feature of the first item through the model to be trained;
    第四获取模块,用于基于所述第一项目被用户点击的概率以及第一项目被用户点击的真实概率,获取目标损失,所述目标损失用于指示所述第一项目被用户点击的概率以及第一项目被用户点击的真实概率之间的差异;The fourth acquisition module is used to obtain a target loss based on the probability that the first item is clicked by the user and the real probability that the first item is clicked by the user. The target loss is used to indicate the probability that the first item is clicked by the user. and the difference between the real probability of the first item being clicked by the user;
    更新模块,用于基于所述目标损失,更新所述待训练模型的参数,直至满足模型训练条件,得到目标模型。An update module, configured to update the parameters of the model to be trained based on the target loss until the model training conditions are met to obtain the target model.
  26. 根据权利要求25所述的装置,其特征在于,所述装置包括: The device according to claim 25, characterized in that the device includes:
    第五获取模块,用于通过所述待训练模型获取所述第三项目的第一特征,所述第一项目和所述第三项目位于所述待处理页面的不同列表或同一列表中,且所述第三项目与所述第一项目相邻;A fifth acquisition module, configured to acquire the first feature of the third item through the model to be trained, where the first item and the third item are located in different lists or the same list of the page to be processed, and The third item is adjacent to the first item;
    第六获取模块,用于通过所述待训练模型基于所述第一项目的第一特征以及所述第三项目的第一特征,获取所述第一项目的第三特征;A sixth acquisition module, configured to acquire the third feature of the first item based on the first feature of the first item and the first feature of the third item through the to-be-trained model;
    所述第三获取模块,用于通过所述待训练模型基于所述第一项目的第二特征以及所述第一项目的第三特征,获取所述第一项目被用户点击的概率。The third acquisition module is configured to acquire the probability that the first item is clicked by the user based on the second characteristic of the first item and the third characteristic of the first item through the model to be trained.
  27. 一种用户行为预测装置,其特征在于,所述装置包括存储器和处理器;A user behavior prediction device, characterized in that the device includes a memory and a processor;
    所述存储器存储有代码,所述处理器被配置为执行所述代码,当所述代码被执行时,所述装置执行如权利要求1至13任一项所述的方法。The memory stores code, the processor is configured to execute the code, and when the code is executed, the device performs the method of any one of claims 1 to 13.
  28. 一种计算机存储介质,其特征在于,所述计算机存储介质存储有计算机程序,该程序由计算机执行时,使得所述计算机实施权利要求1至13任一项所述的方法。A computer storage medium, characterized in that the computer storage medium stores a computer program, which when executed by a computer causes the computer to implement the method described in any one of claims 1 to 13.
  29. 一种计算机程序产品,其特征在于,所述计算机程序产品存储有指令,所述指令在由计算机执行时,使得所述计算机实施权利要求1至13任一项所述的方法。 A computer program product, characterized in that the computer program product stores instructions, which when executed by a computer, cause the computer to implement the method described in any one of claims 1 to 13.
PCT/CN2023/086192 2022-04-12 2023-04-04 User behavior prediction method and related device thereof WO2023197910A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202210379948.7A CN114707070A (en) 2022-04-12 2022-04-12 User behavior prediction method and related equipment thereof
CN202210379948.7 2022-04-12

Publications (1)

Publication Number Publication Date
WO2023197910A1 true WO2023197910A1 (en) 2023-10-19

Family

ID=82175423

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2023/086192 WO2023197910A1 (en) 2022-04-12 2023-04-04 User behavior prediction method and related device thereof

Country Status (2)

Country Link
CN (1) CN114707070A (en)
WO (1) WO2023197910A1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114707070A (en) * 2022-04-12 2022-07-05 华为技术有限公司 User behavior prediction method and related equipment thereof

Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103500011A (en) * 2013-10-08 2014-01-08 百度在线网络技术(北京)有限公司 Eye movement track law analysis method and device
CN104146680A (en) * 2014-09-01 2014-11-19 北京工业大学 Eye movement measuring method and system
CN109816101A (en) * 2019-01-31 2019-05-28 中科人工智能创新技术研究院(青岛)有限公司 A kind of session sequence of recommendation method and system based on figure convolutional neural networks
CN109840782A (en) * 2017-11-24 2019-06-04 腾讯科技(深圳)有限公司 Clicking rate prediction technique, device, server and storage medium
CN110569437A (en) * 2019-09-05 2019-12-13 腾讯科技(深圳)有限公司 click probability prediction and page content recommendation methods and devices
CN111046257A (en) * 2019-12-09 2020-04-21 北京百度网讯科技有限公司 Session recommendation method and device and electronic equipment
CN111259222A (en) * 2020-01-22 2020-06-09 北京百度网讯科技有限公司 Article recommendation method, system, electronic device and storage medium
CN111259133A (en) * 2020-01-17 2020-06-09 成都信息工程大学 Personalized recommendation method integrating multiple information
CN111400592A (en) * 2020-03-12 2020-07-10 山东师范大学 Personalized course recommendation method and system based on eye movement technology and deep learning
CN112148975A (en) * 2020-09-21 2020-12-29 北京百度网讯科技有限公司 Session recommendation method, device and equipment
US20210133612A1 (en) * 2019-10-31 2021-05-06 Adobe Inc. Graph data structure for using inter-feature dependencies in machine-learning
CN112948681A (en) * 2021-03-12 2021-06-11 北京交通大学 Time series data recommendation method fusing multi-dimensional features
CN114240555A (en) * 2021-12-17 2022-03-25 北京沃东天骏信息技术有限公司 Click rate prediction model training method and device and click rate prediction method and device
CN114707070A (en) * 2022-04-12 2022-07-05 华为技术有限公司 User behavior prediction method and related equipment thereof

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103500011A (en) * 2013-10-08 2014-01-08 百度在线网络技术(北京)有限公司 Eye movement track law analysis method and device
CN104146680A (en) * 2014-09-01 2014-11-19 北京工业大学 Eye movement measuring method and system
CN109840782A (en) * 2017-11-24 2019-06-04 腾讯科技(深圳)有限公司 Clicking rate prediction technique, device, server and storage medium
CN109816101A (en) * 2019-01-31 2019-05-28 中科人工智能创新技术研究院(青岛)有限公司 A kind of session sequence of recommendation method and system based on figure convolutional neural networks
CN110569437A (en) * 2019-09-05 2019-12-13 腾讯科技(深圳)有限公司 click probability prediction and page content recommendation methods and devices
US20210133612A1 (en) * 2019-10-31 2021-05-06 Adobe Inc. Graph data structure for using inter-feature dependencies in machine-learning
CN111046257A (en) * 2019-12-09 2020-04-21 北京百度网讯科技有限公司 Session recommendation method and device and electronic equipment
CN111259133A (en) * 2020-01-17 2020-06-09 成都信息工程大学 Personalized recommendation method integrating multiple information
CN111259222A (en) * 2020-01-22 2020-06-09 北京百度网讯科技有限公司 Article recommendation method, system, electronic device and storage medium
CN111400592A (en) * 2020-03-12 2020-07-10 山东师范大学 Personalized course recommendation method and system based on eye movement technology and deep learning
CN112148975A (en) * 2020-09-21 2020-12-29 北京百度网讯科技有限公司 Session recommendation method, device and equipment
CN112948681A (en) * 2021-03-12 2021-06-11 北京交通大学 Time series data recommendation method fusing multi-dimensional features
CN114240555A (en) * 2021-12-17 2022-03-25 北京沃东天骏信息技术有限公司 Click rate prediction model training method and device and click rate prediction method and device
CN114707070A (en) * 2022-04-12 2022-07-05 华为技术有限公司 User behavior prediction method and related equipment thereof

Also Published As

Publication number Publication date
CN114707070A (en) 2022-07-05

Similar Documents

Publication Publication Date Title
US20210012198A1 (en) Method for training deep neural network and apparatus
WO2021047593A1 (en) Method for training recommendation model, and method and apparatus for predicting selection probability
CN111797893B (en) Neural network training method, image classification system and related equipment
EP4250189A1 (en) Model training method, data processing method and apparatus
EP4209965A1 (en) Data processing method and related device
US20230153615A1 (en) Neural network distillation method and apparatus
WO2023221928A1 (en) Recommendation method and apparatus, and training method and apparatus
WO2022016556A1 (en) Neural network distillation method and apparatus
US20230117973A1 (en) Data processing method and apparatus
WO2021129668A1 (en) Neural network training method and device
WO2024001806A1 (en) Data valuation method based on federated learning and related device therefor
WO2023185925A1 (en) Data processing method and related apparatus
WO2022111387A1 (en) Data processing method and related apparatus
WO2024002167A1 (en) Operation prediction method and related apparatus
WO2024083121A1 (en) Data processing method and apparatus
WO2023231753A1 (en) Neural network training method, data processing method, and device
WO2023197910A1 (en) User behavior prediction method and related device thereof
WO2023050143A1 (en) Recommendation model training method and apparatus
WO2023246735A1 (en) Item recommendation method and related device therefor
WO2024046144A1 (en) Video processing method and related device thereof
WO2021136058A1 (en) Video processing method and device
CN113627421A (en) Image processing method, model training method and related equipment
WO2023197857A1 (en) Model partitioning method and related device thereof
WO2024012360A1 (en) Data processing method and related apparatus
WO2023185541A1 (en) Model training method and related device

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 23787555

Country of ref document: EP

Kind code of ref document: A1