CN112800097A - Special topic recommendation method and device based on deep interest network - Google Patents

Special topic recommendation method and device based on deep interest network Download PDF

Info

Publication number
CN112800097A
CN112800097A CN202110054841.0A CN202110054841A CN112800097A CN 112800097 A CN112800097 A CN 112800097A CN 202110054841 A CN202110054841 A CN 202110054841A CN 112800097 A CN112800097 A CN 112800097A
Authority
CN
China
Prior art keywords
user
information
article
deep interest
thematic
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202110054841.0A
Other languages
Chinese (zh)
Inventor
刘志杰
陈鑫晶
蔡淇森
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gaoding Xiamen Technology Co Ltd
Original Assignee
Gaoding Xiamen Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Gaoding Xiamen Technology Co Ltd filed Critical Gaoding Xiamen Technology Co Ltd
Priority to CN202110961484.6A priority Critical patent/CN113688167A/en
Priority to CN202110054841.0A priority patent/CN112800097A/en
Publication of CN112800097A publication Critical patent/CN112800097A/en
Priority to PCT/CN2021/099766 priority patent/WO2022151649A1/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/20Information retrieval; Database structures therefor; File system structures therefor of structured data, e.g. relational data
    • G06F16/24Querying
    • G06F16/245Query processing
    • G06F16/2457Query processing with adaptation to user needs
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • G06F18/232Non-hierarchical techniques
    • G06F18/2321Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions
    • G06F18/23213Non-hierarchical techniques using statistics or function optimisation, e.g. modelling of probability density functions with fixed number of clusters, e.g. K-means clustering
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • G06F18/2411Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches based on the proximity to a decision surface, e.g. support vector machines

Landscapes

  • Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Probability & Statistics with Applications (AREA)
  • Computational Linguistics (AREA)
  • Databases & Information Systems (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a topic recommendation method, medium, equipment and device based on a deep interest network, wherein the method comprises the following steps: acquiring user information and historical click data of a user, and generating training data; training to obtain a deep interest capture model; acquiring article information corresponding to articles, outputting corresponding article vectors through a deep interest capture model, and calculating thematic vectors according to the article vectors corresponding to each article; acquiring click data to be analyzed of a user, and outputting a corresponding user vector through a deep interest capture model; performing similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user; the method and the device can accurately recommend the special topics to the user on the premise of not establishing the labels corresponding to the special topics, and reduce manpower and material resources required to be consumed in the process of recommending the special topics.

Description

Special topic recommendation method and device based on deep interest network
Technical Field
The invention relates to the technical field of deep learning, in particular to a topic recommendation method based on a deep learning network, a computer readable storage medium, computer equipment and a topic recommendation device based on a deep interest network.
Background
In the related technology, when a corresponding topic is required to be recommended to a user, an image mode is mostly adopted; that is, first, the preference scores of the user for different topics are counted based on rules; then, preferentially displaying the special subject under the category most preferred by the user to complete the recommendation of the special subject; however, the dependency of the method on the label corresponding to the topic is strong, and in order to improve the accuracy of the topic recommendation, a large amount of manpower and material resources are inevitably required to be consumed to establish a high-quality label.
Disclosure of Invention
The present invention is directed to solving, at least to some extent, one of the technical problems in the art described above. Therefore, one object of the present invention is to provide a topic recommendation method based on a deep interest network, which can accurately recommend topics to a user without establishing labels corresponding to the topics, and reduce manpower and material resources required in the topic recommendation process.
A second object of the invention is to propose a computer-readable storage medium.
A third object of the invention is to propose a computer device.
The fourth purpose of the invention is to provide a special topic recommendation device based on the deep interest network.
In order to achieve the above object, an embodiment of a first aspect of the present invention provides a topic recommendation method based on a deep interest network, including the following steps: acquiring user information and historical click data of a user, and generating training data according to the user information and the historical click data; performing model training according to the training data to obtain a deep interest capture model; acquiring article information corresponding to articles, inputting the article information into the deep interest capture model, outputting corresponding article vectors through the deep interest capture model, and calculating thematic vectors according to the article vectors corresponding to each article; acquiring click data to be analyzed of a user, inputting the click data to be analyzed into the deep interest capture model, and outputting a corresponding user vector through the deep interest capture model; and performing similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user.
According to the special topic recommendation method based on the deep interest network, firstly, user information and historical click data of a user are obtained, and training data are generated according to the user information and the historical click data; then, performing model training according to the training data to obtain a deep interest capture model; then, acquiring article information corresponding to articles, inputting the article information into the deep interest capture model, outputting corresponding article vectors through the deep interest capture model, and calculating thematic vectors according to the article vectors corresponding to each article; then, click data to be analyzed of a user are obtained, the click data to be analyzed are input into the deep interest capture model, and a corresponding user vector is output through the deep interest capture model; then, carrying out similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user; therefore, the method and the device can accurately recommend the special topics to the user on the premise of not establishing the labels corresponding to the special topics, and reduce the manpower and material resources required to be consumed in the process of recommending the special topics.
In addition, the topic recommendation method based on the deep interest network proposed by the above embodiment of the present invention may further have the following additional technical features:
optionally, the historical click data of the user includes item information, time information, and ranking information between historical click behaviors, where the item information corresponds to each historical click behavior of the user.
Optionally, the training data comprises discrete type features, continuous type features and sequence features; the discrete features comprise time information, user attribute information and article classification information, the continuous features comprise historical click article classification statistical information of users, and the sequence features comprise article information sequences corresponding to historical click behaviors of users.
Optionally, the training data further includes a sample time characteristic, where generating training data according to the user information and the historical click data includes: generating a training sample according to the user information and the historical click data, calculating a time difference between the training sample and the current time, and judging whether the time difference is greater than a preset time threshold value so as to take a judgment result as a sample time characteristic.
Optionally, generating training data according to the user information and the historical click data includes: counting the clicked times corresponding to each article, determining the negative sample selection probability corresponding to each article according to the counting result, and randomly selecting the negative sample according to the negative sample selection probability corresponding to each article.
Optionally, determining a topic recommendation list according to the search result includes: clustering the topics according to a kmeas clustering algorithm to generate a plurality of topic categories; and generating a special subject list to be recommended according to the retrieval result, and scattering the special subject list to be recommended according to the plurality of special subject categories and the sliding window scattering method to generate a final special subject recommendation list.
In order to achieve the above object, a second aspect of the present invention provides a computer-readable storage medium, on which a deep interest network based topic recommendation program is stored, where the deep interest network based topic recommendation program, when executed by a processor, implements the deep interest network based topic recommendation method as described above.
According to the computer-readable storage medium of the embodiment of the invention, the deep interest network-based topic recommendation program is stored, so that the processor can realize the deep interest network-based topic recommendation method when executing the deep interest network-based topic recommendation program, and therefore, the topic recommendation can be accurately performed on the user on the premise of not establishing a label corresponding to the topic, and the manpower and material resources required to be consumed in the topic recommendation process are reduced.
In order to achieve the above object, a third embodiment of the present invention provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the computer program, the processor implements the method for recommending topics based on a deep interest network as described above.
According to the computer equipment provided by the embodiment of the invention, the memory is used for storing the topic recommendation program based on the deep interest network, so that the processor can realize the topic recommendation method based on the deep interest network when executing the topic recommendation program based on the deep interest network, thereby accurately recommending topics to users on the premise of not establishing labels corresponding to the topics, and reducing manpower and material resources required to be consumed in the topic recommendation process.
In order to achieve the above object, a fourth aspect of the present invention provides a topic recommendation device based on a deep interest network, including: the acquisition module is used for acquiring user information and historical click data of a user and generating training data according to the user information and the historical click data; the training module is used for carrying out model training according to the training data to obtain a deep interest capture model; the interest capturing module is used for acquiring article information corresponding to articles, inputting the article information into the deep interest capturing model, outputting corresponding article vectors through the deep interest capturing model, and calculating thematic vectors according to the article vectors corresponding to the articles; the interest capturing module is further used for acquiring click data to be analyzed of a user, inputting the click data to be analyzed into the deep interest capturing model, and outputting a corresponding user vector through the deep interest capturing model; and the recommendation module is used for performing similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user.
According to the special topic recommendation device based on the deep interest network, the acquisition module is arranged to acquire user information and historical click data of a user, and training data is generated according to the user information and the historical click data; the training module is used for carrying out model training according to the training data to obtain a deep interest capture model; the interest capturing module is used for acquiring article information corresponding to articles, inputting the article information into the deep interest capturing model, outputting corresponding article vectors through the deep interest capturing model, and calculating thematic vectors according to the article vectors corresponding to the articles; the interest capturing module is further used for acquiring click data to be analyzed of a user, inputting the click data to be analyzed into the deep interest capturing model, and outputting a corresponding user vector through the deep interest capturing model; the recommendation module is used for carrying out similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result and pushing the thematic recommendation list to the user; therefore, the method and the device can accurately recommend the special topics to the user on the premise of not establishing the labels corresponding to the special topics, and reduce the manpower and material resources required to be consumed in the process of recommending the special topics.
In addition, the topic recommendation device based on the deep interest network according to the above embodiment of the present invention may further have the following additional technical features:
optionally, the historical click data of the user includes item information, time information, and ranking information between historical click behaviors, where the item information corresponds to each historical click behavior of the user.
Drawings
FIG. 1 is a flowchart illustrating a topic recommendation method based on a deep interest network according to an embodiment of the present invention;
FIG. 2 is a schematic structural diagram of a deep interest capture model according to an embodiment of the present invention;
fig. 3 is a block diagram of a topic recommendation apparatus based on a deep interest network according to an embodiment of the present invention.
Detailed Description
Reference will now be made in detail to embodiments of the present invention, examples of which are illustrated in the accompanying drawings, wherein like or similar reference numerals refer to the same or similar elements or elements having the same or similar function throughout. The embodiments described below with reference to the drawings are illustrative and intended to be illustrative of the invention and are not to be construed as limiting the invention.
In the related art, when a topic is recommended, the dependency on a label corresponding to the topic is strong, and in order to improve the accuracy of topic recommendation, a large amount of manpower and material resources are inevitably required to be consumed to establish a high-quality label; according to the special topic recommendation method based on the deep interest network, firstly, user information and historical click data of a user are obtained, and training data are generated according to the user information and the historical click data; then, performing model training according to the training data to obtain a deep interest capture model; then, acquiring article information corresponding to articles, inputting the article information into the deep interest capture model, outputting corresponding article vectors through the deep interest capture model, and calculating thematic vectors according to the article vectors corresponding to each article; then, click data to be analyzed of a user are obtained, the click data to be analyzed are input into the deep interest capture model, and a corresponding user vector is output through the deep interest capture model; then, carrying out similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user; therefore, the method and the device can accurately recommend the special topics to the user on the premise of not establishing the labels corresponding to the special topics, and reduce the manpower and material resources required to be consumed in the process of recommending the special topics.
In order to better understand the above technical solutions, exemplary embodiments of the present invention will be described in more detail below with reference to the accompanying drawings. While exemplary embodiments of the invention are shown in the drawings, it should be understood that the invention can be embodied in various forms and should not be limited to the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the invention to those skilled in the art.
In order to better understand the technical solution, the technical solution will be described in detail with reference to the drawings and the specific embodiments.
Fig. 1 is a schematic flowchart of a topic recommendation method based on a deep interest network according to an embodiment of the present invention, and as shown in fig. 1, the topic recommendation method based on the deep interest network includes the following steps:
s101, obtaining user information and historical click data of a user, and generating training data according to the user information and the historical click data.
The selection mode of the historical click data of the user can be various.
As an example, the historical click data of the user includes an exposure log and a click behavior log of the user, where the exposure log records whether a certain item is exposed to a certain user on the same day, and the click behavior log records information corresponding to the click behavior of the user.
As another example, the historical click data of the user may include item information, time information, and ranking information between historical click behaviors corresponding to each historical click behavior of the user. That is, the historical click data of the user only includes information corresponding to the user click behavior, but does not include exposure information; it should be noted that in an actual scene, because different pages have different depths, the click rate difference of the pages with different depths is large, and the click rate of the page with deeper access depth is often higher; therefore, in order to avoid the influence of the difference in click rate caused by the pages of different depths, the exposure information is not included in the historical click data.
The training data may be set in various ways.
As an example, the training data includes discrete type features, continuous type features, and sequence features; the discrete type features comprise time information, user attribute information and article classification information, the continuous type features comprise historical click article classification statistical information of users, and the sequence features comprise article information sequences corresponding to historical click behaviors of users. Specifically, the discrete features include date information (for example, day of the week, whether it is a work day, whether it is a work period, etc.), an article ID, an article classification ID, an article attribute ID, and the like, and it should be noted that, according to the less characteristic of discrete feature classification, encoding processing may be performed by ONE-HOT; the continuous type features comprise the click times of the user history to different attributes, and the continuous type values can be directly input without processing the continuous type features; the sequence features comprise a user historical click item ID sequence and a historical click classification ID sequence.
In some embodiments, the training data further includes a sample time signature, wherein generating the training data from the user information and the historical click data comprises: generating a training sample according to the user information and the historical click data, calculating a time difference between the training sample and the current time, and judging whether the time difference is greater than a preset time threshold value so as to take a judgment result as a sample time characteristic.
It can be understood that, because the training data includes time information, that is, the context characteristics include the day of the week, whether the day of the work, etc., in the training process of the model, the training samples are scattered sufficiently in time, and the training process is not stable enough; at the moment, due to the influence of time factors, the sample can have great influence on the model, and the closer the sample is to the testing date, the greater the effect of the sample is; therefore, the sample time characteristic is increased to ensure the stability in the model training process.
In some embodiments, generating training data from the user information and the historical click data includes:
counting the clicked times corresponding to each article, determining the negative sample selection probability corresponding to each article according to the counting result, and randomly selecting the negative sample according to the negative sample selection probability corresponding to each article. It can be understood that the selection of the negative sample is required in the training process to smoothly train the model. The negative sample can be selected in various ways. For example, a preset number of negative samples are directly selected from the positive samples in a random manner; preferably, statistics of the number of clicks corresponding to each item can be performed in the above manner to determine the probability that the item is selected as a negative sample; thus, the more popular items are made to have a greater probability of being selected as negative samples, resulting in a higher accuracy of the final trained model.
In some embodiments, to avoid the problem of excessive computation of the softmax function, the sequence features are set in the form of two classes; that is, when the input item ID is the item ID clicked next by the user, the tag is 1, otherwise, the tag is 0. Specifically, assuming that the item sequence clicked by the user is [1,2,3,4,5,6], the sequence feature structure is shown in table 1:
Figure BDA0002900227030000061
TABLE 1
And S102, performing model training according to the training data to obtain a deep interest capture model.
For convenience of understanding, taking fig. 2 as an example, fig. 2 is a schematic structural diagram of a deep interest capture model according to an embodiment of the present invention; as shown in fig. 2, in this embodiment, the sequence feature, the discrete feature, and the continuous feature are spliced, and then pass through the BatchNormalization layer after being spliced, and then are input into the multilayer full-link layer, and each layer of the full-link layer is then connected with the BatchNormalization layer and the Dice activation function, so as to obtain the user vector finally.
In some embodiments, the model is trained using an Adagrad optimizer with an initial learning rate of 0.1, which decays to 1/2 per 50000 steps and a Batch size of 128. And in order to make the model training more stable, adding an L2 regularization parameter to both an Embedding layer and a DNN layer, and adding regularization loss to a loss function together for optimization.
S103, acquiring article information corresponding to the articles, inputting the article information into the deep interest capture model, outputting corresponding article vectors through the deep interest capture model, and calculating thematic vectors according to the article vectors corresponding to the articles.
It can be understood that each topic contains different numbers of articles, and the topic is a set of articles in the same classification; for example, if the topic is sports, the item corresponding to the topic may include: football, basketball, swimming, etc. The mode of calculating the thematic vector according to the article vector corresponding to each article can be various; for example, after the item vector is obtained, the item vectors of all items under the topic are averaged and pooled, so that the pooled result is used as the topic vector of the topic.
S104, acquiring click data to be analyzed of the user, inputting the click data to be analyzed into the deep interest capture model, and outputting a corresponding user vector through the deep interest capture model.
And S105, performing similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user.
In some embodiments, determining the topic recommendation list according to the retrieval result includes: clustering the topics according to a kmeas clustering algorithm to generate a plurality of topic categories; and generating a special subject list to be recommended according to the retrieval result, and scattering the special subject list to be recommended according to a plurality of special subject categories and a sliding window scattering method to generate a final special subject recommendation list.
It can be understood that after the topic list to be recommended is generated according to the retrieval result, a plurality of topics of the same category may appear in the topic list to be recommended under the same window, which brings bad experience to the user; therefore, in order to guarantee user experience, a to-be-recommended topic list is scattered through a sliding window scattering method and a clustering result of topics, so that the categories of the topics under the same window are different, and a final topic recommendation list is determined.
Specifically, as shown in table 2:
Figure BDA0002900227030000071
TABLE 2
As shown in Table 2, if the topic order obtained by user 001 is 1|2|3|4|5|6|7|8|9, the topic category sequence is A | A | A | B | C | B | D | D. Assuming that the size of the sliding window is 3, it means that the topic categories placed in adjacent 3 locations do not overlap. Then:
firstly, if the categories in the first sliding window are A, A and A, and the position index is counted from 0, the list of the 1 st and the 2 nd positions needs to be scattered, the 3 rd position is traversed backwards, if the first different category is B, the A of the 1 st position and the B of the 3 rd position are exchanged, the thematic category sequence is changed into A | B | A | A | C | B | D | D, and then the 1 st and the 3 rd positions of the thematic id sequence are exchanged.
Second, the category in the first sliding window is changed to A, B, A, then the 2 nd position needs processing, and it goes through from the 4 th position, the first different category is C of the 4 th position, so the position 2 and the position 4 are exchanged, then the topic Id sequence is changed to A | B | C | A | B | B | B | D | D, and the topic Id sequence is 1|4|5|2|3|6|7|8| 9.
And thirdly, the category sequence in the second sliding window is B, C and A, the processing is not needed, the window continues to slide forwards, the third window is C, A and A, the A at the 4 th position needs to be processed and is exchanged with the position 5, the thematic ID sequence is changed into A | B | C | A | B | A | B | D | D, the thematic ID sequence is changed into 1|4|5|2|6|3|7|8|9, and the like until the sequence is completed or the broken length reaches a threshold value.
In summary, according to the topic recommendation method based on the deep interest network of the embodiment of the present invention, first, user information and historical click data of a user are obtained, and training data is generated according to the user information and the historical click data; then, performing model training according to the training data to obtain a deep interest capture model; then, acquiring article information corresponding to articles, inputting the article information into the deep interest capture model, outputting corresponding article vectors through the deep interest capture model, and calculating thematic vectors according to the article vectors corresponding to each article; then, click data to be analyzed of a user are obtained, the click data to be analyzed are input into the deep interest capture model, and a corresponding user vector is output through the deep interest capture model; then, carrying out similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user; therefore, the method and the device can accurately recommend the special topics to the user on the premise of not establishing the labels corresponding to the special topics, and reduce the manpower and material resources required to be consumed in the process of recommending the special topics.
In order to implement the foregoing embodiments, an embodiment of the present invention provides a computer-readable storage medium, on which a deep interest network-based topic recommendation program is stored, where the deep interest network-based topic recommendation program, when executed by a processor, implements the deep interest network-based topic recommendation method as described above.
According to the computer-readable storage medium of the embodiment of the invention, the deep interest network-based topic recommendation program is stored, so that the processor can realize the deep interest network-based topic recommendation method when executing the deep interest network-based topic recommendation program, and therefore, the topic recommendation can be accurately performed on the user on the premise of not establishing a label corresponding to the topic, and the manpower and material resources required to be consumed in the topic recommendation process are reduced.
In order to implement the foregoing embodiments, an embodiment of the present invention provides a computer device, which includes a memory, a processor, and a computer program stored in the memory and executable on the processor, and when the processor executes the computer program, the method for recommending topics based on a deep interest network as described above is implemented.
According to the computer equipment provided by the embodiment of the invention, the memory is used for storing the topic recommendation program based on the deep interest network, so that the processor can realize the topic recommendation method based on the deep interest network when executing the topic recommendation program based on the deep interest network, thereby accurately recommending topics to users on the premise of not establishing labels corresponding to the topics, and reducing manpower and material resources required to be consumed in the topic recommendation process.
In order to implement the foregoing embodiments, an embodiment of the present invention provides a topic recommendation device based on a deep interest network, and as shown in fig. 3, the topic recommendation device based on the deep interest network includes: an acquisition module 10, a training module 20, an interest capture module 30, and a recommendation module 40.
The acquisition module 10 is configured to acquire user information and historical click data of a user, and generate training data according to the user information and the historical click data;
the training module 20 is configured to perform model training according to training data to obtain a deep interest capture model;
the interest capturing module 30 is configured to obtain item information corresponding to an item, input the item information into the deep interest capturing model, output a corresponding item vector through the deep interest capturing model, and calculate a thematic vector according to the item vector corresponding to each item;
the interest capturing module 30 is further configured to obtain click data to be analyzed of the user, and input the click data to be analyzed into the deep interest capturing model, so as to output a corresponding user vector through the deep interest capturing model;
the recommendation module 40 is configured to perform similarity retrieval according to the user vector and the topic vector, determine a topic recommendation list according to a retrieval result, and push the topic recommendation list to the user.
In some embodiments, the historical click data of the user includes item information corresponding to each historical click behavior of the user, time information, and ranking information between the historical click behaviors.
It should be noted that the above description about the deep interest network-based topic recommendation method in fig. 1 is also applicable to the deep interest network-based topic recommendation apparatus, and is not repeated herein.
In summary, according to the topic recommendation device based on the deep interest network in the embodiment of the present invention, the acquisition module is configured to acquire user information and historical click data of a user, and generate training data according to the user information and the historical click data; the training module is used for carrying out model training according to the training data to obtain a deep interest capture model; the interest capturing module is used for acquiring article information corresponding to articles, inputting the article information into the deep interest capturing model, outputting corresponding article vectors through the deep interest capturing model, and calculating thematic vectors according to the article vectors corresponding to the articles; the interest capturing module is further used for acquiring click data to be analyzed of a user, inputting the click data to be analyzed into the deep interest capturing model, and outputting a corresponding user vector through the deep interest capturing model; the recommendation module is used for carrying out similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result and pushing the thematic recommendation list to the user; therefore, the method and the device can accurately recommend the special topics to the user on the premise of not establishing the labels corresponding to the special topics, and reduce the manpower and material resources required to be consumed in the process of recommending the special topics.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, system, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
The present invention is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing apparatus to cause a series of operational steps to be performed on the computer or other programmable apparatus to produce a computer implemented process such that the instructions which execute on the computer or other programmable apparatus provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
It should be noted that in the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention may be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In the unit claims enumerating several means, several of these means may be embodied by one and the same item of hardware. The usage of the words first, second and third, etcetera do not indicate any ordering. These words may be interpreted as names.
While preferred embodiments of the present invention have been described, additional variations and modifications in those embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the invention.
It will be apparent to those skilled in the art that various changes and modifications may be made in the present invention without departing from the spirit and scope of the invention. Thus, if such modifications and variations of the present invention fall within the scope of the claims of the present invention and their equivalents, the present invention is also intended to include such modifications and variations.
In the description of the present invention, it is to be understood that the terms "first", "second" and the like are used for descriptive purposes only and are not to be construed as indicating or implying relative importance or implying any number of technical features indicated. Thus, a feature defined as "first" or "second" may explicitly or implicitly include one or more of that feature. In the description of the present invention, "a plurality" means two or more unless specifically defined otherwise.
In the present invention, unless otherwise expressly stated or limited, the terms "mounted," "connected," "secured," and the like are to be construed broadly and can, for example, be fixedly connected, detachably connected, or integrally formed; can be mechanically or electrically connected; either directly or indirectly through intervening media, either internally or in any other relationship. The specific meanings of the above terms in the present invention can be understood by those skilled in the art according to specific situations.
In the present invention, unless otherwise expressly stated or limited, the first feature "on" or "under" the second feature may be directly contacting the first and second features or indirectly contacting the first and second features through an intermediate. Also, a first feature "on," "over," and "above" a second feature may be directly or diagonally above the second feature, or may simply indicate that the first feature is at a higher level than the second feature. A first feature being "under," "below," and "beneath" a second feature may be directly under or obliquely under the first feature, or may simply mean that the first feature is at a lesser elevation than the second feature.
In the description herein, references to the description of the term "one embodiment," "some embodiments," "an example," "a specific example," or "some examples," etc., mean that a particular feature, structure, material, or characteristic described in connection with the embodiment or example is included in at least one embodiment or example of the invention. In this specification, the schematic representations of the terms used above should not be understood to necessarily refer to the same embodiment or example. Furthermore, the particular features, structures, materials, or characteristics described may be combined in any suitable manner in any one or more embodiments or examples. Furthermore, various embodiments or examples and features of different embodiments or examples described in this specification can be combined and combined by one skilled in the art without contradiction.
Although embodiments of the present invention have been shown and described above, it is understood that the above embodiments are exemplary and should not be construed as limiting the present invention, and that variations, modifications, substitutions and alterations can be made to the above embodiments by those of ordinary skill in the art within the scope of the present invention.

Claims (10)

1. A topic recommendation method based on a deep interest network is characterized by comprising the following steps:
acquiring user information and historical click data of a user, and generating training data according to the user information and the historical click data;
performing model training according to the training data to obtain a deep interest capture model;
acquiring article information corresponding to articles, inputting the article information into the deep interest capture model, outputting corresponding article vectors through the deep interest capture model, and calculating thematic vectors according to the article vectors corresponding to each article;
acquiring click data to be analyzed of a user, inputting the click data to be analyzed into the deep interest capture model, and outputting a corresponding user vector through the deep interest capture model;
and performing similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user.
2. The deep interest network-based topic recommendation method according to claim 1, wherein the historical click data of the user comprises item information, time information and ranking information among the historical click behaviors corresponding to each historical click behavior of the user.
3. The deep interest network-based topic recommendation method of claim 1, wherein the training data comprises discrete features, continuous features and sequence features;
the discrete features comprise time information, user attribute information and article classification information, the continuous features comprise historical click article classification statistical information of users, and the sequence features comprise article information sequences corresponding to historical click behaviors of users.
4. The deep interest network-based topic recommendation method of claim 3, wherein the training data further comprises a sample time feature, wherein generating training data from the user information and the historical click data comprises:
generating a training sample according to the user information and the historical click data, calculating a time difference between the training sample and the current time, and judging whether the time difference is greater than a preset time threshold value so as to take a judgment result as a sample time characteristic.
5. The deep interest network-based topic recommendation method according to any one of claims 1-4, wherein generating training data according to the user information and the historical click data comprises:
counting the clicked times corresponding to each article, determining the negative sample selection probability corresponding to each article according to the counting result, and randomly selecting the negative sample according to the negative sample selection probability corresponding to each article.
6. The topic recommendation method based on the deep interest network of any one of claims 1 to 4, wherein determining a topic recommendation list according to the search result comprises:
clustering the topics according to a kmeas clustering algorithm to generate a plurality of topic categories;
and generating a special subject list to be recommended according to the retrieval result, and scattering the special subject list to be recommended according to the plurality of special subject categories and the sliding window scattering method to generate a final special subject recommendation list.
7. A computer-readable storage medium, on which a deep interest network-based topic recommendation program is stored, which, when executed by a processor, implements the deep interest network-based topic recommendation method according to any one of claims 1 to 6.
8. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, wherein the processor, when executing the program, implements the deep interest network based topic recommendation method according to any one of claims 1-6.
9. A topic recommendation device based on a deep interest network is characterized by comprising:
the acquisition module is used for acquiring user information and historical click data of a user and generating training data according to the user information and the historical click data;
the training module is used for carrying out model training according to the training data to obtain a deep interest capture model;
the interest capturing module is used for acquiring article information corresponding to articles, inputting the article information into the deep interest capturing model, outputting corresponding article vectors through the deep interest capturing model, and calculating thematic vectors according to the article vectors corresponding to the articles;
the interest capturing module is further used for acquiring click data to be analyzed of a user, inputting the click data to be analyzed into the deep interest capturing model, and outputting a corresponding user vector through the deep interest capturing model;
and the recommendation module is used for performing similarity retrieval according to the user vector and the thematic vector, determining a thematic recommendation list according to a retrieval result, and pushing the thematic recommendation list to the user.
10. The deep interest network-based topic recommendation device of claim 9, wherein the historical click data of the user comprises item information corresponding to each historical click behavior of the user, time information and ranking information among the historical click behaviors.
CN202110054841.0A 2021-01-15 2021-01-15 Special topic recommendation method and device based on deep interest network Pending CN112800097A (en)

Priority Applications (3)

Application Number Priority Date Filing Date Title
CN202110961484.6A CN113688167A (en) 2021-01-15 2021-01-15 Deep interest capture model construction method and device based on deep interest network
CN202110054841.0A CN112800097A (en) 2021-01-15 2021-01-15 Special topic recommendation method and device based on deep interest network
PCT/CN2021/099766 WO2022151649A1 (en) 2021-01-15 2021-06-11 Deep interest network-based topic recommendation method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110054841.0A CN112800097A (en) 2021-01-15 2021-01-15 Special topic recommendation method and device based on deep interest network

Related Child Applications (1)

Application Number Title Priority Date Filing Date
CN202110961484.6A Division CN113688167A (en) 2021-01-15 2021-01-15 Deep interest capture model construction method and device based on deep interest network

Publications (1)

Publication Number Publication Date
CN112800097A true CN112800097A (en) 2021-05-14

Family

ID=75809601

Family Applications (2)

Application Number Title Priority Date Filing Date
CN202110054841.0A Pending CN112800097A (en) 2021-01-15 2021-01-15 Special topic recommendation method and device based on deep interest network
CN202110961484.6A Pending CN113688167A (en) 2021-01-15 2021-01-15 Deep interest capture model construction method and device based on deep interest network

Family Applications After (1)

Application Number Title Priority Date Filing Date
CN202110961484.6A Pending CN113688167A (en) 2021-01-15 2021-01-15 Deep interest capture model construction method and device based on deep interest network

Country Status (2)

Country Link
CN (2) CN112800097A (en)
WO (1) WO2022151649A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113657975A (en) * 2021-09-03 2021-11-16 广州微行网络科技有限公司 Marketing method and system based on Internet E-commerce live broadcast platform
CN114187036A (en) * 2021-11-30 2022-03-15 深圳市喂车科技有限公司 Internet advertisement intelligent recommendation management system based on behavior characteristic recognition
CN114218476A (en) * 2021-11-12 2022-03-22 深圳前海鹏影数字软件运营有限公司 Content recommendation method and device and terminal equipment
WO2022151649A1 (en) * 2021-01-15 2022-07-21 稿定(厦门)科技有限公司 Deep interest network-based topic recommendation method and apparatus
WO2023082993A1 (en) * 2021-11-11 2023-05-19 北京沃东天骏信息技术有限公司 Information recommendation method, apparatus and system

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114567811B (en) * 2022-02-28 2024-02-09 广州欢聊网络科技有限公司 Multi-modal model training method, system and related equipment for voice sequencing
CN115587261B (en) * 2022-12-09 2023-04-07 思创数码科技股份有限公司 Government affair resource catalog recommendation method and system
CN115828107B (en) * 2023-01-09 2023-05-05 深圳市云积分科技有限公司 Model training method and device based on offline environment
CN115952359A (en) * 2023-03-10 2023-04-11 特斯联科技集团有限公司 Recommendation system recall method and device, electronic equipment and storage medium
CN116385048B (en) * 2023-06-06 2023-08-22 山东政信大数据科技有限责任公司 Intelligent marketing method and system for agricultural products
CN116521908B (en) * 2023-06-28 2024-01-09 图林科技(深圳)有限公司 Multimedia content personalized recommendation method based on artificial intelligence
CN117493677B (en) * 2023-11-10 2024-05-28 成达文化科技(广州)有限公司 Personalized search information recommendation system and method based on user portraits

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110162700A (en) * 2019-04-23 2019-08-23 腾讯科技(深圳)有限公司 The training method of information recommendation and model, device, equipment and storage medium
CN111125521A (en) * 2019-12-13 2020-05-08 上海喜马拉雅科技有限公司 Information recommendation method, device, equipment and storage medium
CN111368210A (en) * 2020-05-27 2020-07-03 腾讯科技(深圳)有限公司 Information recommendation method and device based on artificial intelligence and electronic equipment
CN111651669A (en) * 2020-05-20 2020-09-11 拉扎斯网络科技(上海)有限公司 Information recommendation method and device, electronic equipment and computer-readable storage medium

Family Cites Families (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11269891B2 (en) * 2014-08-21 2022-03-08 Affectomatics Ltd. Crowd-based scores for experiences from measurements of affective response
US10404813B2 (en) * 2016-09-14 2019-09-03 Oath Inc. Baseline interest profile for recommendations using a geographic location
CN108182621A (en) * 2017-12-07 2018-06-19 合肥美的智能科技有限公司 The Method of Commodity Recommendation and device for recommending the commodity, equipment and storage medium
CN111767459A (en) * 2019-10-16 2020-10-13 北京京东尚科信息技术有限公司 Item recommendation method and device
CN111046285B (en) * 2019-12-11 2023-04-25 拉扎斯网络科技(上海)有限公司 Recommendation ordering determining method, device, server and storage medium
CN111310056B (en) * 2020-03-11 2023-04-14 深圳市雅阅科技有限公司 Information recommendation method, device, equipment and storage medium based on artificial intelligence
CN111737578B (en) * 2020-06-22 2024-04-02 陕西师范大学 Recommendation method and system
CN112800097A (en) * 2021-01-15 2021-05-14 稿定(厦门)科技有限公司 Special topic recommendation method and device based on deep interest network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110162700A (en) * 2019-04-23 2019-08-23 腾讯科技(深圳)有限公司 The training method of information recommendation and model, device, equipment and storage medium
CN111125521A (en) * 2019-12-13 2020-05-08 上海喜马拉雅科技有限公司 Information recommendation method, device, equipment and storage medium
CN111651669A (en) * 2020-05-20 2020-09-11 拉扎斯网络科技(上海)有限公司 Information recommendation method and device, electronic equipment and computer-readable storage medium
CN111368210A (en) * 2020-05-27 2020-07-03 腾讯科技(深圳)有限公司 Information recommendation method and device based on artificial intelligence and electronic equipment

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
裔隽 等: "《Python机器学习实战》", 31 January 2018, pages: 145 - 146 *
闲鱼技术-华采: "打散算法的三种解决方案及其选型场景", 《HTTPS://ZHUANLAN.ZHIHU.COM/P/191017039》 *
闲鱼技术-华采: "打散算法的三种解决方案及其选型场景", 《HTTPS://ZHUANLAN.ZHIHU.COM/P/191017039》, 20 August 2020 (2020-08-20), pages 1 - 5 *

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2022151649A1 (en) * 2021-01-15 2022-07-21 稿定(厦门)科技有限公司 Deep interest network-based topic recommendation method and apparatus
CN113657975A (en) * 2021-09-03 2021-11-16 广州微行网络科技有限公司 Marketing method and system based on Internet E-commerce live broadcast platform
CN113657975B (en) * 2021-09-03 2024-03-26 西安稻叶山供应链管理有限公司 Marketing method and system based on Internet E-commerce live broadcast platform
WO2023082993A1 (en) * 2021-11-11 2023-05-19 北京沃东天骏信息技术有限公司 Information recommendation method, apparatus and system
CN114218476A (en) * 2021-11-12 2022-03-22 深圳前海鹏影数字软件运营有限公司 Content recommendation method and device and terminal equipment
CN114187036A (en) * 2021-11-30 2022-03-15 深圳市喂车科技有限公司 Internet advertisement intelligent recommendation management system based on behavior characteristic recognition
CN114187036B (en) * 2021-11-30 2022-10-11 深圳市喂车科技有限公司 Internet advertisement intelligent recommendation management system based on behavior characteristic recognition

Also Published As

Publication number Publication date
CN113688167A (en) 2021-11-23
WO2022151649A1 (en) 2022-07-21

Similar Documents

Publication Publication Date Title
CN112800097A (en) Special topic recommendation method and device based on deep interest network
CN110222267B (en) Game platform information pushing method, system, storage medium and equipment
CN110149540B (en) Recommendation processing method and device for multimedia resources, terminal and readable medium
CN106503025B (en) Application recommendation method and system
CN111126495B (en) Model training method, information prediction device, storage medium and equipment
KR20180011221A (en) Select representative video frames for videos
CN106446078A (en) Information recommendation method and recommendation apparatus
CN109460512A (en) Recommendation information processing method, device, equipment and storage medium
CN110737859A (en) UP main matching method and device
CN109902823B (en) Model training method and device based on generation countermeasure network
CN108376164B (en) Display method and device of potential anchor
CN111783712A (en) Video processing method, device, equipment and medium
CN111737558A (en) Information recommendation method and device and computer readable storage medium
CN104636407A (en) Parameter choice training and search request processing method and device
CN110458600A (en) Portrait model training method, device, computer equipment and storage medium
CN112163614A (en) Anchor classification method and device, electronic equipment and storage medium
Carlier et al. Assessment of crowdsourcing and gamification loss in user-assisted object segmentation
CN113918806A (en) Method for automatically recommending training courses and related equipment
CN111581435A (en) Video cover image generation method and device, electronic equipment and storage medium
CN109308332A (en) A kind of target user's acquisition methods, device and server
CN109460474B (en) User preference trend mining method
CN111639485A (en) Course recommendation method based on text similarity and related equipment
CN111985616B (en) Image feature extraction method, image retrieval method, device and equipment
CN112749313B (en) Label labeling method, label labeling device, computer equipment and storage medium
CN115422346A (en) Knowledge graph-based multi-turn topic recommendation method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20220224

Address after: 102400 zone 1, Guba Road, Chengguan Street, Fangshan District, Beijing - c337

Applicant after: Wanzhongxiang (Beijing) Technology Co.,Ltd.

Address before: G15-2, 3rd floor, No.10, Punan 1st Road, Dongpu Road, Siming District, Xiamen City, Fujian Province

Applicant before: GAODING (XIAMEN) TECHNOLOGY Co.,Ltd.

TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20220627

Address after: G15-2, 3rd floor, No.10, Punan 1st Road, Dongpu Road, Siming District, Xiamen City, Fujian Province

Applicant after: GAODING (XIAMEN) TECHNOLOGY Co.,Ltd.

Address before: 102400 zone 1, Guba Road, Chengguan Street, Fangshan District, Beijing - c337

Applicant before: Wanzhongxiang (Beijing) Technology Co.,Ltd.

TA01 Transfer of patent application right
RJ01 Rejection of invention patent application after publication

Application publication date: 20210514

RJ01 Rejection of invention patent application after publication