WO2022148038A1 - Information recommendation method and device - Google Patents

Information recommendation method and device Download PDF

Info

Publication number
WO2022148038A1
WO2022148038A1 PCT/CN2021/116964 CN2021116964W WO2022148038A1 WO 2022148038 A1 WO2022148038 A1 WO 2022148038A1 CN 2021116964 W CN2021116964 W CN 2021116964W WO 2022148038 A1 WO2022148038 A1 WO 2022148038A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
behavior data
recommendation
sample
target
Prior art date
Application number
PCT/CN2021/116964
Other languages
French (fr)
Chinese (zh)
Inventor
牛亚男
冷德维
惠轶群
宋洋
Original Assignee
北京达佳互联信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京达佳互联信息技术有限公司 filed Critical 北京达佳互联信息技术有限公司
Publication of WO2022148038A1 publication Critical patent/WO2022148038A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the present disclosure is based on a Chinese patent application with an application date of January 11, 2021 and an application number of 202110034339.3, and claims the priority of the Chinese patent application.
  • the disclosure of the above Chinese patent application is hereby incorporated by reference in its entirety as a part of this disclosure.
  • the present disclosure relates to the field of Internet technologies, and in particular, to an information recommendation method, apparatus, electronic device, and storage medium.
  • the present disclosure provides an information recommendation method, device, electronic device and storage medium.
  • the technical solutions of the present disclosure are as follows:
  • an information recommendation method including:
  • the target object In response to the information acquisition request of the target object, acquiring a long-term behavior data set of the target object, where the long-term behavior data set represents a plurality of historical behavior data of the target object within a preset time period;
  • the target information in the recommendation information set is recommended to the target object based on the interest index.
  • determining the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information includes:
  • target behavior data corresponding to each recommendation information is determined from the long-term behavior data set.
  • acquiring the long-term behavior data set of the target object includes:
  • Receive the information acquisition request of the target object and the information acquisition request includes the target object identification of the target object;
  • a long-term behavior data set of the target object is acquired based on the target object identifier.
  • acquiring the long-term behavior data set of the target object based on the object identifier includes:
  • the historical behavior database includes historical behavior data of multiple objects, and the mapping relationship between the object identifiers of the multiple objects and the corresponding historical behavior data;
  • the corresponding historical behavior data is used as the long-term behavior data set of the target object.
  • the determining the characteristic information of each recommendation information in the recommendation information set includes:
  • the feature information of each recommendation information is generated based on the text information of each recommendation information.
  • the recommending the target information in the recommendation information set to the target object based on the interest index includes:
  • the target information is recommended to the target object.
  • the method further includes:
  • the neural network is trained based on the target loss to obtain the interest recognition network.
  • the method further includes:
  • the sample behavior data corresponding to each sample recommendation information determined from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information includes:
  • the sample behavior data corresponding to each sample recommendation information is determined from the corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
  • an information recommendation apparatus including:
  • the first behavior data set acquisition module is configured to obtain a long-term behavior data set of the target object in response to an information acquisition request of the target object, and the long-term behavior data set represents the number of the target object in a preset time period.
  • historical behavioral data
  • a feature information determination module configured to determine feature information of each recommendation information in the recommendation information set
  • a target behavior data determination module configured to determine target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information
  • a first interest identification module configured to input the target behavior data into an interest identification network for interest identification, and obtain an interest index of the target object for each recommendation information;
  • An information recommendation module configured to recommend target information in the recommended information set to the target object based on the interest index.
  • the target behavior data determination module includes:
  • a behavior feature information acquisition unit configured to acquire behavior feature information of each historical behavior data in the long-term behavior data set
  • a first similarity calculation unit configured to calculate a first similarity between the feature information of each recommendation information and the behavior feature information
  • the target behavior data determining unit is configured to determine the target behavior data corresponding to each recommendation information from the long-term behavior data set according to the first similarity.
  • the first behavior data set acquisition module includes:
  • an information acquisition request receiving unit configured to receive an information acquisition request of the target object, where the information acquisition request includes a target object identifier of the target object;
  • the long-term behavior data set acquisition unit is configured to acquire the long-term behavior data set of the target object based on the target object identifier.
  • the long-term behavior data set acquisition unit includes:
  • a historical behavior database acquisition unit configured to acquire a pre-built historical behavior database, the historical behavior database includes historical behavior data of a plurality of objects, and a mapping relationship between the object identifiers of the plurality of objects and the corresponding historical behavior data ;
  • a historical behavior data acquisition unit configured to acquire historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship
  • the long-term behavior data set determination unit is configured to use the corresponding historical behavior data as the long-term behavior data set of the target object.
  • the feature information determination module includes:
  • a first topic tag acquiring unit configured to acquire the topic tag of each of the recommended information
  • a first feature information generating unit configured to generate feature information of each recommendation information based on the subject tag of each recommendation information
  • a first text information extraction unit configured to extract the text information of each recommended information
  • the second feature information generating unit is configured to generate feature information of each recommendation information based on the text information of each recommendation information.
  • the information recommendation module includes:
  • a target information determination unit configured to determine target information from the recommended information set according to the interest index
  • An information pushing unit configured to recommend the target information to the target object.
  • the apparatus further includes:
  • an information acquisition module configured to acquire information identifiers and sample feature information of multiple sample recommendation information
  • an object information determination module configured to determine a historical recommendation object corresponding to each sample recommendation information and an interest labeling index corresponding to the historical recommendation object based on the information identifier of each sample recommendation information;
  • the second behavior data set obtaining module is configured to obtain the long-term sample behavior data set of the historical recommendation object
  • a sample behavior number determination module configured to determine sample behavior data corresponding to each sample recommendation information from a corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information
  • the second interest identification module is configured to input the sample behavior data into the neural network for interest identification, so as to obtain the interest prediction index of each historical recommendation object to the corresponding sample recommendation information;
  • a target loss determination module configured to determine a target loss according to the interest prediction index and the interest labeling index
  • a network training module configured to train the neural network based on the target loss to obtain the interest recognition network.
  • the apparatus further includes:
  • a recommendation time determination module configured to determine the recommendation time of the recommended information for each sample
  • a data filtering module configured to perform data filtering on the corresponding long-term sample behavior data set based on the recommended time to obtain a filtering behavior data set
  • the sample behavior data determination module is further configured to determine sample behavior data corresponding to each sample recommendation information from a corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
  • an electronic device comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to execute the instructions to achieve The method of any one of the first aspects above.
  • a computer-readable storage medium when instructions in the storage medium are executed by a processor of an electronic device, the electronic device can execute the first embodiment of the present disclosure.
  • a computer program product comprising instructions which, when run on a computer, cause the computer to perform any of the methods described in the first aspect of the embodiments of the present disclosure.
  • the long-term behavior data set of the target object is obtained, which ensures the comprehensiveness of the behavior data and can effectively reflect the interest and preference of the object.
  • the target behavior data used for interest recognition can effectively reflect the target object's real interest preference for recommended information, and can greatly reduce the amount of data in the process of interest recognition, effectively improve the efficiency and accuracy of interest recognition, and greatly improve the recommendation.
  • Information recommendation accuracy and recommendation performance in the system are examples of
  • FIG. 1 is a schematic diagram of an application environment according to some embodiments.
  • FIG. 2 is a flowchart of an information recommendation method according to some embodiments
  • FIG. 3 is a schematic flowchart of determining target behavior data corresponding to each recommendation information from a long-term behavior data set based on feature information of each recommendation information according to some embodiments;
  • FIG. 4 is a schematic flowchart of a pre-trained interest recognition network according to some embodiments.
  • FIG. 5 is a schematic flowchart of a pre-trained interest recognition network according to some embodiments.
  • FIG. 6 is a block diagram of an information recommendation apparatus according to some embodiments.
  • Fig. 7 is a block diagram of an electronic device for information recommendation according to some embodiments.
  • FIG. 1 is a schematic diagram of an application environment according to an exemplary embodiment.
  • the application environment may include a log database 01, a retrieval server 02, a historical behavior database 03, and a training server 04 , terminal 05 , content distribution server 06 and service server 07 .
  • the log database 01 may be used to store recommendation log information generated during the recommendation process of the recommendation system.
  • the historical behavior database 03 can be used to store long-term historical behavior data of a large number of objects in the recommender system.
  • the historical behavior database 03 uses the object identifier as the key (retrieval parameter), and the historical behavior data set corresponding to the object as the value (return data corresponding to the retrieval parameter) for data storage.
  • the retrieval server 02 may initiate a long-term behavior data set acquisition request in combination with the recommendation log information in the log database 01, and acquire long-term samples of historical recommendation objects corresponding to the sample recommendation information recommended by the recommendation system from the historical behavior database 03 Behavior data set; and send the long-term sample behavior data set and sample recommendation information to the training server 04 as training data.
  • the training server 04 may train the interest recognition network in combination with the long-term sample behavior data set and the sample recommendation information sent by the retrieval server 02 , and send the trained interest recognition network to the service server 07 .
  • the terminal 05 may provide a user-oriented information recommendation service; accordingly, the user may trigger an information acquisition request based on the terminal 05 .
  • the information acquisition request triggered by the terminal 05 may be sent to the content distribution server 06; correspondingly, the content distribution server 06 forwards the information acquisition request to the service server 07, and then the service server 07 combines the information acquisition request with the information contained in the request.
  • the target object identifier of the target object obtains the target behavior data of the target object corresponding to the recommendation information from the historical behavior database 03, and combines the interest recognition network to perform interest recognition, and then information recommendation can be performed based on the result of the interest recognition.
  • the retrieval server 02, the training server 04, the content distribution server 06, and the service server 07 may be independent physical servers, or may be server clusters or distributed systems composed of multiple physical servers, or may provide cloud services. Services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN (Content Delivery Network), and big data and artificial intelligence platforms Cloud servers for cloud computing services.
  • Cloud databases cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN (Content Delivery Network), and big data and artificial intelligence platforms Cloud servers for cloud computing services.
  • the terminal 05 may include, but is not limited to, smartphones, desktop computers, tablet computers, laptop computers, smart speakers, digital assistants, augmented reality (AR)/virtual reality (VR) devices, Electronic devices such as smart wearable devices.
  • the operating system running on the electronic device may include, but is not limited to, the Android system, the IOS system, linux, windows, and the like.
  • the log database 01 and the historical behavior database 03 may be relational databases.
  • FIG. 1 is only an application environment provided by the present disclosure. In practical applications, other application environments may also be included.
  • the information acquisition request of the terminal 05 can be directly sent to the service server 07.
  • the content distribution server 06 set between the terminal 05 and the service server 07 can effectively reduce the request pressure of the service server, thereby improving the efficiency of information recommendation.
  • the above-mentioned log database 01, retrieval server 02, historical behavior database 03, training server 04, terminal 05, content distribution server 06 and service server 07 can be directly or indirectly connected through wired or wireless communication.
  • the disclosure is not limited here.
  • FIG. 2 is a flowchart of an information recommendation method according to an exemplary embodiment. As shown in FIG. 2 , the information recommendation method is used in electronic devices such as servers and edge computing nodes, and includes the following steps:
  • the long-term behavioral data set represents a plurality of historical behavioral data of the target object within a preset time period.
  • the preset time period may be set in combination with actual application requirements, such as 1 year.
  • each historical behavior data may reflect the associated data in the process of the behavior of the corresponding object to the recommended information.
  • each historical behavior data may include: information identifier of historical recommendation information, publisher identifier of historical recommendation information, recommendation time, information viewing time, topic tag of historical recommendation information, behavior of objects on historical recommendation information Tags (e.g. click, like, follow, retweet, etc.).
  • acquiring the long-term behavior data set of the target object in response to the information acquisition request of the target object includes: receiving an information acquisition request of the target object, where the information acquisition request includes the target object identifier of the target object; acquiring based on the target object identifier A long-term behavioral dataset of the target object.
  • the above method may further include:
  • the historical behavior database may include historical behavior data of multiple objects, and a mapping relationship between object identifiers of the multiple objects and corresponding historical behavior data.
  • each object often corresponds to a large amount of historical behavior data.
  • the historical behavior database can support backfilling in two formats: historical data backfilling and automatic backfilling triggered by recommended events.
  • historical behavior data can be extracted from a large amount of historical log data and backfilled into the historical behavior database.
  • the automatic backfill triggered by the recommendation event can update the behavior data in the real-time information recommendation process to the historical behavior database.
  • deduplication processing in combination with the recommendation time and the information identifier of the historical recommendation information.
  • the above-mentioned acquisition of the long-term behavior data set of the target object based on the object identifier may include:
  • the historical behavior database includes historical behavior data of multiple objects, and the mapping relationship between object identifiers of multiple objects and the corresponding historical behavior data;
  • the corresponding historical behavior data is taken as the long-term behavior data set of the target object.
  • the historical behavior database including the historical behavior data of a large number of objects and the mapping relationship between the object identifiers of the large number of objects and the corresponding historical behavior data, it is convenient to receive the information acquisition request of the target object. Quickly query long-term behavioral data sets of objects directly combined with object identification.
  • the feature information of each recommendation information in the recommendation information set is determined.
  • the recommended information set may include a large number of recommended information in the recommendation system, and in some embodiments, the recommended information may be videos, pictures, text information, and the like.
  • determining the feature information of each recommendation information in the recommendation information set includes:
  • the subject tag of the recommended information may be text information that can represent the subject content of the recommended information.
  • the user when a user publishes information in some platforms, the user is often asked to fill in the subject tag of the information in combination with the preset symbols.
  • the hashtags are distinguished by means of preset symbols, and correspondingly, the hashtags can be obtained from the relevant text information of the information by matching the preset symbols.
  • the preset symbol may be "##", of course, other symbols or manners may also be used to distinguish hashtags in practical applications.
  • a pre-trained hashtag recognition network can also be used to extract the hashtags of the recommended information.
  • the hashtag is obtained by the network performing hashtag recognition training on a preset neural network based on a large amount of sample recommendation information and hashtags corresponding to the sample recommendation information.
  • generating the feature information of the recommendation information based on the topic tag of each recommendation information may include acquiring the word vector of the topic tag, and correspondingly, the word vector of the topic tag may be used as the feature information of the recommendation information.
  • a word vector representation model obtained by pre-training a preset word vector model based on preset training text information may be pre-trained.
  • the preset training text information may be text information in a recommendation system.
  • the preset training text information may be subjected to word segmentation processing, and the classification information after word segmentation processing may be input into the preset word vector model for training.
  • Each word is mapped into a K-dimensional real number vector, and a word vector set representing the semantic similarity between words can be obtained while the word vector representation model is obtained.
  • the preset word vector model is trained with the preset training text information in a certain system, and the obtained word vector representation model can effectively represent the semantic similarity between words in the system.
  • the hashtag when the word vector representation model has been trained, the hashtag can be segmented and input into the word vector representation model, and the word vector representation model can determine the above-mentioned hashtag based on the word vector in the word vector set The word vector of the corresponding word segmentation information. Further, in the case that the word segmentation information corresponding to the topic tag has multiple words, the mean value of the word vectors of the multiple words can be taken as the word vector of the topic tag; in the case that the word segmentation information corresponding to the topic tag has one word , the word vector of this word can be used as the word vector of the hashtag.
  • the preset word vector model may include, but is not limited to, word2vec, BERT, glove and other models.
  • determining the feature information of each recommendation information in the recommendation information set above may include:
  • the recommendation information when the recommendation information itself is text information, the recommendation information may be used as text information.
  • the text information of the recommended information may include text information corresponding to the voice information in the video, text information of the video cover, title information of the video, and entity text (entity text) extracted from the video. It can be a person, an object, etc.) and the search information of the video (that is, the video can be recalled based on the search information in the recommender system), etc.
  • generating the feature information of each recommendation information based on the text information of each recommendation information may be a word vector corresponding to the text information for generating the recommendation information, and correspondingly, the corresponding word vector recommends the feature information of the information.
  • the word vector corresponding to the text information of the generated recommendation information For the specific refinement of the word vector corresponding to the text information of the generated recommendation information, reference may be made to the above-mentioned relevant steps, which will not be repeated here.
  • feature representation networks such as one-hot (one-hot) coding network and N-Gram (Chinese language model) can also be combined to generate a feature vector corresponding to the text information of the recommendation information.
  • the feature vector can be generated. The vector is used as the feature information of recommendation information.
  • the feature information of the recommendation information is generated by the subject tag or text information of the recommendation information, which can realize the effective representation of the recommendation information, thereby ensuring the subsequent recall of behavior data related to the recommendation information.
  • target behavior data corresponding to each recommendation information is determined from the long-term behavior data set based on the feature information of each recommendation information.
  • determining the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information may include the following steps:
  • target behavior data corresponding to each recommendation information is determined from the long-term behavior data set.
  • the behavior feature information of the historical behavior data is a feature vector corresponding to the historical behavior data generated by a feature representation network such as a one-hot coding network, an N-Gram (Chinese language model), and the like.
  • a feature representation network such as a one-hot coding network, an N-Gram (Chinese language model), and the like.
  • the first similarity between the behavior feature information and the feature information of the recommendation information may represent the degree of association between the historical behavior data and the recommendation information.
  • the relationship between the behavior feature information and the feature information of the recommendation information The higher the first similarity, the higher the degree of correlation between the corresponding historical behavior data and the recommendation information; conversely, the lower the first similarity between the behavior feature information and the feature information of the recommendation information, the lower the corresponding historical behavior data and the recommendation information. The lower the degree of correlation between the information.
  • the first similarity between the behavior feature information and the feature information of the recommendation information may include, but is not limited to, cosine distance, Euclidean distance, and Manhattan distance between the behavior feature information and the feature information of the recommendation information.
  • historical behavior data corresponding to behavior feature information whose first degree of similarity to the feature information of the recommendation information is greater than or equal to a preset threshold may be used as the target behavior data corresponding to the recommendation information.
  • the first similarity between the behavior feature information of each historical behavior data in the long-term behavior data set and the feature information of the recommendation information can be sorted in descending order, and correspondingly, a preset number of behaviors can be sorted first.
  • the historical behavior data corresponding to the feature information is used as the target behavior data corresponding to the recommendation information.
  • the preset threshold and the preset number can be set in advance in combination with practical applications to set the information recommendation accuracy requirements (the higher the degree of association between the recommended information and the historical behavior data, the better the information recommendation accuracy).
  • the historical behavior data with a higher degree of correlation can be selected from the long-term behavior data set of the target object for each recommendation information.
  • the target behavior data corresponding to the recommendation information it ensures that the behavior data used for interest identification can fully reflect the interests and preferences of the object; at the same time, combined with the feature information of different recommendation information, the historical behavior data is screened in a targeted manner, so that the target behavior
  • the data can effectively reflect the interest and preference of the target object to the recommended information, greatly reduce the amount of data in the process of interest identification, and effectively improve the efficiency and accuracy of interest identification.
  • the target behavior data is input into the interest recognition network for interest recognition, and the target object's interest index for each recommendation information is obtained.
  • the interest index can represent the preference of the target object to the recommended information.
  • the interest recognition network may be pre-trained, and accordingly, the above method may further include the step of pre-training the interest recognition network. As shown in Figure 4, pre-training an interest recognition network can include the following steps:
  • the multiple sample recommendation information may be information recommended in the recommendation system.
  • the recommendation log information stored in the log database may include sample recommendation information corresponding to multiple historical recommendation events, information identifiers of the sample recommendation information corresponding to multiple historical recommendation events, and recommendation time and other basic information about historical recommendation events. information.
  • information identifiers of multiple sample recommendation information may be obtained from recommendation log information.
  • the sample feature information of the above-mentioned multiple sample recommendation information may be obtained in the following manner:
  • the sample feature information of each sample recommendation information is generated based on the text information of each sample recommendation information.
  • the sample feature information of the sample recommendation information is generated by the subject tag or text information of the sample recommendation information, which can realize the effective representation of the sample recommendation information, thereby ensuring the subsequent recall of behavior data related to the sample recommendation information.
  • a historical recommendation object corresponding to each sample recommendation information and an interest labeling index corresponding to the historical recommendation object are determined based on the information identifier of each sample recommendation information.
  • each historical behavior data stored in the historical behavior database can reflect the behavior of the corresponding object after the recommendation information is recommended.
  • each historical behavior data can include the information identifier of the recommendation information (sample recommendation information) and The object's behavior label for this recommendation.
  • the object corresponding to the historical behavior data pair including the information identifier of the sample recommendation information may be used as the historical recommendation object corresponding to the sample recommendation information.
  • the interest labeling index can be generated in combination with the behavior label of the object's recommendation information about the sample in the historical behavior data.
  • the interest labeling index may include, in combination with the actual application scenario, the preference of the object on the recommendation information reflected by one or more behaviors.
  • the click behavior to reflect the object's preference for the recommended information as an example, in response to the behavior label in the historical behavior data being a click, in some embodiments, the interest labeling indicator may be 1 (represented by 1). clicked, 0 means not clicked).
  • the object identifier of the historical recommendation object may be obtained.
  • corresponding historical behavior data may be obtained from a historical behavior database in combination with the object identifier, as a long-term sample behavior data set of the historical recommendation object.
  • the sample behavior data corresponding to each sample recommendation information is determined from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information.
  • determining the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information may include:
  • the sample behavior data corresponding to each sample recommendation information is determined from the long-term sample behavior data set.
  • the specific refinement of the relevant steps for determining the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information please refer to the above-mentioned based on each sample behavior data.
  • the feature information of the recommendation information is determined from the long-term behavior data set to determine the specific refinement of the relevant steps of the target behavior data corresponding to each recommendation information, which will not be repeated here.
  • each sample recommendation information can be selected from the long-term sample behavior data set of the historical recommendation object.
  • the historical behavior data with a high degree of correlation is used as the sample behavior data corresponding to the sample recommendation information, which ensures that the sample behavior data used for interest identification can fully reflect the interests and preferences of the object; at the same time, combined with different sample recommendation information, targeted
  • the screening of historical behavior data enables the sample behavior data to effectively reflect the interest and preference of historical recommendation objects to sample recommendation information, and greatly reduces the amount of data in the process of interest recognition, thereby improving the efficiency and accuracy of interest recognition.
  • the sample behavior data is input into the neural network for interest identification, so as to obtain the interest prediction index of each historical recommendation object to the corresponding sample recommendation information.
  • the neural network may identify the network for the interest to be trained.
  • neural networks may include, but are not limited to, convolutional neural networks, recurrent neural networks, and the like.
  • the interest prediction index may represent the preference of each historical recommendation object predicted by the trained neural network to the corresponding sample recommendation information.
  • the interest predictor may be a value greater than or equal to 0 and less than or equal to 1. Correspondingly, the larger the value of the interest predictor is, the greater the value of the interest predictor, represents the difference between the historical recommendation objects predicted by the trained neural network and the corresponding sample recommendation information. more like.
  • the target loss is determined according to the interest prediction index and the interest labeling index.
  • determining the target loss may include calculating the loss between the interest prediction index corresponding to each historical recommendation object and the corresponding interest labeling index based on a preset loss function, and analyzing multiple The losses corresponding to the historical recommendation objects corresponding to the sample recommendation information are summed to obtain the above target loss.
  • the preset loss function may include, but is not limited to, a cross-entropy loss function, a logistic loss function, a Hinge (hinge) loss function, an exponential loss function, and the like, and the embodiment of this specification is not limited to the above.
  • a neural network is trained based on the target loss to obtain an interest recognition network.
  • training a neural network based on the target loss to obtain an interest recognition network may include
  • the target loss is updated based on the updated neural network, until the target loss meets the preset condition, and the current neural network is used as the above-mentioned interest recognition network.
  • the target loss satisfying the preset condition may be that the target loss is less than or equal to a specified threshold, or the difference between the corresponding target losses in the two training processes before and after is less than a certain threshold.
  • the specified threshold and a certain threshold may be set in combination with actual training requirements.
  • the interest annotation information may be task annotation information corresponding to multiple tasks
  • the interest prediction indicators may be multiple Task prediction information corresponding to the task.
  • the identification information of the sample recommendation information is obtained to determine the historical recommendation object and the interest labeling index corresponding to the historical recommendation object, and the long-term sample behavior data set of the historical recommendation object is obtained, which ensures the The comprehensiveness of the behavior data in the training process can effectively reflect the interests and preferences of the objects, and at the same time, combined with different sample recommendation information, the historical behavior data is screened in a targeted manner, so that the sample behavior data used for interest identification can effectively reflect the historical recommendation objects.
  • the real interest preference of sample recommendation information can greatly reduce the amount of data in the interest recognition process, thereby effectively improving the interest recognition efficiency and recognition accuracy of the trained interest recognition network.
  • the above method before determining the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information, as shown in FIG. 5 , the above method further includes:
  • determining the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information may include:
  • the sample behavior data corresponding to each sample recommendation information is determined from the corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
  • the recommendation time of the sample recommendation information may be obtained from the corresponding recommendation log information.
  • performing data filtering on the corresponding long-term sample behavior data set based on the recommendation time, and obtaining the filtered behavior data set may include filtering out the behavior time from the long-term sample behavior data set of the historical recommendation object corresponding to each sample recommendation information After the sample behavior data of the recommendation information of each sample, the above filtering behavior data set is obtained.
  • data filtering is performed on the corresponding long-term sample behavior data set in combination with the recommendation time of the sample recommendation information, which can effectively prevent data traversal, realize the filtering of historical behavior data generated after the recommendation time, and ensure that the information is used for interest purposes. Validity of identified behavioral data.
  • the target information in the recommendation information set is recommended to the target object based on the interest index.
  • recommending the target information in the recommendation information set to the target object based on the interest index may include: determining the target information from the recommendation information set according to the interest index; recommending the target information to the target object.
  • the interest index represents the preference of the target object for each recommendation information in the recommendation information set.
  • the interest index may be a numerical value proportional to the degree of preference, or may be a characterized representation representing the degree of preference of the target object for each recommendation information in the recommendation information set, such as "medium".
  • the character symbol representation can be quantified into corresponding numerical values in combination with certain rules.
  • a confidence threshold may be set in advance in combination with the information recommendation accuracy requirement (generally, the higher the confidence threshold, the more accurate the recommended information).
  • the value corresponding to the interest index is greater than or equal to the confidence threshold.
  • Recommendation information can be used as target information.
  • the long-term behavior data set of the target object is obtained, which ensures the comprehensiveness of the behavior data, and can effectively reflect the interests and preferences of the object.
  • the historical behavior data is screened in a targeted manner, so that the target behavior data used for interest identification can effectively reflect the target object's real interest preference for the recommendation information, and can greatly reduce the interest identification process. It can effectively improve the efficiency and accuracy of interest recognition, and then greatly improve the accuracy and performance of information recommendation in the recommendation system.
  • Fig. 6 is a block diagram of an information recommendation apparatus according to an exemplary embodiment.
  • the device includes:
  • the first behavior data set obtaining module 610 is configured to obtain the long-term behavior data set of the target object in response to the information obtaining request of the target object;
  • a feature information determination module 620 configured to determine feature information of each recommendation information in the recommendation information set
  • the target behavior data determination module 630 is configured to determine the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
  • the first interest identification module 640 is configured to input the target behavior data into the interest identification network for interest identification, and obtain the interest index of the target object for each recommendation information;
  • the information recommendation module 650 is configured to recommend the target information in the recommended information set to the target object based on the interest index.
  • target behavior data determination module 630 includes:
  • a behavior feature information acquisition unit configured to acquire behavior feature information of each historical behavior data in the long-term behavior data set
  • a first similarity calculation unit configured to calculate a first similarity between the feature information of each recommendation information and the behavior feature information
  • the target behavior data determining unit is configured to determine the target behavior data corresponding to each recommendation information from the long-term behavior data set according to the first similarity.
  • the first behavior data set acquisition module 610 includes:
  • an information acquisition request receiving unit configured to receive an information acquisition request of a target object, where the information acquisition request includes a target object identifier of the target object;
  • the long-term behavior data set acquisition unit is configured to acquire the long-term behavior data set of the target object based on the target object identifier.
  • the long-term behavioral data set acquisition unit includes:
  • the historical behavior database acquisition unit is configured to acquire a pre-built historical behavior database, the historical behavior database includes historical behavior data of multiple objects, and a mapping relationship between object identifiers of multiple objects and corresponding historical behavior data;
  • the historical behavior data acquisition unit is configured to acquire historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship;
  • the long-term behavior data set determining unit is configured to take the corresponding historical behavior data as the long-term behavior data set of the target object.
  • characteristic information determination module 620 includes:
  • a first topic tag acquiring unit configured to acquire a topic tag of each recommended information
  • a first feature information generating unit configured to generate feature information of each recommendation information based on the subject tag of each recommendation information
  • a first text information extraction unit configured to extract the text information of each recommendation information
  • the second feature information generating unit is configured to generate feature information of each recommendation information based on the text information of each recommendation information.
  • the information recommendation module 650 includes:
  • a target information determination unit configured to determine target information from the recommended information set according to the interest index
  • the information pushing unit is configured to recommend target information to the target object.
  • the above-mentioned apparatus further comprises:
  • an information acquisition module configured to acquire information identifiers and sample feature information of multiple sample recommendation information
  • the object information determination module is configured to determine the historical recommendation object corresponding to each sample recommendation information and the interest labeling index corresponding to the historical recommendation object based on the information identifier of each sample recommendation information;
  • the second behavior data set acquisition module is configured to acquire long-term sample behavior data sets of historical recommendation objects
  • the sample behavior number determination module is configured to determine the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information;
  • the second interest identification module is configured to input the sample behavior data into the neural network for interest identification, and obtain the interest prediction index of each historical recommendation object to the corresponding sample recommendation information;
  • the target loss determination module is configured to determine the target loss according to the interest prediction index and the interest labeling index
  • the network training module is configured to train the neural network based on the target loss to obtain the interest recognition network.
  • the above-mentioned apparatus further comprises:
  • a recommendation time determination module configured to determine the recommendation time of each sample recommendation information
  • the data filtering module is configured to perform data filtering on the corresponding long-term sample behavior data set based on the recommended time to obtain the filtering behavior data set;
  • the sample behavior data determination module is further configured to determine sample behavior data corresponding to each sample recommendation information from the corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
  • FIG. 7 is a block diagram of an electronic device for information recommendation according to an exemplary embodiment.
  • the electronic device may be a server, and its internal structure diagram may be as shown in FIG. 7 .
  • the electronic device includes a processor, memory, and a network interface connected by a system bus. Among them, the processor of the electronic device is used to provide computing and control capabilities.
  • the memory of the electronic device includes a computer-readable storage medium and an internal memory.
  • the computer-readable storage medium stores an operating system and a computer program.
  • the internal memory provides an environment for the execution of the operating system and computer programs in the computer-readable storage medium.
  • the network interface of the electronic device is used to communicate with an external terminal through a network connection.
  • the computer program when executed by the processor, implements an information recommendation method.
  • FIG. 7 is only a block diagram of a partial structure related to the solution of the present disclosure, and does not constitute a limitation on the electronic device to which the solution of the present disclosure is applied.
  • the specific electronic device may be Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
  • an electronic device comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to the instructions to implement embodiments as disclosed in the present disclosure The information in the recommended method.
  • a computer-readable storage medium is also provided, when the instructions in the computer-readable storage medium are executed by the processor of the electronic device, the electronic device can perform the information recommendation in the embodiments of the present disclosure method.
  • the computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.
  • a computer program product containing instructions, which, when executed on a computer, cause the computer to execute the information recommendation method in the embodiments of the present disclosure.
  • any reference to memory, storage, database or other medium used in the various embodiments provided in this application may include non-volatile and/or volatile memory.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Biomedical Technology (AREA)
  • Computational Linguistics (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

An information recommendation method and device, belonging to the technical field of the Internet. The information recommendation method comprises: in response to an information acquisition request of a target object, acquiring a long-term behavior data set of the target object (S201); determining feature information of each recommendation information in a recommendation information set (S203); determining target behavior data corresponding to each recommendation information from the long-term behavior data set on the basis of the feature information of each recommendation information (S205); inputting the target behavior data into an interest recognition network for interest recognition to obtain an interest index of the target object for each recommendation information (S207); and recommending target information in the recommendation information set to the target object on the basis of the interest index (S209).

Description

信息推荐方法及装置Information recommendation method and device
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本公开基于申请日为2021年1月11日、申请号为202110034339.3的中国专利申请,并要求该中国专利申请的优先权,在此全文引用上述中国专利申请公开的内容以作为本公开的一部分。The present disclosure is based on a Chinese patent application with an application date of January 11, 2021 and an application number of 202110034339.3, and claims the priority of the Chinese patent application. The disclosure of the above Chinese patent application is hereby incorporated by reference in its entirety as a part of this disclosure.
技术领域technical field
本公开涉及互联网技术领域,尤其涉及信息推荐方法、装置、电子设备及存储介质。The present disclosure relates to the field of Internet technologies, and in particular, to an information recommendation method, apparatus, electronic device, and storage medium.
背景技术Background technique
随着互联网技术的发展,大量网络平台也在不断的升级,除了可以发布一些图文信息之外,也可以供个人用户随时分享日常的短视频等,而如何精准的捕捉用户的兴趣是大量推荐系统遇到的挑战。With the development of Internet technology, a large number of network platforms are constantly being upgraded. In addition to publishing some graphic information, individual users can also share daily short videos, etc., and how to accurately capture the interests of users is a large number of recommendations. system challenges.
发明内容SUMMARY OF THE INVENTION
本公开提供一种信息推荐方法、装置、电子设备及存储介质。本公开的技术方案如下:The present disclosure provides an information recommendation method, device, electronic device and storage medium. The technical solutions of the present disclosure are as follows:
根据本公开实施例的第一方面,提供一种信息推荐方法,包括:According to a first aspect of the embodiments of the present disclosure, an information recommendation method is provided, including:
响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集,所述长期行为数据集表征所述目标对象在预设时间段内的多个历史行为数据;In response to the information acquisition request of the target object, acquiring a long-term behavior data set of the target object, where the long-term behavior data set represents a plurality of historical behavior data of the target object within a preset time period;
确定推荐信息集中每个推荐信息的特征信息;Determine the feature information of each recommended information in the recommended information set;
基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据;Determine the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
将所述目标行为数据输入兴趣识别网络进行兴趣识别,得到所述目标对象对每个推荐信息的兴趣指标;Inputting the target behavior data into an interest recognition network for interest recognition, to obtain the target object's interest index for each recommendation information;
基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象。The target information in the recommendation information set is recommended to the target object based on the interest index.
在一些实施例中,所述基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据包括:In some embodiments, determining the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information includes:
获取所述长期行为数据集中每个历史行为数据的行为特征信息;Obtaining behavior feature information of each historical behavior data in the long-term behavior data set;
计算每个推荐信息的特征信息与所述行为特征信息间的第一相似度;calculating the first similarity between the feature information of each recommendation information and the behavior feature information;
根据所述第一相似度,从所述长期行为数据集中确定所述每个推荐信息对应的目标行为数据。According to the first similarity, target behavior data corresponding to each recommendation information is determined from the long-term behavior data set.
在一些实施例中,所述响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集包括:In some embodiments, in response to the information acquisition request of the target object, acquiring the long-term behavior data set of the target object includes:
接收所述目标对象的信息获取请求,所述信息获取请求包括所述目标对象的目标对象 标识;Receive the information acquisition request of the target object, and the information acquisition request includes the target object identification of the target object;
基于所述目标对象标识获取所述目标对象的长期行为数据集。A long-term behavior data set of the target object is acquired based on the target object identifier.
在一些实施例中,所述基于所述对象标识获取所述目标对象的长期行为数据集包括:In some embodiments, acquiring the long-term behavior data set of the target object based on the object identifier includes:
获取预先构建的历史行为数据库,所述历史行为数据库包括多个对象的历史行为数据,以及所述多个对象的对象标识与对应的历史行为数据间的映射关系;Obtaining a pre-built historical behavior database, the historical behavior database includes historical behavior data of multiple objects, and the mapping relationship between the object identifiers of the multiple objects and the corresponding historical behavior data;
根据所述映射关系从所述历史行为数据库中,获取所述目标对象标识对应的历史行为数据;Obtain historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship;
将所述对应的历史行为数据作为所述目标对象的长期行为数据集。The corresponding historical behavior data is used as the long-term behavior data set of the target object.
在一些实施例中,所述确定推荐信息集中每个推荐信息的特征信息包括:In some embodiments, the determining the characteristic information of each recommendation information in the recommendation information set includes:
获取所述每个推荐信息的主题标签;obtaining the subject tag of each of the recommended information;
基于所述每个推荐信息的主题标签生成所述每个推荐信息的特征信息;generating feature information of each recommendation information based on the subject tag of each recommendation information;
或,or,
提取所述每个推荐信息的文本信息;extracting the text information of each recommended information;
基于所述每个推荐信息的文本信息生成所述每个推荐信息的特征信息。The feature information of each recommendation information is generated based on the text information of each recommendation information.
在一些实施例中,所述基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象包括:In some embodiments, the recommending the target information in the recommendation information set to the target object based on the interest index includes:
根据所述兴趣指标从所述推荐信息集中确定目标信息;determining target information from the recommendation information set according to the interest index;
将所述目标信息推荐给所述目标对象。The target information is recommended to the target object.
在一些实施例中,所述方法还包括:In some embodiments, the method further includes:
获取多个样本推荐信息的信息标识和样本特征信息;Obtain information identification and sample feature information of multiple sample recommendation information;
基于每个样本推荐信息的信息标识确定所述每个样本推荐信息对应的历史推荐对象和所述历史推荐对象对应的兴趣标注指标;Determine the historical recommendation object corresponding to each sample recommendation information and the interest labeling index corresponding to the historical recommendation object based on the information identifier of each sample recommendation information;
获取所述历史推荐对象的长期样本行为数据集;obtaining a long-term sample behavior data set of the historical recommendation object;
基于所述每个样本推荐信息的样本特征信息从对应长期样本行为数据集中,确定所述每个样本推荐信息对应的样本行为数据;Determine the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of the each sample recommendation information;
将所述样本行为数据输入神经网络进行兴趣识别,得到每个历史推荐对象对对应的样本推荐信息的兴趣预测指标;Inputting the sample behavior data into the neural network for interest identification, and obtaining the interest prediction index of each historical recommendation object to the corresponding sample recommendation information;
根据所述兴趣预测指标和所述兴趣标注指标,确定目标损失;determining a target loss according to the interest prediction index and the interest labeling index;
基于所述目标损失训练所述神经网络,得到所述兴趣识别网络。The neural network is trained based on the target loss to obtain the interest recognition network.
在一些实施例中,,所述方法还包括:In some embodiments, the method further includes:
确定所述每个样本推荐信息的推荐时间;determining the recommendation time of the recommendation information for each sample;
基于所述推荐时间对对应的长期样本行为数据集进行数据过滤,得到过滤行为数据集;Perform data filtering on the corresponding long-term sample behavior data set based on the recommended time to obtain a filtered behavior data set;
所述基于所述每个样本推荐信息的样本特征信息从对应的长期样本行为数据集中,确定所述每个样本推荐信息对应的样本行为数据包括:The sample behavior data corresponding to each sample recommendation information determined from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information includes:
基于所述每个样本推荐信息的样本特征信息从对应的过滤行为数据集中,确定所述每个样本推荐信息对应的样本行为数据。The sample behavior data corresponding to each sample recommendation information is determined from the corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
根据本公开实施例的第二方面,提供一种信息推荐装置,包括:According to a second aspect of the embodiments of the present disclosure, there is provided an information recommendation apparatus, including:
第一行为数据集获取模块,被配置为响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集,所述长期行为数据集表征所述目标对象在预设时间段内的多个历史行为数据;The first behavior data set acquisition module is configured to obtain a long-term behavior data set of the target object in response to an information acquisition request of the target object, and the long-term behavior data set represents the number of the target object in a preset time period. historical behavioral data;
特征信息确定模块,被配置为确定推荐信息集中每个推荐信息的特征信息;a feature information determination module, configured to determine feature information of each recommendation information in the recommendation information set;
目标行为数据确定模块,被配置为基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据;a target behavior data determination module, configured to determine target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
第一兴趣识别模块,被配置为将所述目标行为数据输入兴趣识别网络进行兴趣识别,得到所述目标对象对每个推荐信息的兴趣指标;a first interest identification module, configured to input the target behavior data into an interest identification network for interest identification, and obtain an interest index of the target object for each recommendation information;
信息推荐模块,被配置为基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象。An information recommendation module, configured to recommend target information in the recommended information set to the target object based on the interest index.
在一些实施例中,所述目标行为数据确定模块包括:In some embodiments, the target behavior data determination module includes:
行为特征信息获取单元,被配置为获取所述长期行为数据集中每个历史行为数据的行为特征信息;a behavior feature information acquisition unit, configured to acquire behavior feature information of each historical behavior data in the long-term behavior data set;
第一相似度计算单元,被配置为计算每个推荐信息的特征信息与所述行为特征信息间的第一相似度;a first similarity calculation unit, configured to calculate a first similarity between the feature information of each recommendation information and the behavior feature information;
目标行为数据确定单元,被配置为根据所述第一相似度,从所述长期行为数据集中确定所述每个推荐信息对应的目标行为数据。The target behavior data determining unit is configured to determine the target behavior data corresponding to each recommendation information from the long-term behavior data set according to the first similarity.
在一些实施例中,所述第一行为数据集获取模块包括:In some embodiments, the first behavior data set acquisition module includes:
信息获取请求接收单元,被配置为接收所述目标对象的信息获取请求,所述信息获取请求包括所述目标对象的目标对象标识;an information acquisition request receiving unit, configured to receive an information acquisition request of the target object, where the information acquisition request includes a target object identifier of the target object;
长期行为数据集获取单元,被配置为基于所述目标对象标识获取所述目标对象的长期行为数据集。The long-term behavior data set acquisition unit is configured to acquire the long-term behavior data set of the target object based on the target object identifier.
在一些实施例中,所述长期行为数据集获取单元包括:In some embodiments, the long-term behavior data set acquisition unit includes:
历史行为数据库获取单元,被配置为获取预先构建的历史行为数据库,所述历史行为数据库包括多个对象的历史行为数据,以及所述多个对象的对象标识与对应的历史行为数据间的映射关系;A historical behavior database acquisition unit, configured to acquire a pre-built historical behavior database, the historical behavior database includes historical behavior data of a plurality of objects, and a mapping relationship between the object identifiers of the plurality of objects and the corresponding historical behavior data ;
历史行为数据获取单元,被配置为根据所述映射关系从所述历史行为数据库中,获取所述目标对象标识对应的历史行为数据;a historical behavior data acquisition unit, configured to acquire historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship;
长期行为数据集确定单元,被配置为将所述对应的历史行为数据作为所述目标对象的长期行为数据集。The long-term behavior data set determination unit is configured to use the corresponding historical behavior data as the long-term behavior data set of the target object.
在一些实施例中,所述特征信息确定模块包括:In some embodiments, the feature information determination module includes:
第一主题标签获取单元,被配置为获取所述每个推荐信息的主题标签;a first topic tag acquiring unit, configured to acquire the topic tag of each of the recommended information;
第一特征信息生成单元,被配置为基于所述每个推荐信息的主题标签生成所述每个推荐信息的特征信息;a first feature information generating unit, configured to generate feature information of each recommendation information based on the subject tag of each recommendation information;
或,or,
第一文本信息提取单元,被配置为提取所述每个推荐信息的文本信息;a first text information extraction unit, configured to extract the text information of each recommended information;
第二特征信息生成单元,被配置为基于所述每个推荐信息的文本信息生成所述每个推荐信息的特征信息。The second feature information generating unit is configured to generate feature information of each recommendation information based on the text information of each recommendation information.
在一些实施例中,所述信息推荐模块包括:In some embodiments, the information recommendation module includes:
目标信息确定单元,被配置为根据所述兴趣指标从所述推荐信息集中确定目标信息;a target information determination unit, configured to determine target information from the recommended information set according to the interest index;
信息推单元,被配置为将所述目标信息推荐给所述目标对象。An information pushing unit, configured to recommend the target information to the target object.
在一些实施例中,所述装置还包括:In some embodiments, the apparatus further includes:
信息获取模块,被配置为获取多个样本推荐信息的信息标识和样本特征信息;an information acquisition module, configured to acquire information identifiers and sample feature information of multiple sample recommendation information;
对象信息确定模块,被配置为基于每个样本推荐信息的信息标识确定所述每个样本推荐信息对应的历史推荐对象和所述历史推荐对象对应的兴趣标注指标;an object information determination module, configured to determine a historical recommendation object corresponding to each sample recommendation information and an interest labeling index corresponding to the historical recommendation object based on the information identifier of each sample recommendation information;
第二行为数据集获取模块,被配置为获取所述历史推荐对象的长期样本行为数据集;The second behavior data set obtaining module is configured to obtain the long-term sample behavior data set of the historical recommendation object;
样本行为数确定模块,被配置为基于所述每个样本推荐信息的样本特征信息从对应长期样本行为数据集中,确定所述每个样本推荐信息对应的样本行为数据;a sample behavior number determination module, configured to determine sample behavior data corresponding to each sample recommendation information from a corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information;
第二兴趣识别模块,被配置为将所述样本行为数据输入神经网络进行兴趣识别,以得到每个历史推荐对象对对应的样本推荐信息的兴趣预测指标;The second interest identification module is configured to input the sample behavior data into the neural network for interest identification, so as to obtain the interest prediction index of each historical recommendation object to the corresponding sample recommendation information;
目标损失确定模块,被配置为根据所述兴趣预测指标和所述兴趣标注指标,确定目标损失;a target loss determination module, configured to determine a target loss according to the interest prediction index and the interest labeling index;
网络训练模块,被配置为基于所述目标损失训练所述神经网络,得到所述兴趣识别网络。A network training module configured to train the neural network based on the target loss to obtain the interest recognition network.
在一些实施例中,所述装置还包括:In some embodiments, the apparatus further includes:
推荐时间确定模块,被配置为确定所述每个样本推荐信息的推荐时间;a recommendation time determination module, configured to determine the recommendation time of the recommended information for each sample;
数据过滤模块,被配置为基于所述推荐时间对对应的长期样本行为数据集进行数据过滤,得到过滤行为数据集;a data filtering module, configured to perform data filtering on the corresponding long-term sample behavior data set based on the recommended time to obtain a filtering behavior data set;
所述样本行为数据确定模块还被配置为基于所述每个样本推荐信息的样本特征信息从对应的过滤行为数据集中,确定所述每个样本推荐信息对应的样本行为数据。The sample behavior data determination module is further configured to determine sample behavior data corresponding to each sample recommendation information from a corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
根据本公开实施例的第三方面,提供一种电子设备,包括:处理器;用于存储所述处理器可执行指令的存储器;其中,所述处理器被配置为执行所述指令,以实现如上述第一方面中任一项所述的方法。According to a third aspect of embodiments of the present disclosure, there is provided an electronic device, comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to execute the instructions to achieve The method of any one of the first aspects above.
根据本公开实施例的第四方面,提供一种计算机可读存储介质,当所述存储介质中的指令由电子设备的处理器执行时,使得所述电子设备能够执行本公开实施例的第一方面中任一所述方法。According to a fourth aspect of the embodiments of the present disclosure, a computer-readable storage medium is provided, when instructions in the storage medium are executed by a processor of an electronic device, the electronic device can execute the first embodiment of the present disclosure. The method of any of the aspects.
根据本公开实施例的第五方面,提供一种包含指令的计算机程序产品,当其在计算机 上运行时,使得计算机执行本公开实施例的第一方面中任一所述方法。According to a fifth aspect of embodiments of the present disclosure, there is provided a computer program product comprising instructions which, when run on a computer, cause the computer to perform any of the methods described in the first aspect of the embodiments of the present disclosure.
在信息推荐过程中,获取目标对象的长期行为数据集,保证了行为数据的全面性,能够有效反映对象的兴趣喜好,同时结合不同推荐信息的特征信息有针对性的进行历史行为数据的筛选,使得用于进行兴趣识别的目标行为数据可以有效反映目标对象对推荐信息的真实兴趣偏好情况,且可以大大降低兴趣识别过程中的数据量,有效提高兴趣识别效率和识别精准性,进而大大提升推荐系统中信息推荐精准性和推荐性能。In the process of information recommendation, the long-term behavior data set of the target object is obtained, which ensures the comprehensiveness of the behavior data and can effectively reflect the interest and preference of the object. The target behavior data used for interest recognition can effectively reflect the target object's real interest preference for recommended information, and can greatly reduce the amount of data in the process of interest recognition, effectively improve the efficiency and accuracy of interest recognition, and greatly improve the recommendation. Information recommendation accuracy and recommendation performance in the system.
附图说明Description of drawings
图1是根据一些实施例示出的一种应用环境的示意图;1 is a schematic diagram of an application environment according to some embodiments;
图2是根据一些实施例示出的一种信息推荐方法的流程图;FIG. 2 is a flowchart of an information recommendation method according to some embodiments;
图3是根据一些实施例示出的一种基于每个推荐信息的特征信息从长期行为数据集中,确定每个推荐信息对应的目标行为数据的流程示意图;3 is a schematic flowchart of determining target behavior data corresponding to each recommendation information from a long-term behavior data set based on feature information of each recommendation information according to some embodiments;
图4是根据一些实施例示出的一种预先训练兴趣识别网络的流程示意图;4 is a schematic flowchart of a pre-trained interest recognition network according to some embodiments;
图5是根据一些实施例示出的一种预先训练兴趣识别网络的流程示意图;5 is a schematic flowchart of a pre-trained interest recognition network according to some embodiments;
图6是根据一些实施例示出的一种信息推荐装置框图;6 is a block diagram of an information recommendation apparatus according to some embodiments;
图7是根据一些实施例示出的一种用于信息推荐的电子设备的框图。Fig. 7 is a block diagram of an electronic device for information recommendation according to some embodiments.
具体实施方式Detailed ways
为了使本领域普通人员更好地理解本公开的技术方案,下面将结合附图,对本公开实施例中的技术方案进行清楚、完整地描述。In order to make those skilled in the art better understand the technical solutions of the present disclosure, the technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings.
需要说明的是,本公开的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本公开的实施例能够以除了在这里图示或描述的那些以外的顺序实施。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。It should be noted that the terms "first", "second" and the like in the description and claims of the present disclosure and the above drawings are used to distinguish similar objects, and are not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used may be interchanged under appropriate circumstances such that the embodiments of the disclosure described herein can be practiced in sequences other than those illustrated or described herein. The implementations described in the illustrative examples below are not intended to represent all implementations consistent with this disclosure. Rather, they are merely examples of apparatus and methods consistent with some aspects of the present disclosure, as recited in the appended claims.
请参阅图1,图1是根据一示例性实施例示出的一种应用环境的示意图,如图1所示,该应用环境可以包括日志数据库01、检索服务器02、历史行为数据库03、训练服务器04、终端05、内容分发服务器06和业务服务器07。Please refer to FIG. 1, which is a schematic diagram of an application environment according to an exemplary embodiment. As shown in FIG. 1, the application environment may include a log database 01, a retrieval server 02, a historical behavior database 03, and a training server 04 , terminal 05 , content distribution server 06 and service server 07 .
在一些实施例中,日志数据库01可以用于存储推荐系统推荐过程中产生的推荐日志信息。历史行为数据库03可以用于存储推荐系统中大量对象的长期历史行为数据。在一些实施例中,历史行为数据库03以对象标识为key(检索参数),对象对应的历史行为数据集为value(检索参数对应的返回数据)来进行数据存储。In some embodiments, the log database 01 may be used to store recommendation log information generated during the recommendation process of the recommendation system. The historical behavior database 03 can be used to store long-term historical behavior data of a large number of objects in the recommender system. In some embodiments, the historical behavior database 03 uses the object identifier as the key (retrieval parameter), and the historical behavior data set corresponding to the object as the value (return data corresponding to the retrieval parameter) for data storage.
在一些实施例中,检索服务器02可以结合日志数据库01中的推荐日志信息发起长期 行为数据集获取请求,从历史行为数据库03中获取推荐系统推荐过的样本推荐信息对应的历史推荐对象的长期样本行为数据集;并将长期样本行为数据集和样本推荐信息作为训练数据发送给训练服务器04。In some embodiments, the retrieval server 02 may initiate a long-term behavior data set acquisition request in combination with the recommendation log information in the log database 01, and acquire long-term samples of historical recommendation objects corresponding to the sample recommendation information recommended by the recommendation system from the historical behavior database 03 Behavior data set; and send the long-term sample behavior data set and sample recommendation information to the training server 04 as training data.
在一些实施例中,训练服务器04可以结合检索服务器02发送的长期样本行为数据集和样本推荐信息进行兴趣识别网络的训练,并将训练好的兴趣识别网络发送给业务服务器07。In some embodiments, the training server 04 may train the interest recognition network in combination with the long-term sample behavior data set and the sample recommendation information sent by the retrieval server 02 , and send the trained interest recognition network to the service server 07 .
在一些实施例中,终端05可以提供面向用户的信息推荐服务;相应的,用户可以基于终端05触发信息获取请求。In some embodiments, the terminal 05 may provide a user-oriented information recommendation service; accordingly, the user may trigger an information acquisition request based on the terminal 05 .
在一些实施例中,终端05触发的信息获取请求可以发送至内容分发服务器06;相应的,由内容分发服务器06向业务服务器07转发该信息获取请求,进而由业务服务器07结合信息获取请求中携带的目标对象的目标对象标识从历史行为数据库03中获取推荐信息对应的目标对象的目标行为数据,并结合兴趣识别网络进行兴趣识别,进而可以基于兴趣识别的结果进行信息推荐。In some embodiments, the information acquisition request triggered by the terminal 05 may be sent to the content distribution server 06; correspondingly, the content distribution server 06 forwards the information acquisition request to the service server 07, and then the service server 07 combines the information acquisition request with the information contained in the request. The target object identifier of the target object obtains the target behavior data of the target object corresponding to the recommendation information from the historical behavior database 03, and combines the interest recognition network to perform interest recognition, and then information recommendation can be performed based on the result of the interest recognition.
在一些实施例中,检索服务器02、训练服务器04、内容分发服务器06和业务服务器07可以是独立的物理服务器,也可以是多个物理服务器构成的服务器集群或者分布式系统,还可以是提供云服务、云数据库、云计算、云函数、云存储、网络服务、云通信、中间件服务、域名服务、安全服务、CDN(Content Delivery Network,内容分发网络)、以及大数据和人工智能平台等基础云计算服务的云服务器。In some embodiments, the retrieval server 02, the training server 04, the content distribution server 06, and the service server 07 may be independent physical servers, or may be server clusters or distributed systems composed of multiple physical servers, or may provide cloud services. Services, cloud databases, cloud computing, cloud functions, cloud storage, network services, cloud communications, middleware services, domain name services, security services, CDN (Content Delivery Network), and big data and artificial intelligence platforms Cloud servers for cloud computing services.
在一些实施例中,终端05可以包括但不限于智能手机、台式计算机、平板电脑、笔记本电脑、智能音箱、数字助理、增强现实(augmented reality,AR)/虚拟现实(virtual reality,VR)设备、智能可穿戴设备等类型的电子设备。在一些实施例中,电子设备上运行的操作系统可以包括但不限于安卓系统、IOS系统、linux、windows等。In some embodiments, the terminal 05 may include, but is not limited to, smartphones, desktop computers, tablet computers, laptop computers, smart speakers, digital assistants, augmented reality (AR)/virtual reality (VR) devices, Electronic devices such as smart wearable devices. In some embodiments, the operating system running on the electronic device may include, but is not limited to, the Android system, the IOS system, linux, windows, and the like.
在一些实施例中,日志数据库01和历史行为数据库03可以为关系型数据库。In some embodiments, the log database 01 and the historical behavior database 03 may be relational databases.
此外,需要说明的是,图1所示的仅仅是本公开提供的一种应用环境,在实际应用中,还可以包括其他应用环境,例如终端05的信息获取请求可以直接发送给业务服务器07,本说明书实施例中,在终端05与业务服务器07间设置的内容分发服务器06可以有效降低业务服务器的请求压力,进而提升信息推荐效率。In addition, it should be noted that what is shown in FIG. 1 is only an application environment provided by the present disclosure. In practical applications, other application environments may also be included. For example, the information acquisition request of the terminal 05 can be directly sent to the service server 07, In the embodiment of this specification, the content distribution server 06 set between the terminal 05 and the service server 07 can effectively reduce the request pressure of the service server, thereby improving the efficiency of information recommendation.
本说明书实施例中,上述日志数据库01、检索服务器02、历史行为数据库03、训练服务器04、终端05、内容分发服务器06和业务服务器07可以通过有线或无线通信方式进行直接或间接地连接,本公开在此不做限制。In the embodiment of this specification, the above-mentioned log database 01, retrieval server 02, historical behavior database 03, training server 04, terminal 05, content distribution server 06 and service server 07 can be directly or indirectly connected through wired or wireless communication. The disclosure is not limited here.
图2是根据一示例性实施例示出的一种信息推荐方法的流程图,如图2所示,该信息推荐方法用于服务器、边缘计算节点等电子设备中,包括以下步骤:FIG. 2 is a flowchart of an information recommendation method according to an exemplary embodiment. As shown in FIG. 2 , the information recommendation method is used in electronic devices such as servers and edge computing nodes, and includes the following steps:
在S201中,响应于目标对象的信息获取请求,获取目标对象的长期行为数据集。In S201, in response to the information acquisition request of the target object, a long-term behavior data set of the target object is acquired.
在一些实施例中,长期行为数据集表征目标对象在预设时间段内的多个历史行为数据。在一些实施例中,预设时间段可以结合实际应用需求进行设置,例如1年等。本说明 书实施例中,每一历史行为数据可以反映相应的对象对推荐信息行为过程中的关联数据。在一些实施例中,每一历史行为数据可以包括:历史推荐信息的信息标识、历史推荐信息的发布者标识、推荐时间、信息查看时间、历史推荐信息的主题标签、对象对历史推荐信息的行为标签(例如点击,喜欢,关注,转发等)。In some embodiments, the long-term behavioral data set represents a plurality of historical behavioral data of the target object within a preset time period. In some embodiments, the preset time period may be set in combination with actual application requirements, such as 1 year. In the embodiment of this specification, each historical behavior data may reflect the associated data in the process of the behavior of the corresponding object to the recommended information. In some embodiments, each historical behavior data may include: information identifier of historical recommendation information, publisher identifier of historical recommendation information, recommendation time, information viewing time, topic tag of historical recommendation information, behavior of objects on historical recommendation information Tags (e.g. click, like, follow, retweet, etc.).
在一些实施例中,上述响应于目标对象的信息获取请求,获取目标对象的长期行为数据集包括:接收目标对象的信息获取请求,信息获取请求包括目标对象的目标对象标识;基于目标对象标识获取目标对象的长期行为数据集。In some embodiments, acquiring the long-term behavior data set of the target object in response to the information acquisition request of the target object includes: receiving an information acquisition request of the target object, where the information acquisition request includes the target object identifier of the target object; acquiring based on the target object identifier A long-term behavioral dataset of the target object.
在一些实施例中,上述方法还可以包括:In some embodiments, the above method may further include:
预先构建历史行为数据库。Pre-built historical behavior database.
在一些实施例中,历史行为数据库可以包括多个对象的历史行为数据,以及多个对象的对象标识与对应的历史行为数据间的映射关系。一般的,每个对象往往对应的大量的历史行为数据。In some embodiments, the historical behavior database may include historical behavior data of multiple objects, and a mapping relationship between object identifiers of the multiple objects and corresponding historical behavior data. Generally, each object often corresponds to a large amount of historical behavior data.
在实际应用中,为了尽快存储足够的历史行为数据,历史行为数据库可以支持两种格式的回填:历史数据回填和推荐事件触发的自动回填。历史数据回填过程中,可以从大量历史日志数据中提取历史行为数据,并回填到历史行为数据库,推荐事件触发的自动回填可以为将实时信息推荐过程中的行为数据更新至历史行为数据库。此外,为了保证数据的一致性,在更新的时候,可以结合推荐时间和历史推荐信息的信息标识进行去重处理。In practical applications, in order to store enough historical behavior data as soon as possible, the historical behavior database can support backfilling in two formats: historical data backfilling and automatic backfilling triggered by recommended events. In the process of historical data backfilling, historical behavior data can be extracted from a large amount of historical log data and backfilled into the historical behavior database. The automatic backfill triggered by the recommendation event can update the behavior data in the real-time information recommendation process to the historical behavior database. In addition, in order to ensure the consistency of the data, when updating, it is possible to perform deduplication processing in combination with the recommendation time and the information identifier of the historical recommendation information.
相应的,上述基于对象标识获取目标对象的长期行为数据集可以包括:Correspondingly, the above-mentioned acquisition of the long-term behavior data set of the target object based on the object identifier may include:
获取预先构建的历史行为数据库,历史行为数据库包括多个对象的历史行为数据,以及多个对象的对象标识与对应的历史行为数据间的映射关系;Obtain a pre-built historical behavior database, the historical behavior database includes historical behavior data of multiple objects, and the mapping relationship between object identifiers of multiple objects and the corresponding historical behavior data;
根据映射关系从历史行为数据库中,获取目标对象标识对应的历史行为数据;Obtain the historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship;
将对应的历史行为数据作为目标对象的长期行为数据集。The corresponding historical behavior data is taken as the long-term behavior data set of the target object.
上述实施例中,通过预先构建包括大量对象的历史行为数据,以及大量对象的对象标识与对应的历史行为数据间的映射关系的历史行为数据库,可以便于在接收到目标对象的信息获取请求时,直接结合对象标识快速查询到对象的长期行为数据集。In the above embodiment, by pre-constructing the historical behavior database including the historical behavior data of a large number of objects and the mapping relationship between the object identifiers of the large number of objects and the corresponding historical behavior data, it is convenient to receive the information acquisition request of the target object. Quickly query long-term behavioral data sets of objects directly combined with object identification.
在S203中,确定推荐信息集中每个推荐信息的特征信息。In S203, the feature information of each recommendation information in the recommendation information set is determined.
本说明书实施例中,推荐信息集中可以包括大量推荐系统中的推荐信息,在一些实施例中,推荐信息可以为视频、图片、文本信息等。In the embodiments of this specification, the recommended information set may include a large number of recommended information in the recommendation system, and in some embodiments, the recommended information may be videos, pictures, text information, and the like.
在一些实施例中,确定推荐信息集中每个推荐信息的特征信息包括:In some embodiments, determining the feature information of each recommendation information in the recommendation information set includes:
1)获取每个推荐信息的主题标签;1) Get the hashtag of each recommendation;
2)基于每个推荐信息的主题标签生成每个推荐信息的特征信息;2) generating feature information of each recommendation information based on the subject tag of each recommendation information;
在实际应用中,推荐信息的主题标签可以为能够表征推荐信息主题内容的文本信息。在一些实施例中,一些平台中在用户进行信息发布时,往往会让用户结合预设符号填写信息的主题标签,例如在信息发布编辑页面和后续的发布的信息中,可以通过将主题标签放在预设符号间的方式来区分主题标签,相应的,可以通过匹配预设符号的方式,从信息的 相关文本信息中获取主题标签。在一些实施例中,预设符号可以为“##”,当然在实际应用中还可以通过其他符号或方式来区分主题标签。In practical applications, the subject tag of the recommended information may be text information that can represent the subject content of the recommended information. In some embodiments, when a user publishes information in some platforms, the user is often asked to fill in the subject tag of the information in combination with the preset symbols. The hashtags are distinguished by means of preset symbols, and correspondingly, the hashtags can be obtained from the relevant text information of the information by matching the preset symbols. In some embodiments, the preset symbol may be "##", of course, other symbols or manners may also be used to distinguish hashtags in practical applications.
在一些实施例中,也可以通过预先训练好的主题标签识别网络来提取推荐信息的主题标签。在一些实施例中,该主题标签是被网络可以为基于大量样本推荐信息和样本推荐信息对应的主题标签对预设神经网络进行主题标签识别训练得到的。In some embodiments, a pre-trained hashtag recognition network can also be used to extract the hashtags of the recommended information. In some embodiments, the hashtag is obtained by the network performing hashtag recognition training on a preset neural network based on a large amount of sample recommendation information and hashtags corresponding to the sample recommendation information.
进一步的,基于每个推荐信息的主题标签生成该推荐信息的特征信息可以包括获取该主题标签的词向量,相应的,可以将主题标签的词向量作为该推荐信息的特征信息。Further, generating the feature information of the recommendation information based on the topic tag of each recommendation information may include acquiring the word vector of the topic tag, and correspondingly, the word vector of the topic tag may be used as the feature information of the recommendation information.
在一些实施例中,可以预先基于预设训练文本信息对预设词向量模型进行训练得到的词向量表征模型,在一些实施例中,预设训练文本信息可以为推荐系统中的文本信息。In some embodiments, a word vector representation model obtained by pre-training a preset word vector model based on preset training text information may be pre-trained. In some embodiments, the preset training text information may be text information in a recommendation system.
在一些实施例中,在进行词向量表征模型训练过程中,可以将预设训练文本信息进行分词处理,将分词处理后的分次信息输入预设词向量模型进行训练,在训练过程中可以将每个词语映射成K维实数向量,得到词向量表征模型的同时可以得到表征词语之间的语义相似度的词向量集合。以某一系统中的预设训练文本信息对预设词向量模型进行训练,得到的词向量表征模型,可以有效表征该系统中词语之间的语义相似度。In some embodiments, during the training of the word vector representation model, the preset training text information may be subjected to word segmentation processing, and the classification information after word segmentation processing may be input into the preset word vector model for training. Each word is mapped into a K-dimensional real number vector, and a word vector set representing the semantic similarity between words can be obtained while the word vector representation model is obtained. The preset word vector model is trained with the preset training text information in a certain system, and the obtained word vector representation model can effectively represent the semantic similarity between words in the system.
在一些实施例中,在词向量表征模型被训练好的情况下,可以将主题标签进行分词后输入该词向量表征模型,该词向量表征模型可以基于词向量集合中的词向量确定上述主题标签对应的分词信息的词向量。进一步的,在主题标签对应的分词信息有多个词的情况下,可以取这多个词的词向量的均值,作为主题标签的词向量;在主题标签对应的分词信息有一个词的情况下,可以将这个词的词向量作为主题标签的词向量。In some embodiments, when the word vector representation model has been trained, the hashtag can be segmented and input into the word vector representation model, and the word vector representation model can determine the above-mentioned hashtag based on the word vector in the word vector set The word vector of the corresponding word segmentation information. Further, in the case that the word segmentation information corresponding to the topic tag has multiple words, the mean value of the word vectors of the multiple words can be taken as the word vector of the topic tag; in the case that the word segmentation information corresponding to the topic tag has one word , the word vector of this word can be used as the word vector of the hashtag.
本说明书实施例中,预设词向量模型可以包括但不限于word2vec、BERT、glove等模型。In the embodiment of this specification, the preset word vector model may include, but is not limited to, word2vec, BERT, glove and other models.
在一些实施例中,上述确定推荐信息集中每个推荐信息的特征信息可以包括:In some embodiments, determining the feature information of each recommendation information in the recommendation information set above may include:
1)提取每个推荐信息的文本信息;1) Extract the text information of each recommendation information;
2)基于每个推荐信息的文本信息生成每个推荐信息的特征信息。2) Generate feature information of each recommendation information based on the text information of each recommendation information.
在一些实施例中,在推荐信息本身为文本信息的情况下,可以将推荐信息作为文本信息。在一些实施例中,在推荐信息为视频的情况下,推荐信息的文本信息可以包括视频中语音信息对应的文本信息、视频封面的文本信息、视频的标题信息、视频中提取的实体文本(实体可以为人物、物体等)以及视频的搜索信息(即在推荐系统中可以基于该搜索信息召回该视频)等。In some embodiments, when the recommendation information itself is text information, the recommendation information may be used as text information. In some embodiments, when the recommended information is a video, the text information of the recommended information may include text information corresponding to the voice information in the video, text information of the video cover, title information of the video, and entity text (entity text) extracted from the video. It can be a person, an object, etc.) and the search information of the video (that is, the video can be recalled based on the search information in the recommender system), etc.
在一些实施例中,基于每个推荐信息的文本信息生成每个推荐信息的特征信息可以为生成推荐信息的文本信息对应的词向量,相应的,将该对应的词向量推荐信息的特征信息。生成推荐信息的文本信息对应的词向量的具体细化可以参见上述相关步骤,在此不再赘述。In some embodiments, generating the feature information of each recommendation information based on the text information of each recommendation information may be a word vector corresponding to the text information for generating the recommendation information, and correspondingly, the corresponding word vector recommends the feature information of the information. For the specific refinement of the word vector corresponding to the text information of the generated recommendation information, reference may be made to the above-mentioned relevant steps, which will not be repeated here.
在一些实施例中,也可以结合one-hot(独热)编码网络、N-Gram(汉语语言模型)等特征表征网络来生成推荐信息的文本信息对应的特征向量,相应的,可以将该特征向量 作为推荐信息的特征信息。In some embodiments, feature representation networks such as one-hot (one-hot) coding network and N-Gram (Chinese language model) can also be combined to generate a feature vector corresponding to the text information of the recommendation information. Correspondingly, the feature vector can be generated. The vector is used as the feature information of recommendation information.
上述实施例中,通过推荐信息的主题标签或文本信息来生成推荐信息的特征信息,可以实现对推荐信息的有效表征,进而保证后续召回与推荐信息相关的行为数据。In the above-mentioned embodiment, the feature information of the recommendation information is generated by the subject tag or text information of the recommendation information, which can realize the effective representation of the recommendation information, thereby ensuring the subsequent recall of behavior data related to the recommendation information.
在S205中,基于每个推荐信息的特征信息从长期行为数据集中,确定每个推荐信息对应的目标行为数据。In S205, target behavior data corresponding to each recommendation information is determined from the long-term behavior data set based on the feature information of each recommendation information.
在一些实施例中,如图3所示,上述基于每个推荐信息的特征信息从长期行为数据集中,确定每个推荐信息对应的目标行为数据可以包括以下步骤:In some embodiments, as shown in FIG. 3 , determining the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information may include the following steps:
在S2051中,获取长期行为数据集中每个历史行为数据的行为特征信息;In S2051, obtain behavior feature information of each historical behavior data in the long-term behavior data set;
在S2053中,计算每个推荐信息的特征信息与行为特征信息间的第一相似度;In S2053, calculate the first similarity between the feature information of each recommendation information and the behavior feature information;
在S2055中,根据第一相似度,从长期行为数据集中确定每个推荐信息对应的目标行为数据。In S2055, according to the first similarity, target behavior data corresponding to each recommendation information is determined from the long-term behavior data set.
在一些实施例中,历史行为数据的行为特征信息为可以结合one-hot编码网络、N-Gram(汉语语言模型)等特征表征网络生成的历史行为数据对应的特征向量。In some embodiments, the behavior feature information of the historical behavior data is a feature vector corresponding to the historical behavior data generated by a feature representation network such as a one-hot coding network, an N-Gram (Chinese language model), and the like.
在一些实施例中,行为特征信息与推荐信息的特征信息间的第一相似度可以表征历史行为数据与推荐信息间的关联程度,在一些实施例中,行为特征信息与推荐信息的特征信息间的第一相似度越高,对应的历史行为数据与推荐信息间的关联程度越高;反之,行为特征信息与推荐信息的特征信息间的第一相似度越低,对应的历史行为数据与推荐信息间的关联程度越低。In some embodiments, the first similarity between the behavior feature information and the feature information of the recommendation information may represent the degree of association between the historical behavior data and the recommendation information. In some embodiments, the relationship between the behavior feature information and the feature information of the recommendation information The higher the first similarity, the higher the degree of correlation between the corresponding historical behavior data and the recommendation information; conversely, the lower the first similarity between the behavior feature information and the feature information of the recommendation information, the lower the corresponding historical behavior data and the recommendation information. The lower the degree of correlation between the information.
在一些实施例中,行为特征信息与推荐信息的特征信息间的第一相似度可以包括但不限于行为特征信息与推荐信息的特征信息间的余弦距离、欧式距离、曼哈顿距离。In some embodiments, the first similarity between the behavior feature information and the feature information of the recommendation information may include, but is not limited to, cosine distance, Euclidean distance, and Manhattan distance between the behavior feature information and the feature information of the recommendation information.
在一些实施例中,可以将与推荐信息的特征信息的第一相似度大于等于预设阈值的行为特征信息所对应的历史行为数据作为该推荐信息对应的目标行为数据。In some embodiments, historical behavior data corresponding to behavior feature information whose first degree of similarity to the feature information of the recommendation information is greater than or equal to a preset threshold may be used as the target behavior data corresponding to the recommendation information.
在一些实施例中,可以将长期行为数据集中每个历史行为数据的行为特征信息与推荐信息的特征信息间的第一相似度进行降序排序,相应的,可以将排序在前预设数量个行为特征信息所对应的历史行为数据作为该推荐信息对应的目标行为数据。In some embodiments, the first similarity between the behavior feature information of each historical behavior data in the long-term behavior data set and the feature information of the recommendation information can be sorted in descending order, and correspondingly, a preset number of behaviors can be sorted first. The historical behavior data corresponding to the feature information is used as the target behavior data corresponding to the recommendation information.
本说明书实施例中,预设阈值和预设数量可以预先结合实际应用对信息推荐精准性需求进行设置(推荐信息与历史行为数据间的关联程度越高,信息推荐的精准性越好)。In the embodiment of this specification, the preset threshold and the preset number can be set in advance in combination with practical applications to set the information recommendation accuracy requirements (the higher the degree of association between the recommended information and the historical behavior data, the better the information recommendation accuracy).
上述实施例中,通过推荐信息的特征信息与历史行为数据的行为特征信息间的相似度,可以从目标对象的长期行为数据集中为每个推荐信息筛选出关联程度较高的历史行为数据,来作为推荐信息对应的目标行为数据,保证了用于进行兴趣识别的行为数据可以全面反映对象的兴趣喜好;同时,结合不同推荐信息的特征信息有针对性的进行历史行为数据的筛选,使得目标行为数据可以有效反映目标对象对推荐信息的兴趣偏好情况且大大降低兴趣识别过程中的数据量,有效提高兴趣识别效率和识别精准性。In the above-mentioned embodiment, through the similarity between the feature information of the recommendation information and the behavior feature information of the historical behavior data, the historical behavior data with a higher degree of correlation can be selected from the long-term behavior data set of the target object for each recommendation information. As the target behavior data corresponding to the recommendation information, it ensures that the behavior data used for interest identification can fully reflect the interests and preferences of the object; at the same time, combined with the feature information of different recommendation information, the historical behavior data is screened in a targeted manner, so that the target behavior The data can effectively reflect the interest and preference of the target object to the recommended information, greatly reduce the amount of data in the process of interest identification, and effectively improve the efficiency and accuracy of interest identification.
在S207中,将目标行为数据输入兴趣识别网络进行兴趣识别,得到目标对象对每个推荐信息的兴趣指标。In S207, the target behavior data is input into the interest recognition network for interest recognition, and the target object's interest index for each recommendation information is obtained.
在一些实施例中,兴趣指标可以表征目标对象对推荐信息的喜好情况。在一些实施例中,可以预先训练兴趣识别网络,相应的,上述方法还可以包括预先训练兴趣识别网络的步骤。如图4所示,预先训练兴趣识别网络可以包括以下步骤:In some embodiments, the interest index can represent the preference of the target object to the recommended information. In some embodiments, the interest recognition network may be pre-trained, and accordingly, the above method may further include the step of pre-training the interest recognition network. As shown in Figure 4, pre-training an interest recognition network can include the following steps:
在S401中,获取多个样本推荐信息的信息标识和样本特征信息。In S401, information identifiers and sample feature information of multiple sample recommendation information are acquired.
本说明书实施例中,多个样本推荐信息可以为推荐系统中推荐过的信息。在一些实施例中,日志数据库中存储的推荐日志信息中可以包括多个历史推荐事件对应的样本推荐信息、多个历史推荐事件对应的样本推荐信息的信息标识和推荐时间等历史推荐事件的基本信息。相应的,本说明书实施例中,可以从推荐日志信息中获取多个样本推荐信息的信息标识。In the embodiment of this specification, the multiple sample recommendation information may be information recommended in the recommendation system. In some embodiments, the recommendation log information stored in the log database may include sample recommendation information corresponding to multiple historical recommendation events, information identifiers of the sample recommendation information corresponding to multiple historical recommendation events, and recommendation time and other basic information about historical recommendation events. information. Correspondingly, in the embodiment of this specification, information identifiers of multiple sample recommendation information may be obtained from recommendation log information.
在一些实施例中,上述多个样本推荐信息的样本特征信息可以采用下述方式获取:In some embodiments, the sample feature information of the above-mentioned multiple sample recommendation information may be obtained in the following manner:
获取每个样本推荐信息的主题标签;Get the hashtag of each sample recommendation information;
基于每个样本推荐信息的主题标签生成每个样本推荐信息的样本特征信息;Generate sample feature information of each sample recommendation information based on the subject tag of each sample recommendation information;
或,or,
提取每个样本推荐信息的文本信息;Extract the text information of each sample recommendation information;
基于每个样本推荐信息的文本信息生成每个样本推荐信息的样本特征信息。The sample feature information of each sample recommendation information is generated based on the text information of each sample recommendation information.
本说明书实施例中,获取样本推荐信息的样本特征信息的相关步骤的具体细化可以参见上述确定推荐信息集中每个推荐信息的特征信息的相关步骤的具体细化,在此不再赘述。In the embodiment of this specification, for the specific refinement of the steps related to obtaining the sample feature information of the sample recommendation information, reference may be made to the specific refinement of the related steps for determining the feature information of each recommendation information in the recommendation information set above, which will not be repeated here.
上述实施例中,通过样本推荐信息的主题标签或文本信息来生成样本推荐信息的样本特征信息,可以实现对样本推荐信息的有效表征,进而保证后续召回与样本推荐信息相关的行为数据。In the above embodiment, the sample feature information of the sample recommendation information is generated by the subject tag or text information of the sample recommendation information, which can realize the effective representation of the sample recommendation information, thereby ensuring the subsequent recall of behavior data related to the sample recommendation information.
在S403中,基于每个样本推荐信息的信息标识确定每个样本推荐信息对应的历史推荐对象和历史推荐对象对应的兴趣标注指标。In S403, a historical recommendation object corresponding to each sample recommendation information and an interest labeling index corresponding to the historical recommendation object are determined based on the information identifier of each sample recommendation information.
在实际应用中,历史行为数据库中存储的大量历史行为数据可以反映推荐信息被推荐后相应的对象的行为,相应的,每一历史行为数据中可以包括推荐信息(样本推荐信息)的信息标识和对象对该推荐信息的行为标签。In practical applications, a large amount of historical behavior data stored in the historical behavior database can reflect the behavior of the corresponding object after the recommendation information is recommended. Correspondingly, each historical behavior data can include the information identifier of the recommendation information (sample recommendation information) and The object's behavior label for this recommendation.
本说明书实施例中,可以将包括样本推荐信息的信息标识的历史行为数据对对应的对象作为该样本推荐信息对应的历史推荐对象。相应的,可以结合历史行为数据中对象对该样本荐信息的行为标签生成兴趣标注指标。In the embodiment of this specification, the object corresponding to the historical behavior data pair including the information identifier of the sample recommendation information may be used as the historical recommendation object corresponding to the sample recommendation information. Correspondingly, the interest labeling index can be generated in combination with the behavior label of the object's recommendation information about the sample in the historical behavior data.
在实际应用中,不同业务场景下,表征对象对一个推荐信息的喜好情况的行为可以不同,且同一业务场景下,也可以由多种表征对象对一个推荐信息的喜好情况的行为。相应的,本说明书实施例中,兴趣标注指标可以结合实际应用场景包括一种或多种行为所反映的对象对推荐信息的喜好情况。在一些实施例中,以通过点击行为来反映对象对推荐信息的喜好情况为例,响应于历史行为数据中的行为标签为点击,在一些实施例中,兴趣标注指标可以为1(通过1表示点击,0表示未点击)。In practical applications, in different business scenarios, behaviors that characterize an object's preference for a recommendation information may be different, and in the same business scenario, there may also be multiple behaviors that characterize an object's preference for a recommendation information. Correspondingly, in the embodiment of the present specification, the interest labeling index may include, in combination with the actual application scenario, the preference of the object on the recommendation information reflected by one or more behaviors. In some embodiments, taking the click behavior to reflect the object's preference for the recommended information as an example, in response to the behavior label in the historical behavior data being a click, in some embodiments, the interest labeling indicator may be 1 (represented by 1). clicked, 0 means not clicked).
在S405中,获取历史推荐对象的长期样本行为数据集。In S405, a long-term sample behavior data set of the historical recommendation object is acquired.
在一些实施例中,在确定样本推荐信息对应的历史推荐对象的情况下,可以获取该历史推荐对象的对象标识。在一些实施例中,可以结合该对象标识从历史行为数据库中,获取对应的历史行为数据,以作为该历史推荐对象的长期样本行为数据集。In some embodiments, in the case of determining the historical recommendation object corresponding to the sample recommendation information, the object identifier of the historical recommendation object may be obtained. In some embodiments, corresponding historical behavior data may be obtained from a historical behavior database in combination with the object identifier, as a long-term sample behavior data set of the historical recommendation object.
在S407中,基于每个样本推荐信息的样本特征信息从对应长期样本行为数据集中,确定每个样本推荐信息对应的样本行为数据。In S407, the sample behavior data corresponding to each sample recommendation information is determined from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information.
在一些实施例中,基于每个样本推荐信息的样本特征信息从对应的长期样本行为数据集中,确定每个样本推荐信息对应的样本行为数据可以包括:In some embodiments, determining the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information may include:
获取长期样本行为数据集中每个样本行为数据的样本行为特征信息;Obtain the sample behavior feature information of each sample behavior data in the long-term sample behavior data set;
计算样本行为特征信息与每个样本推荐信息的样本特征信息间的第二相似度;calculating the second similarity between the sample behavior feature information and the sample feature information of each sample recommendation information;
根据第二相似度,从长期样本行为数据集中确定每个样本推荐信息对应的样本行为数据。According to the second similarity, the sample behavior data corresponding to each sample recommendation information is determined from the long-term sample behavior data set.
本说明书实施例中,基于每个样本推荐信息的样本特征信息从对应的长期样本行为数据集中,确定每个样本推荐信息对应的样本行为数据的相关步骤的具体细化,可以参见上述基于每个推荐信息的特征信息从长期行为数据集中,确定每个推荐信息对应的目标行为数据的相关步骤的具体细化,在此不再赘述。In the embodiment of this specification, for the specific refinement of the relevant steps for determining the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information, please refer to the above-mentioned based on each sample behavior data. The feature information of the recommendation information is determined from the long-term behavior data set to determine the specific refinement of the relevant steps of the target behavior data corresponding to each recommendation information, which will not be repeated here.
上述实施例中,通过样本推荐信息的样本特征信息与对应历史推荐对象的历史行为数据的行为特征信息间的相似度,可以从历史推荐对象的长期样本行为数据集中为每个样本推荐信息筛选出关联程度较高的历史行为数据,来作为样本推荐信息对应的样本行为数据,保证了用于进行兴趣识别的样本行为数据可以全面反映对象兴趣喜好;同时,结合不同样本推荐信息有针对性的进行历史行为数据的筛选,使得样本行为数据可以有效反映历史推荐对象对样本推荐信息的兴趣偏好情况,且大大降低兴趣识别过程中的数据量,进而可以提高兴趣识别效率和识别精准性。In the above-mentioned embodiment, through the similarity between the sample feature information of the sample recommendation information and the behavior feature information of the historical behavior data corresponding to the historical recommendation object, each sample recommendation information can be selected from the long-term sample behavior data set of the historical recommendation object. The historical behavior data with a high degree of correlation is used as the sample behavior data corresponding to the sample recommendation information, which ensures that the sample behavior data used for interest identification can fully reflect the interests and preferences of the object; at the same time, combined with different sample recommendation information, targeted The screening of historical behavior data enables the sample behavior data to effectively reflect the interest and preference of historical recommendation objects to sample recommendation information, and greatly reduces the amount of data in the process of interest recognition, thereby improving the efficiency and accuracy of interest recognition.
在S409中,将样本行为数据输入神经网络进行兴趣识别,以得到每个历史推荐对象对对应的样本推荐信息的兴趣预测指标。In S409, the sample behavior data is input into the neural network for interest identification, so as to obtain the interest prediction index of each historical recommendation object to the corresponding sample recommendation information.
在一些实施例中,神经网络可以为要被训练的兴趣识别网络。在一些实施例中,神经网络可以包括但不限于卷积神经网络,递归神经网络等。在一些实施例中,兴趣预测指标可以表征被训练的神经网络预测的每个历史推荐对象对对应的样本推荐信息的喜好情况。在一些实施例中,兴趣预测指标可以为大于等于0小于等于1的数值,相应的,兴趣预测指标的数值越大,表征被训练的神经网络所预测的历史推荐对象对对应的样本推荐信息的越喜欢。In some embodiments, the neural network may identify the network for the interest to be trained. In some embodiments, neural networks may include, but are not limited to, convolutional neural networks, recurrent neural networks, and the like. In some embodiments, the interest prediction index may represent the preference of each historical recommendation object predicted by the trained neural network to the corresponding sample recommendation information. In some embodiments, the interest predictor may be a value greater than or equal to 0 and less than or equal to 1. Correspondingly, the larger the value of the interest predictor is, the greater the value of the interest predictor, represents the difference between the historical recommendation objects predicted by the trained neural network and the corresponding sample recommendation information. more like.
在S411中,根据兴趣预测指标和兴趣标注指标,确定目标损失。In S411, the target loss is determined according to the interest prediction index and the interest labeling index.
在一些实施例中,根据兴趣预测指标和兴趣标注指标,确定目标损失可以包括基于预设损失函数计算每个历史推荐对象对应的兴趣预测指标和对应的兴趣标注指标间的损失,并对多个样本推荐信息对应的历史推荐对象对应的损失进行求和,得到上述目标损失。In some embodiments, according to the interest prediction index and the interest labeling index, determining the target loss may include calculating the loss between the interest prediction index corresponding to each historical recommendation object and the corresponding interest labeling index based on a preset loss function, and analyzing multiple The losses corresponding to the historical recommendation objects corresponding to the sample recommendation information are summed to obtain the above target loss.
本说明书实施例中,预设损失函数可以包括但不限于交叉熵损失函数、逻辑损失函数、Hinge(铰链)损失函数、指数损失函数等,本说明书实施例并不以上述为限。In the embodiment of this specification, the preset loss function may include, but is not limited to, a cross-entropy loss function, a logistic loss function, a Hinge (hinge) loss function, an exponential loss function, and the like, and the embodiment of this specification is not limited to the above.
在S413中,基于目标损失训练神经网络,得到兴趣识别网络。In S413, a neural network is trained based on the target loss to obtain an interest recognition network.
在一些实施例中,基于目标损失训练神经网络,得到兴趣识别网络可以包括In some embodiments, training a neural network based on the target loss to obtain an interest recognition network may include
在目标损失不满足预设条件的情况下,更新神经网络中的网络参数;When the target loss does not meet the preset conditions, update the network parameters in the neural network;
基于更新后的神经网络来更新目标损失,至目标损失满足预设条件,将当前的神经网络作为上述兴趣识别网络。The target loss is updated based on the updated neural network, until the target loss meets the preset condition, and the current neural network is used as the above-mentioned interest recognition network.
在一些实施例中,目标损失满足预设条件可以为目标损失小于等于指定阈值,或前后两次训练过程中对应的目标损失间的差值小于一定阈值。本说明书实施例中,指定阈值和一定阈值可以为结合实际训练需求进行设置。In some embodiments, the target loss satisfying the preset condition may be that the target loss is less than or equal to a specified threshold, or the difference between the corresponding target losses in the two training processes before and after is less than a certain threshold. In the embodiment of this specification, the specified threshold and a certain threshold may be set in combination with actual training requirements.
此外,需要说明书的是,在实际应用中,响应于神经网络为要被训练的多任务神经网络,相应的,兴趣标注信息可以为多个任务对应的任务标注信息,兴趣预测指标可以为多个任务对应的任务预测信息。In addition, it should be noted that, in practical applications, in response to the neural network being a multi-task neural network to be trained, correspondingly, the interest annotation information may be task annotation information corresponding to multiple tasks, and the interest prediction indicators may be multiple Task prediction information corresponding to the task.
上述实施例中,在兴趣识别网络训练过程中,获取结合样本推荐信息的标识信息确定历史推荐对象及该历史推荐对象对应的兴趣标注指标,并获取历史推荐对象的长期样本行为数据集,保证了训练过程中行为数据的全面性,能够有效反映对象的兴趣喜好,同时结合不同样本推荐信息有针对性的进行历史行为数据的筛选,使得用于进行兴趣识别的样本行为数据可以有效反映历史推荐对象对样本推荐信息的真实兴趣偏好情况,且可以大大降低兴趣识别过程中的数据量,进而有效提高训练出的兴趣识别网络的兴趣识别效率和识别精准性。In the above embodiment, in the training process of the interest recognition network, the identification information of the sample recommendation information is obtained to determine the historical recommendation object and the interest labeling index corresponding to the historical recommendation object, and the long-term sample behavior data set of the historical recommendation object is obtained, which ensures the The comprehensiveness of the behavior data in the training process can effectively reflect the interests and preferences of the objects, and at the same time, combined with different sample recommendation information, the historical behavior data is screened in a targeted manner, so that the sample behavior data used for interest identification can effectively reflect the historical recommendation objects. The real interest preference of sample recommendation information can greatly reduce the amount of data in the interest recognition process, thereby effectively improving the interest recognition efficiency and recognition accuracy of the trained interest recognition network.
在一些实施例中,在基于每个样本推荐信息的样本特征信息从对应长期样本行为数据集中,确定每个样本推荐信息对应的样本行为数据之前,如图5所示,上述方法还包括:In some embodiments, before determining the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information, as shown in FIG. 5 , the above method further includes:
在S415中,确定每个样本推荐信息的推荐时间;In S415, the recommendation time of each sample recommendation information is determined;
在S417中,基于推荐时间对对应的长期样本行为数据集进行数据过滤,得到过滤行为数据集;In S417, data filtering is performed on the corresponding long-term sample behavior data set based on the recommended time to obtain a filtering behavior data set;
相应的,基于每个样本推荐信息的样本特征信息从对应的长期样本行为数据集中,确定每个样本推荐信息对应的样本行为数据可以包括:Correspondingly, determining the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information may include:
基于每个样本推荐信息的样本特征信息从对应的过滤行为数据集中,确定每个样本推荐信息对应的样本行为数据。The sample behavior data corresponding to each sample recommendation information is determined from the corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
在一些实施例中,样本推荐信息的推荐时间可以从对应的推荐日志信息中获取。In some embodiments, the recommendation time of the sample recommendation information may be obtained from the corresponding recommendation log information.
在一些实施例中,基于推荐时间对对应的长期样本行为数据集进行数据过滤,得到过滤行为数据集可以包括从每个样本推荐信息对应的历史推荐对象的长期样本行为数据集中,过滤掉行为时间晚于该每个样本推荐信息的样本行为数据,得到上述过滤行为数据集。In some embodiments, performing data filtering on the corresponding long-term sample behavior data set based on the recommendation time, and obtaining the filtered behavior data set may include filtering out the behavior time from the long-term sample behavior data set of the historical recommendation object corresponding to each sample recommendation information After the sample behavior data of the recommendation information of each sample, the above filtering behavior data set is obtained.
上述实施例中,结合样本推荐信息的推荐时间对对应的长期样本行为数据集进行数据过滤,可以有效防止数据穿越,实现对推荐时间之后产生的历史行为数据的过滤,保证了 信息用于进行兴趣识别的行为数据的有效性。In the above embodiment, data filtering is performed on the corresponding long-term sample behavior data set in combination with the recommendation time of the sample recommendation information, which can effectively prevent data traversal, realize the filtering of historical behavior data generated after the recommendation time, and ensure that the information is used for interest purposes. Validity of identified behavioral data.
在S209中,基于兴趣指标将推荐信息集中的目标信息推荐给目标对象。In S209, the target information in the recommendation information set is recommended to the target object based on the interest index.
在一些实施例中,基于兴趣指标将推荐信息集中的目标信息推荐给目标对象可以包括:根据兴趣指标从推荐信息集中确定目标信息;将目标信息推荐给目标对象。In some embodiments, recommending the target information in the recommendation information set to the target object based on the interest index may include: determining the target information from the recommendation information set according to the interest index; recommending the target information to the target object.
在一些实施例中,兴趣指标表征目标对象对推荐信息集中每个推荐信息的喜好情况。在一些实施例中,该兴趣指标可以为与喜好程度成正比的数值,也可以为表征目标对象对推荐信息集中每个推荐信息的喜好程度的字符化表征,例如“中”。在一些实施例中,可以结合一定的规则,将字符号表征量化为相应的数值。In some embodiments, the interest index represents the preference of the target object for each recommendation information in the recommendation information set. In some embodiments, the interest index may be a numerical value proportional to the degree of preference, or may be a characterized representation representing the degree of preference of the target object for each recommendation information in the recommendation information set, such as "medium". In some embodiments, the character symbol representation can be quantified into corresponding numerical values in combination with certain rules.
在一些实施例中,可以预先结合信息推荐精准性需求设置一个置信度阈值(一般的置信度阈值越高,推荐的信息越精准),相应的,兴趣指标对应的数值大于等于该置信度阈值的推荐信息可以作为目标信息。In some embodiments, a confidence threshold may be set in advance in combination with the information recommendation accuracy requirement (generally, the higher the confidence threshold, the more accurate the recommended information). Correspondingly, the value corresponding to the interest index is greater than or equal to the confidence threshold. Recommendation information can be used as target information.
由以上本说明书实施例提供的技术方案可见,本说明书实施例中,在信息推荐过程中,获取目标对象的长期行为数据集,保证了行为数据的全面性,能够有效反映对象的兴趣喜好,同时结合不同推荐信息的特征信息有针对性的进行历史行为数据的筛选,使得用于进行兴趣识别的目标行为数据可以有效反映目标对象对推荐信息的真实兴趣偏好情况,且可以大大降低兴趣识别过程中的数据量,有效提高兴趣识别效率和识别精准性,进而大大提升推荐系统中信息推荐精准性和推荐性能。It can be seen from the technical solutions provided by the above embodiments of this specification that in the embodiments of this specification, in the process of information recommendation, the long-term behavior data set of the target object is obtained, which ensures the comprehensiveness of the behavior data, and can effectively reflect the interests and preferences of the object. Combined with the feature information of different recommendation information, the historical behavior data is screened in a targeted manner, so that the target behavior data used for interest identification can effectively reflect the target object's real interest preference for the recommendation information, and can greatly reduce the interest identification process. It can effectively improve the efficiency and accuracy of interest recognition, and then greatly improve the accuracy and performance of information recommendation in the recommendation system.
图6是根据一示例性实施例示出的一种信息推荐装置框图。参照图6,该装置包括:Fig. 6 is a block diagram of an information recommendation apparatus according to an exemplary embodiment. Referring to Figure 6, the device includes:
第一行为数据集获取模块610,被配置为响应于目标对象的信息获取请求,获取目标对象的长期行为数据集;The first behavior data set obtaining module 610 is configured to obtain the long-term behavior data set of the target object in response to the information obtaining request of the target object;
特征信息确定模块620,被配置为确定推荐信息集中每个推荐信息的特征信息;A feature information determination module 620, configured to determine feature information of each recommendation information in the recommendation information set;
目标行为数据确定模块630,被配置为基于每个推荐信息的特征信息从长期行为数据集中,确定每个推荐信息对应的目标行为数据;The target behavior data determination module 630 is configured to determine the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
第一兴趣识别模块640,被配置为将目标行为数据输入兴趣识别网络进行兴趣识别,得到目标对象对每个推荐信息的兴趣指标;The first interest identification module 640 is configured to input the target behavior data into the interest identification network for interest identification, and obtain the interest index of the target object for each recommendation information;
信息推荐模块650,被配置为基于兴趣指标将推荐信息集中的目标信息推荐给目标对象。The information recommendation module 650 is configured to recommend the target information in the recommended information set to the target object based on the interest index.
在一些实施例中目标行为数据确定模块630包括:In some embodiments target behavior data determination module 630 includes:
行为特征信息获取单元,被配置为获取长期行为数据集中每个历史行为数据的行为特征信息;a behavior feature information acquisition unit, configured to acquire behavior feature information of each historical behavior data in the long-term behavior data set;
第一相似度计算单元,被配置为计算每个推荐信息的特征信息与行为特征信息间的第一相似度;a first similarity calculation unit, configured to calculate a first similarity between the feature information of each recommendation information and the behavior feature information;
目标行为数据确定单元,被配置为根据第一相似度,从长期行为数据集中确定每个推荐信息对应的目标行为数据。The target behavior data determining unit is configured to determine the target behavior data corresponding to each recommendation information from the long-term behavior data set according to the first similarity.
在一些实施例中,第一行为数据集获取模块610包括:In some embodiments, the first behavior data set acquisition module 610 includes:
信息获取请求接收单元,被配置为接收目标对象的信息获取请求,信息获取请求包括目标对象的目标对象标识;an information acquisition request receiving unit, configured to receive an information acquisition request of a target object, where the information acquisition request includes a target object identifier of the target object;
长期行为数据集获取单元,被配置为基于目标对象标识获取目标对象的长期行为数据集。The long-term behavior data set acquisition unit is configured to acquire the long-term behavior data set of the target object based on the target object identifier.
在一些实施例中,长期行为数据集获取单元包括:In some embodiments, the long-term behavioral data set acquisition unit includes:
历史行为数据库获取单元,被配置为获取预先构建的历史行为数据库,历史行为数据库包括多个对象的历史行为数据,以及多个对象的对象标识与对应的历史行为数据间的映射关系;The historical behavior database acquisition unit is configured to acquire a pre-built historical behavior database, the historical behavior database includes historical behavior data of multiple objects, and a mapping relationship between object identifiers of multiple objects and corresponding historical behavior data;
历史行为数据获取单元,被配置为根据映射关系从历史行为数据库中,获取目标对象标识对应的历史行为数据;The historical behavior data acquisition unit is configured to acquire historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship;
长期行为数据集确定单元,被配置为将对应的历史行为数据作为目标对象的长期行为数据集。The long-term behavior data set determining unit is configured to take the corresponding historical behavior data as the long-term behavior data set of the target object.
在一些实施例中,特征信息确定模块620包括:In some embodiments, characteristic information determination module 620 includes:
第一主题标签获取单元,被配置为获取每个推荐信息的主题标签;a first topic tag acquiring unit, configured to acquire a topic tag of each recommended information;
第一特征信息生成单元,被配置为基于每个推荐信息的主题标签生成每个推荐信息的特征信息;a first feature information generating unit, configured to generate feature information of each recommendation information based on the subject tag of each recommendation information;
或,or,
第一文本信息提取单元,被配置为提取每个推荐信息的文本信息;a first text information extraction unit, configured to extract the text information of each recommendation information;
第二特征信息生成单元,被配置为基于每个推荐信息的文本信息生成每个推荐信息的特征信息。The second feature information generating unit is configured to generate feature information of each recommendation information based on the text information of each recommendation information.
在一些实施例中,信息推荐模块650包括:In some embodiments, the information recommendation module 650 includes:
目标信息确定单元,被配置为根据兴趣指标从推荐信息集中确定目标信息;a target information determination unit, configured to determine target information from the recommended information set according to the interest index;
信息推单元,被配置为将目标信息推荐给目标对象。The information pushing unit is configured to recommend target information to the target object.
在一些实施例中,上述装置还包括:In some embodiments, the above-mentioned apparatus further comprises:
信息获取模块,被配置为获取多个样本推荐信息的信息标识和样本特征信息;an information acquisition module, configured to acquire information identifiers and sample feature information of multiple sample recommendation information;
对象信息确定模块,被配置为基于每个样本推荐信息的信息标识确定每个样本推荐信息对应的历史推荐对象和历史推荐对象对应的兴趣标注指标;The object information determination module is configured to determine the historical recommendation object corresponding to each sample recommendation information and the interest labeling index corresponding to the historical recommendation object based on the information identifier of each sample recommendation information;
第二行为数据集获取模块,被配置为获取历史推荐对象的长期样本行为数据集;The second behavior data set acquisition module is configured to acquire long-term sample behavior data sets of historical recommendation objects;
样本行为数确定模块,被配置为基于每个样本推荐信息的样本特征信息从对应长期样本行为数据集中,确定每个样本推荐信息对应的样本行为数据;The sample behavior number determination module is configured to determine the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information;
第二兴趣识别模块,被配置为将样本行为数据输入神经网络进行兴趣识别,得到每个历史推荐对象对对应的样本推荐信息的兴趣预测指标;The second interest identification module is configured to input the sample behavior data into the neural network for interest identification, and obtain the interest prediction index of each historical recommendation object to the corresponding sample recommendation information;
目标损失确定模块,被配置为根据兴趣预测指标和兴趣标注指标,确定目标损失;The target loss determination module is configured to determine the target loss according to the interest prediction index and the interest labeling index;
网络训练模块,被配置为基于目标损失训练神经网络,得到兴趣识别网络。The network training module is configured to train the neural network based on the target loss to obtain the interest recognition network.
在一些实施例中,上述装置还包括:In some embodiments, the above-mentioned apparatus further comprises:
推荐时间确定模块,被配置为确定每个样本推荐信息的推荐时间;a recommendation time determination module, configured to determine the recommendation time of each sample recommendation information;
数据过滤模块,被配置为基于推荐时间对对应的长期样本行为数据集进行数据过滤,得到过滤行为数据集;The data filtering module is configured to perform data filtering on the corresponding long-term sample behavior data set based on the recommended time to obtain the filtering behavior data set;
样本行为数据确定模块还被配置为基于每个样本推荐信息的样本特征信息从对应的过滤行为数据集中,确定每个样本推荐信息对应的样本行为数据。The sample behavior data determination module is further configured to determine sample behavior data corresponding to each sample recommendation information from the corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。Regarding the apparatus in the above-mentioned embodiment, the specific manner in which each module performs operations has been described in detail in the embodiment of the method, and will not be described in detail here.
图7是根据一示例性实施例示出的一种用于信息推荐的电子设备的框图,该电子设备可以是服务器,其内部结构图可以如图7所示。该电子设备包括通过系统总线连接的处理器、存储器和网络接口。其中,该电子设备的处理器用于提供计算和控制能力。该电子设备的存储器包括计算机可读存储介质、内存储器。该计算机可读存储介质存储有操作系统和计算机程序。该内存储器为计算机可读存储介质中的操作系统和计算机程序的运行提供环境。该电子设备的网络接口用于与外部的终端通过网络连接通信。该计算机程序被处理器执行时以实现一种信息推荐方法。FIG. 7 is a block diagram of an electronic device for information recommendation according to an exemplary embodiment. The electronic device may be a server, and its internal structure diagram may be as shown in FIG. 7 . The electronic device includes a processor, memory, and a network interface connected by a system bus. Among them, the processor of the electronic device is used to provide computing and control capabilities. The memory of the electronic device includes a computer-readable storage medium and an internal memory. The computer-readable storage medium stores an operating system and a computer program. The internal memory provides an environment for the execution of the operating system and computer programs in the computer-readable storage medium. The network interface of the electronic device is used to communicate with an external terminal through a network connection. The computer program, when executed by the processor, implements an information recommendation method.
本领域技术人员可以理解,图7中示出的结构,仅仅是与本公开方案相关的部分结构的框图,并不构成对本公开方案所应用于其上的电子设备的限定,具体的电子设备可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。Those skilled in the art can understand that the structure shown in FIG. 7 is only a block diagram of a partial structure related to the solution of the present disclosure, and does not constitute a limitation on the electronic device to which the solution of the present disclosure is applied. The specific electronic device may be Include more or fewer components than shown in the figures, or combine certain components, or have a different arrangement of components.
在示例性实施例中,还提供了一种电子设备,包括:处理器;用于存储该处理器可执行指令的存储器;其中,该处理器被配置为该指令,以实现如本公开实施例中的信息推荐方法。In an exemplary embodiment, an electronic device is also provided, comprising: a processor; a memory for storing instructions executable by the processor; wherein the processor is configured to the instructions to implement embodiments as disclosed in the present disclosure The information in the recommended method.
在示例性实施例中,还提供了一种计算机可读存储介质,当该计算机可读存储介质中的指令由电子设备的处理器执行时,使得电子设备能够执行本公开实施例中的信息推荐方法。在一些实施例中,计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。In an exemplary embodiment, a computer-readable storage medium is also provided, when the instructions in the computer-readable storage medium are executed by the processor of the electronic device, the electronic device can perform the information recommendation in the embodiments of the present disclosure method. In some embodiments, the computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.
在示例性实施例中,还提供了一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行本公开实施例中的信息推荐方法。In an exemplary embodiment, there is also provided a computer program product containing instructions, which, when executed on a computer, cause the computer to execute the information recommendation method in the embodiments of the present disclosure.
本公开所有实施例均可以单独被执行,也可以于其他实施例相结合被执行,均视为本公开要求的保护范围。All the embodiments of the present disclosure may be implemented independently or in combination with other embodiments, which are all regarded as the protection scope required by the present disclosure.
本领域普通技术人员可以理解实现上述实施例方法中的全部或部分流程,是可以通过计算机程序来指令相关的硬件来完成,该计算机程序可存储于一计算机可读取存储介质中,该计算机程序在执行时,可包括如上述各方法的实施例的流程。其中,本申请所提供的各实施例中所使用的对存储器、存储、数据库或其它介质的任何引用,均可包括非易失性和/或易失性存储器。Those of ordinary skill in the art can understand that all or part of the processes in the methods of the above embodiments can be implemented by instructing relevant hardware through a computer program, and the computer program can be stored in a computer-readable storage medium. During execution, the processes of the embodiments of the above-mentioned methods may be included. Wherein, any reference to memory, storage, database or other medium used in the various embodiments provided in this application may include non-volatile and/or volatile memory.

Claims (26)

  1. 一种信息推荐方法,包括:An information recommendation method including:
    响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集,所述长期行为数据集表征所述目标对象在预设时间段内的多个历史行为数据;In response to the information acquisition request of the target object, acquiring a long-term behavior data set of the target object, where the long-term behavior data set represents a plurality of historical behavior data of the target object within a preset time period;
    确定推荐信息集中每个推荐信息的特征信息;Determine the feature information of each recommended information in the recommended information set;
    基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据;Determine the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
    将所述目标行为数据输入到兴趣识别网络中进行兴趣识别,以得到所述目标对象对每个推荐信息的兴趣指标;Inputting the target behavior data into an interest recognition network for interest recognition to obtain the target object's interest index for each recommendation information;
    基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象。The target information in the recommendation information set is recommended to the target object based on the interest index.
  2. 根据权利要求1所述的信息推荐方法,其中,所述基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据包括:The information recommendation method according to claim 1, wherein the determining the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information comprises:
    获取所述长期行为数据集中每个历史行为数据的行为特征信息;Obtaining behavior feature information of each historical behavior data in the long-term behavior data set;
    计算每个推荐信息的特征信息与所述行为特征信息间的第一相似度;calculating the first similarity between the feature information of each recommendation information and the behavior feature information;
    根据所述第一相似度,从所述长期行为数据集中确定所述每个推荐信息对应的目标行为数据。According to the first similarity, target behavior data corresponding to each recommendation information is determined from the long-term behavior data set.
  3. 根据权利要求1所述的信息推荐方法,其中,所述响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集包括:The information recommendation method according to claim 1, wherein, in response to the information acquisition request of the target object, acquiring the long-term behavior data set of the target object comprises:
    接收所述目标对象的信息获取请求,所述信息获取请求包括所述目标对象的目标对象标识;receiving an information acquisition request from the target object, where the information acquisition request includes a target object identifier of the target object;
    基于所述目标对象标识获取所述目标对象的长期行为数据集。A long-term behavior data set of the target object is acquired based on the target object identifier.
  4. 根据权利要求3所述的信息推荐方法,其中,所述基于所述对象标识获取所述目标对象的长期行为数据集包括:The information recommendation method according to claim 3, wherein the acquiring the long-term behavior data set of the target object based on the object identifier comprises:
    获取预先构建的历史行为数据库,所述历史行为数据库包括多个对象的历史行为数据,以及所述多个对象的对象标识与对应的历史行为数据间的映射关系;Obtaining a pre-built historical behavior database, the historical behavior database includes historical behavior data of multiple objects, and the mapping relationship between the object identifiers of the multiple objects and the corresponding historical behavior data;
    根据所述映射关系从所述历史行为数据库中,获取所述目标对象标识对应的历史行为数据;Obtain historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship;
    将所述对应的历史行为数据作为所述目标对象的长期行为数据集。The corresponding historical behavior data is used as the long-term behavior data set of the target object.
  5. 根据权利要求1所述的信息推荐方法,其中,所述确定推荐信息集中每个推荐信息的特征信息包括:The information recommendation method according to claim 1, wherein the determining the characteristic information of each recommendation information in the recommendation information set comprises:
    获取所述每个推荐信息的主题标签;obtaining the subject tag of each of the recommended information;
    基于所述每个推荐信息的主题标签生成所述每个推荐信息的特征信息;generating feature information of each recommendation information based on the subject tag of each recommendation information;
    或,or,
    提取所述每个推荐信息的文本信息;extracting the text information of each recommended information;
    基于所述每个推荐信息的文本信息生成所述每个推荐信息的特征信息。The feature information of each recommendation information is generated based on the text information of each recommendation information.
  6. 根据权利要求1所述的信息推荐方法,其中,所述基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象包括:The information recommendation method according to claim 1, wherein the recommending the target information in the recommended information set to the target object based on the interest index comprises:
    根据所述兴趣指标从所述推荐信息集中确定目标信息;determining target information from the recommendation information set according to the interest index;
    将所述目标信息推荐给所述目标对象。The target information is recommended to the target object.
  7. 根据权利要求1至6任一所述的信息推荐方法,还包括:The information recommendation method according to any one of claims 1 to 6, further comprising:
    获取多个样本推荐信息的信息标识和样本特征信息;Obtain information identification and sample feature information of multiple sample recommendation information;
    基于每个样本推荐信息的信息标识确定所述每个样本推荐信息对应的历史推荐对象和所述历史推荐对象对应的兴趣标注指标;Determine the historical recommendation object corresponding to each sample recommendation information and the interest labeling index corresponding to the historical recommendation object based on the information identifier of each sample recommendation information;
    获取所述历史推荐对象的长期样本行为数据集;obtaining a long-term sample behavior data set of the historical recommendation object;
    基于所述每个样本推荐信息的样本特征信息从对应长期样本行为数据集中,确定所述每个样本推荐信息对应的样本行为数据;Determine the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of the each sample recommendation information;
    将所述样本行为数据输入神经网络中进行兴趣识别,得到每个历史推荐对象对对应的样本推荐信息的兴趣预测指标;Inputting the sample behavior data into the neural network for interest identification, and obtaining the interest prediction index of each historical recommendation object to the corresponding sample recommendation information;
    根据所述兴趣预测指标和所述兴趣标注指标,确定目标损失;determining a target loss according to the interest prediction index and the interest labeling index;
    基于所述目标损失训练所述神经网络,得到所述兴趣识别网络。The neural network is trained based on the target loss to obtain the interest recognition network.
  8. 根据权利要求7所述的信息推荐方法,还包括:The information recommendation method according to claim 7, further comprising:
    确定所述每个样本推荐信息的推荐时间;determining the recommendation time of the recommendation information for each sample;
    基于所述推荐时间对对应的长期样本行为数据集进行数据过滤,得到过滤行为数据集;Perform data filtering on the corresponding long-term sample behavior data set based on the recommended time to obtain a filtered behavior data set;
    所述基于所述每个样本推荐信息的样本特征信息从对应的长期样本行为数据集中,确定所述每个样本推荐信息对应的样本行为数据包括:The sample behavior data corresponding to each sample recommendation information determined from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information includes:
    基于所述每个样本推荐信息的样本特征信息从对应的过滤行为数据集中,确定所述每个样本推荐信息对应的样本行为数据。The sample behavior data corresponding to each sample recommendation information is determined from the corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
  9. 一种信息推荐装置,包括:An information recommendation device, comprising:
    第一行为数据集获取模块,被配置为响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集,所述长期行为数据集表征所述目标对象在预设时间段内的多个历史行为数据;The first behavior data set acquisition module is configured to obtain a long-term behavior data set of the target object in response to an information acquisition request of the target object, and the long-term behavior data set represents the number of the target object in a preset time period. historical behavioral data;
    特征信息确定模块,被配置为确定推荐信息集中每个推荐信息的特征信息;a feature information determination module, configured to determine feature information of each recommendation information in the recommendation information set;
    目标行为数据确定模块,被配置为基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据;a target behavior data determination module, configured to determine target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
    第一兴趣识别模块,被配置为将所述目标行为数据输入兴趣识别网络进行兴趣识别,得到所述目标对象对每个推荐信息的兴趣指标;a first interest identification module, configured to input the target behavior data into an interest identification network for interest identification, and obtain an interest index of the target object for each recommendation information;
    信息推荐模块,被配置为基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象。An information recommendation module, configured to recommend target information in the recommended information set to the target object based on the interest index.
  10. 根据权利要求9所述的信息推荐装置,其中,所述目标行为数据确定模块包括:The information recommendation device according to claim 9, wherein the target behavior data determination module comprises:
    行为特征信息获取单元,被配置为获取所述长期行为数据集中每个历史行为数据的行为特征信息;a behavior feature information acquisition unit, configured to acquire behavior feature information of each historical behavior data in the long-term behavior data set;
    第一相似度计算单元,被配置为计算每个推荐信息的特征信息与所述行为特征信息间的第一相似度;a first similarity calculation unit, configured to calculate a first similarity between the feature information of each recommendation information and the behavior feature information;
    目标行为数据确定单元,被配置为根据所述第一相似度,从所述长期行为数据集中确定所述每个推荐信息对应的目标行为数据。The target behavior data determining unit is configured to determine the target behavior data corresponding to each recommendation information from the long-term behavior data set according to the first similarity.
  11. 根据权利要求9所述的信息推荐装置,其中,所述第一行为数据集获取模块包括:The information recommendation device according to claim 9, wherein the first behavior data set acquisition module comprises:
    信息获取请求接收单元,被配置为接收所述目标对象的信息获取请求,所述信息获取请求包括所述目标对象的目标对象标识;an information acquisition request receiving unit, configured to receive an information acquisition request of the target object, where the information acquisition request includes a target object identifier of the target object;
    长期行为数据集获取单元,被配置为基于所述目标对象标识获取所述目标对象的长期行为数据集。The long-term behavior data set acquisition unit is configured to acquire the long-term behavior data set of the target object based on the target object identifier.
  12. 根据权利要求11所述的信息推荐装置,其中,所述长期行为数据集获取单元包括:The information recommendation device according to claim 11, wherein the long-term behavior data set acquisition unit comprises:
    历史行为数据库获取单元,被配置为获取预先构建的历史行为数据库,所述历史行为数据库包括多个对象的历史行为数据,以及所述多个对象的对象标识与对应的历史行为数据间的映射关系;A historical behavior database acquisition unit, configured to acquire a pre-built historical behavior database, the historical behavior database includes historical behavior data of a plurality of objects, and a mapping relationship between the object identifiers of the plurality of objects and the corresponding historical behavior data ;
    历史行为数据获取单元,被配置为根据所述映射关系从所述历史行为数据库中,获取所述目标对象标识对应的历史行为数据;a historical behavior data acquisition unit, configured to acquire historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship;
    长期行为数据集确定单元,被配置为将所述对应的历史行为数据作为所述目标对象的长期行为数据集。The long-term behavior data set determination unit is configured to use the corresponding historical behavior data as the long-term behavior data set of the target object.
  13. 根据权利要求9所述的信息推荐装置,其中,所述特征信息确定模块包括:The information recommendation device according to claim 9, wherein the feature information determination module comprises:
    第一主题标签获取单元,被配置为获取所述每个推荐信息的主题标签;a first topic tag acquiring unit, configured to acquire the topic tag of each of the recommended information;
    第一特征信息生成单元,被配置为基于所述每个推荐信息的主题标签生成所述每个推荐信息的特征信息;a first feature information generating unit, configured to generate feature information of each recommendation information based on the subject tag of each recommendation information;
    或,or,
    第一文本信息提取单元,被配置为提取所述每个推荐信息的文本信息;a first text information extraction unit, configured to extract the text information of each recommended information;
    第二特征信息生成单元,被配置为基于所述每个推荐信息的文本信息生成所述每个推荐信息的特征信息。The second feature information generating unit is configured to generate feature information of each recommendation information based on the text information of each recommendation information.
  14. 根据权利要求9所述的信息推荐装置,其中,所述信息推荐模块包括:The information recommendation device according to claim 9, wherein the information recommendation module comprises:
    目标信息确定单元,被配置为根据所述兴趣指标从所述推荐信息集中确定目标信息;a target information determination unit, configured to determine target information from the recommended information set according to the interest index;
    信息推单元,被配置为将所述目标信息推荐给所述目标对象。An information pushing unit, configured to recommend the target information to the target object.
  15. 根据权利要求9至14任一所述的信息推荐装置,其中,所述装置还包括:The information recommendation device according to any one of claims 9 to 14, wherein the device further comprises:
    信息获取模块,被配置为获取多个样本推荐信息的信息标识和样本特征信息;an information acquisition module, configured to acquire information identifiers and sample feature information of multiple sample recommendation information;
    对象信息确定模块,被配置为基于每个样本推荐信息的信息标识确定所述每个样本推荐信息对应的历史推荐对象和所述历史推荐对象对应的兴趣标注指标;an object information determination module, configured to determine a historical recommendation object corresponding to each sample recommendation information and an interest labeling index corresponding to the historical recommendation object based on the information identifier of each sample recommendation information;
    第二行为数据集获取模块,被配置为获取所述历史推荐对象的长期样本行为数据集;The second behavior data set obtaining module is configured to obtain the long-term sample behavior data set of the historical recommendation object;
    样本行为数确定模块,被配置为基于所述每个样本推荐信息的样本特征信息从对应长期样本行为数据集中,确定所述每个样本推荐信息对应的样本行为数据;a sample behavior number determination module, configured to determine sample behavior data corresponding to each sample recommendation information from a corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information;
    第二兴趣识别模块,被配置为将所述样本行为数据输入神经网络中进行兴趣识别,得到每个历史推荐对象对对应的样本推荐信息的兴趣预测指标;The second interest identification module is configured to input the sample behavior data into the neural network for interest identification, and obtain the interest prediction index of each historical recommendation object to the corresponding sample recommendation information;
    目标损失确定模块,被配置为根据所述兴趣预测指标和所述兴趣标注指标,确定目标损失;a target loss determination module, configured to determine a target loss according to the interest prediction index and the interest labeling index;
    网络训练模块,被配置为基于所述目标损失训练所述神经网络,以得到所述兴趣识别网络。A network training module configured to train the neural network based on the target loss to obtain the interest recognition network.
  16. 根据权利要求15所述的信息推荐装置,还包括:The information recommendation device according to claim 15, further comprising:
    推荐时间确定模块,被配置为确定所述每个样本推荐信息的推荐时间;a recommendation time determination module, configured to determine the recommendation time of the recommended information for each sample;
    数据过滤模块,被配置为基于所述推荐时间对对应的长期样本行为数据集进行数据过滤,得到过滤行为数据集;a data filtering module, configured to perform data filtering on the corresponding long-term sample behavior data set based on the recommended time to obtain a filtering behavior data set;
    所述样本行为数据确定模块还被配置为基于所述每个样本推荐信息的样本特征信息从对应的过滤行为数据集中,确定所述每个样本推荐信息对应的样本行为数据。The sample behavior data determination module is further configured to determine sample behavior data corresponding to each sample recommendation information from a corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
  17. 一种电子设备,包括:An electronic device comprising:
    处理器;processor;
    用于存储所述处理器可执行指令的存储器;a memory for storing the processor-executable instructions;
    其中,所述处理器被配置为:wherein the processor is configured to:
    响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集,所述长期行为数据集表征所述目标对象在预设时间段内的多个历史行为数据;In response to the information acquisition request of the target object, acquiring a long-term behavior data set of the target object, where the long-term behavior data set represents a plurality of historical behavior data of the target object within a preset time period;
    确定推荐信息集中每个推荐信息的特征信息;Determine the feature information of each recommended information in the recommended information set;
    基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据;Determine the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
    将所述目标行为数据输入到兴趣识别网络中进行兴趣识别,以得到所述目标对象对每个推荐信息的兴趣指标;Inputting the target behavior data into an interest recognition network for interest recognition to obtain the target object's interest index for each recommendation information;
    基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象。The target information in the recommendation information set is recommended to the target object based on the interest index.
  18. 根据权利要求17所述的电子设备,所述处理器被配置为:The electronic device of claim 17, the processor configured to:
    获取所述长期行为数据集中每个历史行为数据的行为特征信息;Obtaining behavior feature information of each historical behavior data in the long-term behavior data set;
    计算每个推荐信息的特征信息与所述行为特征信息间的第一相似度;calculating the first similarity between the feature information of each recommendation information and the behavior feature information;
    根据所述第一相似度,从所述长期行为数据集中确定所述每个推荐信息对应的目标行为数据。According to the first similarity, target behavior data corresponding to each recommendation information is determined from the long-term behavior data set.
  19. 根据权利要求17所述的电子设备,所述处理器被配置为:The electronic device of claim 17, the processor configured to:
    接收所述目标对象的信息获取请求,所述信息获取请求包括所述目标对象的目标对象标识;receiving an information acquisition request from the target object, where the information acquisition request includes a target object identifier of the target object;
    基于所述目标对象标识获取所述目标对象的长期行为数据集。A long-term behavior data set of the target object is acquired based on the target object identifier.
  20. 根据权利要求19所述的电子设备,所述处理器被配置为:The electronic device of claim 19, the processor configured to:
    获取预先构建的历史行为数据库,所述历史行为数据库包括多个对象的历史行为数据,以及所述多个对象的对象标识与对应的历史行为数据间的映射关系;Obtaining a pre-built historical behavior database, the historical behavior database includes historical behavior data of multiple objects, and the mapping relationship between the object identifiers of the multiple objects and the corresponding historical behavior data;
    根据所述映射关系从所述历史行为数据库中,获取所述目标对象标识对应的历史行为数据;Obtain historical behavior data corresponding to the target object identifier from the historical behavior database according to the mapping relationship;
    将所述对应的历史行为数据作为所述目标对象的长期行为数据集。The corresponding historical behavior data is used as the long-term behavior data set of the target object.
  21. 根据权利要求17所述的电子设备,所述处理器被配置为:The electronic device of claim 17, the processor configured to:
    获取所述每个推荐信息的主题标签;obtaining the subject tag of each of the recommended information;
    基于所述每个推荐信息的主题标签生成所述每个推荐信息的特征信息;generating feature information of each recommendation information based on the subject tag of each recommendation information;
    或,or,
    提取所述每个推荐信息的文本信息;extracting the text information of each recommended information;
    基于所述每个推荐信息的文本信息生成所述每个推荐信息的特征信息。The feature information of each recommendation information is generated based on the text information of each recommendation information.
  22. 根据权利要求17所述的电子设备,所述处理器被配置为:The electronic device of claim 17, the processor configured to:
    根据所述兴趣指标从所述推荐信息集中确定目标信息;determining target information from the recommendation information set according to the interest index;
    将所述目标信息推荐给所述目标对象。The target information is recommended to the target object.
  23. 根据权利要求17所述的电子设备,所述处理器还被配置为:The electronic device of claim 17, the processor further configured to:
    获取多个样本推荐信息的信息标识和样本特征信息;Obtain information identification and sample feature information of multiple sample recommendation information;
    基于每个样本推荐信息的信息标识确定所述每个样本推荐信息对应的历史推荐对象和所述历史推荐对象对应的兴趣标注指标;Determine the historical recommendation object corresponding to each sample recommendation information and the interest labeling index corresponding to the historical recommendation object based on the information identifier of each sample recommendation information;
    获取所述历史推荐对象的长期样本行为数据集;obtaining a long-term sample behavior data set of the historical recommendation object;
    基于所述每个样本推荐信息的样本特征信息从对应长期样本行为数据集中,确定所述每个样本推荐信息对应的样本行为数据;Determine the sample behavior data corresponding to each sample recommendation information from the corresponding long-term sample behavior data set based on the sample feature information of the each sample recommendation information;
    将所述样本行为数据输入神经网络中进行兴趣识别,得到每个历史推荐对象对对应的样本推荐信息的兴趣预测指标;Inputting the sample behavior data into the neural network for interest identification, and obtaining the interest prediction index of each historical recommendation object to the corresponding sample recommendation information;
    根据所述兴趣预测指标和所述兴趣标注指标,确定目标损失;determining a target loss according to the interest prediction index and the interest labeling index;
    基于所述目标损失训练所述神经网络,得到所述兴趣识别网络。The neural network is trained based on the target loss to obtain the interest recognition network.
  24. 根据权利要求23所述的电子设备,所述处理器还被配置为:The electronic device of claim 23, the processor further configured to:
    确定所述每个样本推荐信息的推荐时间;determining the recommendation time of the recommendation information for each sample;
    基于所述推荐时间对对应的长期样本行为数据集进行数据过滤,得到过滤行为数据集;Perform data filtering on the corresponding long-term sample behavior data set based on the recommended time to obtain a filtered behavior data set;
    所述基于所述每个样本推荐信息的样本特征信息从对应的长期样本行为数据集中,确定所述每个样本推荐信息对应的样本行为数据包括:The sample behavior data corresponding to each sample recommendation information determined from the corresponding long-term sample behavior data set based on the sample feature information of each sample recommendation information includes:
    基于所述每个样本推荐信息的样本特征信息从对应的过滤行为数据集中,确定所述每个样本推荐信息对应的样本行为数据。The sample behavior data corresponding to each sample recommendation information is determined from the corresponding filtering behavior data set based on the sample feature information of each sample recommendation information.
  25. 一种计算机可读存储介质,当所述存储介质中的指令由电子设备的处理器执行时,使得电子设备能够执行:A computer-readable storage medium, when instructions in the storage medium are executed by a processor of an electronic device, enabling the electronic device to execute:
    响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集,所述长期行为数据集表征所述目标对象在预设时间段内的多个历史行为数据;In response to the information acquisition request of the target object, acquiring a long-term behavior data set of the target object, where the long-term behavior data set represents a plurality of historical behavior data of the target object within a preset time period;
    确定推荐信息集中每个推荐信息的特征信息;Determine the feature information of each recommended information in the recommended information set;
    基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据;Determine the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
    将所述目标行为数据输入到兴趣识别网络中进行兴趣识别,以得到所述目标对象对每个推荐信息的兴趣指标;Inputting the target behavior data into an interest recognition network for interest recognition to obtain the target object's interest index for each recommendation information;
    基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象。The target information in the recommendation information set is recommended to the target object based on the interest index.
  26. 一种计算机程序产品,包括计算机程序,其中,所述计算机程序被处理器执行时实现信息推荐方法,A computer program product, comprising a computer program, wherein the computer program implements an information recommendation method when executed by a processor,
    该信息推荐方法包括:This information recommendation method includes:
    响应于目标对象的信息获取请求,获取所述目标对象的长期行为数据集,所述长期行为数据集表征所述目标对象在预设时间段内的多个历史行为数据;In response to the information acquisition request of the target object, acquiring a long-term behavior data set of the target object, where the long-term behavior data set represents a plurality of historical behavior data of the target object within a preset time period;
    确定推荐信息集中每个推荐信息的特征信息;Determine the feature information of each recommended information in the recommended information set;
    基于所述每个推荐信息的特征信息从所述长期行为数据集中,确定所述每个推荐信息对应的目标行为数据;Determine the target behavior data corresponding to each recommendation information from the long-term behavior data set based on the feature information of each recommendation information;
    将所述目标行为数据输入到兴趣识别网络中进行兴趣识别,以得到所述目标对象对每个推荐信息的兴趣指标;Inputting the target behavior data into an interest recognition network for interest recognition to obtain the target object's interest index for each recommendation information;
    基于所述兴趣指标将所述推荐信息集中的目标信息推荐给所述目标对象。The target information in the recommendation information set is recommended to the target object based on the interest index.
PCT/CN2021/116964 2021-01-11 2021-09-07 Information recommendation method and device WO2022148038A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202110034339.3 2021-01-11
CN202110034339.3A CN112883258B (en) 2021-01-11 2021-01-11 Information recommendation method and device, electronic equipment and storage medium

Publications (1)

Publication Number Publication Date
WO2022148038A1 true WO2022148038A1 (en) 2022-07-14

Family

ID=76045063

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/116964 WO2022148038A1 (en) 2021-01-11 2021-09-07 Information recommendation method and device

Country Status (2)

Country Link
CN (1) CN112883258B (en)
WO (1) WO2022148038A1 (en)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112883258B (en) * 2021-01-11 2024-05-21 北京达佳互联信息技术有限公司 Information recommendation method and device, electronic equipment and storage medium
CN113672746B (en) * 2021-07-27 2024-03-26 北京达佳互联信息技术有限公司 Multimedia resource recommendation method and device, electronic equipment and storage medium
CN113806568B (en) * 2021-08-10 2023-11-03 中国人民大学 Multimedia resource recommendation method and device, electronic equipment and storage medium
CN113761364B (en) * 2021-08-17 2024-02-09 武汉卓尔数字传媒科技有限公司 Multimedia data pushing method and device
CN113918738B (en) * 2021-12-07 2023-05-09 北京达佳互联信息技术有限公司 Multimedia resource recommendation method and device, electronic equipment and storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105913273A (en) * 2015-12-14 2016-08-31 乐视网信息技术(北京)股份有限公司 Intelligent recommendation all-terminal display method and device
CN107038226A (en) * 2017-03-31 2017-08-11 努比亚技术有限公司 A kind of information recommendation method and the network equipment
CN107306355A (en) * 2016-04-20 2017-10-31 华为技术有限公司 A kind of content recommendation method and server
US20180150305A1 (en) * 2016-11-29 2018-05-31 Samsung Electronics Co., Ltd. Method for recommending content and apparatus therefor
CN110781391A (en) * 2019-10-22 2020-02-11 腾讯科技(深圳)有限公司 Information recommendation method, device, equipment and storage medium
CN112883258A (en) * 2021-01-11 2021-06-01 北京达佳互联信息技术有限公司 Information recommendation method and device, electronic equipment and storage medium

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US10685065B2 (en) * 2012-03-17 2020-06-16 Haizhi Wangju Network Technology (Beijing) Co., Ltd. Method and system for recommending content to a user
US10474932B2 (en) * 2016-09-01 2019-11-12 Uptake Technologies, Inc. Detection of anomalies in multivariate data

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105913273A (en) * 2015-12-14 2016-08-31 乐视网信息技术(北京)股份有限公司 Intelligent recommendation all-terminal display method and device
CN107306355A (en) * 2016-04-20 2017-10-31 华为技术有限公司 A kind of content recommendation method and server
US20180150305A1 (en) * 2016-11-29 2018-05-31 Samsung Electronics Co., Ltd. Method for recommending content and apparatus therefor
CN107038226A (en) * 2017-03-31 2017-08-11 努比亚技术有限公司 A kind of information recommendation method and the network equipment
CN110781391A (en) * 2019-10-22 2020-02-11 腾讯科技(深圳)有限公司 Information recommendation method, device, equipment and storage medium
CN112883258A (en) * 2021-01-11 2021-06-01 北京达佳互联信息技术有限公司 Information recommendation method and device, electronic equipment and storage medium

Also Published As

Publication number Publication date
CN112883258B (en) 2024-05-21
CN112883258A (en) 2021-06-01

Similar Documents

Publication Publication Date Title
WO2022148038A1 (en) Information recommendation method and device
AU2018383346B2 (en) Domain-specific natural language understanding of customer intent in self-help
JP6986527B2 (en) How and equipment to process video
CN107193974B (en) Regional information determination method and device based on artificial intelligence
CN110069698B (en) Information pushing method and device
US10606910B2 (en) Ranking search results using machine learning based models
US11263208B2 (en) Context-sensitive cross-lingual searches
US10262041B2 (en) Scoring mechanism for discovery of extremist content
US11436446B2 (en) Image analysis enhanced related item decision
CN112883257B (en) Behavior sequence data processing method and device, electronic equipment and storage medium
CN115329131A (en) Material label recommendation method and device, electronic equipment and storage medium
CN113254671B (en) Atlas optimization method, device, equipment and medium based on query analysis
CN114678137A (en) Big data mining method and server based on digital medical treatment
CN113420203A (en) Object recommendation method and device, electronic equipment and storage medium
CN114491093B (en) Multimedia resource recommendation and object representation network generation method and device
CN112883256B (en) Multitasking method, apparatus, electronic device and storage medium
CN113610215B (en) Task processing network generation method, task processing device and electronic equipment
CN115756821A (en) Online task processing model training and task processing method and device
CN116822491A (en) Log analysis method and device, equipment and storage medium
CN114385918A (en) Message pushing method and device, computer equipment and storage medium
CN113868516A (en) Object recommendation method and device, electronic equipment and storage medium
CN114550157A (en) Bullet screen gathering identification method and device
CN114357242A (en) Training evaluation method and device based on recall model, equipment and storage medium
CN111127057B (en) Multi-dimensional user portrait recovery method
CN114564653A (en) Information recommendation method and device, server and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21917081

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

32PN Ep: public notification in the ep bulletin as address of the adressee cannot be established

Free format text: NOTING OF LOSS OF RIGHTS PURSUANT TO RULE 112(1) EPC (EPO FORM 1205 DATED 20/10/2023)