WO2022116862A1 - Information pushing method and system, model training method, and related devices - Google Patents

Information pushing method and system, model training method, and related devices Download PDF

Info

Publication number
WO2022116862A1
WO2022116862A1 PCT/CN2021/132104 CN2021132104W WO2022116862A1 WO 2022116862 A1 WO2022116862 A1 WO 2022116862A1 CN 2021132104 W CN2021132104 W CN 2021132104W WO 2022116862 A1 WO2022116862 A1 WO 2022116862A1
Authority
WO
WIPO (PCT)
Prior art keywords
model
target domain
domain prediction
user
training
Prior art date
Application number
PCT/CN2021/132104
Other languages
French (fr)
Chinese (zh)
Inventor
李天浩
陈大乾
Original Assignee
京东科技控股股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 京东科技控股股份有限公司 filed Critical 京东科技控股股份有限公司
Publication of WO2022116862A1 publication Critical patent/WO2022116862A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/95Retrieval from the web
    • G06F16/953Querying, e.g. by the use of web search engines
    • G06F16/9535Search customisation based on user profiles and personalisation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/06Buying, selling or leasing transactions
    • G06Q30/0601Electronic shopping [e-shopping]
    • G06Q30/0631Item recommendations

Definitions

  • the present disclosure relates to the field of data processing, and in particular, to an information push method and system, a model training method and related equipment.
  • model training is usually performed based on actual own business scene data.
  • an information push method including: acquiring a target domain prediction model sent by an offline system and completed training, wherein the target domain prediction model is a prediction result using a source domain prediction model , and the user training data of the target domain are obtained by training, the source domain prediction model is obtained by training the user training data of the source domain; using the target domain prediction model, the user data to be tested in the target domain is predicted to Obtain the information push result to the corresponding user.
  • the target domain prediction model and the encoded value of the target domain prediction model that have been trained and sent by the offline system are periodically acquired, and the information push method further includes: comparing the acquired encoded value of the prediction network model with the currently used target If the coding values of the domain prediction models are different, verify the version of the obtained target domain prediction model; if the online verification of the obtained target domain prediction model passes, use the obtained target domain prediction model to replace the currently used one.
  • the target domain prediction model wherein the online verification includes version verification.
  • the online verification further includes model verification
  • the information push method further includes: in the case that the acquired value of the preset parameter of the target domain prediction model is within a preset range, by model validation.
  • the obtained target domain prediction model is a solidification map file, in which the parameters of the target domain prediction model determined through training are converted into constants.
  • the user data to be measured includes user characteristics and product characteristics
  • the information push result to the corresponding user is a result of whether the user recommends a product.
  • a model training method for information push including: training a source domain prediction model by using user training data of a source domain; The domain feature data is input into the source domain prediction model and the target domain prediction model respectively, and the corresponding source domain prediction results and target domain prediction results of the user are obtained; according to the difference between the source domain prediction result and the target domain prediction result, and the corresponding tag value of the user Adjust the parameters of the target domain prediction model based on the difference between the prediction result of the target domain and the target domain.
  • adjusting the parameters of the target domain prediction model includes: adjusting the parameters of the target domain prediction model based on a loss function of the target domain prediction model, wherein the loss function of the target domain prediction model includes the source domain prediction result The cross-entropy with the target domain prediction result, and the cross-entropy between the user-corresponding tag value and the target domain prediction result.
  • the complexity of the target domain prediction model is higher than the complexity of the source domain prediction model.
  • the number of dimensions of the input data of the target domain prediction model is greater than the number of dimensions of the input data of the source domain prediction model.
  • the model training method further includes: acquiring first feature data and second feature data corresponding to each of the multiple users, wherein the second feature data corresponding to the same user includes a partial dimension of the first feature data feature; use the first feature data to train the first preliminary model; input the first feature data and the second feature data corresponding to the same user into the first preliminary model and the second preliminary model respectively, and obtain the first preliminary model corresponding to the user
  • the prediction result and the prediction result of the second preparatory model according to the difference between the prediction result of the first preparatory model and the prediction result of the second preparatory model, and the difference between the mark value corresponding to the user and the prediction result of the first preparatory model, the parameters of the second preparatory model Make adjustments; use the adjusted second preliminary model as the source domain prediction model.
  • the first preliminary model and the second preliminary model have the same network model structure except that the input layer is different.
  • the source domain prediction result and the target domain prediction result corresponding to the user are recommendation results associated with the same item.
  • an apparatus for pushing information comprising: an acquisition module configured to acquire a target domain prediction model sent by an offline system and completed training, wherein the target domain prediction model is obtained by using a source The prediction result of the domain prediction model and the user training data of the target domain are obtained by training, and the source domain prediction model is obtained by training the user training data of the source domain; the prediction module is configured to use the target domain prediction model, Predict the user data to be measured in the target domain to obtain information push results for the corresponding users.
  • a model training apparatus for information push including: a source domain training module configured to train a source domain prediction model using user training data of the source domain; target domain training The module is configured to input the source domain feature data and the target domain feature data corresponding to the same user into the source domain prediction model and the target domain prediction model, respectively, to obtain the source domain prediction result and the target domain prediction result corresponding to the user; and, according to The difference between the prediction result of the source domain and the prediction result of the target domain, as well as the difference between the mark value corresponding to the user and the prediction result of the target domain, adjust the parameters of the prediction model of the target domain.
  • an information push system including: an information push apparatus; and a model training apparatus for information push.
  • an information pushing apparatus comprising: a memory; and a processor coupled to the memory, the processor being configured to execute any one of the foregoing based on instructions stored in the memory Information push method.
  • a model training apparatus for information push comprising: a memory; and a processor coupled to the memory, the processor being configured to, based on instructions stored in the memory, Perform any one of the aforementioned model training methods for information push.
  • a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements any one of the foregoing information pushing methods, or any of the foregoing usage methods.
  • Model training method for information push is provided.
  • FIG. 1 shows a schematic flowchart of an information push method according to some embodiments of the present disclosure.
  • FIG. 2 shows a schematic flowchart of a model training method for information push according to some embodiments of the present disclosure.
  • FIG. 3 shows a schematic flowchart of a pre-training method according to some embodiments of the present disclosure.
  • FIG. 4 shows a schematic flowchart of a model verification method according to some embodiments of the present disclosure.
  • FIG. 5 shows a schematic structural diagram of an information pushing apparatus according to some embodiments of the present disclosure.
  • FIG. 6 shows a schematic structural diagram of a model training apparatus for information push according to some embodiments of the present disclosure.
  • FIG. 7 shows a schematic structural diagram of an information push system according to some embodiments of the present disclosure.
  • FIG. 8 shows a schematic structural diagram of a data processing apparatus according to some embodiments of the present disclosure.
  • FIG. 9 shows a schematic structural diagram of a data processing apparatus according to other embodiments of the present disclosure.
  • a technical problem to be solved by the embodiments of the present disclosure is: how to improve the accuracy of user recommendation in a new business scenario.
  • FIG. 1 shows a schematic flowchart of an information push method according to some embodiments of the present disclosure. As shown in FIG. 1 , the information pushing method of this embodiment includes steps S102 to S104.
  • step S102 the prediction network model sent by the offline system and the training has been completed is obtained, wherein the prediction network model is obtained by using the prediction result of the teacher network model of the prediction network model and the user training data of the target domain for training.
  • the model's teacher network model is obtained by training with user training data from the source domain.
  • domains represent different business scenarios.
  • the source domain is, for example, a mature and usable business scenario with a large amount of user data
  • the target domain is, for example, a new business scenario with a small amount of user data.
  • a company's main business is e-commerce, and its e-commerce platform has sufficient user purchase data. If the likelihood of a user purchasing an item is predicted, so as to recommend an item for the user, the prediction model can be trained using the user purchase data. After that, the company expanded its financial category business, and some users of the original e-commerce platform also opened financial category business.
  • the user purchase data can be regarded as the data of the source domain, and the user's financial business data can be regarded as the data of the target domain.
  • a commonly used training method of a neural network model is: training according to the difference between the prediction result of the training data and the marked value by the model.
  • a source domain prediction model obtained by training with sufficient data volume is considered in the training process.
  • the target domain prediction model not only the prediction accuracy of the input data of the target domain prediction model itself is considered, but also the difference between the prediction results of the target domain prediction model and the prediction results of the source domain prediction model, so that the target domain prediction model Can learn the knowledge learned by the source domain prediction model. Therefore, even when the amount of data in the target domain is small, a target domain prediction model with higher accuracy can be obtained.
  • the specific training method of the target domain prediction model will be further introduced in the following embodiments.
  • the acquired target domain prediction model is a solidified map file, and in the solidified map file, the parameters of the target domain prediction model determined through training are converted into constants. Therefore, the online system obtains a lighter model file, which helps to improve the online efficiency of the new version of the model.
  • step S104 the prediction network model is used to predict the user data to be measured in the target domain, so as to obtain an information push result for the corresponding user.
  • the user data to be measured includes user characteristics, product characteristics, and environmental characteristics (eg, time characteristics, characteristics of other users, characteristics of related products, characteristics of platform activities, etc.) and the like.
  • the information push result to the corresponding user is: the result of whether the product is recommended for the user.
  • the information push result for the corresponding user includes a first indicator and a second indicator, the first indicator indicates that the product is recommended for the user, and the second indicator indicates that the product is not recommended for the user. said product.
  • the user data to be tested of user A includes the characteristics of user A and the characteristics of a certain shampoo, and the information push result is whether the shampoo is recommended for user A.
  • the prediction network model outputs the judgment probability, and the recommendation result is determined according to the comparison result between the judgment probability and the preset probability.
  • a push interface between the front-end application module and the back-end server can be used to send the recommendation result to the user's terminal in a preset information format.
  • the information push result is to recommend a corresponding product to the user
  • the information of the corresponding product is sent to the user terminal; when the information push result is that the corresponding product is not recommended to the user, the corresponding product is not sent to the user. product information.
  • the above embodiment effectively utilizes cross-domain data information, aggregates the knowledge that can be learned from multiple data islands, and assists the training of the target domain prediction model in combination with the training results of the source domain prediction model, which further improves the generalization ability of the model. . Therefore, when a new service or a new application scenario is launched, the embodiments of the present disclosure can quickly and accurately provide a corresponding prediction model.
  • the following describes an embodiment of the model training method for information push of the present disclosure with reference to FIG. 2 .
  • FIG. 2 shows a schematic flowchart of a model training method for information push according to some embodiments of the present disclosure.
  • the model training method for information push in this embodiment includes steps S202 to S206.
  • the user's exposure logs, click logs, product content forward index, user profile feature logs, and the like are collected. After collecting these data, for example, data fusion is performed through identifications such as device numbers, and dirty data lacking effective features is removed, thereby obtaining multiple pieces of user data.
  • positive samples and negative samples can also be extracted from the obtained data according to a preset ratio; in addition, samples that do not meet preset conditions can also be filtered out, for example, the browsing of products is often lower than a certain threshold. samples etc.
  • step S202 the source domain prediction model is trained using the user training data of the source domain.
  • step S204 the source domain feature data and the target domain feature data corresponding to the same user are input into the source domain prediction model and the target domain prediction model, respectively, to obtain the source domain prediction result and the target domain prediction result corresponding to the user. That is, input the source domain feature data corresponding to a user into the source domain prediction model to obtain the source domain prediction result; then input the target domain feature data corresponding to the same user into the target domain prediction model to obtain the target domain prediction result.
  • step S206 the parameters of the target domain prediction model are adjusted according to the difference between the prediction result of the source domain and the prediction result of the target domain, and the difference between the mark value corresponding to the user and the prediction result of the target domain.
  • the parameters of the target domain prediction model are adjusted based on the loss function of the target domain prediction model, wherein the loss function of the target domain prediction model includes the cross entropy of the source domain prediction result and the target domain prediction result, and the user The cross-entropy of the corresponding label value and the prediction result of the target domain. For example, take Equation (1) as the loss function of the target domain prediction model.
  • L represents the value of the loss function
  • CE(*,*) represents the calculation of cross entropy for the two variables in parentheses
  • y represents the label value
  • pred represents the prediction result of the target domain
  • q represents the prediction result of the source domain
  • represents a preset parameter.
  • the value of q is determined by the softmax layer represented by equation (2).
  • qi represents the probability corresponding to the i-th class in the classification result of the source domain prediction model
  • z i represents the result input to the softmax layer corresponding to the i -th class
  • j represents the source domain prediction model given zj represents the result input to the softmax layer corresponding to the jth class
  • the source domain prediction result and the target domain prediction result corresponding to the user are recommendation results associated with the same item.
  • a certain user has user purchase data on the e-commerce platform as the source domain and financial business data on the financial business platform as the target domain. Let's make a recommendation for a certain mobile phone.
  • the source domain prediction result corresponding to the user is, for example, whether a mobile phone is recommended for the user
  • the target domain prediction result corresponding to the user is, for example, whether the user recommends a financial service for purchasing mobile phones by installments. If for the same user and the same item, both the source domain prediction result and the target domain prediction result are recommended, it can be considered that the output values of the source domain prediction model and the target domain prediction model are the same.
  • the complexity of the target domain prediction model is higher than the complexity of the source domain prediction model.
  • the complexity of the model is measured, for example, by the number of layers of the model, the number of layers with preset mechanisms (eg, attention mechanisms), the number of parameters, and so on.
  • the number of dimensions of the input data of the target domain prediction model is greater than the number of dimensions of the input data of the source domain prediction model.
  • the complexity of the model trained later is often lower than the complexity of the model trained earlier, or the model trained later can process lower-dimensional data than the model trained earlier. This is to facilitate the post-trained model to meet the requirements of lightweight operation. For example, when transplanting a server-side model to a mobile terminal to run, the computing capability, storage capability, and data processing capability of the mobile terminal need to be considered.
  • the embodiments of the present disclosure are applied in the scenario of data cross-domain learning, so the complexity of the target domain prediction model may be higher than that of the source domain prediction model, and the input data of the target domain prediction model may also be more complex data . Therefore, even for new business scenarios, more complex information push ideas can be implemented.
  • the source domain prediction model uses as few input dimensions as possible.
  • the source domain prediction model is obtained through a pre-training process. An embodiment of the pre-training method of the present disclosure is described below with reference to FIG. 3 .
  • the training idea of this embodiment is similar to the idea of training the target domain prediction model, both of which use the training result of one model to improve the training accuracy of another model.
  • FIG. 3 shows a schematic flowchart of a pre-training method according to some embodiments of the present disclosure. As shown in FIG. 3 , the pre-training method of this embodiment includes steps S302 to S310.
  • step S302 first feature data and second feature data corresponding to each of the multiple users are acquired, wherein the second feature data corresponding to the same user includes features of a partial dimension of the first feature data.
  • the first feature data and the second feature data are both data of the source domain, and the difference lies in the number of features of the two. For example, if the first feature data is 1000-dimensional user purchase data, the second feature data takes part of the dimensions to form 100-dimensional user purchase data.
  • step S304 a first preliminary model is trained by using the first feature data.
  • step S306 the first feature data and the second feature data corresponding to the same user are input into the first preliminary model and the second preliminary model respectively, and the first preliminary model prediction result and the second preliminary model prediction result corresponding to the user are obtained .
  • step S308 the parameters of the second preliminary model are adjusted according to the difference between the prediction result of the first preliminary model and the prediction result of the second preliminary model, and the difference between the mark value corresponding to the user and the prediction result of the first preliminary model.
  • this gap can be represented by cross-entropy.
  • the parameters of the second preliminary model are adjusted based on the loss function of the second preliminary model, wherein the loss function of the second preliminary model includes the intersection of the prediction result of the first preliminary model and the prediction result of the second preliminary model entropy, and the cross entropy between the label value corresponding to the user and the prediction result of the second preliminary model. For example, take Equation (3) as the loss function of the target domain prediction model.
  • L pre represents the value of the loss function
  • CE(*,*) represents the calculation of the cross entropy for the two variables in brackets
  • y' represents the label value
  • pred' represents the prediction result of the second preliminary model
  • q' Indicates the prediction result of the first preliminary model.
  • the second preparatory network has fewer input dimensions, it also learns the training results of the first preparatory network obtained by training data with more dimensions during the training process. Therefore, the second preparatory network also has a higher prediction accuracy.
  • the first preliminary model and the second preliminary model have the same network model structure except for different input layers.
  • the training process of the second preparatory model can be made to focus more on knowledge extraction of unused features.
  • step S310 the adjusted second preliminary model is used as the source domain prediction model. After a number of tuning iterations, the second preparatory model completes training.
  • the trained second preliminary model may also be tested. If the test accuracy rate is greater than the preset value, the second preliminary model is used as the source domain prediction model. If the test accuracy rate is not greater than the preset value, retraining may be selected; or it may be considered that the input features of the second preliminary model are insufficient to characterize the user, and the features need to be re-selected as the input features of the second preliminary model.
  • the source domain prediction model can be made to use fewer input features but have a prediction accuracy comparable to the model represented by multi-dimensional features, thereby indirectly improving The training efficiency of the target domain prediction model.
  • the training process and the information pushing process of the target domain prediction model can be deployed in an offline system and an online system, respectively.
  • the offline system can periodically update the trained target domain prediction model through the data accumulated during the business process, and send it to the online system for application.
  • the online system may also verify the target domain prediction model before updating it.
  • An embodiment of the model verification method of the present disclosure is described below with reference to FIG. 4 .
  • FIG. 4 shows a schematic flowchart of a model verification method according to some embodiments of the present disclosure. As shown in FIG. 4 , the model verification method of this embodiment includes steps S402 to S408.
  • step S402 regularly acquire the target domain prediction model and the encoded value of the target domain prediction model sent by the offline system and completed training.
  • the encoded value is an MD5 encoded value.
  • step S404 in the case that the obtained coded value of the prediction network model is different from the coded value of the currently used target domain prediction model, verify the version of the obtained target domain prediction model.
  • the offline system sends the newly trained version of the target domain prediction model twice. After the first transmission, the online system has already put it online for use. When sending for the second time, if the online system repeatedly executes the online process of the same model, it will affect the system efficiency and waste system resources. Therefore, by verifying the coded value, the situation of repeated online access is avoided, and system resources are saved.
  • step S406 based on the version verification result, determine the online verification result of the acquired target domain prediction model.
  • the online verification further includes model verification, which is used to verify whether the acquired value of the preset parameter of the target domain prediction model is within the preset range, for example, checking whether the key parameter is empty, and so on. Therefore, it is possible to find out the error of the sending object or the transmission error in time, which improves the stability of the system.
  • model verification is used to verify whether the acquired value of the preset parameter of the target domain prediction model is within the preset range, for example, checking whether the key parameter is empty, and so on. Therefore, it is possible to find out the error of the sending object or the transmission error in time, which improves the stability of the system.
  • step S408 if the online verification of the acquired target domain prediction model is passed, the currently used target domain prediction model is replaced with the acquired target domain prediction model.
  • FIG. 5 shows a schematic structural diagram of an information pushing apparatus according to some embodiments of the present disclosure.
  • the information pushing apparatus 500 of this embodiment includes: an obtaining module 5100 configured to obtain a target domain prediction model sent by an offline system and completed training, wherein the target domain prediction model is obtained by using a source domain prediction model The prediction result and the user training data of the target domain are obtained by training, and the source domain prediction model is obtained by training the user training data of the source domain; the prediction module 5200 is configured to use the target domain prediction model to perform the target domain The user data to be tested is predicted to obtain the information push results for the corresponding users.
  • the acquiring module 5100 is further configured to periodically acquire the target domain prediction model and the encoded value of the target domain prediction model that are sent by the offline system and have completed training;
  • the information pushing apparatus 500 further includes: a verification module 5300, which is be configured to verify the version of the obtained target domain prediction model under the condition that the obtained coding value of the prediction network model is different from the coding value of the currently used target domain prediction model; and, in the obtained target domain prediction When the online verification of the model is passed, the acquired target domain prediction model is used to replace the currently used target domain prediction model, wherein the online verification includes the version verification.
  • the online verification further includes model verification
  • the verification module 5300 is further configured to, in the case that the acquired value of the preset parameter of the target domain prediction model is within a preset range, by Model validation of predictive models for the target domain.
  • the acquired target domain prediction model is a solidification map file, in which the parameters of the target domain prediction model determined through training are converted into constants.
  • the user data to be measured includes user characteristics and product characteristics
  • the information push result to the corresponding user is a result of whether the user recommends the product.
  • the following describes an embodiment of the model training apparatus for information push of the present disclosure with reference to FIG. 6 .
  • FIG. 6 shows a schematic structural diagram of a model training apparatus for information push according to some embodiments of the present disclosure.
  • the model training device 600 of this embodiment includes: a source domain training module 6100 configured to train a source domain prediction model using user training data in the source domain; and a target domain training module 6200 configured to The source domain feature data and target domain feature data corresponding to the same user are respectively input into the source domain prediction model and the target domain prediction model, and the source domain prediction result and the target domain prediction result corresponding to the user are obtained; and, according to the The difference between the prediction result of the source domain and the prediction result of the target domain, and the difference between the mark value corresponding to the user and the prediction result of the target domain, adjust the parameters of the prediction model of the target domain.
  • the target domain training module 6200 is further configured to adjust parameters of the target domain prediction model based on a loss function of the target domain prediction model, wherein the loss function of the target domain prediction model includes the source domain The cross entropy between the prediction result and the target domain prediction result, and the cross entropy between the tag value corresponding to the user and the target domain prediction result.
  • the complexity of the target domain prediction model is higher than the complexity of the source domain prediction model.
  • the number of dimensions of the input data of the target domain prediction model is greater than the number of dimensions of the input data of the source domain prediction model.
  • the source domain training module 6100 is further configured to obtain first feature data and second feature data corresponding to each of the plurality of users, wherein the second feature data corresponding to the same user includes the Partial dimension features of the first feature data; using the first feature data to train the first preliminary model; inputting the first feature data and the second feature data corresponding to the same user into the first preliminary model and the second preliminary model respectively , obtain the first preliminary model prediction result and the second preliminary model prediction result corresponding to the user; according to the difference between the first preliminary model prediction result and the second preliminary model prediction result, and the mark value corresponding to the user According to the difference between the prediction result of the first preliminary model and the first preliminary model, the parameters of the second preliminary model are adjusted; the adjusted second preliminary model is used as the source domain prediction model.
  • the first preliminary model and the second preliminary model have the same network model structure except that the input layer is different.
  • the source domain prediction result and the target domain prediction result corresponding to the user are recommendation results associated with the same item.
  • FIG. 7 shows a schematic structural diagram of an information push system according to some embodiments of the present disclosure.
  • the information pushing system 70 of this embodiment includes an information pushing apparatus 500 and a model training apparatus 600 for information pushing.
  • the information push apparatus 500 is deployed in the online system of the information push system 70
  • the model training apparatus 600 is deployed in the offline system of the information push system 70 .
  • FIG. 8 shows a schematic structural diagram of a data processing apparatus according to some embodiments of the present disclosure, where the data processing apparatus is an information push apparatus or a model training apparatus for information push.
  • the data processing apparatus 80 of this embodiment includes: a memory 810 and a processor 820 coupled to the memory 810 , and the processor 820 is configured to execute any one of the foregoing implementations based on instructions stored in the memory 810
  • the information push method in the example or the model training method for information push.
  • the memory 810 may include, for example, a system memory, a fixed non-volatile storage medium, and the like.
  • the system memory stores, for example, an operating system, an application program, a boot loader (Boot Loader), and other programs.
  • FIG. 9 shows a schematic structural diagram of a data processing apparatus according to other embodiments of the present disclosure, where the data processing apparatus is an information push apparatus or a model training apparatus for information push.
  • the data processing apparatus 90 in this embodiment includes: a memory 910 and a processor 920, and may further include an input/output interface 930, a network interface 940, a storage interface 950, and the like. These interfaces 930 , 940 , 950 and the memory 910 and the processor 920 can be connected, for example, through a bus 960 .
  • the input and output interface 930 provides a connection interface for input and output devices such as a display, a mouse, a keyboard, and a touch screen.
  • Network interface 940 provides a connection interface for various networked devices.
  • the storage interface 950 provides a connection interface for external storage devices such as SD cards and U disks.
  • Embodiments of the present disclosure further provide a computer-readable storage medium on which a computer program is stored, characterized in that, when the program is executed by a processor, any one of the foregoing information push methods or a model training method for information push is implemented. .
  • embodiments of the present disclosure may be provided as a method, system, or computer program product. Accordingly, the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present disclosure may take the form of a computer program product embodied on one or more computer-usable non-transitory storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein .
  • computer-usable non-transitory storage media including, but not limited to, disk storage, CD-ROM, optical storage, etc.
  • These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory result in an article of manufacture comprising instruction means, the instructions
  • the apparatus implements the functions specified in the flow or flows of the flowcharts and/or the block or blocks of the block diagrams.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Databases & Information Systems (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Finance (AREA)
  • General Engineering & Computer Science (AREA)
  • Accounting & Taxation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Computation (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Evolutionary Biology (AREA)
  • Artificial Intelligence (AREA)
  • Development Economics (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Strategic Management (AREA)
  • General Business, Economics & Management (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present disclosure relates to an information pushing method and system, a model training method, and related devices, which relate to the field of data processing. The information pushing method comprises: acquiring a trained target domain prediction model sent by an offline system, wherein the target domain prediction model is obtained by means of training using a prediction result of a source domain prediction model and user training data of a target domain, and the source domain prediction model is obtained by means of training using user training data of a source domain; and using the target domain prediction model to predict data to be detected of a user of the target domain, so as to obtain an information pushing result for the corresponding user. Therefore, when a new service or a new application scenario goes online, a corresponding prediction model can be quickly and accurately provided in the embodiments of the present disclosure.

Description

信息推送方法和系统、模型训练方法及相关设备Information push method and system, model training method and related equipment
相关申请的交叉引用CROSS-REFERENCE TO RELATED APPLICATIONS
本申请是以CN申请号为202011397968.4,申请日为2020年12月3日的申请为基础,并主张其优先权,该CN申请的公开内容在此作为整体引入本申请中。This application is based on the CN application number 202011397968.4 and the filing date is December 3, 2020, and claims its priority. The disclosure of the CN application is hereby incorporated into this application as a whole.
技术领域technical field
本公开涉及数据处理领域,特别涉及一种信息推送方法和系统、模型训练方法及相关设备。The present disclosure relates to the field of data processing, and in particular, to an information push method and system, a model training method and related equipment.
背景技术Background technique
在企业内部,往往会有一系列产品矩阵,其中每项产品都有特定业务场景的用户数据。在相关技术的个性化推荐方法中,通常基于实际自身的业务场景数据进行模型训练。Within an enterprise, there is often a series of product matrices, each of which has user data for a specific business scenario. In the personalized recommendation method of the related art, model training is usually performed based on actual own business scene data.
发明内容SUMMARY OF THE INVENTION
根据本公开一些实施例的第一个方面,提供一种信息推送方法,包括:获取离线系统发送的、完成训练的目标域预测模型,其中,目标域预测模型是利用源域预测模型的预测结果、以及目标域的用户训练数据进行训练而获得的,源域预测模型是利用源域的用户训练数据进行训练而获得的;利用目标域预测模型,对目标域的用户待测数据进行预测,以获得对相应用户的信息推送结果。According to a first aspect of some embodiments of the present disclosure, an information push method is provided, including: acquiring a target domain prediction model sent by an offline system and completed training, wherein the target domain prediction model is a prediction result using a source domain prediction model , and the user training data of the target domain are obtained by training, the source domain prediction model is obtained by training the user training data of the source domain; using the target domain prediction model, the user data to be tested in the target domain is predicted to Obtain the information push result to the corresponding user.
在一些实施例中,定期获取离线系统发送的、完成训练的目标域预测模型以及目标域预测模型的编码值,并且信息推送方法还包括:在获取的预测网络模型的编码值与当前使用的目标域预测模型的编码值不同的情况下,通过对获取的目标域预测模型的版本验证;在对获取的目标域预测模型的上线验证通过的情况下,采用获取的目标域预测模型替换当前使用的目标域预测模型,其中,上线验证包括版本验证。In some embodiments, the target domain prediction model and the encoded value of the target domain prediction model that have been trained and sent by the offline system are periodically acquired, and the information push method further includes: comparing the acquired encoded value of the prediction network model with the currently used target If the coding values of the domain prediction models are different, verify the version of the obtained target domain prediction model; if the online verification of the obtained target domain prediction model passes, use the obtained target domain prediction model to replace the currently used one. The target domain prediction model, wherein the online verification includes version verification.
在一些实施例中,上线验证还包括模型验证,并且信息推送方法还包括:在获取的目标域预测模型的预设参数的值在预设范围内的情况下,通过对获取的目标域预测模型的模型验证。In some embodiments, the online verification further includes model verification, and the information push method further includes: in the case that the acquired value of the preset parameter of the target domain prediction model is within a preset range, by model validation.
在一些实施例中,获取的目标域预测模型为固化图文件,在固化图文件中,通过 训练确定的、目标域预测模型的参数被转换为常量。In some embodiments, the obtained target domain prediction model is a solidification map file, in which the parameters of the target domain prediction model determined through training are converted into constants.
在一些实施例中,用户待测数据包括用户的特征、产品的特征,对相应用户的信息推送结果为是否为用户推荐产品的结果。In some embodiments, the user data to be measured includes user characteristics and product characteristics, and the information push result to the corresponding user is a result of whether the user recommends a product.
根据本公开一些实施例的第二个方面,提供一种用于信息推送的模型训练方法,包括:采用源域的用户训练数据训练源域预测模型;将同一用户对应的源域特征数据和目标域特征数据分别输入到源域预测模型和目标域预测模型中,获得用户对应的源域预测结果和目标域预测结果;根据源域预测结果与目标域预测结果的差距、以及用户对应的标记值与目标域预测结果的差距,对目标域预测模型的参数进行调整。According to a second aspect of some embodiments of the present disclosure, there is provided a model training method for information push, including: training a source domain prediction model by using user training data of a source domain; The domain feature data is input into the source domain prediction model and the target domain prediction model respectively, and the corresponding source domain prediction results and target domain prediction results of the user are obtained; according to the difference between the source domain prediction result and the target domain prediction result, and the corresponding tag value of the user Adjust the parameters of the target domain prediction model based on the difference between the prediction result of the target domain and the target domain.
在一些实施例中,对目标域预测模型的参数进行调整包括:基于目标域预测模型的损失函数,对目标域预测模型的参数进行调整,其中,目标域预测模型的损失函数包括源域预测结果与目标域预测结果的交叉熵、以及用户对应的标记值与目标域预测结果的交叉熵。In some embodiments, adjusting the parameters of the target domain prediction model includes: adjusting the parameters of the target domain prediction model based on a loss function of the target domain prediction model, wherein the loss function of the target domain prediction model includes the source domain prediction result The cross-entropy with the target domain prediction result, and the cross-entropy between the user-corresponding tag value and the target domain prediction result.
在一些实施例中,目标域预测模型的复杂度高于源域预测模型的复杂度。In some embodiments, the complexity of the target domain prediction model is higher than the complexity of the source domain prediction model.
在一些实施例中,目标域预测模型的输入数据的维度数大于源域预测模型的输入数据的维度数。In some embodiments, the number of dimensions of the input data of the target domain prediction model is greater than the number of dimensions of the input data of the source domain prediction model.
在一些实施例中,模型训练方法还包括:获取多个用户中的每一个对应的第一特征数据和第二特征数据,其中,同一用户对应的第二特征数据包括第一特征数据的部分维度的特征;采用第一特征数据训练第一预备模型;将同一用户对应的第一特征数据和第二特征数据分别输入到第一预备模型和第二预备模型中,获得用户对应的第一预备模型预测结果和第二预备模型预测结果;根据第一预备模型预测结果与第二预备模型预测结果的差距、以及用户对应的标记值与第一预备模型预测结果的差距,对第二预备模型的参数进行调整;将调整后的第二预备模型作为源域预测模型。In some embodiments, the model training method further includes: acquiring first feature data and second feature data corresponding to each of the multiple users, wherein the second feature data corresponding to the same user includes a partial dimension of the first feature data feature; use the first feature data to train the first preliminary model; input the first feature data and the second feature data corresponding to the same user into the first preliminary model and the second preliminary model respectively, and obtain the first preliminary model corresponding to the user The prediction result and the prediction result of the second preparatory model; according to the difference between the prediction result of the first preparatory model and the prediction result of the second preparatory model, and the difference between the mark value corresponding to the user and the prediction result of the first preparatory model, the parameters of the second preparatory model Make adjustments; use the adjusted second preliminary model as the source domain prediction model.
在一些实施例中,第一预备模型和第二预备模型除输入层不同以外,具有相同的网络模型结构。In some embodiments, the first preliminary model and the second preliminary model have the same network model structure except that the input layer is different.
在一些实施例中,用户对应的源域预测结果和目标域预测结果是同一物品所关联的推荐结果。In some embodiments, the source domain prediction result and the target domain prediction result corresponding to the user are recommendation results associated with the same item.
根据本公开一些实施例的第三个方面,提供一种信息推送装置,包括:获取模块,被配置为获取离线系统发送的、完成训练的目标域预测模型,其中,目标域预测模型是利用源域预测模型的预测结果、以及目标域的用户训练数据进行训练而获得的,源域预测模型是利用源域的用户训练数据进行训练而获得的;预测模块,被配置为利用 目标域预测模型,对目标域的用户待测数据进行预测,以获得对相应用户的信息推送结果。According to a third aspect of some embodiments of the present disclosure, there is provided an apparatus for pushing information, comprising: an acquisition module configured to acquire a target domain prediction model sent by an offline system and completed training, wherein the target domain prediction model is obtained by using a source The prediction result of the domain prediction model and the user training data of the target domain are obtained by training, and the source domain prediction model is obtained by training the user training data of the source domain; the prediction module is configured to use the target domain prediction model, Predict the user data to be measured in the target domain to obtain information push results for the corresponding users.
根据本公开一些实施例的第四个方面,提供一种用于信息推送的模型训练装置,包括:源域训练模块,被配置为采用源域的用户训练数据训练源域预测模型;目标域训练模块,被配置为将同一用户对应的源域特征数据和目标域特征数据分别输入到源域预测模型和目标域预测模型中,获得用户对应的源域预测结果和目标域预测结果;以及,根据源域预测结果与目标域预测结果的差距、以及用户对应的标记值与目标域预测结果的差距,对目标域预测模型的参数进行调整。According to a fourth aspect of some embodiments of the present disclosure, there is provided a model training apparatus for information push, including: a source domain training module configured to train a source domain prediction model using user training data of the source domain; target domain training The module is configured to input the source domain feature data and the target domain feature data corresponding to the same user into the source domain prediction model and the target domain prediction model, respectively, to obtain the source domain prediction result and the target domain prediction result corresponding to the user; and, according to The difference between the prediction result of the source domain and the prediction result of the target domain, as well as the difference between the mark value corresponding to the user and the prediction result of the target domain, adjust the parameters of the prediction model of the target domain.
根据本公开一些实施例的第五个方面,提供一种信息推送系统,包括:信息推送装置;以及用于信息推送的模型训练装置。According to a fifth aspect of some embodiments of the present disclosure, an information push system is provided, including: an information push apparatus; and a model training apparatus for information push.
根据本公开一些实施例的第六个方面,提供一种信息推送装置,包括:存储器;以及耦接至存储器的处理器,处理器被配置为基于存储在存储器中的指令,执行前述任意一种信息推送方法。According to a sixth aspect of some embodiments of the present disclosure, there is provided an information pushing apparatus, comprising: a memory; and a processor coupled to the memory, the processor being configured to execute any one of the foregoing based on instructions stored in the memory Information push method.
根据本公开一些实施例的第七个方面,提供一种用于信息推送的模型训练装置,包括:存储器;以及耦接至存储器的处理器,处理器被配置为基于存储在存储器中的指令,执行前述任意一种用于信息推送的模型训练方法。According to a seventh aspect of some embodiments of the present disclosure, there is provided a model training apparatus for information push, comprising: a memory; and a processor coupled to the memory, the processor being configured to, based on instructions stored in the memory, Perform any one of the aforementioned model training methods for information push.
根据本公开一些实施例的第八个方面,提供一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现前述任意一种信息推送方法、或者前述任意一种用于信息推送的模型训练方法。According to an eighth aspect of some embodiments of the present disclosure, there is provided a computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, implements any one of the foregoing information pushing methods, or any of the foregoing usage methods. Model training method for information push.
通过以下参照附图对本公开的示例性实施例的详细描述,本公开的其它特征及其优点将会变得清楚。Other features of the present disclosure and advantages thereof will become apparent from the following detailed description of exemplary embodiments of the present disclosure with reference to the accompanying drawings.
附图说明Description of drawings
为了更清楚地说明本公开实施例或现有技术中的技术方案,下面将对实施例或现有技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本公开的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present disclosure or the technical solutions in the prior art, the following briefly introduces the accompanying drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present disclosure, and for those of ordinary skill in the art, other drawings can also be obtained from these drawings without any creative effort.
图1示出了根据本公开一些实施例的信息推送方法的流程示意图。FIG. 1 shows a schematic flowchart of an information push method according to some embodiments of the present disclosure.
图2示出了根据本公开一些实施例的用于信息推送的模型训练方法的流程示意图。FIG. 2 shows a schematic flowchart of a model training method for information push according to some embodiments of the present disclosure.
图3示出了根据本公开一些实施例的预训练方法的流程示意图。FIG. 3 shows a schematic flowchart of a pre-training method according to some embodiments of the present disclosure.
图4示出了根据本公开一些实施例的模型验证方法的流程示意图。FIG. 4 shows a schematic flowchart of a model verification method according to some embodiments of the present disclosure.
图5示出了根据本公开一些实施例的信息推送装置的结构示意图。FIG. 5 shows a schematic structural diagram of an information pushing apparatus according to some embodiments of the present disclosure.
图6示出了根据本公开一些实施例的用于信息推送的模型训练装置的结构示意图。FIG. 6 shows a schematic structural diagram of a model training apparatus for information push according to some embodiments of the present disclosure.
图7示出了根据本公开一些实施例的信息推送系统的结构示意图。FIG. 7 shows a schematic structural diagram of an information push system according to some embodiments of the present disclosure.
图8示出了根据本公开一些实施例的数据处理装置的结构示意图。FIG. 8 shows a schematic structural diagram of a data processing apparatus according to some embodiments of the present disclosure.
图9示出了根据本公开另一些实施例的数据处理装置的结构示意图。FIG. 9 shows a schematic structural diagram of a data processing apparatus according to other embodiments of the present disclosure.
具体实施方式Detailed ways
下面将结合本公开实施例中的附图,对本公开实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本公开一部分实施例,而不是全部的实施例。以下对至少一个示例性实施例的描述实际上仅仅是说明性的,决不作为对本公开及其应用或使用的任何限制。基于本公开中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本公开保护的范围。The technical solutions in the embodiments of the present disclosure will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present disclosure. Obviously, the described embodiments are only a part of the embodiments of the present disclosure, but not all of the embodiments. The following description of at least one exemplary embodiment is merely illustrative in nature and is in no way intended to limit the disclosure, its application or uses in any way. Based on the embodiments in the present disclosure, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present disclosure.
除非另外具体说明,否则在这些实施例中阐述的部件和步骤的相对布置、数字表达式和数值不限制本公开的范围。The relative arrangement of the components and steps, the numerical expressions and numerical values set forth in these embodiments do not limit the scope of the present disclosure unless specifically stated otherwise.
同时,应当明白,为了便于描述,附图中所示出的各个部分的尺寸并不是按照实际的比例关系绘制的。Meanwhile, it should be understood that, for the convenience of description, the dimensions of various parts shown in the accompanying drawings are not drawn in an actual proportional relationship.
对于相关领域普通技术人员已知的技术、方法和设备可能不作详细讨论,但在适当情况下,所述技术、方法和设备应当被视为授权说明书的一部分。Techniques, methods, and devices known to those of ordinary skill in the relevant art may not be discussed in detail, but where appropriate, such techniques, methods, and devices should be considered part of the authorized description.
在这里示出和讨论的所有示例中,任何具体值应被解释为仅仅是示例性的,而不是作为限制。因此,示例性实施例的其它示例可以具有不同的值。In all examples shown and discussed herein, any specific value should be construed as illustrative only and not as limiting. Accordingly, other examples of exemplary embodiments may have different values.
应注意到:相似的标号和字母在下面的附图中表示类似项,因此,一旦某一项在一个附图中被定义,则在随后的附图中不需要对其进行进一步讨论。It should be noted that like numerals and letters refer to like items in the following figures, so once an item is defined in one figure, it does not require further discussion in subsequent figures.
在部分业务场景下,一些处于成长期的产品往往由于用户数据不够丰富,导致进行该业务场景下的模型训练时,模型较难获得很好的泛化能力,从而导致个性化效果较差。In some business scenarios, some products in the growing stage often have insufficient user data, which makes it difficult for the model to obtain good generalization ability during model training in this business scenario, resulting in poor personalization effect.
本公开实施例所要解决的一个技术问题是:如何提高新业务场景下用户推荐的准确性。A technical problem to be solved by the embodiments of the present disclosure is: how to improve the accuracy of user recommendation in a new business scenario.
图1示出了根据本公开一些实施例的信息推送方法的流程示意图。如图1所示,该实施例的信息推送方法包括步骤S102~S104。FIG. 1 shows a schematic flowchart of an information push method according to some embodiments of the present disclosure. As shown in FIG. 1 , the information pushing method of this embodiment includes steps S102 to S104.
在步骤S102中,获取离线系统发送的、完成训练的预测网络模型,其中,预测网络模型是利用预测网络模型的教师网络模型的预测结果、以及目标域的用户训练数据进行训练而获得的,预测模型的教师网络模型是利用源域的用户训练数据进行训练而获得的。In step S102, the prediction network model sent by the offline system and the training has been completed is obtained, wherein the prediction network model is obtained by using the prediction result of the teacher network model of the prediction network model and the user training data of the target domain for training. The model's teacher network model is obtained by training with user training data from the source domain.
在一些实施例中,“域”代表不同的业务场景。源域例如为成熟的、可使用的用户数据量较大的业务场景,目标域例如为用户数据量较小的新业务场景。In some embodiments, "domains" represent different business scenarios. The source domain is, for example, a mature and usable business scenario with a large amount of user data, and the target domain is, for example, a new business scenario with a small amount of user data.
例如,某公司主营业务为电子商务,其电子商务平台具有充足的用户购买数据。如果预测用户购买商品的可能性、从而为用户推荐商品,则可以利用用户购买数据训练预测模型。之后,该公司拓展了金融品类业务,原电子商务平台的部分用户也开通了金融品类业务。然而,由于开设此类业务的用户有限,如果仅依赖于用户金融业务数据训练预测模型,则很难得到高准确率的模型。此时,可以将用户购买数据视为源域的数据,将用户金融业务数据视为目标域的数据。通过借助源域的数据的训练结果,辅助目标域预测模型的训练。For example, a company's main business is e-commerce, and its e-commerce platform has sufficient user purchase data. If the likelihood of a user purchasing an item is predicted, so as to recommend an item for the user, the prediction model can be trained using the user purchase data. After that, the company expanded its financial category business, and some users of the original e-commerce platform also opened financial category business. However, due to the limited number of users who open such services, it is difficult to obtain models with high accuracy if only relying on user financial business data to train predictive models. At this time, the user purchase data can be regarded as the data of the source domain, and the user's financial business data can be regarded as the data of the target domain. By using the training results of the data in the source domain, the training of the prediction model in the target domain is assisted.
在相关技术中,通常使用的神经网络模型的训练方法为:根据模型对训练数据的预测结果与标记值的差距进行训练。然而,由于目标域存在数据量不足的问题,因此考虑在训练过程中进一步借助采用数据量充足的训练数据进行训练而获得的源域预测模型。在对目标域预测模型进行训练时,不仅考虑对目标域预测模型输入数据本身的预测准确性,还考虑目标域预测模型的预测结果与源域预测模型的预测结果的差距,使得目标域预测模型能够学习到源域预测模型学习的知识。从而,在目标域的数据量较少的情况下,也能够得到具有较高准确率的目标域预测模型。关于目标域预测模型的具体训练方法将在后面的实施例中进行进一步介绍。In the related art, a commonly used training method of a neural network model is: training according to the difference between the prediction result of the training data and the marked value by the model. However, due to the problem of insufficient data volume in the target domain, a source domain prediction model obtained by training with sufficient data volume is considered in the training process. When training the target domain prediction model, not only the prediction accuracy of the input data of the target domain prediction model itself is considered, but also the difference between the prediction results of the target domain prediction model and the prediction results of the source domain prediction model, so that the target domain prediction model Can learn the knowledge learned by the source domain prediction model. Therefore, even when the amount of data in the target domain is small, a target domain prediction model with higher accuracy can be obtained. The specific training method of the target domain prediction model will be further introduced in the following embodiments.
在一些实施例中,获取的目标域预测模型为固化图文件,在固化图文件中,通过训练确定的、所述目标域预测模型的参数被转换为常量。从而,在线系统获得了更轻量的模型文件,有助于提高新版本模型的上线效率。In some embodiments, the acquired target domain prediction model is a solidified map file, and in the solidified map file, the parameters of the target domain prediction model determined through training are converted into constants. Therefore, the online system obtains a lighter model file, which helps to improve the online efficiency of the new version of the model.
在步骤S104中,利用预测网络模型,对目标域的用户待测数据进行预测,以获得对相应用户的信息推送结果。In step S104, the prediction network model is used to predict the user data to be measured in the target domain, so as to obtain an information push result for the corresponding user.
在一些实施例中,用户待测数据包括用户的特征、产品的特征,此外还可以包括环境特征(例如时间特征、其他用户的特征、相关产品的特征、平台活动的特征等等)等等。In some embodiments, the user data to be measured includes user characteristics, product characteristics, and environmental characteristics (eg, time characteristics, characteristics of other users, characteristics of related products, characteristics of platform activities, etc.) and the like.
对相应用户的信息推送结果为:是否为用户推荐产品的结果。例如,对相应用户 的信息推送结果包括第一指示符和第二指示符,所述第一指示符表示为所述用户推荐所述产品,所述第二指示符表示不为所述用户推荐所述产品。例如,用户A的用户待测数据包括用户A的特征以及某洗发水的特征,则信息推送结果为是否为用户A推荐该洗发水。在一些实施例中,预测网络模型输出判断概率,根据判断概率和预设概率的比较结果,确定推荐结果。The information push result to the corresponding user is: the result of whether the product is recommended for the user. For example, the information push result for the corresponding user includes a first indicator and a second indicator, the first indicator indicates that the product is recommended for the user, and the second indicator indicates that the product is not recommended for the user. said product. For example, the user data to be tested of user A includes the characteristics of user A and the characteristics of a certain shampoo, and the information push result is whether the shampoo is recommended for user A. In some embodiments, the prediction network model outputs the judgment probability, and the recommendation result is determined according to the comparison result between the judgment probability and the preset probability.
在确定推荐结果后,可以利用前端应用模块和后台服务器之间的推送接口,将推荐结果以预设的信息格式发送到用户的终端。After the recommendation result is determined, a push interface between the front-end application module and the back-end server can be used to send the recommendation result to the user's terminal in a preset information format.
在一些实施例中,在信息推送结果为向用户推荐相应产品的情况下,将相应产品的信息发送到用户终端;在信息推送结果为不向用户推荐相应产品的情况下,不向用户发送相应产品的信息。In some embodiments, when the information push result is to recommend a corresponding product to the user, the information of the corresponding product is sent to the user terminal; when the information push result is that the corresponding product is not recommended to the user, the corresponding product is not sent to the user. product information.
上述实施例有效地利用了跨域数据信息,聚合了多个数据孤岛所能学习到的知识,并结合源域预测模型的训练结果辅助目标域预测模型的训练,进一步提升了模型的泛化能力。从而,当新业务或新的应用场景上线时,本公开的实施例能够快速、准确地提供相应的预测模型。The above embodiment effectively utilizes cross-domain data information, aggregates the knowledge that can be learned from multiple data islands, and assists the training of the target domain prediction model in combination with the training results of the source domain prediction model, which further improves the generalization ability of the model. . Therefore, when a new service or a new application scenario is launched, the embodiments of the present disclosure can quickly and accurately provide a corresponding prediction model.
下面参考图2描述本公开用于信息推送的模型训练方法的实施例。The following describes an embodiment of the model training method for information push of the present disclosure with reference to FIG. 2 .
图2示出了根据本公开一些实施例的用于信息推送的模型训练方法的流程示意图。如图2所示,该实施例的用于信息推送的模型训练方法包括步骤S202~S206。FIG. 2 shows a schematic flowchart of a model training method for information push according to some embodiments of the present disclosure. As shown in FIG. 2 , the model training method for information push in this embodiment includes steps S202 to S206.
在一些实施例中,在进行训练过程之前,收集用户的曝光日志、点击日志、产品内容正排索引、用户画像特征日志等等。在收集这些数据后,例如通过设备号等标识进行数据融合,并去除缺少了有效的特征的脏数据,从而获得多条用户数据。在一些实施例中,还可以按照预设的比例从获得的数据中抽取正样本和负样本;此外,还可以过滤掉不符合预设条件的样本,例如对产品的浏览时常低于一定阈值的样本等等。In some embodiments, prior to the training process, the user's exposure logs, click logs, product content forward index, user profile feature logs, and the like are collected. After collecting these data, for example, data fusion is performed through identifications such as device numbers, and dirty data lacking effective features is removed, thereby obtaining multiple pieces of user data. In some embodiments, positive samples and negative samples can also be extracted from the obtained data according to a preset ratio; in addition, samples that do not meet preset conditions can also be filtered out, for example, the browsing of products is often lower than a certain threshold. samples etc.
在步骤S202中,采用源域的用户训练数据训练源域预测模型。In step S202, the source domain prediction model is trained using the user training data of the source domain.
在步骤S204中,将同一用户对应的源域特征数据和目标域特征数据分别输入到源域预测模型和目标域预测模型中,获得用户对应的源域预测结果和目标域预测结果。即,将某用户对应的源域特征数据输入到源域预测模型中,获得源域预测结果;再将同一用户对应的目标域特征数据输入到目标域预测模型中,获得目标域预测结果。In step S204, the source domain feature data and the target domain feature data corresponding to the same user are input into the source domain prediction model and the target domain prediction model, respectively, to obtain the source domain prediction result and the target domain prediction result corresponding to the user. That is, input the source domain feature data corresponding to a user into the source domain prediction model to obtain the source domain prediction result; then input the target domain feature data corresponding to the same user into the target domain prediction model to obtain the target domain prediction result.
在步骤S206中,根据源域预测结果与目标域预测结果的差距、以及用户对应的标记值与目标域预测结果的差距,对目标域预测模型的参数进行调整。In step S206, the parameters of the target domain prediction model are adjusted according to the difference between the prediction result of the source domain and the prediction result of the target domain, and the difference between the mark value corresponding to the user and the prediction result of the target domain.
在一些实施例中,基于目标域预测模型的损失函数,对目标域预测模型的参数进 行调整,其中,目标域预测模型的损失函数包括源域预测结果与目标域预测结果的交叉熵、以及用户对应的标记值与目标域预测结果的交叉熵。例如,将公式(1)作为目标域预测模型的损失函数。In some embodiments, the parameters of the target domain prediction model are adjusted based on the loss function of the target domain prediction model, wherein the loss function of the target domain prediction model includes the cross entropy of the source domain prediction result and the target domain prediction result, and the user The cross-entropy of the corresponding label value and the prediction result of the target domain. For example, take Equation (1) as the loss function of the target domain prediction model.
L=CE(y,pred)+αCE(q,pred)     (1)L=CE(y,pred)+αCE(q,pred) (1)
在公式(1)中,L表示损失函数的值;CE(*,*)表示对括号内两个变量计算交叉熵;y表示标记值;pred表示目标域预测结果;q表示源域预测结果;α表示预设参数。In formula (1), L represents the value of the loss function; CE(*,*) represents the calculation of cross entropy for the two variables in parentheses; y represents the label value; pred represents the prediction result of the target domain; q represents the prediction result of the source domain; α represents a preset parameter.
在一些实施例中,q的值通过公式(2)代表的softmax层确定。In some embodiments, the value of q is determined by the softmax layer represented by equation (2).
Figure PCTCN2021132104-appb-000001
Figure PCTCN2021132104-appb-000001
在公式(2)中,q i表示源域预测模型分类结果中的第i类所对应的概率;z i表示第i类所对应的、向softmax层输入的结果;j表示源域预测模型给出的各个类别的标识,z j表示第j类所对应的、向softmax层输入的结果;T表示预设的“温度值”参数,用于表示对源域预测模型预测结果的软化程度。在一些实施例中,T=10,其中,经过发明人的测试,该数值能够获得较好的训练效果。在确定各个类所对应的q i后,将其中的最大值作为q的值。 In formula (2), qi represents the probability corresponding to the i-th class in the classification result of the source domain prediction model; z i represents the result input to the softmax layer corresponding to the i -th class; j represents the source domain prediction model given zj represents the result input to the softmax layer corresponding to the jth class; T represents the preset "temperature value" parameter, which is used to represent the softening degree of the prediction result of the source domain prediction model. In some embodiments, T=10, wherein, after the inventor's test, this value can obtain a better training effect. After determining the q i corresponding to each class, the maximum value among them is taken as the value of q.
在一些实施例中,用户对应的源域预测结果和目标域预测结果是同一物品所关联的推荐结果。In some embodiments, the source domain prediction result and the target domain prediction result corresponding to the user are recommendation results associated with the same item.
例如,某个用户即在作为源域的电子商务平台具有用户购买数据,又在作为目标域的金融业务平台具有金融业务数据。设针对某手机进行推荐。则该用户对应的源域预测结果例如为是否为用户推荐手机,该用户对应的目标域预测结果例如为是否为用户推荐分期购买手机的金融业务。如果针对同一用户和同一物品,源域预测结果和目标域预测结果均为推荐,则可以认为源域预测模型和目标域预测模型的输出值是相同的。For example, a certain user has user purchase data on the e-commerce platform as the source domain and financial business data on the financial business platform as the target domain. Let's make a recommendation for a certain mobile phone. The source domain prediction result corresponding to the user is, for example, whether a mobile phone is recommended for the user, and the target domain prediction result corresponding to the user is, for example, whether the user recommends a financial service for purchasing mobile phones by installments. If for the same user and the same item, both the source domain prediction result and the target domain prediction result are recommended, it can be considered that the output values of the source domain prediction model and the target domain prediction model are the same.
在一些实施例中,目标域预测模型的复杂度高于源域预测模型的复杂度。模型的复杂度例如通过模型的层数、具有预设机制(例如注意力机制)的层的数量、参数数量等等。In some embodiments, the complexity of the target domain prediction model is higher than the complexity of the source domain prediction model. The complexity of the model is measured, for example, by the number of layers of the model, the number of layers with preset mechanisms (eg, attention mechanisms), the number of parameters, and so on.
在一些实施例中,目标域预测模型的输入数据的维度数大于源域预测模型的输入数据的维度数。In some embodiments, the number of dimensions of the input data of the target domain prediction model is greater than the number of dimensions of the input data of the source domain prediction model.
在相关技术中,往往在后训练的模型的复杂度低于在先训练的模型的复杂度,或者在后训练的模型相较于在先训练的模型,处理更低维度的数据。这是为了便于在后训练的模型符合轻量级运行的要求,例如将服务器端的模型移植到移动终端中运行时, 需要考虑移动终端的计算能力、存储能力和数据处理能力。然而,本公开的实施例应用于数据跨域学习的场景中,因此目标域预测模型的复杂度可以高于源域预测模型的复杂度,目标域预测模型的输入数据也可以是更复杂的数据。从而,即使对于新的业务场景,也能够实现较为复杂的信息推送思路。In the related art, the complexity of the model trained later is often lower than the complexity of the model trained earlier, or the model trained later can process lower-dimensional data than the model trained earlier. This is to facilitate the post-trained model to meet the requirements of lightweight operation. For example, when transplanting a server-side model to a mobile terminal to run, the computing capability, storage capability, and data processing capability of the mobile terminal need to be considered. However, the embodiments of the present disclosure are applied in the scenario of data cross-domain learning, so the complexity of the target domain prediction model may be higher than that of the source domain prediction model, and the input data of the target domain prediction model may also be more complex data . Therefore, even for new business scenarios, more complex information push ideas can be implemented.
在一些实施例中,为了进一步提升训练目标域预测模型的效率,源域预测模型使用尽可能少的输入维度。为了在提升训练效率的同时还保证训练的准确性,在一些实施例中,通过预训练的过程获得源域预测模型。下面参考图3描述本公开预训练方法的实施例。该实施例的训练思想与训练目标域预测模型的思路类似,都是借助一个模型的训练结果提升另一个模型的训练准确度。In some embodiments, in order to further improve the efficiency of training the target domain prediction model, the source domain prediction model uses as few input dimensions as possible. In order to improve the training efficiency and also ensure the accuracy of the training, in some embodiments, the source domain prediction model is obtained through a pre-training process. An embodiment of the pre-training method of the present disclosure is described below with reference to FIG. 3 . The training idea of this embodiment is similar to the idea of training the target domain prediction model, both of which use the training result of one model to improve the training accuracy of another model.
图3示出了根据本公开一些实施例的预训练方法的流程示意图。如图3所示,该实施例的预训练方法包括步骤S302~S310。FIG. 3 shows a schematic flowchart of a pre-training method according to some embodiments of the present disclosure. As shown in FIG. 3 , the pre-training method of this embodiment includes steps S302 to S310.
在步骤S302中,获取多个用户中的每一个对应的第一特征数据和第二特征数据,其中,同一用户对应的第二特征数据包括第一特征数据的部分维度的特征。In step S302, first feature data and second feature data corresponding to each of the multiple users are acquired, wherein the second feature data corresponding to the same user includes features of a partial dimension of the first feature data.
在一些实施例中,第一特征数据和第二特征数据都是源域的数据,区别在于二者的特征数量不同。例如,第一特征数据为1000维的用户购买数据,则第二特征数据取其中的部分维度,形成100维的用户购买数据。In some embodiments, the first feature data and the second feature data are both data of the source domain, and the difference lies in the number of features of the two. For example, if the first feature data is 1000-dimensional user purchase data, the second feature data takes part of the dimensions to form 100-dimensional user purchase data.
在步骤S304中,采用第一特征数据训练第一预备模型。In step S304, a first preliminary model is trained by using the first feature data.
在步骤S306中,将同一用户对应的第一特征数据和第二特征数据分别输入到第一预备模型和第二预备模型中,获得用户对应的第一预备模型预测结果和第二预备模型预测结果。In step S306, the first feature data and the second feature data corresponding to the same user are input into the first preliminary model and the second preliminary model respectively, and the first preliminary model prediction result and the second preliminary model prediction result corresponding to the user are obtained .
在步骤S308中,根据第一预备模型预测结果与第二预备模型预测结果的差距、以及用户对应的标记值与第一预备模型预测结果的差距,对第二预备模型的参数进行调整。In step S308, the parameters of the second preliminary model are adjusted according to the difference between the prediction result of the first preliminary model and the prediction result of the second preliminary model, and the difference between the mark value corresponding to the user and the prediction result of the first preliminary model.
例如,参考目标域预测模型的训练过程,该差距可以通过交叉熵来体现。在一些实施例中,基于第二预备模型的损失函数,对第二预备模型的参数进行调整,其中,第二预备模型的损失函数包括第一预备模型预测结果与第二预备模型预测结果的交叉熵、以及该用户对应的标记值与第二预备模型预测结果的交叉熵。例如,将公式(3)作为目标域预测模型的损失函数。For example, referring to the training process of the target domain prediction model, this gap can be represented by cross-entropy. In some embodiments, the parameters of the second preliminary model are adjusted based on the loss function of the second preliminary model, wherein the loss function of the second preliminary model includes the intersection of the prediction result of the first preliminary model and the prediction result of the second preliminary model entropy, and the cross entropy between the label value corresponding to the user and the prediction result of the second preliminary model. For example, take Equation (3) as the loss function of the target domain prediction model.
L pre=(1-λ)CE(y′,pred′)+λCE(q′,pred′)   (3) L pre =(1-λ)CE(y',pred')+λCE(q',pred') (3)
在公式(3)中,L pre表示损失函数的值;CE(*,*)表示对括号内两个变量计算交叉熵; y’表示标记值;pred’表示第二预备模型预测结果;q’表示第一预备模型预测结果。 In formula (3), L pre represents the value of the loss function; CE(*,*) represents the calculation of the cross entropy for the two variables in brackets; y' represents the label value; pred' represents the prediction result of the second preliminary model; q' Indicates the prediction result of the first preliminary model.
第二预备网络虽然具有较少的输入维度,但其在训练过程中还学习了由更多维度的数据训练获得的第一预备网络的训练结果。因此,第二预备网络也有较高的预测准确率。Although the second preparatory network has fewer input dimensions, it also learns the training results of the first preparatory network obtained by training data with more dimensions during the training process. Therefore, the second preparatory network also has a higher prediction accuracy.
在一些实施例中,第一预备模型和第二预备模型除输入层不同以外,模型的其他部分具有相同的网络模型结构。从而,可以使得第二预备模型的训练过程更关注对未使用特征的知识提取。In some embodiments, the first preliminary model and the second preliminary model have the same network model structure except for different input layers. Thus, the training process of the second preparatory model can be made to focus more on knowledge extraction of unused features.
在步骤S310中,将调整后的第二预备模型作为源域预测模型。在经过多次调整迭代后,第二预备模型完成训练。In step S310, the adjusted second preliminary model is used as the source domain prediction model. After a number of tuning iterations, the second preparatory model completes training.
在一些实施例中,还可以对完成训练的第二预备模型进行测试。如果测试准确率大于预设值,则将第二预备模型作为源域预测模型。如果测试准确率不大于预设值,可以选择重新训练;或者可以认为第二预备模型的输入特征不足以表征用户,需要重新选择特征作为第二预备模型的输入特征。In some embodiments, the trained second preliminary model may also be tested. If the test accuracy rate is greater than the preset value, the second preliminary model is used as the source domain prediction model. If the test accuracy rate is not greater than the preset value, retraining may be selected; or it may be considered that the input features of the second preliminary model are insufficient to characterize the user, and the features need to be re-selected as the input features of the second preliminary model.
在通过上述预训练过程获得第二预备模型、进而确定源域预测模型后,可以使得源域预测模型使用较少的输入特征、但具备与多维特征表征的模型相当的预测准确率,从而间接提高了目标域预测模型的训练效率。After the second preliminary model is obtained through the above pre-training process and the source domain prediction model is determined, the source domain prediction model can be made to use fewer input features but have a prediction accuracy comparable to the model represented by multi-dimensional features, thereby indirectly improving The training efficiency of the target domain prediction model.
在一些实施例中,目标域预测模型的训练流程以及信息推送流程可以分别部署在离线系统和在线系统。离线系统可以通过在业务进行过程中积累的数据定期更新训练的目标域预测模型,并将其发送给在线系统进行应用。In some embodiments, the training process and the information pushing process of the target domain prediction model can be deployed in an offline system and an online system, respectively. The offline system can periodically update the trained target domain prediction model through the data accumulated during the business process, and send it to the online system for application.
在一些实施例中,在线系统在更新目标域预测模型前,还可以对其先行校验。下面参考图4描述本公开模型验证方法的实施例。In some embodiments, the online system may also verify the target domain prediction model before updating it. An embodiment of the model verification method of the present disclosure is described below with reference to FIG. 4 .
图4示出了根据本公开一些实施例的模型验证方法的流程示意图。如图4所示,该实施例的模型验证方法包括步骤S402~S408。FIG. 4 shows a schematic flowchart of a model verification method according to some embodiments of the present disclosure. As shown in FIG. 4 , the model verification method of this embodiment includes steps S402 to S408.
在步骤S402中,定期获取离线系统发送的、完成训练的目标域预测模型以及目标域预测模型的编码值。在一些实施例中,该编码值为MD5编码值。In step S402, regularly acquire the target domain prediction model and the encoded value of the target domain prediction model sent by the offline system and completed training. In some embodiments, the encoded value is an MD5 encoded value.
在步骤S404中,在获取的预测网络模型的编码值与当前使用的目标域预测模型的编码值不同的情况下,通过对获取的目标域预测模型的版本验证。In step S404, in the case that the obtained coded value of the prediction network model is different from the coded value of the currently used target domain prediction model, verify the version of the obtained target domain prediction model.
例如,离线系统将最新训练的某版本的目标域预测模型发送了两次。在第一次发送后,在线系统就已经将其上线使用。当第二次发送时,如果在线系统将同样的模型又重复执行了上线流程,则会影响系统效率、浪费系统资源。因此,通过验证编码值, 避免了重复上线的情况,节约了系统资源。For example, the offline system sends the newly trained version of the target domain prediction model twice. After the first transmission, the online system has already put it online for use. When sending for the second time, if the online system repeatedly executes the online process of the same model, it will affect the system efficiency and waste system resources. Therefore, by verifying the coded value, the situation of repeated online access is avoided, and system resources are saved.
在步骤S406中,基于版本验证结果,确定对获取的目标域预测模型的上线验证结果。In step S406, based on the version verification result, determine the online verification result of the acquired target domain prediction model.
在一些实施例中,上线验证还包括模型验证,用于验证获取的目标域预测模型的预设参数的值是否在预设范围内,例如查看关键参数是否为空等等。从而,可以及时发现发送对象错误或者传输错误,提高了系统的稳定性。In some embodiments, the online verification further includes model verification, which is used to verify whether the acquired value of the preset parameter of the target domain prediction model is within the preset range, for example, checking whether the key parameter is empty, and so on. Therefore, it is possible to find out the error of the sending object or the transmission error in time, which improves the stability of the system.
在步骤S408中,在对获取的目标域预测模型的上线验证通过的情况下,采用获取的目标域预测模型替换当前使用的目标域预测模型。In step S408, if the online verification of the acquired target domain prediction model is passed, the currently used target domain prediction model is replaced with the acquired target domain prediction model.
通过上述验证过程,可以提高模型上线过程的稳定性。Through the above verification process, the stability of the model online process can be improved.
下面参考图5描述信息推送装置的实施例。An embodiment of the information pushing apparatus is described below with reference to FIG. 5 .
图5示出了根据本公开一些实施例的信息推送装置的结构示意图。如图5所示,该实施例的信息推送装置500包括:获取模块5100,被配置为获取离线系统发送的、完成训练的目标域预测模型,其中,目标域预测模型是利用源域预测模型的预测结果、以及目标域的用户训练数据进行训练而获得的,源域预测模型是利用源域的用户训练数据进行训练而获得的;预测模块5200,被配置为利用目标域预测模型,对目标域的用户待测数据进行预测,以获得对相应用户的信息推送结果。FIG. 5 shows a schematic structural diagram of an information pushing apparatus according to some embodiments of the present disclosure. As shown in FIG. 5 , the information pushing apparatus 500 of this embodiment includes: an obtaining module 5100 configured to obtain a target domain prediction model sent by an offline system and completed training, wherein the target domain prediction model is obtained by using a source domain prediction model The prediction result and the user training data of the target domain are obtained by training, and the source domain prediction model is obtained by training the user training data of the source domain; the prediction module 5200 is configured to use the target domain prediction model to perform the target domain The user data to be tested is predicted to obtain the information push results for the corresponding users.
在一些实施例中,获取模块5100进一步被配置为定期获取离线系统发送的、完成训练的目标域预测模型以及所述目标域预测模型的编码值;信息推送装置500还包括:验证模块5300,被配置为在获取的预测网络模型的编码值与当前使用的目标域预测模型的编码值不同的情况下,通过对获取的目标域预测模型的版本验证;以及,在对所述获取的目标域预测模型的上线验证通过的情况下,采用所述获取的目标域预测模型替换所述当前使用的目标域预测模型,其中,所述上线验证包括所述版本验证。In some embodiments, the acquiring module 5100 is further configured to periodically acquire the target domain prediction model and the encoded value of the target domain prediction model that are sent by the offline system and have completed training; the information pushing apparatus 500 further includes: a verification module 5300, which is be configured to verify the version of the obtained target domain prediction model under the condition that the obtained coding value of the prediction network model is different from the coding value of the currently used target domain prediction model; and, in the obtained target domain prediction When the online verification of the model is passed, the acquired target domain prediction model is used to replace the currently used target domain prediction model, wherein the online verification includes the version verification.
在一些实施例中,上线验证还包括模型验证,验证模块5300进一步被配置为在所述获取的目标域预测模型的预设参数的值在预设范围内的情况下,通过对所述获取的目标域预测模型的模型验证。In some embodiments, the online verification further includes model verification, and the verification module 5300 is further configured to, in the case that the acquired value of the preset parameter of the target domain prediction model is within a preset range, by Model validation of predictive models for the target domain.
在一些实施例中,获取的目标域预测模型为固化图文件,在所述固化图文件中,通过训练确定的、所述目标域预测模型的参数被转换为常量。In some embodiments, the acquired target domain prediction model is a solidification map file, in which the parameters of the target domain prediction model determined through training are converted into constants.
在一些实施例中,用户待测数据包括用户的特征、产品的特征,所述对相应用户的信息推送结果为是否为所述用户推荐所述产品的结果。In some embodiments, the user data to be measured includes user characteristics and product characteristics, and the information push result to the corresponding user is a result of whether the user recommends the product.
下面参考图6描述本公开用于信息推送的模型训练装置的实施例。The following describes an embodiment of the model training apparatus for information push of the present disclosure with reference to FIG. 6 .
图6示出了根据本公开一些实施例的用于信息推送的模型训练装置的结构示意图。如图6所示,该实施例的模型训练装置600包括:源域训练模块6100,被配置为采用源域的用户训练数据训练源域预测模型;以及,目标域训练模块6200,被配置为将同一用户对应的源域特征数据和目标域特征数据分别输入到所述源域预测模型和目标域预测模型中,获得所述用户对应的源域预测结果和目标域预测结果;以及,根据所述源域预测结果与所述目标域预测结果的差距、以及所述用户对应的标记值与所述目标域预测结果的差距,对所述目标域预测模型的参数进行调整。FIG. 6 shows a schematic structural diagram of a model training apparatus for information push according to some embodiments of the present disclosure. As shown in FIG. 6 , the model training device 600 of this embodiment includes: a source domain training module 6100 configured to train a source domain prediction model using user training data in the source domain; and a target domain training module 6200 configured to The source domain feature data and target domain feature data corresponding to the same user are respectively input into the source domain prediction model and the target domain prediction model, and the source domain prediction result and the target domain prediction result corresponding to the user are obtained; and, according to the The difference between the prediction result of the source domain and the prediction result of the target domain, and the difference between the mark value corresponding to the user and the prediction result of the target domain, adjust the parameters of the prediction model of the target domain.
在一些实施例中,目标域训练模块6200进一步被配置为基于目标域预测模型的损失函数,对所述目标域预测模型的参数进行调整,其中,目标域预测模型的损失函数包括所述源域预测结果与所述目标域预测结果的交叉熵、以及所述用户对应的标记值与所述目标域预测结果的交叉熵。In some embodiments, the target domain training module 6200 is further configured to adjust parameters of the target domain prediction model based on a loss function of the target domain prediction model, wherein the loss function of the target domain prediction model includes the source domain The cross entropy between the prediction result and the target domain prediction result, and the cross entropy between the tag value corresponding to the user and the target domain prediction result.
在一些实施例中,目标域预测模型的复杂度高于所述源域预测模型的复杂度。In some embodiments, the complexity of the target domain prediction model is higher than the complexity of the source domain prediction model.
在一些实施例中,目标域预测模型的输入数据的维度数大于所述源域预测模型的输入数据的维度数。In some embodiments, the number of dimensions of the input data of the target domain prediction model is greater than the number of dimensions of the input data of the source domain prediction model.
在一些实施例中,源域训练模块6100进一步被配置为获取多个用户中的每一个对应的第一特征数据和第二特征数据,其中,同一用户对应的所述第二特征数据包括所述第一特征数据的部分维度的特征;采用第一特征数据训练第一预备模型;将同一用户对应的第一特征数据和第二特征数据分别输入到所述第一预备模型和第二预备模型中,获得所述用户对应的第一预备模型预测结果和第二预备模型预测结果;根据所述第一预备模型预测结果与所述第二预备模型预测结果的差距、以及所述用户对应的标记值与所述第一预备模型预测结果的差距,对所述第二预备模型的参数进行调整;将调整后的第二预备模型作为所述源域预测模型。In some embodiments, the source domain training module 6100 is further configured to obtain first feature data and second feature data corresponding to each of the plurality of users, wherein the second feature data corresponding to the same user includes the Partial dimension features of the first feature data; using the first feature data to train the first preliminary model; inputting the first feature data and the second feature data corresponding to the same user into the first preliminary model and the second preliminary model respectively , obtain the first preliminary model prediction result and the second preliminary model prediction result corresponding to the user; according to the difference between the first preliminary model prediction result and the second preliminary model prediction result, and the mark value corresponding to the user According to the difference between the prediction result of the first preliminary model and the first preliminary model, the parameters of the second preliminary model are adjusted; the adjusted second preliminary model is used as the source domain prediction model.
在一些实施例中,第一预备模型和所述第二预备模型除输入层不同以外,具有相同的网络模型结构。In some embodiments, the first preliminary model and the second preliminary model have the same network model structure except that the input layer is different.
在一些实施例中,用户对应的源域预测结果和目标域预测结果是同一物品所关联的推荐结果。In some embodiments, the source domain prediction result and the target domain prediction result corresponding to the user are recommendation results associated with the same item.
下面参考图7描述本公开信息推送系统的实施例。The following describes an embodiment of the information push system of the present disclosure with reference to FIG. 7 .
图7示出了根据本公开一些实施例的信息推送系统的结构示意图。如图7所示,该实施例的信息推送系统70包括信息推送装置500和用于信息推送的模型训练装置600。FIG. 7 shows a schematic structural diagram of an information push system according to some embodiments of the present disclosure. As shown in FIG. 7 , the information pushing system 70 of this embodiment includes an information pushing apparatus 500 and a model training apparatus 600 for information pushing.
在一些实施例中,信息推送装置500部署于信息推送系统70的在线系统,模型训练装置600部署于信息推送系统70的离线系统。In some embodiments, the information push apparatus 500 is deployed in the online system of the information push system 70 , and the model training apparatus 600 is deployed in the offline system of the information push system 70 .
图8示出了根据本公开一些实施例的数据处理装置的结构示意图,该数据处理装置为信息推送装置或者用于信息推送的模型训练装置。如图8所示,该实施例的数据处理装置80包括:存储器810以及耦接至该存储器810的处理器820,处理器820被配置为基于存储在存储器810中的指令,执行前述任意一个实施例中的信息推送方法或者用于信息推送的模型训练方法。FIG. 8 shows a schematic structural diagram of a data processing apparatus according to some embodiments of the present disclosure, where the data processing apparatus is an information push apparatus or a model training apparatus for information push. As shown in FIG. 8 , the data processing apparatus 80 of this embodiment includes: a memory 810 and a processor 820 coupled to the memory 810 , and the processor 820 is configured to execute any one of the foregoing implementations based on instructions stored in the memory 810 The information push method in the example or the model training method for information push.
其中,存储器810例如可以包括系统存储器、固定非易失性存储介质等。系统存储器例如存储有操作系统、应用程序、引导装载程序(Boot Loader)以及其他程序等。The memory 810 may include, for example, a system memory, a fixed non-volatile storage medium, and the like. The system memory stores, for example, an operating system, an application program, a boot loader (Boot Loader), and other programs.
图9示出了根据本公开另一些实施例的数据处理装置的结构示意图,该数据处理装置为信息推送装置或者用于信息推送的模型训练装置。如图9所示,该实施例的数据处理装置90包括:存储器910以及处理器920,还可以包括输入输出接口930、网络接口940、存储接口950等。这些接口930,940,950以及存储器910和处理器920之间例如可以通过总线960连接。其中,输入输出接口930为显示器、鼠标、键盘、触摸屏等输入输出设备提供连接接口。网络接口940为各种联网设备提供连接接口。存储接口950为SD卡、U盘等外置存储设备提供连接接口。FIG. 9 shows a schematic structural diagram of a data processing apparatus according to other embodiments of the present disclosure, where the data processing apparatus is an information push apparatus or a model training apparatus for information push. As shown in FIG. 9 , the data processing apparatus 90 in this embodiment includes: a memory 910 and a processor 920, and may further include an input/output interface 930, a network interface 940, a storage interface 950, and the like. These interfaces 930 , 940 , 950 and the memory 910 and the processor 920 can be connected, for example, through a bus 960 . The input and output interface 930 provides a connection interface for input and output devices such as a display, a mouse, a keyboard, and a touch screen. Network interface 940 provides a connection interface for various networked devices. The storage interface 950 provides a connection interface for external storage devices such as SD cards and U disks.
本公开的实施例还提供一种计算机可读存储介质,其上存储有计算机程序,其特征在于,该程序被处理器执行时实现前述任意一种信息推送方法或者用于信息推送的模型训练方法。Embodiments of the present disclosure further provide a computer-readable storage medium on which a computer program is stored, characterized in that, when the program is executed by a processor, any one of the foregoing information push methods or a model training method for information push is implemented. .
本领域内的技术人员应当明白,本公开的实施例可提供为方法、系统、或计算机程序产品。因此,本公开可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本公开可采用在一个或多个其中包含有计算机可用程序代码的计算机可用非瞬时性存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。As will be appreciated by one skilled in the art, embodiments of the present disclosure may be provided as a method, system, or computer program product. Accordingly, the present disclosure may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, the present disclosure may take the form of a computer program product embodied on one or more computer-usable non-transitory storage media (including, but not limited to, disk storage, CD-ROM, optical storage, etc.) having computer-usable program code embodied therein .
本公开是参照根据本公开实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解为可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方 框或多个方框中指定的功能的装置。The present disclosure is described with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems), and computer program products according to embodiments of the disclosure. It will be understood that each process and/or block in the flowchart illustrations and/or block diagrams, and combinations of processes and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing device to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing device produce Means for implementing the functions specified in a flow or flow of a flowchart and/or a block or blocks of a block diagram.
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer-readable memory capable of directing a computer or other programmable data processing apparatus to function in a particular manner, such that the instructions stored in the computer-readable memory result in an article of manufacture comprising instruction means, the instructions The apparatus implements the functions specified in the flow or flows of the flowcharts and/or the block or blocks of the block diagrams.
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing device to cause a series of operational steps to be performed on the computer or other programmable device to produce a computer-implemented process such that The instructions provide steps for implementing the functions specified in the flow or blocks of the flowcharts and/or the block or blocks of the block diagrams.
以上所述仅为本公开的较佳实施例,并不用以限制本公开,凡在本公开的精神和原则之内,所作的任何修改、等同替换、改进等,均应包含在本公开的保护范围之内。The above descriptions are only preferred embodiments of the present disclosure, and are not intended to limit the present disclosure. Any modifications, equivalent replacements, improvements, etc. made within the spirit and principles of the present disclosure shall be included in the protection of the present disclosure. within the range.

Claims (18)

  1. 一种信息推送方法,包括:An information push method, comprising:
    获取离线系统发送的、完成训练的目标域预测模型,其中,所述目标域预测模型是利用源域预测模型的预测结果、以及目标域的用户训练数据进行训练而获得的,所述源域预测模型是利用源域的用户训练数据进行训练而获得的;Obtain the target domain prediction model sent by the offline system and completed training, wherein the target domain prediction model is obtained by using the prediction result of the source domain prediction model and the user training data of the target domain for training, and the source domain prediction model is obtained. The model is obtained by training with user training data from the source domain;
    利用所述目标域预测模型,对目标域的用户待测数据进行预测,以获得对相应用户的信息推送结果。Using the target domain prediction model, the user data to be measured in the target domain is predicted to obtain the information push result for the corresponding user.
  2. 根据权利要求1所述的信息推送方法,其中,定期获取离线系统发送的、完成训练的目标域预测模型以及所述目标域预测模型的编码值,并且所述信息推送方法还包括:The information push method according to claim 1, wherein the target domain prediction model and the encoded value of the target domain prediction model sent by the offline system and completed training are regularly obtained, and the information push method further comprises:
    在获取的预测网络模型的编码值与当前使用的目标域预测模型的编码值不同的情况下,通过对获取的目标域预测模型的版本验证;In the case that the coding value of the obtained prediction network model is different from the coding value of the currently used target domain prediction model, verify the version of the obtained target domain prediction model;
    在对所述获取的目标域预测模型的上线验证通过的情况下,采用所述获取的目标域预测模型替换所述当前使用的目标域预测模型,其中,所述上线验证包括所述版本验证。In the case that the online verification of the acquired target domain prediction model is passed, the currently used target domain prediction model is replaced with the acquired target domain prediction model, wherein the online verification includes the version verification.
  3. 根据权利要求2所述的信息推送方法,其中,所述上线验证还包括模型验证,并且所述信息推送方法还包括:The information push method according to claim 2, wherein the online verification further includes model verification, and the information push method further includes:
    在所述获取的目标域预测模型的预设参数的值在预设范围内的情况下,通过对所述获取的目标域预测模型的模型验证。When the value of the preset parameter of the acquired target domain prediction model is within a preset range, model verification of the acquired target domain prediction model is performed.
  4. 根据权利要求1所述的信息推送方法,其中,所述获取的目标域预测模型为固化图文件,在所述固化图文件中,通过训练确定的、所述目标域预测模型的参数被转换为常量。The information push method according to claim 1, wherein the obtained target domain prediction model is a solidified map file, and in the solidified map file, the parameters of the target domain prediction model determined through training are converted into constant.
  5. 根据权利要求1所述的信息推送方法,其中,所述用户待测数据包括用户的特征、产品的特征,所述对相应用户的信息推送结果为是否为所述用户推荐所述产品的结果。The information push method according to claim 1, wherein the user data to be measured includes user characteristics and product characteristics, and the information push result to the corresponding user is a result of whether the user recommends the product.
  6. 一种用于信息推送的模型训练方法,包括:A model training method for information push, comprising:
    采用源域的用户训练数据训练源域预测模型;Use the user training data of the source domain to train the source domain prediction model;
    将同一用户对应的源域特征数据和目标域特征数据分别输入到所述源域预测模型和目标域预测模型中,获得所述用户对应的源域预测结果和目标域预测结果;Inputting the source domain feature data and target domain feature data corresponding to the same user into the source domain prediction model and the target domain prediction model, respectively, to obtain the source domain prediction result and the target domain prediction result corresponding to the user;
    根据所述源域预测结果与所述目标域预测结果的差距、以及所述用户对应的标记值与所述目标域预测结果的差距,对所述目标域预测模型的参数进行调整。The parameters of the target domain prediction model are adjusted according to the difference between the source domain prediction result and the target domain prediction result, and the difference between the tag value corresponding to the user and the target domain prediction result.
  7. 根据权利要求6所述的模型训练方法,其中,所述对所述目标域预测模型的参数进行调整包括:The model training method according to claim 6, wherein the adjusting the parameters of the target domain prediction model comprises:
    基于所述目标域预测模型的损失函数,对所述目标域预测模型的参数进行调整,其中,所述目标域预测模型的损失函数包括所述源域预测结果与所述目标域预测结果的交叉熵、以及所述用户对应的标记值与所述目标域预测结果的交叉熵。Adjust the parameters of the target domain prediction model based on the loss function of the target domain prediction model, wherein the loss function of the target domain prediction model includes the intersection of the source domain prediction result and the target domain prediction result entropy, and the cross-entropy between the tag value corresponding to the user and the prediction result of the target domain.
  8. 根据权利要求6所述的模型训练方法,其中,所述目标域预测模型的复杂度高于所述源域预测模型的复杂度。The model training method according to claim 6, wherein the complexity of the target domain prediction model is higher than that of the source domain prediction model.
  9. 根据权利要求6所述的模型训练方法,其中,所述目标域预测模型的输入数据的维度数大于所述源域预测模型的输入数据的维度数。The model training method according to claim 6, wherein the number of dimensions of the input data of the target domain prediction model is greater than the number of dimensions of the input data of the source domain prediction model.
  10. 根据权利要求6所述的模型训练方法,其中,所述采用源域的用户训练数据训练源域预测模型包括:The model training method according to claim 6, wherein the training of the source domain prediction model using the user training data of the source domain comprises:
    获取多个用户中的每一个对应的第一特征数据和第二特征数据,其中,同一用户对应的所述第二特征数据包括所述第一特征数据的部分维度的特征;acquiring first feature data and second feature data corresponding to each of the multiple users, wherein the second feature data corresponding to the same user includes features of some dimensions of the first feature data;
    采用第一特征数据训练第一预备模型;using the first feature data to train the first preliminary model;
    将同一用户对应的第一特征数据和第二特征数据分别输入到所述第一预备模型和第二预备模型中,获得所述用户对应的第一预备模型预测结果和第二预备模型预测结果;Inputting the first feature data and the second feature data corresponding to the same user into the first preliminary model and the second preliminary model, respectively, to obtain the first preliminary model prediction result and the second preliminary model prediction result corresponding to the user;
    根据所述第一预备模型预测结果与所述第二预备模型预测结果的差距、以及所述用户对应的标记值与所述第一预备模型预测结果的差距,对所述第二预备模型的参数 进行调整;According to the difference between the prediction result of the first preparatory model and the prediction result of the second preparatory model, and the difference between the mark value corresponding to the user and the prediction result of the first preparatory model, the parameters of the second preparatory model are determined. make adjustments;
    将调整后的第二预备模型作为所述源域预测模型。The adjusted second preliminary model is used as the source domain prediction model.
  11. 根据权利要求10所述的模型训练方法,其中,所述第一预备模型和所述第二预备模型除输入层不同以外,具有相同的网络模型结构。The model training method according to claim 10, wherein the first preliminary model and the second preliminary model have the same network model structure except for different input layers.
  12. 根据权利要求6所述的模型训练方法,其中,所述用户对应的源域预测结果和目标域预测结果是同一物品所关联的推荐结果。The model training method according to claim 6, wherein the source domain prediction result and the target domain prediction result corresponding to the user are recommendation results associated with the same item.
  13. 一种信息推送装置,包括:An information push device, comprising:
    获取模块,被配置为获取离线系统发送的、完成训练的目标域预测模型,其中,所述目标域预测模型是利用源域预测模型的预测结果、以及目标域的用户训练数据进行训练而获得的,所述源域预测模型是利用源域的用户训练数据进行训练而获得的;The acquisition module is configured to acquire the target domain prediction model sent by the offline system and completed the training, wherein the target domain prediction model is obtained by using the prediction result of the source domain prediction model and the user training data of the target domain for training , the source domain prediction model is obtained by using the user training data of the source domain for training;
    预测模块,被配置为利用所述目标域预测模型,对目标域的用户待测数据进行预测,以获得对相应用户的信息推送结果。The prediction module is configured to use the target domain prediction model to predict the user data to be measured in the target domain to obtain information push results for the corresponding users.
  14. 一种用于信息推送的模型训练装置,包括:A model training device for information push, comprising:
    源域训练模块,被配置为采用源域的用户训练数据训练源域预测模型;a source domain training module, configured to train a source domain prediction model using user training data in the source domain;
    目标域训练模块,被配置为将同一用户对应的源域特征数据和目标域特征数据分别输入到所述源域预测模型和目标域预测模型中,获得所述用户对应的源域预测结果和目标域预测结果;以及,根据所述源域预测结果与所述目标域预测结果的差距、以及所述用户对应的标记值与所述目标域预测结果的差距,对所述目标域预测模型的参数进行调整。The target domain training module is configured to input the source domain feature data and target domain feature data corresponding to the same user into the source domain prediction model and the target domain prediction model respectively, and obtain the source domain prediction result and target corresponding to the user. domain prediction result; and, according to the difference between the source domain prediction result and the target domain prediction result, and the difference between the mark value corresponding to the user and the target domain prediction result, the parameters of the target domain prediction model make adjustments.
  15. 一种信息推送系统,包括:An information push system, comprising:
    权利要求13所述的信息推送装置;以及The information push device of claim 13; and
    权力要求14所述的用于信息推送的模型训练装置。The model training device for information push according to claim 14.
  16. 一种信息推送装置,包括:An information push device, comprising:
    存储器;以及memory; and
    耦接至所述存储器的处理器,所述处理器被配置为基于存储在所述存储器中的指令,执行如权利要求1~5中任一项所述的信息推送方法。A processor coupled to the memory, the processor configured to perform the information pushing method of any one of claims 1-5 based on instructions stored in the memory.
  17. 一种用于信息推送的模型训练装置,包括:A model training device for information push, comprising:
    存储器;以及memory; and
    耦接至所述存储器的处理器,所述处理器被配置为基于存储在所述存储器中的指令,执行如权利要求6~12中任一项所述的用于信息推送的模型训练方法。A processor coupled to the memory, the processor configured to execute the model training method for information push according to any one of claims 6-12 based on the instructions stored in the memory.
  18. 一种计算机可读存储介质,其上存储有计算机程序,该程序被处理器执行时实现权利要求1~5中任一项所述的信息推送方法、或者权利要求6~12中任一项所述的用于信息推送的模型训练方法。A computer-readable storage medium on which a computer program is stored, and when the program is executed by a processor, realizes the information pushing method described in any one of claims 1 to 5, or any one of claims 6 to 12. The described model training method for information push.
PCT/CN2021/132104 2020-12-03 2021-11-22 Information pushing method and system, model training method, and related devices WO2022116862A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011397968.4 2020-12-03
CN202011397968.4A CN112417293A (en) 2020-12-03 2020-12-03 Information pushing method and system, model training method and related equipment

Publications (1)

Publication Number Publication Date
WO2022116862A1 true WO2022116862A1 (en) 2022-06-09

Family

ID=74829847

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/132104 WO2022116862A1 (en) 2020-12-03 2021-11-22 Information pushing method and system, model training method, and related devices

Country Status (2)

Country Link
CN (1) CN112417293A (en)
WO (1) WO2022116862A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116502271A (en) * 2023-06-21 2023-07-28 杭州金智塔科技有限公司 Privacy protection cross-domain recommendation method based on generation model

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112417293A (en) * 2020-12-03 2021-02-26 京东数字科技控股股份有限公司 Information pushing method and system, model training method and related equipment
CN117609887B (en) * 2024-01-19 2024-05-10 腾讯科技(深圳)有限公司 Data enhancement model training and data processing method, device, equipment and medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109544306A (en) * 2018-11-30 2019-03-29 苏州大学 A kind of cross-cutting recommended method and device based on user behavior sequence signature
CN110796232A (en) * 2019-10-12 2020-02-14 腾讯科技(深圳)有限公司 Attribute prediction model training method, attribute prediction method and electronic equipment
CN111966914A (en) * 2020-10-26 2020-11-20 腾讯科技(深圳)有限公司 Content recommendation method and device based on artificial intelligence and computer equipment
CN112417293A (en) * 2020-12-03 2021-02-26 京东数字科技控股股份有限公司 Information pushing method and system, model training method and related equipment

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108108821B (en) * 2017-12-29 2022-04-22 Oppo广东移动通信有限公司 Model training method and device
US10885383B2 (en) * 2018-05-16 2021-01-05 Nec Corporation Unsupervised cross-domain distance metric adaptation with feature transfer network
US20200104710A1 (en) * 2018-09-27 2020-04-02 Google Llc Training machine learning models using adaptive transfer learning
CN109447149B (en) * 2018-10-25 2021-11-02 腾讯医疗健康(深圳)有限公司 Training method and device for detection model and terminal equipment
CN109544190A (en) * 2018-11-28 2019-03-29 北京芯盾时代科技有限公司 A kind of fraud identification model training method, fraud recognition methods and device
CN110378726A (en) * 2019-07-02 2019-10-25 阿里巴巴集团控股有限公司 A kind of recommended method of target user, system and electronic equipment
CN110400169B (en) * 2019-07-02 2022-12-23 创新先进技术有限公司 Information pushing method, device and equipment
CN110399856B (en) * 2019-07-31 2021-09-14 上海商汤临港智能科技有限公司 Feature extraction network training method, image processing method, device and equipment
CN111582449B (en) * 2020-05-07 2023-08-04 广州视源电子科技股份有限公司 Training method, device, equipment and storage medium of target domain detection network

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109544306A (en) * 2018-11-30 2019-03-29 苏州大学 A kind of cross-cutting recommended method and device based on user behavior sequence signature
CN110796232A (en) * 2019-10-12 2020-02-14 腾讯科技(深圳)有限公司 Attribute prediction model training method, attribute prediction method and electronic equipment
CN111966914A (en) * 2020-10-26 2020-11-20 腾讯科技(深圳)有限公司 Content recommendation method and device based on artificial intelligence and computer equipment
CN112417293A (en) * 2020-12-03 2021-02-26 京东数字科技控股股份有限公司 Information pushing method and system, model training method and related equipment

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116502271A (en) * 2023-06-21 2023-07-28 杭州金智塔科技有限公司 Privacy protection cross-domain recommendation method based on generation model
CN116502271B (en) * 2023-06-21 2023-09-19 杭州金智塔科技有限公司 Privacy protection cross-domain recommendation method based on generation model

Also Published As

Publication number Publication date
CN112417293A (en) 2021-02-26

Similar Documents

Publication Publication Date Title
WO2022116862A1 (en) Information pushing method and system, model training method, and related devices
US11048762B2 (en) User-defined automated document feature modeling, extraction and optimization
CN111768285A (en) Credit wind control model construction system and method, wind control system and storage medium
WO2019100635A1 (en) Editing method and apparatus for automated test script, terminal device and storage medium
CN104102917A (en) Construction method of domain self-adaptive classifier, construction device for domain self-adaptive classifier, data classification method and data classification device
CN112734104B (en) Cross-domain recommendation method fusing generation countermeasure network and self-encoder
US20200272845A1 (en) End-to-End Fuzzy Entity Matching
CN107229614A (en) Method and apparatus for grouped data
CN113449753B (en) Service risk prediction method, device and system
CN113011895A (en) Associated account sample screening method, device and equipment and computer storage medium
CN109242165A (en) A kind of model training and prediction technique and device based on model training
CN110458600A (en) Portrait model training method, device, computer equipment and storage medium
CN114492601A (en) Resource classification model training method and device, electronic equipment and storage medium
Lawrence et al. Explaining neural matrix factorization with gradient rollback
CN116522912B (en) Training method, device, medium and equipment for package design language model
CN116880867A (en) Policy big model-based decision engine updating method and device
CN116842263A (en) Training processing method and device for intelligent question-answering financial advisor model
CN115599990A (en) Knowledge perception and deep reinforcement learning combined cross-domain recommendation method and system
CN115310004A (en) Graph nerve collaborative filtering recommendation method fusing project time sequence relation
CN112348318A (en) Method and device for training and applying supply chain risk prediction model
CN111401754A (en) Method and system for intelligently selecting evaluation index weight determination method
CN111737554A (en) Scoring model training method, electronic book scoring method and device
CN113836907B (en) Text clustering picture identification method, device and system
US20240169255A1 (en) Systems and methods for integrating disparate feature groups during feature engineering of training data for artificial intelligence models
US20240169254A1 (en) Systems and methods for generating integrated feature graphs during feature engineering of training data for artificial intelligence models

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21899896

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21899896

Country of ref document: EP

Kind code of ref document: A1