CN113420866A - Score prediction method based on dual generation countermeasure network - Google Patents

Score prediction method based on dual generation countermeasure network Download PDF

Info

Publication number
CN113420866A
CN113420866A CN202110698814.7A CN202110698814A CN113420866A CN 113420866 A CN113420866 A CN 113420866A CN 202110698814 A CN202110698814 A CN 202110698814A CN 113420866 A CN113420866 A CN 113420866A
Authority
CN
China
Prior art keywords
samples
user
gan
sample generator
negative sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202110698814.7A
Other languages
Chinese (zh)
Other versions
CN113420866B (en
Inventor
秦继伟
武步尘
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xinjiang University
Original Assignee
Xinjiang University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xinjiang University filed Critical Xinjiang University
Priority to CN202110698814.7A priority Critical patent/CN113420866B/en
Publication of CN113420866A publication Critical patent/CN113420866A/en
Application granted granted Critical
Publication of CN113420866B publication Critical patent/CN113420866B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Business, Economics & Management (AREA)
  • Software Systems (AREA)
  • Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • General Health & Medical Sciences (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Biophysics (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Game Theory and Decision Science (AREA)
  • Development Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Marketing (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • General Business, Economics & Management (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention discloses a scoring prediction method based on a dual-generation countermeasure network, which mainly relates to the field of deep learning; the method comprises the following steps: s1, dividing the samples into three types, namely samples which are liked by the user, samples which are not purchased and samples which are disliked by the user; s2, two GANs are used, wherein G of the first GAN is a negative sample generator, G of the second GAN is a positive sample generator, and the negative sample generator is used for generating high-quality negative samples; s3, inputting the samples generated by the negative sample generator into the positive sample generator as additional marking data, and randomly selecting some samples which are not purchased as input into the negative sample generator to generate positive samples; s4, inputting a favorite purchase vector of the user by the second GAN and requiring to generate an output close to 0 on the negative sample element generated by the first GAN, and generating x (0 < x < 1) from the randomly sampled unpurchased vector as much as possible; the method can improve the accuracy of the model to the recommended prediction and the generalization force of the model.

Description

Score prediction method based on dual generation countermeasure network
Technical Field
The invention relates to the field of deep learning, in particular to a scoring prediction method based on a dual-generation countermeasure network.
Background
Collaborative Filtering (CF) is one of the most mature recommendation technologies, and the similarity of scores among users is calculated based on the historical score records of the users to establish a user preference model; the establishment of the user preference model is a key factor influencing the quality of the recommendation algorithm. When the user sparsely scores the recommended resource scoring matrix, a user model is difficult to establish; the problem of cold start can be brought when facing new users and new resources, and resource recommendation cannot be effectively completed. Therefore, how to build a user preference model by fully utilizing user and resource information, especially implicit information between them, has been a focus of research.
With the continuous development of deep learning, generation of antagonistic neural Networks (GAN) has been successfully applied to the field of recommendation. The generation of the countermeasure network is realized by continuously playing a game on a generation model G (Generator) and a discrimination model D (discriminator), so that G learns the distribution of data, and in the training process, the generator tries to cheat a discriminator and considers the data in the generation model to be real data. The discriminator attempts to correctly judge the true data. During GAN training, the generative model attempts to generate true data by inputting false data, while the discriminative model estimates the likelihood of the data being true or false (from the generative model). Finally, the discriminator cannot distinguish the data in the generator, and the generator can generate the data we need.
CFGAN is a highly successful deep recommendation framework based on generating countermeasure networks and collaborative filtering. The CFGAN is a GAN framework which introduces GAN in the field of collaborative filtering, migrates the relation between implicit features and files to users and projects, adopts a vector mode to resist training in the aspect of data preprocessing and proposes a vector mode-based resist training. Since the data is easier to collect, it plans to focus on CF with implicit feedback. The generative model of CFGAN attempts to generate a true purchase vector. Likewise, D attempts to distinguish between the generated user purchase vector and the true purchase vector obtained from ground truth. The generator plays games with the discriminator by generating a vector similar to the purchase vector, but the final generator will train to obtain an all-1 vector, for this reason, a negative sampling technique is introduced into the CFGAN, and optimization is performed through a loss function, so that the generator G is ensured to learn that the purchased article of the user is 1, and the unpurchased article is 0, so as to generate a purchase vector which is not all-1. CFGAN has achieved great success in performance, but the selection of negative examples is too random, because in real life, articles that the user does not purchase often do not represent articles that the user does not like, but rather, there are articles that the user does not see, and meanwhile, the previous algorithms do not utilize information of samples that the user does not purchase in the data set, so that a large amount of hidden information is wasted by the model, and the limit of model performance is not reached.
Disclosure of Invention
The invention aims to solve the problems in the prior art, provides a scoring prediction method based on a dual-generation countermeasure network, utilizes user embedded information, enhances the intimacy between users and projects and between users, improves a model, and better adapts to the problems of data sparseness and user cold start so as to improve the precision of the model on recommendation prediction and the generalization force of the model.
In order to achieve the purpose, the invention is realized by the following technical scheme:
the scoring prediction method based on the dual generation countermeasure network comprises the following steps:
s1, dividing the samples into three types, namely samples which are liked by the user, samples which are not purchased and samples which are disliked by the user;
s2, two GANs are used, wherein G of the first GAN is a negative sample generator, G of the second GAN is a positive sample generator, and the negative sample generator is used for generating high-quality negative samples;
s3, inputting the samples generated by the negative sample generator into the positive sample generator as additional marking data, and randomly selecting some samples which are not purchased as input into the negative sample generator to generate positive samples;
and S4, inputting a favorite purchase vector of the user by the second GAN, requiring that the negative sample element generated by the first GAN generates an output close to 0, and generating x (0 < x < 1) by using the randomly sampled unpurchased vector as much as possible.
Compared with the prior art, the invention has the beneficial effects that:
1. the invention fully utilizes the user embedded matrix and enhances the potential relationship between users; the negative sample and the generated countermeasure network are combined, so that the accuracy of the model is improved; samples which are not purchased by the user are utilized, so that the model is applied to the implicit relation between the user items;
2. the invention provides a general CF frame name DGAN based on GAN, which makes full use of the implicit information of user-resource, uses the unpurchased items, and proves the effectiveness of the implicit information used by the invention through comparison experiments; the experimental results obtained by carrying out a large number of experimental verifications on the two data sets not only prove the effectiveness of the invention, but also prove the superiority of the invention, and compared with the latest top-N, the accuracy is obviously improved.
Drawings
FIG. 1 is a general architecture diagram of the present invention;
FIG. 2 is a score prediction result (movielens-100K) based on the present invention and comparison algorithm;
FIG. 3 shows the score prediction results (movielens-1M) based on the present invention and comparison algorithm.
Detailed Description
The invention will be further illustrated with reference to the following specific examples. It should be understood that these examples are for illustrative purposes only and are not intended to limit the scope of the present invention. Further, it should be understood that various changes or modifications of the present invention may be made by those skilled in the art after reading the teaching of the present invention, and these equivalents also fall within the scope of the present application.
Example (b): as shown in fig. 1, the scoring prediction method based on the dual generation countermeasure network of the present invention includes the steps of:
s1, dividing the samples into three types, namely samples which are liked by the user, samples which are not purchased and samples which are disliked by the user;
s2, two GANs are used, wherein G of the first GAN is a negative sample generator, G of the second GAN is a positive sample generator, and the negative sample generator is used for generating high-quality negative samples;
s3, inputting the samples generated by the negative sample generator into the positive sample generator as additional marking data, and randomly selecting some samples which are not purchased as input into the negative sample generator to generate positive samples;
and S4, inputting a favorite purchase vector of the user by the second GAN, requiring that the negative sample element generated by the first GAN generates an output close to 0, and generating x (0 < x < 1) by using the randomly sampled unpurchased vector as much as possible.
Specifically, the method comprises the following steps:
1. negative sample generator
The method comprises the steps of collecting user historical data information, conducting data processing, marking samples which are three different types, namely samples which are liked by a user, samples which are not bought by the user and samples which are disliked by the user, and randomly selecting some samples which are disliked by the user as input to a negative sample generator.
2. Positive sample generator
Positive samples are generated by inputting the samples generated by the negative sample generator as additional marking data into the positive sample generator. The positive sample generator inputs the user's favorite purchase vector and requires the negative sample generator to generate an output close to 0 on the element, and simultaneously generates x (0 < x < 1) on the randomly sampled unpurchased vector as much as possible, thereby improving the accuracy of the scoring prediction model.
3. Scoring prediction recommendation model
Referring to fig. 1, a negative sample generator and a positive sample generator are fused and learned and score prediction is tested so as to better recommend favorite items for a user and improve user satisfaction.
The method comprises the following specific steps:
step 1: negative examples are extracted from the negative example generator.
Step 2: an item score is generated from the positive sample generator.
And step 3: the negative and positive sample generators are fused and learned and the score predictions are tested.
4. Experiments prove that
2-3, using 2 data sets movielens-100K and movielens-1M, the following 7 comparison algorithms were used: itempop, BPR, MPR, CDAE, IRGAN, CFGAN, GauphGAN, thereby verifying that the model performance is superior to other comparison recommendation algorithms.

Claims (1)

1. The scoring prediction method based on the dual generation countermeasure network is characterized by comprising the following steps of:
s1, dividing the samples into three types, namely samples which are liked by the user, samples which are not purchased and samples which are disliked by the user;
s2, two GANs are used, wherein G of the first GAN is a negative sample generator, G of the second GAN is a positive sample generator, and the negative sample generator is used for generating high-quality negative samples;
s3, inputting the sample generated by the negative sample generator into the positive sample generator as additional marking data to generate a positive sample;
s4, inputting a favorite purchase vector of the user by the second GAN and requiring to generate an output close to 0 on the negative sample element generated by the first GAN, and generating x (0 < x < 1) from the randomly sampled unpurchased vector as much as possible; the method can improve the accuracy of the model to the recommended prediction and the generalization force of the model.
CN202110698814.7A 2021-06-23 2021-06-23 Score prediction method based on dual generation countermeasure network Active CN113420866B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202110698814.7A CN113420866B (en) 2021-06-23 2021-06-23 Score prediction method based on dual generation countermeasure network

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202110698814.7A CN113420866B (en) 2021-06-23 2021-06-23 Score prediction method based on dual generation countermeasure network

Publications (2)

Publication Number Publication Date
CN113420866A true CN113420866A (en) 2021-09-21
CN113420866B CN113420866B (en) 2022-10-11

Family

ID=77716345

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202110698814.7A Active CN113420866B (en) 2021-06-23 2021-06-23 Score prediction method based on dual generation countermeasure network

Country Status (1)

Country Link
CN (1) CN113420866B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116610857A (en) * 2023-04-10 2023-08-18 南京邮电大学 Personalized post recommendation method based on user preference for post popularity

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109657156A (en) * 2019-01-22 2019-04-19 杭州师范大学 A kind of personalized recommendation method generating confrontation network based on circulation
CN110442781A (en) * 2019-06-28 2019-11-12 武汉大学 It is a kind of based on generate confrontation network to grade ranked items recommended method
US20200151940A1 (en) * 2018-11-13 2020-05-14 Nec Laboratories America, Inc. Pose-variant 3d facial attribute generation
CN111259264A (en) * 2020-01-15 2020-06-09 电子科技大学 Time sequence scoring prediction method based on generation countermeasure network
CN111382785A (en) * 2020-03-04 2020-07-07 武汉精立电子技术有限公司 GAN network model and method for realizing automatic cleaning and auxiliary marking of sample
US20200349393A1 (en) * 2019-04-30 2020-11-05 Agora Lab, Inc. Optimizing Supervised Generative Adversarial Networks via Latent Space Regularizations
CN111950619A (en) * 2020-08-05 2020-11-17 东北林业大学 Active learning method based on dual-generation countermeasure network

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20200151940A1 (en) * 2018-11-13 2020-05-14 Nec Laboratories America, Inc. Pose-variant 3d facial attribute generation
CN109657156A (en) * 2019-01-22 2019-04-19 杭州师范大学 A kind of personalized recommendation method generating confrontation network based on circulation
US20200349393A1 (en) * 2019-04-30 2020-11-05 Agora Lab, Inc. Optimizing Supervised Generative Adversarial Networks via Latent Space Regularizations
CN110442781A (en) * 2019-06-28 2019-11-12 武汉大学 It is a kind of based on generate confrontation network to grade ranked items recommended method
CN111259264A (en) * 2020-01-15 2020-06-09 电子科技大学 Time sequence scoring prediction method based on generation countermeasure network
CN111382785A (en) * 2020-03-04 2020-07-07 武汉精立电子技术有限公司 GAN network model and method for realizing automatic cleaning and auxiliary marking of sample
CN111950619A (en) * 2020-08-05 2020-11-17 东北林业大学 Active learning method based on dual-generation countermeasure network

Non-Patent Citations (4)

* Cited by examiner, † Cited by third party
Title
PEIFENG WANG 等: "Incorporating GAN for Negative Sampling in Knowledge Representation Learning", 《THIRTY-SECOND AAAI CONFERENCE ON ARTIFICIAL INTELLIGENCE》 *
林嘉豪 等: "基于生成对抗网络的模仿学习综述", 《计算机学报》 *
滑瑾: "基于跨模态语义挖掘与生成对抗网络的推荐系统研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
黄钢: "生成对抗算法及其在对抗样本生成上的应用研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116610857A (en) * 2023-04-10 2023-08-18 南京邮电大学 Personalized post recommendation method based on user preference for post popularity
CN116610857B (en) * 2023-04-10 2024-05-03 南京邮电大学 Personalized post recommendation method based on user preference for post popularity

Also Published As

Publication number Publication date
CN113420866B (en) 2022-10-11

Similar Documents

Publication Publication Date Title
Zhang et al. Human motion recognition based on SVM in VR art media interaction environment
CN110097094B (en) Multiple semantic fusion few-sample classification method for character interaction
US8768050B2 (en) Accurate text classification through selective use of image data
CN110442790A (en) Recommend method, apparatus, server and the storage medium of multi-medium data
CN105808590B (en) Search engine implementation method, searching method and device
CN109800785B (en) Data classification method and device based on self-expression correlation
CN107797998A (en) The recognition methods of user-generated content containing rumour and device
CN113221663B (en) Real-time sign language intelligent identification method, device and system
WO2020135642A1 (en) Model training method and apparatus employing generative adversarial network
Ahuja et al. Low-resource adaptation for personalized co-speech gesture generation
Cobo et al. Automatic task decomposition and state abstraction from demonstration
CN113420866B (en) Score prediction method based on dual generation countermeasure network
Li et al. Meta learning for task-driven video summarization
CN107749034A (en) A kind of safe friend recommendation method in social networks
CN111259264B (en) Time sequence scoring prediction method based on generation countermeasure network
Liu et al. Finding out reasons for low completion in MOOC environment: an explicable approach using hybrid data mining methods
CN113220856A (en) Multi-round dialogue system based on Chinese pre-training model
CN113806630A (en) Attention-based multi-view feature fusion cross-domain recommendation method and device
CN113449011A (en) Big data prediction-based information push updating method and big data prediction system
CN113641811B (en) Session recommendation method, system, equipment and storage medium for promoting purchasing behavior
CN116932778A (en) Teaching knowledge mining method and system based on VR technology
CN109740743A (en) Hierarchical neural network query recommendation method and device
Yang [Retracted] A Deep Learning and Clustering Extraction Mechanism for Recognizing the Actions of Athletes in Sports
CN109062900A (en) A kind of circle of friends generation method and device
CN114862496A (en) Session recommendation method, device and medium based on user personalized modeling

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant