WO2020093781A1 - Multimedia resource estimated click through rate determination method and apparatus, and server - Google Patents

Multimedia resource estimated click through rate determination method and apparatus, and server Download PDF

Info

Publication number
WO2020093781A1
WO2020093781A1 PCT/CN2019/105452 CN2019105452W WO2020093781A1 WO 2020093781 A1 WO2020093781 A1 WO 2020093781A1 CN 2019105452 W CN2019105452 W CN 2019105452W WO 2020093781 A1 WO2020093781 A1 WO 2020093781A1
Authority
WO
WIPO (PCT)
Prior art keywords
click
information
rate
multimedia
multimedia resource
Prior art date
Application number
PCT/CN2019/105452
Other languages
French (fr)
Chinese (zh)
Inventor
牛亚男
Original Assignee
北京达佳互联信息技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京达佳互联信息技术有限公司 filed Critical 北京达佳互联信息技术有限公司
Publication of WO2020093781A1 publication Critical patent/WO2020093781A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0202Market predictions or forecasting for commercial activities
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/08Learning methods

Definitions

  • the present disclosure relates to the field of information recommendation, in particular to a method, device and server for determining the estimated click rate of multimedia resources.
  • CTR clickthrough rate
  • the related art commonly uses a deep neural network (DNN) click rate prediction model to calculate the estimated click rate.
  • the input data of the click rate estimation model can be divided into different fields, for example, the field of multimedia resources and the field of users.
  • the input data can be converted into a low-dimensional embedding vector through an embedding layer, where different fields can correspond to different weight matrices in the embedding layer.
  • Each feature information can determine the embedding vector through the corresponding weight matrix.
  • the weight matrix corresponding to the domain can also be called the embedding mapping table of the domain.
  • the embedded vector can be input into the deep neural network to output the estimated click rate of the multimedia resource.
  • the number of certain feature information may be less.
  • the feature information in the user field includes the image identifier that the user clicked, if the user clicks the image less times, the collected image identifier The number is also small.
  • a small number of feature information will lead to insufficient learning of the corresponding weight matrix, that is, the obtained embedding vector is weakly represented, which leads to When estimating, the accuracy of the estimated click-through rate is low.
  • the present disclosure provides a method, device and server for determining the estimated click rate of multimedia resources, which can solve the problem of low accuracy of the estimated click rate.
  • a method for determining a multimedia resource estimated click rate includes:
  • the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
  • inputting the user behavior information and the multimedia attribute information into the click-through rate estimation model, and outputting the user's estimated click-through rate to the first multimedia resource includes:
  • Input at least one embedding vector output by the embedding layer into the click-through rate estimation model, and output the user's estimated click-through rate to the first multimedia resource.
  • the training method of the click-through rate prediction model includes:
  • the training sample including multimedia attribute information of a second multimedia resource, user behavior information of a sample user when browsing the second multimedia resource, and the sample user's response to the second multimedia Clicks on physical resources, the clicks include clicked or not clicked;
  • the initial model includes an initial embedding layer and an initial click rate prediction network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
  • the training the initial model based on the at least one training sample to obtain the estimated click-through rate model includes:
  • the click-through rate estimation model is obtained.
  • the first The learning rate corresponding to one information type is greater than the learning rate corresponding to the second information type.
  • the information type includes work identification, author identification, and / or style identification.
  • the user behavior information includes click history information, attention information and / or favorite information
  • the click history information is used to indicate multimedia attribute information of the multimedia resource clicked by the user
  • the attention information is used to indicate the user's attention Multimedia attribute information of the multimedia resource
  • the favorite information is used to represent multimedia attribute information of the user's favorite multimedia resource.
  • an apparatus for determining an estimated click rate of multimedia resources including:
  • the obtaining unit is configured to obtain user behavior information of the user
  • the obtaining unit is further configured to obtain multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
  • the calling unit is configured to call a click-through rate prediction model
  • the click-through rate prediction model includes an embedding layer and a click-through rate prediction network
  • the embedding layer includes a weight matrix corresponding to at least one information type
  • the estimation network is used to take the embedding vector output by the embedding layer as an input and output the estimated click rate of the multimedia resource;
  • the determining unit is configured to input the user behavior information and the multimedia attribute information into the click-through rate estimation model, and output the estimated click-through rate of the user to the first multimedia resource.
  • the determining unit is configured to:
  • Input at least one embedding vector output by the embedding layer into the click-through rate estimation network, and output the user's estimated click-through rate to the first multimedia resource.
  • the device further includes a training unit, the training unit is configured to:
  • the training sample including multimedia attribute information of a second multimedia resource, user behavior information of a sample user when browsing the second multimedia resource, and the sample user's response to the second multimedia Clicks on physical resources, the clicks include clicked or not clicked;
  • the initial model includes an initial embedding layer and an initial click rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
  • the training unit is configured to:
  • the click-through rate estimation model is obtained.
  • the first The learning rate corresponding to one information type is greater than the learning rate corresponding to the second information type.
  • the information type includes work identification, author identification, and / or style identification.
  • the user behavior information includes click history information, attention information and / or favorite information
  • the click history information is used to indicate multimedia attribute information of the multimedia resource clicked by the user
  • the attention information is used to indicate the user's attention Multimedia attribute information of the multimedia resource
  • the favorite information is used to represent multimedia attribute information of the user's favorite multimedia resource.
  • a server including:
  • One or more processors are One or more processors;
  • One or more memories for storing one or more processor executable instructions
  • the one or more processors are configured as:
  • the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
  • the user behavior information and the multimedia attribute information are input to the click-through rate estimation model, and the user's estimated click-through rate for the first multimedia resource is output.
  • a non-transitory computer-readable storage medium which when the instructions in the storage medium are executed by the processor of the server, enables the server to execute a multimedia resource estimated click rate Determination method, the method includes:
  • the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
  • an application program / computer program product is provided.
  • the server is caused to perform a method for determining a multimedia resource estimated click rate.
  • Methods include:
  • the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
  • the user behavior information and the multimedia attribute information are input to the click-through rate estimation model, and the user's estimated click-through rate for the first multimedia resource is output.
  • information of the same information type can be determined by the same weight matrix in the embedding layer, which can improve the representativeness of the embedding vector. Therefore, when the click rate of the multimedia resource is estimated based on the method of this embodiment, the accuracy of the estimated click rate is improved.
  • Fig. 1 is a schematic diagram of a click-through rate estimation model according to an exemplary embodiment.
  • Fig. 2 is a diagram of an implementation environment according to an exemplary embodiment.
  • Fig. 3 is a flow chart of a method for determining a multimedia resource estimated click rate according to an exemplary embodiment.
  • Fig. 4 is a flow chart of a method for determining an estimated click rate of a multimedia resource according to an exemplary embodiment.
  • Fig. 5 is a schematic diagram of an application program interface according to an exemplary embodiment.
  • Fig. 6 is a schematic diagram of a click rate prediction model according to an exemplary embodiment.
  • Fig. 7 is a schematic diagram of a click rate prediction model according to an exemplary embodiment.
  • Fig. 8 is a flow chart of a method for training a click rate prediction model according to an exemplary embodiment.
  • Fig. 9 is a flow chart of a method for training a click rate prediction model according to an exemplary embodiment.
  • Fig. 10 is a block diagram of a device for determining an estimated click rate of a multimedia resource according to an exemplary embodiment.
  • Fig. 11 is a block diagram of a device for determining an estimated click rate of a multimedia resource according to an exemplary embodiment.
  • the implementation environment diagram is shown in FIG. 2.
  • the implementation environment may include multiple terminals 201 and a server 202 for providing services for the multiple terminals 201.
  • a plurality of terminals 201 are connected to the server 202 through a wireless or wired network, and the plurality of terminals 201 may be computer devices or smart terminals that can access the server 202.
  • the terminal 201 may be installed with an application program for recommending multimedia resources (such as pictures, short videos, etc.), and the user may log in to the above application program.
  • the server 202 can provide background services for the above-mentioned applications and record user behavior information of each user.
  • the server 202 may also have at least one database for storing click-through rate estimation models, multimedia resources and corresponding multimedia attribute information, user behavior information of various users, and so on.
  • This embodiment provides a method for determining the estimated click rate of multimedia resources.
  • This method may be implemented by a server.
  • FIG. 3 a flowchart of a method for determining the estimated click rate of multimedia resources.
  • the processing flow of the method may include the following A step of:
  • step S301 the server acquires user behavior information of the user.
  • step S302 the server acquires multimedia attribute information of the first multimedia resource.
  • the first multimedia resource is a multimedia resource to be recommended to the user.
  • step S303 the server calls the click-through rate estimation model.
  • the click-through rate prediction model includes an embedding layer and a click-through rate prediction network.
  • the embedding layer includes a weight matrix corresponding to at least one information type.
  • the click-through rate prediction network is used to take the embedding vector output by the embedding layer as input and output multimedia resources Estimated clickthrough rate.
  • step S304 the server inputs user behavior information and multimedia attribute information into the click-through rate estimation model, and outputs the user's estimated click-through rate to the first multimedia resource.
  • inputting user behavior information and multimedia attribute information into the click-through rate estimation network, and outputting the user's estimated click-through rate to the first multimedia resource include:
  • the at least one embedding vector output from the embedding layer is input into the click rate estimation model, and the user's estimated click rate of the first multimedia resource is output.
  • the training methods of the CTR prediction model include:
  • the training sample includes multimedia attribute information of the second multimedia resource, user behavior information when the sample user browses the second multimedia resource, and the click situation of the sample user on the second multimedia resource. Including clicked or not clicked;
  • the initial model is trained based on at least one training sample to obtain a click-through rate estimation model.
  • the initial model includes an initial embedding layer and an initial click rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
  • Training the initial model based on at least one training sample to obtain a click-through rate estimation model including:
  • the initial weight matrix corresponding to each information type is adjusted based on the training samples containing the information type to obtain the weight matrix corresponding to the trained information type;
  • a click-through rate prediction model is obtained.
  • the first information type corresponds to The learning rate of is greater than the learning rate corresponding to the second information type.
  • the information type includes work identification, author identification and / or style identification.
  • the user behavior information includes click history information, attention information and / or favorite information
  • the click history information includes information recorded when the user clicks on any multimedia resource
  • the attention information includes information recorded when the user clicks on the attention option
  • the favorite information includes Information recorded when the user clicks the favorite option.
  • a method for determining the estimated click rate of multimedia resources will be introduced in conjunction with a specific implementation manner.
  • the method may be implemented by a server, as shown in the flowchart of the method for determining the estimated click rate of multimedia resources shown in FIG. 4, the processing flow of the method may include the following steps:
  • step S401 the server acquires user behavior information of the user.
  • the terminal can send a multimedia resource acquisition request to the server; or, when the user searches for multimedia resources through the application, the terminal can also send the multimedia resource acquisition request to the server.
  • the server receives the multimedia resource acquisition request sent by the terminal, it may trigger processing logic of the recommended multimedia resource, so as to be processed by the method for determining the estimated click rate of the multimedia resource provided in this embodiment.
  • This embodiment does not limit the specific manner of triggering the processing logic of recommending multimedia resources.
  • the server may obtain the user behavior information of each user from the stored user behavior information of each user according to the user's identification information.
  • the user behavior information may include click history information, attention information and / or favorite information
  • the click history information may be used to represent multimedia attribute information of the multimedia resource clicked by the user
  • the attention information may be used to represent the user
  • the multimedia attribute information and favorite information of the focused multimedia resource can be used to represent the multimedia attribute information of the user's favorite multimedia resource.
  • the information type may include work identification, author identification, and / or style identification.
  • the terminal can display the multimedia resources provided by the application program, and the display interface of the multimedia resources can include attention options and favorite options.
  • the server may receive a request to load the multimedia resource, and then may add corresponding multimedia attribute information to the user's click history information for storage. For example, when the user clicks to watch a short video, the server may add the work identification, author identification, and / or style identification of the short video to the user's click history information.
  • the server may receive a request to add attention to the multimedia resource, and add the corresponding multimedia attribute information to the attention information of the user for storage.
  • the attention option may be not only attention to multimedia resources, but also attention to authors or styles of multimedia resources, which is not limited in this embodiment.
  • the server can add the author ID of the multimedia resource to the user's attention information, and after that, when the author updates the work, the user can receive the corresponding Update notifications to achieve the effect of subscription.
  • the server adds the corresponding favorite information processing, which is similar to the processing of adding attention information, and will not be repeated here.
  • the above solutions for recording click history information, attention information, and favorite information are all optional solutions, that is, the user behavior information recorded by the server may be one or more of click history information, attention information, and favorite information.
  • the type of information contained therein may also be one or more of a work identification, an author identification and a style identification, and may also include other information types. The specific information type is not limited in this embodiment.
  • step S402 the server acquires multimedia attribute information of the first multimedia resource.
  • the first multimedia resource is a multimedia resource to be recommended to the user.
  • the multimedia attribute information may be obtained by the author when the author uploads the corresponding multimedia resource, or the server automatically generates the information based on the author and content of the multimedia resource. This embodiment does not limit the generation method of the multimedia attribute information.
  • the first multimedia resource may include multiple multimedia resources, for example, it may be a currently popular multimedia resource, or it may be a search result obtained by searching a user. Since the multimedia resources that the user can view are limited, the server can determine which first multimedia resources to display, and can also sort the displayed first multimedia resources in order to achieve the priority of displaying multimedia resources more in line with user needs .
  • the estimated click rate of the multimedia resource is used to achieve the above purpose. The higher the estimated click rate, the greater the possibility that the user clicks the multimedia resource, that is, the multimedia resource more meets the user's needs.
  • the server may, according to the identification information of the first multimedia resource, from the stored multimedia resource and the corresponding multimedia attribute information To obtain multimedia attribute information of the first multimedia resource.
  • step S403 the server calls the click-through rate estimation model.
  • the click-through rate prediction model provided in this embodiment may include an embedding layer and a click-through rate prediction network.
  • the embedding layer may include a weight matrix corresponding to at least one information type
  • the prediction network can be used to take the embedding vector output by the embedding layer as an input and output the estimated click rate of the multimedia resource.
  • the click prediction network may be a deep neural network, or a convolutional neural network, etc.
  • the technician can design the click rate prediction network according to requirements, and the specific network structure is not limited in this embodiment.
  • the server can also store multiple click-through rate prediction models, and the server can call the click-through rate prediction model that meets the preset conditions. For example, when the server also stores the accuracy of each click-through rate prediction model, Call any click rate prediction model that meets the accuracy rate greater than the preset threshold. This embodiment does not limit the specific method of calling the click rate model.
  • the click-through rate prediction model can be periodically trained, and the latest click-through rate prediction model obtained by training can be stored in the server.
  • the server needs to determine the estimated click rate of the first multimedia resource, it can call the latest click rate prediction model.
  • step S404 for each information type, the server inputs the information belonging to the information type in the user behavior information and multimedia attribute information into the weight matrix corresponding to the information type in the embedding layer, and outputs at least one embedding vector.
  • the server may input the user behavior information obtained in the above step S401 and the multimedia attribute information of the first multimedia resource obtained in step S402 into the embedding layer of the click rate prediction model.
  • the input data can be encoded in order to adapt to the calculation of the neural network.
  • the vector dimension obtained after encoding is large, so the dimensionality reduction process can be performed on the encoded vector through the weight matrix in the embedded layer, so as to reduce the IO (Input-Output) overhead in the processing process.
  • IO Input-Output
  • one-hot encoding of a feature information of the input data to obtain a vector (0,0,0,1,0,0,0,0,0,0), which can be converted into an embedded vector (0.145,0.152) through the weight matrix .
  • Each information type has a corresponding weight matrix.
  • the server can obtain the vector corresponding to the feature information of the same information type, input the vector corresponding to the same information type into the corresponding weight matrix, and output the embedded vector.
  • the information type may include a work identification, an author identification and / or a style identification.
  • the user behavior information is click history information, wherein each feature information is a work identifier clicked by the user, and the feature information of multimedia attribute information is a work of the multimedia resource logo.
  • the embedding mapping table shown in FIG. 7 is a weight matrix corresponding to the work identification.
  • the server can determine the embedding vector from the work identification in the user behavior information and the work identification of the multimedia attribute information through the embedding mapping table. That is, the weight matrix in the embedding layer used by the information of the work identification is the same, and it will not be distinguished by the domain of multimedia resources or the domain of users.
  • step S405 the server inputs at least one embedding vector output from the embedding layer into the click-through rate estimation network, and outputs the user's estimated click-through rate to the first multimedia resource.
  • the server After the server determines each embedding vector of the input data, it can input the embedding vector into the click-through rate estimation network, perform data processing through each network node in the click-through rate estimation network, and output the estimated click-through rate of the first multimedia resource.
  • the estimated click rate can be determined through the above steps S402-S405.
  • the plurality of first multimedia resources may be determined in parallel by means of parallel processing.
  • the order in which the plurality of first multimedia resources determine the estimated click rate is not limited.
  • the server may sort each first multimedia resource according to the order of the estimated click rate from large to small, and may divide each first multimedia resource
  • the physical resources and the corresponding order are sent to the terminal.
  • the terminal may display the received first multimedia resource according to the order.
  • the server may send the preset number of first multimedia resources ranked first and the corresponding order to the terminal, and the terminal may respond to the preset Set a number of first multimedia resources for display.
  • the first multimedia resource sent by the server to the terminal may be an abbreviated form corresponding to the first multimedia resource in order to reduce the consumption of network resources.
  • the server The image thumbnail may be sent; when the first multimedia resource is a short video, the server may send a preview image to the terminal.
  • the specific form of the first multimedia resource sent in this embodiment is not limited.
  • the method for determining the estimated click rate of the multimedia resource provided by this embodiment can improve the user's click rate of the recommended multimedia resource.
  • the user retention rate of the application can be improved.
  • the information of the same information type can be determined by the same weight matrix in the embedding layer, which can improve the representativeness of the embedding vector. Therefore, when the click rate of the multimedia resource is estimated based on the method of this embodiment, the accuracy of the estimated click rate can be improved.
  • the above embodiment describes the process of determining the estimated click rate using the click rate prediction model.
  • the click rate prediction model can be trained.
  • This embodiment provides a training method for a click-through rate prediction model, which can be implemented by a server. As shown in the flowchart of the training method of the click-through rate prediction model shown in FIG. 8, the processing flow of this method may include the following steps:
  • step S801 the server acquires the initial model of the click-through rate estimation model.
  • the initial model may include an initial embedding layer and an initial click-through rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type.
  • the initial model of the click-through rate estimation model can be stored in the server.
  • the initial model may be a machine learning model designed by a technician to determine an estimated click rate, which takes user behavior information and multimedia attribute information as inputs, predicts the user's click rate on multimedia resources, and outputs the estimated click rate.
  • the model parameters in the initial model are all preset initial values, the accuracy of the predicted click-through rate is low, and the initial model needs to be trained.
  • step S802 the server acquires at least one training sample.
  • the training sample may include multimedia attribute information of the second multimedia resource, user behavior information of the sample user when browsing the second multimedia resource, and the click situation of the sample user on the second multimedia resource.
  • the click situation may include Clicked or not clicked. That is, the second multimedia resource may refer to a multimedia resource whose history is displayed to the sample user.
  • the server may record the multimedia attribute information of the second multimedia resource.
  • the server may also record the identification information of the second multimedia resource, and the identification information may be used to obtain multimedia attribute information of the second multimedia resource.
  • the server can also obtain the user behavior information of the user, and the user behavior information and the above information recorded by the server (such as multimedia attribute information of the second multimedia resource or identification information of the second multimedia resource) Record accordingly.
  • the server may receive a request to load the second multimedia resource, and further, may record the click of the second multimedia resource as clicked, as described above Recording corresponding to the information, for example, recording corresponding to multimedia attribute information, user behavior information, and click status of the second multimedia resource.
  • the terminal When the terminal closes the display interface, it can send a display close notification to the server.
  • the server receives the display close notification, it can obtain the unclicked second multimedia resource from the second multimedia resources sent, and send the corresponding Clicks are recorded as unclicked.
  • the server may also obtain the unclicked second multimedia resource when the preset duration is reached. This embodiment does not limit the specific manner in which the server obtains the unclicked second multimedia resource.
  • the server trains the click-through rate estimation model, it can obtain the multimedia attribute information of the second multimedia resource recorded in the above process, the user behavior information when the user browses the second multimedia resource, and the user ’s second multimedia
  • the click of physical resources is used as a training sample.
  • the server may use the training sample with the clicked condition as a positive sample, and the training sample with the clicked condition as an unclicked as a negative sample.
  • step S803 the server trains the initial model based on at least one training sample, and obtains a click-through rate estimation model.
  • the server can input the multimedia attribute information and user behavior information into the initial model, and perform data processing based on the model parameters of each network node in the initial model, to obtain the initial model for the second multimedia resource. Estimate click rate. Then, the server may determine the gradient of each model parameter in the initial model according to the user's click on the second multimedia resource in the training sample and the corresponding estimated click rate. The server can determine the correction value of each model parameter according to the gradient of each model parameter, and adjust the parameter of each model parameter based on the correction value, that is, the error back propagation.
  • step S803 may include steps S8031-S8033:
  • step S8031 for the initial weight matrix corresponding to each information type, the server adjusts the parameters of the initial weight matrix based on the training samples containing the information type to obtain the weight matrix corresponding to the trained information type.
  • Each model parameter in the weight matrix can be adjusted during the training process. It has been introduced in the above embodiments that the feature information of the same information type can determine the embedding vector based on the same weight matrix, and then determine the estimated click rate.
  • the server can obtain training samples for determining the estimated click rate through the initial weight matrix, and the corresponding estimated click rate, which can then be estimated Click rate and actual user click, calculate the gradient of each model parameter in the initial weight matrix, determine the corresponding correction value according to the gradient, and adjust each model parameter.
  • the obtained weight matrix corresponding to each information type can be used to determine the embedding vector for the corresponding information.
  • the training matrix containing the information type can be fully utilized to train the weight matrix so that the weight matrix Do full learning.
  • step S8032 the server adjusts the parameters of the initial click-through rate estimation network based on at least one training sample to obtain the trained click-through rate estimation network.
  • each training sample can be processed through the click-through rate estimation network. Therefore, the server can obtain each training sample and the corresponding estimated click-through rate, and adjust the parameters of the initial click-through rate prediction network for specific processing. As mentioned above, it will not be repeated here.
  • step S8033 the server obtains a click rate prediction model based on the weight matrix corresponding to the at least one information type after training and the click rate prediction network after training.
  • the server can obtain each weight matrix in the current embedding layer, and the click-through rate estimation network, etc., to constitute a click-through rate estimation model , And the click-through rate estimation model can be stored.
  • the server needs to predict the multimedia resource, it can obtain the stored click rate prediction model for processing.
  • the server can also train the stored click-through rate estimation model again.
  • the training process is the same as the above process.
  • the server continuously updates the click-through rate estimation model, which can improve the accuracy of the click-through rate estimation model.
  • the server when determining the correction value of each model parameter in the initial weight matrix according to the gradient, the server may adjust the learning rate according to the gradient, for example, the learning rate adjustment method may be AdaGrad (Adaptive Gradient, adaptive learning rate) algorithm.
  • AdaGrad Adaptive Gradient, adaptive learning rate
  • the determined gradient is smaller, that is, the gradient changes more smoothly, and the server can increase the learning rate corresponding to the information type, that is, the amplitude of the correction value increases. Make the weight matrix get fuller gradient update.
  • the above method for adjusting the learning rate can also be applied in the above step S8032, so that the model parameters in the click-through rate estimation network can also be adjusted adaptively according to the gradient. .
  • the training goal of the initial model by the server may be to maximize AUC (Area Under the ROC Curve, area under the ROC curve; ROC, Receiver Operating Characteristic, receiver operating characteristics).
  • AUC may refer to the first training sample before the second training sample Probability.
  • the server After the server determines the estimated click-through rate for each training sample through the initial model, it can be arranged in order of the estimated click-through rate from large to small, and then according to the number of first training samples ranked before all second training samples, And the total number of training samples to determine the value of AUC.
  • the larger the AUC the more the first training samples are ranked before all the second training samples, that is, the higher the accuracy of the click rate prediction model.
  • the server can also determine the AUC based on other methods. For example, after the ROC curve is established based on the estimated click-through rate of the training sample, the area under the ROC curve is calculated by the integration method. This embodiment does not limit the specific method for determining the AUC.
  • the weight matrix corresponding to an information type can be trained based on training samples containing the information type. Since there is no field based on multimedia resources and user's field to divide the feature information of the same information type, the training samples can be fully utilized, so that the weight matrix of the embedding layer can be fully learned, the representativeness of the embedding vector can be improved, and then the click-through rate can be improved. Estimate the accuracy of the model.
  • Fig. 10 is a block diagram of a device for determining an estimated click rate of a multimedia resource according to an exemplary embodiment.
  • the device includes an acquisition unit 1010, a calling unit 1020, and a determination unit 1030.
  • the obtaining unit 1010 is configured to obtain user behavior information of the user
  • the obtaining unit 1010 is further configured to obtain multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
  • the calling unit 1020 is configured to call a click-through rate estimation model.
  • the click-through rate estimation model includes an embedding layer and a click-through rate estimation network.
  • the embedding layer includes a weight matrix corresponding to at least one information type.
  • the rate estimation network is used to take the embedding vector output by the embedding layer as an input and output the estimated click rate of the multimedia resource;
  • the determining unit 1030 is configured to input the user behavior information and the multimedia attribute information into the click-through rate estimation model, and output the user's estimated click-through rate to the first multimedia resource.
  • the determining unit 1030 is configured to:
  • Input at least one embedding vector output by the embedding layer into the click-through rate estimation model, and output the user's estimated click-through rate to the first multimedia resource.
  • the device further includes a training unit, the training unit is configured to:
  • the training sample including multimedia attribute information of the second multimedia resource, user behavior information when the user browses the second multimedia resource, and the user's response to the second multimedia resource Clicks, including clicked or not clicked;
  • the initial model includes an initial embedding layer and an initial click rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
  • the training unit is configured as:
  • the click-through rate estimation model is obtained.
  • the first The learning rate corresponding to one information type is greater than the learning rate corresponding to the second information type.
  • the information type includes work identification, author identification, and / or style identification.
  • the user behavior information includes click history information, attention information and / or favorite information
  • the click history information is used to indicate multimedia attribute information of the multimedia resource clicked by the user
  • the attention information is used to indicate the user's attention Multimedia attribute information of the multimedia resource
  • the favorite information is used to represent multimedia attribute information of the user's favorite multimedia resource.
  • the information of the same information type can be determined by the same weight matrix in the embedding layer, which can improve the representativeness of the embedding vector. Therefore, when the click rate of the multimedia resource is predicted based on the method of this embodiment, the accuracy of the click rate prediction model is improved.
  • Fig. 11 is a block diagram of an apparatus 1100 for determining an estimated click rate of a multimedia resource according to an exemplary embodiment.
  • the device 1100 may be provided as a server.
  • the device 1100 includes a processing component 1122, which further includes one or more processors, and memory resources represented by the memory 1132, for storing instructions executable by the processing component 1122, such as application programs.
  • the application program stored in the memory 1132 may include one or more modules each corresponding to a set of instructions.
  • the processing component 1122 is configured to execute an instruction to execute the above method for determining the estimated click rate of the multimedia resource.
  • the device 1100 may also include a power component 1126 configured to perform power management of the device 1100, a wired or wireless network interface 1150 configured to connect the device 1100 to the network, and an input output (I / O) interface 1158.
  • the device 1100 can operate an operating system based on the memory 1132, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
  • a non-transitory computer-readable storage medium such as a memory including instructions, which can be executed by a processor in the server to complete the method for determining the estimated click rate of the multimedia resource.
  • the computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, or the like.
  • an application program / computer program product which includes one or more instructions, and the one or more instructions may be executed by a processor of the server to complete the above-mentioned estimated click rate of multimedia resources Determine the method.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Strategic Management (AREA)
  • Finance (AREA)
  • Development Economics (AREA)
  • Accounting & Taxation (AREA)
  • Data Mining & Analysis (AREA)
  • Computational Linguistics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Computing Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Health & Medical Sciences (AREA)
  • Mathematical Physics (AREA)
  • Software Systems (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Artificial Intelligence (AREA)
  • Molecular Biology (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Game Theory and Decision Science (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • General Business, Economics & Management (AREA)
  • Information Transfer Between Computers (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

A multimedia resource estimated click through rate determination method and apparatus, and a server, relating to the field of information recommendation. The method comprises: obtaining user behavior information of the current user (301); obtaining multimedia attribute information of a first multimedia resource (302), the first multimedia resource being a multimedia resource to be recommended to the current user; calling a click through rate estimation model (303), the click through rate estimation model comprising an embedded layer and a click through rate estimation network, the embedded layer comprising a weight matrix corresponding to at least one information type, and the click through rate estimation network being used for taking an embedded vector output by the embedded layer as an input and outputting an estimated click through rate of the multimedia resource; and inputting the user behavior information and the multimedia attribute information into the click through rate estimation model, and outputting the estimated click through rate of the user on the first multimedia resource (304). According to the method, the accuracy of click through rate estimation can be improved.

Description

一种多媒体资源预估点击率的确定方法、装置及服务器Method, device and server for determining estimated click rate of multimedia resources
本公开要求于2018年11月06日提交中国专利局、申请号为201811314076.6发明名称为“多媒体资源预估点击率的确定方法、装置及服务器”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This disclosure requires the priority of the Chinese patent application filed on November 06, 2018 in the Chinese Patent Office with the application number 201811314076.6 and the invention titled "Multimedia Resource Estimated Click Rate Determination Method, Device and Server" Incorporated in this application.
技术领域Technical field
本公开涉及信息推荐领域,尤其涉及一种多媒体资源预估点击率的确定方法、装置及服务器。The present disclosure relates to the field of information recommendation, in particular to a method, device and server for determining the estimated click rate of multimedia resources.
背景技术Background technique
在信息推荐系统中,相关技术会对多媒体资源的点击率(Click Through Rate,CTR)进行预估,并将预估点击率较高的多媒体资源展示给用户,以便提高用户点击多媒体资源的概率,提高信息推荐的准确率。In the information recommendation system, related technologies will estimate the clickthrough rate (CTR) of multimedia resources, and display multimedia resources with a higher estimated clickthrough rate to users, so as to increase the probability of users clicking multimedia resources. Improve the accuracy of information recommendation.
对多媒体资源的点击率进行预估时,相关技术中常用基于深度神经网络(Deep Neural Network,DNN)的点击率预估模型,对预估点击率进行计算。点击率预估模型的输入数据可以分为不同的领域,例如,多媒体资源的领域和用户的领域。如图1所示的点击率预估模型示意图,输入数据可以通过嵌入(embedding)层转换为低维的嵌入向量,其中,不同的领域可以对应于嵌入层中不同的权重矩阵,一个领域中的各项特征信息可以通过对应的权重矩阵确定嵌入向量。特征信息与嵌入向量存在一一对应的关系,因此,领域对应的权重矩阵也可称为该领域的embedding映射表。最后,相关技术中可以将嵌入向量输入深度神经网络,输出多媒体资源的预估点击率。When predicting the click rate of multimedia resources, the related art commonly uses a deep neural network (DNN) click rate prediction model to calculate the estimated click rate. The input data of the click rate estimation model can be divided into different fields, for example, the field of multimedia resources and the field of users. As shown in the schematic diagram of the click rate prediction model shown in FIG. 1, the input data can be converted into a low-dimensional embedding vector through an embedding layer, where different fields can correspond to different weight matrices in the embedding layer. Each feature information can determine the embedding vector through the corresponding weight matrix. There is a one-to-one correspondence between feature information and embedding vectors. Therefore, the weight matrix corresponding to the domain can also be called the embedding mapping table of the domain. Finally, in the related art, the embedded vector can be input into the deep neural network to output the estimated click rate of the multimedia resource.
但是,在一个领域中,某些特征信息的数目可能较少,例如,当用户领域中的特征信息包括用户点击过的图片标识时,如果用户点击图片的次数较少,则采集到的图片标识的数目也较少。在对点击率预估模型进行训练时,数目较少的特征信息会导致对应的权重矩阵的学习不够充分,也即得到的嵌入向量的代表性较弱,从而导致在对多媒体资源的点击率进行预估时,得到的预估点击率的准确性较低。However, in a field, the number of certain feature information may be less. For example, when the feature information in the user field includes the image identifier that the user clicked, if the user clicks the image less times, the collected image identifier The number is also small. When training the click-through rate estimation model, a small number of feature information will lead to insufficient learning of the corresponding weight matrix, that is, the obtained embedding vector is weakly represented, which leads to When estimating, the accuracy of the estimated click-through rate is low.
发明内容Summary of the invention
本公开提供一种多媒体资源预估点击率的确定方法、装置及服务器,可以解决预估点击率的准确性较低的问题。The present disclosure provides a method, device and server for determining the estimated click rate of multimedia resources, which can solve the problem of low accuracy of the estimated click rate.
根据本公开实施例的第一方面,提供一种多媒体资源预估点击率的确定方法,包括:According to a first aspect of an embodiment of the present disclosure, a method for determining a multimedia resource estimated click rate includes:
获取用户的用户行为信息;Obtain user's user behavior information;
获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;Acquiring multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;Calling a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
将所述用户行为信息与所述多媒体属性信息输入所述点击率预估网络,输出所述用户对所述第一多媒体资源的预估点击率。Input the user behavior information and the multimedia attribute information into the click-through rate estimation network, and output the user's estimated click-through rate to the first multimedia resource.
可选的,所述将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率,包括:Optionally, inputting the user behavior information and the multimedia attribute information into the click-through rate estimation model, and outputting the user's estimated click-through rate to the first multimedia resource includes:
对于每种信息类型,将所述用户行为信息和所述多媒体属性信息中属于所述信息类型的信息,输入所述嵌入层中所述信息类型对应的权重矩阵,输出至少一个嵌入向量;For each information type, input the information of the user behavior information and the multimedia attribute information that belong to the information type into the weight matrix corresponding to the information type in the embedding layer, and output at least one embedding vector;
将所述嵌入层输出的至少一个嵌入向量输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。Input at least one embedding vector output by the embedding layer into the click-through rate estimation model, and output the user's estimated click-through rate to the first multimedia resource.
可选的,所述点击率预估模型的训练方法包括:Optionally, the training method of the click-through rate prediction model includes:
获取所述点击率预估模型的初始模型;Acquiring the initial model of the click-through rate estimation model;
获取至少一个训练样本,所述训练样本包括第二多媒体资源的多媒体属性信息、样本用户浏览所述第二多媒体资源时的用户行为信息以及所述样本用户对所述第二多媒体资源的点击情况,所述点击情况包括已点击或未点击;Obtaining at least one training sample, the training sample including multimedia attribute information of a second multimedia resource, user behavior information of a sample user when browsing the second multimedia resource, and the sample user's response to the second multimedia Clicks on physical resources, the clicks include clicked or not clicked;
基于所述至少一个训练样本对所述初始模型进行训练,得到所述点击率预估模型。Training the initial model based on the at least one training sample to obtain the click-through rate prediction model.
可选的,所述初始模型包括初始嵌入层以及初始点击率预估网络,所述 初始嵌入层包括至少一种信息类型对应的初始权重矩阵;Optionally, the initial model includes an initial embedding layer and an initial click rate prediction network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
所述基于所述至少一个训练样本对所述初始模型进行训练,得到所述点击率预估模型,包括:The training the initial model based on the at least one training sample to obtain the estimated click-through rate model includes:
对于每种信息类型对应的初始权重矩阵,基于包含所述信息类型的训练样本对所述初始权重矩阵进行参数调整,得到训练后的所述信息类型对应的权重矩阵;For the initial weight matrix corresponding to each information type, adjust the parameters of the initial weight matrix based on the training samples containing the information type to obtain the weight matrix corresponding to the information type after training;
基于所述至少一个训练样本对所述初始点击率预估网络进行参数调整,得到训练后的点击率预估网络;Adjust the parameters of the initial click-through rate estimation network based on the at least one training sample to obtain a trained click-through rate estimation network;
基于训练后的至少一种信息类型对应的权重矩阵以及所述训练后的点击率预估网络,得到所述点击率预估模型。Based on the weight matrix corresponding to at least one information type after training and the trained click-through rate estimation network, the click-through rate estimation model is obtained.
可选的,在对所述每种信息类型对应的初始权重矩阵的参数调整过程中,当包含第一信息类型的训练样本的数目小于包含第二信息类型的训练样本的数目时,所述第一信息类型对应的学习率大于所述第二信息类型对应的学习率。Optionally, during the parameter adjustment of the initial weight matrix corresponding to each information type, when the number of training samples containing the first information type is less than the number of training samples containing the second information type, the first The learning rate corresponding to one information type is greater than the learning rate corresponding to the second information type.
可选的,所述信息类型包括作品标识、作者标识和/或风格标识。Optionally, the information type includes work identification, author identification, and / or style identification.
可选的,所述用户行为信息包括点击历史信息、关注信息和/或喜爱信息,所述点击历史信息用于表示用户点击的多媒体资源的多媒体属性信息,所述关注信息用于表示用户关注的多媒体资源的多媒体属性信息,所述喜爱信息用于表示用户喜爱的多媒体资源的多媒体属性信息。Optionally, the user behavior information includes click history information, attention information and / or favorite information, the click history information is used to indicate multimedia attribute information of the multimedia resource clicked by the user, and the attention information is used to indicate the user's attention Multimedia attribute information of the multimedia resource, the favorite information is used to represent multimedia attribute information of the user's favorite multimedia resource.
根据本公开实施例的第二方面,提供一种多媒体资源预估点击率的确定装置,包括:According to a second aspect of the embodiments of the present disclosure, there is provided an apparatus for determining an estimated click rate of multimedia resources, including:
获取单元,被配置为获取用户的用户行为信息;The obtaining unit is configured to obtain user behavior information of the user;
所述获取单元,还被配置为获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;The obtaining unit is further configured to obtain multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
调用单元,被配置为调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;The calling unit is configured to call a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction model The estimation network is used to take the embedding vector output by the embedding layer as an input and output the estimated click rate of the multimedia resource;
确定单元,被配置为将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。The determining unit is configured to input the user behavior information and the multimedia attribute information into the click-through rate estimation model, and output the estimated click-through rate of the user to the first multimedia resource.
可选的,所述确定单元,被配置为:Optionally, the determining unit is configured to:
对于每种信息类型,将所述用户行为信息和所述多媒体属性信息中属于所述信息类型的信息,输入所述嵌入层中所述信息类型对应的权重矩阵,输出至少一个嵌入向量;For each information type, input the information of the user behavior information and the multimedia attribute information that belong to the information type into the weight matrix corresponding to the information type in the embedding layer, and output at least one embedding vector;
将所述嵌入层输出的至少一个嵌入向量输入所述点击率预估网络,输出所述用户对所述第一多媒体资源的预估点击率。Input at least one embedding vector output by the embedding layer into the click-through rate estimation network, and output the user's estimated click-through rate to the first multimedia resource.
可选的,所述装置还包括训练单元,所述训练单元被配置为:Optionally, the device further includes a training unit, the training unit is configured to:
获取所述点击率预估模型的初始模型;Acquiring the initial model of the click-through rate estimation model;
获取至少一个训练样本,所述训练样本包括第二多媒体资源的多媒体属性信息、样本用户浏览所述第二多媒体资源时的用户行为信息以及所述样本用户对所述第二多媒体资源的点击情况,所述点击情况包括已点击或未点击;Obtaining at least one training sample, the training sample including multimedia attribute information of a second multimedia resource, user behavior information of a sample user when browsing the second multimedia resource, and the sample user's response to the second multimedia Clicks on physical resources, the clicks include clicked or not clicked;
基于所述至少一个训练样本对所述初始模型进行训练,得到所述点击率预估模型。Training the initial model based on the at least one training sample to obtain the click-through rate prediction model.
可选的,所述初始模型包括初始嵌入层以及初始点击率预估网络,所述初始嵌入层包括至少一种信息类型对应的初始权重矩阵;Optionally, the initial model includes an initial embedding layer and an initial click rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
所述训练单元被配置为:The training unit is configured to:
对于每种信息类型对应的初始权重矩阵,基于包含所述信息类型的训练样本对所述初始权重矩阵进行参数调整,得到训练后的所述信息类型对应的权重矩阵;For the initial weight matrix corresponding to each information type, adjust the parameters of the initial weight matrix based on the training samples containing the information type to obtain the weight matrix corresponding to the information type after training;
基于所述至少一个训练样本对所述初始点击率预估网络进行参数调整,得到训练后的点击率预估网络;Adjust the parameters of the initial click-through rate estimation network based on the at least one training sample to obtain a trained click-through rate estimation network;
基于训练后的至少一种信息类型对应的权重矩阵以及所述训练后的点击率预估网络,得到所述点击率预估模型。Based on the weight matrix corresponding to at least one information type after training and the trained click-through rate estimation network, the click-through rate estimation model is obtained.
可选的,在对所述每种信息类型对应的初始权重矩阵的参数调整过程中,当包含第一信息类型的训练样本的数目小于包含第二信息类型的训练样本的数目时,所述第一信息类型对应的学习率大于所述第二信息类型对应的学习率。Optionally, during the parameter adjustment of the initial weight matrix corresponding to each information type, when the number of training samples containing the first information type is less than the number of training samples containing the second information type, the first The learning rate corresponding to one information type is greater than the learning rate corresponding to the second information type.
可选的,所述信息类型包括作品标识、作者标识和/或风格标识。Optionally, the information type includes work identification, author identification, and / or style identification.
可选的,所述用户行为信息包括点击历史信息、关注信息和/或喜爱信息,所述点击历史信息用于表示用户点击的多媒体资源的多媒体属性信息,所述关注信息用于表示用户关注的多媒体资源的多媒体属性信息,所述喜爱信息用于表示用户喜爱的多媒体资源的多媒体属性信息。Optionally, the user behavior information includes click history information, attention information and / or favorite information, the click history information is used to indicate multimedia attribute information of the multimedia resource clicked by the user, and the attention information is used to indicate the user's attention Multimedia attribute information of the multimedia resource, the favorite information is used to represent multimedia attribute information of the user's favorite multimedia resource.
根据本公开实施例的第三方面,提供一种服务器,包括:According to a third aspect of the embodiments of the present disclosure, a server is provided, including:
一个或多个处理器;One or more processors;
用于存储一个或多个处理器可执行指令的一个或多个存储器;One or more memories for storing one or more processor executable instructions;
其中,所述一个或多个处理器被配置为:Wherein, the one or more processors are configured as:
获取用户的用户行为信息;Obtain user's user behavior information;
获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;Acquiring multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;Calling a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。The user behavior information and the multimedia attribute information are input to the click-through rate estimation model, and the user's estimated click-through rate for the first multimedia resource is output.
根据本公开实施例的第四方面,提供一种非临时性计算机可读存储介质,当所述存储介质中的指令由服务器的处理器执行时,使得服务器能够执行一种多媒体资源预估点击率的确定方法,所述方法包括:According to a fourth aspect of the embodiments of the present disclosure, there is provided a non-transitory computer-readable storage medium, which when the instructions in the storage medium are executed by the processor of the server, enables the server to execute a multimedia resource estimated click rate Determination method, the method includes:
获取用户的用户行为信息;Obtain user's user behavior information;
获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;Acquiring multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;Calling a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型, 输出所述用户对所述第一多媒体资源的预估点击率。Input the user behavior information and the multimedia attribute information into the click-through rate estimation model, and output the estimated click-through rate of the user to the first multimedia resource.
根据本公开实施例的第五方面,提供一种应用程序/计算机程序产品,当应用程序/计算机程序产品在服务器在运行时,使得服务器执行一种多媒体资源预估点击率的确定方法,所述方法包括:According to a fifth aspect of the embodiments of the present disclosure, an application program / computer program product is provided. When the application program / computer program product is running on a server, the server is caused to perform a method for determining a multimedia resource estimated click rate. Methods include:
获取当前用户的用户行为信息;Obtain the user behavior information of the current user;
获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述当前用户的多媒体资源;Acquiring multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the current user;
调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;Calling a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。The user behavior information and the multimedia attribute information are input to the click-through rate estimation model, and the user's estimated click-through rate for the first multimedia resource is output.
本公开的实施例提供的技术方案可以包括以下有益效果:The technical solutions provided by the embodiments of the present disclosure may include the following beneficial effects:
在用户行为信息和多媒体属性信息中,相同信息类型的信息均可通过嵌入层中相同的权重矩阵确定嵌入向量,可以提高嵌入向量的代表性。从而,在基于本实施例的方法对多媒体资源的点击率进行预估时,提高预估点击率的准确性。In user behavior information and multimedia attribute information, information of the same information type can be determined by the same weight matrix in the embedding layer, which can improve the representativeness of the embedding vector. Therefore, when the click rate of the multimedia resource is estimated based on the method of this embodiment, the accuracy of the estimated click rate is improved.
应当理解的是,以上的一般描述和后文的细节描述仅是示例性和解释性的,并不能限制本公开。It should be understood that the above general description and the following detailed description are only exemplary and explanatory, and do not limit the present disclosure.
附图说明BRIEF DESCRIPTION
为了更清楚地说明本公开实施例和现有技术的技术方案,下面对实施例和现有技术中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本公开的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly explain the embodiments of the present disclosure and the technical solutions of the prior art, the following briefly introduces the drawings required in the embodiments and the prior art. Obviously, the drawings in the following description are only For some of the disclosed embodiments, those of ordinary skill in the art can obtain other drawings based on these drawings without creative efforts.
图1是根据一示例性实施例示出的一种点击率预估模型示意图。Fig. 1 is a schematic diagram of a click-through rate estimation model according to an exemplary embodiment.
图2是根据一示例性实施例示出的一种实施环境图。Fig. 2 is a diagram of an implementation environment according to an exemplary embodiment.
图3是根据一示例性实施例示出的一种多媒体资源预估点击率的确定方 法流程图。Fig. 3 is a flow chart of a method for determining a multimedia resource estimated click rate according to an exemplary embodiment.
图4是根据一示例性实施例示出的一种多媒体资源预估点击率的确定方法流程图。Fig. 4 is a flow chart of a method for determining an estimated click rate of a multimedia resource according to an exemplary embodiment.
图5是根据一示例性实施例示出的一种应用程序界面示意图。Fig. 5 is a schematic diagram of an application program interface according to an exemplary embodiment.
图6是根据一示例性实施例示出的一种点击率预估模型示意图。Fig. 6 is a schematic diagram of a click rate prediction model according to an exemplary embodiment.
图7是根据一示例性实施例示出的一种点击率预估模型示意图。Fig. 7 is a schematic diagram of a click rate prediction model according to an exemplary embodiment.
图8是根据一示例性实施例示出的一种点击率预估模型的训练方法流程图。Fig. 8 is a flow chart of a method for training a click rate prediction model according to an exemplary embodiment.
图9是根据一示例性实施例示出的一种点击率预估模型的训练方法流程图。Fig. 9 is a flow chart of a method for training a click rate prediction model according to an exemplary embodiment.
图10是根据一示例性实施例示出的一种多媒体资源预估点击率的确定装置框图。Fig. 10 is a block diagram of a device for determining an estimated click rate of a multimedia resource according to an exemplary embodiment.
图11是根据一示例性实施例示出的一种用于确定多媒体资源预估点击率的装置的框图。Fig. 11 is a block diagram of a device for determining an estimated click rate of a multimedia resource according to an exemplary embodiment.
具体实施方式detailed description
为使本公开的目的、技术方案、及优点更加清楚明白,以下参照附图并举实施例,对本公开进一步详细说明。显然,所描述的实施例仅仅是本公开一部分实施例,而不是全部的实施例。基于本公开中的实施例,本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例,都属于本公开保护的范围。In order to make the objectives, technical solutions, and advantages of the present disclosure clearer, the disclosure will be further described in detail below with reference to the accompanying drawings and embodiments. Obviously, the described embodiments are only a part of the embodiments of the present disclosure, but not all the embodiments. Based on the embodiments in the present disclosure, all other embodiments obtained by a person of ordinary skill in the art without creative efforts fall within the protection scope of the present disclosure.
本实施例提供了一种多媒体资源预估点击率的确定方法的实施环境图,该实施环境图如图2所示。该实施环境可以包括多个终端201、用于为该多个终端201提供服务的服务器202。多个终端201通过无线或者有线网络和服务器202连接,该多个终端201可以为能够访问服务器202的计算机设备或智能终端等。终端201中可以安装有用于推荐多媒体资源(如图片、短视频等)的应用程序,用户可以登录上述应用程序。服务器202可以为上述应用程序提供后台服务,并记录各个用户的用户行为信息。服务器202中还可以具有至少一种数据库,用以存储点击率预估模型、多媒体资源及对应的多媒体属性信息、各个用户的用户行为信息等等。This embodiment provides an implementation environment diagram of a method for determining a multimedia resource estimated click rate. The implementation environment diagram is shown in FIG. 2. The implementation environment may include multiple terminals 201 and a server 202 for providing services for the multiple terminals 201. A plurality of terminals 201 are connected to the server 202 through a wireless or wired network, and the plurality of terminals 201 may be computer devices or smart terminals that can access the server 202. The terminal 201 may be installed with an application program for recommending multimedia resources (such as pictures, short videos, etc.), and the user may log in to the above application program. The server 202 can provide background services for the above-mentioned applications and record user behavior information of each user. The server 202 may also have at least one database for storing click-through rate estimation models, multimedia resources and corresponding multimedia attribute information, user behavior information of various users, and so on.
本实施例提供了一种多媒体资源预估点击率的确定方法,该方法可以由服务器实现,如图3所示的多媒体资源预估点击率的确定方法流程图,该方法的处理流程可以包括如下的步骤:This embodiment provides a method for determining the estimated click rate of multimedia resources. This method may be implemented by a server. As shown in FIG. 3, a flowchart of a method for determining the estimated click rate of multimedia resources. The processing flow of the method may include the following A step of:
在步骤S301中,服务器获取用户的用户行为信息。In step S301, the server acquires user behavior information of the user.
在步骤S302中,服务器获取第一多媒体资源的多媒体属性信息。In step S302, the server acquires multimedia attribute information of the first multimedia resource.
其中,第一多媒体资源为待推荐给用户的多媒体资源。The first multimedia resource is a multimedia resource to be recommended to the user.
在步骤S303中,服务器调用点击率预估模型。In step S303, the server calls the click-through rate estimation model.
其中,点击率预估模型包括嵌入层和点击率预估网络,嵌入层包括至少一种信息类型对应的权重矩阵,点击率预估网络用于将嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率。The click-through rate prediction model includes an embedding layer and a click-through rate prediction network. The embedding layer includes a weight matrix corresponding to at least one information type. The click-through rate prediction network is used to take the embedding vector output by the embedding layer as input and output multimedia resources Estimated clickthrough rate.
在步骤S304中,服务器将用户行为信息与多媒体属性信息输入点击率预估模型,输出用户对第一多媒体资源的预估点击率。In step S304, the server inputs user behavior information and multimedia attribute information into the click-through rate estimation model, and outputs the user's estimated click-through rate to the first multimedia resource.
可选的,将用户行为信息与多媒体属性信息输入点击率预估网络,输出用户对第一多媒体资源的预估点击率包括:Optionally, inputting user behavior information and multimedia attribute information into the click-through rate estimation network, and outputting the user's estimated click-through rate to the first multimedia resource include:
对于每种信息类型,将用户行为信息和多媒体属性信息中属于信息类型的信息,输入嵌入层中信息类型对应的权重矩阵,输出至少一个嵌入向量;For each information type, input information belonging to the information type in the user behavior information and multimedia attribute information into the weight matrix corresponding to the information type in the embedding layer, and output at least one embedding vector;
将嵌入层输出的至少一个嵌入向量输入点击率预估模型,输出用户对第一多媒体资源的预估点击率。The at least one embedding vector output from the embedding layer is input into the click rate estimation model, and the user's estimated click rate of the first multimedia resource is output.
可选的,点击率预估模型的训练方法包括:Optionally, the training methods of the CTR prediction model include:
获取点击率预估模型的初始模型;Get the initial model of the click-through rate estimation model;
获取至少一个训练样本,训练样本包括第二多媒体资源的多媒体属性信息、样本用户浏览第二多媒体资源时的用户行为信息以及样本用户对第二多媒体资源的点击情况,点击情况包括已点击或未点击;Obtain at least one training sample. The training sample includes multimedia attribute information of the second multimedia resource, user behavior information when the sample user browses the second multimedia resource, and the click situation of the sample user on the second multimedia resource. Including clicked or not clicked;
基于至少一个训练样本对初始模型进行训练,得到点击率预估模型。The initial model is trained based on at least one training sample to obtain a click-through rate estimation model.
可选的,初始模型包括初始嵌入层以及初始点击率预估网络,初始嵌入层包括至少一种信息类型对应的初始权重矩阵;Optionally, the initial model includes an initial embedding layer and an initial click rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
基于至少一个训练样本对初始模型进行训练,得到点击率预估模型,包括:Training the initial model based on at least one training sample to obtain a click-through rate estimation model, including:
对于每种信息类型对应的初始权重矩阵,基于包含信息类型的训练样本 对初始权重矩阵进行参数调整,得到训练后的信息类型对应的权重矩阵;For the initial weight matrix corresponding to each information type, the initial weight matrix is adjusted based on the training samples containing the information type to obtain the weight matrix corresponding to the trained information type;
基于至少一个训练样本对初始点击率预估网络进行参数调整,得到训练后的点击率预估网络;Adjust the parameters of the initial click-through rate estimation network based on at least one training sample to obtain the trained click-through rate estimation network;
基于训练后的至少一种信息类型对应的权重矩阵以及训练后的点击率预估网络,得到点击率预估模型。Based on the weight matrix corresponding to the at least one information type after training and the click-through rate prediction network after training, a click-through rate prediction model is obtained.
可选的,在对每种信息类型对应的初始权重矩阵的参数调整过程中,当包含第一信息类型的训练样本的数目小于包含第二信息类型的训练样本的数目时,第一信息类型对应的学习率大于所述第二信息类型对应的学习率。Optionally, during the parameter adjustment process of the initial weight matrix corresponding to each information type, when the number of training samples containing the first information type is less than the number of training samples containing the second information type, the first information type corresponds to The learning rate of is greater than the learning rate corresponding to the second information type.
可选的,信息类型包括作品标识、作者标识和/或风格标识。Optionally, the information type includes work identification, author identification and / or style identification.
可选的,用户行为信息包括点击历史信息、关注信息和/或喜爱信息,点击历史信息包括用户点击任一多媒体资源时记录的信息,关注信息包括用户点击关注选项时记录的信息,喜爱信息包括用户点击喜爱选项时记录的信息。Optionally, the user behavior information includes click history information, attention information and / or favorite information, the click history information includes information recorded when the user clicks on any multimedia resource, the attention information includes information recorded when the user clicks on the attention option, and the favorite information includes Information recorded when the user clicks the favorite option.
本实施例将结合具体的实施方式,对多媒体资源预估点击率的确定方法进行介绍。该方法可以由服务器实现,如图4所示的多媒体资源预估点击率的确定方法流程图,该方法的处理流程可以包括如下的步骤:In this embodiment, a method for determining the estimated click rate of multimedia resources will be introduced in conjunction with a specific implementation manner. The method may be implemented by a server, as shown in the flowchart of the method for determining the estimated click rate of multimedia resources shown in FIG. 4, the processing flow of the method may include the following steps:
在步骤S401中,服务器获取用户的用户行为信息。In step S401, the server acquires user behavior information of the user.
用户开启应用程序时,终端可以向服务器发送多媒体资源的获取请求;或者,用户通过应用程序搜索多媒体资源时,终端也可以向服务器发送多媒体资源的获取请求。服务器接收到终端发送的多媒体资源的获取请求时,可以触发推荐多媒体资源的处理逻辑,以便通过本实施例提供的多媒体资源预估点击率的确定方法进行处理。本实施例对触发推荐多媒体资源的处理逻辑的具体方式不作限定。When the user starts the application, the terminal can send a multimedia resource acquisition request to the server; or, when the user searches for multimedia resources through the application, the terminal can also send the multimedia resource acquisition request to the server. When the server receives the multimedia resource acquisition request sent by the terminal, it may trigger processing logic of the recommended multimedia resource, so as to be processed by the method for determining the estimated click rate of the multimedia resource provided in this embodiment. This embodiment does not limit the specific manner of triggering the processing logic of recommending multimedia resources.
此时,服务器可以根据用户的标识信息,从存储的各个用户的用户行为信息中,获取该用户的用户行为信息。在一种可能的实施方式中,用户行为信息可以包括点击历史信息、关注信息和/或喜爱信息,点击历史信息可以用于表示用户点击的多媒体资源的多媒体属性信息,关注信息可以用于表示用户关注的多媒体资源的多媒体属性信息,喜爱信息可以用于表示用户喜爱的多媒体资源的多媒体属性信息。At this time, the server may obtain the user behavior information of each user from the stored user behavior information of each user according to the user's identification information. In a possible implementation manner, the user behavior information may include click history information, attention information and / or favorite information, the click history information may be used to represent multimedia attribute information of the multimedia resource clicked by the user, and the attention information may be used to represent the user The multimedia attribute information and favorite information of the focused multimedia resource can be used to represent the multimedia attribute information of the user's favorite multimedia resource.
可选的,对于服务器记录的多媒体资源的多媒体属性信息,信息类型可以包括作品标识、作者标识和/或风格标识。Optionally, for the multimedia attribute information of the multimedia resource recorded by the server, the information type may include work identification, author identification, and / or style identification.
下面对服务器记录上述信息的过程进行介绍:The following describes the process of the server recording the above information:
如图5所示的应用程序界面示意图,终端可以展示应用程序提供的多媒体资源,并且在多媒体资源的展示界面中,可以包括关注选项和喜爱选项。As shown in the schematic diagram of the application program interface shown in FIG. 5, the terminal can display the multimedia resources provided by the application program, and the display interface of the multimedia resources can include attention options and favorite options.
当用户点击查看该多媒体资源时,服务器可以接收到加载该多媒体资源的请求,进而可以将对应的多媒体属性信息添加到该用户的点击历史信息当中进行存储。例如,当用户点击观看一个短视频时,服务器可以将该短视频的作品标识、作者标识和/或风格标识添加到该用户的点击历史信息中。When the user clicks to view the multimedia resource, the server may receive a request to load the multimedia resource, and then may add corresponding multimedia attribute information to the user's click history information for storage. For example, when the user clicks to watch a short video, the server may add the work identification, author identification, and / or style identification of the short video to the user's click history information.
如果用户需要订阅该多媒体资源,则可以点击展示界面中的关注选项。进而,服务器可以接收到对该多媒体资源的关注添加请求,将对应的多媒体属性信息添加到该用户的关注信息当中进行存储。当然,关注选项除了可以是针对于多媒体资源的关注,也可以是针对于多媒体资源的作者或风格的关注,本实施例对此不作限定。例如,如果关注选项是针对于多媒体资源的作者的关注,则服务器可以将该多媒体资源的作者标识添加到用户的关注信息中,在此之后,当该作者更新作品时,用户可以接收到相应的更新通知,达到订阅的效果。If the user needs to subscribe to the multimedia resource, they can click the follow option in the display interface. Furthermore, the server may receive a request to add attention to the multimedia resource, and add the corresponding multimedia attribute information to the attention information of the user for storage. Of course, the attention option may be not only attention to multimedia resources, but also attention to authors or styles of multimedia resources, which is not limited in this embodiment. For example, if the follow option is for the attention of the author of the multimedia resource, the server can add the author ID of the multimedia resource to the user's attention information, and after that, when the author updates the work, the user can receive the corresponding Update notifications to achieve the effect of subscription.
如果用户喜爱该多媒体资源,则可以点击展示界面中的喜爱选项,服务器添加对应的喜爱信息的处理,与添加关注信息的处理类似,此处不再赘述。If the user likes the multimedia resource, he can click the favorite option in the display interface, and the server adds the corresponding favorite information processing, which is similar to the processing of adding attention information, and will not be repeated here.
当然,上述记录点击历史信息、关注信息和喜爱信息的方案均为可选方案,也即,服务器记录的用户行为信息可以是点击历史信息、关注信息和喜爱信息中的一种或多种。其中包含的信息类型同样也可以是作品标识、作者标识和风格标识中的一种或多种,还可以包括其它信息类型,本实施例对具体的信息类型不作限定。Of course, the above solutions for recording click history information, attention information, and favorite information are all optional solutions, that is, the user behavior information recorded by the server may be one or more of click history information, attention information, and favorite information. The type of information contained therein may also be one or more of a work identification, an author identification and a style identification, and may also include other information types. The specific information type is not limited in this embodiment.
在步骤S402中,服务器获取第一多媒体资源的多媒体属性信息。In step S402, the server acquires multimedia attribute information of the first multimedia resource.
其中,第一多媒体资源为待推荐给用户的多媒体资源。多媒体属性信息可以是在作者上传对应的多媒体资源时作者主动添加得到,或服务器根据多媒体资源的作者、内容等信息自动生成,本实施例对多媒体属性信息的生成方式不作限定。The first multimedia resource is a multimedia resource to be recommended to the user. The multimedia attribute information may be obtained by the author when the author uploads the corresponding multimedia resource, or the server automatically generates the information based on the author and content of the multimedia resource. This embodiment does not limit the generation method of the multimedia attribute information.
第一多媒体资源可以包括多个多媒体资源,例如,可以是当前热门的多媒体资源,或者可以是对用户的搜索得到的搜索结果。由于用户可以查看的多媒体资源有限,服务器可以确定展示哪些第一多媒体资源,并且还可以对展示的第一多媒体资源进行排序,以便达到将更加符合用户需求的多媒体资源优先展示的目的。本实施例中利用多媒体资源的预估点击率来实现上述目的,预估点击率越高,用户点击该多媒体资源的可能性越大,也即该多媒体资源越符合用户的需求。The first multimedia resource may include multiple multimedia resources, for example, it may be a currently popular multimedia resource, or it may be a search result obtained by searching a user. Since the multimedia resources that the user can view are limited, the server can determine which first multimedia resources to display, and can also sort the displayed first multimedia resources in order to achieve the priority of displaying multimedia resources more in line with user needs . In this embodiment, the estimated click rate of the multimedia resource is used to achieve the above purpose. The higher the estimated click rate, the greater the possibility that the user clicks the multimedia resource, that is, the multimedia resource more meets the user's needs.
在确定第一多媒体资源的预估点击率时,对于一个第一多媒体资源,服务器可以根据该第一多媒体资源的标识信息,从存储的多媒体资源及对应的多媒体属性信息中,获取该第一多媒体资源的多媒体属性信息。When determining the estimated click rate of the first multimedia resource, for a first multimedia resource, the server may, according to the identification information of the first multimedia resource, from the stored multimedia resource and the corresponding multimedia attribute information To obtain multimedia attribute information of the first multimedia resource.
在步骤S403中,服务器调用点击率预估模型。In step S403, the server calls the click-through rate estimation model.
如图6所示的点击率预估模型示意图,本实施例提供的点击率预估模型可以包括嵌入层和点击率预估网络,嵌入层可以包括至少一种信息类型对应的权重矩阵,点击率预估网络可以用于将嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率。点击预估网络可以是深度神经网络,也可以是卷积神经网络等,技术人员可以根据需求对点击率预估网络进行设计,本实施例对具体的网络结构不作限定。As shown in the schematic diagram of the click-through rate prediction model shown in FIG. 6, the click-through rate prediction model provided in this embodiment may include an embedding layer and a click-through rate prediction network. The embedding layer may include a weight matrix corresponding to at least one information type The prediction network can be used to take the embedding vector output by the embedding layer as an input and output the estimated click rate of the multimedia resource. The click prediction network may be a deep neural network, or a convolutional neural network, etc. The technician can design the click rate prediction network according to requirements, and the specific network structure is not limited in this embodiment.
当然,服务器中也可以存储有多个点击率预估模型,服务器可以调用满足预设条件的点击率预估模型,例如,当服务器中还存储有各个点击率预估模型的准确率时,可以调用满足准确率大于预设阈值的任一点击率预估模型。本实施例对调用点击率模型的具体方式不作限定。Of course, the server can also store multiple click-through rate prediction models, and the server can call the click-through rate prediction model that meets the preset conditions. For example, when the server also stores the accuracy of each click-through rate prediction model, Call any click rate prediction model that meets the accuracy rate greater than the preset threshold. This embodiment does not limit the specific method of calling the click rate model.
点击率预估模型可以周期性进行训练,服务器中可以存储有训练得到的最新的点击率预估模型。当服务器需要确定第一多媒体资源的预估点击率时,可以调用最新的点击率预估模型。The click-through rate prediction model can be periodically trained, and the latest click-through rate prediction model obtained by training can be stored in the server. When the server needs to determine the estimated click rate of the first multimedia resource, it can call the latest click rate prediction model.
在步骤S404中,对于每种信息类型,服务器将用户行为信息和多媒体属性信息中属于该信息类型的信息,输入嵌入层中该信息类型对应的权重矩阵,输出至少一个嵌入向量。In step S404, for each information type, the server inputs the information belonging to the information type in the user behavior information and multimedia attribute information into the weight matrix corresponding to the information type in the embedding layer, and outputs at least one embedding vector.
服务器可以将上述步骤S401中获取到的用户行为信息,以及步骤S402中获取到的第一多媒体资源的多媒体属性信息,输入点击率预估模型的嵌入 层。The server may input the user behavior information obtained in the above step S401 and the multimedia attribute information of the first multimedia resource obtained in step S402 into the embedding layer of the click rate prediction model.
在嵌入层中,可以将输入数据进行编码,以便适应神经网络的计算。一般而言,编码后得到的向量维度较大,因此可以通过嵌入层中的权重矩阵对编码后的向量进行降维处理,以便减少处理过程中的IO(Input-Output,输入/输出)开销。例如,对输入数据的一项特征信息进行独热编码后得到向量(0,0,0,1,0,0,0,0,0),通过权重矩阵可以转换为嵌入向量(0.145,0.152)。In the embedding layer, the input data can be encoded in order to adapt to the calculation of the neural network. Generally speaking, the vector dimension obtained after encoding is large, so the dimensionality reduction process can be performed on the encoded vector through the weight matrix in the embedded layer, so as to reduce the IO (Input-Output) overhead in the processing process. For example, one-hot encoding of a feature information of the input data to obtain a vector (0,0,0,1,0,0,0,0,0), which can be converted into an embedded vector (0.145,0.152) through the weight matrix .
每种信息类型具有对应的权重矩阵,在用户行为信息和多媒体属性信息中,服务器可以获取相同信息类型的特征信息对应的向量,将同一信息类型对应的向量输入对应的权重矩阵,输出嵌入向量。Each information type has a corresponding weight matrix. In user behavior information and multimedia attribute information, the server can obtain the vector corresponding to the feature information of the same information type, input the vector corresponding to the same information type into the corresponding weight matrix, and output the embedded vector.
上述步骤S401中已经介绍了,在一种可能的实施方式中,信息类型可以包括作品标识、作者标识和/或风格标识。示例性的,如图7所示的点击率预估模型示意图,用户行为信息为点击历史信息,其中每个特征信息为用户点击过的作品标识,多媒体属性信息的特征信息为该多媒体资源的作品标识。图7所示的嵌入(embedding)映射表为作品标识对应的权重矩阵,服务器可以将用户行为信息中的作品标识以及多媒体属性信息的作品标识,均通过该embedding映射表确定嵌入向量。也即,作品标识的信息所使用的嵌入层中的权重矩阵相同,不会以多媒体资源的领域或用户的领域进行区分。As mentioned above in step S401, in a possible implementation manner, the information type may include a work identification, an author identification and / or a style identification. Exemplarily, as shown in the schematic diagram of the click-through rate estimation model shown in FIG. 7, the user behavior information is click history information, wherein each feature information is a work identifier clicked by the user, and the feature information of multimedia attribute information is a work of the multimedia resource Logo. The embedding mapping table shown in FIG. 7 is a weight matrix corresponding to the work identification. The server can determine the embedding vector from the work identification in the user behavior information and the work identification of the multimedia attribute information through the embedding mapping table. That is, the weight matrix in the embedding layer used by the information of the work identification is the same, and it will not be distinguished by the domain of multimedia resources or the domain of users.
在步骤S405中,服务器将嵌入层输出的至少一个嵌入向量输入点击率预估网络,输出用户对第一多媒体资源的预估点击率。In step S405, the server inputs at least one embedding vector output from the embedding layer into the click-through rate estimation network, and outputs the user's estimated click-through rate to the first multimedia resource.
当服务器确定输入数据的各个嵌入向量后,可以将嵌入向量输入点击率预估网络,通过点击率预估网络中的各个网络节点进行数据处理,输出第一多媒体资源的预估点击率。After the server determines each embedding vector of the input data, it can input the embedding vector into the click-through rate estimation network, perform data processing through each network node in the click-through rate estimation network, and output the estimated click-through rate of the first multimedia resource.
对于每个第一多媒体资源,均可通过上述步骤S402-S405确定预估点击率。多个第一多媒体资源可以是通过并行处理的方式确定预估点击率,本实施例对多个第一多媒体资源确定预估点击率的顺序不作限定。For each first multimedia resource, the estimated click rate can be determined through the above steps S402-S405. The plurality of first multimedia resources may be determined in parallel by means of parallel processing. In this embodiment, the order in which the plurality of first multimedia resources determine the estimated click rate is not limited.
服务器确定每个第一多媒体资源的预估点击率后,可以按照预估点击率从大到小的顺序对每个第一多媒体资源进行排序,并且可以将每个第一多媒体资源以及对应的排序发送给终端。进而,终端可以按照该排序对接收到的第一多媒体资源进行展示。可选的,当展示的多媒体资源的数目为预设数目 时,服务器可以将排序靠前的预设数目个第一多媒体资源以及对应的排序发送给终端,终端可以按照该排序对该预设数目个第一多媒体资源进行展示。After the server determines the estimated click rate of each first multimedia resource, it may sort each first multimedia resource according to the order of the estimated click rate from large to small, and may divide each first multimedia resource The physical resources and the corresponding order are sent to the terminal. Furthermore, the terminal may display the received first multimedia resource according to the order. Optionally, when the number of displayed multimedia resources is a preset number, the server may send the preset number of first multimedia resources ranked first and the corresponding order to the terminal, and the terminal may respond to the preset Set a number of first multimedia resources for display.
当然,服务器向终端发送的第一多媒体资源可以是第一多媒体资源对应的缩略形式,以便减少网络资源的消耗,例如,当第一多媒体资源为图片时,服务器向终端发送的可以是图片的缩略图;当第一多媒体资源为短视频时,服务器向终端发送的可以是预览图。本实施例发送的第一多媒体资源的具体形式不作限定。Of course, the first multimedia resource sent by the server to the terminal may be an abbreviated form corresponding to the first multimedia resource in order to reduce the consumption of network resources. For example, when the first multimedia resource is a picture, the server The image thumbnail may be sent; when the first multimedia resource is a short video, the server may send a preview image to the terminal. The specific form of the first multimedia resource sent in this embodiment is not limited.
终端展示第一多媒体资源后,当前用户优先看到的可以是预估点击率较高的第一多媒体资源。因此,通过本实施例提供的多媒体资源预估点击率的确定方法,可以提高用户对推荐的多媒体资源的点击率。当上述方法应用在应用程序中时,可以提高应用程序的用户留存率。After the terminal displays the first multimedia resource, the current user may first see the first multimedia resource with a higher estimated click rate. Therefore, the method for determining the estimated click rate of the multimedia resource provided by this embodiment can improve the user's click rate of the recommended multimedia resource. When the above method is applied to an application, the user retention rate of the application can be improved.
本实施例中,在用户行为信息和多媒体属性信息中,相同信息类型的信息均可通过嵌入层中相同的权重矩阵确定嵌入向量,可以提高嵌入向量的代表性。从而,在基于本实施例的方法对多媒体资源的点击率进行预估时,可以提高预估点击率的准确率。In this embodiment, in the user behavior information and multimedia attribute information, the information of the same information type can be determined by the same weight matrix in the embedding layer, which can improve the representativeness of the embedding vector. Therefore, when the click rate of the multimedia resource is estimated based on the method of this embodiment, the accuracy of the estimated click rate can be improved.
上述实施例中介绍了利用点击率预估模型确定预估点击率的过程,在使用点击率预估模型之前,可以对该点击率预估模型进行训练。本实施例提供了一种点击率预估模型的训练方法,该方法可以由服务器实现。如图8所示的点击率预估模型的训练方法流程图,该方法的处理流程可以包括如下的步骤:The above embodiment describes the process of determining the estimated click rate using the click rate prediction model. Before using the click rate prediction model, the click rate prediction model can be trained. This embodiment provides a training method for a click-through rate prediction model, which can be implemented by a server. As shown in the flowchart of the training method of the click-through rate prediction model shown in FIG. 8, the processing flow of this method may include the following steps:
在步骤S801中,服务器获取点击率预估模型的初始模型。In step S801, the server acquires the initial model of the click-through rate estimation model.
其中,与上述实施例中介绍的点击率预估模型相对应的,初始模型可以包括初始嵌入层以及初始点击率预估网络,初始嵌入层包括至少一种信息类型对应的初始权重矩阵。Corresponding to the click-through rate estimation model described in the above embodiment, the initial model may include an initial embedding layer and an initial click-through rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type.
服务器中可以存储有点击率预估模型的初始模型。该初始模型可以是技术人员设计的用于确定预估点击率的机器学习模型,以用户行为信息以及多媒体属性信息作为输入,预测用户对多媒体资源的点击率,并输出预估点击率。但由于初始模型中的模型参数均为预设的初始值,预测的点击率准确性 较低,需要对初始模型进行训练。The initial model of the click-through rate estimation model can be stored in the server. The initial model may be a machine learning model designed by a technician to determine an estimated click rate, which takes user behavior information and multimedia attribute information as inputs, predicts the user's click rate on multimedia resources, and outputs the estimated click rate. However, since the model parameters in the initial model are all preset initial values, the accuracy of the predicted click-through rate is low, and the initial model needs to be trained.
在步骤S802中,服务器获取至少一个训练样本。In step S802, the server acquires at least one training sample.
其中,训练样本可以包括第二多媒体资源的多媒体属性信息、样本用户浏览第二多媒体资源时的用户行为信息以及样本用户对第二多媒体资源的点击情况,点击情况可以包括已点击或未点击。也即,第二多媒体资源可以是指历史展示给样本用户的多媒体资源。The training sample may include multimedia attribute information of the second multimedia resource, user behavior information of the sample user when browsing the second multimedia resource, and the click situation of the sample user on the second multimedia resource. The click situation may include Clicked or not clicked. That is, the second multimedia resource may refer to a multimedia resource whose history is displayed to the sample user.
下面对服务器记录训练样本中的信息的过程进行介绍:The following describes the process by which the server records the information in the training samples:
每当服务器向终端发送用于展示的第二多媒体资源时,可以记录该第二多媒体资源的多媒体属性信息。当然,服务器也可以记录该第二多媒体资源的标识信息,该标识信息可以用于获取第二多媒体资源的多媒体属性信息。Whenever the server sends the second multimedia resource for presentation to the terminal, it may record the multimedia attribute information of the second multimedia resource. Of course, the server may also record the identification information of the second multimedia resource, and the identification information may be used to obtain multimedia attribute information of the second multimedia resource.
此时,服务器还可以获取用户的用户行为信息,并且可以将该用户行为信息与服务器记录的上述信息(如第二多媒体资源的多媒体属性信息,或者第二多媒体资源的标识信息)对应的进行记录。At this time, the server can also obtain the user behavior information of the user, and the user behavior information and the above information recorded by the server (such as multimedia attribute information of the second multimedia resource or identification information of the second multimedia resource) Record accordingly.
当用户点击查看任一第二多媒体资源时,服务器可以接收到加载该第二多媒体资源的请求,进而,可以将该第二多媒体资源的点击情况记录为已点击,与上述信息对应的进行记录,例如,将第二多媒体资源的多媒体属性信息、用户行为信息以及点击情况对应的进行记录。When the user clicks to view any second multimedia resource, the server may receive a request to load the second multimedia resource, and further, may record the click of the second multimedia resource as clicked, as described above Recording corresponding to the information, for example, recording corresponding to multimedia attribute information, user behavior information, and click status of the second multimedia resource.
终端关闭展示界面时,可以向服务器发送展示关闭通知,服务器接收到该展示关闭通知时,可以在发送的第二多媒体资源中,获取未点击的第二多媒体资源,并将相应的点击情况记录为未点击。或者,服务器还可以在向终端发送用于展示的第二多媒体资源之后,达到预设时长时,获取未点击的第二多媒体资源。本实施例对服务器获取未点击的第二多媒体资源的具体方式不作限定。When the terminal closes the display interface, it can send a display close notification to the server. When the server receives the display close notification, it can obtain the unclicked second multimedia resource from the second multimedia resources sent, and send the corresponding Clicks are recorded as unclicked. Alternatively, after sending the second multimedia resource for display to the terminal, the server may also obtain the unclicked second multimedia resource when the preset duration is reached. This embodiment does not limit the specific manner in which the server obtains the unclicked second multimedia resource.
当服务器对点击率预估模型进行训练时,可以获取上述过程中记录的第二多媒体资源的多媒体属性信息、用户浏览第二多媒体资源时的用户行为信息以及用户对第二多媒体资源的点击情况,作为训练样本。可选的,服务器可以将点击情况为已点击的训练样本作为正样本,将点击情况为未点击的训练样本作为负样本。When the server trains the click-through rate estimation model, it can obtain the multimedia attribute information of the second multimedia resource recorded in the above process, the user behavior information when the user browses the second multimedia resource, and the user ’s second multimedia The click of physical resources is used as a training sample. Optionally, the server may use the training sample with the clicked condition as a positive sample, and the training sample with the clicked condition as an unclicked as a negative sample.
在步骤S803中,服务器基于至少一个训练样本对初始模型进行训练,得 到点击率预估模型。In step S803, the server trains the initial model based on at least one training sample, and obtains a click-through rate estimation model.
对于每个训练样本,服务器可以将其中的多媒体属性信息和用户行为信息输入初始模型,并基于初始模型中各个网络节点的模型参数进行数据处理,得到该初始模型对第二多媒体资源的预估点击率。然后,服务器可以根据训练样本中用户对第二多媒体资源的点击情况,以及对应的预估点击率,确定初始模型中各个模型参数的梯度。服务器可以根据各个模型参数的梯度,确定各个模型参数的修正值,并基于修正值对各个模型参数进行参数调整,也即误差反向传播。For each training sample, the server can input the multimedia attribute information and user behavior information into the initial model, and perform data processing based on the model parameters of each network node in the initial model, to obtain the initial model for the second multimedia resource. Estimate click rate. Then, the server may determine the gradient of each model parameter in the initial model according to the user's click on the second multimedia resource in the training sample and the corresponding estimated click rate. The server can determine the correction value of each model parameter according to the gradient of each model parameter, and adjust the parameter of each model parameter based on the correction value, that is, the error back propagation.
在一种可能的实施方式中,如图9所示的点击率预估模型的训练方法流程图,步骤S803可以包括步骤S8031-S8033:In a possible implementation manner, as shown in the flowchart of the training method of the click-through rate estimation model shown in FIG. 9, step S803 may include steps S8031-S8033:
在步骤S8031中,对于每种信息类型对应的初始权重矩阵,服务器基于包含信息类型的训练样本对初始权重矩阵进行参数调整,得到训练后的信息类型对应的权重矩阵。In step S8031, for the initial weight matrix corresponding to each information type, the server adjusts the parameters of the initial weight matrix based on the training samples containing the information type to obtain the weight matrix corresponding to the trained information type.
权重矩阵中的各个模型参数可以在训练过程中进行调整。上述实施例中已经介绍了,相同信息类型的特征信息可以基于相同的权重矩阵确定嵌入向量,进而确定预估点击率。相对应的,在训练过程中,对于每种信息类型对应的初始权重矩阵,服务器可以获取通过该初始权重矩阵确定预估点击率的训练样本,以及对应的预估点击率,进而可以根据预估点击率与用户实际的点击情况,计算该初始权重矩阵中各个模型参数的梯度,根据梯度确定相应的修正值,对各个模型参数进行调整。训练结束后,得到的各个信息类型对应的权重矩阵即可用于对相应的信息确定嵌入向量。Each model parameter in the weight matrix can be adjusted during the training process. It has been introduced in the above embodiments that the feature information of the same information type can determine the embedding vector based on the same weight matrix, and then determine the estimated click rate. Correspondingly, during the training process, for the initial weight matrix corresponding to each type of information, the server can obtain training samples for determining the estimated click rate through the initial weight matrix, and the corresponding estimated click rate, which can then be estimated Click rate and actual user click, calculate the gradient of each model parameter in the initial weight matrix, determine the corresponding correction value according to the gradient, and adjust each model parameter. After the training, the obtained weight matrix corresponding to each information type can be used to determine the embedding vector for the corresponding information.
相比于基于多媒体资源的领域和用户的领域划分权重矩阵的方案,由于相同信息类型的信息均使用同一权重矩阵,可以充分利用包含该信息类型的训练样本对权重矩阵进行训练,使得该权重矩阵进行充分的学习。Compared with the scheme of dividing the weight matrix based on the field of multimedia resources and the field of users, since the information of the same information type uses the same weight matrix, the training matrix containing the information type can be fully utilized to train the weight matrix so that the weight matrix Do full learning.
在步骤S8032中,服务器基于至少一个训练样本对初始点击率预估网络进行参数调整,得到训练后的点击率预估网络。In step S8032, the server adjusts the parameters of the initial click-through rate estimation network based on at least one training sample to obtain the trained click-through rate estimation network.
一般来说,每个训练样本均可以通过点击率预估网络进行数据处理,因此,服务器可以获取每个训练样本以及对应的预估点击率,对初始点击率预估网络进行参数调整,具体处理如上所述,此处不再赘述。In general, each training sample can be processed through the click-through rate estimation network. Therefore, the server can obtain each training sample and the corresponding estimated click-through rate, and adjust the parameters of the initial click-through rate prediction network for specific processing. As mentioned above, it will not be repeated here.
在步骤S8033中,服务器基于训练后的至少一种信息类型对应的权重矩阵以及训练后的点击率预估网络,得到点击率预估模型。In step S8033, the server obtains a click rate prediction model based on the weight matrix corresponding to the at least one information type after training and the click rate prediction network after training.
当达到训练结束的条件(如达到预设训练次数或损失函数的值小于目标数值)时,服务器可以获取当前嵌入层中的各个权重矩阵,以及点击率预估网络等,构成点击率预估模型,并且可以将该点击率预估模型进行存储。当服务器需要对多媒体资源进行预测时,可以获取已存储的点击率预估模型进行处理。When the conditions for the end of training are reached (for example, the preset number of trainings is reached or the value of the loss function is less than the target value), the server can obtain each weight matrix in the current embedding layer, and the click-through rate estimation network, etc., to constitute a click-through rate estimation model , And the click-through rate estimation model can be stored. When the server needs to predict the multimedia resource, it can obtain the stored click rate prediction model for processing.
当然,在此之后,服务器还可以对存储的点击率预估模型再次进行训练,训练过程与上述过程同理。服务器不断对点击率预估模型进行更新,可以提高点击率预估模型的准确性。Of course, after this, the server can also train the stored click-through rate estimation model again. The training process is the same as the above process. The server continuously updates the click-through rate estimation model, which can improve the accuracy of the click-through rate estimation model.
在一种可能的实施方式中,上述在对每种信息类型对应的初始权重矩阵的参数调整过程中,当包含第一信息类型的训练样本的数目小于包含第二信息类型的训练样本的数目时,第一信息类型对应的学习率大于第二信息类型对应的学习率。In a possible implementation manner, in the above parameter adjustment process of the initial weight matrix corresponding to each information type, when the number of training samples containing the first information type is smaller than the number of training samples containing the second information type , The learning rate corresponding to the first information type is greater than the learning rate corresponding to the second information type.
上述步骤S8031中,在根据梯度确定初始权重矩阵中各个模型参数的修正值时,服务器可以根据梯度对学习率进行调整,例如,学习率调整的方法可以是AdaGrad(Adaptive Gradient,自适应学习率)算法。In the above step S8031, when determining the correction value of each model parameter in the initial weight matrix according to the gradient, the server may adjust the learning rate according to the gradient, for example, the learning rate adjustment method may be AdaGrad (Adaptive Gradient, adaptive learning rate) algorithm.
一般来说,如果包含信息类型的训练样本较少,则确定的梯度较小,也即梯度变化较为平缓,服务器可以增大该信息类型对应的学习率,也即使得修正值的幅度增大,令权重矩阵得到更充分的梯度更新。通过上述方法,可以使得模型参数在特征稀疏的情况下也可以得到充分的学习,提高模型的准确率。In general, if there are fewer training samples containing information types, the determined gradient is smaller, that is, the gradient changes more smoothly, and the server can increase the learning rate corresponding to the information type, that is, the amplitude of the correction value increases. Make the weight matrix get fuller gradient update. Through the above method, the model parameters can be fully learned even when the features are sparse, and the accuracy of the model can be improved.
当然,上述调整学习率的方法也可以应用在上述步骤S8032中,使得点击率预估网络中的模型参数也可以根据梯度适应性调整,本实施例对点击率预估网络的参数调整方法不作限定。Of course, the above method for adjusting the learning rate can also be applied in the above step S8032, so that the model parameters in the click-through rate estimation network can also be adjusted adaptively according to the gradient. .
在一种可能的实施方式中,服务器对初始模型的训练目标可以是使得AUC(Area Under the ROC Curve,ROC曲线下方面积;ROC,Receiver Operating Characteristic,受试者工作特征)最大化。In a possible implementation, the training goal of the initial model by the server may be to maximize AUC (Area Under the ROC Curve, area under the ROC curve; ROC, Receiver Operating Characteristic, receiver operating characteristics).
如果将点击情况为已点击的训练样本称为第一训练样本,将点击情况为 未点击的训练样本称为第二训练样本,则AUC可以是指第一训练样本排在第二训练样本之前的概率。If the training sample with clicks is called the first training sample and the training sample with clicks is not called the second training sample, then AUC may refer to the first training sample before the second training sample Probability.
服务器通过初始模型对每个训练样本确定预估点击率后,可以按照预估点击率从大到小的顺序进行排列,进而可以根据排在所有第二训练样本之前的第一训练样本的数目,以及训练样本总数,确定AUC的数值。AUC越大,表明越多的第一训练样本排在所有第二训练样本之前,也即点击率预估模型的准确率越高。After the server determines the estimated click-through rate for each training sample through the initial model, it can be arranged in order of the estimated click-through rate from large to small, and then according to the number of first training samples ranked before all second training samples, And the total number of training samples to determine the value of AUC. The larger the AUC, the more the first training samples are ranked before all the second training samples, that is, the higher the accuracy of the click rate prediction model.
当然,服务器还可以基于其它方式确定AUC,例如可以基于训练样本的预估点击率建立ROC曲线后,利用求积分的方法计算ROC曲线下方的面积。本实施例对确定AUC的具体方式不作限定。Of course, the server can also determine the AUC based on other methods. For example, after the ROC curve is established based on the estimated click-through rate of the training sample, the area under the ROC curve is calculated by the integration method. This embodiment does not limit the specific method for determining the AUC.
实验表明,本实施例提供的方法能够使得AUC得到明显提升,也即通过本实施例的方法得到的点击率预估模型准确率提高。Experiments show that the method provided in this embodiment can significantly improve AUC, that is, the accuracy of the click rate prediction model obtained by the method in this embodiment is improved.
本实施例中,对于一个信息类型对应的权重矩阵,均可基于包含该信息类型的训练样本进行训练。由于没有基于多媒体资源的领域和用户的领域对同一信息类型的特征信息进行划分,因此可以充分利用训练样本,使得嵌入层的权重矩阵得到充分的学习,提高嵌入向量的代表性,进而提高点击率预估模型的准确率。In this embodiment, the weight matrix corresponding to an information type can be trained based on training samples containing the information type. Since there is no field based on multimedia resources and user's field to divide the feature information of the same information type, the training samples can be fully utilized, so that the weight matrix of the embedding layer can be fully learned, the representativeness of the embedding vector can be improved, and then the click-through rate can be improved. Estimate the accuracy of the model.
图10是根据一示例性实施例示出的一种多媒体资源预估点击率的确定装置框图。参照图10,该装置包括获取单元1010,调用单元1020和确定单元1030。Fig. 10 is a block diagram of a device for determining an estimated click rate of a multimedia resource according to an exemplary embodiment. Referring to FIG. 10, the device includes an acquisition unit 1010, a calling unit 1020, and a determination unit 1030.
该获取单元1010,被配置为获取用户的用户行为信息;The obtaining unit 1010 is configured to obtain user behavior information of the user;
该获取单元1010,还被配置为获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;The obtaining unit 1010 is further configured to obtain multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
该调用单元1020,被配置为调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;The calling unit 1020 is configured to call a click-through rate estimation model. The click-through rate estimation model includes an embedding layer and a click-through rate estimation network. The embedding layer includes a weight matrix corresponding to at least one information type. The rate estimation network is used to take the embedding vector output by the embedding layer as an input and output the estimated click rate of the multimedia resource;
该确定单元1030,被配置为将所述用户行为信息与所述多媒体属性信息 输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。The determining unit 1030 is configured to input the user behavior information and the multimedia attribute information into the click-through rate estimation model, and output the user's estimated click-through rate to the first multimedia resource.
可选的,该确定单元1030,被配置为:Optionally, the determining unit 1030 is configured to:
对于每种信息类型,将所述用户行为信息和所述多媒体属性信息中属于所述信息类型的信息,输入所述嵌入层中所述信息类型对应的权重矩阵,输出至少一个嵌入向量;For each information type, input the information of the user behavior information and the multimedia attribute information that belong to the information type into the weight matrix corresponding to the information type in the embedding layer, and output at least one embedding vector;
将所述嵌入层输出的至少一个嵌入向量输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。Input at least one embedding vector output by the embedding layer into the click-through rate estimation model, and output the user's estimated click-through rate to the first multimedia resource.
可选的,所述的装置还包括训练单元,该训练单元被配置为:Optionally, the device further includes a training unit, the training unit is configured to:
获取所述点击率预估模型的初始模型;Acquiring the initial model of the click-through rate estimation model;
获取至少一个训练样本,所述训练样本包括第二多媒体资源的多媒体属性信息、用户浏览所述第二多媒体资源时的用户行为信息以及所述用户对所述第二多媒体资源的点击情况,所述点击情况包括已点击或未点击;Obtaining at least one training sample, the training sample including multimedia attribute information of the second multimedia resource, user behavior information when the user browses the second multimedia resource, and the user's response to the second multimedia resource Clicks, including clicked or not clicked;
基于所述至少一个训练样本对所述初始模型进行训练,得到所述点击率预估模型。Training the initial model based on the at least one training sample to obtain the click-through rate prediction model.
可选的,所述初始模型包括初始嵌入层以及初始点击率预估网络,所述初始嵌入层包括至少一种信息类型对应的初始权重矩阵;Optionally, the initial model includes an initial embedding layer and an initial click rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
该训练单元被配置为:The training unit is configured as:
对于每种信息类型对应的初始权重矩阵,基于包含所述信息类型的训练样本对所述初始权重矩阵进行参数调整,得到训练后的所述信息类型对应的权重矩阵;For the initial weight matrix corresponding to each information type, adjust the parameters of the initial weight matrix based on the training samples containing the information type to obtain the weight matrix corresponding to the information type after training;
基于所述至少一个训练样本对所述初始点击率预估网络进行参数调整,得到训练后的点击率预估网络;Adjust the parameters of the initial click-through rate estimation network based on the at least one training sample to obtain a trained click-through rate estimation network;
基于训练后的至少一种信息类型对应的权重矩阵以及所述训练后的点击率预估网络,得到所述点击率预估模型。Based on the weight matrix corresponding to at least one information type after training and the trained click-through rate estimation network, the click-through rate estimation model is obtained.
可选的,在对所述每种信息类型对应的初始权重矩阵的参数调整过程中,当包含第一信息类型的训练样本的数目小于包含第二信息类型的训练样本的数目时,所述第一信息类型对应的学习率大于所述第二信息类型对应的学习率。Optionally, during the parameter adjustment of the initial weight matrix corresponding to each information type, when the number of training samples containing the first information type is less than the number of training samples containing the second information type, the first The learning rate corresponding to one information type is greater than the learning rate corresponding to the second information type.
可选的,所述信息类型包括作品标识、作者标识和/或风格标识。Optionally, the information type includes work identification, author identification, and / or style identification.
可选的,所述用户行为信息包括点击历史信息、关注信息和/或喜爱信息,所述点击历史信息用于表示用户点击的多媒体资源的多媒体属性信息,所述关注信息用于表示用户关注的多媒体资源的多媒体属性信息,所述喜爱信息用于表示用户喜爱的多媒体资源的多媒体属性信息。Optionally, the user behavior information includes click history information, attention information and / or favorite information, the click history information is used to indicate multimedia attribute information of the multimedia resource clicked by the user, and the attention information is used to indicate the user's attention Multimedia attribute information of the multimedia resource, the favorite information is used to represent multimedia attribute information of the user's favorite multimedia resource.
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关该方法的实施例中进行了详细描述,此处将不做详细阐述说明。Regarding the device in the above embodiment, the specific manner in which each module performs operations has been described in detail in the embodiment related to the method, and will not be elaborated here.
本实施例中,在用户行为信息和多媒体属性信息中,相同信息类型的信息均可通过嵌入层中相同的权重矩阵确定嵌入向量,可以提高嵌入向量的代表性。从而,在基于本实施例的方法对多媒体资源的点击率进行预估时,提高点击率预估模型的准确率。In this embodiment, in the user behavior information and multimedia attribute information, the information of the same information type can be determined by the same weight matrix in the embedding layer, which can improve the representativeness of the embedding vector. Therefore, when the click rate of the multimedia resource is predicted based on the method of this embodiment, the accuracy of the click rate prediction model is improved.
图11是根据一示例性实施例示出的一种用于确定多媒体资源预估点击率的装置1100的框图。例如,装置1100可以被提供为一服务器。参照图11,装置1100包括处理组件1122,其进一步包括一个或多个处理器,以及由存储器1132所代表的存储器资源,用于存储可由处理组件1122的执行的指令,例如应用程序。存储器1132中存储的应用程序可以包括一个或一个以上的每一个对应于一组指令的模块。此外,处理组件1122被配置为执行指令,以执行上述多媒体资源预估点击率的确定方法。Fig. 11 is a block diagram of an apparatus 1100 for determining an estimated click rate of a multimedia resource according to an exemplary embodiment. For example, the device 1100 may be provided as a server. Referring to FIG. 11, the device 1100 includes a processing component 1122, which further includes one or more processors, and memory resources represented by the memory 1132, for storing instructions executable by the processing component 1122, such as application programs. The application program stored in the memory 1132 may include one or more modules each corresponding to a set of instructions. In addition, the processing component 1122 is configured to execute an instruction to execute the above method for determining the estimated click rate of the multimedia resource.
装置1100还可以包括一个电源组件1126被配置为执行装置1100的电源管理,一个有线或无线网络接口1150被配置为将装置1100连接到网络,和一个输入输出(I/O)接口1158。装置1100可以操作基于存储在存储器1132的操作系统,例如Windows ServerTM,Mac OS XTM,UnixTM,LinuxTM,FreeBSDTM或类似。The device 1100 may also include a power component 1126 configured to perform power management of the device 1100, a wired or wireless network interface 1150 configured to connect the device 1100 to the network, and an input output (I / O) interface 1158. The device 1100 can operate an operating system based on the memory 1132, such as Windows ServerTM, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM or the like.
在示例性实施例中,还提供了一种非临时性计算机可读存储介质,例如包括指令的存储器,上述指令可由服务器中的处理器执行以完成上述多媒体资源预估点击率的确定方法。例如,所述计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。In an exemplary embodiment, a non-transitory computer-readable storage medium is also provided, such as a memory including instructions, which can be executed by a processor in the server to complete the method for determining the estimated click rate of the multimedia resource. For example, the computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, or the like.
在示例性实施例中,还提供了一种应用程序/计算机程序产品,包括一条或多条指令,该一条或多条指令可以由服务器的处理器执行,以完成上述多 媒体资源预估点击率的确定方法。In an exemplary embodiment, an application program / computer program product is also provided, which includes one or more instructions, and the one or more instructions may be executed by a processor of the server to complete the above-mentioned estimated click rate of multimedia resources Determine the method.
本领域技术人员在考虑说明书及实践这里公开的内容后,将容易想到本公开的其它实施方案。本申请旨在涵盖本公开的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求指出。Those skilled in the art will easily think of other embodiments of the present disclosure after considering the description and practicing the contents disclosed herein. This application is intended to cover any variations, uses, or adaptive changes of the present disclosure that follow the general principles of the present disclosure and include common general knowledge or customary technical means in the technical field not disclosed in the present disclosure . The description and examples are to be considered exemplary only, and the true scope and spirit of this disclosure are pointed out by the following claims.
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。It should be understood that the present disclosure is not limited to the precise structure that has been described above and shown in the drawings, and various modifications and changes can be made without departing from the scope thereof. The scope of the present disclosure is limited only by the appended claims.

Claims (16)

  1. 一种多媒体资源预估点击率的确定方法,其特征在于,包括:A method for determining the estimated click rate of multimedia resources, which includes:
    获取用户的用户行为信息;Obtain user's user behavior information;
    获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;Acquiring multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
    调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;Calling a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
    将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。The user behavior information and the multimedia attribute information are input to the click-through rate estimation model, and the user's estimated click-through rate for the first multimedia resource is output.
  2. 根据权利要求1所述的方法,其特征在于,所述将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率,包括:The method according to claim 1, wherein the user behavior information and the multimedia attribute information are input to the click-through rate estimation model, and the user's information about the first multimedia resource is output Estimated CTR, including:
    对于每种信息类型,将所述用户行为信息和所述多媒体属性信息中属于所述信息类型的信息,输入所述嵌入层中所述信息类型对应的权重矩阵,输出至少一个嵌入向量;For each information type, input the information of the user behavior information and the multimedia attribute information that belong to the information type into the weight matrix corresponding to the information type in the embedding layer, and output at least one embedding vector;
    将所述嵌入层输出的至少一个嵌入向量输入所述点击率预估网络,输出所述用户对所述第一多媒体资源的预估点击率。Input at least one embedding vector output by the embedding layer into the click-through rate estimation network, and output the user's estimated click-through rate to the first multimedia resource.
  3. 根据权利要求1所述的方法,其特征在于,所述点击率预估模型的训练方法包括:The method according to claim 1, wherein the training method of the click-through rate prediction model comprises:
    获取所述点击率预估模型的初始模型;Acquiring the initial model of the click-through rate estimation model;
    获取至少一个训练样本,所述训练样本包括第二多媒体资源的多媒体属性信息、样本用户浏览所述第二多媒体资源时的用户行为信息以及所述样本用户对所述第二多媒体资源的点击情况,所述点击情况包括已点击或未点击;Obtaining at least one training sample, the training sample including multimedia attribute information of a second multimedia resource, user behavior information of a sample user when browsing the second multimedia resource, and the sample user's response to the second multimedia Clicks on physical resources, the clicks include clicked or not clicked;
    基于所述至少一个训练样本对所述初始模型进行训练,得到所述点击率预估模型。Training the initial model based on the at least one training sample to obtain the click-through rate prediction model.
  4. 根据权利要求3所述的方法,其特征在于,所述初始模型包括初始嵌入层以及初始点击率预估网络,所述初始嵌入层包括至少一种信息类型对应的 初始权重矩阵;The method according to claim 3, wherein the initial model includes an initial embedding layer and an initial click-through rate estimation network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
    所述基于所述至少一个训练样本对所述初始模型进行训练,得到所述点击率预估模型,包括:The training the initial model based on the at least one training sample to obtain the estimated click-through rate model includes:
    对于每种信息类型对应的初始权重矩阵,基于包含所述信息类型的训练样本对所述初始权重矩阵进行参数调整,得到训练后的所述信息类型对应的权重矩阵;For the initial weight matrix corresponding to each information type, adjust the parameters of the initial weight matrix based on the training samples containing the information type to obtain the weight matrix corresponding to the information type after training;
    基于所述至少一个训练样本对所述初始点击率预估网络进行参数调整,得到训练后的点击率预估网络;Adjust the parameters of the initial click-through rate estimation network based on the at least one training sample to obtain a trained click-through rate estimation network;
    基于训练后的至少一种信息类型对应的权重矩阵以及所述训练后的点击率预估网络,得到所述点击率预估模型。Based on the weight matrix corresponding to at least one information type after training and the trained click-through rate estimation network, the click-through rate estimation model is obtained.
  5. 根据权利要求4所述的方法,其特征在于,在对所述每种信息类型对应的初始权重矩阵的参数调整过程中,当包含第一信息类型的训练样本的数目小于包含第二信息类型的训练样本的数目时,所述第一信息类型对应的学习率大于所述第二信息类型对应的学习率。The method according to claim 4, wherein during the parameter adjustment of the initial weight matrix corresponding to each information type, when the number of training samples containing the first information type is smaller than that containing the second information type When the number of training samples, the learning rate corresponding to the first information type is greater than the learning rate corresponding to the second information type.
  6. 根据权利要求1-5任一所述的方法,其特征在于,所述信息类型包括作品标识、作者标识和/或风格标识。The method according to any one of claims 1-5, wherein the information type includes a work identification, an author identification, and / or a style identification.
  7. 根据权利要求1-5任一所述的方法,其特征在于,所述用户行为信息包括点击历史信息、关注信息和/或喜爱信息,所述点击历史信息用于表示用户点击的多媒体资源的多媒体属性信息,所述关注信息用于表示用户关注的多媒体资源的多媒体属性信息,所述喜爱信息用于表示用户喜爱的多媒体资源的多媒体属性信息。The method according to any one of claims 1-5, wherein the user behavior information includes click history information, attention information, and / or favorite information, and the click history information is used to represent multimedia of a multimedia resource clicked by the user Attribute information, the attention information is used to represent multimedia attribute information of the multimedia resource that the user focuses on, and the favorite information is used to represent multimedia attribute information of the multimedia resource that the user prefers.
  8. 一种多媒体资源预估点击率的确定装置,其特征在于,包括:A device for determining the estimated click rate of multimedia resources is characterized by comprising:
    获取单元,被配置为获取用户的用户行为信息;The obtaining unit is configured to obtain user behavior information of the user;
    所述获取单元,还被配置为获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;The obtaining unit is further configured to obtain multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
    调用单元,被配置为调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;The calling unit is configured to call a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction model The estimation network is used to take the embedding vector output by the embedding layer as an input and output the estimated click rate of the multimedia resource;
    确定单元,被配置为将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。The determining unit is configured to input the user behavior information and the multimedia attribute information into the click-through rate estimation model, and output the estimated click-through rate of the user to the first multimedia resource.
  9. 根据权利要求8所述的装置,其特征在于,所述确定单元,被配置为:The apparatus according to claim 8, wherein the determining unit is configured to:
    对于每种信息类型,将所述用户行为信息和所述多媒体属性信息中属于所述信息类型的信息,输入所述嵌入层中所述信息类型对应的权重矩阵,输出至少一个嵌入向量;For each information type, input the information of the user behavior information and the multimedia attribute information that belong to the information type into the weight matrix corresponding to the information type in the embedding layer, and output at least one embedding vector;
    将所述嵌入层输出的至少一个嵌入向量输入所述点击率预估网络,输出所述用户对所述第一多媒体资源的预估点击率。Input at least one embedding vector output by the embedding layer into the click-through rate estimation network, and output the user's estimated click-through rate to the first multimedia resource.
  10. 根据权利要求8所述的装置,其特征在于,所述装置还包括训练单元,所述训练单元被配置为:The apparatus according to claim 8, wherein the apparatus further comprises a training unit, the training unit is configured to:
    获取所述点击率预估模型的初始模型;Acquiring the initial model of the click-through rate estimation model;
    获取至少一个训练样本,所述训练样本包括第二多媒体资源的多媒体属性信息、样本用户浏览所述第二多媒体资源时的用户行为信息以及所述样本用户对所述第二多媒体资源的点击情况,所述点击情况包括已点击或未点击;Obtaining at least one training sample, the training sample including multimedia attribute information of a second multimedia resource, user behavior information of a sample user when browsing the second multimedia resource, and the sample user's response to the second multimedia Clicks on physical resources, the clicks include clicked or not clicked;
    基于所述至少一个训练样本对所述初始模型进行训练,得到所述点击率预估模型。Training the initial model based on the at least one training sample to obtain the click-through rate prediction model.
  11. 根据权利要求10所述的装置,其特征在于,所述初始模型包括初始嵌入层以及初始点击率预估网络,所述初始嵌入层包括至少一种信息类型对应的初始权重矩阵;The apparatus according to claim 10, wherein the initial model includes an initial embedding layer and an initial click-through rate prediction network, and the initial embedding layer includes an initial weight matrix corresponding to at least one information type;
    所述训练单元被配置为:The training unit is configured to:
    对于每种信息类型对应的初始权重矩阵,基于包含所述信息类型的训练样本对所述初始权重矩阵进行参数调整,得到训练后的所述信息类型对应的权重矩阵;For the initial weight matrix corresponding to each information type, adjust the parameters of the initial weight matrix based on the training samples containing the information type to obtain the weight matrix corresponding to the information type after training;
    基于所述至少一个训练样本对所述初始点击率预估网络进行参数调整,得到训练后的点击率预估网络;Adjust the parameters of the initial click-through rate estimation network based on the at least one training sample to obtain a trained click-through rate estimation network;
    基于训练后的至少一种信息类型对应的权重矩阵以及所述训练后的点击率预估网络,得到所述点击率预估模型。Based on the weight matrix corresponding to at least one information type after training and the trained click-through rate estimation network, the click-through rate estimation model is obtained.
  12. 根据权利要求11所述的装置,其特征在于,在对所述每种信息类型对应的初始权重矩阵的参数调整过程中,当包含第一信息类型的训练样本的数 目小于包含第二信息类型的训练样本的数目时,所述第一信息类型对应的学习率大于所述第二信息类型对应的学习率。The apparatus according to claim 11, wherein during the parameter adjustment of the initial weight matrix corresponding to each information type, when the number of training samples containing the first information type is smaller than that containing the second information type When the number of training samples, the learning rate corresponding to the first information type is greater than the learning rate corresponding to the second information type.
  13. 根据权利要求8-12任一所述的装置,其特征在于,所述信息类型包括作品标识、作者标识和/或风格标识。The device according to any one of claims 8-12, wherein the information type includes a work identification, an author identification, and / or a style identification.
  14. 根据权利要求8-12任一所述的装置,其特征在于,所述用户行为信息包括点击历史信息、关注信息和/或喜爱信息,所述点击历史信息用于表示用户点击的多媒体资源的多媒体属性信息,所述关注信息用于表示用户关注的多媒体资源的多媒体属性信息,所述喜爱信息用于表示用户喜爱的多媒体资源的多媒体属性信息。The device according to any one of claims 8-12, wherein the user behavior information includes click history information, attention information, and / or favorite information, and the click history information is used to represent multimedia of a multimedia resource clicked by the user Attribute information, the attention information is used to represent multimedia attribute information of the multimedia resource that the user focuses on, and the favorite information is used to represent multimedia attribute information of the multimedia resource that the user prefers.
  15. 一种服务器,其特征在于,包括:A server, characterized in that it includes:
    一个或多个处理器;One or more processors;
    用于存储一个或多个处理器可执行指令的一个或多个存储器;One or more memories for storing one or more processor executable instructions;
    其中,所述一个或多个处理器被配置为:Wherein, the one or more processors are configured as:
    获取用户的用户行为信息;Obtain user's user behavior information;
    获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;Acquiring multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
    调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;Calling a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
    将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。The user behavior information and the multimedia attribute information are input to the click-through rate estimation model, and the user's estimated click-through rate for the first multimedia resource is output.
  16. 一种非临时性计算机可读存储介质,其特征在于,当所述存储介质中的指令由服务器的处理器执行时,使得服务器能够执行一种多媒体资源预估点击率的确定方法,所述方法包括:A non-transitory computer-readable storage medium, characterized in that when instructions in the storage medium are executed by a processor of a server, the server is enabled to perform a method for determining a multimedia resource estimated click rate, the method include:
    获取用户的用户行为信息;Obtain user's user behavior information;
    获取第一多媒体资源的多媒体属性信息,所述第一多媒体资源为待推荐给所述用户的多媒体资源;Acquiring multimedia attribute information of a first multimedia resource, where the first multimedia resource is a multimedia resource to be recommended to the user;
    调用点击率预估模型,所述点击率预估模型包括嵌入层和点击率预估网 络,所述嵌入层包括至少一种信息类型对应的权重矩阵,所述点击率预估网络用于将所述嵌入层输出的嵌入向量作为输入,输出多媒体资源的预估点击率;Calling a click-through rate prediction model, the click-through rate prediction model includes an embedding layer and a click-through rate prediction network, the embedding layer includes a weight matrix corresponding to at least one information type, and the click-through rate prediction network is used to The embedding vector output from the embedding layer is used as an input to output the estimated click rate of the multimedia resource;
    将所述用户行为信息与所述多媒体属性信息输入所述点击率预估模型,输出所述用户对所述第一多媒体资源的预估点击率。The user behavior information and the multimedia attribute information are input to the click-through rate estimation model, and the user's estimated click-through rate for the first multimedia resource is output.
PCT/CN2019/105452 2018-11-06 2019-09-11 Multimedia resource estimated click through rate determination method and apparatus, and server WO2020093781A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201811314076.6 2018-11-06
CN201811314076.6A CN109408724B (en) 2018-11-06 2018-11-06 Method and device for determining pre-estimated click rate of multimedia resource and server

Publications (1)

Publication Number Publication Date
WO2020093781A1 true WO2020093781A1 (en) 2020-05-14

Family

ID=65471674

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2019/105452 WO2020093781A1 (en) 2018-11-06 2019-09-11 Multimedia resource estimated click through rate determination method and apparatus, and server

Country Status (2)

Country Link
CN (1) CN109408724B (en)
WO (1) WO2020093781A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111708944A (en) * 2020-06-17 2020-09-25 北京达佳互联信息技术有限公司 Multimedia resource identification method, device, equipment and storage medium
CN111737573A (en) * 2020-06-17 2020-10-02 北京三快在线科技有限公司 Resource recommendation method, device, equipment and storage medium
CN112364184A (en) * 2020-11-12 2021-02-12 北京达佳互联信息技术有限公司 Method, device, server and storage medium for ordering multimedia data
CN112732953A (en) * 2020-12-30 2021-04-30 上海众源网络有限公司 Recommendation method, sample analysis method, device, electronic equipment and storage medium
CN112839247A (en) * 2021-01-04 2021-05-25 聚好看科技股份有限公司 Content display method, display device and cloud server
CN113836391A (en) * 2020-06-24 2021-12-24 北京达佳互联信息技术有限公司 Resource recommendation method and device, computer equipment and storage medium
CN114548076A (en) * 2022-02-28 2022-05-27 深圳市和讯华谷信息技术有限公司 Intelligent scoring method for content file and related device

Families Citing this family (24)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109408724B (en) * 2018-11-06 2020-10-02 北京达佳互联信息技术有限公司 Method and device for determining pre-estimated click rate of multimedia resource and server
CN110297970B (en) * 2019-05-28 2021-04-23 北京达佳互联信息技术有限公司 Information recommendation model training method and device
CN112150182B (en) * 2019-06-28 2023-08-29 腾讯科技(深圳)有限公司 Multimedia file pushing method and device, storage medium and electronic device
CN110472071A (en) * 2019-07-03 2019-11-19 中移(杭州)信息技术有限公司 Multimedia file recommendation method, device, the network equipment and storage medium
CN112307313B (en) * 2019-07-31 2024-04-16 北京达佳互联信息技术有限公司 Information recommendation method, device and storage medium
CN110502715B (en) * 2019-08-28 2023-07-14 腾讯科技(深圳)有限公司 Click probability prediction method and device
CN110825956A (en) * 2019-09-17 2020-02-21 中国平安人寿保险股份有限公司 Information flow recommendation method and device, computer equipment and storage medium
CN110909182B (en) * 2019-11-29 2023-05-09 北京达佳互联信息技术有限公司 Multimedia resource searching method, device, computer equipment and storage medium
CN111191165A (en) * 2019-12-05 2020-05-22 腾讯科技(深圳)有限公司 Data processing method, device, equipment and computer readable storage medium
CN112925924A (en) * 2019-12-05 2021-06-08 北京达佳互联信息技术有限公司 Multimedia file recommendation method and device, electronic equipment and storage medium
CN111160959B (en) * 2019-12-20 2023-06-27 深圳前海微众银行股份有限公司 User click conversion prediction method and device
CN111340522B (en) * 2019-12-30 2024-03-08 支付宝实验室(新加坡)有限公司 Resource recommendation method, device, server and storage medium
CN113495966B (en) * 2020-03-18 2023-06-23 北京达佳互联信息技术有限公司 Interactive operation information determining method and device and video recommendation system
CN113742567B (en) * 2020-05-29 2023-08-22 北京达佳互联信息技术有限公司 Recommendation method and device for multimedia resources, electronic equipment and storage medium
CN114077701A (en) * 2020-08-13 2022-02-22 北京达佳互联信息技术有限公司 Method and device for determining resource information, computer equipment and storage medium
CN112256918B (en) * 2020-11-17 2021-07-06 中国计量大学 Short video click rate prediction method based on multi-mode dynamic routing
CN112395504B (en) * 2020-12-01 2021-11-23 中国计量大学 Short video click rate prediction method based on sequence capsule network
CN112950266B (en) * 2021-02-26 2024-05-03 北京达佳互联信息技术有限公司 Media resource display method and device
CN113254679B (en) * 2021-06-28 2021-09-28 北京达佳互联信息技术有限公司 Multimedia resource recommendation method and device, electronic equipment and storage medium
CN113722588B (en) * 2021-08-12 2023-09-05 北京达佳互联信息技术有限公司 Resource recommendation method and device and electronic equipment
CN113516522B (en) * 2021-09-14 2022-03-04 腾讯科技(深圳)有限公司 Media resource recommendation method, and training method and device of multi-target fusion model
CN114547417A (en) * 2022-02-25 2022-05-27 北京百度网讯科技有限公司 Media resource ordering method and electronic equipment
CN114677168A (en) * 2022-03-11 2022-06-28 百果园技术(新加坡)有限公司 Resource recommendation method, device, equipment and medium
CN114780843A (en) * 2022-04-20 2022-07-22 北京字节跳动网络技术有限公司 Model training method, medium resource recommendation method, device, medium and equipment

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105183772A (en) * 2015-08-07 2015-12-23 百度在线网络技术(北京)有限公司 Release information click rate estimation method and apparatus
CN105678335A (en) * 2016-01-08 2016-06-15 车智互联(北京)科技有限公司 Click rate pre-estimation method, device and calculating equipment
CN105975641A (en) * 2016-07-15 2016-09-28 合网络技术(北京)有限公司 Video recommendation method ad device
CN107463580A (en) * 2016-06-06 2017-12-12 腾讯科技(深圳)有限公司 Train clicking rate prediction model method and apparatus, clicking rate predictor method and device
CN109408724A (en) * 2018-11-06 2019-03-01 北京达佳互联信息技术有限公司 Multimedia resource estimates the determination method, apparatus and server of clicking rate

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102663617A (en) * 2012-03-20 2012-09-12 亿赞普(北京)科技有限公司 Method and system for prediction of advertisement clicking rate
CN104462594A (en) * 2014-12-29 2015-03-25 北京奇虎科技有限公司 Method and device for providing user personalized resource message pushing
CN104951965B (en) * 2015-06-26 2017-04-19 深圳市腾讯计算机系统有限公司 Advertisement delivery method and device
CN105095964B (en) * 2015-08-17 2017-10-20 杭州朗和科技有限公司 A kind of data processing method and device
CN106604068B (en) * 2016-12-30 2019-11-05 中广热点云科技有限公司 A kind of method and its system of more new media program
CN108304441B (en) * 2017-11-14 2022-04-26 腾讯科技(深圳)有限公司 Network resource recommendation method and device, electronic equipment, server and storage medium
CN108307207A (en) * 2017-12-28 2018-07-20 北京达佳互联信息技术有限公司 A kind of video pushing method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105183772A (en) * 2015-08-07 2015-12-23 百度在线网络技术(北京)有限公司 Release information click rate estimation method and apparatus
CN105678335A (en) * 2016-01-08 2016-06-15 车智互联(北京)科技有限公司 Click rate pre-estimation method, device and calculating equipment
CN107463580A (en) * 2016-06-06 2017-12-12 腾讯科技(深圳)有限公司 Train clicking rate prediction model method and apparatus, clicking rate predictor method and device
CN105975641A (en) * 2016-07-15 2016-09-28 合网络技术(北京)有限公司 Video recommendation method ad device
CN109408724A (en) * 2018-11-06 2019-03-01 北京达佳互联信息技术有限公司 Multimedia resource estimates the determination method, apparatus and server of clicking rate

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111708944A (en) * 2020-06-17 2020-09-25 北京达佳互联信息技术有限公司 Multimedia resource identification method, device, equipment and storage medium
CN111737573A (en) * 2020-06-17 2020-10-02 北京三快在线科技有限公司 Resource recommendation method, device, equipment and storage medium
CN113836391A (en) * 2020-06-24 2021-12-24 北京达佳互联信息技术有限公司 Resource recommendation method and device, computer equipment and storage medium
CN113836391B (en) * 2020-06-24 2023-07-11 北京达佳互联信息技术有限公司 Resource recommendation method, device, computer equipment and storage medium
CN112364184A (en) * 2020-11-12 2021-02-12 北京达佳互联信息技术有限公司 Method, device, server and storage medium for ordering multimedia data
CN112364184B (en) * 2020-11-12 2024-04-30 北京达佳互联信息技术有限公司 Method, device, server and storage medium for ordering multimedia data
CN112732953A (en) * 2020-12-30 2021-04-30 上海众源网络有限公司 Recommendation method, sample analysis method, device, electronic equipment and storage medium
CN112732953B (en) * 2020-12-30 2024-04-12 上海众源网络有限公司 Recommendation method, sample analysis device, electronic equipment and storage medium
CN112839247A (en) * 2021-01-04 2021-05-25 聚好看科技股份有限公司 Content display method, display device and cloud server
CN114548076A (en) * 2022-02-28 2022-05-27 深圳市和讯华谷信息技术有限公司 Intelligent scoring method for content file and related device

Also Published As

Publication number Publication date
CN109408724B (en) 2020-10-02
CN109408724A (en) 2019-03-01

Similar Documents

Publication Publication Date Title
WO2020093781A1 (en) Multimedia resource estimated click through rate determination method and apparatus, and server
US11727048B2 (en) Methods, systems, and media for presenting recommended content based on social cues
US9959506B1 (en) Predictive content retrieval using device movements
CN109547814B (en) Video recommendation method and device, server and storage medium
US9838743B2 (en) Techniques for context aware video recommendation
JP6767499B2 (en) Optimizing content delivery using models
US10733249B1 (en) Machine learning system for data selection
CN109783656B (en) Recommendation method and system of audio and video data, server and storage medium
WO2022165664A1 (en) Live broadcast interface display method and apparatus, terminal, server, and storage medium
EP4092545A1 (en) Content recommendation method and device
WO2020135189A1 (en) Product recommendation method, product recommendation system and storage medium
CN110209975B (en) Method, apparatus, device and storage medium for providing object
CN112241327A (en) Shared information processing method and device, storage medium and electronic equipment
CN112104505B (en) Application recommendation method, device, server and computer readable storage medium
EP3256966B1 (en) Relevance table aggregation in a database system
TWI602431B (en) Method and device for transmitting information
CN110933504B (en) Video recommendation method, device, server and storage medium
CN111988642B (en) Method, device, server and storage medium for recommending videos
CN107222386A (en) A kind of message back method and terminal
US20210234941A1 (en) Wireless Device, Computer Server Node, and Methods Thereof
US20160019400A1 (en) Recommendation system for protecting user privacy
CN113935251A (en) User behavior prediction model generation method and device and user behavior prediction method and device
CN117271043B (en) Data display method based on intelligent wearable equipment
US11870833B2 (en) Methods and systems for encoder parameter setting optimization
CN116541588A (en) Content recommendation method, device, electronic equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 19882829

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 19882829

Country of ref document: EP

Kind code of ref document: A1