CN115146717A

CN115146717A - Recommendation method, device, electronic device and storage medium based on multi-task model

Info

Publication number: CN115146717A
Application number: CN202210723718.8A
Authority: CN
Inventors: 汪利伟; 仲籽彦; 魏丫丫; 金伟德; 洪迪; 刘健
Original assignee: China Telecom Corp Ltd
Current assignee: China Telecom Corp Ltd
Priority date: 2022-06-24
Filing date: 2022-06-24
Publication date: 2022-10-04

Abstract

The embodiment of the invention provides a recommendation method, a recommendation device, electronic equipment and a storage medium based on a multitask model, which comprise the following steps: acquiring user behavior data from the log data according to the mobile phone number of the user; screening feature data from the user behavior data, wherein the feature data comprises basic features, interactive features and time slice features; adopting the characteristic data to carry out a prediction task aiming at voice use and a prediction task aiming at flow use through a multi-task model to obtain a prediction result; and determining the voice package and the flow package recommended for the user according to the prediction result. In the embodiment of the invention, the two tasks of the flow and the voice can be respectively predicted by using the multi-task model, so that the prediction efficiency is improved, the package of the most suitable user can be more accurately matched, the service is furthest realized, and the user is kept.

Description

Recommendation method, device, electronic device and storage medium based on multi-task model

技术领域technical field

本发明涉及线上营销技术领域，特别是涉及一种基于多任务模型的推荐方法、一种基于多任务模型的推荐装置、一种电子设备和一种计算机可读存储介质。The present invention relates to the technical field of online marketing, and in particular, to a recommendation method based on a multi-task model, a recommendation device based on the multi-task model, an electronic device and a computer-readable storage medium.

背景技术Background technique

由于用户入网时选择的套餐往往与后续使用习惯并不匹配，导致用户粘性不高，从而使得用户流失。并且，随着5G时代的到来，越来越多的用户需要将手机卡从4G更换至5G，因此，为了在用户携号转网时，能够留存更多的用户，如何跟用户推荐更适合的套餐，以增加用户对产品的信任和用户的粘度成为亟待解决的问题。Because the packages selected by users when they access the network often do not match their subsequent usage habits, user stickiness is not high, and users are lost. In addition, with the advent of the 5G era, more and more users need to change their mobile phone cards from 4G to 5G. Therefore, in order to retain more users when users port numbers to the Internet, how to recommend more suitable ones to users? Packages to increase users' trust in products and user stickiness have become an urgent problem to be solved.

发明内容SUMMARY OF THE INVENTION

鉴于上述问题，提出了本发明实施例以便提供一种克服上述问题或者至少部分地解决上述问题的一种基于多任务模型的推荐方法、一种基于多任务模型的推荐装置、一种电子设备和一种计算机可读存储介质。In view of the above problems, the embodiments of the present invention are proposed to provide a multi-task model-based recommendation method, a multi-task model-based recommendation device, an electronic device and A computer-readable storage medium.

为了解决上述问题，本发明实施例公开了一种基于多任务模型的推荐方法，所述方法包括：In order to solve the above problem, an embodiment of the present invention discloses a recommendation method based on a multi-task model, and the method includes:

根据用户手机号，从日志数据中获取用户行为数据；Obtain user behavior data from log data according to the user's mobile phone number;

从所述用户行为数据中筛选特征数据，所述特征数据包括基础特征、交互特征以及时间片特征；Screening feature data from the user behavior data, where the feature data includes basic features, interaction features, and time slice features;

采用所述特征数据，通过多任务模型进行针对语音使用的预测任务和针对流量使用的预测任务，得到预测结果；Using the feature data, a multi-task model is used to perform a prediction task for voice usage and a prediction task for traffic usage to obtain a prediction result;

根据所述预测结果，确定为所述用户推荐的语音套餐和流量套餐。According to the prediction result, a voice package and a data package recommended for the user are determined.

可选地，所述根据所述预测结果，确定为所述用户推荐的语音套餐和流量套餐，包括：Optionally, determining the voice package and data package recommended for the user according to the prediction result, including:

若所述预测结果为流量和语音时长，则根据预先设置的映射关系，将流量和语音时长映射为语音套餐和流量套餐；If the predicted result is traffic and voice duration, then map the traffic and voice duration to voice packages and traffic packages according to a preset mapping relationship;

根据所述语音套餐和所述流量套餐，组合成基础套餐，以推荐给所述用户。According to the voice package and the data package, a basic package is combined to be recommended to the user.

可选地，所述多任务模型包括：两个门网络、多个专家网络以及两个Tower网络，所述两个门网络与所述两个Tower网络一一对应；所述采用所述特征数据，通过多任务模型进行针对语音使用的预测任务和针对流量使用的预测任务，得到预测结果，包括：Optionally, the multi-task model includes: two gate networks, multiple expert networks and two Tower networks, the two gate networks are in one-to-one correspondence with the two Tower networks; the use of the feature data , perform prediction tasks for speech usage and prediction tasks for traffic usage through a multi-task model, and obtain prediction results, including:

通过每个所述门网络对所述特征数据进行处理，以获取每个所述门网络对应的多个专家网络的权重；Process the feature data through each of the gate networks to obtain the weights of multiple expert networks corresponding to each of the gate networks;

通过每个所述门网络对应的多个专家网络，分别对所述特征数据进行特征提取，以得到多个第一特征；Perform feature extraction on the feature data through a plurality of expert networks corresponding to each of the gate networks to obtain a plurality of first features;

根据每个门网络对应的多个专家网络的权重，对每个门网络对应的多个第一特征，进行加权求和，并通过每个所述门网络对应的Tower网络得到所述预测结果。According to the weights of multiple expert networks corresponding to each gate network, weighted summation is performed on multiple first features corresponding to each gate network, and the prediction result is obtained through the Tower network corresponding to each gate network.

可选地，所述通过每个门网络对所述特征数据进行处理，以获取每个门网络对应的多个专家网络的权重，包括：Optionally, the feature data is processed through each gate network to obtain the weights of multiple expert networks corresponding to each gate network, including:

通过每个所述门网络对所述特征数据进行特征提取，以得到多个所述专家网络被每个所述门网络选择的概率；Perform feature extraction on the feature data through each of the gate networks to obtain the probability that a plurality of the expert networks are selected by each of the gate networks;

对多个所述专家网络被每个所述门网络选择的概率进行归一化处理，得到每个所述门网络对应的多个专家网络的权重。Normalize the probability that the plurality of expert networks are selected by each of the gate networks to obtain the weights of the multiple expert networks corresponding to each of the gate networks.

可选地，通过如下方式训练所述多任务模型：Optionally, the multi-task model is trained as follows:

获取特征样本数据，所述特征样本数据包括基础特征样本、交互特征样本以及时间片特征样本；acquiring feature sample data, where the feature sample data includes basic feature samples, interactive feature samples, and time slice feature samples;

将所述特征样本数据作为所述多任务模型的输入；using the feature sample data as the input of the multi-task model;

在所述多任务模型中，采用所述特征样本数据进行针对语音使用的预测任务和针对流量使用的预测任务，以得到预测结果；In the multi-task model, the feature sample data is used to perform a prediction task for voice usage and a prediction task for traffic usage to obtain a prediction result;

根据所述预测结果，对所述多任务模型进行训练。According to the prediction result, the multi-task model is trained.

可选地，所述多任务模型包括：两个门网络、多个专家网络以及两个Tower网络，所述两个门网络与所述两个Tower网络一一对应；所述在所述多任务模型中，采用所述特征样本数据进行针对语音使用的预测任务和针对流量使用的预测任务，以得到预测结果，包括：Optionally, the multitasking model includes: two gate networks, multiple expert networks, and two Tower networks, the two gate networks are in one-to-one correspondence with the two Tower networks; In the model, the feature sample data is used to perform prediction tasks for voice usage and prediction tasks for traffic usage to obtain prediction results, including:

通过每个所述门网络对所述特征样本数据进行处理，以获取每个所述门网络对应的多个所述专家网络的权重；Process the feature sample data through each of the gate networks to obtain the weights of a plurality of the expert networks corresponding to each of the gate networks;

通过每个所述门网络对应的多个专家网络，分别对所述特征样本数据进行特征提取，以得到多个第一特征样本；Perform feature extraction on the feature sample data through a plurality of expert networks corresponding to each of the gate networks to obtain a plurality of first feature samples;

根据每个所述门网络对应的多个专家网络的权重，对每个门网络对应的多个第一特征样本，进行加权求和，并通过每个所述门网络对应的Tower网络，以得到流量和语音时长的预测结果。According to the weights of multiple expert networks corresponding to each gate network, weighted summation is performed on multiple first feature samples corresponding to each gate network, and the Tower network corresponding to each gate network is used to obtain Predicted results of traffic and voice duration.

可选地，所述通过每个所述门网络对所述特征样本数据进行处理，以获取每个所述门网络对应的多个所述专家网络的权重，包括：Optionally, the processing of the feature sample data through each of the gate networks to obtain the weights of the multiple expert networks corresponding to each of the gate networks, including:

通过每个所述门网络对所述特征样本数据进行特征提取，以得到多个所述专家网络被每个所述门网络选择的概率；Perform feature extraction on the feature sample data through each of the gate networks to obtain the probability that a plurality of the expert networks are selected by each of the gate networks;

本发明实施例还公开了一种基于多任务模型的推荐装置，所述装置包括：The embodiment of the present invention also discloses a recommendation device based on a multi-task model, the device comprising:

获取数据模块，用于根据用户手机号，从日志数据中获取用户行为数据；The data acquisition module is used to acquire user behavior data from the log data according to the user's mobile phone number;

筛选特征模块，用于从所述用户行为数据中筛选特征数据，所述特征数据包括基础特征、交互特征以及时间片特征；a screening feature module, configured to screen feature data from the user behavior data, where the feature data includes basic features, interaction features, and time slice features;

特征预测模块，用于采用所述特征数据，通过多任务模型进行针对语音使用的预测任务和针对流量使用的预测任务，得到预测结果；A feature prediction module, configured to use the feature data to perform a prediction task for voice usage and a prediction task for traffic usage through a multi-task model to obtain a prediction result;

套餐确定模块，用于根据所述预测结果，确定为所述用户推荐的语音套餐和流量套餐。A package determination module, configured to determine a voice package and a traffic package recommended for the user according to the prediction result.

可选地，所述套餐确定模块包括：Optionally, the package determination module includes:

套餐映射子模块，用于若所述预测结果为流量和语音时长，则根据预先设置的映射关系，将流量和语音时长映射为语音套餐和流量套餐；a package mapping submodule, configured to map the traffic and the voice duration to a voice package and a traffic package according to a preset mapping relationship if the predicted result is traffic and voice duration;

套餐推荐子模块，用于根据所述语音套餐和所述流量套餐，组合成基础套餐，以推荐给所述用户。The package recommendation submodule is configured to combine the voice package and the traffic package into a basic package to recommend to the user.

可选地，所述多任务模型包括：两个门网络、多个专家网络以及两个Tower网络，所述两个门网络与所述两个Tower网络一一对应；所述特征预测模块，包括：Optionally, the multi-task model includes: two gate networks, multiple expert networks and two Tower networks, the two gate networks are in one-to-one correspondence with the two Tower networks; the feature prediction module includes :

权重获取子模块，用于通过每个所述门网络对所述特征数据进行处理，以获取每个所述门网络对应的多个专家网络的权重；A weight acquisition sub-module, configured to process the feature data through each of the gate networks to obtain the weights of multiple expert networks corresponding to each of the gate networks;

特征提取子模块，用于通过每个所述门网络对应的多个专家网络，分别对所述特征数据进行特征提取，以得到多个第一特征；A feature extraction sub-module, configured to perform feature extraction on the feature data through a plurality of expert networks corresponding to each of the gate networks to obtain a plurality of first features;

预测结果子模块，用于根据每个门网络对应的多个专家网络的权重，对每个门网络对应的多个第一特征，进行加权求和，并通过每个所述门网络对应的Tower网络得到所述预测结果。The prediction result sub-module is used to perform weighted summation of multiple first features corresponding to each gate network according to the weights of multiple expert networks corresponding to each gate network, and pass the Tower corresponding to each gate network. The network obtains the predicted result.

可选地，所述权重获取子模块，包括：Optionally, the weight acquisition sub-module includes:

获取概率单元，用于通过每个所述门网络对所述特征数据进行特征提取，以得到多个所述专家网络被每个所述门网络选择的概率；obtaining a probability unit for performing feature extraction on the feature data through each of the gate networks to obtain the probability that a plurality of the expert networks are selected by each of the gate networks;

获取权重单元，用于对多个所述专家网络被每个所述门网络选择的概率进行归一化处理，得到每个所述门网络对应的多个专家网络的权重。A weight unit is obtained, which is used for normalizing the probability that multiple expert networks are selected by each of the gate networks to obtain the weights of multiple expert networks corresponding to each of the gate networks.

样本获取模块，用于获取特征样本数据，所述特征样本数据包括基础特征样本、交互特征样本以及时间片特征样本；a sample acquisition module, configured to acquire feature sample data, where the feature sample data includes basic feature samples, interactive feature samples, and time slice feature samples;

样本输入模块，用于将所述特征样本数据作为所述多任务模型的输入；a sample input module, configured to use the feature sample data as the input of the multi-task model;

样本预测模块，用于在所述多任务模型中，采用所述特征样本数据进行针对语音使用的预测任务和针对流量使用的预测任务，以得到预测结果；A sample prediction module, configured to use the feature sample data to perform a prediction task for voice usage and a prediction task for traffic usage in the multi-task model to obtain a prediction result;

模型训练模块，根据所述预测结果，对所述多任务模型进行训练。A model training module, for training the multi-task model according to the prediction result.

可选地，所述多任务模型包括：两个门网络、多个专家网络以及两个Tower网络，所述两个门网络与所述两个Tower网络一一对应；所述样本预测模块包括：Optionally, the multi-task model includes: two gate networks, multiple expert networks and two Tower networks, the two gate networks are in one-to-one correspondence with the two Tower networks; the sample prediction module includes:

样本权重子模块，用于通过每个所述门网络对所述特征样本数据进行处理，以获取每个所述门网络对应的多个所述专家网络的权重；a sample weight submodule, configured to process the feature sample data through each of the gate networks to obtain the weights of a plurality of the expert networks corresponding to each of the gate networks;

样本提取子模块，用于通过每个所述门网络对应的多个专家网络，分别对所述特征样本数据进行特征提取，以得到多个第一特征样本；A sample extraction sub-module, configured to perform feature extraction on the feature sample data respectively through a plurality of expert networks corresponding to each of the gate networks, so as to obtain a plurality of first feature samples;

样本预测子模块，用于根据每个所述门网络对应的多个专家网络的权重，对每个门网络对应的多个第一特征样本，进行加权求和，并通过每个所述门网络对应的Tower网络，以得到流量和语音时长的预测结果。The sample prediction sub-module is used to perform weighted summation on multiple first feature samples corresponding to each gate network according to the weights of the multiple expert networks corresponding to each gate network, and pass through each gate network Corresponding Tower network to get the prediction results of traffic and voice duration.

可选地，所述样本权重子模块包括：Optionally, the sample weight submodule includes:

样本概率单元，用于通过每个所述门网络对所述特征样本数据进行特征提取，以得到多个所述专家网络被每个所述门网络选择的概率；a sample probability unit, configured to perform feature extraction on the feature sample data through each of the gate networks, so as to obtain the probability that a plurality of the expert networks are selected by each of the gate networks;

样本权重单元，用于对多个所述专家网络被每个所述门网络选择的概率进行归一化处理，得到每个所述门网络对应的多个专家网络的权重。The sample weight unit is used for normalizing the probability that the plurality of expert networks are selected by each of the gate networks to obtain the weights of the plurality of expert networks corresponding to each of the gate networks.

本发明实施例包括以下优点：The embodiments of the present invention include the following advantages:

在本发明实施例中，根据用户手机号，从日志数据中获取用户行为数据；从用户行为数据中筛选特征数据，采用特征数据，通过多任务模型进行针对语音使用的预测任务和针对流量使用的预测任务，得到预测结果；根据预测结果，确定为用户推荐的语音套餐和流量套餐。在本发明实施例中可以通过使用多任务模型，分别对流量和语音这两个任务进行预测，提高了预测效率，并且能够更加精准的匹配最合适用户的套餐，实现最大程度的服务并留存用户。In the embodiment of the present invention, user behavior data is obtained from log data according to the user's mobile phone number; feature data is filtered from user behavior data, and the feature data is used to perform prediction tasks for voice usage and prediction tasks for traffic usage through a multi-task model. Predict the task and get the prediction result; according to the prediction result, determine the voice package and data package recommended for the user. In the embodiment of the present invention, by using a multi-task model, the two tasks of traffic and voice can be predicted respectively, which improves the prediction efficiency, and can more accurately match the most suitable user package, so as to achieve maximum service and retain users. .

附图说明Description of drawings

图1是本发明实施例提供的一种基于多任务模型的推荐方法的步骤流程图；1 is a flowchart of steps of a multitask model-based recommendation method provided by an embodiment of the present invention;

图2为本发明实施例提供的一种多任务模型的结构图；2 is a structural diagram of a multi-task model provided by an embodiment of the present invention;

图3是本发明实施例提供的多任务模型的训练方法的流程图；3 is a flowchart of a training method for a multi-task model provided by an embodiment of the present invention;

图4是本发明实施例提供的一种基于多任务模型的推荐装置的结构框图。FIG. 4 is a structural block diagram of a recommendation device based on a multi-task model provided by an embodiment of the present invention.

具体实施方式Detailed ways

为使本发明的上述目的、特征和优点能够更加明显易懂，下面结合附图和具体实施方式对本发明作进一步详细的说明。In order to make the above objects, features and advantages of the present invention more clearly understood, the present invention will be described in further detail below with reference to the accompanying drawings and specific embodiments.

随着5G时代的到来，越来越多的用户需要将手机卡更换成5G的形式，为了能够最大程度服务并提高用户的携号转化率，则需要精准的预测用户最适合的套餐，但是目前预测用户最适合的套餐并不精准，且预测效率较低，因此，为了更加精准预测用户最适合的套餐，并且提高预测效率，以为用户提高更好的服务，从而提高用户的携号转化率。With the advent of the 5G era, more and more users need to replace their mobile phone cards with 5G. In order to maximize service and improve the conversion rate of users' number portability, it is necessary to accurately predict the most suitable package for users, but at present It is not accurate to predict the most suitable package for a user, and the prediction efficiency is low. Therefore, in order to more accurately predict the most suitable package for a user and improve the prediction efficiency, the user can improve the service and improve the conversion rate of the user's number portability.

本发明实施例的核心构思之一在于，通过用户使用套餐时的用户行为数据，进行数据处理，将数据处理后的用户行为数据输入至已经训练好的、基于流量和语音这两个任务进行预测的多任务模型进行处理，从而得到预测结果，在提高预测效率的同时，能够更加精准的为用户匹配最合适的套餐，进一步提升用户体验，以实现最大程度的服务并留存用户。One of the core concepts of the embodiments of the present invention is to perform data processing through the user behavior data when the user uses the package, and input the user behavior data after data processing into the trained, based on the two tasks of traffic and voice for prediction The multi-task model is used for processing, so as to obtain the prediction result. While improving the prediction efficiency, it can more accurately match the most suitable package for the user, further improve the user experience, and achieve the maximum service and retain the user.

参照图1，示出了本发明实施例提供的一种基于多任务模型的推荐方法的步骤流程图，所述方法具体可以包括如下步骤：Referring to FIG. 1, it shows a flowchart of steps of a recommendation method based on a multi-task model provided by an embodiment of the present invention. The method may specifically include the following steps:

步骤101，根据用户手机号，从日志数据中获取用户行为数据；Step 101, obtaining user behavior data from log data according to the user's mobile phone number;

示例性地，日志表中存储有日志数据，日志表中记载着用户的相关行为信息，可以根据用户的手机号，从日志表中获取用户行为数据，用户行为数据可以包括用户基础信息、用户行为信息以及用户终端信息等等。用户基础信息可以包括用户年龄、用户性别、当前所在省份以及当前的套餐类型等等；用户行为信息可以包括用户购买加油包次数、购买加油包类型以及用户点击行为、近N月出账金额、近N月流量使用量等；用户终端信息可以包括用户使用终端个数、终端品牌等。Exemplarily, log data is stored in the log table, and the relevant behavior information of the user is recorded in the log table. User behavior data can be obtained from the log table according to the user's mobile phone number, and the user behavior data can include user basic information, user behavior data. information and user terminal information, etc. User basic information can include user age, user gender, current province and current package type, etc.; user behavior information can include the number of times the user has purchased gas packs, the type of gas packs purchased, and the user's click behavior, the amount of billing in the past N months, the recent N-month traffic usage, etc.; user terminal information may include the number of terminals used by the user, the terminal brand, and the like.

步骤102，从所述用户行为数据中筛选特征数据，所述特征数据包括基础特征、交互特征以及时间片特征；Step 102, screening feature data from the user behavior data, where the feature data includes basic features, interaction features, and time slice features;

示例性地，筛选特征数据的具体操作可以为，首先对用户行为数据进行初步选择，然后对初步选择的用户行为数据进行数据清洗。Exemplarily, the specific operation of screening the feature data may be, firstly performing preliminary selection on the user behavior data, and then performing data cleaning on the preliminarily selected user behavior data.

对用户行为数据进行初步选择可以为，对用户行为数据进行可视化分析，例如通过箱型图、热力图等对用户行为数据进行初步选择，去除相关性过强或者缺失值、异常值严重的用户行为数据；利用随机森林来检查用户行为数据以过滤掉无关数据。Preliminary selection of user behavior data can be performed by visual analysis of user behavior data, such as preliminary selection of user behavior data through box plots, heat maps, etc., to remove user behaviors with strong correlation, missing values, and serious outliers. Data; utilizes random forests to examine user behavior data to filter out irrelevant data.

对初步选择的用户行为数据进行数据清洗，例如，对数据的格式进行转换，如金额类标签统一转换成以元为单位；对缺失值不严重的数据根据实际情况利用中位数和众数进行补值处理；利用k-means聚类算法对异常值进行重新赋值；对年龄等连续特征，做id化处理；对年龄、性别等重要特征做交叉，构造交叉特征等等。具体地，如何筛选出特征数据，本领域技术人员可以根据实际情况而定，本发明在此不作限制。Perform data cleaning on the preliminarily selected user behavior data, for example, convert the format of the data, such as the unified conversion of the amount label into yuan; for the data with less serious missing values, use the median and mode according to the actual situation. Complement value processing; use k-means clustering algorithm to reassign outliers; perform id processing for continuous features such as age; cross important features such as age and gender to construct cross features, etc. Specifically, how to filter out the characteristic data can be determined by those skilled in the art according to the actual situation, which is not limited in the present invention.

在本发明实施例中，特征数据可以包括基础特征、交互特征以及时间片特征；其中，基础特征可以包括用户年龄、性别等；交互特征可以包括用户对套餐或者流量包的行为以及套餐热度等；时间片特征可以包括前一天，近三天，月初月中月末，近一个月，近三个月等。In this embodiment of the present invention, the feature data may include basic features, interactive features, and time slice features; wherein, the basic features may include user age, gender, etc.; the interactive features may include user behavior on packages or traffic packages, package popularity, etc.; The time slice features may include the previous day, the last three days, the beginning of the month, the middle of the month, and the end of the month, the last month, the last three months, and so on.

步骤103，采用所述特征数据，通过多任务模型进行针对语音使用的预测任务和针对流量使用的预测任务，得到预测结果；Step 103, using the feature data, perform a prediction task for voice usage and a prediction task for traffic usage through a multi-task model, and obtain a prediction result;

在本发明实施例中，通过用户行为数据筛选的特征数据，输入至训练好的多任务模型中，以得到相应的预测结果。其中，训练好的多任务模型是分别基于语音任务和流量任务进行预测建模。具体地，使用何种方式训练多任务模型可以根据实际情况来定，本发明在此不作限制。In the embodiment of the present invention, the feature data screened by the user behavior data is input into the trained multi-task model to obtain the corresponding prediction result. Among them, the trained multi-task model performs predictive modeling based on speech tasks and traffic tasks respectively. Specifically, which method to use to train the multi-task model can be determined according to the actual situation, which is not limited in the present invention.

参照图2所示，为本发明实施例提供的一种多任务模型的结构图。多任务模型可以包括两个门网络(Gate A和Gate B)、多个专家网络(Expert)以及与门网络一一对应的Tower(塔)网络；其中，每一个任务对应配置一个Gate(门网络)，专家网络的数量可以为多个，具体地，专家网络的数据根据实际情况而定，本发明在此不作限制。Referring to FIG. 2 , it is a structural diagram of a multi-task model provided by an embodiment of the present invention. The multi-task model can include two gate networks (Gate A and Gate B), multiple expert networks (Expert), and a Tower network corresponding to the gate network one-to-one; wherein, each task corresponds to a Gate (gate network) configuration. ), the number of expert networks may be multiple, and specifically, the data of expert networks is determined according to the actual situation, which is not limited in the present invention.

门网络可以对输入的特征数据进行处理，以获取门网络对应的专家网络被选择的概率，并将特征数据基于多个专家网络进行加权求和，输出至对应的Tower网络；Tower网络用于表示最终的输出，表达式如下所示：The gate network can process the input feature data to obtain the probability that the expert network corresponding to the gate network is selected, and the feature data is weighted and summed based on multiple expert networks, and output to the corresponding Tower network; the Tower network is used to represent The final output, the expression looks like this:

其中，g^k(x)表示Gate的输出，为多层感知机模型，实现为简单的线性变换加softmax(柔性最大传递函数)层。Among them, g ^k (x) represents the output of Gate, which is a multi-layer perceptron model, which is implemented as a simple linear transformation plus a softmax (flexible maximum transfer function) layer.

softmax用于为每个输出分类的结果都赋予一个概率值，表示属于每个类别的可能性。Softmax is used to assign a probability value to the outcome of each output classification, representing the likelihood of belonging to each class.

损失函数：分别计算两个任务的RMSE(Root Mean Square Error：均方根误差)，然后求和作为整个任务的损失函数。Loss function: Calculate the RMSE (Root Mean Square Error: root mean square error) of the two tasks separately, and then sum it up as the loss function of the entire task.

在一种可选的实施例中，所述多任务模型包括：两个门网络、多个专家网络以及两个Tower网络，所述两个门网络与所述两个Tower网络一一对应；所述步骤103可以包括以下子步骤S11-S13：In an optional embodiment, the multi-task model includes: two gate networks, multiple expert networks, and two Tower networks, the two gate networks are in one-to-one correspondence with the two Tower networks; the The step 103 may include the following sub-steps S11-S13:

子步骤S11，通过每个所述门网络对所述特征数据进行处理，以获取每个所述门网络对应的多个专家网络的权重；Sub-step S11, processing the feature data through each of the gate networks to obtain the weights of multiple expert networks corresponding to each of the gate networks;

在本发明实施例中，训练好的多任务模型可以包括两个门网络，多个专家网络以及与门网络一一对应的Tower网络；在具体实施中，可以将Gate A作为语音任务，Gate B作为流量任务；Gate A可以对输入的特征数据进行处理，从而获得语音任务对应的多个专家网络的权重；Gate B可以对输入的特征数据进行处理，从而获得流量任务对应的多个专家网络的权重。具体地，Gate A也可以作为流量任务，Gate B也可以作为语音任务，每个任务对应的门网络可以根据实际情况设定，本发明在此不作限制。In this embodiment of the present invention, the trained multi-task model may include two gate networks, multiple expert networks, and a Tower network corresponding to the gate network one-to-one; in specific implementation, Gate A may be used as a voice task, and Gate B As a traffic task; Gate A can process the input feature data to obtain the weights of multiple expert networks corresponding to the voice task; Gate B can process the input feature data to obtain the weights of multiple expert networks corresponding to the traffic task. Weights. Specifically, Gate A can also be used as a traffic task, and Gate B can also be used as a voice task, and the gate network corresponding to each task can be set according to the actual situation, which is not limited in the present invention.

子步骤S12，通过每个所述门网络对应的多个专家网络，分别对所述特征数据进行特征提取，以得到多个第一特征；Sub-step S12, through a plurality of expert networks corresponding to each of the gate networks, respectively perform feature extraction on the feature data to obtain a plurality of first features;

子步骤S13，根据每个门网络对应的多个专家网络的权重，对每个门网络对应的多个第一特征，进行加权求和，并通过每个所述门网络对应的Tower网络得到所述预测结果。Sub-step S13, according to the weights of the multiple expert networks corresponding to each gate network, perform a weighted summation on the multiple first features corresponding to each gate network, and obtain the obtained data through the Tower network corresponding to each gate network. the predicted results.

在本发明实施例中，每个门网络可以对应着多个专家网络，多个专家网络分别对输入的特征数据进行提取，从而得到对应的多个第一特征；基于每个门网络对应的多个专家网络的权重，对每个门网络对应的多个第一特征，进行加权求和，可以通过每个门网络对应的Tower网络，获得预测结果。其中，第一特征的数量与专家网络的数量相同。In the embodiment of the present invention, each gate network may correspond to multiple expert networks, and multiple expert networks respectively extract the input feature data, thereby obtaining multiple corresponding first features; based on the multiple expert networks corresponding to each gate network The weight of each expert network is weighted and summed for multiple first features corresponding to each gate network, and the prediction result can be obtained through the Tower network corresponding to each gate network. Among them, the number of first features is the same as the number of expert networks.

在一种可选的实施例中，所述子步骤S11可以包括：通过每个所述门网络对所述特征数据进行特征提取，以得到多个所述专家网络被每个所述门网络选择的概率；对多个所述专家网络被每个所述门网络选择的概率进行归一化处理，得到每个所述门网络对应的多个专家网络的权重。In an optional embodiment, the sub-step S11 may include: performing feature extraction on the feature data through each of the gate networks, so as to obtain a plurality of the expert networks selected by each of the gate networks The probability that multiple expert networks are selected by each gate network is normalized to obtain the weights of multiple expert networks corresponding to each gate network.

在本发明实施例中，每个门网络可以对输入的特征数据进行特征提取，以得到该输入特征数据的第二特征；通过该门网络对第二特征进行softmax处理，以得到多个专家网络被该门网络选择的概率，并对每个门网络对应的多个专家网络的概率进行归一化处理，以得到每个门网络对应的多个专家网络的权重。其中，概率的大小可以为0。In the embodiment of the present invention, each gate network can perform feature extraction on the input feature data to obtain the second feature of the input feature data; perform softmax processing on the second feature through the gate network to obtain multiple expert networks The probability of being selected by the gate network is normalized to the probability of multiple expert networks corresponding to each gate network, so as to obtain the weight of multiple expert networks corresponding to each gate network. Among them, the size of the probability can be 0.

步骤104，根据所述预测结果，确定为所述用户推荐的语音套餐和流量套餐。Step 104: Determine, according to the prediction result, a voice package and a data package recommended for the user.

示例性地，基于不同的任务预测需求，设置不同的Tower网络，例如，如果是预测用户的点击率，则可以设置一个用于预测用户点击率的Tower网络，通过该Tower网络，得到用户的点击概率。Exemplarily, different Tower networks are set based on different task prediction requirements. For example, if it is to predict the click-through rate of users, a Tower network for predicting the click-through rate of users can be set, and the clicks of users can be obtained through the Tower network. probability.

在本发明实施例中，通过训练好的多任务模型可以进行针对语音使用的预测任务和针对流量使用的预测任务，通过设置的Tower网络得到预测结果；其中，预测结果可以通过设置Tower网络来进行调整。In the embodiment of the present invention, a prediction task for voice usage and a prediction task for traffic usage can be performed through the trained multi-task model, and the prediction result can be obtained through the set Tower network; wherein, the prediction result can be performed by setting the Tower network. Adjustment.

在一种可选的实施例中，所述步骤104可以包括以下子步骤S21-S22：In an optional embodiment, the step 104 may include the following sub-steps S21-S22:

子步骤S21，若所述预测结果为流量和语音时长，则根据预先设置的映射关系，将流量和语音时长映射为语音套餐和流量套餐；Sub-step S21, if the predicted result is traffic and voice duration, then according to the preset mapping relationship, the traffic and voice duration are mapped to a voice package and a traffic package;

子步骤S22，根据所述语音套餐和所述流量套餐，组合成基础套餐，以推荐给所述用户。Sub-step S22, combining the voice package and the traffic package into a basic package to recommend to the user.

在本发明实施例中，预测结果可以为流量和语音时长；根据预先设置的语音套餐和流量套餐，将流量和语音时长映射为流量套餐和语音套餐，并组成基础套餐，推荐给用户。例如，流量套餐分别为A1、A2和A3套餐，对应的流量分别为5G、10G以及30G；语音套餐分别为B1、B2和B3套餐，对应的语音时长分别为30分钟、80分钟、150分钟。当预测结果的流量为9G、语音时长为70分钟的时候，将流量套餐A2和语音套餐B2，组合成基础套餐，推荐给相应的用户。In the embodiment of the present invention, the prediction result may be traffic and voice duration; according to the preset voice package and traffic package, the traffic and voice duration are mapped to the traffic package and the voice package, and a basic package is formed and recommended to the user. For example, the data packages are A1, A2, and A3, and the corresponding traffic is 5G, 10G, and 30G; the voice packages are B1, B2, and B3, and the corresponding voice durations are 30 minutes, 80 minutes, and 150 minutes, respectively. When the predicted traffic is 9G and the voice duration is 70 minutes, the traffic package A2 and the voice package B2 are combined into a basic package and recommended to the corresponding users.

示例性地，推荐给用户的方式可以是通过短信、电话或者APP弹窗等方式。具体地，如何将套餐推荐给用户，本领域技术人员可以根据实际情况而定，本发明在此不作限制。Exemplarily, the method of recommending to the user may be through a text message, a phone call, or a pop-up window of an APP or the like. Specifically, how to recommend the package to the user can be determined by those skilled in the art according to the actual situation, which is not limited in the present invention.

在本发明实施例中，根据用户手机号，从日志数据中获取用户行为数据；从用户行为数据中筛选特征数据，采用特征数据，通过多任务模型进行针对语音使用的预测任务和针对流量使用的预测任务，得到预测结果；根据预测结果，确定为用户推荐的语音套餐和流量套餐。通过使用多任务模型，分别对流量和语音这两个任务进行预测，提高了预测效率，并且能够更加精准的匹配最合适用户的套餐，实现最大程度的服务并留存用户。In the embodiment of the present invention, user behavior data is obtained from log data according to the user's mobile phone number; feature data is filtered from user behavior data, and the feature data is used to perform prediction tasks for voice usage and prediction tasks for traffic usage through a multi-task model. Predict the task and get the prediction result; according to the prediction result, determine the voice package and data package recommended for the user. By using a multi-task model, the two tasks of traffic and voice are predicted respectively, which improves the prediction efficiency, and can more accurately match the most suitable user package, achieve maximum service and retain users.

参照图3，示出了本发明实施例中多任务模型的训练方法的流程图，多任务模型的训练方法包括：Referring to FIG. 3, a flowchart of a training method of a multi-task model in an embodiment of the present invention is shown, and the training method of a multi-task model includes:

步骤301，获取特征样本数据，所述特征样本数据包括基础特征样本、交互特征样本以及时间片特征样本；Step 301, acquiring feature sample data, the feature sample data includes basic feature samples, interactive feature samples, and time slice feature samples;

示例性地，特征样本数据可以包括基础特征样本、交互特征样本，以及时间片特征样本等，对于特征样本数据的获取可以通过用户的历史套餐使用数据进行筛选，具体地，如何获取特征样本数据，本领域技术人员可以根据实际情况而定，本发明在此不作限制。Exemplarily, the feature sample data may include basic feature samples, interactive feature samples, and time slice feature samples, etc. The acquisition of the feature sample data may be filtered through the user's historical package usage data, specifically, how to obtain the feature sample data, Those skilled in the art can decide according to the actual situation, and the present invention is not limited here.

步骤302，将所述特征样本数据作为所述多任务模型的输入；Step 302, using the feature sample data as the input of the multi-task model;

步骤303，在所述多任务模型中，采用所述特征样本数据进行针对语音使用的预测任务和针对流量使用的预测任务，以得到预测结果；Step 303, in the multi-task model, using the feature sample data to perform a prediction task for voice usage and a prediction task for traffic usage to obtain a prediction result;

在本发明实施例中，可以将获取的特征样本数据作为多任务模型的输入，以通过多任务模型，得到针对语音和流量的预测结果。其中，多任务模型是分别基于语音任务和流量任务进行建模；其对应的预测结果可以为语音时长和流量，具体地，可以通过基于不同任务的预测需求，来设置Tower网络，从而获取与需求对应的预测结果。In the embodiment of the present invention, the acquired feature sample data may be used as the input of the multi-task model, so as to obtain the prediction results for speech and traffic through the multi-task model. Among them, the multi-task model is modeled based on voice tasks and traffic tasks respectively; the corresponding prediction results can be voice duration and traffic. Specifically, the Tower network can be set based on the predicted requirements of different tasks, so as to obtain and demand corresponding prediction results.

在一种可选的实施例中，所述多任务模型包括：两个门网络、多个专家网络以及两个Tower网络，所述两个门网络与所述两个Tower网络一一对应；所述步骤303可以包括以下子步骤S31-S33：In an optional embodiment, the multi-task model includes: two gate networks, multiple expert networks, and two Tower networks, the two gate networks are in one-to-one correspondence with the two Tower networks; the The step 303 may include the following sub-steps S31-S33:

子步骤S31，通过每个所述门网络对所述特征样本数据进行处理，以获取每个所述门网络对应的多个所述专家网络的权重；Sub-step S31, processing the feature sample data through each of the gate networks to obtain the weights of a plurality of the expert networks corresponding to each of the gate networks;

在一种可选的实施例中，所述子步骤S31可以包括：In an optional embodiment, the sub-step S31 may include:

通过每个所述门网络对所述特征样本数据进行特征提取，以得到多个所述专家网络被每个所述门网络选择的概率；对多个所述专家网络被每个所述门网络选择的概率进行归一化处理，得到每个所述门网络对应的多个专家网络的权重。Perform feature extraction on the feature sample data through each of the gate networks to obtain the probability that a plurality of the expert networks are selected by each of the gate networks; The selected probability is normalized to obtain the weights of multiple expert networks corresponding to each gate network.

在本发明实施例中，每个门网络可以用于对输入的特征样本数据进行特征提取后，以得到该输入特征数据的第二特征样本；通过该门网络对第二特征样本进行softmax处理，以得到多个专家网络被每个门网络选择的概率；其中，概率的大小可以为0。In this embodiment of the present invention, each gate network can be used to perform feature extraction on the input feature sample data to obtain a second feature sample of the input feature data; perform softmax processing on the second feature sample through the gate network, to obtain the probability that multiple expert networks are selected by each gate network; wherein, the size of the probability can be 0.

每个门网络对应的多个专家网络的权重，是对每个门网络对应的多个专家网络的概率进行归一化处理得到的。The weights of the multiple expert networks corresponding to each gate network are obtained by normalizing the probabilities of the multiple expert networks corresponding to each gate network.

子步骤S32，通过每个所述门网络对应的多个专家网络，分别对所述特征样本数据进行特征提取，以得到多个第一特征样本；Sub-step S32, through a plurality of expert networks corresponding to each of the gate networks, respectively perform feature extraction on the feature sample data to obtain a plurality of first feature samples;

子步骤S33，根据每个所述门网络对应的多个专家网络的权重，对每个门网络对应的多个第一特征样本，进行加权求和，并通过每个所述门网络对应的Tower网络，以得到流量和语音时长的预测结果；Sub-step S33, according to the weights of the multiple expert networks corresponding to each of the gate networks, perform a weighted sum of the multiple first feature samples corresponding to each gate network, and pass the Tower corresponding to each of the gate networks. network to get traffic and voice duration predictions;

在本发明实施例中，每个门网络可以对应着多个专家网络，专家网络用于输入的特征样本数据进行特征提取，以得到对应的多个第一特征样本。其中，第一特征样本的数量与专家网络的数量相同。In the embodiment of the present invention, each gate network may correspond to multiple expert networks, and the expert network is used to perform feature extraction on the input feature sample data to obtain a plurality of corresponding first feature samples. Among them, the number of first feature samples is the same as the number of expert networks.

根据每个门网络对应的多个专家网络的权重，对多个第一特征样本进行加权求和，可以通过每个门网络对应的Tower网络，获得流量和语音时长的预测结果。According to the weights of multiple expert networks corresponding to each gate network, the weighted summation of multiple first feature samples is performed, and the prediction results of traffic and voice duration can be obtained through the Tower network corresponding to each gate network.

步骤304，根据所述预测结果，对所述多任务模型进行训练。Step 304: Train the multi-task model according to the prediction result.

在本发明实施例中，可以根据对应的预测结果，对多任务模型进行训练；示例性地，可以采用十折交叉验证方式或者五折交叉验证的方式对训练的多任务模型进行筛选。In the embodiment of the present invention, the multi-task model can be trained according to the corresponding prediction result; for example, the trained multi-task model can be screened by using a ten-fold cross-validation method or a five-fold cross-validation method.

在本发明实施例中，在多任务模型中，通过特征样本数据进行语音使用的预测任务和针对流量使用的预测任务，来训练多任务模型，可以提高模型鲁棒性，并且通过不同门网络来同时预测语音任务和流量任务，既提高了预测的精准度，也提高了预测效率。In the embodiment of the present invention, in the multi-task model, the multi-task model is trained by using the feature sample data for the prediction task of speech usage and the prediction task for traffic usage, which can improve the robustness of the model, and use different gate networks to Predicting voice tasks and traffic tasks at the same time not only improves the accuracy of prediction, but also improves the efficiency of prediction.

需要说明的是，对于方法实施例，为了简单描述，故将其都表述为一系列的动作组合，但是本领域技术人员应该知悉，本发明实施例并不受所描述的动作顺序的限制，因为依据本发明实施例，某些步骤可以采用其他顺序或者同时进行。其次，本领域技术人员也应该知悉，说明书中所描述的实施例均属于优选实施例，所涉及的动作并不一定是本发明实施例所必须的。It should be noted that, for the sake of simple description, the method embodiments are described as a series of action combinations, but those skilled in the art should know that the embodiments of the present invention are not limited by the described action sequences, because According to embodiments of the present invention, certain steps may be performed in other sequences or simultaneously. Secondly, those skilled in the art should also know that the embodiments described in the specification are all preferred embodiments, and the actions involved are not necessarily required by the embodiments of the present invention.

参照图4，示出了本发明实施例提供的一种基于多任务模型的推荐装置的结构框图，具体可以包括如下模块：Referring to FIG. 4, it shows a structural block diagram of a recommendation device based on a multi-task model provided by an embodiment of the present invention, which may specifically include the following modules:

获取数据模块401，用于根据用户手机号，从日志数据中获取用户行为数据；A data acquisition module 401, configured to acquire user behavior data from log data according to the user's mobile phone number;

筛选特征模块402，用于从所述用户行为数据中筛选特征数据，所述特征数据包括基础特征、交互特征以及时间片特征；A screening feature module 402, configured to screen feature data from the user behavior data, where the feature data includes basic features, interaction features, and time slice features;

特征预测模块403，用于采用所述特征数据，通过多任务模型进行针对语音使用的预测任务和针对流量使用的预测任务，得到预测结果；The feature prediction module 403 is configured to use the feature data to perform a prediction task for voice usage and a prediction task for traffic usage through a multi-task model to obtain a prediction result;

套餐确定模块404，用于根据所述预测结果，确定为所述用户推荐的语音套餐和流量套餐。The package determination module 404 is configured to determine, according to the prediction result, a voice package and a data package recommended for the user.

在一种实施例中，所述套餐确定模块包括：In one embodiment, the package determination module includes:

在一种实施例中，所述多任务模型包括：两个门网络、多个专家网络以及两个Tower网络，所述两个门网络与所述两个Tower网络一一对应；所述特征预测模块，包括：In an embodiment, the multi-task model includes: two gate networks, multiple expert networks, and two Tower networks, the two gate networks are in one-to-one correspondence with the two Tower networks; the feature prediction modules, including:

在一种实施例中，所述权重获取子模块，包括：In one embodiment, the weight acquisition sub-module includes:

在一种实施例中，通过如下方式训练所述多任务模型：In one embodiment, the multi-task model is trained by:

在一种实施例中，所述多任务模型包括：两个门网络、多个专家网络以及两个Tower网络，所述两个门网络与所述两个Tower网络一一对应；所述样本预测模块包括：In an embodiment, the multi-task model includes: two gate networks, multiple expert networks and two Tower networks, the two gate networks are in one-to-one correspondence with the two Tower networks; the sample prediction Modules include:

在一种实施例中，所述样本权重子模块包括：In one embodiment, the sample weight sub-module includes:

综上，在本发明实施例中，根据用户手机号，从日志数据中获取用户行为数据；从用户行为数据中筛选特征数据，采用特征数据，通过多任务模型进行针对语音使用的预测任务和针对流量使用的预测任务，得到预测结果；根据预测结果，确定为用户推荐的语音套餐和流量套餐。通过使用多任务模型，分别对流量和语音这两个任务进行预测，提高了预测效率，并且能够更加精准的匹配最合适用户的套餐，实现最大程度的服务并留存用户。To sum up, in the embodiment of the present invention, user behavior data is obtained from log data according to the user's mobile phone number; feature data is screened from user behavior data, and feature data is used to perform prediction tasks for voice usage and target data through a multi-task model. According to the prediction task of traffic usage, the prediction result is obtained; according to the prediction result, the voice package and data package recommended for the user are determined. By using a multi-task model, the two tasks of traffic and voice are predicted respectively, which improves the prediction efficiency, and can more accurately match the most suitable user package, achieve maximum service and retain users.

对于装置实施例而言，由于其与方法实施例基本相似，所以描述的比较简单，相关之处参见方法实施例的部分说明即可。As for the apparatus embodiment, since it is basically similar to the method embodiment, the description is relatively simple, and reference may be made to the partial description of the method embodiment for related parts.

本发明实施例还提供了一种电子设备，包括：The embodiment of the present invention also provides an electronic device, including:

包括处理器、存储器及存储在所述存储器上并能够在所述处理器上运行的计算机程序，该计算机程序被处理器执行时实现上述一种基于多任务模型的推荐方法实施例的各个过程，且能达到相同的技术效果，为避免重复，这里不再赘述。It includes a processor, a memory, and a computer program stored on the memory and capable of running on the processor, and when the computer program is executed by the processor, each process of the above-mentioned embodiment of the recommendation method based on a multitasking model is implemented, And can achieve the same technical effect, in order to avoid repetition, it is not repeated here.

本发明实施例还提供了一种计算机可读存储介质，计算机可读存储介质上存储计算机程序，计算机程序被处理器执行时实现上述一种基于多任务模型的推荐方法实施例的各个过程，且能达到相同的技术效果，为避免重复，这里不再赘述。Embodiments of the present invention further provide a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, each process of the above-mentioned embodiment of the multitask model-based recommendation method is implemented, and The same technical effect can be achieved, and in order to avoid repetition, details are not repeated here.

本说明书中的各个实施例均采用递进的方式描述，每个实施例重点说明的都是与其他实施例的不同之处，各个实施例之间相同相似的部分互相参见即可。The various embodiments in this specification are described in a progressive manner, and each embodiment focuses on the differences from other embodiments, and the same and similar parts between the various embodiments may be referred to each other.

本领域内的技术人员应明白，本发明实施例的实施例可提供为方法、装置、或计算机程序产品。因此，本发明实施例可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且，本发明实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。It should be understood by those skilled in the art that the embodiments of the embodiments of the present invention may be provided as a method, an apparatus, or a computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product implemented on one or more computer-usable storage media having computer-usable program code embodied therein, including but not limited to disk storage, CD-ROM, optical storage, and the like.

本发明实施例是参照根据本发明实施例的方法、终端设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理终端设备的处理器以产生一个机器，使得通过计算机或其他可编程数据处理终端设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。Embodiments of the present invention are described with reference to flowcharts and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the present invention. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, special purpose computer, embedded processor or other programmable data processing terminal equipment to produce a machine that causes the instructions to be executed by the processor of the computer or other programmable data processing terminal equipment Means are created for implementing the functions specified in the flow or flows of the flowcharts and/or the blocks or blocks of the block diagrams.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理终端设备以特定方式工作的计算机可读存储器中，使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品，该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。These computer program instructions may also be stored in a computer readable memory capable of directing a computer or other programmable data processing terminal equipment to operate in a particular manner, such that the instructions stored in the computer readable memory result in an article of manufacture comprising instruction means, the The instruction means implement the functions specified in the flow or flow of the flowcharts and/or the block or blocks of the block diagrams.

这些计算机程序指令也可装载到计算机或其他可编程数据处理终端设备上，使得在计算机或其他可编程终端设备上执行一系列操作步骤以产生计算机实现的处理，从而在计算机或其他可编程终端设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions can also be loaded on a computer or other programmable data processing terminal equipment, so that a series of operational steps are performed on the computer or other programmable terminal equipment to produce a computer-implemented process, thereby executing on the computer or other programmable terminal equipment The instructions executed on the above provide steps for implementing the functions specified in the flowchart or blocks and/or the block or blocks of the block diagrams.

尽管已描述了本发明实施例的优选实施例，但本领域内的技术人员一旦得知了基本创造性概念，则可对这些实施例做出另外的变更和修改。所以，所附权利要求意欲解释为包括优选实施例以及落入本发明实施例范围的所有变更和修改。Although preferred embodiments of the embodiments of the present invention have been described, additional changes and modifications to these embodiments may be made by those skilled in the art once the basic inventive concepts are known. Therefore, the appended claims are intended to be construed to include the preferred embodiments as well as all changes and modifications that fall within the scope of the embodiments of the present invention.

最后，还需要说明的是，在本文中，诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来，而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者终端设备不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者终端设备所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括所述要素的过程、方法、物品或者终端设备中还存在另外的相同要素。Finally, it should also be noted that in this document, relational terms such as first and second are used only to distinguish one entity or operation from another, and do not necessarily require or imply these entities or that there is any such actual relationship or sequence between operations. Moreover, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or terminal device comprising a list of elements includes not only those elements, but also a non-exclusive list of elements. other elements, or also include elements inherent to such a process, method, article or terminal equipment. Without further limitation, an element defined by the phrase "comprises a..." does not preclude the presence of additional identical elements in the process, method, article or terminal device comprising said element.

以上对本发明所提供的一种基于多任务模型的推荐方法和一种基于多任务模型的推荐装置，进行了详细介绍，本文中应用了具体个例对本发明的原理及实施方式进行了阐述，以上实施例的说明只是用于帮助理解本发明的方法及其核心思想；同时，对于本领域的一般技术人员，依据本发明的思想，在具体实施方式及应用范围上均会有改变之处，综上所述，本说明书内容不应理解为对本发明的限制。A method for recommending based on a multi-task model and a device for recommending based on a multi-task model provided by the present invention have been described in detail above. Specific examples are used in this paper to illustrate the principles and implementations of the present invention. The description of the embodiment is only used to help understand the method of the present invention and its core idea; meanwhile, for those of ordinary skill in the art, according to the idea of the present invention, there will be changes in specific embodiments and application scope. As mentioned above, the contents of this specification should not be construed as limiting the present invention.

Claims

1. A recommendation method based on a multi-task model, wherein the method comprises:

Obtain user behavior data from log data according to the user's mobile phone number;

Screening feature data from the user behavior data, where the feature data includes basic features, interaction features, and time slice features;

Using the feature data, a multi-task model is used to perform a prediction task for voice usage and a prediction task for traffic usage to obtain a prediction result;

According to the prediction result, a voice package and a data package recommended for the user are determined.

2. The method according to claim 1, wherein, according to the prediction result, determining the voice package and the traffic package recommended for the user, comprising:

If the predicted result is traffic and voice duration, then map the traffic and voice duration to voice packages and traffic packages according to a preset mapping relationship;

According to the voice package and the data package, a basic package is combined to be recommended to the user.

3. The method according to claim 1, wherein the multi-task model comprises: two gate networks, multiple expert networks and two Tower networks, the two gate networks and the two Tower networks One-to-one correspondence; the feature data is used to perform the prediction task for voice usage and the prediction task for traffic usage through a multi-task model to obtain prediction results, including:

Process the feature data through each of the gate networks to obtain the weights of multiple expert networks corresponding to each of the gate networks;

Perform feature extraction on the feature data through a plurality of expert networks corresponding to each of the gate networks to obtain a plurality of first features;

According to the weights of multiple expert networks corresponding to each gate network, weighted summation is performed on multiple first features corresponding to each gate network, and the prediction result is obtained through the Tower network corresponding to each gate network.

4. The method according to claim 3, wherein the feature data is processed through each gate network to obtain the weights of multiple expert networks corresponding to each gate network, comprising:

Perform feature extraction on the feature data through each of the gate networks to obtain the probability that a plurality of the expert networks are selected by each of the gate networks;

Normalize the probability that the plurality of expert networks are selected by each of the gate networks to obtain the weights of the multiple expert networks corresponding to each of the gate networks.

5. The method of claim 1, wherein the multitasking model is trained in the following manner:

acquiring feature sample data, where the feature sample data includes basic feature samples, interactive feature samples, and time slice feature samples;

using the feature sample data as the input of the multi-task model;

In the multi-task model, the feature sample data is used to perform a prediction task for voice usage and a prediction task for traffic usage to obtain a prediction result;

According to the prediction result, the multi-task model is trained.

6. The method according to claim 5, wherein the multi-task model comprises: two gate networks, multiple expert networks and two Tower networks, the two gate networks and the two Tower networks One-to-one correspondence; in the multi-task model, the feature sample data is used to perform a prediction task for voice usage and a prediction task for traffic usage to obtain a prediction result, including:

Process the feature sample data through each of the gate networks to obtain the weights of a plurality of the expert networks corresponding to each of the gate networks;

Perform feature extraction on the feature sample data through a plurality of expert networks corresponding to each of the gate networks to obtain a plurality of first feature samples;

According to the weights of multiple expert networks corresponding to each gate network, weighted summation is performed on multiple first feature samples corresponding to each gate network, and the Tower network corresponding to each gate network is used to obtain Predicted results of traffic and voice duration.

7 . The method according to claim 6 , wherein the feature sample data is processed through each of the gate networks to obtain the data of a plurality of the expert networks corresponding to each of the gate networks. 8 . weights, including:

Perform feature extraction on the feature sample data through each of the gate networks to obtain the probability that a plurality of the expert networks are selected by each of the gate networks;

8. A recommendation device based on a multi-task model, wherein the device comprises:

The data acquisition module is used to acquire user behavior data from the log data according to the user's mobile phone number;

a screening feature module, configured to screen feature data from the user behavior data, where the feature data includes basic features, interaction features, and time slice features;

A feature prediction module, configured to use the feature data to perform a prediction task for voice usage and a prediction task for traffic usage through a multi-task model to obtain a prediction result;

A package determination module, configured to determine a voice package and a traffic package recommended for the user according to the prediction result.

9. An electronic device, comprising: a processor, a memory, and a computer program stored on the memory and capable of running on the processor, the computer program being executed by the processor to achieve the following: The steps of the multi-task model-based recommendation method according to any one of claims 1-7.

10. A computer-readable storage medium, characterized in that, a computer program is stored on the computer-readable storage medium, and when the computer program is executed by a processor, the method according to any one of claims 1-7 is implemented. Steps of a recommended approach for a multi-task model.