WO2019120019A1 - User gender prediction method and apparatus, storage medium and electronic device - Google Patents

User gender prediction method and apparatus, storage medium and electronic device Download PDF

Info

Publication number
WO2019120019A1
WO2019120019A1 PCT/CN2018/116481 CN2018116481W WO2019120019A1 WO 2019120019 A1 WO2019120019 A1 WO 2019120019A1 CN 2018116481 W CN2018116481 W CN 2018116481W WO 2019120019 A1 WO2019120019 A1 WO 2019120019A1
Authority
WO
WIPO (PCT)
Prior art keywords
prediction
probability
user
sample
gender
Prior art date
Application number
PCT/CN2018/116481
Other languages
French (fr)
Chinese (zh)
Inventor
陈岩
刘耀勇
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Publication of WO2019120019A1 publication Critical patent/WO2019120019A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition

Definitions

  • the present application belongs to the field of communications technologies, and in particular, to a user gender prediction method, apparatus, storage medium, and electronic device.
  • the present application provides a user gender prediction method, apparatus, storage medium, and electronic device capable of predicting a user's gender based on a BP neural network.
  • an embodiment of the present application provides a user gender prediction method, including:
  • a prediction probability is generated according to the predicted sample and the trained prediction model, the predicted probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
  • the embodiment of the present application provides a user gender prediction apparatus, including:
  • An acquisition module is configured to collect multi-dimensional feature information of a known gender user using the electronic device as a sample, and construct a sample set of gender prediction;
  • a training module configured to train the BP neural network model by using the sample set to obtain a trained prediction model
  • An obtaining module configured to obtain multi-dimensional feature information of an electronic device used by an unknown gender user as a prediction sample
  • a generating module configured to generate a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: a first probability that the current user is a male, and a second probability that the current user is a female.
  • an embodiment of the present application provides a storage medium on which a computer program is stored, and when the computer program runs on a computer, causes the computer to execute the user gender prediction method described above.
  • an embodiment of the present application provides an electronic device, including a processor and a memory, where the memory stores a plurality of instructions, and the processor loads the instructions in the memory to perform the following steps:
  • a prediction probability is generated according to the predicted sample and the trained prediction model, the predicted probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
  • FIG. 1 is a schematic diagram of a system for a user gender prediction apparatus according to an embodiment of the present application.
  • FIG. 2 is a schematic diagram of an application scenario of a user gender prediction apparatus according to an embodiment of the present application.
  • FIG. 3 is a schematic flowchart of a user gender prediction method according to an embodiment of the present application.
  • FIG. 4 is another schematic flowchart of a user gender prediction method according to an embodiment of the present application.
  • FIG. 5 is a schematic diagram of another application scenario of a user gender prediction apparatus according to an embodiment of the present application.
  • FIG. 6 is a schematic structural diagram of a user gender prediction apparatus according to an embodiment of the present application.
  • FIG. 7 is another schematic structural diagram of a user gender prediction apparatus according to an embodiment of the present application.
  • FIG. 8 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
  • FIG. 9 is another schematic structural diagram of an electronic device according to an embodiment of the present application.
  • the electronic device may be a smart phone, a tablet computer, a desktop computer, a notebook computer, or a handheld computer.
  • FIG. 1 is a schematic diagram of a system for predicting a user gender according to an embodiment of the present application.
  • the user gender prediction device is mainly used for collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, constructing a sample set of gender prediction, and training the BP neural network model by using the sample set to obtain a trained prediction model.
  • FIG. 2 is a schematic diagram of an application scenario of a user gender prediction apparatus according to an embodiment of the present application.
  • the user gender prediction device collects multi-dimensional feature information of the electronic device that the unknown gender user uses.
  • the multi-dimensional feature information may be used to acquire historical usage information in the electronic device and collect multi-dimensional feature information from the historical usage information.
  • the multi-dimensional feature information is input to the prediction model, and the current user is predicted by the prediction model to obtain the first probability.
  • the prediction model is used to predict the current user's female probability, and the second probability is obtained, and finally according to the first probability sum.
  • the second probability is entered into the final predicted result.
  • the embodiment of the present application provides a user gender prediction method, and the execution subject of the user gender prediction method may be the user gender prediction device provided by the embodiment of the present application, or an electronic device integrated with the user gender prediction device, wherein the user gender prediction The device can be implemented in hardware or software.
  • the embodiment of the present application will be described from the perspective of a user gender prediction device, which may be specifically integrated in an electronic device.
  • the user gender prediction method includes: collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, constructing a sample set of gender prediction, and training the BP neural network model by using the sample set to obtain a trained prediction model and acquiring an unknown.
  • the gender user uses the multi-dimensional feature information of the electronic device as a prediction sample, and generates a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: the first probability that the current user is a male, and the current user is a female Two probability.
  • the step of training the BP neural network model by using the sample set comprises:
  • the BP neural network model is trained according to the normalized sample parameters.
  • the step of training the BP neural network model according to the normalized sample parameters includes:
  • Training is performed based on the loss value to generate a target model parameter.
  • the step of training according to the loss value includes:
  • Training is performed using the gradient descent method based on the loss value.
  • the method further includes:
  • the final prediction result is output.
  • the step of outputting the final prediction result according to the comparison result includes:
  • the step of inputting the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results includes:
  • Z K is the intermediate value and C is the number of categories of the predicted result.
  • C is the number of categories of the predicted result.
  • the step of obtaining a loss value according to the two prediction results and the probability corresponding thereto includes:
  • C is the number of categories of prediction results
  • y k is the true value
  • E is the average.
  • FIG. 3 is a schematic flowchart diagram of a user gender prediction method according to an embodiment of the present application.
  • the user gender prediction method provided by the embodiment of the present application is applied to an electronic device, and the specific process may be as follows:
  • Step 101 Collect multi-dimensional feature information of a known gender user using the electronic device as a sample, and construct a sample set of gender prediction.
  • the multi-dimensional feature information of the electronic user using the electronic device has a dimension of a certain length, and the parameters in each dimension correspond to one feature information of the characterization application, that is, the multi-dimensional feature information is composed of a plurality of feature information.
  • the plurality of feature information may include feature information of the user in using different types of applications, such as the number and duration of browsing the male-type goods (such as men's clothing) in the shopping application, and the user browsing the female-oriented products in the shopping application (such as The number and duration of cosmetics and women's wear; the length of time users read a male novel in a reading application, the length of time a user reads a female novel, the length of time a user reads a sports news, the length of time a user reads a constellation news; the user uses a different type
  • the usage information in the application such as the number of times the user uses the front camera self-timer, the number of times the user uses the beauty software, and the number and duration of the user playing different types of games.
  • the sample set of gender predictions may include multiple training samples collected during the historical time period.
  • the historical time period can be, for example, the past 7 days, 10 days, and the like. It can be understood that the known gender users collected at one time use the multi-dimensional feature data of the electronic device to form a sample set.
  • the feature set for each sample is recorded with a set of real numbers.
  • the sample values need to be normalized during sampling, such as normalization to a number between 0 and 1.
  • each user is a sample, and N users make up N samples, each of which is characterized by (x 1 , x 2 , . . . , x n ).
  • each sample in the sample set can be marked to obtain a sample label of each sample. Since the implementation is to predict the gender of the current user, the labeled sample label is the registered account of the user.
  • the actual gender information provided including gender males and gender females, can be represented by a code of 0 or 1.
  • the sample categories can include gender males and gender females.
  • the user may mark the historical usage habits of the application in the electronic device, for example, within a length of one week, the user browses the male-type goods (such as men's clothing) in the shopping application for 20 times, and the accumulated duration is one hour.
  • the user can read a female novel for 8 hours, which can be marked as a gender female.
  • the value "1" can be used to indicate “gender male”.
  • sample set of the above gender prediction is used to train the BP neural network model, for example, the sample set may include the following features:
  • step 102 the BP neural network model is trained by using the sample set to obtain the trained prediction model.
  • BP neural network model is a classification model in machine learning. Its basic idea is gradient descent method, which uses gradient search technology to minimize the error mean square error between the actual output value of the network and the expected output value.
  • the basic BP algorithm includes two processes of forward propagation of signals and back propagation of errors. That is, the calculation of the error output is performed in the direction from the input to the output, and the adjustment weight and the threshold are performed from the output to the input.
  • the input signal acts on the output node through the hidden layer and undergoes a nonlinear transformation to produce an output signal. If the actual output does not match the expected output, the error propagates back into the error propagation process. Error back propagation is to pass the output error back to the input layer through the hidden layer, and distribute the error to all the units in each layer, so as to adjust the error value of each unit as the basis for adjusting the weight of each unit.
  • the error is decreased along the gradient direction, and after repeated learning training, the network parameters (weights and thresholds corresponding to the minimum error) are determined. ), the training will stop. At this time, the trained neural network can process the non-linear conversion information with the smallest output error for the input information of similar samples.
  • the BP neural network model can be trained by using the sample set to obtain the trained prediction model.
  • the training model of the BP neural network model is used to solve the model parameters in the BP neural network model.
  • the network structure of the BP neural network model in the embodiment of the present application includes three layers, which are an input layer, a hidden layer, and an output layer, respectively, wherein the input layer inputs the feature information in step 101, and of course the feature information is normalized.
  • the feature information for example, the above 9 feature information is introduced into 9 input nodes, and the hidden layer may include three layers, which may be 10 nodes, 5 nodes, 2 nodes respectively, and finally obtain 2D output through the SoftMax function.
  • the layers represent the probability that the user is male and female.
  • the above mini batch gradient descent method is an optimization algorithm between the fastest gradient descent method and the stochastic gradient descent method, and each time a certain amount of training samples are selected for iteration.
  • the cross-entropy (crossentropy) is used as the loss function to update the network weights in reverse propagation. After the loss value is less than the preset threshold or the number of iterations reaches the set number of training iterations, the training can be ended.
  • the loss function is used to estimate the degree of inconsistency between the predicted value f(x) of the model and the true value Y. It is a non-negative real-valued function, usually using L(Y, f(x)), or L(w) indicates that the smaller the loss function, the better the robustness of the model.
  • the loss function is the core part of the empirical risk function and an important part of the structural risk function.
  • Step 103 Acquire the current multi-dimensional feature information of the electronic device of the unknown gender user and use it as a prediction sample.
  • the user characteristic value mentioned in the previous week is counted, the user feature vector is obtained, and the normalized feature value is input into the network to perform a forward calculation, and the output of the network is the probability that the user gender is male or female.
  • the following characteristics can be included in the forecast sample:
  • Step 104 Generate a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: a first probability that the current user is a male, and a second probability that the current user is a female.
  • a corresponding probability is output, and the BP neural network model outputs a first probability that the current user is a male and a second probability that the current user is a female.
  • one probability may be selected for two probabilities in the prediction probability, and then the gender of the current user is predicted based on the selected probability.
  • the method may further include:
  • the final prediction result is output.
  • the step of outputting the final prediction result may include:
  • the second predicted result that the current user is a female is output.
  • x) is not greater than P(Y 0
  • the training process of the predictive model can be completed on the server side or on the electronic device side.
  • the training process and the actual prediction process of the predictive model are completed on the server side
  • the multi-dimensional feature information of the unknown gender user using the electronic device can be input to the server, and after the actual prediction of the server is completed, the prediction will be performed.
  • the result is sent to the electronic device end, and finally the electronic device outputs the predicted result.
  • the multi-dimensional feature information can be input to the electronic device, and after the actual prediction of the electronic device is completed, the electronic device outputs the predicted result. .
  • the embodiment of the present application collects multi-dimensional feature information of a known gender user using the electronic device as a sample, constructs a sample set of gender prediction, and uses the sample set to train the BP neural network model to obtain a trained prediction model and obtain
  • the unknown gender user uses the multi-dimensional feature information of the electronic device as a prediction sample, and generates a prediction probability according to the predicted sample and the trained prediction model, the predicted probability includes: the first probability that the current user is a male, and the current user is a female Second probability.
  • the application can predict the gender of the current user based on the BP neural network model, improve the accuracy of the gender prediction, and facilitate the electronic device to complete the accurate portrait of the user.
  • FIG. 4 is another schematic flowchart of a user gender prediction method according to an embodiment of the present application, where the user gender prediction method includes:
  • Step 201 Collect multi-dimensional feature information of a known gender user using the electronic device as a sample, and construct a sample set of gender prediction.
  • the multi-dimensional feature information of the electronic user using the electronic device has a dimension of a certain length, and the parameters in each dimension correspond to one feature information of the characterization application, that is, the multi-dimensional feature information is composed of a plurality of feature information.
  • the plurality of feature information may include feature information of the user in using different types of applications, such as the number and duration of browsing the male-type goods (such as men's clothing) in the shopping application, and the user browsing the female-oriented products in the shopping application (such as The number and duration of cosmetics and women's wear; the length of time users read a male novel in a reading application, the length of time a user reads a female novel, the length of time a user reads a sports news, the length of time a user reads a constellation news; the user uses a different type
  • the usage information in the application such as the number of times the user uses the front camera self-timer, the number of times the user uses the beauty software, and the number and duration of the user playing different types of games.
  • Step S202 normalizing the sample parameters in the sample set.
  • the feature set for each sample is recorded with a set of real numbers.
  • the sample values need to be normalized during sampling, such as normalization to a number between 0 and 1.
  • each user is a sample, and N users make up N samples, each of which is characterized by (x 1 , x 2 , . . . , x n ).
  • Step S203 input the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results.
  • the network structure of the BP neural network model in the embodiment of the present application includes three layers, which are an input layer, a hidden layer, and an output layer, respectively, wherein the input layer inputs the feature information in step 101, and of course the feature information is normalized.
  • the feature information for example, the above 9 feature information is introduced into 9 input nodes, and the hidden layer may include three layers, which may be 10 nodes, 5 nodes, 2 nodes respectively, and finally obtain 2D output through the SoftMax function.
  • the layers represent the probability that the user is male and female. For all samples collected, they were sent to the network for training in batches using the mini batch gradient descent method.
  • the probability of obtaining two prediction results is calculated based on the first preset formula, wherein the first preset formula is:
  • Z K is the intermediate value and C is the number of categories of the predicted result.
  • C is the number of categories of the predicted result.
  • Step S204 the loss value is obtained according to the two prediction results and the probability corresponding thereto.
  • the loss value is obtained according to the two prediction results and the probability corresponding thereto according to the second preset formula, wherein the second preset formula is:
  • C is the number of categories of prediction results
  • y k is the true value
  • E is the average.
  • step S205 training is performed according to the loss value to generate a target model parameter.
  • the gradient descent method is used for training according to the loss value.
  • all the samples collected are sent to the network for training in a batch manner using a mini batch gradient descent method.
  • the above mini batch gradient descent method is an optimization algorithm between the fastest gradient descent method and the stochastic gradient descent method, and each time a certain amount of training samples are selected for iteration.
  • the cross-entropy (crossentropy) is used as the loss function to update the network weights in reverse propagation. After the loss value is less than the preset threshold or the number of iterations reaches the set number of training iterations, the training can be ended.
  • Step S206 Acquire multi-dimensional feature information of the electronic device of the unknown gender user and use it as a prediction sample.
  • the user characteristic value mentioned in the previous week is counted, the user feature vector is obtained, and the normalized feature value is input into the network to perform a forward calculation, and the output of the network is the probability that the user gender is male or female.
  • the following characteristics can be included in the forecast sample:
  • Step S207 generating a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability comprises: a first probability that the current user is a male, and a second probability that the current user is a female.
  • step S208 the first probability that the current user is a male is compared with the second probability that the current user is a female, and a comparison result is obtained.
  • Step S209 according to the comparison result, output the final prediction result.
  • the step of outputting the final prediction result may include:
  • the second predicted result that the current user is a female is output.
  • x) is not greater than P(Y 0
  • the embodiment of the present application collects the multi-dimensional feature information of the known gender user using the electronic device as a sample, constructs a sample set of the gender prediction, normalizes the sample parameters in the sample set, and normalizes the sample parameters.
  • the BP neural network model is input to obtain the probability of two prediction results, the loss value is obtained according to the two prediction results and the probability corresponding thereto, the training is performed according to the loss value, the target model parameter is generated, and the multi-dimensional use of the electronic device by the unknown gender user is obtained.
  • the feature information is used as a prediction sample, and the prediction probability is generated according to the prediction sample and the trained prediction model.
  • the prediction probability includes: a first probability that the current user is a male, and a second probability that the current user is a female, and the current user is a male. A probability is compared with a second probability that the current user is a woman, a comparison result is obtained, and a final prediction result is output according to the comparison result.
  • the application can predict the gender of the current user based on the BP neural network model, improve the accuracy of the gender prediction, and facilitate the electronic device to complete the accurate portrait of the user.
  • FIG. 5 is a schematic diagram of another application scenario of a user gender prediction apparatus according to an embodiment of the present application.
  • the training process of the predictive model is completed on the server side, and the actual prediction process of the predictive model is completed on the electronic device side
  • the optimized predictive model needs to be used
  • the current multidimensional feature information of the electronic device can be input to the electronic device, and the electronic device actually After the prediction is completed, the electronic device outputs the predicted result based on the predicted result.
  • the trained predictive model file model file
  • the smart device If it is necessary to determine the gender of the current user, update the current sample set, input the trained predictive model file (model file), and calculate You can get the predicted value.
  • the method may further include:
  • the preset time is obtained. If the current system time reaches the preset time, the current multi-dimensional feature information of the electronic device is obtained.
  • the preset time can be a time point in the day, such as 9 am, or several time points in the day, such as 9 am, 6 pm, and the like. It can also be one or several time points in multiple days. Then, the prediction result is generated according to the prediction model and the optimization parameter, and the application is controlled according to the prediction result.
  • FIG. 6 is a schematic structural diagram of a user gender prediction apparatus according to an embodiment of the present application.
  • the user gender prediction device 300 is applied to an electronic device, and the user gender prediction device 300 includes an acquisition module 301, a training module 302, an acquisition module 303, and a generation module 304.
  • the collecting module 301 is configured to collect multi-dimensional feature information of the known gender user using the electronic device as a sample, and construct a sample set of the gender prediction.
  • the sample set of gender prediction may include multiple training samples collected during the historical time period.
  • the historical time period can be, for example, the past 7 days, 10 days, and the like.
  • the unknown gender user collected at one time uses the multi-dimensional feature information of the electronic device to form a sample set.
  • the feature set for each sample is recorded with a set of real numbers.
  • the sample values need to be normalized during sampling, such as normalization to a number between 0 and 1.
  • each user is a sample, and N users make up N samples, each of which is characterized by (x 1 , x 2 , . . . , x n ).
  • each sample in the sample set can be marked to obtain a sample label of each sample. Since the implementation is to predict the gender of the current user, the labeled sample label is the registered account of the user.
  • the actual gender information provided including gender males and gender females, can be represented by a code of 0 or 1.
  • the sample categories can include gender males and gender females.
  • the user may mark the historical usage habits of the application in the electronic device, for example, within a length of one week, the user browses the male-type goods (such as men's clothing) in the shopping application for 20 times, and the accumulated duration is one hour.
  • the user can read a female novel for 8 hours, which can be marked as a gender female.
  • the value "1" can be used to indicate “gender male”.
  • sample set of the above gender prediction is used to train the BP neural network model, for example, the sample set may include the following features:
  • the training module 302 is configured to train the BP neural network model by using the sample set to obtain the trained prediction model.
  • the sample parameters may be input into the BP neural network model to obtain the probability of two prediction results, and the probability of obtaining two prediction results based on the first preset formula, wherein the first preset formula is for:
  • Z K is the intermediate value and C is the number of categories of the predicted result.
  • C is the number of categories of the predicted result.
  • the loss value is obtained according to the two prediction results and the probability corresponding thereto, and the loss value is obtained according to the two prediction results and the probability corresponding thereto according to the second preset formula, wherein the second preset formula is:
  • C is the number of categories of prediction results
  • y k is the true value
  • E is the average. Then, according to the loss value, training is performed to generate the target model parameters.
  • the obtaining module 303 is configured to obtain multi-dimensional feature information of the electronic device that the unknown gender user uses and use as a prediction sample.
  • the user characteristic value mentioned in the previous week is counted, the user feature vector is obtained, and the normalized feature value is input into the network to perform a forward calculation, and the output of the network is the probability that the user gender is male or female.
  • the generating module 304 is configured to generate a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability comprises: a first probability that the current user is a male, and a second probability that the current user is a female.
  • x) is greater than P(Y 0
  • x), then the output is The current user is the first predicted result of the male; assuming P(Y 1
  • x) is not greater than P(Y 0
  • FIG. 7 is another schematic structural diagram of a user gender prediction apparatus according to an embodiment of the present application.
  • the training module 302 can specifically include a processing sub-module 3021 and a training sub-module 3022.
  • the processing sub-module 3021 is configured to perform normalization processing on the sample parameters in the sample set
  • the training sub-module 3022 is configured to train the BP neural network model according to the normalized sample parameters.
  • the training sub-module 3022 is specifically configured to input the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results, according to the two prediction results and corresponding The probability obtains a loss value, and is trained according to the loss value to generate a target model parameter.
  • the apparatus 300 may further include:
  • the comparing module 305 is configured to compare, after the generating module 304 generates the predicted probability according to the predicted sample and the trained predictive model, a first probability that the current user is a male and a second probability that the current user is a female, to obtain a comparison result;
  • the output module 306 is configured to output a final prediction result according to the comparison result.
  • the output module 306 is specifically configured to: when the first probability is greater than the second probability, output a first prediction result that the current user is a male, when the first probability is not greater than the second probability , output the second predicted result that the current user is a female.
  • the training process of the predictive model can be completed on the server side or on the electronic device side.
  • the training process and the actual prediction process of the predictive model are completed on the server side
  • the multi-dimensional feature information of the electronic device can be input to the server by the user of unknown gender, and after the actual prediction of the server is completed, The prediction result is sent to the electronic device end, and finally the electronic device outputs the predicted result.
  • the multi-dimensional feature information can be input to the electronic device, and after the actual prediction of the electronic device is completed, the electronic device outputs the predicted result. .
  • the user gender prediction apparatus in the embodiment of the present application constructs a sample set of gender prediction by collecting multi-dimensional feature information of a known gender user using the electronic device, and uses the sample set to train the BP neural network model to obtain
  • the predicted model after the training obtains multi-dimensional feature information of the electronic device of the unknown gender and uses the electronic device as a prediction sample, and generates a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: the first probability that the current user is a male And the second probability that the current user is a woman.
  • the application can predict the gender of the current user based on the BP neural network model, improve the accuracy of the gender prediction, and facilitate the electronic device to complete the accurate portrait of the user.
  • the user gender prediction apparatus belongs to the same concept as the user gender prediction method in the above embodiment, and any method provided in the embodiment of the user gender prediction method may be run on the user gender prediction apparatus, and the specific implementation process thereof For details, refer to the embodiment of the user gender prediction method, and details are not described herein again.
  • module as used herein may be taken to mean a software object that is executed on the computing system.
  • the different components, modules, engines, and services described herein can be viewed as implementation objects on the computing system.
  • the apparatus and method described herein may be implemented in software, and may of course be implemented in hardware, all of which are within the scope of the present application.
  • the embodiment of the present application further provides a storage medium on which a computer program is stored, and when the computer program runs on a computer, causes the computer to execute the user gender prediction method described above.
  • the embodiment of the present application further provides an electronic device, such as an electronic device such as a tablet computer or a mobile phone.
  • the processor in the electronic device loads the instructions corresponding to the process of one or more applications into the memory according to the following steps, and the processor runs the application stored in the memory to implement various functions:
  • a prediction probability is generated according to the predicted sample and the trained prediction model, the predicted probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
  • the processor when training the BP neural network model with the sample set, the processor is configured to perform the following steps:
  • the BP neural network model is trained according to the normalized sample parameters.
  • the processor when the BP neural network model is trained according to the normalized sample parameters, the processor is configured to perform the following steps:
  • Training is performed based on the loss value to generate a target model parameter.
  • the processor when training is performed according to the loss value, the processor is configured to perform the following steps:
  • Training is performed using the gradient descent method based on the loss value.
  • the processor is further configured to perform the following steps:
  • the final prediction result is output.
  • the processor when the final prediction result is output according to the comparison result, the processor is configured to perform the following steps:
  • references to "an embodiment” herein mean that a particular feature, structure, or characteristic described in connection with the embodiments can be included in at least one embodiment of the present application.
  • the appearances of the phrases in various places in the specification are not necessarily referring to the same embodiments, and are not exclusive or alternative embodiments that are mutually exclusive. Those skilled in the art will understand and implicitly understand that the embodiments described herein can be combined with other embodiments.
  • the electronic device 400 includes a processor 401 and a memory 402.
  • the processor 401 is electrically connected to the memory 402.
  • the processor 400 is a control center of the electronic device 400 that connects various portions of the entire electronic device using various interfaces and lines, executes the electronic by running or loading a computer program stored in the memory 402, and recalling data stored in the memory 402.
  • the memory 402 can be used to store software programs and modules, and the processor 401 executes various functional applications and data processing by running computer programs and modules stored in the memory 402.
  • the memory 402 can mainly include a storage program area and a storage data area, wherein the storage program area can store an operating system, a computer program required for at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area can be stored according to Data created by the use of electronic devices, etc.
  • memory 402 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device. Accordingly, memory 402 can also include a memory controller to provide processor 401 access to memory 402.
  • the processor 401 in the electronic device 400 loads the instructions corresponding to the process of one or more computer programs into the memory 402 according to the following steps, and is stored in the memory 402 by the processor 401.
  • the computer program in which to implement various functions, as follows:
  • the multi-dimensional feature information of the known gender users using the electronic device is collected as a sample, the sample set of gender prediction is constructed, and the BP neural network model is trained by using the sample set to obtain the predicted model after training, and the multi-dimensional use of the electronic device by the unknown gender user is obtained.
  • the feature information is used as a prediction sample, and a prediction probability is generated according to the prediction sample and the trained prediction model, the prediction probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
  • the application can predict the gender of the current user based on the BP neural network model, improve the accuracy of the gender prediction, and facilitate the electronic device to complete the accurate portrait of the user.
  • the electronic device 400 may further include: a display 403, a radio frequency circuit 404, an audio circuit 405, and a power source 406.
  • the display 403, the radio frequency circuit 404, the audio circuit 405, and the power source 406 are electrically connected to the processor 401, respectively.
  • Display 403 can be used to display information entered by the user or information provided to the user, as well as various graphical user interfaces, which can be comprised of graphics, text, icons, video, and any combination thereof.
  • the display 403 can include a display panel.
  • the display panel can be configured in the form of a liquid crystal display (LCD), or an organic light-emitting diode (OLED).
  • LCD liquid crystal display
  • OLED organic light-emitting diode
  • the radio frequency circuit 404 can be used to transmit and receive radio frequency signals to establish wireless communication with network devices or other electronic devices through wireless communication, and to transmit and receive signals with network devices or other electronic devices.
  • the audio circuit 405 can be used to provide an audio interface between the user and the electronic device through the speaker and the microphone.
  • Power source 406 can be used to power various components of electronic device 400.
  • the power supply 406 can be logically coupled to the processor 401 through a power management system to enable functions such as managing charging, discharging, and power management through the power management system.
  • the electronic device 400 may further include a camera, a Bluetooth module, and the like, and details are not described herein.
  • the storage medium may be a magnetic disk, an optical disk, a read only memory (ROM), or a random access memory (RAM).
  • ROM read only memory
  • RAM random access memory
  • the user gender prediction method in the embodiment of the present application a general tester in the field can understand all or part of the process of implementing the user gender prediction method in the embodiment of the present application, and the related hardware can be controlled by a computer program.
  • the computer program can be stored in a computer readable storage medium, such as in a memory of the electronic device, and executed by at least one processor within the electronic device, and can include, for example, a user gender prediction method during execution.
  • the storage medium may be a magnetic disk, an optical disk, a read only memory, a random access memory, or the like.
  • each functional module may be integrated into one processing chip, or each module may exist physically separately, or two or more modules may be integrated into one module.
  • the above integrated modules can be implemented in the form of hardware or in the form of software functional modules.
  • An integrated module, if implemented in the form of a software functional module and sold or used as a standalone product, may also be stored in a computer readable storage medium such as a read only memory, a magnetic disk or an optical disk.

Abstract

Disclosed is a user gender prediction method, comprising: collecting multi-dimensional feature information of a user whose gender is already known using an electronic device so as to construct a sample set; training a BP neural network model by using the sample set to obtain a prediction model; obtaining multi-dimensional feature information of a user whose gender is unknown using the electronic device as a prediction sample; and generating a prediction probability according to the prediction sample and the prediction model. The present application further provides a user gender prediction apparatus, a storage medium, and an electronic device.

Description

用户性别预测方法、装置、存储介质及电子设备User gender prediction method, device, storage medium and electronic device
本申请要求于2017年12月20日提交中国专利局、申请号201711387919.0、发明名称为“用户性别预测方法、装置、存储介质及电子设备”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of the Chinese Patent Application filed on Dec. 20, 2017, the Chinese Patent Office, Application No. 201711387919.0, entitled "User Gender Prediction Method, Apparatus, Storage Media, and Electronic Equipment", the entire contents of which are incorporated by reference. In this application.
技术领域Technical field
本申请属于通信技术领域,尤其涉及一种用户性别预测方法、装置、存储介质及电子设备。The present application belongs to the field of communications technologies, and in particular, to a user gender prediction method, apparatus, storage medium, and electronic device.
背景技术Background technique
随着电子设备技术的发展,电子设备已经开始从以前简单地提供通话设备渐渐变成一个通用软件运行的平台。该平台不再以提供通话管理为主要目的,而是提供一个包括通话管理、游戏娱乐、办公记事、移动支付等各类应用软件在内的运行环境,随着大量的普及,已经深入至人们的生活、工作的方方面面。With the development of electronic device technology, electronic devices have begun to gradually become a platform for running general-purpose software from a simple provision of a call device. The platform is no longer to provide call management as the main purpose, but to provide an operating environment including call management, game entertainment, office notes, mobile payment and other application software, with a large number of popularization, has penetrated into people's Every aspect of life and work.
用户画像是近年来非常热门的一个研究方向。比如在智能手机上,如果有一种方法能够从用户行为习惯上准确地判断出用户的性别,从而对手机进行各方面的深度优化是非常有意义的。User portraits are a very popular research direction in recent years. For example, on a smart phone, if there is a way to accurately determine the user's gender from the user's behavioral habits, it is very meaningful to optimize the mobile phone in all aspects.
发明内容Summary of the invention
本申请提供一种用户性别预测方法、装置、存储介质及电子设备,能够基于BP神经网络来预测用户的性别。The present application provides a user gender prediction method, apparatus, storage medium, and electronic device capable of predicting a user's gender based on a BP neural network.
第一方面,本申请实施例提供一种用户性别预测方法,包括:In a first aspect, an embodiment of the present application provides a user gender prediction method, including:
采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集;Collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, and constructing a sample set of gender prediction;
利用所述样本集对BP神经网络模型进行训练,以得到训练后的预测模型;Using the sample set to train the BP neural network model to obtain a trained prediction model;
获取未知性别用户使用电子设备的多维特征信息并作为预测样本;Obtaining multi-dimensional feature information of an electronic device that is used by an unknown gender user as a prediction sample;
根据所述预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。A prediction probability is generated according to the predicted sample and the trained prediction model, the predicted probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
第二方面,本申请实施例提供一种用户性别预测装置,包括:In a second aspect, the embodiment of the present application provides a user gender prediction apparatus, including:
采集模块,用于采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集;An acquisition module is configured to collect multi-dimensional feature information of a known gender user using the electronic device as a sample, and construct a sample set of gender prediction;
训练模块,用于利用所述样本集对BP神经网络模型进行训练,以得到训练后的预测模型;a training module, configured to train the BP neural network model by using the sample set to obtain a trained prediction model;
获取模块,用于获取未知性别用户使用电子设备的多维特征信息并作为预测样本;An obtaining module, configured to obtain multi-dimensional feature information of an electronic device used by an unknown gender user as a prediction sample;
生成模块,用于根据所述预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。And a generating module, configured to generate a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: a first probability that the current user is a male, and a second probability that the current user is a female.
第三方面,本申请实施例提供一种存储介质,其上存储有计算机程序,当所述计算机程序在计算机上运行时,使得所述计算机执行上述的用户性别预测方法。In a third aspect, an embodiment of the present application provides a storage medium on which a computer program is stored, and when the computer program runs on a computer, causes the computer to execute the user gender prediction method described above.
第四方面,本申请实施例提供一种电子设备,包括处理器和存储器,所述存储器存储有多条指令,所述处理器加载所述存储器中的指令用于执行以下步骤:In a fourth aspect, an embodiment of the present application provides an electronic device, including a processor and a memory, where the memory stores a plurality of instructions, and the processor loads the instructions in the memory to perform the following steps:
采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集;Collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, and constructing a sample set of gender prediction;
利用所述样本集对BP神经网络模型进行训练,以得到训练后的预测模型;Using the sample set to train the BP neural network model to obtain a trained prediction model;
获取未知性别用户使用电子设备的多维特征信息并作为预测样本;Obtaining multi-dimensional feature information of an electronic device that is used by an unknown gender user as a prediction sample;
根据所述预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。A prediction probability is generated according to the predicted sample and the trained prediction model, the predicted probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
附图说明DRAWINGS
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍。显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the technical solutions in the embodiments of the present application, the drawings used in the description of the embodiments will be briefly described below. Obviously, the drawings in the following description are only some embodiments of the present application, and those skilled in the art can obtain other drawings according to the drawings without any creative work.
图1为本申请实施例提供的用户性别预测装置的系统示意图。FIG. 1 is a schematic diagram of a system for a user gender prediction apparatus according to an embodiment of the present application.
图2为本申请实施例提供的用户性别预测装置的应用场景示意图。FIG. 2 is a schematic diagram of an application scenario of a user gender prediction apparatus according to an embodiment of the present application.
图3为本申请实施例提供的用户性别预测方法的流程示意图。FIG. 3 is a schematic flowchart of a user gender prediction method according to an embodiment of the present application.
图4为本申请实施例提供的用户性别预测方法的另一流程示意图。FIG. 4 is another schematic flowchart of a user gender prediction method according to an embodiment of the present application.
图5为本申请实施例提供的用户性别预测装置的另一应用场景示意图。FIG. 5 is a schematic diagram of another application scenario of a user gender prediction apparatus according to an embodiment of the present application.
图6为本申请实施例提供的用户性别预测装置的结构示意图。FIG. 6 is a schematic structural diagram of a user gender prediction apparatus according to an embodiment of the present application.
图7为本申请实施例提供的用户性别预测装置的另一结构示意图。FIG. 7 is another schematic structural diagram of a user gender prediction apparatus according to an embodiment of the present application.
图8为本申请实施例提供的电子设备的结构示意图。FIG. 8 is a schematic structural diagram of an electronic device according to an embodiment of the present application.
图9为本申请实施例提供的电子设备的另一结构示意图。FIG. 9 is another schematic structural diagram of an electronic device according to an embodiment of the present application.
具体实施方式Detailed ways
请参照图式,其中相同的组件符号代表相同的组件,本申请的原理是以实施在一适当的运算环境中来举例说明。以下的说明是基于所例示的本申请具体实施例,其不应被视为限制本申请未在此详述的其它具体实施例。Referring to the drawings, wherein like reference numerals represent the same components, the principles of the present application are illustrated by the implementation in a suitable computing environment. The following description is based on the specific embodiments of the present invention as illustrated, and should not be construed as limiting the specific embodiments that are not described herein.
在以下的说明中,本申请的具体实施例将参考由一部或多部计算机所执行的步骤及符号来说明,除非另有述明。因此,这些步骤及操作将有数次提到由计算机执行,本文所指的计算机执行包括了由代表了以一结构化型式中的数据的电子信号的计算机处理单元的操作。此操作转换该数据或将其维持在该计算机的内存系统中的位置处,其可重新配置或另外以本领域测试人员所熟知的方式来改变该计算机的运作。该数据所维持的数据结构为该内存的实体位置,其具有由该数据格式所定义的特定特性。但是,本申请原理以上述文字来说明,其并不代表为一种限制,本领域测试人员将可了解到以下所述的多种步骤及操作亦可实施在硬件当中。In the following description, specific embodiments of the present application will be described with reference to the steps and symbols executed by one or more computers, unless otherwise stated. Thus, these steps and operations will be referred to several times by a computer, and the computer execution referred to herein includes the operation of a computer processing unit that represents an electronic signal in data in a structured version. This operation converts the data or maintains it at a location in the computer's memory system, which can be reconfigured or otherwise altered in a manner well known to those skilled in the art. The data structure maintained by the data is the physical location of the memory, which has specific characteristics defined by the data format. However, the principles of the present application are described in the above text, which is not intended to be a limitation, and those skilled in the art will appreciate that the various steps and operations described below can also be implemented in hardware.
本申请中的术语“第一”、“第二”和“第三”等是用于区别不同对象,而不是用于描述特定顺序。此外,术语“包括”和“具有”以及它们任何变形,意图在于覆盖不排他的包含。例如包含了一系列步骤或模块的过程、方法、系统、产品或设备没有限定于已列出的步骤或模块,而是某些实施例还包括没有列出的步骤或模块,或某些实施例还包括对于这些过程、方法、产品或设备固有的其它步骤或模块。The terms "first," "second," and "third," etc. in this application are used to distinguish different objects, and are not intended to describe a particular order. Furthermore, the terms "comprises" and "comprising" and "comprising" are intended to cover a non-exclusive inclusion. For example, a process, method, system, product, or device that comprises a series of steps or modules is not limited to the listed steps or modules, but some embodiments also include steps or modules not listed, or some embodiments Other steps or modules inherent to these processes, methods, products or devices are also included.
当今当流的厂商的电子设备系统里都会让用户注册绑定设备与用户帐号。但不是每个用户都愿意提供性别信息。因此这一方法不能解决一大部分用户画像的问题。其中,该电子设备可以是智能手机、平板电脑、台式电脑、笔记本电脑、或者掌上电脑等设备。In today's streaming electronic device systems, users are allowed to register binding devices and user accounts. But not every user is willing to provide gender information. Therefore, this method cannot solve the problem of a large part of the user's portrait. The electronic device may be a smart phone, a tablet computer, a desktop computer, a notebook computer, or a handheld computer.
请参阅图1,图1为本申请实施例提供的用户性别预测装置的系统示意图。该用户性别预测装置主要用于:采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集,利用样本集对BP神经网络模型进行训练,以得到训练后的预测模型,获取未知性别用户使用电子设备的多维特征信息并作为预测样本,根据预测样本和训练后的预测模型生成预测概率,预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。Please refer to FIG. 1. FIG. 1 is a schematic diagram of a system for predicting a user gender according to an embodiment of the present application. The user gender prediction device is mainly used for collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, constructing a sample set of gender prediction, and training the BP neural network model by using the sample set to obtain a trained prediction model. Obtaining multi-dimensional feature information of the unknown gender user using the electronic device as a prediction sample, and generating a prediction probability according to the prediction sample and the trained prediction model, the prediction probability includes: a first probability that the current user is a male, and a second probability that the current user is a female Probability.
具体的,请参阅图2,图2为本申请实施例提供的用户性别预测装置的应用场景示意图。比如,用户性别预测装置在接收到预测请求时,采集未知性别用户使用电子设备的多维特征信息。其中上述多维特征信息可以以一周为时间长度,获取电子设备中的历史使用信息,从历史使用信息当中采集多维特征信息。将上述多维特征信息输入至预测模型,通过预测模型对当前用户为男性概率进行预测,得到第一概率,通过预测模型对当前用户为女性概率进行预测,得到第二概率,最终根据第一概率和第二概率输入最终的预测结果。Specifically, please refer to FIG. 2 , which is a schematic diagram of an application scenario of a user gender prediction apparatus according to an embodiment of the present application. For example, when receiving the prediction request, the user gender prediction device collects multi-dimensional feature information of the electronic device that the unknown gender user uses. The multi-dimensional feature information may be used to acquire historical usage information in the electronic device and collect multi-dimensional feature information from the historical usage information. The multi-dimensional feature information is input to the prediction model, and the current user is predicted by the prediction model to obtain the first probability. The prediction model is used to predict the current user's female probability, and the second probability is obtained, and finally according to the first probability sum. The second probability is entered into the final predicted result.
本申请实施例提供一种用户性别预测方法,该用户性别预测方法的执行主体可以是本申请实施例提供的用户性别预测装置,或者集成了该用户性别预测装置的电子设备,其中该用户性别预测装置可以采用硬件或者软件的方式实现。The embodiment of the present application provides a user gender prediction method, and the execution subject of the user gender prediction method may be the user gender prediction device provided by the embodiment of the present application, or an electronic device integrated with the user gender prediction device, wherein the user gender prediction The device can be implemented in hardware or software.
本申请实施例将从用户性别预测装置的角度进行描述,该用户性别预测装置具体可以集成在电子设备中。该用户性别预测方法包括:采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集,利用样本集对BP神经网络模型进行训练,以得到训练后的预测模型,获取未知性别用户使用电子设备的多维特征信息并作为预测样本,根据该预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。The embodiment of the present application will be described from the perspective of a user gender prediction device, which may be specifically integrated in an electronic device. The user gender prediction method includes: collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, constructing a sample set of gender prediction, and training the BP neural network model by using the sample set to obtain a trained prediction model and acquiring an unknown. The gender user uses the multi-dimensional feature information of the electronic device as a prediction sample, and generates a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: the first probability that the current user is a male, and the current user is a female Two probability.
一实施例中,利用所述样本集对BP神经网络模型进行训练的步骤,包括:In an embodiment, the step of training the BP neural network model by using the sample set comprises:
对所述样本集中的样本参数进行归一化处理;Normalizing the sample parameters in the sample set;
根据归一化之后的样本参数对所述BP神经网络模型进行训练。The BP neural network model is trained according to the normalized sample parameters.
一实施例中,根据归一化之后的样本参数对所述BP神经网络模型进行训练的步骤,包括:In an embodiment, the step of training the BP neural network model according to the normalized sample parameters includes:
将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率;Entering the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results;
根据所述两个预测结果和与其对应的概率得到损失值;Obtaining a loss value according to the two prediction results and a probability corresponding thereto;
根据所述损失值进行训练,生成目标模型参数。Training is performed based on the loss value to generate a target model parameter.
一实施例中,所述根据所述损失值进行训练的步骤,包括:In an embodiment, the step of training according to the loss value includes:
根据所述损失值利用梯度下降法进行训练。Training is performed using the gradient descent method based on the loss value.
一实施例中,在根据所述预测样本和训练后的预测模型生成预测概率之后,所述方法还包括:In an embodiment, after generating the prediction probability according to the predicted sample and the trained prediction model, the method further includes:
对所述当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果;Comparing a first probability that the current user is a male with a second probability that the current user is a female, and obtaining a comparison result;
根据所述比较结果,输出最终的预测结果。Based on the comparison result, the final prediction result is output.
一实施例中,所述根据所述比较结果,输出最终的预测结果的步骤,包括:In an embodiment, the step of outputting the final prediction result according to the comparison result includes:
当所述第一概率大于所述第二概率时,输出当前用户为男性的第一预测结果;When the first probability is greater than the second probability, outputting a first prediction result that the current user is a male;
当所述第一概率不大于所述第二概率时,输出当前用户为女性的第二预测结果。When the first probability is not greater than the second probability, outputting a second prediction result that the current user is a female.
一实施例中,将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率的步骤,包括:In one embodiment, the step of inputting the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results includes:
基于第一预设公式计算以得到两个预测结果的概率,其中所述第一预设公式为:Calculating a probability based on a first preset formula to obtain two prediction results, wherein the first preset formula is:
Figure PCTCN2018116481-appb-000001
Figure PCTCN2018116481-appb-000001
其中,Z K为中间值,C为预测结果的类别数,
Figure PCTCN2018116481-appb-000002
为第j个中间值,基于所述第一预设公式得到两维的输出层,分别代表所述样本为男性和女性的概率。
Where Z K is the intermediate value and C is the number of categories of the predicted result.
Figure PCTCN2018116481-appb-000002
For the jth intermediate value, a two-dimensional output layer is obtained based on the first preset formula, respectively representing the probability that the sample is male and female.
一实施例中,根据所述两个预测结果和与其对应的概率得到损失值的步骤,包括:In an embodiment, the step of obtaining a loss value according to the two prediction results and the probability corresponding thereto includes:
基于第二预设公式根据所述两个预测结果和与其对应的概率得到损失值,其中所述第二预设公式为:And obtaining, according to the second preset formula, a loss value according to the two prediction results and a probability corresponding thereto, wherein the second preset formula is:
Figure PCTCN2018116481-appb-000003
Figure PCTCN2018116481-appb-000003
其中C为预测结果的类别数,y k为真实值,E为平均值。 Where C is the number of categories of prediction results, y k is the true value, and E is the average.
请参阅图3,图3为本申请实施例提供的用户性别预测方法的流程示意图。本申请实施例提供的用户性别预测方法应用于电子设备,具体流程可以如下:Please refer to FIG. 3. FIG. 3 is a schematic flowchart diagram of a user gender prediction method according to an embodiment of the present application. The user gender prediction method provided by the embodiment of the present application is applied to an electronic device, and the specific process may be as follows:
步骤101,采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集。Step 101: Collect multi-dimensional feature information of a known gender user using the electronic device as a sample, and construct a sample set of gender prediction.
已知性别用户使用电子设备的多维特征信息具有一定长度的维度,其每个维度上的参数均对应表征应用的一种特征信息,即该多维特征信息由多个特征信息构成。该多个特征信息可以包括用户在使用不同类型应用程序中的特征信息,比如用户在购物应用中浏览偏 男性类商品(如男装)次数与时长,用户在购物应用中浏览偏女性类商品(如化妆品、女装)次数与时长;用户在阅读应用中阅读偏男性类小说的时长,用户阅读偏女性类小说的时长,用户阅读体育类新闻的时长,用户阅读星座类新闻的时长;用户使用不同类型的应用程序中的使用信息,比如用户使用前置摄像头自拍的次数,用户使用美颜类软件的次数,用户玩不同类别游戏的次数与时长。It is known that the multi-dimensional feature information of the electronic user using the electronic device has a dimension of a certain length, and the parameters in each dimension correspond to one feature information of the characterization application, that is, the multi-dimensional feature information is composed of a plurality of feature information. The plurality of feature information may include feature information of the user in using different types of applications, such as the number and duration of browsing the male-type goods (such as men's clothing) in the shopping application, and the user browsing the female-oriented products in the shopping application (such as The number and duration of cosmetics and women's wear; the length of time users read a male novel in a reading application, the length of time a user reads a female novel, the length of time a user reads a sports news, the length of time a user reads a constellation news; the user uses a different type The usage information in the application, such as the number of times the user uses the front camera self-timer, the number of times the user uses the beauty software, and the number and duration of the user playing different types of games.
性别预测的样本集中,可以包括在历史时间段内采集的多个训练样本。历史时间段,例如可以是过去7天、10天等。可以理解的是,一次采集的已知性别用户使用电子设备的多维特征数据构成一个样本集。对于每个样本的特征集,均用一组实数予以记录。在采样时需要对样本的数值进行归一化处理,如归一化成0~1之间的数。这样每个用户是一个样本,N个用户组成了N个样本,每个样本的特征是(x 1,x 2,...,x n)。 The sample set of gender predictions may include multiple training samples collected during the historical time period. The historical time period can be, for example, the past 7 days, 10 days, and the like. It can be understood that the known gender users collected at one time use the multi-dimensional feature data of the electronic device to form a sample set. The feature set for each sample is recorded with a set of real numbers. The sample values need to be normalized during sampling, such as normalization to a number between 0 and 1. Thus each user is a sample, and N users make up N samples, each of which is characterized by (x 1 , x 2 , . . . , x n ).
在构成样本集之后,可以对样本集中的每个样本进行标记,得到每个样本的样本标签,由于本实施要实现的是预测当前用户的性别,因此,所标记的样本标签是该用户注册帐号时实际提供的性别信息,包括性别男和性别女,可以用编码为0或者1来表示,此时,样本类别可以包括性别男和性别女。具体可根据用户对电子设备中应用的历史使用习惯进行标记,例如:在一周的时间长度之内用户在购物应用中浏览偏男性类商品(如男装)次数为20次、累计时长为一小时,则可以标记为性别男;再例如,在一周的时间长度之内用户阅读偏女性类小说的时长为8个小时,则可以标记为性别女,具体地,可以用数值“1”表示“性别男”,用数值“0”表示“性别女”,反之亦可。After constituting the sample set, each sample in the sample set can be marked to obtain a sample label of each sample. Since the implementation is to predict the gender of the current user, the labeled sample label is the registered account of the user. The actual gender information provided, including gender males and gender females, can be represented by a code of 0 or 1. In this case, the sample categories can include gender males and gender females. Specifically, the user may mark the historical usage habits of the application in the electronic device, for example, within a length of one week, the user browses the male-type goods (such as men's clothing) in the shopping application for 20 times, and the accumulated duration is one hour. It can be marked as a gender male; for example, within a period of one week, the user can read a female novel for 8 hours, which can be marked as a gender female. Specifically, the value "1" can be used to indicate "gender male". ", use the value "0" to indicate "gender female", and vice versa.
其中,上述性别预测的样本集用于对BP神经网络模型进行训练,例如样本集中可以包括以下特征:Wherein, the sample set of the above gender prediction is used to train the BP neural network model, for example, the sample set may include the following features:
用户在购物应用中浏览偏男性类商品(如男装)次数与时长;The number and duration of browsing of male-type items (such as men's clothing) in the shopping app;
用户在购物应用中浏览偏女性类商品(如化妆品、女装)次数与时长;The number and duration of browsing of female-type items (such as cosmetics and women's clothing) in the shopping application;
用户阅读偏男性类小说的时长;The length of time users read a male-like novel;
用户阅读偏女性类小说的时长;The length of time users read a female novel;
用户阅读体育类新闻的时长;The length of time users read sports news;
用户阅读星座类新闻的时长;The length of time users read the news of the constellation;
用户使用前置摄像头自拍的次数;The number of times the user used the front camera to take a self-portrait;
用户使用美颜类软件的次数;The number of times the user uses the beauty software;
用户玩不同类别游戏的次数与时长。The number and duration of users playing different types of games.
步骤102,利用样本集对BP神经网络模型进行训练,以得到训练后的预测模型。In step 102, the BP neural network model is trained by using the sample set to obtain the trained prediction model.
其中,BP神经网络模型是机器学习中的一种分类模型,它的基本思想是梯度下降法,利用梯度搜索技术,以期使网络的实际输出值和期望输出值的误差均方差为最小。Among them, BP neural network model is a classification model in machine learning. Its basic idea is gradient descent method, which uses gradient search technology to minimize the error mean square error between the actual output value of the network and the expected output value.
基本BP算法包括信号的前向传播和误差的反向传播两个过程。即计算误差输出时按从输入到输出的方向进行,而调整权值和阈值则从输出到输入的方向进行。正向传播时, 输入信号通过隐含层作用于输出节点,经过非线性变换,产生输出信号,若实际输出与期望输出不相符,则转入误差的反向传播过程。误差反传是将输出误差通过隐含层向输入层逐层反传,并将误差分摊给各层所有单元,以从各层获得的误差信号作为调整各单元权值的依据。通过调整输入节点与隐层节点的联接强度和隐层节点与输出节点的联接强度以及阈值,使误差沿梯度方向下降,经过反复学习训练,确定与最小误差相对应的网络参数(权值和阈值),训练即告停止。此时经过训练的神经网络即能对类似样本的输入信息,自行处理输出误差最小的经过非线形转换的信息。The basic BP algorithm includes two processes of forward propagation of signals and back propagation of errors. That is, the calculation of the error output is performed in the direction from the input to the output, and the adjustment weight and the threshold are performed from the output to the input. In the case of forward propagation, the input signal acts on the output node through the hidden layer and undergoes a nonlinear transformation to produce an output signal. If the actual output does not match the expected output, the error propagates back into the error propagation process. Error back propagation is to pass the output error back to the input layer through the hidden layer, and distribute the error to all the units in each layer, so as to adjust the error value of each unit as the basis for adjusting the weight of each unit. By adjusting the connection strength between the input node and the hidden layer node and the connection strength and threshold of the hidden layer node and the output node, the error is decreased along the gradient direction, and after repeated learning training, the network parameters (weights and thresholds corresponding to the minimum error) are determined. ), the training will stop. At this time, the trained neural network can process the non-linear conversion information with the smallest output error for the input information of similar samples.
在本申请实施例当中,可以利用样本集对BP神经网络模型进行训练,以得到训练后的预测模型。其中,对BP神经网络模型进行训练指的利用训练集求解BP神经网络模型中的模型参数。In the embodiment of the present application, the BP neural network model can be trained by using the sample set to obtain the trained prediction model. Among them, the training model of the BP neural network model is used to solve the model parameters in the BP neural network model.
本申请实施例中的BP神经网络模型的网络结构包括三层,分别为输入层、隐藏层以及输出层,其中,输入层输入步骤101中的特征信息,当然该特征信息为经过归一化处理后的特征信息,例如将上述的9个特征信息引入9个输入节点,隐藏层可以包含三层,可以分别为10个节点、5个节点、2个节点,最后通过SoftMax函数得到2维的输出层,分别代表的是用户为男性和女性的概率。The network structure of the BP neural network model in the embodiment of the present application includes three layers, which are an input layer, a hidden layer, and an output layer, respectively, wherein the input layer inputs the feature information in step 101, and of course the feature information is normalized. After the feature information, for example, the above 9 feature information is introduced into 9 input nodes, and the hidden layer may include three layers, which may be 10 nodes, 5 nodes, 2 nodes respectively, and finally obtain 2D output through the SoftMax function. The layers represent the probability that the user is male and female.
对于收集到的所有样本,采用mini batch梯度下降法的方式批量送入网络进行训练。其中上述mini batch梯度下降法是介于最快梯度下降法和随机梯度下降法之间的一种优化算法,每次选取一定量的训练样本进行迭代。采用交叉熵(crossentropy)做损失函数以反向传播更新网络权值,满足损失值小于预设阈值或是迭代次数达到设定的训练迭代次数后,可以结束训练。For all samples collected, they were sent to the network for training in batches using the mini batch gradient descent method. The above mini batch gradient descent method is an optimization algorithm between the fastest gradient descent method and the stochastic gradient descent method, and each time a certain amount of training samples are selected for iteration. The cross-entropy (crossentropy) is used as the loss function to update the network weights in reverse propagation. After the loss value is less than the preset threshold or the number of iterations reaches the set number of training iterations, the training can be ended.
其中,损失函数(loss function)是用来估量模型的预测值f(x)与真实值Y的不一致程度,它是一个非负实值函数,通常使用L(Y,f(x)),或者L(w)来表示,损失函数越小,模型的鲁棒性就越好。损失函数是经验风险函数的核心部分,也是结构风险函数重要组成部分。Wherein, the loss function is used to estimate the degree of inconsistency between the predicted value f(x) of the model and the true value Y. It is a non-negative real-valued function, usually using L(Y, f(x)), or L(w) indicates that the smaller the loss function, the better the robustness of the model. The loss function is the core part of the empirical risk function and an important part of the structural risk function.
步骤103,获取未知性别用户使用电子设备当前的多维特征信息并作为预测样本。Step 103: Acquire the current multi-dimensional feature information of the electronic device of the unknown gender user and use it as a prediction sample.
例如,统计一周内前文提到的用户特征值,得到用户特征向量,将归一化后的特征值输入网络中,做一次前向计算,网络的输出为该用户性别是男或女的概率,预测样本中可以包括以下特征:For example, the user characteristic value mentioned in the previous week is counted, the user feature vector is obtained, and the normalized feature value is input into the network to perform a forward calculation, and the output of the network is the probability that the user gender is male or female. The following characteristics can be included in the forecast sample:
用户在购物应用中浏览偏男性类商品(如男装)次数与时长;The number and duration of browsing of male-type items (such as men's clothing) in the shopping app;
用户在购物应用中浏览偏女性类商品(如化妆品、女装)次数与时长;The number and duration of browsing of female-type items (such as cosmetics and women's clothing) in the shopping application;
用户阅读偏男性类小说的时长;The length of time users read a male-like novel;
用户阅读偏女性类小说的时长;The length of time users read a female novel;
用户阅读体育类新闻的时长;The length of time users read sports news;
用户阅读星座类新闻的时长;The length of time users read the news of the constellation;
用户使用前置摄像头自拍的次数;The number of times the user used the front camera to take a self-portrait;
用户使用美颜类软件的次数;The number of times the user uses the beauty software;
用户玩不同类别游戏的次数与时长。The number and duration of users playing different types of games.
步骤104,根据预测样本和训练后的预测模型生成预测概率,预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。Step 104: Generate a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: a first probability that the current user is a male, and a second probability that the current user is a female.
根据预测集机器及其对应的训练后的BP神经网络模型,输出相应的概率,该BP神经网络模型输出一个当前用户为男性的第一概率、和当前用户为女性的第二概率。According to the prediction set machine and its corresponding trained BP neural network model, a corresponding probability is output, and the BP neural network model outputs a first probability that the current user is a male and a second probability that the current user is a female.
在一实施例中,为了简化应用预测运行,提升预测速度,可以针对预测概率中两个概率选取一个概率,然后,基于选取的概率来预测当前用户的性别。比如,在根据预测样本和训练后的预测模型生成预测概率之后,该方法还可以包括:In an embodiment, in order to simplify the application prediction operation and increase the prediction speed, one probability may be selected for two probabilities in the prediction probability, and then the gender of the current user is predicted based on the selected probability. For example, after generating the prediction probability according to the prediction sample and the trained prediction model, the method may further include:
对当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果;Comparing the first probability that the current user is a male with the second probability that the current user is a female, and obtaining a comparison result;
根据比较结果,输出最终的预测结果。According to the comparison result, the final prediction result is output.
在一实施例中,根据比较结果,输出最终的预测结果的步骤可以包括:In an embodiment, according to the comparison result, the step of outputting the final prediction result may include:
当第一概率大于第二概率时,输出当前用户为男性的第一预测结果;When the first probability is greater than the second probability, outputting a first prediction result that the current user is a male;
当第一概率不大于第二概率时,输出当前用户为女性的第二预测结果。When the first probability is not greater than the second probability, the second predicted result that the current user is a female is output.
例如,对于某个预测概率,如果Y=1表示当前用户为男性、Y=0表示当前用户为女性,假设P(Y=1|x)大于P(Y=0|x),此时,输出当前用户为男性的第一预测结果;假设P(Y=1|x)不大于P(Y=0|x),此时,输出当前用户为女性的第二预测结果。For example, for a certain prediction probability, if Y=1 indicates that the current user is male, Y=0 indicates that the current user is female, and if P(Y=1|x) is greater than P(Y=0|x), then the output is The current user is the first predicted result of the male; assuming P(Y=1|x) is not greater than P(Y=0|x), at this time, the second predicted result that the current user is a female is output.
需要说明的是,预测模型的训练过程可以在服务器端也可以在电子设备端完成。当预测模型的训练过程、实际预测过程都在服务器端完成时,需要使用训练后的预测模型时,可以将未知性别用户使用电子设备的多维特征信息输入到服务器,服务器实际预测完成后,将预测结果发送至电子设备端,最后由电子设备输出预测结果。It should be noted that the training process of the predictive model can be completed on the server side or on the electronic device side. When the training process and the actual prediction process of the predictive model are completed on the server side, when the trained predictive model is needed, the multi-dimensional feature information of the unknown gender user using the electronic device can be input to the server, and after the actual prediction of the server is completed, the prediction will be performed. The result is sent to the electronic device end, and finally the electronic device outputs the predicted result.
当预测模型的训练过程、实际预测过程都在电子设备端完成时,需要使用训练后的预测模型时,可以将的多维特征信息输入到电子设备,电子设备实际预测完成后,电子设备输出预测结果。When the training process and the actual prediction process of the predictive model are completed on the electronic device end, when the trained predictive model is needed, the multi-dimensional feature information can be input to the electronic device, and after the actual prediction of the electronic device is completed, the electronic device outputs the predicted result. .
由上可知,本申请实施例采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集,利用样本集对BP神经网络模型进行训练,以得到训练后的预测模型,获取未知性别用户使用电子设备的多维特征信息并作为预测样本,根据该预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。本申请可以基于BP神经网络模型对当前用户的性别进行预测,提高了性别预测的准确性,方便电子设备完成用户的精准画像。As can be seen from the above, the embodiment of the present application collects multi-dimensional feature information of a known gender user using the electronic device as a sample, constructs a sample set of gender prediction, and uses the sample set to train the BP neural network model to obtain a trained prediction model and obtain The unknown gender user uses the multi-dimensional feature information of the electronic device as a prediction sample, and generates a prediction probability according to the predicted sample and the trained prediction model, the predicted probability includes: the first probability that the current user is a male, and the current user is a female Second probability. The application can predict the gender of the current user based on the BP neural network model, improve the accuracy of the gender prediction, and facilitate the electronic device to complete the accurate portrait of the user.
下面将在上述实施例描述的方法基础上,对本申请的清理方法做进一步介绍。,参阅图4,图4为本申请实施例提供的用户性别预测方法的另一流程示意图,该用户性别预测方法包括:The cleaning method of the present application will be further described below based on the method described in the above embodiments. Referring to FIG. 4, FIG. 4 is another schematic flowchart of a user gender prediction method according to an embodiment of the present application, where the user gender prediction method includes:
步骤201,采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集。Step 201: Collect multi-dimensional feature information of a known gender user using the electronic device as a sample, and construct a sample set of gender prediction.
已知性别用户使用电子设备的多维特征信息具有一定长度的维度,其每个维度上的参数均对应表征应用的一种特征信息,即该多维特征信息由多个特征信息构成。该多个特征信息可以包括用户在使用不同类型应用程序中的特征信息,比如用户在购物应用中浏览偏男性类商品(如男装)次数与时长,用户在购物应用中浏览偏女性类商品(如化妆品、女装)次数与时长;用户在阅读应用中阅读偏男性类小说的时长,用户阅读偏女性类小说的时长,用户阅读体育类新闻的时长,用户阅读星座类新闻的时长;用户使用不同类型的应用程序中的使用信息,比如用户使用前置摄像头自拍的次数,用户使用美颜类软件的次数,用户玩不同类别游戏的次数与时长。It is known that the multi-dimensional feature information of the electronic user using the electronic device has a dimension of a certain length, and the parameters in each dimension correspond to one feature information of the characterization application, that is, the multi-dimensional feature information is composed of a plurality of feature information. The plurality of feature information may include feature information of the user in using different types of applications, such as the number and duration of browsing the male-type goods (such as men's clothing) in the shopping application, and the user browsing the female-oriented products in the shopping application (such as The number and duration of cosmetics and women's wear; the length of time users read a male novel in a reading application, the length of time a user reads a female novel, the length of time a user reads a sports news, the length of time a user reads a constellation news; the user uses a different type The usage information in the application, such as the number of times the user uses the front camera self-timer, the number of times the user uses the beauty software, and the number and duration of the user playing different types of games.
步骤S202,对样本集中的样本参数进行归一化处理。Step S202, normalizing the sample parameters in the sample set.
对于每个样本的特征集,均用一组实数予以记录。在采样时需要对样本的数值进行归一化处理,如归一化成0~1之间的数。这样每个用户是一个样本,N个用户组成了N个样本,每个样本的特征是(x 1,x 2,...,x n)。 The feature set for each sample is recorded with a set of real numbers. The sample values need to be normalized during sampling, such as normalization to a number between 0 and 1. Thus each user is a sample, and N users make up N samples, each of which is characterized by (x 1 , x 2 , . . . , x n ).
步骤S203,将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率。Step S203, input the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results.
本申请实施例中的BP神经网络模型的网络结构包括三层,分别为输入层、隐藏层以及输出层,其中,输入层输入步骤101中的特征信息,当然该特征信息为经过归一化处理后的特征信息,例如将上述的9个特征信息引入9个输入节点,隐藏层可以包含三层,可以分别为10个节点、5个节点、2个节点,最后通过SoftMax函数得到2维的输出层,分别代表的是用户为男性和女性的概率。对于收集到的所有样本,采用mini batch梯度下降法的方式批量送入网络进行训练。The network structure of the BP neural network model in the embodiment of the present application includes three layers, which are an input layer, a hidden layer, and an output layer, respectively, wherein the input layer inputs the feature information in step 101, and of course the feature information is normalized. After the feature information, for example, the above 9 feature information is introduced into 9 input nodes, and the hidden layer may include three layers, which may be 10 nodes, 5 nodes, 2 nodes respectively, and finally obtain 2D output through the SoftMax function. The layers represent the probability that the user is male and female. For all samples collected, they were sent to the network for training in batches using the mini batch gradient descent method.
在本实施例中,基于第一预设公式计算以得到两个预测结果的概率,其中所述第一预设公式为:In this embodiment, the probability of obtaining two prediction results is calculated based on the first preset formula, wherein the first preset formula is:
Figure PCTCN2018116481-appb-000004
Figure PCTCN2018116481-appb-000004
其中,Z K为中间值,C为预测结果的类别数,
Figure PCTCN2018116481-appb-000005
为第j个中间值,基于所述第一预设公式得到两维的输出层,分别代表所述样本为男性和女性的概率。
Where Z K is the intermediate value and C is the number of categories of the predicted result.
Figure PCTCN2018116481-appb-000005
For the jth intermediate value, a two-dimensional output layer is obtained based on the first preset formula, respectively representing the probability that the sample is male and female.
步骤S204,根据两个预测结果和与其对应的概率得到损失值。Step S204, the loss value is obtained according to the two prediction results and the probability corresponding thereto.
在本实施例中,基于第二预设公式根据两个预测结果和与其对应的概率得到损失值,其中所述第二预设公式为:In this embodiment, the loss value is obtained according to the two prediction results and the probability corresponding thereto according to the second preset formula, wherein the second preset formula is:
Figure PCTCN2018116481-appb-000006
Figure PCTCN2018116481-appb-000006
其中C为预测结果的类别数,y k为真实值,E为平均值。 Where C is the number of categories of prediction results, y k is the true value, and E is the average.
步骤S205,根据损失值进行训练,生成目标模型参数。In step S205, training is performed according to the loss value to generate a target model parameter.
其中,根据损失值利用梯度下降法进行训练。在本实施例中,对于收集到的所有样本,采用mini batch梯度下降法的方式批量送入网络进行训练。其中上述mini batch梯度下降法是介于最快梯度下降法和随机梯度下降法之间的一种优化算法,每次选取一定量的训练样本进行迭代。采用交叉熵(crossentropy)做损失函数以反向传播更新网络权值,满足损失值小于预设阈值或是迭代次数达到设定的训练迭代次数后,可以结束训练。Among them, the gradient descent method is used for training according to the loss value. In this embodiment, all the samples collected are sent to the network for training in a batch manner using a mini batch gradient descent method. The above mini batch gradient descent method is an optimization algorithm between the fastest gradient descent method and the stochastic gradient descent method, and each time a certain amount of training samples are selected for iteration. The cross-entropy (crossentropy) is used as the loss function to update the network weights in reverse propagation. After the loss value is less than the preset threshold or the number of iterations reaches the set number of training iterations, the training can be ended.
步骤S206,获取未知性别用户使用电子设备的多维特征信息并作为预测样本。Step S206: Acquire multi-dimensional feature information of the electronic device of the unknown gender user and use it as a prediction sample.
例如,统计一周内前文提到的用户特征值,得到用户特征向量,将归一化后的特征值输入网络中,做一次前向计算,网络的输出为该用户性别是男或女的概率,预测样本中可以包括以下特征:For example, the user characteristic value mentioned in the previous week is counted, the user feature vector is obtained, and the normalized feature value is input into the network to perform a forward calculation, and the output of the network is the probability that the user gender is male or female. The following characteristics can be included in the forecast sample:
用户在购物应用中浏览偏男性类商品(如男装)次数与时长;The number and duration of browsing of male-type items (such as men's clothing) in the shopping app;
用户在购物应用中浏览偏女性类商品(如化妆品、女装)次数与时长;The number and duration of browsing of female-type items (such as cosmetics and women's clothing) in the shopping application;
用户阅读偏男性类小说的时长;The length of time users read a male-like novel;
用户阅读偏女性类小说的时长;The length of time users read a female novel;
用户阅读体育类新闻的时长;The length of time users read sports news;
用户阅读星座类新闻的时长;The length of time users read the news of the constellation;
用户使用前置摄像头自拍的次数;The number of times the user used the front camera to take a self-portrait;
用户使用美颜类软件的次数;The number of times the user uses the beauty software;
用户玩不同类别游戏的次数与时长。The number and duration of users playing different types of games.
步骤S207,根据预测样本和训练后的预测模型生成预测概率,预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。Step S207, generating a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability comprises: a first probability that the current user is a male, and a second probability that the current user is a female.
步骤S208,对当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果。In step S208, the first probability that the current user is a male is compared with the second probability that the current user is a female, and a comparison result is obtained.
步骤S209,根据比较结果,输出最终的预测结果。Step S209, according to the comparison result, output the final prediction result.
在本实施例中,根据比较结果,输出最终的预测结果的步骤可以包括:In this embodiment, according to the comparison result, the step of outputting the final prediction result may include:
当第一概率大于第二概率时,输出当前用户为男性的第一预测结果;When the first probability is greater than the second probability, outputting a first prediction result that the current user is a male;
当第一概率不大于第二概率时,输出当前用户为女性的第二预测结果。When the first probability is not greater than the second probability, the second predicted result that the current user is a female is output.
例如,对于某个预测概率,如果Y=1表示当前用户为男性、Y=0表示当前用户为女性,假设P(Y=1|x)大于P(Y=0|x),此时,输出当前用户为男性的第一预测结果;假设P(Y=1|x)不大于P(Y=0|x),此时,输出当前用户为女性的第二预测结果。For example, for a certain prediction probability, if Y=1 indicates that the current user is male, Y=0 indicates that the current user is female, and if P(Y=1|x) is greater than P(Y=0|x), then the output is The current user is the first predicted result of the male; assuming P(Y=1|x) is not greater than P(Y=0|x), at this time, the second predicted result that the current user is a female is output.
由上可知,本申请实施例采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集,对样本集中的样本参数进行归一化处理,将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率,根据两个预测结果和与其对应 的概率得到损失值,根据损失值进行训练,生成目标模型参数,获取未知性别用户使用电子设备的多维特征信息并作为预测样本,根据预测样本和训练后的预测模型生成预测概率,预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率,对当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果,根据比较结果,输出最终的预测结果。本申请可以基于BP神经网络模型对当前用户的性别进行预测,提高了性别预测的准确性,方便电子设备完成用户的精准画像。As can be seen from the above, the embodiment of the present application collects the multi-dimensional feature information of the known gender user using the electronic device as a sample, constructs a sample set of the gender prediction, normalizes the sample parameters in the sample set, and normalizes the sample parameters. The BP neural network model is input to obtain the probability of two prediction results, the loss value is obtained according to the two prediction results and the probability corresponding thereto, the training is performed according to the loss value, the target model parameter is generated, and the multi-dimensional use of the electronic device by the unknown gender user is obtained. The feature information is used as a prediction sample, and the prediction probability is generated according to the prediction sample and the trained prediction model. The prediction probability includes: a first probability that the current user is a male, and a second probability that the current user is a female, and the current user is a male. A probability is compared with a second probability that the current user is a woman, a comparison result is obtained, and a final prediction result is output according to the comparison result. The application can predict the gender of the current user based on the BP neural network model, improve the accuracy of the gender prediction, and facilitate the electronic device to complete the accurate portrait of the user.
请参阅图5,图5为本申请实施例提供的用户性别预测装置的另一应用场景示意图。当预测模型的训练过程在服务器端完成,预测模型的实际预测过程在电子设备端完成时,需要使用优化后的预测模型时,可以将电子设备当前的多维特征信息输入到电子设备,电子设备实际预测完成后,电子设备根据预测结果输出预测结果。可选的,可以将训练好的预测模型文件(model文件)移植到智能设备上,若需要判断当前用户的性别,更新当前的样本集,输入到训练好的预测模型文件(model文件),计算即可得到预测值。Referring to FIG. 5, FIG. 5 is a schematic diagram of another application scenario of a user gender prediction apparatus according to an embodiment of the present application. When the training process of the predictive model is completed on the server side, and the actual prediction process of the predictive model is completed on the electronic device side, when the optimized predictive model needs to be used, the current multidimensional feature information of the electronic device can be input to the electronic device, and the electronic device actually After the prediction is completed, the electronic device outputs the predicted result based on the predicted result. Optionally, the trained predictive model file (model file) can be transplanted to the smart device. If it is necessary to determine the gender of the current user, update the current sample set, input the trained predictive model file (model file), and calculate You can get the predicted value.
在一些实施例中,在获取电子设备当前的多维特征信息的步骤之前,还可以包括:In some embodiments, before the step of acquiring the current multi-dimensional feature information of the electronic device, the method may further include:
获取预设时间,若当前系统时间到达预设时间时,则获取电子设备当前的多维特征信息。其中预设时间可以为一天中的一个时间点,如上午9点,也可以为一天中的几个时间点,如上午9点、下午6点等。也可以为多天中的一个或几个时间点。然后根据预测模型、优化参数生成预测结果,并根据预测结果对应用程序进行管控。The preset time is obtained. If the current system time reaches the preset time, the current multi-dimensional feature information of the electronic device is obtained. The preset time can be a time point in the day, such as 9 am, or several time points in the day, such as 9 am, 6 pm, and the like. It can also be one or several time points in multiple days. Then, the prediction result is generated according to the prediction model and the optimization parameter, and the application is controlled according to the prediction result.
上述所有的技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All of the above technical solutions may be combined to form an optional embodiment of the present application, and will not be further described herein.
请参阅图6,图6为本申请实施例提供的用户性别预测装置的结构示意图。其中该用户性别预测装置300应用于电子设备,该用户性别预测装置300包括采集模块301、训练模块302、获取模块303以及生成模块304。Please refer to FIG. 6. FIG. 6 is a schematic structural diagram of a user gender prediction apparatus according to an embodiment of the present application. The user gender prediction device 300 is applied to an electronic device, and the user gender prediction device 300 includes an acquisition module 301, a training module 302, an acquisition module 303, and a generation module 304.
其中,采集模块301,用于采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集。The collecting module 301 is configured to collect multi-dimensional feature information of the known gender user using the electronic device as a sample, and construct a sample set of the gender prediction.
具体的,性别预测的样本集中,可以包括在历史时间段内采集的多个训练样本。历史时间段,例如可以是过去7天、10天等。可以理解的是,一次采集的未知性别用户使用电子设备的多维特征信息构成一个样本集。对于每个样本的特征集,均用一组实数予以记录。在采样时需要对样本的数值进行归一化处理,如归一化成0~1之间的数。这样每个用户是一个样本,N个用户组成了N个样本,每个样本的特征是(x 1,x 2,...,x n)。 Specifically, the sample set of gender prediction may include multiple training samples collected during the historical time period. The historical time period can be, for example, the past 7 days, 10 days, and the like. It can be understood that the unknown gender user collected at one time uses the multi-dimensional feature information of the electronic device to form a sample set. The feature set for each sample is recorded with a set of real numbers. The sample values need to be normalized during sampling, such as normalization to a number between 0 and 1. Thus each user is a sample, and N users make up N samples, each of which is characterized by (x 1 , x 2 , . . . , x n ).
在构成样本集之后,可以对样本集中的每个样本进行标记,得到每个样本的样本标签,由于本实施要实现的是预测当前用户的性别,因此,所标记的样本标签是该用户注册帐号时实际提供的性别信息,包括性别男和性别女,可以用编码为0或者1来表示,此时,样本类别可以包括性别男和性别女。具体可根据用户对电子设备中应用的历史使用习惯进行标记,例如:在一周的时间长度之内用户在购物应用中浏览偏男性类商品(如男装)次数 为20次、累计时长为一小时,则可以标记为性别男;再例如,在一周的时间长度之内用户阅读偏女性类小说的时长为8个小时,则可以标记为性别女,具体地,可以用数值“1”表示“性别男”,用数值“0”表示“性别女”,反之亦可。After constituting the sample set, each sample in the sample set can be marked to obtain a sample label of each sample. Since the implementation is to predict the gender of the current user, the labeled sample label is the registered account of the user. The actual gender information provided, including gender males and gender females, can be represented by a code of 0 or 1. In this case, the sample categories can include gender males and gender females. Specifically, the user may mark the historical usage habits of the application in the electronic device, for example, within a length of one week, the user browses the male-type goods (such as men's clothing) in the shopping application for 20 times, and the accumulated duration is one hour. It can be marked as a gender male; for example, within a period of one week, the user can read a female novel for 8 hours, which can be marked as a gender female. Specifically, the value "1" can be used to indicate "gender male". ", use the value "0" to indicate "gender female", and vice versa.
其中,上述性别预测的样本集用于对BP神经网络模型进行训练,例如样本集中可以包括以下特征:Wherein, the sample set of the above gender prediction is used to train the BP neural network model, for example, the sample set may include the following features:
用户在购物应用中浏览偏男性类商品(如男装)次数与时长;The number and duration of browsing of male-type items (such as men's clothing) in the shopping app;
用户在购物应用中浏览偏女性类商品(如化妆品、女装)次数与时长;The number and duration of browsing of female-type items (such as cosmetics and women's clothing) in the shopping application;
用户阅读偏男性类小说的时长;The length of time users read a male-like novel;
用户阅读偏女性类小说的时长;The length of time users read a female novel;
用户阅读体育类新闻的时长;The length of time users read sports news;
用户阅读星座类新闻的时长;The length of time users read the news of the constellation;
用户使用前置摄像头自拍的次数;The number of times the user used the front camera to take a self-portrait;
用户使用美颜类软件的次数;The number of times the user uses the beauty software;
用户玩不同类别游戏的次数与时长。The number and duration of users playing different types of games.
训练模块302,用于利用样本集对BP神经网络模型进行训练,以得到训练后的预测模型。The training module 302 is configured to train the BP neural network model by using the sample set to obtain the trained prediction model.
在一实施例中,可以将样本参数输入所述BP神经网络模型以得到两个预测结果的概率,基于第一预设公式计算以得到两个预测结果的概率,其中所述第一预设公式为:In an embodiment, the sample parameters may be input into the BP neural network model to obtain the probability of two prediction results, and the probability of obtaining two prediction results based on the first preset formula, wherein the first preset formula is for:
Figure PCTCN2018116481-appb-000007
Figure PCTCN2018116481-appb-000007
其中,Z K为中间值,C为预测结果的类别数,
Figure PCTCN2018116481-appb-000008
为第j个中间值,基于所述第一预设公式得到两维的输出层,分别代表所述样本为男性和女性的概率。
Where Z K is the intermediate value and C is the number of categories of the predicted result.
Figure PCTCN2018116481-appb-000008
For the jth intermediate value, a two-dimensional output layer is obtained based on the first preset formula, respectively representing the probability that the sample is male and female.
然后根据两个预测结果和与其对应的概率得到损失值,基于第二预设公式根据两个预测结果和与其对应的概率得到损失值,其中所述第二预设公式为:Then, the loss value is obtained according to the two prediction results and the probability corresponding thereto, and the loss value is obtained according to the two prediction results and the probability corresponding thereto according to the second preset formula, wherein the second preset formula is:
Figure PCTCN2018116481-appb-000009
Figure PCTCN2018116481-appb-000009
其中C为预测结果的类别数,y k为真实值,E为平均值。然后再根据损失值进行训练,生成目标模型参数。 Where C is the number of categories of prediction results, y k is the true value, and E is the average. Then, according to the loss value, training is performed to generate the target model parameters.
获取模块303,用于获取未知性别用户使用电子设备的多维特征信息并作为预测样本。The obtaining module 303 is configured to obtain multi-dimensional feature information of the electronic device that the unknown gender user uses and use as a prediction sample.
例如,统计一周内前文提到的用户特征值,得到用户特征向量,将归一化后的特征值输入网络中,做一次前向计算,网络的输出为该用户性别是男或女的概率,预测样本中可以包括上述的一些特征。For example, the user characteristic value mentioned in the previous week is counted, the user feature vector is obtained, and the normalized feature value is input into the network to perform a forward calculation, and the output of the network is the probability that the user gender is male or female. Some of the features described above may be included in the predicted sample.
生成模块304,用于根据预测样本和训练后的预测模型生成预测概率,预测概率包括: 当前用户为男性的第一概率、和当前用户为女性的第二概率。The generating module 304 is configured to generate a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability comprises: a first probability that the current user is a male, and a second probability that the current user is a female.
为了简化应用预测运行,提升预测速度,可以针对预测概率中两个概率选取一个概率,然后,基于选取的概率来预测当前用户的性别。例如,对于某个预测概率,如果Y=1表示当前用户为男性、Y=0表示当前用户为女性,假设P(Y=1|x)大于P(Y=0|x),此时,输出当前用户为男性的第一预测结果;假设P(Y=1|x)不大于P(Y=0|x),此时,输出当前用户为女性的第二预测结果。In order to simplify the application prediction operation and improve the prediction speed, one probability can be selected for the two probabilities in the prediction probability, and then the gender of the current user is predicted based on the selected probability. For example, for a certain prediction probability, if Y=1 indicates that the current user is male, Y=0 indicates that the current user is female, and if P(Y=1|x) is greater than P(Y=0|x), then the output is The current user is the first predicted result of the male; assuming P(Y=1|x) is not greater than P(Y=0|x), at this time, the second predicted result that the current user is a female is output.
请一并参阅图7,图7为本申请实施例提供的用户性别预测装置的另一结构示意图。在一些实施方式中,训练模块302可以具体包括处理子模块3021和训练子模块3022。Please refer to FIG. 7. FIG. 7 is another schematic structural diagram of a user gender prediction apparatus according to an embodiment of the present application. In some embodiments, the training module 302 can specifically include a processing sub-module 3021 and a training sub-module 3022.
处理子模块3021,用于对样本集中的样本参数进行归一化处理;The processing sub-module 3021 is configured to perform normalization processing on the sample parameters in the sample set;
训练子模块3022,用于根据归一化之后的样本参数对所述BP神经网络模型进行训练。The training sub-module 3022 is configured to train the BP neural network model according to the normalized sample parameters.
在一实施例中,上述训练子模块3022,具体用于将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率,根据所述两个预测结果和与其对应的概率得到损失值,根据所述损失值进行训练,生成目标模型参数。In an embodiment, the training sub-module 3022 is specifically configured to input the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results, according to the two prediction results and corresponding The probability obtains a loss value, and is trained according to the loss value to generate a target model parameter.
在一实施例中,该装置300还可以包括:In an embodiment, the apparatus 300 may further include:
比较模块305,用于在生成模块304根据预测样本和训练后的预测模型生成预测概率之后,对当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果;The comparing module 305 is configured to compare, after the generating module 304 generates the predicted probability according to the predicted sample and the trained predictive model, a first probability that the current user is a male and a second probability that the current user is a female, to obtain a comparison result;
输出模块306,用于根据比较结果,输出最终的预测结果。The output module 306 is configured to output a final prediction result according to the comparison result.
具体的,上述输出模块306,具体用于当所述第一概率大于所述第二概率时,输出当前用户为男性的第一预测结果,当所述第一概率不大于所述第二概率时,输出当前用户为女性的第二预测结果。Specifically, the output module 306 is specifically configured to: when the first probability is greater than the second probability, output a first prediction result that the current user is a male, when the first probability is not greater than the second probability , output the second predicted result that the current user is a female.
需要说明的是,预测模型的训练过程可以在服务器端也可以在电子设备端完成。当预测模型的训练过程、实际预测过程都在服务器端完成时,需要使用训练后的预测模型时,可以将未知性别的用户使用电子设备的多维特征信息输入到服务器,服务器实际预测完成后,将预测结果发送至电子设备端,最后由电子设备输出预测结果。It should be noted that the training process of the predictive model can be completed on the server side or on the electronic device side. When the training process and the actual prediction process of the predictive model are completed on the server side, when the trained predictive model is needed, the multi-dimensional feature information of the electronic device can be input to the server by the user of unknown gender, and after the actual prediction of the server is completed, The prediction result is sent to the electronic device end, and finally the electronic device outputs the predicted result.
当预测模型的训练过程、实际预测过程都在电子设备端完成时,需要使用训练后的预测模型时,可以将的多维特征信息输入到电子设备,电子设备实际预测完成后,电子设备输出预测结果。When the training process and the actual prediction process of the predictive model are completed on the electronic device end, when the trained predictive model is needed, the multi-dimensional feature information can be input to the electronic device, and after the actual prediction of the electronic device is completed, the electronic device outputs the predicted result. .
上述所有的技术方案,可以采用任意结合形成本申请的可选实施例,在此不再一一赘述。All of the above technical solutions may be combined to form an optional embodiment of the present application, and will not be further described herein.
由上述可知,本申请实施例的用户性别预测装置,通过采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集,利用样本集对BP神经网络模型进行训练,以得到训练后的预测模型,获取未知性别用户使用电子设备的多维特征信息并作为预测样本,根据该预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当 前用户为男性的第一概率、和当前用户为女性的第二概率。本申请可以基于BP神经网络模型对当前用户的性别进行预测,提高了性别预测的准确性,方便电子设备完成用户的精准画像。It can be seen from the above that the user gender prediction apparatus in the embodiment of the present application constructs a sample set of gender prediction by collecting multi-dimensional feature information of a known gender user using the electronic device, and uses the sample set to train the BP neural network model to obtain The predicted model after the training obtains multi-dimensional feature information of the electronic device of the unknown gender and uses the electronic device as a prediction sample, and generates a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: the first probability that the current user is a male And the second probability that the current user is a woman. The application can predict the gender of the current user based on the BP neural network model, improve the accuracy of the gender prediction, and facilitate the electronic device to complete the accurate portrait of the user.
本申请实施例中,用户性别预测装置与上文实施例中的用户性别预测方法属于同一构思,在用户性别预测装置上可以运行用户性别预测方法实施例中提供的任一方法,其具体实现过程详见用户性别预测方法的实施例,此处不再赘述。In the embodiment of the present application, the user gender prediction apparatus belongs to the same concept as the user gender prediction method in the above embodiment, and any method provided in the embodiment of the user gender prediction method may be run on the user gender prediction apparatus, and the specific implementation process thereof For details, refer to the embodiment of the user gender prediction method, and details are not described herein again.
本文所使用的术语“模块”可看作为在该运算系统上执行的软件对象。本文所述的不同组件、模块、引擎及服务可看作为在该运算系统上的实施对象。而本文所述的装置及方法可以以软件的方式进行实施,当然也可在硬件上进行实施,均在本申请保护范围之内。The term "module" as used herein may be taken to mean a software object that is executed on the computing system. The different components, modules, engines, and services described herein can be viewed as implementation objects on the computing system. The apparatus and method described herein may be implemented in software, and may of course be implemented in hardware, all of which are within the scope of the present application.
本申请实施例还提供一种存储介质,其上存储有计算机程序,当所述计算机程序在计算机上运行时,使得所述计算机执行上述的用户性别预测方法。The embodiment of the present application further provides a storage medium on which a computer program is stored, and when the computer program runs on a computer, causes the computer to execute the user gender prediction method described above.
本申请实施例还提供一种电子设备,如平板电脑、手机等电子设备。电子设备中的处理器会按照如下的步骤,将一个或一个以上的应用程序的进程对应的指令加载到存储器中,并由处理器来运行存储在存储器中的应用程序,从而实现各种功能:The embodiment of the present application further provides an electronic device, such as an electronic device such as a tablet computer or a mobile phone. The processor in the electronic device loads the instructions corresponding to the process of one or more applications into the memory according to the following steps, and the processor runs the application stored in the memory to implement various functions:
采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集;Collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, and constructing a sample set of gender prediction;
利用所述样本集对BP神经网络模型进行训练,以得到训练后的预测模型;Using the sample set to train the BP neural network model to obtain a trained prediction model;
获取未知性别用户使用电子设备的多维特征信息并作为预测样本;Obtaining multi-dimensional feature information of an electronic device that is used by an unknown gender user as a prediction sample;
根据所述预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。A prediction probability is generated according to the predicted sample and the trained prediction model, the predicted probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
在一实施例中,利用所述样本集对BP神经网络模型进行训练时,所述处理器用于执行以下步骤:In an embodiment, when training the BP neural network model with the sample set, the processor is configured to perform the following steps:
对所述样本集中的样本参数进行归一化处理;Normalizing the sample parameters in the sample set;
根据归一化之后的样本参数对所述BP神经网络模型进行训练。The BP neural network model is trained according to the normalized sample parameters.
在一实施例中,根据归一化之后的样本参数对所述BP神经网络模型进行训练时,所述处理器用于执行以下步骤:In an embodiment, when the BP neural network model is trained according to the normalized sample parameters, the processor is configured to perform the following steps:
将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率;Entering the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results;
根据所述两个预测结果和与其对应的概率得到损失值;Obtaining a loss value according to the two prediction results and a probability corresponding thereto;
根据所述损失值进行训练,生成目标模型参数。Training is performed based on the loss value to generate a target model parameter.
在一实施例中,根据所述损失值进行训练时,所述处理器用于执行以下步骤:In an embodiment, when training is performed according to the loss value, the processor is configured to perform the following steps:
根据所述损失值利用梯度下降法进行训练。Training is performed using the gradient descent method based on the loss value.
在一实施例中,在根据所述预测样本和训练后的预测模型生成预测概率之后,所述处理器还用于执行以下步骤:In an embodiment, after generating the prediction probability according to the predicted sample and the trained prediction model, the processor is further configured to perform the following steps:
对所述当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果;Comparing a first probability that the current user is a male with a second probability that the current user is a female, and obtaining a comparison result;
根据所述比较结果,输出最终的预测结果。Based on the comparison result, the final prediction result is output.
在一实施例中,根据所述比较结果,输出最终的预测结果时,所述处理器用于执行以下步骤:In an embodiment, when the final prediction result is output according to the comparison result, the processor is configured to perform the following steps:
当所述第一概率大于所述第二概率时,输出当前用户为男性的第一预测结果;When the first probability is greater than the second probability, outputting a first prediction result that the current user is a male;
当所述第一概率不大于所述第二概率时,输出当前用户为女性的第二预测结果。When the first probability is not greater than the second probability, outputting a second prediction result that the current user is a female.
在本文中提及“实施例”意味着,结合实施例描述的特定特征、结构或特性可以包含在本申请的至少一个实施例中。在说明书中的各个位置出现该短语并不一定均是指相同的实施例,也不是与其它实施例互斥的独立的或备选的实施例。本领域技术人员显式地和隐式地理解的是,本文所描述的实施例可以与其它实施例相结合。References to "an embodiment" herein mean that a particular feature, structure, or characteristic described in connection with the embodiments can be included in at least one embodiment of the present application. The appearances of the phrases in various places in the specification are not necessarily referring to the same embodiments, and are not exclusive or alternative embodiments that are mutually exclusive. Those skilled in the art will understand and implicitly understand that the embodiments described herein can be combined with other embodiments.
请参阅图8,电子设备400包括处理器401以及存储器402。其中,处理器401与存储器402电性连接。Referring to FIG. 8, the electronic device 400 includes a processor 401 and a memory 402. The processor 401 is electrically connected to the memory 402.
处理器400是电子设备400的控制中心,利用各种接口和线路连接整个电子设备的各个部分,通过运行或加载存储在存储器402内的计算机程序,以及调用存储在存储器402内的数据,执行电子设备400的各种功能并处理数据,从而对电子设备400进行整体监控。The processor 400 is a control center of the electronic device 400 that connects various portions of the entire electronic device using various interfaces and lines, executes the electronic by running or loading a computer program stored in the memory 402, and recalling data stored in the memory 402. The various functions of device 400 and processing data to provide overall monitoring of electronic device 400.
存储器402可用于存储软件程序以及模块,处理器401通过运行存储在存储器402的计算机程序以及模块,从而执行各种功能应用以及数据处理。存储器402可主要包括存储程序区和存储数据区,其中,存储程序区可存储操作系统、至少一个功能所需的计算机程序(比如声音播放功能、图像播放功能等)等;存储数据区可存储根据电子设备的使用所创建的数据等。此外,存储器402可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。相应地,存储器402还可以包括存储器控制器,以提供处理器401对存储器402的访问。The memory 402 can be used to store software programs and modules, and the processor 401 executes various functional applications and data processing by running computer programs and modules stored in the memory 402. The memory 402 can mainly include a storage program area and a storage data area, wherein the storage program area can store an operating system, a computer program required for at least one function (such as a sound playing function, an image playing function, etc.), and the like; the storage data area can be stored according to Data created by the use of electronic devices, etc. Moreover, memory 402 can include high speed random access memory, and can also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device. Accordingly, memory 402 can also include a memory controller to provide processor 401 access to memory 402.
在本申请实施例中,电子设备400中的处理器401会按照如下的步骤,将一个或一个以上的计算机程序的进程对应的指令加载到存储器402中,并由处理器401运行存储在存储器402中的计算机程序,从而实现各种功能,如下:In the embodiment of the present application, the processor 401 in the electronic device 400 loads the instructions corresponding to the process of one or more computer programs into the memory 402 according to the following steps, and is stored in the memory 402 by the processor 401. The computer program in which to implement various functions, as follows:
采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集,利用样本集对BP神经网络模型进行训练,以得到训练后的预测模型,获取未知性别用户使用电子设备的多维特征信息并作为预测样本,根据该预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。本申请可以基于BP神经网络模型对当前用户的性别进行预测,提高了性别预测的准确性,方便电子设备完成用户的精准画像。The multi-dimensional feature information of the known gender users using the electronic device is collected as a sample, the sample set of gender prediction is constructed, and the BP neural network model is trained by using the sample set to obtain the predicted model after training, and the multi-dimensional use of the electronic device by the unknown gender user is obtained. The feature information is used as a prediction sample, and a prediction probability is generated according to the prediction sample and the trained prediction model, the prediction probability including: a first probability that the current user is a male, and a second probability that the current user is a female. The application can predict the gender of the current user based on the BP neural network model, improve the accuracy of the gender prediction, and facilitate the electronic device to complete the accurate portrait of the user.
请一并参阅图9,在一些实施方式中,电子设备400还可以包括:显示器403、射频电路404、音频电路405以及电源406。其中,其中,显示器403、射频电路404、音频电路405以及电源406分别与处理器401电性连接。Referring to FIG. 9 together, in some embodiments, the electronic device 400 may further include: a display 403, a radio frequency circuit 404, an audio circuit 405, and a power source 406. The display 403, the radio frequency circuit 404, the audio circuit 405, and the power source 406 are electrically connected to the processor 401, respectively.
显示器403可以用于显示由用户输入的信息或提供给用户的信息以及各种图形用户接 口,这些图形用户接口可以由图形、文本、图标、视频和其任意组合来构成。显示器403可以包括显示面板,在一些实施方式中,可以采用液晶显示器(Liquid Crystal Display,LCD)、或者有机发光二极管(Organic Light-Emitting Diode,OLED)等形式来配置显示面板。 Display 403 can be used to display information entered by the user or information provided to the user, as well as various graphical user interfaces, which can be comprised of graphics, text, icons, video, and any combination thereof. The display 403 can include a display panel. In some embodiments, the display panel can be configured in the form of a liquid crystal display (LCD), or an organic light-emitting diode (OLED).
射频电路404可以用于收发射频信号,以通过无线通信与网络设备或其他电子设备建立无线通讯,与网络设备或其他电子设备之间收发信号。The radio frequency circuit 404 can be used to transmit and receive radio frequency signals to establish wireless communication with network devices or other electronic devices through wireless communication, and to transmit and receive signals with network devices or other electronic devices.
音频电路405可以用于通过扬声器、传声器提供用户与电子设备之间的音频接口。The audio circuit 405 can be used to provide an audio interface between the user and the electronic device through the speaker and the microphone.
电源406可以用于给电子设备400的各个部件供电。在一些实施例中,电源406可以通过电源管理系统与处理器401逻辑相连,从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。 Power source 406 can be used to power various components of electronic device 400. In some embodiments, the power supply 406 can be logically coupled to the processor 401 through a power management system to enable functions such as managing charging, discharging, and power management through the power management system.
尽管图9中未示出,电子设备400还可以包括摄像头、蓝牙模块等,在此不再赘述。Although not shown in FIG. 9, the electronic device 400 may further include a camera, a Bluetooth module, and the like, and details are not described herein.
在本申请实施例中,存储介质可以是磁碟、光盘、只读存储器(Read Only Memory,ROM)、或者随机存取记忆体(Random Access Memory,RAM)等。In the embodiment of the present application, the storage medium may be a magnetic disk, an optical disk, a read only memory (ROM), or a random access memory (RAM).
在上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above embodiments, the descriptions of the various embodiments are different, and the details that are not detailed in a certain embodiment can be referred to the related descriptions of other embodiments.
需要说明的是,对本申请实施例的用户性别预测方法而言,本领域普通测试人员可以理解实现本申请实施例用户性别预测方法的全部或部分流程,是可以通过计算机程序来控制相关的硬件来完成,计算机程序可存储于一计算机可读取存储介质中,如存储在电子设备的存储器中,并被该电子设备内的至少一个处理器执行,在执行过程中可包括如用户性别预测方法的实施例的流程。其中,的存储介质可为磁碟、光盘、只读存储器、随机存取记忆体等。It should be noted that, for the user gender prediction method in the embodiment of the present application, a general tester in the field can understand all or part of the process of implementing the user gender prediction method in the embodiment of the present application, and the related hardware can be controlled by a computer program. Upon completion, the computer program can be stored in a computer readable storage medium, such as in a memory of the electronic device, and executed by at least one processor within the electronic device, and can include, for example, a user gender prediction method during execution. The flow of the embodiment. The storage medium may be a magnetic disk, an optical disk, a read only memory, a random access memory, or the like.
对本申请实施例的用户性别预测装置而言,其各功能模块可以集成在一个处理芯片中,也可以是各个模块单独物理存在,也可以两个或两个以上模块集成在一个模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。集成的模块如果以软件功能模块的形式实现并作为独立的产品销售或使用时,也可以存储在一个计算机可读取存储介质中,存储介质譬如为只读存储器,磁盘或光盘等。For the user gender prediction apparatus of the embodiment of the present application, each functional module may be integrated into one processing chip, or each module may exist physically separately, or two or more modules may be integrated into one module. The above integrated modules can be implemented in the form of hardware or in the form of software functional modules. An integrated module, if implemented in the form of a software functional module and sold or used as a standalone product, may also be stored in a computer readable storage medium such as a read only memory, a magnetic disk or an optical disk.
以上对本申请实施例所提供的一种用户性别预测方法、装置、存储介质及电子设备进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The user gender prediction method, apparatus, storage medium, and electronic device provided by the embodiments of the present application are described in detail. The principles and implementation manners of the present application are described in the specific examples, and the foregoing embodiments are described. It is only used to help understand the method of the present application and its core idea; at the same time, for those skilled in the art, according to the idea of the present application, there will be changes in the specific implementation manner and application scope. The contents of the description should not be construed as limiting the application.

Claims (20)

  1. 一种用户性别预测方法,其中,所述方法包括以下步骤:A user gender prediction method, wherein the method comprises the following steps:
    采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集;Collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, and constructing a sample set of gender prediction;
    利用所述样本集对BP神经网络模型进行训练,以得到训练后的预测模型;Using the sample set to train the BP neural network model to obtain a trained prediction model;
    获取未知性别用户使用电子设备的多维特征信息并作为预测样本;Obtaining multi-dimensional feature information of an electronic device that is used by an unknown gender user as a prediction sample;
    根据所述预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。A prediction probability is generated according to the predicted sample and the trained prediction model, the predicted probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
  2. 根据权利要求1所述的用户性别预测方法,其中,利用所述样本集对BP神经网络模型进行训练的步骤,包括:The user gender prediction method according to claim 1, wherein the step of training the BP neural network model by using the sample set comprises:
    对所述样本集中的样本参数进行归一化处理;Normalizing the sample parameters in the sample set;
    根据归一化之后的样本参数对所述BP神经网络模型进行训练。The BP neural network model is trained according to the normalized sample parameters.
  3. 根据权利要求2所述的用户性别预测方法,其中,根据归一化之后的样本参数对所述BP神经网络模型进行训练的步骤,包括:The user gender prediction method according to claim 2, wherein the step of training the BP neural network model according to the normalized sample parameters comprises:
    将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率;Entering the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results;
    根据所述两个预测结果和与其对应的概率得到损失值;Obtaining a loss value according to the two prediction results and a probability corresponding thereto;
    根据所述损失值进行训练,生成目标模型参数。Training is performed based on the loss value to generate a target model parameter.
  4. 根据权利要求3所述的用户性别预测方法,其中,所述根据所述损失值进行训练的步骤,包括:The user gender prediction method according to claim 3, wherein the step of training according to the loss value comprises:
    根据所述损失值利用梯度下降法进行训练。Training is performed using the gradient descent method based on the loss value.
  5. 根据权利要求1所述的用户性别预测方法,其中,在根据所述预测样本和训练后的预测模型生成预测概率之后,所述方法还包括:The user gender prediction method according to claim 1, wherein after generating the prediction probability according to the prediction sample and the trained prediction model, the method further comprises:
    对所述当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果;Comparing a first probability that the current user is a male with a second probability that the current user is a female, and obtaining a comparison result;
    根据所述比较结果,输出最终的预测结果。Based on the comparison result, the final prediction result is output.
  6. 根据权利要求5所述的用户性别预测方法,其中,所述根据所述比较结果,输出最终的预测结果的步骤,包括:The user gender prediction method according to claim 5, wherein the step of outputting the final prediction result according to the comparison result comprises:
    当所述第一概率大于所述第二概率时,输出当前用户为男性的第一预测结果;When the first probability is greater than the second probability, outputting a first prediction result that the current user is a male;
    当所述第一概率不大于所述第二概率时,输出当前用户为女性的第二预测结果。When the first probability is not greater than the second probability, outputting a second prediction result that the current user is a female.
  7. 根据权利要求3所述的用户性别预测方法,其中,将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率的步骤,包括:The user gender prediction method according to claim 3, wherein the step of inputting the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results comprises:
    基于第一预设公式计算以得到两个预测结果的概率,其中所述第一预设公式为:Calculating a probability based on a first preset formula to obtain two prediction results, wherein the first preset formula is:
    Figure PCTCN2018116481-appb-100001
    Figure PCTCN2018116481-appb-100001
    其中,Z K为中间值,C为预测结果的类别数,
    Figure PCTCN2018116481-appb-100002
    为第j个中间值,基于所述第一预设公式得到两维的输出层,分别代表所述样本为男性和女性的概率。
    Where Z K is the intermediate value and C is the number of categories of the predicted result.
    Figure PCTCN2018116481-appb-100002
    For the jth intermediate value, a two-dimensional output layer is obtained based on the first preset formula, respectively representing the probability that the sample is male and female.
  8. 根据权利要求3所述的用户性别预测方法,其中,根据所述两个预测结果和与其对应的概率得到损失值的步骤,包括:The user gender prediction method according to claim 3, wherein the step of obtaining a loss value according to the two prediction results and the probability corresponding thereto comprises:
    基于第二预设公式根据所述两个预测结果和与其对应的概率得到损失值,其中所述第二预设公式为:And obtaining, according to the second preset formula, a loss value according to the two prediction results and a probability corresponding thereto, wherein the second preset formula is:
    Figure PCTCN2018116481-appb-100003
    Figure PCTCN2018116481-appb-100003
    其中C为预测结果的类别数,y k为真实值,E为平均值。 Where C is the number of categories of prediction results, y k is the true value, and E is the average.
  9. 一种用户性别预测装置,其中,所述装置包括:A user gender prediction device, wherein the device comprises:
    采集模块,用于采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集;An acquisition module is configured to collect multi-dimensional feature information of a known gender user using the electronic device as a sample, and construct a sample set of gender prediction;
    训练模块,用于利用所述样本集对BP神经网络模型进行训练,以得到训练后的预测模型;a training module, configured to train the BP neural network model by using the sample set to obtain a trained prediction model;
    获取模块,用于获取未知性别用户使用电子设备的多维特征信息并作为预测样本;An obtaining module, configured to obtain multi-dimensional feature information of an electronic device used by an unknown gender user as a prediction sample;
    生成模块,用于根据所述预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。And a generating module, configured to generate a prediction probability according to the predicted sample and the trained prediction model, where the predicted probability includes: a first probability that the current user is a male, and a second probability that the current user is a female.
  10. 根据权利要求9所述的用户性别预测装置,其中,所述训练模块,具体包括:The user gender prediction apparatus according to claim 9, wherein the training module specifically includes:
    处理子模块,用于对所述样本集中的样本参数进行归一化处理;Processing a sub-module for normalizing the sample parameters in the sample set;
    训练子模块,用于根据归一化之后的样本参数对所述BP神经网络模型进行训练。The training submodule is configured to train the BP neural network model according to the normalized sample parameters.
  11. 根据权利要求10所述的用户性别预测装置,其中,The user gender prediction apparatus according to claim 10, wherein
    所述训练子模块,具体用于将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率,根据所述两个预测结果和与其对应的概率得到损失值,根据所述损失值进行训练,生成目标模型参数。The training sub-module is specifically configured to input the normalized sample parameters into the BP neural network model to obtain a probability of two prediction results, and obtain a loss value according to the two prediction results and a probability corresponding thereto, according to The loss value is trained to generate a target model parameter.
  12. 根据权利要求9所述的用户性别预测装置,其中,所述装置还包括:The user gender prediction apparatus according to claim 9, wherein the apparatus further comprises:
    比较模块,用于在生成模块根据所述预测样本和训练后的预测模型生成预测概率之后,对所述当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果;a comparison module, configured to compare a first probability that the current user is a male with a second probability that the current user is a female, and obtain a comparison result, after the generating module generates the predicted probability according to the predicted sample and the trained predicted model ;
    输出模块,用于根据所述比较结果,输出最终的预测结果。And an output module, configured to output a final prediction result according to the comparison result.
  13. 根据权利要求12所述的用户性别预测装置,其中,The user gender prediction apparatus according to claim 12, wherein
    所述输出模块,具体用于当所述第一概率大于所述第二概率时,输出当前用户为男性的第一预测结果,当所述第一概率不大于所述第二概率时,输出当前用户为女性的第二预测结果。The output module is configured to: when the first probability is greater than the second probability, output a first prediction result that the current user is a male, and when the first probability is not greater than the second probability, output the current The user is the second predicted result of the woman.
  14. 一种存储介质,其上存储有计算机程序,其中,当所述计算机程序在计算机上运行时,使得所述计算机执行如权利要求1至8任一项所述的用户性别预测方法。A storage medium having stored thereon a computer program, wherein when the computer program is run on a computer, the computer is caused to perform the user gender prediction method according to any one of claims 1 to 8.
  15. 一种电子设备,包括处理器和存储器,所述存储器存储有多条指令,其中,所述处理器加载所述存储器中的指令用于执行以下步骤:An electronic device includes a processor and a memory, the memory storing a plurality of instructions, wherein the processor loads instructions in the memory for performing the following steps:
    采集已知性别用户使用电子设备的多维特征信息作为样本,构建性别预测的样本集;Collecting multi-dimensional feature information of a known gender user using the electronic device as a sample, and constructing a sample set of gender prediction;
    利用所述样本集对BP神经网络模型进行训练,以得到训练后的预测模型;Using the sample set to train the BP neural network model to obtain a trained prediction model;
    获取未知性别用户使用电子设备的多维特征信息并作为预测样本;Obtaining multi-dimensional feature information of an electronic device that is used by an unknown gender user as a prediction sample;
    根据所述预测样本和训练后的预测模型生成预测概率,所述预测概率包括:当前用户为男性的第一概率、和当前用户为女性的第二概率。A prediction probability is generated according to the predicted sample and the trained prediction model, the predicted probability including: a first probability that the current user is a male, and a second probability that the current user is a female.
  16. 根据权利要求15所述的电子设备,其中,利用所述样本集对BP神经网络模型进行训练时,所述处理器用于执行以下步骤:The electronic device of claim 15, wherein when the BP neural network model is trained using the sample set, the processor is configured to perform the following steps:
    对所述样本集中的样本参数进行归一化处理;Normalizing the sample parameters in the sample set;
    根据归一化之后的样本参数对所述BP神经网络模型进行训练。The BP neural network model is trained according to the normalized sample parameters.
  17. 根据权利要求16所述的电子设备,其中,根据归一化之后的样本参数对所述BP神经网络模型进行训练时,所述处理器用于执行以下步骤:The electronic device of claim 16, wherein the processor is configured to perform the following steps when training the BP neural network model based on the normalized sample parameters:
    将归一化之后的样本参数输入所述BP神经网络模型以得到两个预测结果的概率;Entering the normalized sample parameters into the BP neural network model to obtain the probability of two prediction results;
    根据所述两个预测结果和与其对应的概率得到损失值;Obtaining a loss value according to the two prediction results and a probability corresponding thereto;
    根据所述损失值进行训练,生成目标模型参数。Training is performed based on the loss value to generate a target model parameter.
  18. 根据权利要求17所述的电子设备,其中,根据所述损失值进行训练时,所述处理器用于执行以下步骤:The electronic device of claim 17, wherein said processor is operative to perform the following steps when training is performed based on said loss value:
    根据所述损失值利用梯度下降法进行训练。Training is performed using the gradient descent method based on the loss value.
  19. 根据权利要求15所述的电子设备,其中,在根据所述预测样本和训练后的预测模型生成预测概率之后,所述处理器还用于执行以下步骤:The electronic device of claim 15, wherein the processor is further configured to perform the following steps after generating the prediction probability according to the predicted sample and the trained prediction model:
    对所述当前用户为男性的第一概率与当前用户为女性的第二概率进行比较,得到比较结果;Comparing a first probability that the current user is a male with a second probability that the current user is a female, and obtaining a comparison result;
    根据所述比较结果,输出最终的预测结果。Based on the comparison result, the final prediction result is output.
  20. 根据权利要求19所述的电子设备,其中,根据所述比较结果,输出最终的预测结果时,所述处理器用于执行以下步骤:The electronic device according to claim 19, wherein, when the final prediction result is output according to the comparison result, the processor is configured to perform the following steps:
    当所述第一概率大于所述第二概率时,输出当前用户为男性的第一预测结果;When the first probability is greater than the second probability, outputting a first prediction result that the current user is a male;
    当所述第一概率不大于所述第二概率时,输出当前用户为女性的第二预测结果。When the first probability is not greater than the second probability, outputting a second prediction result that the current user is a female.
PCT/CN2018/116481 2017-12-20 2018-11-20 User gender prediction method and apparatus, storage medium and electronic device WO2019120019A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711387919.0 2017-12-20
CN201711387919.0A CN109948633A (en) 2017-12-20 2017-12-20 User gender prediction method, apparatus, storage medium and electronic equipment

Publications (1)

Publication Number Publication Date
WO2019120019A1 true WO2019120019A1 (en) 2019-06-27

Family

ID=66993043

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/116481 WO2019120019A1 (en) 2017-12-20 2018-11-20 User gender prediction method and apparatus, storage medium and electronic device

Country Status (2)

Country Link
CN (1) CN109948633A (en)
WO (1) WO2019120019A1 (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111209173A (en) * 2020-01-02 2020-05-29 腾讯科技(深圳)有限公司 Performance prediction method, device, storage medium and electronic equipment
CN111652432A (en) * 2020-06-01 2020-09-11 北京达佳互联信息技术有限公司 Method and device for determining user attribute information, electronic equipment and storage medium
CN112632351A (en) * 2020-12-28 2021-04-09 北京百度网讯科技有限公司 Training method, classification method, device and equipment of classification model
CN113095473A (en) * 2020-01-09 2021-07-09 富士通株式会社 Neural network architecture search system and method, and computer-readable recording medium
CN113535700A (en) * 2021-07-19 2021-10-22 福建凯米网络科技有限公司 User information updating method for digital audio-visual place and computer readable storage medium

Families Citing this family (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112541010B (en) * 2019-09-23 2023-05-23 银橙(上海)信息技术有限公司 User gender prediction method based on logistic regression
CN110796234B (en) * 2019-10-21 2022-07-12 苏州浪潮智能科技有限公司 Method and device for predicting computer state
CN112825178A (en) * 2019-11-21 2021-05-21 北京沃东天骏信息技术有限公司 Method and device for predicting user gender portrait
CN111143441A (en) * 2019-12-30 2020-05-12 北京每日优鲜电子商务有限公司 Gender determination method, device, equipment and storage medium
CN111178983B (en) * 2020-01-03 2024-03-12 北京搜狐新媒体信息技术有限公司 User gender prediction method, device, equipment and storage medium
CN111291928A (en) * 2020-01-21 2020-06-16 深圳市华星光电半导体显示技术有限公司 Method and system for optimizing display panel process parameters
CN113392175A (en) * 2020-03-12 2021-09-14 北京有限元科技有限公司 Method, apparatus and storage medium for predicting dialect combinations
CN113657917A (en) * 2020-05-12 2021-11-16 上海佳投互联网技术集团有限公司 Visitor gender analysis method and system based on USER-AGENT
CN111737319B (en) * 2020-06-24 2024-03-01 平安科技(深圳)有限公司 User cluster prediction method, device, computer equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110222724A1 (en) * 2010-03-15 2011-09-15 Nec Laboratories America, Inc. Systems and methods for determining personal characteristics
CN103530540A (en) * 2013-09-27 2014-01-22 西安交通大学 User identity attribute detection method based on man-machine interaction behavior characteristics
CN103679139A (en) * 2013-11-26 2014-03-26 闻泰通讯股份有限公司 Face recognition method based on particle swarm optimization BP network
CN104573048A (en) * 2015-01-20 2015-04-29 电子科技大学 User basic attribute predicting method based on flow data of smart phone
CN106682686A (en) * 2016-12-09 2017-05-17 北京拓明科技有限公司 User gender prediction method based on mobile phone Internet-surfing behavior

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105469041B (en) * 2015-11-19 2019-05-24 上海交通大学 Face point detection system based on multitask regularization and layer-by-layer supervision neural network
CN107180044A (en) * 2016-03-09 2017-09-19 精硕科技(北京)股份有限公司 Recognize Internet user's sex method and system
CN107247947B (en) * 2017-07-07 2021-02-09 智慧眼科技股份有限公司 Face attribute identification method and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110222724A1 (en) * 2010-03-15 2011-09-15 Nec Laboratories America, Inc. Systems and methods for determining personal characteristics
CN103530540A (en) * 2013-09-27 2014-01-22 西安交通大学 User identity attribute detection method based on man-machine interaction behavior characteristics
CN103679139A (en) * 2013-11-26 2014-03-26 闻泰通讯股份有限公司 Face recognition method based on particle swarm optimization BP network
CN104573048A (en) * 2015-01-20 2015-04-29 电子科技大学 User basic attribute predicting method based on flow data of smart phone
CN106682686A (en) * 2016-12-09 2017-05-17 北京拓明科技有限公司 User gender prediction method based on mobile phone Internet-surfing behavior

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111209173A (en) * 2020-01-02 2020-05-29 腾讯科技(深圳)有限公司 Performance prediction method, device, storage medium and electronic equipment
CN111209173B (en) * 2020-01-02 2023-10-31 腾讯科技(深圳)有限公司 Gender prediction method and device, storage medium and electronic equipment
CN113095473A (en) * 2020-01-09 2021-07-09 富士通株式会社 Neural network architecture search system and method, and computer-readable recording medium
CN111652432A (en) * 2020-06-01 2020-09-11 北京达佳互联信息技术有限公司 Method and device for determining user attribute information, electronic equipment and storage medium
CN112632351A (en) * 2020-12-28 2021-04-09 北京百度网讯科技有限公司 Training method, classification method, device and equipment of classification model
CN112632351B (en) * 2020-12-28 2024-01-16 北京百度网讯科技有限公司 Classification model training method, classification method, device and equipment
CN113535700A (en) * 2021-07-19 2021-10-22 福建凯米网络科技有限公司 User information updating method for digital audio-visual place and computer readable storage medium

Also Published As

Publication number Publication date
CN109948633A (en) 2019-06-28

Similar Documents

Publication Publication Date Title
WO2019120019A1 (en) User gender prediction method and apparatus, storage medium and electronic device
US10846643B2 (en) Method and system for predicting task completion of a time period based on task completion rates and data trend of prior time periods in view of attributes of tasks using machine learning models
US10958748B2 (en) Resource push method and apparatus
WO2019062413A1 (en) Method and apparatus for managing and controlling application program, storage medium, and electronic device
WO2020092281A1 (en) Probabilistic neural network architecture generation
US10825072B2 (en) System for producing recommendations and predicting purchases of products based on usage patterns
WO2019062414A1 (en) Method and apparatus for managing and controlling application program, storage medium, and electronic device
JP2012058972A (en) Evaluation prediction device, evaluation prediction method, and program
CN109256147B (en) Audio beat detection method, device and storage medium
CN110069715A (en) A kind of method of information recommendation model training, the method and device of information recommendation
CN113284142B (en) Image detection method, image detection device, computer-readable storage medium and computer equipment
WO2019062405A1 (en) Application program processing method and apparatus, storage medium, and electronic device
CN112580733B (en) Classification model training method, device, equipment and storage medium
CN113538070B (en) User life value cycle detection method and device and computer equipment
CN111898675B (en) Credit wind control model generation method and device, scoring card generation method, machine readable medium and equipment
WO2019062317A1 (en) Application program control method and electronic device
WO2019085750A1 (en) Application program control method and apparatus, medium, and electronic device
WO2020000715A1 (en) Index feature extraction-based stock index prediction method, server and storage medium
CN115271931A (en) Credit card product recommendation method and device, electronic equipment and medium
CN113344647B (en) Information recommendation method and device
CN115034379A (en) Causal relationship determination method and related equipment
CN114282587A (en) Data processing method and device, computer equipment and storage medium
CN113763077A (en) Method and apparatus for detecting false trade orders
WO2020252925A1 (en) Method and apparatus for searching user feature group for optimized user feature, electronic device, and computer nonvolatile readable storage medium
US20240046065A1 (en) System, devices and/or processes for defining a search space for neural network processing device architectures

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18890199

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18890199

Country of ref document: EP

Kind code of ref document: A1