CN108280542B - User portrait model optimization method, medium and equipment - Google Patents

User portrait model optimization method, medium and equipment Download PDF

Info

Publication number
CN108280542B
CN108280542B CN201810035915.4A CN201810035915A CN108280542B CN 108280542 B CN108280542 B CN 108280542B CN 201810035915 A CN201810035915 A CN 201810035915A CN 108280542 B CN108280542 B CN 108280542B
Authority
CN
China
Prior art keywords
prediction model
sample data
training
prediction
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810035915.4A
Other languages
Chinese (zh)
Other versions
CN108280542A (en
Inventor
宋国庆
罗伟东
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Hexun Huagu Information Technology Co ltd
Original Assignee
Shenzhen Hexun Huagu Information Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Hexun Huagu Information Technology Co ltd filed Critical Shenzhen Hexun Huagu Information Technology Co ltd
Priority to CN201810035915.4A priority Critical patent/CN108280542B/en
Publication of CN108280542A publication Critical patent/CN108280542A/en
Application granted granted Critical
Publication of CN108280542B publication Critical patent/CN108280542B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0201Market modelling; Market analysis; Collecting market data
    • G06Q30/0203Market surveys; Market polls
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0255Targeted advertisements based on user history
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q30/00Commerce
    • G06Q30/02Marketing; Price estimation or determination; Fundraising
    • G06Q30/0241Advertisements
    • G06Q30/0251Targeted advertisements
    • G06Q30/0269Targeted advertisements based on user profile or attribute
    • G06Q30/0271Personalized advertisement

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Development Economics (AREA)
  • Accounting & Taxation (AREA)
  • Finance (AREA)
  • Economics (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Physics & Mathematics (AREA)
  • Marketing (AREA)
  • Game Theory and Decision Science (AREA)
  • General Business, Economics & Management (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Human Resources & Organizations (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Tourism & Hospitality (AREA)
  • Data Mining & Analysis (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention provides a method, medium and equipment for optimizing a user portrait model. The method comprises the following steps: acquiring user behavior data; obtaining a first prediction result based on a pre-established first prediction model according to the behavior data; and training and optimizing a second prediction model according to the first prediction result and the behavior data. The method comprises the steps of obtaining a first prediction result according to a first prediction model by obtaining user behavior data, training and optimizing a second prediction model according to the first prediction result and the behavior data, and improving the prediction accuracy of the second prediction model by adding an input sample data type of the training and optimizing model to the second prediction model; meanwhile, the second prediction model is trained and optimized by utilizing the prediction result of the first prediction model, and when the first prediction model is changed, automatic training and optimization of the second prediction model can be realized, so that time can be saved, and cost can be reduced.

Description

User portrait model optimization method, medium and equipment
Technical Field
The invention relates to the field of big data machine learning, in particular to an optimization method, medium and equipment of a user portrait model.
Background
Under the background of big data, advertisements recommended by purposive and classified products need to be pushed according to labels of behaviors, sexes, ages and the like of users, so that the purposes of user portrait subdivision and accurate marketing are achieved. With the continuous upgrading of internet technology, when a user uses user data to establish a user portrait, the user receives various information at any moment, the selection made at each moment is different, and the user is difficult to analyze and predict behaviors by using a fixed model. Therefore, the real-time performance is lost, the phenomena of message lag and inaccurate prediction result are caused, the marketing effect is poor, and the advertisement promotion conversion rate is low. If the model is manually modified and updated each time, the method is huge in time cost and labor cost and is not good in effect.
Disclosure of Invention
Aiming at the defects in the prior art, the invention provides an optimization method, medium and equipment of a user portrait model, which can improve the accuracy of a prediction model, save the model optimization time and reduce the cost.
In a first aspect, the present invention provides a method for optimizing a user portrait model, including:
acquiring user behavior data;
obtaining a first prediction result based on a pre-established first prediction model according to the behavior data;
and training and optimizing a second prediction model according to the first prediction result and the behavior data.
Optionally, before the step of obtaining a first prediction result based on a first pre-established prediction model according to the first behavior data, the method further includes:
acquiring sample data;
classifying the sample data;
performing data cleaning on the classified sample data to obtain characteristic sample data of the sample data;
and training a first prediction model according to the characteristic sample data.
Optionally, the classifying the sample data includes:
and classifying the sample data according to the behavior mode.
Optionally, before the step of training the first prediction model according to the feature sample data, the method further includes:
combining the characteristic sample data to obtain new characteristic sample data;
training a first prediction model according to the feature sample data, including:
and training a first prediction model according to the new characteristic sample data.
Optionally, after the step of combining the feature sample data to obtain new feature sample data, the method further includes:
carrying out normalization processing on the new characteristic sample data;
training a first prediction model according to the feature sample data, including:
and training a first prediction model according to the new characteristic sample data after normalization processing.
Optionally, after the step of training the first prediction model, the method further includes:
acquiring test data;
calculating an accuracy score of the first predictive model from the test data;
judging whether the accuracy score is larger than a preset accuracy threshold value or not;
if so, executing the step of obtaining a first prediction result based on a pre-established first prediction model according to the behavior data;
if not, the step of training the first prediction model according to the characteristic sample data is executed again.
Optionally, before the step of training and optimizing the second prediction model according to the first prediction result and the behavior data, the method further includes:
obtaining a confidence level of the first prediction result based on the first prediction model;
judging whether the confidence coefficient is smaller than a corresponding first threshold value; if the characteristic sample data is less than the preset characteristic sample data, the step of training the first prediction model according to the characteristic sample data is executed again;
if not, outputting the first prediction result; judging whether the confidence coefficient is larger than a corresponding second threshold value;
if so, judging that the first prediction result can be used as a training feature; if not, judging that the first prediction result can not be used as a training feature;
training and optimizing a second prediction model according to the first prediction result and the behavior data, wherein the training and optimizing the second prediction model comprises the following steps:
and training and optimizing a second prediction model according to the first prediction result and the behavior data, wherein the confidence coefficient is larger than the second threshold value.
In a second aspect, the present invention provides a computer readable storage medium having stored thereon a computer program which, when executed by a processor, implements a method of optimizing a user representation model as described above.
In a third aspect, the present invention provides an apparatus for optimizing a user profile model, comprising: a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing a method of optimizing a user representation model as described above when executing the program.
The invention provides an optimization method of a user portrait model, which comprises the following steps: acquiring user behavior data; obtaining a first prediction result based on a pre-established first prediction model according to the behavior data; and training and optimizing a second prediction model according to the first prediction result and the behavior data. The method comprises the steps of obtaining a first prediction result according to a first prediction model by obtaining user behavior data, training and optimizing a second prediction model according to the first prediction result and the behavior data, and improving the prediction accuracy of the second prediction model by adding an input sample data type of the training and optimizing model to the second prediction model; meanwhile, the second prediction model is trained and optimized by utilizing the prediction result of the first prediction model, and when the first prediction model is changed, automatic training and optimization of the second prediction model can be realized, so that time can be saved, and cost can be reduced.
The invention provides a computer readable storage medium and a user portrait model optimization device, which have the same beneficial effects with the user portrait model optimization method based on the same inventive concept.
Drawings
In order to more clearly illustrate the detailed description of the invention or the technical solutions in the prior art, the drawings that are needed in the detailed description of the invention or the prior art will be briefly described below. Throughout the drawings, like elements or portions are generally identified by like reference numerals. In the drawings, elements or portions are not necessarily drawn to scale.
FIG. 1 is a flow chart of a method for optimizing a user profile model according to the present invention;
FIG. 2 is a schematic structural diagram of an optimizing apparatus for a user portrait model according to the present invention.
Detailed Description
Embodiments of the present invention will be described in detail below with reference to the accompanying drawings. The following examples are only for illustrating the technical solutions of the present invention more clearly, and therefore are only examples, and the protection scope of the present invention is not limited thereby.
It is to be noted that, unless otherwise specified, technical or scientific terms used herein shall have the ordinary meaning as understood by those skilled in the art to which the invention pertains.
The invention provides a method, medium and equipment for optimizing a user portrait model. Embodiments of the present invention will be described below with reference to the drawings.
The first embodiment:
referring to fig. 1, fig. 1 is a schematic diagram of an optimization method of a user portrait model according to an embodiment of the present invention, where the optimization method of the user portrait model according to the embodiment includes:
step S101: and acquiring user behavior data.
Step S102: and obtaining a first prediction result based on a first pre-established prediction model according to the behavior data.
Step S103: and training and optimizing a second prediction model according to the first prediction result and the behavior data.
The method comprises the steps of obtaining a first prediction result according to a first prediction model by obtaining user behavior data, training and optimizing a second prediction model according to the first prediction result and the behavior data, and improving the prediction accuracy of the second prediction model by adding an input sample data type of the training and optimizing model to the second prediction model; meanwhile, the second prediction model is trained and optimized by utilizing the prediction result of the first prediction model, and when the first prediction model is changed, automatic training and optimization of the second prediction model can be realized, so that time can be saved, and cost can be reduced.
In the invention, the first prediction model can be a plurality of prediction models or one prediction model; the second predictive model may be a plurality of predictive models or may be one predictive model. The number of the first prediction model and the second prediction model is not limited herein and is within the scope of the present invention.
For example, the first prediction results of the plurality of first prediction models may be used as training optimization data for the plurality of second prediction models.
In a specific embodiment of the present invention, before the step of obtaining a first prediction result based on a first pre-established prediction model according to the first behavior data, the method further includes: acquiring sample data; classifying the sample data; performing data cleaning on the classified sample data to obtain characteristic sample data of the sample data; and training a first prediction model according to the characteristic sample data.
Before the first prediction result is obtained by using the first prediction model, the method further includes: a first predictive model is trained.
The process of training the first predictive model is as follows:
the method comprises the following steps of firstly, obtaining sample data, wherein the sample data comprises input sample data and output sample data. Wherein the sample data may be user behavior data. E.g. what APPs are installed, which APPs are turned on, which places to go, the house price of residential cells, etc.
Secondly, classifying the sample data; during classification, sample data can be classified according to the behavior pattern. For example, the sample data may be categorized according to online and offline behavior, with users purchasing goods online being categorized into one category and purchasing goods offline being categorized into another category.
And thirdly, performing data cleaning on the classified sample data to obtain characteristic sample data. And cleaning the data, including deleting null values, deleting error data and the like.
And fourthly, training the first prediction model according to the characteristic sample data. When the prediction model is trained, a machine learning algorithm can be used, and a deep learning algorithm, a random forest algorithm and the like can also be used, which are all suitable for the invention.
When the random forest algorithm is used for training the model, parameter searching can be performed according to data dimensionality, and a plurality of models with different parameters can be trained simultaneously.
By classifying the sample data, unnecessary sample data can be avoided from being trained, and the calculation amount is saved. By cleaning the sample data, the input of error data, null data and the like can be avoided, and the accuracy of the prediction model can be improved.
In an embodiment of the present invention, before the step of training the first prediction model according to the feature sample data, the method may further include: combining the characteristic sample data to obtain new characteristic sample data; training a first prediction model according to the feature sample data, including: and training a first prediction model according to the new characteristic sample data.
After the step of obtaining the feature sample data and before the step of training the first prediction model according to the feature sample data, the method may further include: and combining the characteristic sample data to obtain new characteristic sample data.
When the feature sample data is combined, the relevance among the features can be analyzed, various features are combined, multi-layer linear transformation is carried out, and the feature space is rotated, so that more valuable features can be obtained. For example, we have a lot of APPs, which may all be some kind of APPs, such as ofo, mobarae, small blue bicycle, etc., which are all shared bicycles, though different APPs, but represent the same meaning, all installed by those who have riding requirements; the invention can combine and summarize the characteristics of installation ofo, Mobai, small blue bicycle and the like to obtain a new characteristic of 'talent installation with riding requirement'. Similarly, the invention can also be applied to APP of financial class, which is in the protection scope of the invention.
By combining the feature sample data, more new features can be obtained, and the prediction model is obtained by training the new feature sets, so that the accuracy and the real-time performance of the prediction model can be improved, the time and the labor cost can be obviously reduced, correct guidance is provided for operation and advertisement putting, and the operation and advertisement putting effect is greatly improved.
In a specific embodiment provided by the present invention, after the step of combining the feature sample data to obtain new feature sample data, the method may further include: carrying out normalization processing on the new characteristic sample data; training a first prediction model according to the feature sample data, including: and training a first prediction model according to the new characteristic sample data after normalization processing.
After combining the feature sample data to obtain new feature sample data, there may be a case where the span of the value range between the same features is large. By normalizing the features, namely performing maximum and minimum normalization on each feature: (x-min (x)/(max (x) — min (x)) so that the characteristic values of the columns can be scaled to 0 to 1, thereby increasing the calculation speed.
For example, there are two features: A. the user has money, the house price of a residence is 150000 yuan/square meter, B, another user rents a house, the house rents only 200 yuan/month, the difference value between the two values is too large, the convergence speed of a prediction model is extremely low in the subsequent calculation, the working efficiency is influenced, and the calculation speed can be improved after normalization processing is carried out on all the characteristics.
In an embodiment of the present invention, after the step of training the first prediction model, the method may further include: acquiring test data; calculating an accuracy score of the first predictive model from the test data; judging whether the accuracy score is larger than a preset accuracy threshold value or not; if so, executing the step of obtaining a first prediction result based on a pre-established first prediction model according to the behavior data; if not, the step of training the first prediction model according to the characteristic sample data is executed again.
When the first prediction model is a binary model, when the first prediction model is tested by using test data, the prediction result of each binary model is compared with the real data in the test data, and the quality of the model is evaluated by calculating F1 score: f1 ═ 2TP/(2TP + FP + FN). Higher F1 values indicate better model and higher accuracy. Wherein the F1 value is the accuracy score of the binary model. TP is that the real sample is a positive sample, and the prediction result is the number of the positive samples; FP is that the real sample is the negative sample, the prediction result is the number of the positive sample; FN is that the real sample is a negative sample, and the prediction result is the number of the negative samples.
When the first prediction model is a multi-classification model, the accuracy score is an accuracy rate. The accuracy of the stochastic test of the predictive model can be used as the accuracy score of the multi-classification model.
The present invention is applicable to any prediction model, such as an incorporatability prediction model, a gender prediction model, an age prediction model, etc., but the accuracy threshold of each model is different, which is set according to the specific model.
When the calculated accuracy score is greater than a preset accuracy threshold, the first prediction model can be used for inputting behavior data into the first prediction model to obtain a first prediction result; and when the calculated accuracy score is not greater than the preset accuracy threshold, the first prediction model is indicated to be unavailable, and the first prediction model needs to be trained again according to the characteristic sample data until the calculated accuracy score is greater than the accuracy threshold.
In the invention, when the first prediction model is predicted by using the test data, the first prediction model can be predicted in a sampling mode, so that the accuracy of the accuracy score can be ensured.
By calculating the accuracy value of the first prediction model and judging the quality of the prediction model according to the accuracy value, a more accurate prediction result can be obtained when the prediction model is used.
In a specific embodiment of the present invention, before the step of training and optimizing the second prediction model according to the first prediction result and the behavior data, the method further includes: obtaining a confidence level of the first prediction result based on the first prediction model; judging whether the confidence coefficient is smaller than a corresponding first threshold value; if the characteristic sample data is less than the preset characteristic sample data, the step of training the first prediction model according to the characteristic sample data is executed again; if not, outputting the first prediction result; judging whether the confidence coefficient is larger than a corresponding second threshold value; if so, judging that the first prediction result can be used as a training feature; if not, judging that the first prediction result can not be used as a training feature; training and optimizing a second prediction model according to the first prediction result and the behavior data, wherein the training and optimizing the second prediction model comprises the following steps: and training and optimizing a second prediction model according to the first prediction result and the behavior data, wherein the confidence coefficient is larger than the second threshold value.
After the first prediction result is obtained, the second prediction model can be trained and optimized by using the first prediction result and the behavior data of the user, so that the second prediction model can be trained and optimized by adopting more characteristics, and the real-time performance and the accuracy of the second prediction model can be improved.
Before training and optimizing the second prediction model, the method further comprises the following steps: based on the first prediction model, a confidence level of the first prediction result is obtained. When each prediction result is output, the confidence of the prediction result can be output. Confidence may be used to indicate the accuracy of each prediction, with higher confidence indicating more accurate prediction.
When the confidence coefficient is smaller than a first threshold value, the prediction result is unreliable, and the first prediction model needs to be retrained; if not, the result can be used, and guidance can be provided for advertisement promotion; although the prediction result may be used, it is not necessary that the prediction result may be used as training sample data or optimization sample data of the second prediction model. And when the confidence of the prediction result is greater than a second threshold, judging that the prediction result can be used as a training feature, and when the confidence of the prediction result is not greater than the second threshold, indicating that the prediction result cannot be used as the training feature.
If the prediction result can be used as a training feature of the second prediction model, then the second prediction model can be trained and optimized according to the prediction result and the behavior data. In this way, by ensuring the accuracy of the training data, the accuracy of the second prediction model can be improved.
For example, the collected user behavior data does not have the characteristic of gender, but the first prediction model can be used for predicting gender characteristics, the accuracy of the prediction result of the first prediction model can be ensured through the verification of the first prediction model, then, the gender prediction result of the first prediction model can be used for training and optimizing the second prediction model, the interest and preference characteristics can be predicted by the second prediction model, and the gender characteristics are added for training and optimizing the second prediction model, so that the accuracy of the interest and preference characteristic prediction can be improved.
By the method, a plurality of abundant characteristics can be obtained, and along with the improvement of the accuracy of a certain model, the accuracy of other models can be improved, so that other models can be optimized in real time.
The above is a method for optimizing a user portrait model according to the present invention.
Second embodiment:
in the first embodiment, an optimization method for a user representation model is provided, and in combination with the first embodiment, a second embodiment of the present invention provides a computer-readable storage medium, on which a computer program is stored, wherein the computer program, when executed by a processor, implements the optimization method for a user representation model provided by the first embodiment.
The third embodiment:
in combination with the first embodiment, the present invention provides a method for optimizing a user portrait model, and further provides an apparatus for optimizing a user portrait model, including: a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the method for optimizing a user representation model as provided by the first embodiment described above when executing the program. FIG. 2 is a schematic diagram illustrating a hardware structure of an optimization apparatus for a user representation model according to an embodiment of the present invention.
Specifically, the processor 201 may include a Central Processing Unit (CPU), or an Application Specific Integrated Circuit (ASIC), or may be configured as one or more Integrated circuits implementing the embodiments of the present invention.
Memory 202 may include mass storage for data or instructions. By way of example, and not limitation, memory 202 may include a Hard Disk Drive (HDD), a floppy Disk Drive, flash memory, an optical Disk, a magneto-optical Disk, tape, or a Universal Serial Bus (USB) Drive or a combination of two or more of these. Memory 202 may include removable or non-removable (or fixed) media, where appropriate. The memory 202 may be internal or external to the data processing apparatus, where appropriate. In a particular embodiment, the memory 202 is a non-volatile solid-state memory. In a particular embodiment, the memory 202 includes Read Only Memory (ROM). Where appropriate, the ROM may be mask-programmed ROM, Programmable ROM (PROM), Erasable PROM (EPROM), Electrically Erasable PROM (EEPROM), electrically rewritable ROM (EAROM), or flash memory or a combination of two or more of these.
The processor 201 may implement any of the above described embodiments of the method for optimizing a user representation model by reading and executing computer program instructions stored in the memory 202.
In one example, the user representation model optimization device may also include a communication interface 203 and a bus 210. As shown in fig. 2, the processor 201, the memory 202, and the communication interface 203 are connected via a bus 210 to complete communication therebetween.
The communication interface 203 is mainly used for implementing communication between modules, apparatuses, units and/or devices in the embodiments of the present invention.
Bus 210 includes hardware, software, or both to couple the components of the optimization device of the user representation model to each other. By way of example, and not limitation, a bus may include an Accelerated Graphics Port (AGP) or other graphics bus, an Enhanced Industry Standard Architecture (EISA) bus, a Front Side Bus (FSB), a Hypertransport (HT) interconnect, an Industry Standard Architecture (ISA) bus, an infiniband interconnect, a Low Pin Count (LPC) bus, a memory bus, a Micro Channel Architecture (MCA) bus, a Peripheral Component Interconnect (PCI) bus, a PCI-Express (PCI-X) bus, a Serial Advanced Technology Attachment (SATA) bus, a video electronics standards association local (VLB) bus, or other suitable bus or a combination of two or more of these. Bus 210 may include one or more buses, where appropriate. Although specific buses have been described and shown in the embodiments of the invention, any suitable buses or interconnects are contemplated by the invention.
It is to be understood that the invention is not limited to the specific arrangements and instrumentality described above and shown in the drawings. A detailed description of known methods is omitted herein for the sake of brevity. In the above embodiments, several specific steps are described and shown as examples. However, the method processes of the present invention are not limited to the specific steps described and illustrated, and those skilled in the art can make various changes, modifications and additions or change the order between the steps after comprehending the spirit of the present invention.
The functional blocks shown in the above structural block diagrams may be implemented as hardware, software, firmware, or a combination thereof. When implemented in hardware, it may be, for example, an electronic circuit, an Application Specific Integrated Circuit (ASIC), suitable firmware, plug-in, function card, or the like. When implemented in software, the elements of the invention are the programs or code segments used to perform the required tasks. The program or code segments may be stored in a machine-readable medium or transmitted by a data signal carried in a carrier wave over a transmission medium or a communication link. A "machine-readable medium" may include any medium that can store or transfer information. Examples of a machine-readable medium include electronic circuits, semiconductor memory devices, ROM, flash memory, Erasable ROM (EROM), floppy disks, CD-ROMs, optical disks, hard disks, fiber optic media, Radio Frequency (RF) links, and so forth. The code segments may be downloaded via computer networks such as the internet, intranet, etc.
Finally, it should be noted that: the above embodiments are only used to illustrate the technical solution of the present invention, and not to limit the same; while the invention has been described in detail and with reference to the foregoing embodiments, it will be understood by those skilled in the art that: the technical solutions described in the foregoing embodiments may still be modified, or some or all of the technical features may be equivalently replaced; such modifications and substitutions do not depart from the spirit and scope of the present invention, and they should be construed as being included in the following claims and description.

Claims (4)

1. A method for optimizing a user portrait model, comprising:
acquiring user behavior data;
obtaining a first prediction result based on a pre-established first prediction model according to the behavior data;
training and optimizing a second prediction model according to the first prediction result and the behavior data;
before the step of obtaining a first prediction result based on a first pre-established prediction model according to the behavior data, the method further includes:
acquiring sample data;
classifying the sample data;
performing data cleaning on the classified sample data to obtain characteristic sample data of the sample data;
training a first prediction model according to the characteristic sample data;
before the step of training the first prediction model according to the feature sample data, the method further comprises:
combining the characteristic sample data to obtain new characteristic sample data;
training a first prediction model according to the feature sample data, including:
training a first prediction model according to the new characteristic sample data;
when the feature sample data is combined, the relevance among the features is analyzed, various features are combined, multi-layer linear transformation is carried out, and the feature space is rotated;
after the step of combining the feature sample data to obtain new feature sample data, the method further includes: carrying out normalization processing on the new characteristic sample data; training a first prediction model according to the feature sample data, including: training a first prediction model according to the new characteristic sample data after normalization processing;
after the step of training the first prediction model, the method further comprises:
acquiring test data;
calculating an accuracy score of the first predictive model from the test data;
judging whether the accuracy score is larger than a preset accuracy threshold value or not;
if so, executing the step of obtaining a first prediction result based on a pre-established first prediction model according to the behavior data;
if not, re-executing the step of training the first prediction model according to the characteristic sample data;
before the step of training and optimizing a second prediction model according to the first prediction result and the behavior data, the method further comprises the following steps:
obtaining a confidence level of the first prediction result based on the first prediction model;
judging whether the confidence coefficient is smaller than a corresponding first threshold value; if the characteristic sample data is less than the preset characteristic sample data, the step of training the first prediction model according to the characteristic sample data is executed again;
if not, outputting the first prediction result; judging whether the confidence coefficient is larger than a corresponding second threshold value;
if so, judging that the first prediction result can be used as a training feature; if not, judging that the first prediction result can not be used as a training feature;
training and optimizing a second prediction model according to the first prediction result and the behavior data, wherein the training and optimizing the second prediction model comprises the following steps:
and training and optimizing a second prediction model according to the first prediction result and the behavior data, wherein the confidence coefficient is larger than the second threshold value.
2. The method of claim 1, wherein said classifying said sample data comprises:
and classifying the sample data according to the behavior mode.
3. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method of one of claims 1-2.
4. An apparatus for optimizing a user representation model, comprising: memory, processor and computer program stored on the memory and executable on the processor, characterized in that the processor implements the method of one of claims 1-2 when executing the program.
CN201810035915.4A 2018-01-15 2018-01-15 User portrait model optimization method, medium and equipment Active CN108280542B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810035915.4A CN108280542B (en) 2018-01-15 2018-01-15 User portrait model optimization method, medium and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810035915.4A CN108280542B (en) 2018-01-15 2018-01-15 User portrait model optimization method, medium and equipment

Publications (2)

Publication Number Publication Date
CN108280542A CN108280542A (en) 2018-07-13
CN108280542B true CN108280542B (en) 2021-05-11

Family

ID=62803631

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810035915.4A Active CN108280542B (en) 2018-01-15 2018-01-15 User portrait model optimization method, medium and equipment

Country Status (1)

Country Link
CN (1) CN108280542B (en)

Families Citing this family (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109214448B (en) * 2018-08-27 2023-04-07 平安科技(深圳)有限公司 Non-performance person training method, system, terminal and computer readable storage medium
CN109447694B (en) * 2018-10-11 2022-04-12 上海瀚之友信息技术服务有限公司 User characteristic analysis method and system
CN109933698B (en) * 2019-02-27 2021-06-08 腾讯科技(深圳)有限公司 User portrait source verification method and device
CN110083517B (en) * 2019-04-29 2023-08-25 秒针信息技术有限公司 User image confidence optimization method and device
CN110134722A (en) * 2019-05-22 2019-08-16 北京小度信息科技有限公司 Target user determines method, apparatus, equipment and storage medium
CN110634044A (en) * 2019-08-09 2019-12-31 万翼科技有限公司 Park user behavior analysis method and device, computer equipment and storage medium
CN111091489B (en) * 2019-11-01 2024-05-07 平安科技(深圳)有限公司 Picture optimization method and device, electronic equipment and storage medium
CN113538029A (en) * 2020-04-22 2021-10-22 中国移动通信集团上海有限公司 User behavior data prediction method, device, equipment and medium
TWI777154B (en) * 2020-04-23 2022-09-11 和碩聯合科技股份有限公司 Electronic device and method for turnover rate prediction
CN112133431A (en) * 2020-08-27 2020-12-25 绿瘦健康产业集团有限公司 Health information message pushing method, device, medium and terminal equipment
CN112182333A (en) * 2020-09-25 2021-01-05 山东亿云信息技术有限公司 Talent space-time big data processing method and system based on random forest
CN112967424A (en) * 2021-02-02 2021-06-15 广州橙行智动汽车科技有限公司 Simulation method and device of vehicle-mounted Bluetooth key

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103502899A (en) * 2011-01-26 2014-01-08 谷歌公司 Dynamic predictive modeling platform
CN104464291A (en) * 2014-12-08 2015-03-25 杭州智诚惠通科技有限公司 Traffic flow predicting method and system
CN106776925A (en) * 2016-11-30 2017-05-31 腾云天宇科技(北京)有限公司 A kind of Forecasting Methodology of mobile terminal user's sex, server and system
CN107330445A (en) * 2017-05-31 2017-11-07 北京京东尚科信息技术有限公司 The Forecasting Methodology and device of user property

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN100541523C (en) * 2007-09-29 2009-09-16 华为技术有限公司 A kind of object video recognition methods and system based on support vector machine
US20090259534A1 (en) * 2008-04-11 2009-10-15 Microsoft Corporation Internal business arbitrage
CN101739555B (en) * 2009-12-01 2014-11-26 北京中星微电子有限公司 Method and system for detecting false face, and method and system for training false face model
WO2012075600A1 (en) * 2010-12-06 2012-06-14 Yahoo! Inc. System and method for list ranking and ads placement using interaction features
CN102567391B (en) * 2010-12-20 2014-05-07 中国移动通信集团广东有限公司 Method and device for building classification forecasting mixed model
CN104810017B (en) * 2015-04-08 2018-07-17 广东外语外贸大学 Oral evaluation method and system based on semantic analysis

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103502899A (en) * 2011-01-26 2014-01-08 谷歌公司 Dynamic predictive modeling platform
CN104464291A (en) * 2014-12-08 2015-03-25 杭州智诚惠通科技有限公司 Traffic flow predicting method and system
CN106776925A (en) * 2016-11-30 2017-05-31 腾云天宇科技(北京)有限公司 A kind of Forecasting Methodology of mobile terminal user's sex, server and system
CN107330445A (en) * 2017-05-31 2017-11-07 北京京东尚科信息技术有限公司 The Forecasting Methodology and device of user property

Also Published As

Publication number Publication date
CN108280542A (en) 2018-07-13

Similar Documents

Publication Publication Date Title
CN108280542B (en) User portrait model optimization method, medium and equipment
CN110147456B (en) Image classification method and device, readable storage medium and terminal equipment
CN106095942B (en) Strong variable extracting method and device
CN113159355A (en) Data prediction method, data prediction device, logistics cargo quantity prediction method, medium and equipment
CN109993183B (en) Network fault evaluation method and device, computing equipment and storage medium
US20200027034A1 (en) System and method for relationship identification
CN111582315B (en) Sample data processing method and device and electronic equipment
CN114549136A (en) Method, device and equipment for evaluating credit of supply chain user and computer storage medium
CN113919432A (en) Classification model construction method, data classification method and device
CN111385659A (en) Video recommendation method, device, equipment and storage medium
CN111144575B (en) Public opinion early warning model training method, early warning method, device, equipment and medium
CN117078016A (en) Enterprise risk assessment method, system, equipment and medium thereof
CN114417830A (en) Risk evaluation method, device, equipment and computer readable storage medium
CN115392787A (en) Enterprise risk assessment method, device, equipment, storage medium and program product
CN112749998A (en) Income information output method and device, electronic equipment and computer storage medium
CN111160419A (en) Electronic transformer data classification prediction method and device based on deep learning
CN112417959A (en) Picture generation method and device, electronic equipment and computer storage medium
CN113128810B (en) Training method, device, equipment and storage medium of scoring model
CN117035888B (en) New energy vehicle residual value acquisition method and device, electronic equipment and storage medium
CN111476281B (en) Information popularity prediction method and device
CN117093845A (en) Model training method, data processing method, device, equipment, medium and product
CN116910555A (en) Training and application methods, devices, equipment and media of user credit prediction model
CN116703608A (en) Training method, device, equipment and storage medium for evaluation factor selection model
CN116739606A (en) Credit risk prediction model construction method, apparatus, device and storage medium
CN117745413A (en) Enterprise credit assessment method, device, equipment and medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Optimization method, medium and device of a user portrait model

Effective date of registration: 20221031

Granted publication date: 20210511

Pledgee: Shenzhen small and medium sized small loan Co.,Ltd.

Pledgor: SHENZHEN HEXUN HUAGU INFORMATION TECHNOLOGY CO.,LTD.

Registration number: Y2022440020221

PE01 Entry into force of the registration of the contract for pledge of patent right