WO2021179715A1 - Hidden markov model-based resignation prediction method and related device - Google Patents

Hidden markov model-based resignation prediction method and related device Download PDF

Info

Publication number
WO2021179715A1
WO2021179715A1 PCT/CN2020/135184 CN2020135184W WO2021179715A1 WO 2021179715 A1 WO2021179715 A1 WO 2021179715A1 CN 2020135184 W CN2020135184 W CN 2020135184W WO 2021179715 A1 WO2021179715 A1 WO 2021179715A1
Authority
WO
WIPO (PCT)
Prior art keywords
job
hidden markov
markov model
resignation
employee
Prior art date
Application number
PCT/CN2020/135184
Other languages
French (fr)
Chinese (zh)
Inventor
夏婧
吴振宇
王建明
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2021179715A1 publication Critical patent/WO2021179715A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/23Clustering techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/29Graphical models, e.g. Bayesian networks
    • G06F18/295Markov models or related models, e.g. semi-Markov models; Markov random fields; Networks embedding Markov models
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/10Office automation; Time management
    • G06Q10/105Human resources
    • G06Q10/1053Employment or hiring

Definitions

  • This application relates to the field of big data technology, and in particular to a method and related devices for predicting turnover based on a hidden Markov model.
  • the inventor is aware of the evaluation indicators used in the existing methods of predicting agent turnover, such as the effectiveness of interview selection, the pertinence in the training process, the quantitative measurement of the usual high-intensity workload, and the prediction of long-term working conditions, etc. Most are based on human judgment, which inevitably brings subjective arbitrariness and random uncertainty to human resource management.
  • a method for predicting turnover based on a hidden Markov model comprising:
  • a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of job status and turnover probability;
  • the static information feature data set is input into the hidden Markov model to predict the job status and the resignation probability of the job applicant after entering the job.
  • a resignation prediction device based on a hidden Markov model comprising:
  • the first acquisition module is used to construct a pre-training hidden Markov model based on the employee’s static information and the influence of the work status on the employee’s intention to leave, and to use the sample data of the hired employee to hide the pre-training
  • the Markov model is trained to obtain the hidden Markov model used to predict the job status and the probability of leaving the job;
  • a second acquisition module is used to acquire job applicants’ application data, and build a static information feature data set of the job applicants based on the application data;
  • the prediction module is configured to input the static information feature data set into the hidden Markov model to predict the job status and the probability of leaving the job after the job applicant enters the job.
  • the present application also provides a computer device, including a memory and a processor, the memory storing program instructions for implementing the following steps, the steps including:
  • a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of work status and turnover probability; acquiring job applicant’s application data, and constructing a static information feature data set of the job applicant based on the application data; inputting the static information feature data set into the hidden Markov
  • the husband model predicts the job status and the probability of leaving the job after the job applicant enters the job.
  • the processor is configured to execute program instructions stored in the memory.
  • the present application also provides a storage medium storing computer readable instructions, which when executed by one or more processors, cause the one or more processors to perform the following steps:
  • a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of work status and turnover probability; acquiring job applicant’s application data, and constructing a static information feature data set of the job applicant based on the application data; inputting the static information feature data set into the hidden Markov
  • the husband model predicts the job status and the probability of leaving the job after the job applicant enters the job.
  • the Hidden Markov Model-based Resignation Prediction Method and related devices of the present application construct a pre-trained Hidden Markov Model based on the influence of the employee’s static information and work status on the employee’s resignation intention, and
  • the pre-trained hidden Markov model is trained through the sample data of hired employees to obtain a hidden Markov model for predicting the job status and the probability of leaving the job; the application data of the job applicant is obtained, and the construction is based on the application data
  • the static information feature data set of the job applicant; the static information feature data set is input into the hidden Markov model to predict the job status and the resignation probability of the job applicant after entering the job.
  • the human resources management department can make a decision on whether to hire or not based on the predicted results, so as to avoid hiring unstable job applicants who are easy to resign into the company. , Thereby reducing the loss caused by employee resignation, while also reducing the cost of manpower recruitment.
  • the hidden Markov model used in this application is a time series model, and its prediction results also include the time of resignation. Therefore, the resignation prediction method provided by this application has strong scalability on the time scale, and the subsequent sequence can be further realized Respond to resignation warnings in real time.
  • FIG. 1 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to an embodiment of the present application
  • Fig. 2 is a schematic diagram of a structure of the pre-trained hidden Markov model in the method shown in Fig. 1;
  • FIG. 3 is a schematic flow chart of a method for resignation prediction based on a hidden Markov model in another embodiment of the present application
  • FIG. 4 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model in another embodiment of the present application
  • FIG. 5 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to another embodiment of the present application
  • Fig. 6 is a schematic flow chart of a method for resignation prediction based on a hidden Markov model in another embodiment of the present application.
  • FIG. 7 is a schematic structural diagram of a resignation prediction device based on a hidden Markov model in an embodiment of the present application.
  • FIG. 8 is a schematic structural diagram of a computer device according to an embodiment of the present application.
  • FIG. 9 is a schematic structural diagram of a storage medium according to an embodiment of the present application.
  • this application provides a method and related devices for resignation prediction based on a hidden Markov model.
  • FIG. 1 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to an embodiment of the present application. It should be noted that if there is substantially the same result, the method of the present application is not limited to the sequence of the process shown in FIG. 1. As shown in Figure 1, the method includes:
  • Step S101 Based on the influence of the employee’s static information and work status on the employee’s intention to leave, construct a pre-trained hidden Markov model, and train the pre-trained hidden Markov model based on the sample data of hired employees to obtain predictions Hidden Markov Model of Job Seeker's Job Status and Resignation Probability.
  • the static information of the employee includes some basic personal information of the employee, such as name, age, home address, telephone number, graduate school, education, major, project experience, internship experience, personal evaluation, and professional skills Etc., these basic information are generally not changed in general, and can be extracted and integrated from the application data generated when employees apply for a job.
  • the application data may include any one or a combination of resume data, interview video data, and written examination data.
  • the work status of employees includes employee performance, employee attendance, employee leave frequency, leadership scoring records, colleague evaluation records, and employee rewards and punishments.
  • the work status data generated after the employee enters the job can be stored in the employee management system, and can be extracted from it when constructing the data set required for training or validating the model.
  • the employee’s resignation intention that is, the employee’s likelihood of resignation, can be marked with a numerical value.
  • the value 1 means that the employee resigns
  • the value 0 means that the employee does not resign.
  • the employee’s work status and resignation intentions are arranged in a one-to-one correspondence according to time.
  • the resignation intention label value For incumbent employees, set their resignation intention label value to 0 after entry, that is, for incumbent employees, the resignation intention label value corresponding to their work status at any time can be assumed to be 0.
  • the resignation intention label value before resignation is set to 0
  • the resignation intention label value at the time of resignation is set to 1, that is, the resignation intention label value corresponding to the work status data generated at the time of resignation is required Assume it is 1.
  • the static information of the employee will have a certain impact on the employee's work status and resignation intention after entry. For example, if an employee has high job expectations for the position he is applying for when applying for a job, these characteristics may be manifested during the interview, and his working status after entering the job may also be better, resulting in a weaker intention to leave. Or, when an employee is a fresh graduate with no work experience, he can predict whether his professional knowledge is solid and his learning ability is strong through the school he attends, the type of activities he participates, etc. These characteristics will generally be directly recorded in the resume data middle.
  • the size of the city where employees are located will affect their acceptance of commuting distance. For example, in first-tier cities, one-hour commute per day is acceptable, while in third-tier and fourth-tier cities, one-hour commute per day is not easy to accept. That is, when the employee's city and commuting time are within the acceptable range, their intention to leave will be weaker; otherwise, their intention to leave will be stronger.
  • the employee’s static information is used as input
  • the employee’s working status is the implicit state
  • the employee’s resignation intention is the observation state
  • X represents the static information of the employee, which is the input item of the model
  • A, B, and C are all model parameters
  • is the initial state probability matrix.
  • FIG. 2 is a schematic structural diagram of the pre-training hidden Markov model constructed in step S101.
  • the employee’s work status Y is the implicit status, and the employee’s resignation intention Z is the observation status
  • A is the first observation probability matrix, and the elements in the first observation probability matrix Static information indicates when X i, Y j operating state of probability
  • B is the state transition probability matrix, the state transition probability matrix element Represents the probability of the transition from the working state Y j to the working state Y k
  • C is the second observation probability matrix, and the elements in the second observation probability matrix Static information is represented by X i, Z j turnover intention of probability, i, j and k are a natural number greater than or equal to 1.
  • the hidden state sequence ⁇ Y 1 , Y 2 ,..., Y n ⁇ can be determined by analyzing the work status data of a sample of hired employees, that is, each hidden state Y j in the hidden state sequence represents the employee’s position. This kind of working status is given.
  • the sample of hired employees includes samples of current employees and samples of resigned employees.
  • the samples of resigned employees include samples of resigned employees and samples of employees who have submitted resignation applications but have not formally resigned.
  • the hidden state sequence is determined by a statistical method. For example, first count the job status categories of each hired employee sample. Considering that different hired employee samples may have the same or similar job status categories, you can use any appropriate method in the field to de-duplicate the same job status categories. Or directly cluster the job status categories of each sample of hired employees, and finally establish an implicit status sequence based on all the job status categories after deduplication or the clustered job status categories.
  • the hired employee sample data includes application data generated when each employee sample applies for employment, job status data generated after entry, and resignation intention data that corresponds to the job status data in a chronological order.
  • the method for training the pre-trained hidden Markov model using the sample data of hired employees includes: first establishing training set data, the training set data includes static information feature data of a plurality of first hired employee samples Set, work status feature data set sequence, and resignation intention tag value sequence, where the resignation intention tag value in the resignation intention tag value sequence corresponds to the work status feature data set in the work status feature data set sequence in a one-to-one order.
  • a sample of hired employees includes a sample of the first incumbent employee and a sample of the first resigned employee. Then use the training set data to train the pre-trained hidden Markov model.
  • the hired employee sample data is divided into two parts: the first hired employee sample data and the second hired employee sample data.
  • the first hired employee sample data is used to establish training set data, and the training set data is used to train the pre-trained hidden Markov model constructed as described above to obtain model parameters.
  • the second hired employee sample data is used to establish test set data, and the test set data is used to test whether the model parameters obtained after training are better or optimal model parameters.
  • the static information feature data set includes any one or a combination of resume information features, interview video features, and written examination information features.
  • the deep learning graph neural network model is used for resume text data, interview video data and written test text data. Carry out dimensionality reduction classification to realize the efficient conversion, analysis and integration of unstructured data to improve the accuracy of model prediction.
  • the method for obtaining resume information features includes: inputting resume text into a deep learning resume text graph neural network model, and the deep learning resume text graph neural network model outputs resume information features corresponding to the resume text.
  • the resume information features include educational background, age, project experience, internship experience, personal evaluation, professional skills, etc.
  • the method for acquiring features of the interview video includes: inputting at least one frame of images in the interview video into a deep learning facet neural network model, and the deep learning facet neural network model outputs interview video features corresponding to the image.
  • the interview video features include facial expressions, dress, etiquette, interview duration, etc.
  • the method for acquiring written test information features includes: inputting the written test text into a deep learning written test text graph neural network model, and the deep learning written test text graph neural network model outputs written test information features corresponding to the written test text.
  • the written test information features include personality test scores, professional skill scores, and the like.
  • the work status feature data set includes any one or a combination of employee performance, employee attendance, employee leave frequency, leadership scoring, colleague evaluation, employee rewards and punishments.
  • the resignation intention label value sequence is a sequence composed of 0 and 1, such as Indicates that the employee has not resigned at time t and the work status is Y j , and then It means that the employee resigns when the work status is Y j at time t+1.
  • the employee’s static information feature data set and the employee’s work status data set sequence are used as input, and the employee’s turnover intention value is output by adjusting the model parameters A, B, and C, and then the output intention value of the employee Compare with the corresponding actual resignation intention label value. If the output resignation intention is consistent with the actual resignation intention, determine the model parameters A, B, and C at this time; otherwise, continue to adjust the model parameters A, B, and C.
  • the employee turnover intention value output by the model ranges from 0 to 1.
  • the value of the employee's resignation intention output by the model is between 0-0.5, it indicates that the employee's resignation intention is low, and it is judged not to resign. If the corresponding actual resignation intention label value is 0 at this time, it can be judged that the employee resignation intention value output by the model is consistent with the actual resignation intention label value, otherwise the judgment is inconsistent.
  • the value of the employee's resignation intention output by the model is between 0.5 and 1, it indicates that the employee's resignation intention is high, and the employee is judged to resign. If the corresponding actual resignation intention label value is 1, it can be judged that the employee resignation intention value output by the model is consistent with the actual resignation intention label value, otherwise, the judgment is inconsistent.
  • the test inspection method includes: inputting the static information feature data set in the test set data into the pre-trained hidden Markov model after training for testing, and outputting the work status feature data of each second hired employee sample Set prediction sequence and resignation intention label value prediction sequence. Then the work status feature data set prediction sequence and the resignation intention label value prediction sequence are compared with the corresponding work status feature data set sequence and the resignation intention label value sequence in the test set data respectively, and the prediction accuracy rate is calculated.
  • the training ends and the model parameters are determined; when the prediction accuracy is less than the preset accuracy threshold, return to re-establish the training set data to repeat the training process to obtain the results after training
  • the model parameters are optimized until a Hidden Markov Model with high prediction accuracy can be used to predict the job status and turnover probability of job applicants.
  • the method for establishing the test set data is the same as the method for establishing the training set data.
  • the test set data includes a static information feature data set, a work status feature data set sequence, and a resignation intention label value sequence of a plurality of second hired employee samples, where the resignation intention label value in the resignation intention label value sequence There is a one-to-one correspondence with the work state characteristic data set in the work state characteristic data set sequence according to the time sequence; the second hired employee sample includes a second in-service employee sample and a second resigned employee sample.
  • a Hidden Markov Model for predicting the job status and turnover probability of job applicants can be constructed and trained in advance, and it can be called directly when needed. It can also be constructed and trained to obtain the hidden Markov model operation for predicting the job status and the probability of leaving the job, and then call it.
  • Step S102 Obtain the job applicant's application data, and construct the job applicant's static information feature data set based on the job applicant's application data.
  • step S102 the method of constructing the static information feature data set of job applicants is the same as the method of constructing the static information feature data set of the first hired employee sample in step S101. For the sake of simplicity, it will not be repeated here.
  • Step S103 Input the static information feature data set into the hidden Markov model for predicting the job seeker's work status and resignation probability, and predict the job seeker's work status and resignation probability after entering the job.
  • step S103 after the static information feature data set is input into the Hidden Markov Model, the probability of the hidden state sequence is solved by the forward-backward algorithm, and the job status of the job applicant and the probability of leaving the job are predicted.
  • the data used is the static information feature data set of job applicants, and no one is the evaluation index. Therefore, the subjective arbitrariness and random uncertainty brought about by human resource management can be avoided.
  • the hidden Markov model-based resignation prediction method of the embodiment shown in Figure 1 constructs a pre-trained hidden Markov model based on the employee’s static information and the influence of work status on the employee’s resignation intention, and uses the sample data of hired employees Train the pre-trained hidden Markov model to obtain a hidden Markov model for predicting the job status and turnover probability of the job applicant; obtain the job applicant’s application data, and construct the job applicant’s static information based on the job applicant’s application data Feature data set: Input the static information feature data set into the hidden Markov model to predict the job status and the probability of leaving the job after the job applicant.
  • the human resources management department can make a decision on whether to hire or not based on the predicted results, so as to avoid hiring unstable job applicants who are easy to resign into the company. , Thereby reducing the loss caused by employee resignation, while also reducing the cost of manpower recruitment.
  • the hidden Markov model used in this embodiment is a time series model, and its prediction result also includes the time of resignation. Therefore, the resignation prediction method provided by this embodiment is highly scalable on the time scale. Further realize real-time response to resignation warning, etc.
  • the hidden status sequence is determined based on the analysis of the work status data of all selected samples of hired employees.
  • the working years are classified, for example, including no working years (ie fresh graduates) category, 1-2 The category of annual working years, the category of working years of 2-5 years, the category of working years of 5-10 years, etc.
  • a hidden Markov model for predicting the future work status and the probability of leaving the job is constructed and trained.
  • the specific training method is the same as the training method described in step S101, for the sake of brevity, it will not be repeated here.
  • FIG. 3 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to another embodiment of the present application. It should be noted that if there are substantially the same results, the method of the present application is not limited to the sequence of the process shown in FIG. 3. As shown in Figure 3, the method includes:
  • Step S201 Obtain the application data of the job applicant, construct the static information feature data set of the job applicant based on the job applicant's application data, and determine the working years category of the job applicant.
  • step S201 the application data of the job applicant is obtained, and the step of constructing the static information feature data set of the job applicant based on the application data of the job applicant is similar to the step S102 of the embodiment shown in FIG. Go into details again.
  • any suitable classification model in the art can be used to determine the working years category to which the job applicant belongs. It is also possible to manually classify the categories after identifying the working years.
  • Step S202 Invoking a Hidden Markov Model corresponding to the working years category for predicting the job status and the probability of leaving the job according to the working years category to which the job applicant belongs.
  • prediction models corresponding to different working years categories are constructed and trained in advance, and then in step S202, the corresponding prediction models can be called according to the working years category of the job seeker.
  • Step S203 Input the static information feature data set into the hidden Markov model for predicting the job-seeker's work status and resignation probability, and predict the job-seeker's work status and resignation probability after entering the job.
  • step S203 is similar to step S103 in the embodiment shown in FIG.
  • the Hidden Markov Model-based Resignation Prediction Method of the embodiment shown in FIG. 3 obtains the job applicant’s application data, constructs the job applicant’s static information feature data set based on the job applicant’s application data and determines the job applicant’s belongings Working life category; according to the working life category of the job applicant, call the hidden Markov model corresponding to the working life category for predicting the job status and turnover probability of the job applicant; input the static information feature data set to the job application prediction
  • the Hidden Markov Model of the job status and the probability of leaving the job predicts the job status and the probability of leaving the job applicant.
  • the accuracy of model prediction can be improved to help the human resource management department make more correct decisions, and further reduce the loss caused by employee resignation and reduce the cost of human recruitment.
  • the regions where the employees are located are classified, for example, into the south and the north, or According to the city, it is divided into first-tier cities, second-tier cities, third-tier cities, etc., and then constructs and trains hidden Markov models for predicting the future job status and the probability of leaving the job according to the regional category.
  • the specific training method is the same as the training method described in step S101, for the sake of simplicity, it will not be repeated here.
  • FIG. 4 is a schematic flow chart of a method for resignation prediction based on a hidden Markov model in another embodiment of the present application. It should be noted that if there is substantially the same result, the method of the present application is not limited to the sequence of the process shown in FIG. 4. As shown in Figure 4, the method includes:
  • Step S301 Obtain application data of the job applicant, construct a static information feature data set of the job applicant based on the job applicant's application data, and determine the area category to which the job applicant belongs.
  • step S301 the application data of the job applicant is obtained, and the step of constructing the static information feature data set of the job applicant based on the application data of the job applicant is similar to the step S102 of the embodiment shown in FIG. Go into details again.
  • step S301 the administrative region to which the job seeker belongs is determined according to the resume text information of the job seeker, and then the category of the region to which the job seeker belongs is determined according to the administrative region to which the job seeker belongs.
  • Step S302 According to the category of the area to which the job seeker belongs, a hidden Markov model corresponding to the category of the job seeker is called for predicting the job status of the job seeker and the probability of leaving the job.
  • prediction models corresponding to different area categories are constructed and trained in advance, and then in step S302, the corresponding prediction models can be called according to the area category to which the job applicant belongs.
  • Step S303 Input the static information feature data set into the hidden Markov model for predicting the job-seeker's work status and resignation probability, and predict the job-seeker’s work status and resignation probability after entering the job.
  • step S303 is similar to step S103 of the embodiment shown in FIG.
  • the Hidden Markov Model-based Resignation Prediction Method of the embodiment shown in FIG. 4 obtains the job applicant’s application data, constructs the job applicant’s static information feature data set based on the job applicant’s application data and determines the region where the job applicant belongs Category; call the hidden Markov model corresponding to the area category for predicting the job applicant’s working status and the probability of resignation according to the area category to which the job applicant belongs; input the static information feature data set into the job applicant’s working status and The Hidden Markov Model of Resignation Probability, which predicts the job status and resignation probability of the job applicant after entering the job.
  • the accuracy of model prediction can be improved to help the human resource management department make more correct decisions, and further reduce the loss caused by employee resignation and reduce the cost of human recruitment.
  • FIG. 5 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to another embodiment of the present application. It should be noted that if there is substantially the same result, the method of the present application is not limited to the sequence of the process shown in FIG. 5. As shown in Figure 5, the method includes:
  • Step S401 Obtain application data of the job applicant, construct a static information feature data set of the job applicant based on the job applicant's application data, and determine the area category and the working years category of the job applicant.
  • step S401 the application data of the job applicant is obtained, and the step of constructing the static information characteristic data set of the job applicant based on the application data of the job applicant is similar to the step S102 of the embodiment shown in FIG.
  • the steps are similar to step S201 of the embodiment shown in FIG. 3, and the step of determining the working years category of the job applicant is similar to step S301 of the embodiment shown in FIG.
  • Step S402 According to the region category and the working years category of the job seeker, the hidden Markov model corresponding to the region category and the working years category is called for predicting the job seeker's working status and the probability of leaving the job.
  • the prediction models corresponding to different working years categories are pre-built and trained, and then in step S402, the job applicant can be based on the regional category and the working years category. Call the corresponding prediction model.
  • Step S403 Input the static information feature data set into the hidden Markov model for predicting the job-seeker's work status and resignation probability, and predict the job-seeker’s work status and resignation probability after entering the job.
  • step S403 is similar to step S103 of the embodiment shown in FIG.
  • the Hidden Markov Model-based Resignation Prediction Method of the embodiment shown in FIG. 5 obtains the job applicant’s application data, constructs the job applicant’s static information feature data set based on the job applicant’s application data and determines the region where the job applicant belongs Category and working years category; according to the region category and working years category of the job seeker, call the hidden Markov model corresponding to the region category to predict the job seeker’s working status and the probability of leaving the job; input the static information feature data set
  • the Hidden Markov Model for predicting the job status and the probability of leaving the job seeker predicts the job status and the probability of leaving the job after the job seeker.
  • the accuracy of model prediction can be improved to help the human resource management department make more correct decisions, and further reduce the loss caused by employee resignation and reduce the cost of human recruitment.
  • FIG. 6 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to another embodiment of the present application. It should be noted that if there is substantially the same result, the method of the present application is not limited to the sequence of the process shown in FIG. 6. As shown in Figure 6, the method includes:
  • Step S501 Based on the influence of the employee’s static information and work status on the employee’s intention to leave, construct a pre-trained hidden Markov model, and train the pre-trained hidden Markov model based on the sample data of hired employees to obtain predictions Hidden Markov Model of Job Seeker's Job Status and Resignation Probability.
  • step S501 is similar to step S101 of the embodiment shown in FIG.
  • Step S502 Obtain application data of the job applicant, construct a static information characteristic data set of the job applicant based on the application data of the job applicant, and create a job application portrait based on the static information characteristic data set of the job applicant.
  • step S502 the step of acquiring the job applicant’s application data in step S502, and constructing the job applicant’s static information feature data set based on the job applicant’s application data is similar to step S102 in the embodiment shown in FIG. 1 , For the sake of simplicity, I won’t repeat it here.
  • a personal file ie job application portrait
  • the personal file can be matched with the job applicant’s photo and application number
  • Easy to find all these personal files created can be stored in the database, because the applicant's application number is unique within the enterprise, so the files stored in the database can be searched according to the unique identification of the application number.
  • Step S503 Input the static information feature data set into the Hidden Markov Model for predicting the job status and resignation probability of the job applicant, and predict the job status and resignation probability of the job applicant after entering the job.
  • step S503 is similar to step S503 of the embodiment shown in FIG.
  • Step S504 Determine whether the corresponding entry time when the job applicant's resignation probability is greater than the preset probability threshold is greater than or equal to the preset time threshold.
  • the preset probability threshold and the preset time threshold may be set according to actual needs.
  • the preset probability threshold is set to 0.9
  • the preset time threshold is set to 2 years.
  • the preset time threshold can also be set separately according to the job position.
  • the preset time threshold for important or core project positions can be set slightly longer Some, such as 5 years, etc.
  • the aforementioned Hidden Markov Model has been introduced as a time series model.
  • the corresponding entry time at the time of resignation can be correspondingly known. If the entry time is greater than or equal to the preset time threshold, the job search is determined If the job seeker is a stable job seeker who is not easy to resign, step S505 is executed. Otherwise, it is determined that the job seeker is an unstable job seeker who is easy to resign, and step S506 is executed.
  • Step S505 Set an admission mark on the job-seeking portrait of the job-seeker.
  • Step S506 Set a non-admission flag on the job-seeking portrait of the job-seeker.
  • the hidden Markov model-based resignation prediction method of the embodiment shown in Figure 6 constructs a pre-trained hidden Markov model based on the employee’s static information and the influence of the work status on the employee’s resignation intention, and uses the sample data of hired employees Train the pre-trained hidden Markov model to obtain a hidden Markov model for predicting the job status and turnover probability of the job applicant; obtain the job applicant’s application data, and build the job applicant’s static state based on the job applicant’s application data Information feature data set, and create a job application portrait based on the job applicant’s static information feature data set; input the static information feature data set into the hidden Markov model used to predict the job applicant’s work status and the probability of leaving the job to predict the job applicant The job status and resignation probability after entering the job; judge whether the entry time corresponding to the job applicant’s resignation probability is greater than the preset probability threshold is greater than or equal to the preset time threshold; if so, set an admission mark on the job applicant’s portrait ; Otherwise, set a non-ad
  • FIG. 7 is a schematic structural diagram of a resignation prediction device based on a hidden Markov model in an embodiment of the present application.
  • the resignation prediction device 60 includes a first acquisition module 61, a second acquisition module 62 and a prediction module 63.
  • the first acquisition module 61 is used to construct a pre-training hidden Markov model based on the employee’s static information and work status’s influence on the employee’s intention to leave, and to perform the pre-training hidden Markov model on the sample data of hired employees. Train to obtain a hidden Markov model for predicting the job status and turnover probability of job applicants.
  • the second acquiring module 62 is used to acquire the job applicant's application data, and build the job applicant's static information feature data set based on the job applicant data.
  • the prediction module 63 is coupled with the first acquisition module 61 and the second acquisition module 62, and is used to input the static information feature data set into the hidden Markov model to predict the job status and the probability of leaving the job after the job applicant.
  • the first obtaining module 61 obtains the hidden Markov model for predicting the job status and the probability of leaving the job.
  • the intention to leave is the observation state, and a pre-training hidden Markov model is constructed.
  • the model parameters of the pre-training hidden Markov model include the first observation probability matrix A, the state transition probability matrix B, and the second observation probability matrix C;
  • An element of the observation probability matrix A Static information indicates when X i, Y j operating state probability;
  • the state transition probability matrix element B is Indicates the probability of the transition from the working state Y j to the working state Y k ;
  • the element in the second observation probability matrix C Static information is represented by X i, Z j turnover intention of probability, i, j and k are a natural number greater than or equal to 1;
  • the establishment of training data the training data set comprises a first plurality of samples has been hired employee Static information feature data set, work status feature data set sequence, and resignation intention label value sequence.
  • the resignation intention label value in the resignation intention label value sequence corresponds to the work status feature data set in the work status feature data set sequence according to the time sequence. ;
  • the first hired employee sample includes the first in-service employee sample and the first resigned employee sample;
  • the training set data is used to train the pre-training hidden Markov model.
  • the operation of the first obtaining module 61 to obtain the hidden Markov model for predicting the job status and the probability of leaving the job further includes: establishing a test set data, the test set data includes a plurality of second hired employee samples
  • the static information feature data set, the work status feature data set sequence and the resignation intention label value sequence, the resignation intention label value in the resignation intention label value sequence and the work status feature data set in the work status feature data set sequence are sorted by time one by one Correspondence; among them, the second sample of hired employees includes the second sample of incumbent employees and the second sample of resigned employees;
  • the static information feature data set in the test set data is input into the pre-trained hidden Markov model after training Test, output the prediction sequence of the work status feature data set and the expected sequence of the resignation intention label value of each second hired employee sample; compare the prediction sequence of the work status feature data set with the corresponding work status feature data set sequence in the test set data Compare and compare the resignation intention label value prediction sequence with the corresponding resignation intention label value sequence in the test set data to calculate the
  • the static information feature data set includes any one or a combination of resume information features, interview video features, and written examination information features.
  • the method for obtaining resume information features includes: inputting resume text into a deep learning resume text graph neural network model, and the deep learning resume text graph neural network model outputs the resume information feature.
  • the method for acquiring features of the interview video includes: inputting at least one frame of images in the interview video into a deep-learning facial-attempt neural network model, and the deep-learning facial-attempt neural network model outputs the interview video features.
  • the method for acquiring written test information features includes: inputting the written test text into a deep learning written test text graph neural network model, and the deep learning written test text graph neural network model outputs the written test information feature.
  • the resignation prediction device 60 further includes a determination model 64 coupled to the second acquisition module 62 for determining the area category to which the job applicant belongs based on the job applicant's application data.
  • the first acquiring module 61 is also coupled to the determining module 64, and is configured to call a Hidden Markov Model corresponding to the region category for predicting the job status and the probability of leaving the job according to the region category to which the job seeker belongs.
  • the determination model 64 is also used to determine the type of working years to which the job applicant belongs based on the job application data of the job applicant.
  • the first acquisition module 61 is also configured to call a Hidden Markov Model corresponding to the working years category for predicting the job seeker's working status and the probability of leaving the job according to the working years category to which the job seeker belongs.
  • the resignation prediction device 60 further includes a creation module 65 coupled to the second acquisition module 62, and is configured to create a job-seeking portrait according to the static information feature data set of the job-seeker.
  • the prediction module 63 is also coupled with the creation module 65, and is used to display the job applicant’s job search portrait when it is predicted that the job applicant’s resignation probability is greater than the preset probability threshold and the corresponding entry time is less than the preset time threshold.
  • FIG. 8 is a schematic structural diagram of a computer device according to an embodiment of the application.
  • the computer device 70 includes a processor 71 and a memory 72 coupled to the processor 71.
  • the memory 72 stores computer-readable instructions, and when the computer-readable instructions are executed by the processor 71, the processor 71 executes the steps of the above-mentioned hidden Markov model-based resignation prediction method.
  • the processor 71 may also be referred to as a CPU (Central Processing Unit, central processing unit).
  • the processor 71 may be an integrated circuit chip with signal processing capabilities.
  • the processor 71 may also be a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, a discrete gate or transistor logic device, a discrete hardware component .
  • the general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
  • FIG. 9 is a schematic structural diagram of a storage medium according to an embodiment of the application.
  • the storage medium 80 stores computer-readable instructions 81, which when executed by one or more processors, cause the one or more processors to execute the steps of the above-mentioned Hidden Markov Model-based Resignation Prediction Method .
  • the computer-readable instruction 81 may be stored in the above-mentioned storage medium in the form of a software product, including several instructions for enabling a computer device (may be a personal computer, a server, or a network device, etc.) or a processor (processor) Perform all or part of the steps of the methods described in the various embodiments of this application.
  • the aforementioned storage medium 80 includes: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks, etc., which can store program codes.
  • U disk mobile hard disk
  • read-only memory ROM, Read-Only Memory
  • RAM random access memory
  • magnetic disks or optical disks, etc. which can store program codes.
  • Media or terminal devices such as computers, servers, mobile phones, tablets, etc.
  • the storage medium may be non-volatile or volatile.
  • the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated unit can be implemented in the form of hardware or software functional unit. The above are only implementations of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of this application, or directly or indirectly applied to other related technical fields, The same reasoning is included in the scope of patent protection of this application.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Human Resources & Organizations (AREA)
  • General Physics & Mathematics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Strategic Management (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Evolutionary Biology (AREA)
  • Economics (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Operations Research (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Quality & Reliability (AREA)
  • Marketing (AREA)
  • General Health & Medical Sciences (AREA)
  • Computational Linguistics (AREA)
  • Biophysics (AREA)
  • Biomedical Technology (AREA)
  • Molecular Biology (AREA)
  • Health & Medical Sciences (AREA)
  • Software Systems (AREA)
  • Mathematical Physics (AREA)
  • Computing Systems (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

A hidden Markov model-based resignation prediction method and a related device. Said method comprises: constructing a pre-trained hidden Markov model on the basis of employee static information and influences of a working status on employee resignation intention, and training the pre-trained hidden Markov model by means of sample data of hired employees, so as to acquire a hidden Markov model for predicting a working status and a resignation probability of a job applicant (S101); acquiring application data of the job applicant, and constructing a static information feature data set of the job applicant on the basis of the application data (S102); and inputting the static information feature data set into the hidden Markov model, so as to predict the working status and the resignation probability of the job applicant after being hired (S103). In this way, the present invention provides reference information for a human resource management department to make the decision of whether to hire, avoiding recruiting, to a company, job applicants who are unstable and liable to resign, reducing the costs of human recruitment. In addition, the method has strong expandability on the time scale, for example, resignation warning, etc. can be achieved subsequently.

Description

基于隐马尔可夫模型的离职预测方法及相关装置Resignation prediction method and related device based on hidden Markov model
本申请要求于2020年10月21日提交中国专利局、申请号为202011134554.2、申请名称为“基于隐马尔可夫模型的离职预测方法及相关装置”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on October 21, 2020, the application number is 202011134554.2, and the application name is "Hidden Markov Model-based Resignation Prediction Method and Related Devices". The entire content of the application is approved The reference is incorporated in this application.
技术领域Technical field
本申请涉及大数据技术领域,特别是涉及一种基于隐马尔可夫模型的离职预测方法及相关装置。This application relates to the field of big data technology, and in particular to a method and related devices for predicting turnover based on a hidden Markov model.
背景技术Background technique
人力资源管理是公司运营的重要组成,对员工离职的预测是人力资源领域一重要难题。员工离职对公司在经济成本、效率成本、文化流失成本上有着极大的负面影响,因此,现有技术提供了一些预测坐席离职方法,这些方法大多为逻辑回归或随机森林等方法,用于预测在职员工的离职概率和离职原因等。而对于潜在员工,如求职者,如何合理预测其入职后的工作状态和离职概率,为人力资源管理部门做出是否录用决定提供参考信息,目前还显有报道。发明人意识到现有的预测坐席离职方法中所使用的评估指标,如面试选拔的有效性、培训过程中的针对性、平时高强度工作量的定量化衡量以及长时间工作状态的预判等大都基于人为评判,不可避免给人力资源管理带来了主观随意性和随机不确定性。Human resource management is an important part of company operations, and the prediction of employee turnover is an important problem in the field of human resources. Employee resignation has a great negative impact on the company’s economic costs, efficiency costs, and cultural loss costs. Therefore, the existing technology provides some methods for predicting agent resignation. Most of these methods are logistic regression or random forest for forecasting. Probability and reason for resignation of incumbent employees. As for potential employees, such as job seekers, how to reasonably predict their working status and resignation probability after entry, and provide reference information for the human resources management department to make hiring decisions, there are still obvious reports. The inventor is aware of the evaluation indicators used in the existing methods of predicting agent turnover, such as the effectiveness of interview selection, the pertinence in the training process, the quantitative measurement of the usual high-intensity workload, and the prediction of long-term working conditions, etc. Most are based on human judgment, which inevitably brings subjective arbitrariness and random uncertainty to human resource management.
发明内容Summary of the invention
基于此,有必要提供一种基于隐马尔可夫模型的离职预测方法及相关装置,以实现合理预测求职者入职后的工作状态和离职概率,为人力资源管理部门做出是否录用决定提供参考信息。Based on this, it is necessary to provide a resignation prediction method and related devices based on a hidden Markov model to realize a reasonable prediction of the job status and resignation probability of job applicants after entry, and provide reference information for the human resources management department to make hiring decisions. .
一种基于隐马尔可夫模型的离职预测方法,所述方法包括:A method for predicting turnover based on a hidden Markov model, the method comprising:
基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;Based on the influence of the employee’s static information and work status on the employee’s intention to leave, a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of job status and turnover probability;
获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;Obtain job applicant's application data, and construct a static information feature data set of the job applicant based on the application data;
将所述静态信息特征数据集输入所述隐马尔可夫模型,预测所述求职者入职后的工作状态和离职概率。The static information feature data set is input into the hidden Markov model to predict the job status and the resignation probability of the job applicant after entering the job.
一种基于隐马尔可夫模型的离职预测装置,所述装置包括:A resignation prediction device based on a hidden Markov model, the device comprising:
第一获取模块,所述第一获取模块用于基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;The first acquisition module is used to construct a pre-training hidden Markov model based on the employee’s static information and the influence of the work status on the employee’s intention to leave, and to use the sample data of the hired employee to hide the pre-training The Markov model is trained to obtain the hidden Markov model used to predict the job status and the probability of leaving the job;
第二获取模块,所述第二获取模块用于获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;A second acquisition module, the second acquisition module is used to acquire job applicants’ application data, and build a static information feature data set of the job applicants based on the application data;
预测模块,所述预测模块用于将所述静态信息特征数据集输入所述隐马尔可夫 模型,预测所述求职者入职后的工作状态和离职概率。The prediction module is configured to input the static information feature data set into the hidden Markov model to predict the job status and the probability of leaving the job after the job applicant enters the job.
此外,为实现上述目的,本申请还提供一种计算机设备,包括存储器和处理器,所述存储器存储有用于实现以下步骤的程序指令,所述步骤包括:In addition, in order to achieve the above object, the present application also provides a computer device, including a memory and a processor, the memory storing program instructions for implementing the following steps, the steps including:
基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;将所述静态信息特征数据集输入所述隐马尔可夫模型,预测所述求职者入职后的工作状态和离职概率。Based on the influence of the employee’s static information and work status on the employee’s intention to leave, a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of work status and turnover probability; acquiring job applicant’s application data, and constructing a static information feature data set of the job applicant based on the application data; inputting the static information feature data set into the hidden Markov The husband model predicts the job status and the probability of leaving the job after the job applicant enters the job.
所述处理器用于执行所述存储器存储的程序指令。此外,为实现上述目的,本申请还提供一种存储有计算机可读指令的存储介质,所述计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行如下步骤:The processor is configured to execute program instructions stored in the memory. In addition, in order to achieve the above object, the present application also provides a storage medium storing computer readable instructions, which when executed by one or more processors, cause the one or more processors to perform the following steps:
基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;将所述静态信息特征数据集输入所述隐马尔可夫模型,预测所述求职者入职后的工作状态和离职概率。Based on the influence of the employee’s static information and work status on the employee’s intention to leave, a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of work status and turnover probability; acquiring job applicant’s application data, and constructing a static information feature data set of the job applicant based on the application data; inputting the static information feature data set into the hidden Markov The husband model predicts the job status and the probability of leaving the job after the job applicant enters the job.
与现有技术相比,本申请的基于隐马尔可夫模型的离职预测方法及相关装置,通过基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;将所述静态信息特征数据集输入所述隐马尔可夫模型,预测所述求职者入职后的工作状态和离职概率。通过预测求职者入职后的工作状态和离职概率,使得人力资源管理部门在应聘流程结束后,可以结合预测结果做出是否录用决定,从而实现尽量避免将易离职的不稳定求职者招聘到企业中,进而降低员工离职所带来的损失,同时也能减少人力招聘成本。此外,本申请采用的隐马尔可夫模型为时序模型,其预测结果中还包含了离职时间,因此,本申请提供的离职预测方法在时间尺度上的可拓展性强,如后序可以进一步实现实时响应离职预警等。Compared with the prior art, the Hidden Markov Model-based Resignation Prediction Method and related devices of the present application construct a pre-trained Hidden Markov Model based on the influence of the employee’s static information and work status on the employee’s resignation intention, and The pre-trained hidden Markov model is trained through the sample data of hired employees to obtain a hidden Markov model for predicting the job status and the probability of leaving the job; the application data of the job applicant is obtained, and the construction is based on the application data The static information feature data set of the job applicant; the static information feature data set is input into the hidden Markov model to predict the job status and the resignation probability of the job applicant after entering the job. By predicting the job status and resignation probability of job applicants, after the application process is over, the human resources management department can make a decision on whether to hire or not based on the predicted results, so as to avoid hiring unstable job applicants who are easy to resign into the company. , Thereby reducing the loss caused by employee resignation, while also reducing the cost of manpower recruitment. In addition, the hidden Markov model used in this application is a time series model, and its prediction results also include the time of resignation. Therefore, the resignation prediction method provided by this application has strong scalability on the time scale, and the subsequent sequence can be further realized Respond to resignation warnings in real time.
附图说明Description of the drawings
图1是本申请一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图;FIG. 1 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to an embodiment of the present application;
图2为图1所示方法中预训练隐马尔可夫模型的一种结构示意图;Fig. 2 is a schematic diagram of a structure of the pre-trained hidden Markov model in the method shown in Fig. 1;
图3是本申请另一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图;FIG. 3 is a schematic flow chart of a method for resignation prediction based on a hidden Markov model in another embodiment of the present application;
图4是本申请再一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图;FIG. 4 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model in another embodiment of the present application;
图5是本申请又一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图;FIG. 5 is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to another embodiment of the present application;
图6是本申请又一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图;Fig. 6 is a schematic flow chart of a method for resignation prediction based on a hidden Markov model in another embodiment of the present application;
图7是本申请一个实施例基于隐马尔可夫模型的离职预测装置的结构示意图;FIG. 7 is a schematic structural diagram of a resignation prediction device based on a hidden Markov model in an embodiment of the present application;
图8是本申请一个实施例计算机设备的结构示意图;FIG. 8 is a schematic structural diagram of a computer device according to an embodiment of the present application;
图9是本申请一个实施例存储介质的结构示意图。FIG. 9 is a schematic structural diagram of a storage medium according to an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅是本申请的一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。The following will clearly and completely describe the technical solutions in the embodiments of the present application in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, rather than all the embodiments. Based on the embodiments in this application, all other embodiments obtained by those of ordinary skill in the art without creative work shall fall within the protection scope of this application.
为了合理预测求职者入职后的工作状态和离职概率,为人力资源管理部门做出是否录用决定提供参考信息,本申请提供了一种基于隐马尔可夫模型的离职预测方法及相关装置。In order to reasonably predict the job status and resignation probability of job applicants after entering the job, and to provide reference information for the human resources management department to make a decision on whether to hire or not, this application provides a method and related devices for resignation prediction based on a hidden Markov model.
请参阅图1所示,图1是本申请一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图。需注意的是,若有实质上相同的结果,本申请的方法并不以图1所示的流程顺序为限。如图1所示,该方法包括:Please refer to FIG. 1, which is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to an embodiment of the present application. It should be noted that if there is substantially the same result, the method of the present application is not limited to the sequence of the process shown in FIG. 1. As shown in Figure 1, the method includes:
步骤S101:基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对该预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型。Step S101: Based on the influence of the employee’s static information and work status on the employee’s intention to leave, construct a pre-trained hidden Markov model, and train the pre-trained hidden Markov model based on the sample data of hired employees to obtain predictions Hidden Markov Model of Job Seeker's Job Status and Resignation Probability.
可选的,在步骤101中,员工的静态信息包括员工个人的一些基本信息,如姓名、年龄、家庭住址、电话、毕业院校、学历、专业、项目经历、实习经历、个人评价、专业技能等,这些基本信息一般都是不大会改动的,可以从员工应聘时产生的应聘数据中提取整合得到。进一步的,该应聘数据可以包括简历数据、面试视频数据和笔试数据中的任意一种或者几种的组合。Optionally, in step 101, the static information of the employee includes some basic personal information of the employee, such as name, age, home address, telephone number, graduate school, education, major, project experience, internship experience, personal evaluation, and professional skills Etc., these basic information are generally not changed in general, and can be extracted and integrated from the application data generated when employees apply for a job. Further, the application data may include any one or a combination of resume data, interview video data, and written examination data.
员工的工作状态包括员工绩效、员工考勤、员工请假频率、领导打分记录、同事评价记录以及员工奖惩等。员工入职后产生的工作状态数据可以存储在员工管理系统中,当构建用于训练或验证模型所需的数据集时可以从中进行提取。The work status of employees includes employee performance, employee attendance, employee leave frequency, leadership scoring records, colleague evaluation records, and employee rewards and punishments. The work status data generated after the employee enters the job can be stored in the employee management system, and can be extracted from it when constructing the data set required for training or validating the model.
员工的离职意向,即员工离职的可能性,可使用数值进行标注。如数值1表示员工离职,数值0表示员工不离职,数值越接近于1时,表示员工的离职意向越强烈,数值越接近于0时,表示员工的离职意向越弱。The employee’s resignation intention, that is, the employee’s likelihood of resignation, can be marked with a numerical value. For example, the value 1 means that the employee resigns, and the value 0 means that the employee does not resign. The closer the value is to 1, the stronger the employee’s resignation intention, and the closer the value is to 0, the weaker the employee’s resignation intention.
需要说明的是,在存储数据或者构建用于训练或验证模型所需的数据集时,将员工的工作状态和离职意向按照时间排序一一对应。对于在职员工,将其入职后的离职意向标注值均设置为0,即对于在职员工而言,其在任意时刻下的工作状态所对应的离职意向标注值均可以假设为0。对于离职员工,将其离职前的离职意向标注值均设置为0,而将其离职时的离职意向标注值设置为1,即其离职时所产生的工作状态数据所对应的离职意向标注值要假设为1。It should be noted that when storing data or constructing a data set required for training or validating a model, the employee’s work status and resignation intentions are arranged in a one-to-one correspondence according to time. For incumbent employees, set their resignation intention label value to 0 after entry, that is, for incumbent employees, the resignation intention label value corresponding to their work status at any time can be assumed to be 0. For resigning employees, the resignation intention label value before resignation is set to 0, and the resignation intention label value at the time of resignation is set to 1, that is, the resignation intention label value corresponding to the work status data generated at the time of resignation is required Assume it is 1.
一般情况下,员工的静态信息对员工入职后的工作状态和离职意向均会产生一定的影响。举例说明,若员工应聘时对其所应聘岗位的工作期待较高,这些特征在其面试时可能会表现出来,其入职后的工作状态也可能会比较好,从而离职意向较弱。再或者,员工应聘时是无工作经验的应届毕业生,可以通过其就读的学校、参加的活动类型等预判其专业知识是否扎实,学习能力强弱等,这些特征一般会直接记录在简历数据中。若其专业知识扎实并具备较强的学习能力,其入职后对第一份工作的适应能力就会比较强,从而工作状态也会比较好,离职意向弱。反之,其入职后对第一份工作的适应能力可能较弱,导致工作状态较差,进而离职意向就会比较强烈。又或者,员工所在城市大小会影响其对通勤距离的接受程度,如一线城市,每天单程通勤一小时属于可接受范围,而三四线城市,每天单程通勤一小时则不太容易被接受。即员工所在城市以及通勤时长属于可接受范围时,其离职意向就会比较弱,否则,其离职意向就会较强。Under normal circumstances, the static information of the employee will have a certain impact on the employee's work status and resignation intention after entry. For example, if an employee has high job expectations for the position he is applying for when applying for a job, these characteristics may be manifested during the interview, and his working status after entering the job may also be better, resulting in a weaker intention to leave. Or, when an employee is a fresh graduate with no work experience, he can predict whether his professional knowledge is solid and his learning ability is strong through the school he attends, the type of activities he participates, etc. These characteristics will generally be directly recorded in the resume data middle. If their professional knowledge is solid and they have strong learning ability, they will be more adaptable to the first job after entering the job, and their working status will be better, and their intention to leave will be weak. On the contrary, their ability to adapt to the first job may be weak after entering the job, resulting in a poor working status and a stronger intention to leave. Or, the size of the city where employees are located will affect their acceptance of commuting distance. For example, in first-tier cities, one-hour commute per day is acceptable, while in third-tier and fourth-tier cities, one-hour commute per day is not easy to accept. That is, when the employee's city and commuting time are within the acceptable range, their intention to leave will be weaker; otherwise, their intention to leave will be stronger.
因此,在本申请中,将员工的静态信息作为输入,以员工的工作状态为隐含状态,员工的离职意向为观测状态,构建预训练隐马尔可夫模型λ=((A,B,C),X,π)。其中,X表示员工的静态信息,为模型的输入项;A,B,C均为模型参数,π为初始状态概率矩阵。Therefore, in this application, the employee’s static information is used as input, the employee’s working status is the implicit state, and the employee’s resignation intention is the observation state, and a pre-trained hidden Markov model is constructed λ=((A,B,C ),X,π). Among them, X represents the static information of the employee, which is the input item of the model; A, B, and C are all model parameters, and π is the initial state probability matrix.
为了便于理解,请参阅图2所示,图2为步骤S101中构建的预训练隐马尔可夫模型的一种结构示意图。如图2中所示,员工的工作状态Y为隐含状态,员工的离职 意向Z为观测状态;A为第一观测概率矩阵,该第一观测概率矩阵中的元素
Figure PCTCN2020135184-appb-000001
Figure PCTCN2020135184-appb-000002
表示静态信息为X i时,工作状态为Y j的概率;B为状态转移概率矩阵,该状态转移概率矩阵中的元素
Figure PCTCN2020135184-appb-000003
表示工作状态Y j向工作状态Y k转移的概率;C为第二观测概率矩阵,该第二观测概率矩阵中的元素
Figure PCTCN2020135184-appb-000004
表示静态信息为X i时,离职意向为Z j的概率,i,j和k均为大于或等于1的自然数。
For ease of understanding, please refer to FIG. 2. FIG. 2 is a schematic structural diagram of the pre-training hidden Markov model constructed in step S101. As shown in Figure 2, the employee’s work status Y is the implicit status, and the employee’s resignation intention Z is the observation status; A is the first observation probability matrix, and the elements in the first observation probability matrix
Figure PCTCN2020135184-appb-000001
Figure PCTCN2020135184-appb-000002
Static information indicates when X i, Y j operating state of probability; B is the state transition probability matrix, the state transition probability matrix element
Figure PCTCN2020135184-appb-000003
Represents the probability of the transition from the working state Y j to the working state Y k ; C is the second observation probability matrix, and the elements in the second observation probability matrix
Figure PCTCN2020135184-appb-000004
Static information is represented by X i, Z j turnover intention of probability, i, j and k are a natural number greater than or equal to 1.
需要说明的是,在该预训练隐马尔可夫模型λ=((A,B,C),X,π)中,模型参数A,B,C均是未知的,需要经过训练得到。而隐含状态序列{Y 1,Y 2,…,Y n}可以通过对已录用员工样本的工作状态数据分析后确定,即隐含状态序列中的各个隐含状态Y j分别表示员工的何种工作状态是给定的。其中,已录用员工样本包括在职员工样本和离职员工样本,离职员工样本包括已离职员工样本和已提交离职申请但未正式离职的员工样本。 It should be noted that in the pre-trained hidden Markov model λ=((A,B,C),X,π), the model parameters A, B, and C are all unknown and need to be obtained through training. The hidden state sequence {Y 1 , Y 2 ,..., Y n } can be determined by analyzing the work status data of a sample of hired employees, that is, each hidden state Y j in the hidden state sequence represents the employee’s position. This kind of working status is given. Among them, the sample of hired employees includes samples of current employees and samples of resigned employees. The samples of resigned employees include samples of resigned employees and samples of employees who have submitted resignation applications but have not formally resigned.
可选的,通过统计的方法确定隐含状态序列。例如先统计每个已录用员工样本的工作状态类别,考虑到不同已录用员工样本可能具有相同或相近的工作状态类别,可以使用本领域中任何合适的方式将相同的工作状态类别进行去重,或者直接对各个已录用员工样本的工作状态类别进行聚类,最后基于去重后的所有工作状态类别或者聚类后的工作状态类别建立隐含状态序列。Optionally, the hidden state sequence is determined by a statistical method. For example, first count the job status categories of each hired employee sample. Considering that different hired employee samples may have the same or similar job status categories, you can use any appropriate method in the field to de-duplicate the same job status categories. Or directly cluster the job status categories of each sample of hired employees, and finally establish an implicit status sequence based on all the job status categories after deduplication or the clustered job status categories.
通过建立如上述的预训练隐马尔可夫模型,然后利用已录用员工样本数据对其进行训练,以获取可以用于预测求职者入职后工作状态和离职概率的隐马尔可夫模型。其中,已录用员工样本数据包括每个员工样本应聘时产生的应聘数据、入职后产生的工作状态数据、以及与工作状态数据按照时间排序一一对应的离职意向数据。By establishing the above-mentioned pre-trained hidden Markov model, and then using the sample data of hired employees to train it, a hidden Markov model can be obtained that can be used to predict the job status of job seekers and the probability of leaving the job. Among them, the hired employee sample data includes application data generated when each employee sample applies for employment, job status data generated after entry, and resignation intention data that corresponds to the job status data in a chronological order.
可选的,利用已录用员工样本数据对该预训练隐马尔可夫模型进行训练的方法包括:先建立训练集数据,该训练集数据中包括多个第一已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,其中,离职意向标注值序列中的离职意向标注值与工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应,第一已录用员工样本包括第一在职员工样本和第一离职员工样本。然后利用该训练集数据对该预训练隐马尔可夫模型进行训练。Optionally, the method for training the pre-trained hidden Markov model using the sample data of hired employees includes: first establishing training set data, the training set data includes static information feature data of a plurality of first hired employee samples Set, work status feature data set sequence, and resignation intention tag value sequence, where the resignation intention tag value in the resignation intention tag value sequence corresponds to the work status feature data set in the work status feature data set sequence in a one-to-one order. A sample of hired employees includes a sample of the first incumbent employee and a sample of the first resigned employee. Then use the training set data to train the pre-trained hidden Markov model.
需要说明的是,在本实施例中,将已录用员工样本数据分为第一已录用员工样本数据和第二已录用员工样本数据两部分。其中第一已录用员工样本数据用于建立训练集数据,该训练集数据用于训练前述构建的预训练隐马尔可夫模型,以获取模型参数。第二已录用员工样本数据用于建立测试集数据,该测试集数据用于检验训练后得到的模型参数是否为较优或最优模型参数。It should be noted that, in this embodiment, the hired employee sample data is divided into two parts: the first hired employee sample data and the second hired employee sample data. The first hired employee sample data is used to establish training set data, and the training set data is used to train the pre-trained hidden Markov model constructed as described above to obtain model parameters. The second hired employee sample data is used to establish test set data, and the test set data is used to test whether the model parameters obtained after training are better or optimal model parameters.
具体的,静态信息特征数据集中包括简历信息特征、面试视频特征、以及笔试信息特征中的任意一种或几种的组合。Specifically, the static information feature data set includes any one or a combination of resume information features, interview video features, and written examination information features.
为了避免预测模型对数据形式的依赖性和数据维度过高产生的过拟合,优选的,在本实施例中,通过深度学习的图神经网络模型对简历文本数据、面试视频数据和笔试文本数据进行降维分类,实现对非结构化数据的高效转化、分析和整合,以提高模型预测准确率。In order to avoid the dependence of the prediction model on the data form and the over-fitting caused by the excessively high data dimension, preferably, in this embodiment, the deep learning graph neural network model is used for resume text data, interview video data and written test text data. Carry out dimensionality reduction classification to realize the efficient conversion, analysis and integration of unstructured data to improve the accuracy of model prediction.
具体的,简历信息特征的获取方法包括:将简历文本输入至深度学习的简历文本图神经网络模型中,该深度学习的简历文本图神经网络模型输出与该简历文本对应的简历信息特征。可选的,该简历信息特征包括学历、年龄、项目经历、实习经历、个人评价、专业技能等。Specifically, the method for obtaining resume information features includes: inputting resume text into a deep learning resume text graph neural network model, and the deep learning resume text graph neural network model outputs resume information features corresponding to the resume text. Optionally, the resume information features include educational background, age, project experience, internship experience, personal evaluation, professional skills, etc.
面试视频特征的获取方法包括:将面试视频中的至少一帧图像输入至深度学习的面试图神经网络模型中,该深度学习的面试图神经网络模型输出与该图像对应的面试视频特征。可选的,该面试视频特征包括表情、着装、礼仪、面试时长等。The method for acquiring features of the interview video includes: inputting at least one frame of images in the interview video into a deep learning facet neural network model, and the deep learning facet neural network model outputs interview video features corresponding to the image. Optionally, the interview video features include facial expressions, dress, etiquette, interview duration, etc.
笔试信息特征的获取方法包括:将笔试文本输入至深度学习的笔试文本图神经网络模型中,该深度学习的笔试文本图神经网络模型输出与该笔试文本对应的笔试信息特征。可选的,该笔试信息特征包括性格测试得分、专业技能得分等。The method for acquiring written test information features includes: inputting the written test text into a deep learning written test text graph neural network model, and the deep learning written test text graph neural network model outputs written test information features corresponding to the written test text. Optionally, the written test information features include personality test scores, professional skill scores, and the like.
具体的,工作状态特征数据集中包括员工绩效、员工考勤、员工请假频率、领导打分、同事评价、员工奖惩中的任意一种或者多种的组合。Specifically, the work status feature data set includes any one or a combination of employee performance, employee attendance, employee leave frequency, leadership scoring, colleague evaluation, employee rewards and punishments.
具体的,离职意向标注值序列为由0和1组成的序列,如
Figure PCTCN2020135184-appb-000005
表示员工在t时刻,工作状态为Y j时未离职,再如
Figure PCTCN2020135184-appb-000006
表示员工在t+1时刻,工作状态为Y j时离职。
Specifically, the resignation intention label value sequence is a sequence composed of 0 and 1, such as
Figure PCTCN2020135184-appb-000005
Indicates that the employee has not resigned at time t and the work status is Y j , and then
Figure PCTCN2020135184-appb-000006
It means that the employee resigns when the work status is Y j at time t+1.
具体的,训练时,以员工的静态信息特征数据集和员工的工作状态数据集序列作为输入,通过调整模型参数A,B,C,输出员工的离职意向值,然后再将输出的离职意向值与对应的实际的离职意向标注值进行比对,若输出的离职意向与实际的离职意向均一致,确定此时的模型参数A,B,C;反之,继续调整模型参数A,B,C。Specifically, during training, the employee’s static information feature data set and the employee’s work status data set sequence are used as input, and the employee’s turnover intention value is output by adjusting the model parameters A, B, and C, and then the output intention value of the employee Compare with the corresponding actual resignation intention label value. If the output resignation intention is consistent with the actual resignation intention, determine the model parameters A, B, and C at this time; otherwise, continue to adjust the model parameters A, B, and C.
需要说明的是,模型输出的员工离职意向值的范围为0~1。当模型输出的员工离职意向值介于0-0.5之间时,表明员工的离职意向较低,判定其不离职。若此时与其对应的实际的离职意向标注值为0,就可以判定为模型输出的员工离职意向值与实际的离职意向标注值一致,反之判定不一致。当模型输出的员工离职意向值介于0.5-1之间时,表明员工的离职意向较高,判定其离职。若此时与其对应的实际的离职意向标注值为1,就可以判定为模型输出的员工离职意向值与实际的离职意向标注值一致,反之,判定不一致。It should be noted that the employee turnover intention value output by the model ranges from 0 to 1. When the value of the employee's resignation intention output by the model is between 0-0.5, it indicates that the employee's resignation intention is low, and it is judged not to resign. If the corresponding actual resignation intention label value is 0 at this time, it can be judged that the employee resignation intention value output by the model is consistent with the actual resignation intention label value, otherwise the judgment is inconsistent. When the value of the employee's resignation intention output by the model is between 0.5 and 1, it indicates that the employee's resignation intention is high, and the employee is judged to resign. If the corresponding actual resignation intention label value is 1, it can be judged that the employee resignation intention value output by the model is consistent with the actual resignation intention label value, otherwise, the judgment is inconsistent.
在本申请的其他实施例中,在训练结束之后,还需要对训练得到的模型参数A,B,C进行测试检验,若检验不通过,还需要对训练后得到的模型参数A,B,C进行优化。可选的,测试检验方法包括:将测试集数据中的静态信息特征数据集输入至训练后的预训练隐马尔可夫模型中进行测试,输出每个第二已录用员工样本的工作状态特征数据集预测序列和离职意向标注值预测序列。然后将该工作状态特征数据集预测序列和离职意向标注值预测序列分别与测试集数据中对应的工作状态特征数据集序列以及离职意向标注值序列进行比对,计算预测准确率。当预测准确率大于或等于预设准确率阈值时,结束训练并确定模型参数;当预测准确率小于预设准确率阈值时,返回重新建立训练集数据,以重复训练过程,以对训练后得到的模型参数进行优化,直至得到预测精度高的、能够用于预测求职者工作状态和离职概率的隐马尔可夫模型。In other embodiments of the present application, after the training is completed, the model parameters A, B, and C obtained by training need to be tested and checked. If the test fails, the model parameters A, B, and C obtained after training need to be tested and checked. optimize. Optionally, the test inspection method includes: inputting the static information feature data set in the test set data into the pre-trained hidden Markov model after training for testing, and outputting the work status feature data of each second hired employee sample Set prediction sequence and resignation intention label value prediction sequence. Then the work status feature data set prediction sequence and the resignation intention label value prediction sequence are compared with the corresponding work status feature data set sequence and the resignation intention label value sequence in the test set data respectively, and the prediction accuracy rate is calculated. When the prediction accuracy is greater than or equal to the preset accuracy threshold, the training ends and the model parameters are determined; when the prediction accuracy is less than the preset accuracy threshold, return to re-establish the training set data to repeat the training process to obtain the results after training The model parameters are optimized until a Hidden Markov Model with high prediction accuracy can be used to predict the job status and turnover probability of job applicants.
需要说明的是,测试集数据的建立方法与训练集数据的建立方法相同,为简约起见,在此不再赘述。具体的,该测试集数据中包括多个第二已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,其中,离职意向标注值序列中的离职意向标注值与工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;该第二已录用员工样本包括第二在职员工样本和第二离职员工样本。It should be noted that the method for establishing the test set data is the same as the method for establishing the training set data. For the sake of brevity, it is not repeated here. Specifically, the test set data includes a static information feature data set, a work status feature data set sequence, and a resignation intention label value sequence of a plurality of second hired employee samples, where the resignation intention label value in the resignation intention label value sequence There is a one-to-one correspondence with the work state characteristic data set in the work state characteristic data set sequence according to the time sequence; the second hired employee sample includes a second in-service employee sample and a second resigned employee sample.
需要说明的是,在步骤S101中,可以预先构建并训练得到用于预测求职者工作状态和离职概率的隐马尔可夫模型,当需要使用时直接调用即可。也可以在使用时才进行构建并训练得到用于预测求职者工作状态和离职概率的隐马尔可夫模型操作,然后进行调用。It should be noted that in step S101, a Hidden Markov Model for predicting the job status and turnover probability of job applicants can be constructed and trained in advance, and it can be called directly when needed. It can also be constructed and trained to obtain the hidden Markov model operation for predicting the job status and the probability of leaving the job, and then call it.
步骤S102:获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集。Step S102: Obtain the job applicant's application data, and construct the job applicant's static information feature data set based on the job applicant's application data.
在步骤S102中,构建求职者的静态信息特征数据集的方法与步骤S101中构建第一已录用员工样本的静态信息特征数据集的方法相同,为简约起见,在此不再赘 述。In step S102, the method of constructing the static information feature data set of job applicants is the same as the method of constructing the static information feature data set of the first hired employee sample in step S101. For the sake of simplicity, it will not be repeated here.
步骤S103:将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。Step S103: Input the static information feature data set into the hidden Markov model for predicting the job seeker's work status and resignation probability, and predict the job seeker's work status and resignation probability after entering the job.
可选的,在步骤S103中,将静态信息特征数据集输入隐马尔可夫模型后,通过前向后向算法求解隐含状态序列的概率,预测求职者入职后的工作状态和离职概率。Optionally, in step S103, after the static information feature data set is input into the Hidden Markov Model, the probability of the hidden state sequence is solved by the forward-backward algorithm, and the job status of the job applicant and the probability of leaving the job are predicted.
具体的求解过程如下:The specific solution process is as follows:
假设隐含状态序列为o 1,o 2,o 3,...,o t,...,o n,观测状态序列为s 1,s 2,s 3,...,s t,...,s nIs implicitly assumed state sequence o 1, o 2, o 3 , ..., o t, ..., o n, the state observation sequence s 1, s 2, s 3 , ..., s t,. ..,s n .
前向概率:Forward probability:
给定隐马尔可夫模型λ=((A,B,C),X,π),定义到t时刻部分观测状态序o 1,o 2,o 3,...,o t,且工作状态为s t的概率为前向概率,记作α t(i)=P(o 1,o 2,...,o t,s t|λ),根据递推公式 Given a hidden Markov model λ=((A,B,C),X,π), define the partial observation state sequence o 1 , o 2 , o 3 ,..., o t at time t , and the working state The probability of st is the forward probability, denoted as α t (i) = P(o 1 ,o 2 ,...,o t ,s t |λ), according to the recurrence formula
Figure PCTCN2020135184-appb-000007
Figure PCTCN2020135184-appb-000007
,求解前向概率。, Solve the forward probability.
同样地,给定隐马尔可夫模型λ=((A,B,C),X,π),定义在t时刻且工作状态为s t的条件下,从t+1到T的部分观测序列为部分观测状态序o t+1,o t+2,o t+3,...,o T的概率为后向概率,记为β t(i)=p(o t+1,o t+2,o t+3,...,o T|s t,λ),根据递推公式 Similarly, given hidden Markov model λ = ((A, B, C), X, π), defined at time t and the working state is s t , the partial observation sequence from t + 1 to T Is the partial observation state sequence o t+1 ,o t+2 ,o t+3 ,...,o The probability of T is the backward probability, denoted as β t (i) = p(o t+1 ,o t +2 ,o t+3 ,...,o T |s t ,λ), according to the recurrence formula
Figure PCTCN2020135184-appb-000008
Figure PCTCN2020135184-appb-000008
求解后向概率,即得到预测的员工工作状态。由于存在关系P(s t|o)∝α t(s tt(s t),因此可以预测出员工离职的概率。 Solve the backward probability, that is, get the predicted work status of the employee. Due to the relationship P(s t |o)∝α t (s tt (s t ), the probability of employee leaving can be predicted.
本申请中,预测时,使用的数据为求职者的静态信息特征数据集,无人为评判指标,因此,可以避免为人力资源管理带来的主观随意性和随机不确定性。In this application, when forecasting, the data used is the static information feature data set of job applicants, and no one is the evaluation index. Therefore, the subjective arbitrariness and random uncertainty brought about by human resource management can be avoided.
图1所示实施例的基于隐马尔可夫模型的离职预测方法,通过基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对该预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;获取求职者的应聘数据,基于该求职者的应聘数据构建求职者的静态信息特征数据集;将该静态信息特征数据集输入该隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。通过预测求职者入职后的工作状态和离职概率,使得人力资源管理部门在应聘流程结束后,可以结合预测结果做出是否录用决定,从而实现尽量避免将易离职的不稳定求职者招聘到企业中,进而降低员工离职所带来的损失,同时也能减少人力招聘成本。此外,本实施例采用的隐马尔可夫模型为时序模型,其预测结果中还包含了离职时间,因此,本实施例提供的离职预测方法在时间尺度上的可拓展性强,如后序可以进一步实现实时响应离职预警等。The hidden Markov model-based resignation prediction method of the embodiment shown in Figure 1 constructs a pre-trained hidden Markov model based on the employee’s static information and the influence of work status on the employee’s resignation intention, and uses the sample data of hired employees Train the pre-trained hidden Markov model to obtain a hidden Markov model for predicting the job status and turnover probability of the job applicant; obtain the job applicant’s application data, and construct the job applicant’s static information based on the job applicant’s application data Feature data set: Input the static information feature data set into the hidden Markov model to predict the job status and the probability of leaving the job after the job applicant. By predicting the job status and resignation probability of job applicants, after the application process is over, the human resources management department can make a decision on whether to hire or not based on the predicted results, so as to avoid hiring unstable job applicants who are easy to resign into the company. , Thereby reducing the loss caused by employee resignation, while also reducing the cost of manpower recruitment. In addition, the hidden Markov model used in this embodiment is a time series model, and its prediction result also includes the time of resignation. Therefore, the resignation prediction method provided by this embodiment is highly scalable on the time scale. Further realize real-time response to resignation warning, etc.
在前述实施例中,隐含状态序列是基于对选定的所有已录用员工样本的工作状态数据分析后确定的。考虑到不同工作年限的求职者入职后的工作状态差异较大,在本申请的另一个实施例中,对工作年限进行了分类,例如包括无工作年限(即应届毕业生)类别、1-2年工作年限类别、2-5年工作年限类别、5-10年工作年限类别等,然后按照该类别分别构建并训练用于预测求职者未来工作状态和离职概率的隐马尔可夫模型。在每个类别中,具体的训练方法同步骤S101中所述的训练方法,为 简约起见,在此不再赘述。In the foregoing embodiment, the hidden status sequence is determined based on the analysis of the work status data of all selected samples of hired employees. Considering that the working status of job seekers with different working years is quite different after entering the job, in another embodiment of this application, the working years are classified, for example, including no working years (ie fresh graduates) category, 1-2 The category of annual working years, the category of working years of 2-5 years, the category of working years of 5-10 years, etc. Then, according to the categories, a hidden Markov model for predicting the future work status and the probability of leaving the job is constructed and trained. In each category, the specific training method is the same as the training method described in step S101, for the sake of brevity, it will not be repeated here.
请参阅图3所示,图3是本申请另一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图。需注意的是,若有实质上相同的结果,本申请的方法并不以图3所示的流程顺序为限。如图3所示,该方法包括:Please refer to FIG. 3, which is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to another embodiment of the present application. It should be noted that if there are substantially the same results, the method of the present application is not limited to the sequence of the process shown in FIG. 3. As shown in Figure 3, the method includes:
步骤S201:获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集并确定该求职者所属的工作年限类别。Step S201: Obtain the application data of the job applicant, construct the static information feature data set of the job applicant based on the job applicant's application data, and determine the working years category of the job applicant.
在该步骤S201中,获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集步骤与图1所示实施例的步骤S102类似,为简约起见,在此不再赘述。In this step S201, the application data of the job applicant is obtained, and the step of constructing the static information feature data set of the job applicant based on the application data of the job applicant is similar to the step S102 of the embodiment shown in FIG. Go into details again.
在步骤S201中,可以使用本领域中任何合适的分类模型来确定求职者所属的工作年限类别。也可以人为识别工作年限后,人为进行类别分类。In step S201, any suitable classification model in the art can be used to determine the working years category to which the job applicant belongs. It is also possible to manually classify the categories after identifying the working years.
步骤S202:根据该求职者所属工作年限类别调用与该工作年限类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型。Step S202: Invoking a Hidden Markov Model corresponding to the working years category for predicting the job status and the probability of leaving the job according to the working years category to which the job applicant belongs.
在本实施例中,预先构建并训练得到不同工作年限类别所对应的预测模型,然后在步骤S202中,就可以根据求职者所属工作年限类别调用对应的预测模型。In this embodiment, prediction models corresponding to different working years categories are constructed and trained in advance, and then in step S202, the corresponding prediction models can be called according to the working years category of the job seeker.
步骤S203:将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。Step S203: Input the static information feature data set into the hidden Markov model for predicting the job-seeker's work status and resignation probability, and predict the job-seeker's work status and resignation probability after entering the job.
在本实施例中,步骤S203与图1所示实施例的步骤S103类似,为简约起见,在此不再赘述。In this embodiment, step S203 is similar to step S103 in the embodiment shown in FIG.
图3所示实施例的基于隐马尔可夫模型的离职预测方法,通过获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集并确定该求职者所属的工作年限类别;根据该求职者所属工作年限类别调用与该工作年限类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型;将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。相比较于图1所示实施例,通过上述方式,可以提高模型预测准确率,以帮助人力资源管理部门做出更加正确的决定,进一步降低员工离职所带来的损失和减少人力招聘成本。The Hidden Markov Model-based Resignation Prediction Method of the embodiment shown in FIG. 3 obtains the job applicant’s application data, constructs the job applicant’s static information feature data set based on the job applicant’s application data and determines the job applicant’s belongings Working life category; according to the working life category of the job applicant, call the hidden Markov model corresponding to the working life category for predicting the job status and turnover probability of the job applicant; input the static information feature data set to the job application prediction The Hidden Markov Model of the job status and the probability of leaving the job predicts the job status and the probability of leaving the job applicant. Compared with the embodiment shown in FIG. 1, through the above method, the accuracy of model prediction can be improved to help the human resource management department make more correct decisions, and further reduce the loss caused by employee resignation and reduce the cost of human recruitment.
进一步的,考虑到不同区域的员工和求职者对工作的期望要求和可接受范围有差异,在本申请的另一个实施例中,对员工所在区域进行了分类,例如分成南方和北方,又或者按照城市划分为一线城市、二线城市、三线城市、……等,然后按照该区域类别分别构建并训练用于预测求职者未来工作状态和离职概率的隐马尔可夫模型。在每个类别中,具体的训练方法同步骤S101中所述的训练方法,为简约起见,在此不再赘述。Further, considering that employees and job applicants in different regions have different expectations and acceptable scope of work, in another embodiment of this application, the regions where the employees are located are classified, for example, into the south and the north, or According to the city, it is divided into first-tier cities, second-tier cities, third-tier cities, etc., and then constructs and trains hidden Markov models for predicting the future job status and the probability of leaving the job according to the regional category. In each category, the specific training method is the same as the training method described in step S101, for the sake of simplicity, it will not be repeated here.
请参阅图4所示,图4是本申请再一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图。需注意的是,若有实质上相同的结果,本申请的方法并不以图4所示的流程顺序为限。如图4所示,该方法包括:Please refer to FIG. 4, which is a schematic flow chart of a method for resignation prediction based on a hidden Markov model in another embodiment of the present application. It should be noted that if there is substantially the same result, the method of the present application is not limited to the sequence of the process shown in FIG. 4. As shown in Figure 4, the method includes:
步骤S301:获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集并确定该求职者所属区域类别。Step S301: Obtain application data of the job applicant, construct a static information feature data set of the job applicant based on the job applicant's application data, and determine the area category to which the job applicant belongs.
在该步骤S301中,获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集步骤与图1所示实施例的步骤S102类似,为简约起见,在此不再赘述。In this step S301, the application data of the job applicant is obtained, and the step of constructing the static information feature data set of the job applicant based on the application data of the job applicant is similar to the step S102 of the embodiment shown in FIG. Go into details again.
可选的,在步骤S301中,先根据该求职者的简历文本信息确定该求职者所属行政区域,然后根据其所属行政区域确定其所属区域类别。Optionally, in step S301, the administrative region to which the job seeker belongs is determined according to the resume text information of the job seeker, and then the category of the region to which the job seeker belongs is determined according to the administrative region to which the job seeker belongs.
步骤S302:根据该求职者所属区域类别调用与该区域类别对应的用于预测求职 者工作状态和离职概率的隐马尔可夫模型。Step S302: According to the category of the area to which the job seeker belongs, a hidden Markov model corresponding to the category of the job seeker is called for predicting the job status of the job seeker and the probability of leaving the job.
在本实施例中,预先构建并训练得到不同区域类别所对应的预测模型,然后在步骤S302中,就可以根据求职者所属区域类别调用对应的预测模型。In this embodiment, prediction models corresponding to different area categories are constructed and trained in advance, and then in step S302, the corresponding prediction models can be called according to the area category to which the job applicant belongs.
步骤S303:将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。Step S303: Input the static information feature data set into the hidden Markov model for predicting the job-seeker's work status and resignation probability, and predict the job-seeker’s work status and resignation probability after entering the job.
在本实施例中,步骤S303与图1所示实施例的步骤S103类似,为简约起见,在此不再赘述。In this embodiment, step S303 is similar to step S103 of the embodiment shown in FIG.
图4所示实施例的基于隐马尔可夫模型的离职预测方法,通过获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集并确定该求职者所属区域类别;根据该求职者所属区域类别调用与该区域类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型;将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。相比较于图1所示实施例,通过上述方式,可以提高模型预测准确率,以帮助人力资源管理部门做出更加正确的决定,进一步降低员工离职所带来的损失和减少人力招聘成本。The Hidden Markov Model-based Resignation Prediction Method of the embodiment shown in FIG. 4 obtains the job applicant’s application data, constructs the job applicant’s static information feature data set based on the job applicant’s application data and determines the region where the job applicant belongs Category; call the hidden Markov model corresponding to the area category for predicting the job applicant’s working status and the probability of resignation according to the area category to which the job applicant belongs; input the static information feature data set into the job applicant’s working status and The Hidden Markov Model of Resignation Probability, which predicts the job status and resignation probability of the job applicant after entering the job. Compared with the embodiment shown in FIG. 1, through the above method, the accuracy of model prediction can be improved to help the human resource management department make more correct decisions, and further reduce the loss caused by employee resignation and reduce the cost of human recruitment.
请参阅图5所示,图5是本申请又一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图。需注意的是,若有实质上相同的结果,本申请的方法并不以图5所示的流程顺序为限。如图5所示,该方法包括:Please refer to FIG. 5, which is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to another embodiment of the present application. It should be noted that if there is substantially the same result, the method of the present application is not limited to the sequence of the process shown in FIG. 5. As shown in Figure 5, the method includes:
步骤S401:获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集并确定该求职者所属区域类别和工作年限类别。Step S401: Obtain application data of the job applicant, construct a static information feature data set of the job applicant based on the job applicant's application data, and determine the area category and the working years category of the job applicant.
在该步骤S401中,获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集步骤与图1所示实施例的步骤S102类似,确定该求职者所属区域类别步骤与图3所示实施例的步骤S201类似,确定该求职者所属工作年限类别步骤与图4所示实施例的步骤S301类似,为简约起见,在此不再赘述。In this step S401, the application data of the job applicant is obtained, and the step of constructing the static information characteristic data set of the job applicant based on the application data of the job applicant is similar to the step S102 of the embodiment shown in FIG. The steps are similar to step S201 of the embodiment shown in FIG. 3, and the step of determining the working years category of the job applicant is similar to step S301 of the embodiment shown in FIG.
步骤S402:根据该求职者所属区域类别和工作年限类别调用与该区域类别以及工作年限类别均对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型。Step S402: According to the region category and the working years category of the job seeker, the hidden Markov model corresponding to the region category and the working years category is called for predicting the job seeker's working status and the probability of leaving the job.
在本实施例中,按照区域分类,在每个区域类别中,预先构建并训练得到不同工作年限类别所对应的预测模型,然后在步骤S402中,就可以根据求职者所属区域类别和工作年限类别调用对应的预测模型。In this embodiment, according to the regional classification, in each regional category, the prediction models corresponding to different working years categories are pre-built and trained, and then in step S402, the job applicant can be based on the regional category and the working years category. Call the corresponding prediction model.
步骤S403:将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。Step S403: Input the static information feature data set into the hidden Markov model for predicting the job-seeker's work status and resignation probability, and predict the job-seeker’s work status and resignation probability after entering the job.
在本实施例中,步骤S403与图1所示实施例的步骤S103类似,为简约起见,在此不再赘述。In this embodiment, step S403 is similar to step S103 of the embodiment shown in FIG.
图5所示实施例的基于隐马尔可夫模型的离职预测方法,通过获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集并确定该求职者所属区域类别和工作年限类别;根据该求职者所属区域类别和工作年限类别调用与该区域类别相对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型;将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。相比较于图1所示实施例,通过上述方式,可以提高模型预测准确率,以帮助人力资源管理部门做出更加正确的决定,进一步降低员工离职所带来的损失和减少人力招聘成本。The Hidden Markov Model-based Resignation Prediction Method of the embodiment shown in FIG. 5 obtains the job applicant’s application data, constructs the job applicant’s static information feature data set based on the job applicant’s application data and determines the region where the job applicant belongs Category and working years category; according to the region category and working years category of the job seeker, call the hidden Markov model corresponding to the region category to predict the job seeker’s working status and the probability of leaving the job; input the static information feature data set The Hidden Markov Model for predicting the job status and the probability of leaving the job seeker predicts the job status and the probability of leaving the job after the job seeker. Compared with the embodiment shown in FIG. 1, through the above method, the accuracy of model prediction can be improved to help the human resource management department make more correct decisions, and further reduce the loss caused by employee resignation and reduce the cost of human recruitment.
请参阅图6所示,图6是本申请又一个实施例基于隐马尔可夫模型的离职预测方法的流程示意图。需注意的是,若有实质上相同的结果,本申请的方法并不以图6所示的流程顺序为限。如图6所示,该方法包括:Please refer to FIG. 6, which is a schematic flowchart of a method for predicting turnover based on a hidden Markov model according to another embodiment of the present application. It should be noted that if there is substantially the same result, the method of the present application is not limited to the sequence of the process shown in FIG. 6. As shown in Figure 6, the method includes:
步骤S501:基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对该预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型。Step S501: Based on the influence of the employee’s static information and work status on the employee’s intention to leave, construct a pre-trained hidden Markov model, and train the pre-trained hidden Markov model based on the sample data of hired employees to obtain predictions Hidden Markov Model of Job Seeker's Job Status and Resignation Probability.
可选的,在本实施例中,步骤S501与图1所示实施例的步骤S101类似,为简约起见,在此不再赘述。Optionally, in this embodiment, step S501 is similar to step S101 of the embodiment shown in FIG.
步骤S502:获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集,并根据该求职者的静态信息特征数据集创建求职画像。Step S502: Obtain application data of the job applicant, construct a static information characteristic data set of the job applicant based on the application data of the job applicant, and create a job application portrait based on the static information characteristic data set of the job applicant.
可选的,在本实施例中,步骤S502中的获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集步骤与图1所示实施例的步骤S102类似,为简约起见,在此不再赘述。Optionally, in this embodiment, the step of acquiring the job applicant’s application data in step S502, and constructing the job applicant’s static information feature data set based on the job applicant’s application data is similar to step S102 in the embodiment shown in FIG. 1 , For the sake of simplicity, I won’t repeat it here.
可选的,当获取到求职者的静态信息特征数据集后,可以为该求职者建立专属于该求职者的个人档案(即求职画像),所述个人档案可以配合求职者的照片以及应聘编号,方便查找,所有建立的这些个人档案可以存储在数据库中,由于求职者的应聘编号在企业内部是唯一的,因此存储在数据库中的档案可以根据应聘编号唯一标识进行查找。Optionally, when the static information feature data set of the job applicant is obtained, a personal file (ie job application portrait) dedicated to the job applicant can be created for the job applicant, and the personal file can be matched with the job applicant’s photo and application number , Easy to find, all these personal files created can be stored in the database, because the applicant's application number is unique within the enterprise, so the files stored in the database can be searched according to the unique identification of the application number.
步骤S503:将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。Step S503: Input the static information feature data set into the Hidden Markov Model for predicting the job status and resignation probability of the job applicant, and predict the job status and resignation probability of the job applicant after entering the job.
可选的,在本实施例中,步骤S503与图1所示实施例的步骤S503类似,为简约起见,在此不再赘述。Optionally, in this embodiment, step S503 is similar to step S503 of the embodiment shown in FIG.
步骤S504:判断该求职者的离职概率大于预设概率阈值时所对应的入职时间是否大于或等于预设时间阈值。Step S504: Determine whether the corresponding entry time when the job applicant's resignation probability is greater than the preset probability threshold is greater than or equal to the preset time threshold.
可选的,在步骤S504中,预设概率阈值和预设时间阈值可以根据实际需求进行设置。如将预设概率阈值设为0.9,将预设时间阈值设为2年,当然预设时间阈值还可以根据所应聘岗位分别设置,如重要或核心项目岗位的预设时间阈值可以设置得稍长一些,如5年等。Optionally, in step S504, the preset probability threshold and the preset time threshold may be set according to actual needs. For example, the preset probability threshold is set to 0.9, and the preset time threshold is set to 2 years. Of course, the preset time threshold can also be set separately according to the job position. For example, the preset time threshold for important or core project positions can be set slightly longer Some, such as 5 years, etc.
前述已经介绍了隐马尔可夫模型为时序模型,当预测到求职者要离职时,就能对应获知其离职时所对应的入职时间,如果该入职时间大于或等于预设时间阈值,判定该求职者属于不易离职的稳定型求职者,执行步骤S505。否则,判定该求职者属于易离职的不稳定型求职者,执行步骤S506。The aforementioned Hidden Markov Model has been introduced as a time series model. When it is predicted that the job applicant will leave the job, the corresponding entry time at the time of resignation can be correspondingly known. If the entry time is greater than or equal to the preset time threshold, the job search is determined If the job seeker is a stable job seeker who is not easy to resign, step S505 is executed. Otherwise, it is determined that the job seeker is an unstable job seeker who is easy to resign, and step S506 is executed.
步骤S505:在该求职者的求职画像上设置可以录取标识。Step S505: Set an admission mark on the job-seeking portrait of the job-seeker.
步骤S506:在该求职者的求职画像上设置不录取标识。Step S506: Set a non-admission flag on the job-seeking portrait of the job-seeker.
图6所示实施例的基于隐马尔可夫模型的离职预测方法,通过基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对该预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;获取求职者的应聘数据,基于该求职者的应聘数据构建该求职者的静态信息特征数据集,并根据该求职者的静态信息特征数据集创建求职画像;将该静态信息特征数据集输入该用于预测求职者工作状态和离职概率的隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率;判断该求职者的离职概率大于预设概率阈值时所对应的入职时间是否大于或等于预设时间阈值;若是,在该求职者的求职画像上设置可以录取标识;否则,在该求职者的求职画像上设置不录取标识。通过为求职者创建求职画像,根据预设条件判断其是否可以被录取,并在求职画像上设置对应的标识,更加方便人力资源管理部门直观获取可以录取的求职者资料,同时也为人力资源管理部门后序寻找后补员工提供参考信息,使得人力资源管理部门的工作更加便捷,可以提升人力资源管理部门的工作管理效率, 降低招聘成本等。The hidden Markov model-based resignation prediction method of the embodiment shown in Figure 6 constructs a pre-trained hidden Markov model based on the employee’s static information and the influence of the work status on the employee’s resignation intention, and uses the sample data of hired employees Train the pre-trained hidden Markov model to obtain a hidden Markov model for predicting the job status and turnover probability of the job applicant; obtain the job applicant’s application data, and build the job applicant’s static state based on the job applicant’s application data Information feature data set, and create a job application portrait based on the job applicant’s static information feature data set; input the static information feature data set into the hidden Markov model used to predict the job applicant’s work status and the probability of leaving the job to predict the job applicant The job status and resignation probability after entering the job; judge whether the entry time corresponding to the job applicant’s resignation probability is greater than the preset probability threshold is greater than or equal to the preset time threshold; if so, set an admission mark on the job applicant’s portrait ; Otherwise, set a non-admission mark on the job-seeking portrait of the job-seeker. By creating job search portraits for job applicants, judging whether they can be accepted according to preset conditions, and setting corresponding signs on the job search portraits, it is more convenient for the human resources management department to intuitively obtain the information of candidates that can be admitted, and it is also for human resources management The department will search for subsequent employees to provide reference information, which makes the work of the human resources management department more convenient, can improve the work management efficiency of the human resources management department, and reduce the cost of recruitment.
图7是本申请一个实施例基于隐马尔可夫模型的离职预测装置的结构示意图。如图7所示,该离职预测装置60包括第一获取模块61、第二获取模块62和预测模块63。FIG. 7 is a schematic structural diagram of a resignation prediction device based on a hidden Markov model in an embodiment of the present application. As shown in FIG. 7, the resignation prediction device 60 includes a first acquisition module 61, a second acquisition module 62 and a prediction module 63.
其中,第一获取模块61用于基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对该预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型。第二获取模块62用于获取求职者的应聘数据,基于该应聘数据构建该求职者的静态信息特征数据集。预测模块63与第一获取模块61以及第二获取模块62均耦接,用于将该静态信息特征数据集输入隐马尔可夫模型,预测该求职者入职后的工作状态和离职概率。Among them, the first acquisition module 61 is used to construct a pre-training hidden Markov model based on the employee’s static information and work status’s influence on the employee’s intention to leave, and to perform the pre-training hidden Markov model on the sample data of hired employees. Train to obtain a hidden Markov model for predicting the job status and turnover probability of job applicants. The second acquiring module 62 is used to acquire the job applicant's application data, and build the job applicant's static information feature data set based on the job applicant data. The prediction module 63 is coupled with the first acquisition module 61 and the second acquisition module 62, and is used to input the static information feature data set into the hidden Markov model to predict the job status and the probability of leaving the job after the job applicant.
可选的,第一获取模块61获取用于预测求职者工作状态和离职概率的隐马尔可夫模型的操作包括:将员工的静态信息作为输入,以员工的工作状态为隐含状态、员工的离职意向为观测状态,构建预训练隐马尔可夫模型,该预训练隐马尔可夫模型的模型参数包括第一观测概率矩阵A、状态转移概率矩阵B和第二观测概率矩阵C;其中,第一观测概率矩阵A中的元素
Figure PCTCN2020135184-appb-000009
表示静态信息为X i时,工作状态为Y j的概率;状态转移概率矩阵B中的元素
Figure PCTCN2020135184-appb-000010
表示工作状态Y j向工作状态Y k转移的概率;第二观测概率矩阵C中的元素
Figure PCTCN2020135184-appb-000011
表示静态信息为X i时,离职意向为Z j的概率,i,j和k均为大于或等于1的自然数;建立训练集数据,该训练集数据中包括多个第一已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,离职意向标注值序列中的离职意向标注值与工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;其中,第一已录用员工样本中包括第一在职员工样本和第一离职员工样本;利用该训练集数据对该预训练隐马尔可夫模型进行训练。
Optionally, the first obtaining module 61 obtains the hidden Markov model for predicting the job status and the probability of leaving the job. The intention to leave is the observation state, and a pre-training hidden Markov model is constructed. The model parameters of the pre-training hidden Markov model include the first observation probability matrix A, the state transition probability matrix B, and the second observation probability matrix C; An element of the observation probability matrix A
Figure PCTCN2020135184-appb-000009
Static information indicates when X i, Y j operating state probability; the state transition probability matrix element B is
Figure PCTCN2020135184-appb-000010
Indicates the probability of the transition from the working state Y j to the working state Y k ; the element in the second observation probability matrix C
Figure PCTCN2020135184-appb-000011
Static information is represented by X i, Z j turnover intention of probability, i, j and k are a natural number greater than or equal to 1; the establishment of training data, the training data set comprises a first plurality of samples has been hired employee Static information feature data set, work status feature data set sequence, and resignation intention label value sequence. The resignation intention label value in the resignation intention label value sequence corresponds to the work status feature data set in the work status feature data set sequence according to the time sequence. ; Among them, the first hired employee sample includes the first in-service employee sample and the first resigned employee sample; the training set data is used to train the pre-training hidden Markov model.
可选的,第一获取模块61获取用于预测求职者工作状态和离职概率的隐马尔可夫模型的操作还包括:建立测试集数据,该测试集数据中包括多个第二已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,离职意向标注值序列中的离职意向标注值与工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;其中,第二已录用员工样本中包括第二在职员工样本和第二离职员工样本;将该测试集数据中的静态信息特征数据集输入至训练后的预训练隐马尔可夫模型中进行测试,输出每个第二已录用员工样本的工作状态特征数据集预测序列和离职意向标注值预测序列;将工作状态特征数据集预测序列与测试集数据中对应的工作状态特征数据集序列进行比对、以及将离职意向标注值预测序列与测试集数据中对应的离职意向标注值序列进行比对,计算预测准确率;当预测准确率大于或等于预设准确率阈值时,结束训练并确定模型参数;当预测准确率小于预设准确率阈值时,返回建立训练集数据步骤,以对训练后得到的模型参数进行优化。Optionally, the operation of the first obtaining module 61 to obtain the hidden Markov model for predicting the job status and the probability of leaving the job further includes: establishing a test set data, the test set data includes a plurality of second hired employee samples The static information feature data set, the work status feature data set sequence and the resignation intention label value sequence, the resignation intention label value in the resignation intention label value sequence and the work status feature data set in the work status feature data set sequence are sorted by time one by one Correspondence; among them, the second sample of hired employees includes the second sample of incumbent employees and the second sample of resigned employees; the static information feature data set in the test set data is input into the pre-trained hidden Markov model after training Test, output the prediction sequence of the work status feature data set and the expected sequence of the resignation intention label value of each second hired employee sample; compare the prediction sequence of the work status feature data set with the corresponding work status feature data set sequence in the test set data Compare and compare the resignation intention label value prediction sequence with the corresponding resignation intention label value sequence in the test set data to calculate the prediction accuracy; when the prediction accuracy is greater than or equal to the preset accuracy threshold, the training ends and the model is determined Parameters; when the prediction accuracy is less than the preset accuracy threshold, return to the step of establishing the training set data to optimize the model parameters obtained after training.
可选的,静态信息特征数据集中包括:简历信息特征、面试视频特征、以及笔试信息特征中的任意一种或几种的组合。Optionally, the static information feature data set includes any one or a combination of resume information features, interview video features, and written examination information features.
可选的,简历信息特征的获取方法包括:将简历文本输入至深度学习的简历文本图神经网络模型中,所述深度学习的简历文本图神经网络模型输出所述简历信息特征。Optionally, the method for obtaining resume information features includes: inputting resume text into a deep learning resume text graph neural network model, and the deep learning resume text graph neural network model outputs the resume information feature.
可选的,面试视频特征的获取方法包括:将面试视频中的至少一帧图像输入至深度学习的面试图神经网络模型中,所述深度学习的面试图神经网络模型输出所述 面试视频特征。Optionally, the method for acquiring features of the interview video includes: inputting at least one frame of images in the interview video into a deep-learning facial-attempt neural network model, and the deep-learning facial-attempt neural network model outputs the interview video features.
可选的,笔试信息特征的获取方法包括:将笔试文本输入至深度学习的笔试文本图神经网络模型中,所述深度学习的笔试文本图神经网络模型输出所述笔试信息特征。Optionally, the method for acquiring written test information features includes: inputting the written test text into a deep learning written test text graph neural network model, and the deep learning written test text graph neural network model outputs the written test information feature.
可选的,该离职预测装置60还包括与第二获取模块62耦接的确定模型64,用于基于求职者的应聘数据确定求职者所属区域类别。第一获取模块61还与确定模块64耦接,用于根据求职者所属区域类别调用与该区域类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型。Optionally, the resignation prediction device 60 further includes a determination model 64 coupled to the second acquisition module 62 for determining the area category to which the job applicant belongs based on the job applicant's application data. The first acquiring module 61 is also coupled to the determining module 64, and is configured to call a Hidden Markov Model corresponding to the region category for predicting the job status and the probability of leaving the job according to the region category to which the job seeker belongs.
可选的,确定模型64还用于基于求职者的应聘数据确定求职者所属工作年限类别。第一获取模块61还用于根据求职者所属工作年限类别调用与该工作年限类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型。Optionally, the determination model 64 is also used to determine the type of working years to which the job applicant belongs based on the job application data of the job applicant. The first acquisition module 61 is also configured to call a Hidden Markov Model corresponding to the working years category for predicting the job seeker's working status and the probability of leaving the job according to the working years category to which the job seeker belongs.
可选的,该离职预测装置60还包括与第二获取模块62耦接的创建模块65,用于根据求职者的静态信息特征数据集创建求职画像。Optionally, the resignation prediction device 60 further includes a creation module 65 coupled to the second acquisition module 62, and is configured to create a job-seeking portrait according to the static information feature data set of the job-seeker.
可选的,预测模块63还与创建模块65耦接,用于当预测到求职者的离职概率大于预设概率阈值时所对应的入职时间小于预设时间阈值时,在求职者的求职画像上设置不录取标识;当预测到求职者的离职概率大于预设概率阈值时所对应的入职时间大于或等于预设时间阈值时,在求职者的求职画像上设置可以录取标识。Optionally, the prediction module 63 is also coupled with the creation module 65, and is used to display the job applicant’s job search portrait when it is predicted that the job applicant’s resignation probability is greater than the preset probability threshold and the corresponding entry time is less than the preset time threshold. Set the non-admission flag; when it is predicted that the job applicant’s resignation probability is greater than the preset probability threshold and the corresponding entry time is greater than or equal to the preset time threshold, set an admission flag on the job applicant’s portrait.
请参阅图8,图8为本申请一个实施例计算机设备的结构示意图。如图8所示,该计算机设备70包括处理器71及和处理器71耦接的存储器72。存储器72中存储有计算机可读指令,该计算机可读指令被处理器71执行时,使得处理器71执行上述的基于隐马尔可夫模型的离职预测方法的步骤。Please refer to FIG. 8, which is a schematic structural diagram of a computer device according to an embodiment of the application. As shown in FIG. 8, the computer device 70 includes a processor 71 and a memory 72 coupled to the processor 71. The memory 72 stores computer-readable instructions, and when the computer-readable instructions are executed by the processor 71, the processor 71 executes the steps of the above-mentioned hidden Markov model-based resignation prediction method.
其中,处理器71还可以称为CPU(Central Processing Unit,中央处理单元)。处理器71可能是一种集成电路芯片,具有信号的处理能力。处理器71还可以是通用处理器、数字信号处理器(DSP)、专用集成电路(ASIC)、现场可编程门阵列(FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。The processor 71 may also be referred to as a CPU (Central Processing Unit, central processing unit). The processor 71 may be an integrated circuit chip with signal processing capabilities. The processor 71 may also be a general-purpose processor, a digital signal processor (DSP), an application specific integrated circuit (ASIC), a field programmable gate array (FPGA) or other programmable logic device, a discrete gate or transistor logic device, a discrete hardware component . The general-purpose processor may be a microprocessor or the processor may also be any conventional processor or the like.
参阅图9,图9为本申请一个实施例的存储介质的结构示意图。该存储介质80中存储有计算机可读指令81,该计算机可读指令被一个或多个处理器执行时,使得一个或多个处理器执行上述的基于隐马尔可夫模型的离职预测方法的步骤。其中,该计算机可读指令81可以以软件产品的形式存储在上述存储介质中,包括若干指令用以使得一台计算机设备(可以是个人计算机,服务器,或者网络设备等)或处理器(processor)执行本申请各个实施方式所述方法的全部或部分步骤。而前述的存储介质80包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁碟或者光盘等各种可以存储程序代码的介质,或者是计算机、服务器、手机、平板等终端设备。所述存储介质可以是非易失性的,也可以是易失性的。Refer to FIG. 9, which is a schematic structural diagram of a storage medium according to an embodiment of the application. The storage medium 80 stores computer-readable instructions 81, which when executed by one or more processors, cause the one or more processors to execute the steps of the above-mentioned Hidden Markov Model-based Resignation Prediction Method . Wherein, the computer-readable instruction 81 may be stored in the above-mentioned storage medium in the form of a software product, including several instructions for enabling a computer device (may be a personal computer, a server, or a network device, etc.) or a processor (processor) Perform all or part of the steps of the methods described in the various embodiments of this application. The aforementioned storage medium 80 includes: U disk, mobile hard disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), magnetic disks or optical disks, etc., which can store program codes. Media, or terminal devices such as computers, servers, mobile phones, tablets, etc. The storage medium may be non-volatile or volatile.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统,装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。In the several embodiments provided in this application, it should be understood that the disclosed system, device, and method can be implemented in other ways. For example, the device embodiments described above are only illustrative, for example, the division of units is only a logical function division, and there may be other division methods in actual implementation.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。以上仅为本申请的实施方式,并非因此限制本申请的专利范围,凡是利用本申请说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技 术领域,均同理包括在本申请的专利保护范围内。In addition, the functional units in the various embodiments of the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units may be integrated into one unit. The above-mentioned integrated unit can be implemented in the form of hardware or software functional unit. The above are only implementations of this application, and do not limit the scope of this application. Any equivalent structure or equivalent process transformation made using the content of the description and drawings of this application, or directly or indirectly applied to other related technical fields, The same reasoning is included in the scope of patent protection of this application.

Claims (20)

  1. 一种基于隐马尔可夫模型的离职预测方法,其中,所述方法包括:A method for predicting turnover based on a hidden Markov model, wherein the method includes:
    基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;Based on the influence of the employee’s static information and work status on the employee’s intention to leave, a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of job status and turnover probability;
    获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;Obtain job applicant's application data, and construct a static information feature data set of the job applicant based on the application data;
    将所述静态信息特征数据集输入所述隐马尔可夫模型,预测所述求职者入职后的工作状态和离职概率。The static information feature data set is input into the hidden Markov model to predict the job status and the resignation probability of the job applicant after entering the job.
  2. 根据权利要求1所述的方法,其中,所述基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型,包括:The method according to claim 1, wherein the pre-training hidden Markov model is constructed based on the static information of the employee and the influence of the work status on the employee’s intention to leave, and the pre-training hidden Markov model is constructed based on the sample data of hired employees. The Markov model is trained to obtain the hidden Markov model used to predict the job status and turnover probability of job applicants, including:
    将员工的静态信息作为输入,以员工的工作状态为隐含状态、员工的离职意向为观测状态,构建预训练隐马尔可夫模型,所述预训练隐马尔可夫模型的模型参数包括第一观测概率矩阵A、状态转移概率矩阵B和第二观测概率矩阵C;其中,所述第一观测概率矩阵A中的元素
    Figure PCTCN2020135184-appb-100001
    表示静态信息为X i时,工作状态为Y j的概率;所述状态转移概率矩阵B中的元素
    Figure PCTCN2020135184-appb-100002
    表示工作状态Y j向工作状态Y k转移的概率;所述第二观测概率矩阵C中的元素
    Figure PCTCN2020135184-appb-100003
    表示静态信息为X i时,离职意向为Z j的概率,i,j和k均为正整数;
    Taking the employee’s static information as input, taking the employee’s working status as the hidden state and the employee’s turnover intention as the observation state, a pre-training hidden Markov model is constructed. The model parameters of the pre-training hidden Markov model include the first The observation probability matrix A, the state transition probability matrix B and the second observation probability matrix C; wherein, the elements in the first observation probability matrix A
    Figure PCTCN2020135184-appb-100001
    Static information indicates when X i, Y j probability working state; and the state transition probability matrix element B is
    Figure PCTCN2020135184-appb-100002
    Indicates the probability of the transition from the working state Y j to the working state Y k ; the element in the second observation probability matrix C
    Figure PCTCN2020135184-appb-100003
    Represents the static information is X i, intention to quit the probability of Z j, i, j and k are positive integers;
    建立训练集数据,所述训练集数据中包括多个第一已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,所述离职意向标注值序列中的离职意向标注值与所述工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;其中,所述第一已录用员工样本中包括第一在职员工样本和第一离职员工样本;The training set data is established, and the training set data includes a static information feature data set, a work status feature data set sequence, and a resignation intention label value sequence of a plurality of first hired employee samples, and resignation in the resignation intention label value sequence The intention label value corresponds to the work state feature data set in the work state feature data set sequence in a time sequence; wherein, the first hired employee sample includes the first in-service employee sample and the first resigned employee sample;
    利用所述训练集数据对所述预训练隐马尔可夫模型进行训练。Using the training set data to train the pre-training hidden Markov model.
  3. 根据权利要求2所述的方法,其中,所述构建预训练隐马尔可夫模型之后,还包括:The method according to claim 2, wherein after said constructing a pre-trained hidden Markov model, the method further comprises:
    建立测试集数据,所述测试集数据中包括多个第二已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,所述离职意向标注值序列中的离职意向标注值与所述工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;其中,所述第二已录用员工样本中包括第二在职员工样本和第二离职员工样本;The test set data is established, and the test set data includes a static information feature data set, a work status feature data set sequence, and a resignation intention label value sequence of a plurality of second hired employee samples, and resignation in the resignation intention label value sequence The intention label value corresponds to the work status feature data set in the work status feature data set sequence in a chronological order; wherein, the second hired employee sample includes a second in-service employee sample and a second resigned employee sample;
    所述利用所述训练集数据对所述预训练隐马尔可夫模型进行训练之后,还包括:After the training of the pre-training hidden Markov model using the training set data, the method further includes:
    将所述测试集数据中的静态信息特征数据集输入至训练后的预训练隐马尔可夫模型中进行测试,输出每个所述第二已录用员工样本的工作状态特征数据集预测序列和离职意向标注值预测序列;Input the static information feature data set in the test set data into the pre-trained hidden Markov model after training for testing, and output the work status feature data set prediction sequence and resignation of each second hired employee sample Intent label value prediction sequence;
    将所述工作状态特征数据集预测序列与所述测试集数据中对应的工作状态特征数据集序列进行比对、以及将所述离职意向标注值预测序列与所述测试集数据中对应的离职意向标注值序列进行比对,计算预测准确率;Compare the predicted sequence of the work state characteristic data set with the corresponding work state characteristic data set sequence in the test set data, and compare the resignation intention label value prediction sequence with the corresponding resignation intention in the test set data Align the labeled value sequence to calculate the prediction accuracy;
    当所述预测准确率大于或等于预设准确率阈值时,结束训练并确定模型参数;When the prediction accuracy rate is greater than or equal to the preset accuracy rate threshold, the training is ended and the model parameters are determined;
    当所述预测准确率小于所述预设准确率阈值时,返回所述建立训练集数据步骤,以对训练后得到的模型参数进行优化。When the prediction accuracy is less than the preset accuracy threshold, return to the step of establishing training set data to optimize the model parameters obtained after training.
  4. 根据权利要求1-3中任一项所述的方法,其中,所述静态信息特征数据集中包括:简历信息特征、面试视频特征、以及笔试信息特征中的任意一种或几种的组合。The method according to any one of claims 1 to 3, wherein the static information feature data set includes any one or a combination of resume information features, interview video features, and written examination information features.
  5. 根据权利要求4所述的方法,其中,The method of claim 4, wherein:
    所述简历信息特征的获取方法包括:将简历文本输入至深度学习的简历文本图神经网络模型中,所述深度学习的简历文本图神经网络模型输出所述简历信息特征;和/或The method for acquiring the resume information feature includes: inputting resume text into a deep learning resume text graph neural network model, and the deep learning resume text graph neural network model outputs the resume information feature; and/or
    所述面试视频特征的获取方法包括:将面试视频中的至少一帧图像输入至深度学习的面试图神经网络模型中,所述深度学习的面试图神经网络模型输出所述面试视频特征;和/或The method for acquiring features of the interview video includes: inputting at least one frame of the image in the interview video into a deep learning facet neural network model, and the deep learning facet neural network model outputs the interview video feature; and/ or
    所述笔试信息特征的获取方法包括:将笔试文本输入至深度学习的笔试文本图神经网络模型中,所述深度学习的笔试文本图神经网络模型输出所述笔试信息特征。The method for acquiring the written test information feature includes: inputting the written test text into a deep learning written test text graph neural network model, and the deep learning written test text graph neural network model outputting the written test information feature.
  6. 根据权利要求1所述的方法,其中,The method of claim 1, wherein:
    所述方法还包括:基于所述求职者的应聘数据确定所述求职者所属区域类别;The method further includes: determining the area category to which the job seeker belongs based on the application data of the job seeker;
    所述获取用于预测求职者工作状态和离职概率的隐马尔可夫模型,包括:The acquisition of the hidden Markov model used to predict the job status and turnover probability of job applicants includes:
    根据所述求职者所属区域类别调用与所述区域类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型;和/或According to the region category to which the job seeker belongs, call the hidden Markov model corresponding to the region category for predicting the job status and the probability of leaving the job; and/or
    所述方法还包括:基于所述求职者的应聘数据确定所述求职者所属工作年限类别;The method further includes: determining the type of working years to which the job applicant belongs based on the application data of the job applicant;
    所述获取用于预测求职者工作状态和离职概率的隐马尔可夫模型,包括:The acquisition of the hidden Markov model used to predict the job status and turnover probability of job applicants includes:
    根据所述求职者所属工作年限类别调用与所述工作年限类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型。According to the working years category of the job applicant, a hidden Markov model corresponding to the working years category for predicting the job status and the probability of leaving the job is invoked.
  7. 根据权利要求1所述的方法,其中,所述方法还包括:根据所述求职者的静态信息特征数据集创建求职画像;The method according to claim 1, wherein the method further comprises: creating a job application portrait according to the static information feature data set of the job applicant;
    所述预测所述求职者入职后的工作状态和离职概率之后,还包括:After predicting the job status and resignation probability of the job seeker after entering the job, it further includes:
    当预测到所述求职者的离职概率大于预设概率阈值时所对应的入职时间小于预设时间阈值时,在所述求职者的求职画像上设置不录取标识;When it is predicted that the resignation probability of the job seeker is greater than the preset probability threshold, the corresponding entry time is less than the preset time threshold, setting a non-admission flag on the job seeker portrait of the job seeker;
    当预测到所述求职者的离职概率大于预设概率阈值时所对应的入职时间大于或等于所述预设时间阈值时,在所述求职者的求职画像上设置可以录取标识。When it is predicted that the resignation probability of the job seeker is greater than the preset probability threshold, and the corresponding entry time is greater than or equal to the preset time threshold, an admission mark is set on the job seeker portrait of the job seeker.
  8. 一种基于隐马尔可夫模型的离职预测装置,其中,所述装置包括:A resignation prediction device based on a hidden Markov model, wherein the device includes:
    第一获取模块,所述第一获取模块用于基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;The first acquisition module is used to construct a pre-training hidden Markov model based on the employee’s static information and the influence of the work status on the employee’s intention to leave, and to use the sample data of the hired employee to hide the pre-training The Markov model is trained to obtain the hidden Markov model used to predict the job status and the probability of leaving the job;
    第二获取模块,所述第二获取模块用于获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;A second acquisition module, the second acquisition module is used to acquire job applicants’ application data, and build a static information feature data set of the job applicants based on the application data;
    预测模块,所述预测模块用于将所述静态信息特征数据集输入所述隐马尔可夫模型,预测所述求职者入职后的工作状态和离职概率。A prediction module, which is used to input the static information feature data set into the hidden Markov model to predict the job status of the job applicant and the probability of leaving the job.
  9. 一种计算机设备,包括存储器和处理器,所述存储器存储有用于实现以下步骤的程序指令,所述步骤包括:A computer device includes a memory and a processor, the memory stores program instructions for implementing the following steps, the steps including:
    基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取 用于预测求职者工作状态和离职概率的隐马尔可夫模型;Based on the influence of the employee’s static information and work status on the employee’s intention to leave, a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of job status and turnover probability;
    获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;Obtain job applicant's application data, and construct a static information feature data set of the job applicant based on the application data;
    将所述静态信息特征数据集输入所述隐马尔可夫模型,预测所述求职者入职后的工作状态和离职概率。The static information feature data set is input into the hidden Markov model to predict the job status and the resignation probability of the job applicant after entering the job.
  10. 根据权利要求9所述的计算机设备,其中,所述基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型,包括:The computer device according to claim 9, wherein the pre-training hidden Markov model is constructed based on the employee’s static information and the influence of the work status on the employee’s intention to leave, and the pre-training The hidden Markov model is trained to obtain the hidden Markov model used to predict the job status and the probability of leaving the job, including:
    将员工的静态信息作为输入,以员工的工作状态为隐含状态、员工的离职意向为观测状态,构建预训练隐马尔可夫模型,所述预训练隐马尔可夫模型的模型参数包括第一观测概率矩阵A、状态转移概率矩阵B和第二观测概率矩阵C;其中,所述第一观测概率矩阵A中的元素
    Figure PCTCN2020135184-appb-100004
    表示静态信息为X i时,工作状态为Y j的概率;所述状态转移概率矩阵B中的元素
    Figure PCTCN2020135184-appb-100005
    表示工作状态Y j向工作状态Y k转移的概率;所述第二观测概率矩阵C中的元素
    Figure PCTCN2020135184-appb-100006
    表示静态信息为X i时,离职意向为Z j的概率,i,j和k均为正整数;
    Taking the employee’s static information as input, taking the employee’s working status as the hidden state and the employee’s turnover intention as the observation state, a pre-training hidden Markov model is constructed. The model parameters of the pre-training hidden Markov model include the first The observation probability matrix A, the state transition probability matrix B and the second observation probability matrix C; wherein, the elements in the first observation probability matrix A
    Figure PCTCN2020135184-appb-100004
    Static information indicates when X i, Y j probability working state; and the state transition probability matrix element B is
    Figure PCTCN2020135184-appb-100005
    Indicates the probability of the transition from the working state Y j to the working state Y k ; the element in the second observation probability matrix C
    Figure PCTCN2020135184-appb-100006
    Represents the static information is X i, intention to quit the probability of Z j, i, j and k are positive integers;
    建立训练集数据,所述训练集数据中包括多个第一已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,所述离职意向标注值序列中的离职意向标注值与所述工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;其中,所述第一已录用员工样本中包括第一在职员工样本和第一离职员工样本;The training set data is established, and the training set data includes a static information feature data set, a work status feature data set sequence, and a resignation intention label value sequence of a plurality of first hired employee samples, and resignation in the resignation intention label value sequence The intention label value corresponds to the work state feature data set in the work state feature data set sequence in a time sequence; wherein, the first hired employee sample includes the first in-service employee sample and the first resigned employee sample;
    利用所述训练集数据对所述预训练隐马尔可夫模型进行训练。Using the training set data to train the pre-training hidden Markov model.
  11. 根据权利要求10所述的计算机设备,其中,所述构建预训练隐马尔可夫模型之后,还包括:The computer device according to claim 10, wherein after said constructing a pre-trained hidden Markov model, it further comprises:
    建立测试集数据,所述测试集数据中包括多个第二已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,所述离职意向标注值序列中的离职意向标注值与所述工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;其中,所述第二已录用员工样本中包括第二在职员工样本和第二离职员工样本;The test set data is established, and the test set data includes a static information feature data set, a work status feature data set sequence, and a resignation intention label value sequence of a plurality of second hired employee samples, and resignation in the resignation intention label value sequence The intention label value corresponds to the work status feature data set in the work status feature data set sequence in a chronological order; wherein, the second hired employee sample includes a second in-service employee sample and a second resigned employee sample;
    所述利用所述训练集数据对所述预训练隐马尔可夫模型进行训练之后,还包括:After the training of the pre-training hidden Markov model using the training set data, the method further includes:
    将所述测试集数据中的静态信息特征数据集输入至训练后的预训练隐马尔可夫模型中进行测试,输出每个所述第二已录用员工样本的工作状态特征数据集预测序列和离职意向标注值预测序列;Input the static information feature data set in the test set data into the pre-trained hidden Markov model after training for testing, and output the work status feature data set prediction sequence and resignation of each second hired employee sample Intent label value prediction sequence;
    将所述工作状态特征数据集预测序列与所述测试集数据中对应的工作状态特征数据集序列进行比对、以及将所述离职意向标注值预测序列与所述测试集数据中对应的离职意向标注值序列进行比对,计算预测准确率;Compare the predicted sequence of the work state characteristic data set with the corresponding work state characteristic data set sequence in the test set data, and compare the resignation intention label value prediction sequence with the corresponding resignation intention in the test set data Align the labeled value sequence to calculate the prediction accuracy;
    当所述预测准确率大于或等于预设准确率阈值时,结束训练并确定模型参数;When the prediction accuracy rate is greater than or equal to the preset accuracy rate threshold, the training is ended and the model parameters are determined;
    当所述预测准确率小于所述预设准确率阈值时,返回所述建立训练集数据步骤,以对训练后得到的模型参数进行优化。When the prediction accuracy is less than the preset accuracy threshold, return to the step of establishing training set data to optimize the model parameters obtained after training.
  12. 根据权利要求9-11中任一项所述的计算机设备,其中,所述静态信息特征数据集中包括:简历信息特征、面试视频特征、以及笔试信息特征中的任意一种或几种的组合。The computer device according to any one of claims 9-11, wherein the static information feature data set includes any one or a combination of resume information features, interview video features, and written examination information features.
  13. 根据权利要求12所述的计算机设备,其中,The computer device according to claim 12, wherein:
    所述简历信息特征的获取方法包括:将简历文本输入至深度学习的简历文本图 神经网络模型中,所述深度学习的简历文本图神经网络模型输出所述简历信息特征;和/或The method for obtaining the resume information feature includes: inputting resume text into a deep learning resume text graph neural network model, and the deep learning resume text graph neural network model outputs the resume information feature; and/or
    所述面试视频特征的获取方法包括:将面试视频中的至少一帧图像输入至深度学习的面试图神经网络模型中,所述深度学习的面试图神经网络模型输出所述面试视频特征;和/或The method for acquiring features of the interview video includes: inputting at least one frame of the image in the interview video into a deep learning facet neural network model, and the deep learning facet neural network model outputs the interview video feature; and/ or
    所述笔试信息特征的获取方法包括:将笔试文本输入至深度学习的笔试文本图神经网络模型中,所述深度学习的笔试文本图神经网络模型输出所述笔试信息特征。The method for acquiring the written test information feature includes: inputting the written test text into a deep learning written test text graph neural network model, and the deep learning written test text graph neural network model outputting the written test information feature.
  14. 根据权利要求9所述的计算机设备,其中,The computer device according to claim 9, wherein:
    所述方法还包括:基于所述求职者的应聘数据确定所述求职者所属区域类别;The method further includes: determining the area category to which the job seeker belongs based on the application data of the job seeker;
    所述获取用于预测求职者工作状态和离职概率的隐马尔可夫模型,包括:The acquisition of the hidden Markov model used to predict the job status and turnover probability of job applicants includes:
    根据所述求职者所属区域类别调用与所述区域类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型;和/或According to the region category to which the job seeker belongs, call the hidden Markov model corresponding to the region category for predicting the job status and the probability of leaving the job; and/or
    所述方法还包括:基于所述求职者的应聘数据确定所述求职者所属工作年限类别;The method further includes: determining the type of working years to which the job applicant belongs based on the application data of the job applicant;
    所述获取用于预测求职者工作状态和离职概率的隐马尔可夫模型,包括:The acquiring a hidden Markov model used to predict the job status and turnover probability of job applicants includes:
    根据所述求职者所属工作年限类别调用与所述工作年限类别对应的用于预测求职者工作状态和离职概率的隐马尔可夫模型。According to the working years category of the job applicant, a hidden Markov model corresponding to the working years category for predicting the job status and the probability of leaving the job is invoked.
  15. 根据权利要求9所述的计算机设备,其中,所述计算机设备执行的步骤还包括:根据所述求职者的静态信息特征数据集创建求职画像;The computer device according to claim 9, wherein the steps performed by the computer device further comprise: creating a job application portrait according to the static information feature data set of the job applicant;
    所述预测所述求职者入职后的工作状态和离职概率之后,还包括:After predicting the job status and resignation probability of the job seeker after entering the job, it further includes:
    当预测到所述求职者的离职概率大于预设概率阈值时所对应的入职时间小于预设时间阈值时,在所述求职者的求职画像上设置不录取标识;When it is predicted that the resignation probability of the job seeker is greater than the preset probability threshold, the corresponding entry time is less than the preset time threshold, setting a non-admission flag on the job seeker portrait of the job seeker;
    当预测到所述求职者的离职概率大于预设概率阈值时所对应的入职时间大于或等于所述预设时间阈值时,在所述求职者的求职画像上设置可以录取标识。When it is predicted that the resignation probability of the job seeker is greater than the preset probability threshold, and the corresponding entry time is greater than or equal to the preset time threshold, an admission mark is set on the job seeker portrait of the job seeker.
  16. 一种存储有计算机可读指令的存储介质,所述计算机可读指令被一个或多个处理器执行时,实现如下步骤的计算机可读指令,所述步骤包括:A storage medium storing computer-readable instructions. When the computer-readable instructions are executed by one or more processors, the computer-readable instructions implement the following steps, the steps including:
    基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型;Based on the influence of the employee’s static information and work status on the employee’s intention to leave, a pre-trained hidden Markov model is constructed, and the pre-trained hidden Markov model is trained based on the sample data of hired employees to obtain candidates for predicting job applicants Hidden Markov model of job status and turnover probability;
    获取求职者的应聘数据,基于所述应聘数据构建所述求职者的静态信息特征数据集;Obtain job applicant's application data, and construct a static information feature data set of the job applicant based on the application data;
    将所述静态信息特征数据集输入所述隐马尔可夫模型,预测所述求职者入职后的工作状态和离职概率。The static information feature data set is input into the hidden Markov model to predict the job status and the resignation probability of the job applicant after entering the job.
  17. 根据权利要求16所述的存储介质,其中,所述基于员工的静态信息和工作状态对员工离职意向的影响,构建预训练隐马尔可夫模型,并通过已录用员工样本数据对所述预训练隐马尔可夫模型进行训练,获取用于预测求职者工作状态和离职概率的隐马尔可夫模型,包括:The storage medium according to claim 16, wherein the pre-training hidden Markov model is constructed based on the static information of the employee and the influence of the work status on the employee's intention to leave, and the pre-training is performed based on the sample data of hired employees The hidden Markov model is trained to obtain the hidden Markov model used to predict the job status and turnover probability of job applicants, including:
    将员工的静态信息作为输入,以员工的工作状态为隐含状态、员工的离职意向为观测状态,构建预训练隐马尔可夫模型,所述预训练隐马尔可夫模型的模型参数包括第一观测概率矩阵A、状态转移概率矩阵B和第二观测概率矩阵C;其中,所述第一观测概率矩阵A中的元素
    Figure PCTCN2020135184-appb-100007
    表示静态信息为X i时,工作状态为Y j的概率;所述状态转移概率矩阵B中的元素
    Figure PCTCN2020135184-appb-100008
    表示工作状态Y j向工作状态Y k转移的概率;所述第二观测概率矩阵C中的元素
    Figure PCTCN2020135184-appb-100009
    表示静态信息为X i时,离职意向为Z j的概率,i,j和k均为正整数;
    Taking the employee’s static information as input, taking the employee’s working status as the hidden state and the employee’s turnover intention as the observation state, a pre-training hidden Markov model is constructed. The model parameters of the pre-training hidden Markov model include the first The observation probability matrix A, the state transition probability matrix B and the second observation probability matrix C; wherein, the elements in the first observation probability matrix A
    Figure PCTCN2020135184-appb-100007
    Static information indicates when X i, Y j probability working state; and the state transition probability matrix element B is
    Figure PCTCN2020135184-appb-100008
    Indicates the probability of the transition from the working state Y j to the working state Y k ; the element in the second observation probability matrix C
    Figure PCTCN2020135184-appb-100009
    Represents the static information is X i, intention to quit the probability of Z j, i, j and k are positive integers;
    建立训练集数据,所述训练集数据中包括多个第一已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,所述离职意向标注值序列中的离职意向标注值与所述工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;其中,所述第一已录用员工样本中包括第一在职员工样本和第一离职员工样本;The training set data is established, and the training set data includes a static information feature data set, a work status feature data set sequence, and a resignation intention label value sequence of a plurality of first hired employee samples, and resignation in the resignation intention label value sequence The intention label value corresponds to the work state feature data set in the work state feature data set sequence in a time sequence; wherein, the first hired employee sample includes the first in-service employee sample and the first resigned employee sample;
    利用所述训练集数据对所述预训练隐马尔可夫模型进行训练。Using the training set data to train the pre-training hidden Markov model.
  18. 根据权利要求17所述的存储介质,其中,所述构建预训练隐马尔可夫模型之后,还包括:The storage medium according to claim 17, wherein, after the construction of the pre-training hidden Markov model, the method further comprises:
    建立测试集数据,所述测试集数据中包括多个第二已录用员工样本的静态信息特征数据集、工作状态特征数据集序列以及离职意向标注值序列,所述离职意向标注值序列中的离职意向标注值与所述工作状态特征数据集序列中的工作状态特征数据集按照时间排序一一对应;其中,所述第二已录用员工样本中包括第二在职员工样本和第二离职员工样本;The test set data is established, and the test set data includes a static information feature data set, a work status feature data set sequence, and a resignation intention label value sequence of a plurality of second hired employee samples, and resignation in the resignation intention label value sequence The intention label value corresponds to the work status feature data set in the work status feature data set sequence in a chronological order; wherein, the second hired employee sample includes a second in-service employee sample and a second resigned employee sample;
    所述利用所述训练集数据对所述预训练隐马尔可夫模型进行训练之后,还包括:After the training of the pre-training hidden Markov model using the training set data, the method further includes:
    将所述测试集数据中的静态信息特征数据集输入至训练后的预训练隐马尔可夫模型中进行测试,输出每个所述第二已录用员工样本的工作状态特征数据集预测序列和离职意向标注值预测序列;Input the static information feature data set in the test set data into the pre-trained hidden Markov model after training for testing, and output the work status feature data set prediction sequence and resignation of each second hired employee sample Intent label value prediction sequence;
    将所述工作状态特征数据集预测序列与所述测试集数据中对应的工作状态特征数据集序列进行比对、以及将所述离职意向标注值预测序列与所述测试集数据中对应的离职意向标注值序列进行比对,计算预测准确率;Compare the predicted sequence of the work state characteristic data set with the corresponding work state characteristic data set sequence in the test set data, and compare the resignation intention label value prediction sequence with the corresponding resignation intention in the test set data Align the labeled value sequence to calculate the prediction accuracy;
    当所述预测准确率大于或等于预设准确率阈值时,结束训练并确定模型参数;When the prediction accuracy rate is greater than or equal to the preset accuracy rate threshold, the training is ended and the model parameters are determined;
    当所述预测准确率小于所述预设准确率阈值时,返回所述建立训练集数据步骤,以对训练后得到的模型参数进行优化。When the prediction accuracy rate is less than the preset accuracy rate threshold, return to the step of establishing training set data to optimize the model parameters obtained after training.
  19. 根据权利要求16-18中任一项所述的计算机设备,其中,所述静态信息特征数据集中包括:简历信息特征、面试视频特征、以及笔试信息特征中的任意一种或几种的组合。The computer device according to any one of claims 16-18, wherein the static information feature data set includes any one or a combination of resume information features, interview video features, and written examination information features.
  20. 根据权利要求19所述的计算机设备,其中,The computer device according to claim 19, wherein:
    所述简历信息特征的获取方法包括:将简历文本输入至深度学习的简历文本图神经网络模型中,所述深度学习的简历文本图神经网络模型输出所述简历信息特征;和/或The method for acquiring the resume information feature includes: inputting resume text into a deep learning resume text graph neural network model, and the deep learning resume text graph neural network model outputs the resume information feature; and/or
    所述面试视频特征的获取方法包括:将面试视频中的至少一帧图像输入至深度学习的面试图神经网络模型中,所述深度学习的面试图神经网络模型输出所述面试视频特征;和/或The method for acquiring features of the interview video includes: inputting at least one frame of the image in the interview video into a deep learning facet neural network model, and the deep learning facet neural network model outputs the interview video feature; and/ or
    所述笔试信息特征的获取方法包括:将笔试文本输入至深度学习的笔试文本图神经网络模型中,所述深度学习的笔试文本图神经网络模型输出所述笔试信息特征。The method for acquiring the written test information feature includes: inputting the written test text into a deep learning written test text graph neural network model, and the deep learning written test text graph neural network model outputting the written test information feature.
PCT/CN2020/135184 2020-10-21 2020-12-10 Hidden markov model-based resignation prediction method and related device WO2021179715A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202011134554.2A CN112257777B (en) 2020-10-21 2020-10-21 Off-duty prediction method and related device based on hidden Markov model
CN202011134554.2 2020-10-21

Publications (1)

Publication Number Publication Date
WO2021179715A1 true WO2021179715A1 (en) 2021-09-16

Family

ID=74263263

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/135184 WO2021179715A1 (en) 2020-10-21 2020-12-10 Hidden markov model-based resignation prediction method and related device

Country Status (2)

Country Link
CN (1) CN112257777B (en)
WO (1) WO2021179715A1 (en)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113822355A (en) * 2021-09-22 2021-12-21 华北电力科学研究院有限责任公司 Composite attack prediction method and device based on improved hidden Markov model
CN114282169A (en) * 2021-10-12 2022-04-05 腾讯科技(深圳)有限公司 Abnormal data detection method and related device
CN114676743A (en) * 2021-12-09 2022-06-28 上海无线电设备研究所 Low-slow small target track threat identification method based on hidden Markov model
CN116680590A (en) * 2023-07-28 2023-09-01 中国人民解放军国防科技大学 Post portrait label extraction method and device based on work instruction analysis
CN116776158A (en) * 2023-08-22 2023-09-19 长沙隼眼软件科技有限公司 Target classification model training method, target classification device and storage medium
CN117408660A (en) * 2023-12-15 2024-01-16 山东杰出人才发展集团有限公司 Human resource data service management system based on big data

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114925287B (en) * 2022-07-22 2022-11-18 天津大学 Intelligent knowledge management system and method based on big data

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106022708A (en) * 2016-05-09 2016-10-12 陈包容 Method for predicting employee resignation
CN108805413A (en) * 2018-05-21 2018-11-13 中国平安人寿保险股份有限公司 Labor turnover Risk Forecast Method, device, computer equipment and storage medium
CN109710930A (en) * 2018-12-20 2019-05-03 重庆邮电大学 A kind of Chinese Resume analytic method based on deep neural network
US20190385124A1 (en) * 2018-06-18 2019-12-19 Adp, Llc Targeting delivery of recruiting messages
CN110659757A (en) * 2018-06-29 2020-01-07 北京京东尚科信息技术有限公司 Employee departure prediction method and device
CN110782072A (en) * 2019-09-29 2020-02-11 广州荔支网络技术有限公司 Employee leave risk prediction method, device, equipment and readable storage medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20140122355A1 (en) * 2012-10-26 2014-05-01 Bright Media Corporation Identifying candidates for job openings using a scoring function based on features in resumes and job descriptions
US10949762B2 (en) * 2015-09-30 2021-03-16 Tata Consultancy Services Limited Methods and systems for optimizing hidden Markov Model based land change prediction
CN105894253A (en) * 2016-05-09 2016-08-24 陈包容 Method and device for automatic pushing of job application demand
JP6762584B2 (en) * 2018-11-05 2020-09-30 株式会社アッテル Learning model construction device, post-employment evaluation prediction device, learning model construction method and post-employment evaluation prediction method
CN110288046B (en) * 2019-07-02 2022-11-18 南京恩瑞特实业有限公司 Fault prediction method based on wavelet neural network and hidden Markov model

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106022708A (en) * 2016-05-09 2016-10-12 陈包容 Method for predicting employee resignation
CN108805413A (en) * 2018-05-21 2018-11-13 中国平安人寿保险股份有限公司 Labor turnover Risk Forecast Method, device, computer equipment and storage medium
US20190385124A1 (en) * 2018-06-18 2019-12-19 Adp, Llc Targeting delivery of recruiting messages
CN110659757A (en) * 2018-06-29 2020-01-07 北京京东尚科信息技术有限公司 Employee departure prediction method and device
CN109710930A (en) * 2018-12-20 2019-05-03 重庆邮电大学 A kind of Chinese Resume analytic method based on deep neural network
CN110782072A (en) * 2019-09-29 2020-02-11 广州荔支网络技术有限公司 Employee leave risk prediction method, device, equipment and readable storage medium

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
XIE ZHI-HENG, LI QUAN-SHENG: "Forecasting of the Core Staff Flow in Enterprise Based on Markov Chain", EAST CHINA ECONOMIC MANAGEMENT, vol. 23, no. 10, 31 October 2009 (2009-10-31), pages 98 - 100, XP055854574, ISSN: 1007-5097 *

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113822355A (en) * 2021-09-22 2021-12-21 华北电力科学研究院有限责任公司 Composite attack prediction method and device based on improved hidden Markov model
CN114282169A (en) * 2021-10-12 2022-04-05 腾讯科技(深圳)有限公司 Abnormal data detection method and related device
CN114676743A (en) * 2021-12-09 2022-06-28 上海无线电设备研究所 Low-slow small target track threat identification method based on hidden Markov model
CN114676743B (en) * 2021-12-09 2024-04-26 上海无线电设备研究所 Low-speed small target track threat identification method based on hidden Markov model
CN116680590A (en) * 2023-07-28 2023-09-01 中国人民解放军国防科技大学 Post portrait label extraction method and device based on work instruction analysis
CN116680590B (en) * 2023-07-28 2023-10-20 中国人民解放军国防科技大学 Post portrait label extraction method and device based on work instruction analysis
CN116776158A (en) * 2023-08-22 2023-09-19 长沙隼眼软件科技有限公司 Target classification model training method, target classification device and storage medium
CN116776158B (en) * 2023-08-22 2023-11-14 长沙隼眼软件科技有限公司 Target classification method, device and storage medium
CN117408660A (en) * 2023-12-15 2024-01-16 山东杰出人才发展集团有限公司 Human resource data service management system based on big data

Also Published As

Publication number Publication date
CN112257777B (en) 2023-09-05
CN112257777A (en) 2021-01-22

Similar Documents

Publication Publication Date Title
WO2021179715A1 (en) Hidden markov model-based resignation prediction method and related device
US10152696B2 (en) Methods and systems for providing predictive metrics in a talent management application
CA3129745C (en) Neural network system for text classification
US20230342608A1 (en) Systems and processes for bias removal in a predictive performance model
US10127522B2 (en) Automatic profiling of social media users
JP5965557B1 (en) Similarity learning system and similarity learning method
US10147020B1 (en) System and method for computational disambiguation and prediction of dynamic hierarchical data structures
De Mauro et al. Beyond data scientists: a review of big data skills and job families
Jekel et al. How to teach open science principles in the undergraduate curriculum—The Hagen cumulative science project
Schalck et al. Predicting French SME failures: new evidence from machine learning techniques
Rhyn et al. A machine learning approach for classifying textual data in crowdsourcing
McCutcheon Reviewing pronatalism: a summary and critical analysis of prior research examining attitudes towards women without children
Rehan et al. Employees reviews classification and evaluation (ERCE) model using supervised machine learning approaches
JP2020129232A (en) Machine learning device, program, and machine learning method
Gendronneau et al. Measuring labour mobility and migration using big data
Bharadwaj et al. Resume Screening using NLP and LSTM
Park et al. Developing an advanced prediction model for new employee turnover intention utilizing machine learning techniques
Câmpeanu et al. The impact of higher education funding on socio-economic variables: Evidence from EU countries
US20210357699A1 (en) Data quality assessment for data analytics
Poon Relationships between demographic factors and employment prospects of architecture, construction and urban planning graduates
Wang et al. Intelligent Crowdsourced Testing
WO2021129368A1 (en) Method and apparatus for determining client type
Romanov et al. Applying AI in Education Creating a Grading Prediction System and Digitalizing Student Profiles
Tahamont et al. No ground truth? No problem: Improving administrative data linking using active learning and a little bit of guile
Wang A study of student performance under English teaching using a decision tree algorithm

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20923714

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20923714

Country of ref document: EP

Kind code of ref document: A1