CN105930353A

CN105930353A - Feature subdivision method for call center customer service staff in mass data

Info

Publication number: CN105930353A
Application number: CN201610216835.XA
Authority: CN
Inventors: 程宏亮; 卢耀宗; 强劲; 黄蓉
Original assignee: Xi'an Merit Data Technology Co Ltd
Current assignee: Xi'an Merit Data Technology Co Ltd
Priority date: 2016-04-08
Filing date: 2016-04-08
Publication date: 2016-09-07

Abstract

The invention discloses a feature subdivision method for call center customer service staff in mass data. The feature subdivision method includes: acquiring a preset number of workers; determining a cluster index, and determining data corresponding to the cluster index, of each worker as cluster index data of the worker; performing clustering through a K-Means algorithm according to the cluster index data of each worker so as to obtain a cluster result; and dividing the workers into different classes according to the cluster result to obtain a group result. A call center work management method can automatically adapt worker subdivision work in the mass data, is high in accuracy, and can avoid the situations that classification based on subjective experience does not confirm with actual task performance of the workers.

Description

A kind of feature divided method of the call center contact staff under mass data

Technical field

The present invention relates to call center management technical field, more particularly, it relates under a kind of mass data The feature divided method of call center contact staff.

Background technology

In the management to the operator of call center, generally by manager or teams and groups' pipe of call center Reason person's subjectivity determines to be broadly divided into operator a few class.But, in view of call center's seat personnel quantity Reach scale up to ten thousand, this will management level to the subjective understanding of some operator and its actual work performance Difference is relatively big, thus well can not remove arranged rational related work according to the ability to work of operator. Accordingly, it would be desirable to a kind of technology can help manager preferably to recognize the general performance of seat personnel of its subordinate, Preferably carry out such as run enhancing efficiency, attend a banquet arrange an order according to class and grade, the work of the related management such as skills training.

Summary of the invention

It is an object of the invention to provide the staff's feature segmentation side of call center under a kind of mass data Method, to solve present in prior art the cognitive Bias problem to staff, assists management personnel to enter Row aid decision.

To achieve these goals, the present invention provides following technical scheme:

A kind of management method of the call center's staff's feature segmentation under mass data, including:

Obtain the KPI achievement data that the staff of the some teams and groups of M row S row examined within some years；

For the pretreatment of described KPI achievement data, described pretreatment mode include missing values supplement, Replacement and last the process by data normalization of exceptional value obtain clustering target data；

The clustering target data separate distributed K-Means algorithm that will have processed, carries out client characteristics and hives off, Obtain the segmentation result of client.

Preferably, in described acquisition each described some years, staff examines number corresponding to KPI index According to, including: percent of call completed, incoming call project occupation rate, exhalation project work efficiency, average handling time, Average queuing time, rate of solving the problem once and for all, average cost per contact, the rate of attendance, business evaluating achievement And satisfaction rate；

Wherein, described percent of call completed refers to the connection of the whole level service unit of IVR (Interactive Voice Response) Measure the ratio of the connection amount sum with manual position and the calling total amount entering call center；

In described incoming call project occupation rate refers to certain section of timing statistics, seat person process take on the telephone more total time The long ratio with actual log system duration；

In described exhalation project work efficiency refers to certain section of timing statistics, total handling duration and login system duration Ratio；

Described average handling time: once get in touch with required average time；

Described average queuing time refers in certain section of timing statistics, and caller waits artificial seat after shortlisting The average waiting duration of the wait that seat is answered；

The described rate of solving the problem once and for all is in certain section of timing statistics, it is not necessary to client dials in calling again Telephone redialing or switching are accounted for seat person with regard to soluble phone amount and pick up electricity also without seat person by center The percentage ratio of words total amount；

The described rate of attendance refers in certain statistical time range, and the actual number turned out for work of certain teams and groups is turned out for work with plan The percentage rate of number；

In described average cost per contact refers to certain section of timing statistics, the full payment of call center divided by phone at Reason amount；

Described business evaluating achievement refers to that call center's seat is to the Grasping level of professional knowledge and described satisfaction Degree refers to the service satisfactory degree that call center is provided by client.

Preferably, supplementing of the described missing values that above-mentioned KPI performance assessment criteria data are carried out pretreatment, bag Include:

Determine total observation data S row, the KPI that the staff of corresponding some teams and groups examined within some years Index, these indexs are cluster input attribute field；

Data, its attribute field X is observed if there is L row_i(i≤M) has missing values or all inputs all Record or the observation field with missing values do not have the data corresponding with ATTRIBUTE INDEX, it is determined that this L Row data need to carry out missing values process；

Determine that residue S-L row data, for reference to data, add up every in these observation data M attribute fields Individual attribute X_iThe value that (i≤M) occurrence number is most, i.e. mode；

Missing data in above-mentioned L row M attribute field of observation is filled with, is replaced with mode, Complete corresponding attribute X_iThe missing values of (i≤M) processes.

Preferably, the replacement of the described abnormal data that above-mentioned KPI performance assessment criteria data are carried out pretreatment, Including:

Raw sample data S row, each attribute X_iThe average of (i≤M) isStandard deviation is δ_iIf existingThen this observation is an abnormity point.

For abnormal data, ifComplete the replacement of abnormal data.

Preferably, described above-mentioned KPI performance assessment criteria data are normalized, including:

ATTRIBUTE INDEX is carried out standardization processing, can be allowed to by ATTRIBUTE INDEX data bi-directional scaling Enter a little specific interval；By min-max method for normalizing, utilize following functional transformation formula:

Y_{i} = \frac{X_{i} - X_{m i n}}{X_{m a x} - X_{m i n}}

X_max,X_minMaximum and minima, the X of attribute field is concentrated for original sample_i,Y_iIt is respectively input Value before and after samples normalization；The numerical value x of each clustering target data is mapped to the interval [0,1] of correspondence, To realize the unitized of different clustering target, obtain the clustering target data with unified metric unit.

Preferably, the described parallel K-Means of clustering target data separate according to each described staff Algorithm clusters, and obtains cluster result, including:

Initial phase: first determine cluster number k, concentrate from initial data at random and choose k data conduct Initial classes center, is divided into several data blocks simultaneously by raw data set；Determine class center at random, such as Original have 10 points, determines that wherein K=2 point is as class center at random.

The MAP stage: each data block (comprising many data) correspondence is distributed to a map function, For this data block, calculate wherein every data generic；

The REDUCE stage: this stage obtains complete cluster result, specifically by each piece of cluster result of merger It is to calculate to belong to the average of same category data as Xin Lei center, and as next iteration Input, until algorithmic statement；Xin Lei center is to be obtained by the mean value computation of same category data.Ratio As comprised 10 points under a certain class, calculate the average of these 10 points, obtain class central point；

The results verification stage: according to described cluster result, described staff is divided into different classes, obtains spy Levy segmentation result；

The result application stage: determine the pipe of such staff according to the feature of the staff of each class Reason mode.

Preferably, in the described results verification stage, need to cluster according to the cluster result of described staff The determination of optimum number, including:

Initial range is chosen: preset certain cluster number scope [k₁,k₂], and cluster number is that this is interval Interior integer；

Inter-object distance calculates: to [k₁,k₂Any k in], calculates under current cluster result in same class Distance isWherein x_iFor clustering the data point in j,For clustering the apoplexy due to endogenous wind of j Heart point；

Between class distance calculates: to [k₁,k₂Any k in], calculates between classes different under current cluster result Distance beWhereinFor clustering the data point in i,For cluster j's Class central point；

Optimal cluster number determines: set up Cluster Validity FunctionCarry out cluster result evaluation, When L (k) is minimum, correspondence obtains optimal cluster number k, thus avoids clusters number and exist on choosing Subjectivity.

The manager of the call center's staff's feature segmentation under a kind of mass data that the present invention provides Method, including: obtain the KPI index that the staff of the some teams and groups of M row S row examined within some years Data；Carrying out the pretreatment of above-mentioned data, including the supplementing of missing values, the replacement of exceptional value, data are returned One change process etc.；Utilize distributed K-Means algorithm, carry out client characteristics and hive off, obtain the thin of client Divide result；

Compared with prior art, the application is not simply to set index weights and carries out KPI marking examination, But it is entered according to the examination KPI achievement data of each staff by parallel K-Means algorithm Row hives off, and obtains the general cognitive about seat personnel task performance.And K-Means algorithm is pin parallel Cluster under mass data, the iteration of calculating process can be completed more quickly, can be the completeest Become correlated characteristic segmentation work, therefore, work people in call center's under a kind of mass data disclosed in the present application The management method of member's feature segmentation, analytical performance is higher, it is to avoid manager uprushes because of crew numbers And cause the performance cognition of the real work to staff to occur that the situation of deviation occurs.

Accompanying drawing explanation

In order to be illustrated more clearly that the embodiment of the present invention or technical scheme of the prior art, below will be to reality Execute the required accompanying drawing used in example or description of the prior art to be briefly described, it should be apparent that below, Accompanying drawing in description is only embodiments of the invention, for those of ordinary skill in the art, not On the premise of paying creative work, it is also possible to obtain other accompanying drawing according to the accompanying drawing provided.

Staff's feature segmentation in call center's under a kind of mass data that Fig. 1 provides for the embodiment of the present invention The flow chart of management method；

The stream of the Missing Data Filling of a kind of examination KPI data pretreatment that Fig. 2 provides for the embodiment of the present invention Cheng Tu；

The replacement of the exceptional value of a kind of examination KPI data pretreatment that Fig. 3 provides for the embodiment of the present invention Flow chart；

The stream of the data normalization of a kind of examination KPI data pretreatment that Fig. 4 provides for the embodiment of the present invention Cheng Tu；

The Parallel K-Means Clustering Algorithm in Web that utilizes that Fig. 5 provides for the embodiment of the present invention is operated personnel characteristics Segmentation, obtains the flow chart of cluster result；

Fig. 6 determines the optimal stream clustering number for what the embodiment of the present invention provided according to Cluster Validity Function Cheng Tu；

Detailed description of the invention

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out Clearly and completely describe, it is clear that described embodiment is only a part of embodiment of the present invention, and It is not all, of embodiment.Based on the embodiment in the present invention, those of ordinary skill in the art are not doing Go out the every other embodiment obtained under creative work premise, broadly fall into the scope of protection of the invention.

Refer to Fig. 1, it illustrates the flow process of a kind of staff's management method that the embodiment of the present invention provides Figure, may include that

S11: obtain the KPI index number that the staff of the some teams and groups of M row S row examined within some years According to；Including percent of call completed, incoming call project occupation rate, exhalation project work efficiency, average handling time, put down All queuing time, rate of solving the problem once and for all, average cost per contact, rate of attendance business evaluating achievements and full Meaning rate；

Table 1KPI index and implication thereof

Choose the some employees in call center job performance data in specific duration, carry out feature clustering. Concrete sample data example such as following table:

The sample data of table 2KPI index

Predetermined amount can be determined according to actual needs, is not specifically limited at this.

S12: carry out the pretreatment of above-mentioned data, including missing values supplement, the replacement of exceptional value, data Normalized obtains clustering target data；

Wherein, clustering target is the index in terms of the work performance of staff, and its concrete meaning is according to work The position making personnel is different and otherwise varied.It addition, the quantity of clustering target can be 1, it is also possible to For multiple, it is determined according to actual needs, is not specifically limited at this.

S13: utilize distributed K-Means algorithm, carries out client characteristics and hives off, and obtains the segmentation knot of client Really；

Wherein, the object in set is divided into the data analysing method of several classes by cluster exactly, and it makes Between the object of each apoplexy due to endogenous wind the most similar, the object of different apoplexy due to endogenous wind is the most different, the application In object be staff.

The application by parallel K-means clustering algorithm according to the clustering target data pair of each staff It is classified, and staff is classified by the cluster result utilizing clustering algorithm to obtain, it is achieved each Staff's maximizing in class similar.Therefore, a kind of staff manager disclosed in the present application Method is not it can be avoided that staff's cognition of causing because of the subjective impression of upper-level leader meets practical situation and sends out Raw.

It addition, Parallel K-Means Clustering Algorithm in Web is applied to the classification for staff by the present invention, have It is beneficial to realize the lean management of staff, it is simple to the leader of call center understands the operation at center in time The problem existed during efficiency and service, and follow the tracks of action plan and performance result by correction strategy, Thus ensure the realization of development strategy.

Refer to Fig. 2, a kind of examination KPI data pretreatment of embodiment of the present invention offer is provided The flow chart of Missing Data Filling, may include that

S21: acquiring unit.Obtaining total observation data S row, the staff of corresponding some teams and groups is some The KPI index of examination in year, determines that these indexs are for cluster input attribute field；

S22: detector unit.Data, its attribute field X is observed if there is L row_i(i≤M) has disappearance Value or all inputs all have the record of missing values or observation field does not have the number corresponding with ATTRIBUTE INDEX According to, then detect and determine that these L row data need to carry out missing values process；

S23: computing unit.Determine that residue S-L row data, for reference to data, add up these observation data M Each attribute X in individual attribute field_iThe value that (i≤M) occurrence number is most, i.e. mode；

S24: fill unit.Missing data in above-mentioned L row M attribute field of observation is filled with, It is replaced with mode, completes corresponding attribute X_iThe missing values of (i≤M) processes.

Refer to Fig. 3, a kind of examination KPI data pretreatment of embodiment of the present invention offer is provided The flow chart of the replacement of exceptional value, may include that

S31: confirmation unit.Raw sample data S row, each attribute X_iThe average of (i≤M) isMark Quasi-difference is δ_iIf existingThen confirm that this observation is an abnormity point.

S32: replacement unit.For abnormal data, ifComplete abnormal number According to replacement.

Refer to Fig. 4, a kind of examination KPI data pretreatment of embodiment of the present invention offer is provided The flow chart of data normalization；

S41: computing unit.Calculate original sample and concentrate maximum and the minima of each attribute field X_max,X_min；

S42: normalization unit.By min-max method for normalizing, utilize following functional transformation formula:

Y_{i} = \frac{X_{i} - X_{m i n}}{X_{\max} - X_{m i n}}

Wherein X_i,Y_iIt is respectively the value before and after input samples normalization.Numerical value by each clustering target data X is mapped to the interval [0,1] of correspondence.

By ATTRIBUTE INDEX is carried out standardization processing, ATTRIBUTE INDEX data bi-directional scaling can be made Fall into a little specific interval.To realize the unitized of different clustering target, obtain that there is unified degree The clustering target data of amount unit.

Referring to Fig. 5, it illustrates embodiment of the present invention offer utilizes Parallel K-Means Clustering Algorithm in Web It is operated personnel characteristics's segmentation, obtains the flow chart of cluster result；

S51: initialization unit.First determine cluster number, determine initial all kinds of centers at random, simultaneously Raw data set is divided into several data blocks.

S52:MAP unit.Each data subset correspondence is distributed to a map function, for often Individual data block, calculates each data item generic.

S53:REDUCE unit.Each piece of cluster result of merger obtains complete cluster result, recalculates Class center is as the input of next iteration, until algorithmic statement.

S54: results verification unit.According to described cluster result, described staff is divided into different classes, Obtain feature segmentation result.

Illustrate above-mentioned steps.As operator can be divided into following 3 classes:

Cluster 1 (maker post type): the indexs such as its KPI of the operator related to do well, percent of call completed, incoming call item Mesh occupation rate, exhalation project work efficiency, rate of solving the problem once and for all, the rate of attendance, business evaluating achievement And satisfaction rate is the highest, but average handling time, average queuing time and average cost per contact are the most relatively low, say Bright this kind of people's work quality is the highest, and explanation work efficiency is high simultaneously.

Cluster 2 (treating concern type): the indexs such as its KPI of the operator related to do well, percent of call completed, incoming call Project occupation rate, exhalation project work efficiency, rate of solving the problem once and for all, the rate of attendance, business evaluating become Achievement and satisfaction rate are minimum, and inefficiency is described.Average handling time, average queuing time and the most single Exhale cost the highest, illustrate that work quality is minimum.

Cluster 3 (middle types): the percent of call completed of this kind of operator, incoming call project occupation rate, exhalation project work Make efficiency, rate of solving the problem once and for all, the rate of attendance, business evaluating achievement and satisfaction rate and be in by-level, Average handling time, average queuing time and average cost per contact are in by-level.

S55: result applying unit.The feature of the staff according to each class determines such staff Way to manage.

In a kind of staff's management method that above-described embodiment provides, according to cluster result by staff It is divided into different classes, after obtaining classification results, determines the staff's of each class according to classification results Feature carries out fine-grained management, promotes the efficiency of operation of call center；

Citing carries out above-mentioned steps explanation.

One is job content and the work that the feature of the staff according to each class determines such staff Make the time.For " maker post type ", " middle type " and the operator of " treating concern type ", take correspondence Excitation penal system, is operated the arrangement of content and working time, wherein, when job content and work Between specifically can include arranging an order according to class and grade the diversity arrangement etc. of duration and services client, such as " problem type " personnel's night shift Time of arranging an order according to class and grade is longer, the group customer of " maker post type " personnel service's Electric Power Customer Service Center and VIP client. Thus, according to the feature of different operating personnel, Reasonable adjustment order of classes or grades at school, and corresponding different demand (as Customer demand etc.), distribute the staff of different features, be conducive to the management for staff, with And improve the work efficiency of staff.

Two is the work defect that the feature of the staff according to each class determines such staff, and For the staff of each class, the staff under class of especially " treating concern type ", according to its work Such staff is giveed training by defect.The work of the staff of each class is determined based on grouping result Making defect, beneficially manager arranges by a definite date difference according to the work defect difference of the staff of each class Training, the beneficially raising of staff's level.On this basis, long-acting training mechanism, shape are set up Become the work climate of autonomic learning.

Refer to Fig. 6, determining most preferably according to Cluster Validity Function of embodiment of the present invention offer is provided The flow chart of cluster number；

S61: initial range is chosen.Preset certain cluster number scope [k₁,k₂], and cluster number be this Integer in interval；

S62: inter-object distance calculates.To [k₁,k₂Any k in], calculates under current cluster result same Distance in class isWherein x_iFor clustering the data point in j,For cluster j's Class central point；

S63: between class distance calculates.To [k₁,k₂Any k in], calculates under current cluster result different Distance between class isWhereinFor clustering the data point in i,For cluster The class central point of j；

S64: optimal cluster number determines.Set up Cluster Validity FunctionCarry out cluster knot Fruit is evaluated, and when L (k) is minimum, correspondence obtains optimal cluster number k, thus avoids clusters number in choosing Subjectivity present on taking.

Described above to the disclosed embodiments, makes those skilled in the art be capable of or uses this Bright.Multiple amendment to these embodiments will be apparent from, herein for a person skilled in the art Defined in General Principle can be real at other without departing from the spirit or scope of the present invention Execute in example and realize.Therefore, the present invention is not intended to be limited to the embodiments shown herein, and is intended to Meet the widest scope consistent with principles disclosed herein and features of novelty.

Claims

1. the method for the feature segmentation of call center contact staff under a mass data, it is characterised in that bag Include:

For the pretreatment of described KPI achievement data, described pretreatment mode includes the supplementary, different of missing values Replacement and last the process by data normalization of constant value obtain clustering target data；

Method the most according to claim 1, it is characterised in that described acquisition each described some years Interior staff examines the data that KPI index is corresponding, including: percent of call completed, incoming call project occupation rate, exhalation Project work efficiency, average handling time, average queuing time, rate of solving the problem once and for all, average individual calling Cost, the rate of attendance, business evaluating achievement and satisfaction rate；

Wherein, described percent of call completed refers to the connection amount of IVR level at end service unit and the connection amount sum of manual position Ratio with the calling total amount entering call center；In described incoming call project occupation rate refers to certain section of timing statistics, Seat person processes the ratio of total duration and the actual log system duration taken on the telephone more；Described exhalation project work In efficiency refers to certain section of timing statistics, total handling duration and the ratio of login system duration.During described average treatment Between: once get in touch with required average time；Described average queuing time refers in certain section of timing statistics, calling Person waits the average waiting duration of the wait of manual position answer after shortlisting；Described solve the problem once and for all Rate is in certain section of timing statistics, it is not necessary to client dials in call center also without seat person again by phone Clawback or switching account for seat person with regard to soluble phone amount and pick up the percentage ratio of phone total amount；The described rate of attendance Refer in certain statistical time range, the percentage rate of the number that the actual number turned out for work of certain teams and groups is turned out for work with plan； In described average cost per contact refers to certain section of timing statistics, the full payment of call center is divided by phone treating capacity； Described business evaluating achievement refers to call center's seat Grasping level to professional knowledge；Described satisfaction refers to client The service satisfactory degree that call center is provided.

3. want the method described in 1 according to right, it is characterised in that missing values is supplemented, including:

Data, its attribute field X is observed if there is L row_i(i≤M) has missing values or all inputs all have The record or the observation field that have missing values do not have the data corresponding with ATTRIBUTE INDEX, it is determined that this L line number Missing values process is carried out according to needs；

Determine that residue S-L row data, for reference to data, add up each in these observation data M attribute fields Attribute X_iThe value that (i≤M) occurrence number is most, i.e. mode；

Method the most according to claim 1, it is characterised in that for the replacement of abnormal data, including:

Raw sample data S row, each attribute X_iThe average of (i≤M) isStandard deviation is δ_iIf existingThen this observation is an abnormity point；

For abnormal data, ifComplete the replacement of abnormal data.

5. according to the arbitrary described method of Claims 1-4, it is characterised in that for above-mentioned KPI index Data are all normalized, including:

ATTRIBUTE INDEX is carried out standardization processing, can be allowed to fall into by ATTRIBUTE INDEX data bi-directional scaling One little specific interval, by min-max method for normalizing, utilizes following functional transformation formula:

Y_{i} = \frac{X_{i} - X_{m i n}}{X_{\max} - X_{m i n}}

X_max,X_minMaximum and minima, the X of attribute field is concentrated for original sample_i,Y_iRespectively input sample Value before and after this normalization.The numerical value x of each clustering target data is mapped to the interval [0,1] of correspondence, with Realize the unitized of different clustering target, obtain the clustering target data with unified metric unit.

Method the most according to claim 5, it is characterised in that described according to each described staff Clustering target data separate parallel K-Means algorithm cluster, obtain cluster result, including:

Initial phase: first determine cluster number, determines initial all kinds of centers, at random simultaneously by original number It is divided into several data blocks according to collection；

The MAP stage: each data block correspondence is distributed to a map function, for this data block, meter Calculating wherein every data generic, above-mentioned data base comprises many data；

The REDUCE stage: this stage obtains complete cluster result, specifically by each piece of cluster result of merger Calculate and belong to the average of same category data as Xin Lei center, and defeated as next iteration Enter, until algorithmic statement；

The results verification stage: according to described cluster result, described staff is divided into different classes, obtains feature Segmentation result；

The result application stage: determine the management of such staff according to the feature of the staff of each class Mode.

Method the most according to claim 6, it is characterised in that in the described results verification stage, need basis The cluster result of described staff carries out clustering the determination of optimum number, including:

Preset certain cluster number scope [k₁,k₂], and cluster number is the integer in this interval；

To [k₁,k₂Any k in], calculates the distance in same class under current cluster result；

To [k₁,k₂Any k in], calculates the distance between classes different under current cluster result；

Determine Cluster Validity Function, carry out cluster result evaluation, obtain optimal cluster number, thus avoid Clusters number subjectivity present on choose.

Method the most according to claim 7, it is characterised in that in the described results verification stage, need basis The cluster result of described staff carries out clustering the determination of optimum number, including:

Initial range is chosen: preset certain cluster number scope [k₁,k₂], and cluster is in number is this interval Integer；

Inter-object distance calculates: to [k₁,k₂Any k in], calculate under current cluster result in same class away from From forWherein x_iFor clustering the data point in j,For clustering the class central point of j；

Between class distance calculates: to [k₁,k₂Any k in], calculates between classes different under current cluster result Distance isWhereinFor clustering the data point in i,For clustering the apoplexy due to endogenous wind of j Heart point；

Optimal cluster number determines: set up Cluster Validity FunctionCarry out cluster result evaluation, When L (k) is minimum, correspondence obtains optimal cluster number k, thus avoids clusters number present on choose Subjectivity.