CN109754281A - A kind of supplier's attrition prediction method - Google Patents

A kind of supplier's attrition prediction method Download PDF

Info

Publication number
CN109754281A
CN109754281A CN201811397492.7A CN201811397492A CN109754281A CN 109754281 A CN109754281 A CN 109754281A CN 201811397492 A CN201811397492 A CN 201811397492A CN 109754281 A CN109754281 A CN 109754281A
Authority
CN
China
Prior art keywords
class
cluster
supplier
data
neural network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201811397492.7A
Other languages
Chinese (zh)
Other versions
CN109754281B (en
Inventor
须峰
张福斌
宋安平
施海鹰
李传中
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Construction Network Technology (shanghai) Co Ltd
Original Assignee
Construction Network Technology (shanghai) Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Construction Network Technology (shanghai) Co Ltd filed Critical Construction Network Technology (shanghai) Co Ltd
Priority to CN201811397492.7A priority Critical patent/CN109754281B/en
Publication of CN109754281A publication Critical patent/CN109754281A/en
Application granted granted Critical
Publication of CN109754281B publication Critical patent/CN109754281B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Landscapes

  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The present invention relates to a kind of supplier's attrition prediction methods, first according to the demand of practical problem, are integrated to the data of platform itself, determine the feature for being lost supplier;Secondly unbalanced dataset is sampled using MBCDK-means lack sampling method, unbalanced data is converted into equilibrium data collection;Then Genetic Artificial Neural Network method is utilized, equilibrium data collection is predicted;Finally, output prediction result.The present invention can carry out Accurate Prediction to the loss of supplier.

Description

A kind of supplier's attrition prediction method
Technical field
The present invention relates to supplier's attrition prediction technical fields, more particularly to a kind of supplier's attrition prediction method.
Background technique
Customer churn prediction is a major issue in customer relation management.In increasingly competitive business environment now In, client can easily switch between rival.It is some studies have shown that obtain the cost of new client usually more existing than retaining There are expensive 5 to 6 times of the cost of client.Meanwhile long-term customers are lower to the susceptibility of competitive marketing activity, profit is higher.In addition, The loss of client not only results in revenue losses, also results in brand loyalty decline and influences the morale of company.Therefore, public Emphasis is transferred to reservation existing customer group from new client is developed by department.Accurate customer churn prediction, which will facilitate company, to be closed Suitable client navigates in reserved-range, therefore is acknowledged as the marketing matter of priority.
Customer churn means that client's subsidiary company cancels service, and the existing customer for retaining enterprise plays an important role, to increase Add the overall income of company, and the company status being retained in market with keen competition.The reason of leading to customer churn, has very much, So determining these the reason is that considerably complicated, because they depend on personal view and the company that is utilizing of client of client Service, but determine customer churn be necessary again.
Summary of the invention
Technical problem to be solved by the invention is to provide a kind of supplier's attrition prediction methods, can be to the stream of supplier It loses and carries out Accurate Prediction.
The technical solution adopted by the present invention to solve the technical problems is: providing a kind of supplier's attrition prediction method, wraps Include following steps:
(1) supplier of acquisition reflection platform is lost the related data of feature;
(2) collected unbalanced dataset is divided into uneven training dataset and uneven test data set;
(3) balance training data set is converted for uneven training dataset using MBCDK-means lack sampling method;
(4) prediction model is established using Genetic Artificial Neural Network method;
(5) uneven test data set collection is predicted using the prediction model, exports prediction result.
It includes: that qualification certificates, company's type, registered capital, registration information are complete that supplier, which is lost feature, in the step (1) It is whole degree, concern bidding documents number, nearest attitude, company's qualification, service quality, product quality, delivery rate, credibility, in Mark number, bid number, agreed-upon price number, login times, reasonable price degree and contract agreement fulfillment rate.
Further include that pretreated step is carried out to the data of acquisition between the step (1) and step (2), specifically include: Data integration is carried out to data;Data are cleaned, including removing noise and deleting inconsistent data;Data are become It changes, including construction new feature and data normalization.
The step (3) specifically includes following sub-step:
(31) uneven training dataset is divided into M most class samples and N number of minority class sample;
(32) initialization cluster number K;
(33) K class is polymerized to using K-means algorithm to M most class samples, a kind of composition is polymerized to N number of minority class sample Minority class subset;
(34) cluster centre of classes most for i-th, calculates the distance of its cluster centre for arriving minority classWherein, XiIndicate the cluster center of i-th of cluster, XNIndicate the cluster center of minority class;
(35) average distance of the most class cluster centers of calculating to minority class cluster center
(36) sample is selected to constitute most class subsets from each cluster of most classes, wherein sample size ismiIndicate the sample number in the cluster of i-th of most class cluster;
(37) most class subsets and minority class subset are constituted into balance training data set.
The step (4) specifically includes following sub-step:
(41) n individual of the first generation is randomly generated;
(42) n neural network is initialized;
(43) training neural network;
(44) whether the neural network after training of judgement reaches setting target, goes to step (45) if not reaching, otherwise turns Step (46);
(45) genetic operation is carried out, duplication, intersection and variation including chromosome obtain n individual of new generation, and return Step (42);
(46) optimal neural network is selected;
(47) according to optimal neural network Genetic Neural Network Predictive Model.
Beneficial effect
Due to the adoption of the above technical solution, compared with prior art, the present invention having the following advantages that and actively imitating Fruit: the present invention is selected with most classes to the distance at minority class cluster center when sampling according to the sample distribution quantity in cluster Number of samples is taken, retains the distributed intelligence of initial data cluster and improves boundary sample sample rate simultaneously, help to improve final classification Performance;The present invention uses MBCDK-means lack sampling method, reduces time and the space of existing K-means lack sampling algorithm Complexity;Present invention introduces weights and biasing that genetic algorithm carrys out optimized artificial neural network, construct genetic neural network mould Type has better estimated performance compared to artificial neural network.
Detailed description of the invention
Fig. 1 is flow chart of the invention;
Fig. 2 is to convert balance instruction for uneven training dataset using MBCDK-means lack sampling method in the present invention Practice the flow chart of data set;
Fig. 3 is the flow chart for establishing prediction model in the present invention using Genetic Artificial Neural Network method.
Specific embodiment
Present invention will be further explained below with reference to specific examples.It should be understood that these embodiments are merely to illustrate the present invention Rather than it limits the scope of the invention.In addition, it should also be understood that, after reading the content taught by the present invention, those skilled in the art Member can make various changes or modifications the present invention, and such equivalent forms equally fall within the application the appended claims and limited Range.
Embodiments of the present invention are related to a kind of supplier's attrition prediction method, first according to the demand of practical problem, knot The data of platform itself are closed, determine the feature for being lost supplier;Secondly using MBCDK-means lack sampling method to imbalance Data set is sampled, and unbalanced data is converted to equilibrium data collection;Then Genetic Artificial Neural Network method is utilized, to flat Weighing apparatus data set is predicted;Finally, output prediction result.As shown in Figure 1, the specific steps of which are as follows:
Step A, according to the demand of practical problem, the data of platform itself are integrated to, determine the loss feature of supplier;This Embodiment is for building supplier, wherein the loss feature of determining supplier includes: qualification certificates, company's type, note Volume fund, registration information integrity degree, pay close attention to bidding documents number and nearest attitude (such as nearest two months, four months, Half a year), company's qualification, service quality, product quality, delivery rate, credibility, acceptance of the bid number, bid number, agreed-upon price number, Login times, reasonable price degree, contract agreement fulfillment rate.
Step B, reflect that supplier is lost the related data of feature on acquisition platform;
Step C, data prediction is carried out to the data of acquisition, specifically included;
C1, data integration is carried out to data;
C2, data are cleaned, including removing noise and deleting inconsistent data;
C3, data are converted, including construction new feature and data normalization.
Step D, unbalanced dataset is divided into uneven training dataset and uneven test data set;
Step E, balance training data are converted for uneven training dataset using MBCDK-means lack sampling method Collection, as shown in Fig. 2, specifically including:
E1, training set is divided into M most class samples and N number of minority class sample;
E2, initialization cluster number K;
E3, K class is polymerized to using K-means algorithm to most class samples;
E4, a kind of composition minority class subset is polymerized to minority class sample;
The cluster centre of E5, class most for i-th, calculate the distance of its cluster centre for arriving minority classWherein XiIndicate the cluster center of i-th of cluster, XNIndicate the cluster center of minority class;
E6, calculate most class cluster centers to minority class cluster center average distance
E7, for i-th of most class, the sample number selected in suchmiIndicate i-th of most class Sample number in the cluster of cluster;
E8, the most class subsets of sampling composition are carried out from each most classes according to sample number;
E9, most class subsets and minority class subset are constituted into balance training collection.
Step F, using Genetic Artificial Neural Network method, prediction model is established;
F1, n individual of the first generation is randomly generated;
N F2, initialization neural network;
F3, training neural network;
F4, judge whether to reach setting target, go to step F5 if not reaching, otherwise go to step F8;
F5, chromosome replication;
F6, chiasma;
F7, chromosomal variation, go to step F2;
F8, selection optimal neural network;
F9, Genetic Neural Network Predictive Model is obtained.
Model parameter in present embodiment is provided that
The number of iterations 1000
Learning rate 0.05
Target error 0.0001
Population Size N 40
Evolutionary generation T 100
Crossover probability Pc 0.8
Mutation probability Pm 0.02
Step G, test set is predicted, exports prediction result.
The present invention is when sampling, according to the distance of sample distribution quantity and most classes to minority class cluster center in cluster It chooses number of samples, retains the distributed intelligence of initial data cluster and improve boundary sample sample rate simultaneously, help to improve final Classification performance;The present invention use MBCDK-means lack sampling method, reduce existing K-means lack sampling algorithm time and Space complexity;Present invention introduces weights and biasing that genetic algorithm carrys out optimized artificial neural network, construct genetic neural network Network model has better estimated performance compared to artificial neural network.

Claims (5)

1. a kind of supplier's attrition prediction method, which comprises the following steps:
(1) supplier of acquisition reflection platform is lost the related data of feature;
(2) collected unbalanced dataset is divided into uneven training dataset and uneven test data set;
(3) balance training data set is converted for uneven training dataset using MBCDK-means lack sampling method;
(4) prediction model is established using Genetic Artificial Neural Network method;
(5) uneven test data set collection is predicted using the prediction model, exports prediction result.
2. supplier's attrition prediction method according to claim 1, which is characterized in that supply commodity-circulate in the step (1) Losing feature includes: qualification certificates, company's type, registered capital, registration information integrity degree, concern bidding documents number, nearest service state Degree, acceptance of the bid number, bid number, agreed-upon price number, logs in company's qualification, service quality, product quality, delivery rate, credibility Number, reasonable price degree and contract agreement fulfillment rate.
3. supplier's attrition prediction method according to claim 1, which is characterized in that the step (1) and step (2) it Between further include that pretreated step is carried out to the data of acquisition, specifically include: data integration carried out to data;Data are carried out clear It washes, including removing noise and deleting inconsistent data;Data are converted, including construction new feature and data normalizing Change.
4. supplier's attrition prediction method according to claim 1, which is characterized in that the step (3) specifically include with Lower sub-step:
(31) uneven training dataset is divided into M most class samples and N number of minority class sample;
(32) initialization cluster number K;
(33) K class is polymerized to using K-means algorithm to M most class samples, it is a small number of to be polymerized to a kind of composition to N number of minority class sample Class subset;
(34) cluster centre of classes most for i-th, calculates the distance of its cluster centre for arriving minority classWherein, XiIndicate the cluster center of i-th of cluster, XNIndicate the cluster center of minority class;
(35) average distance of the most class cluster centers of calculating to minority class cluster center
(36) sample is selected to constitute most class subsets from each cluster of most classes, wherein sample size ismiIndicate the sample number in the cluster of i-th of most class cluster;
(37) most class subsets and minority class subset are constituted into balance training data set.
5. supplier's attrition prediction method according to claim 1, which is characterized in that the step (4) specifically include with Lower sub-step:
(41) n individual of the first generation is randomly generated;
(42) n neural network is initialized;
(43) training neural network;
(44) whether the neural network after training of judgement reaches setting target, goes to step (45) if not reaching, otherwise goes to step (46);
(45) genetic operation is carried out, duplication, intersection and variation including chromosome obtain n individual of new generation, and return step (42);
(46) optimal neural network is selected;
(47) according to optimal neural network Genetic Neural Network Predictive Model.
CN201811397492.7A 2018-11-22 2018-11-22 Supplier loss prediction method Active CN109754281B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201811397492.7A CN109754281B (en) 2018-11-22 2018-11-22 Supplier loss prediction method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201811397492.7A CN109754281B (en) 2018-11-22 2018-11-22 Supplier loss prediction method

Publications (2)

Publication Number Publication Date
CN109754281A true CN109754281A (en) 2019-05-14
CN109754281B CN109754281B (en) 2021-11-19

Family

ID=66402533

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201811397492.7A Active CN109754281B (en) 2018-11-22 2018-11-22 Supplier loss prediction method

Country Status (1)

Country Link
CN (1) CN109754281B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112862546A (en) * 2021-04-25 2021-05-28 平安科技(深圳)有限公司 User loss prediction method and device, computer equipment and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107944460A (en) * 2016-10-12 2018-04-20 甘肃农业大学 One kind is applied to class imbalance sorting technique in bioinformatics

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107944460A (en) * 2016-10-12 2018-04-20 甘肃农业大学 One kind is applied to class imbalance sorting technique in bioinformatics

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
刘晨晨: "基于数据挖掘的通信客户流失预警模型研究", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112862546A (en) * 2021-04-25 2021-05-28 平安科技(深圳)有限公司 User loss prediction method and device, computer equipment and storage medium

Also Published As

Publication number Publication date
CN109754281B (en) 2021-11-19

Similar Documents

Publication Publication Date Title
CN109034915B (en) Artificial intelligent electronic commerce system capable of using digital assets or points as transaction media
Corazza et al. Particle Swarm Optimization with non-smooth penalty reformulation, for a complex portfolio selection problem
Lin et al. Tourism demand forecasting: Econometric model based on multivariate adaptive regression splines, artificial neural network and support vector regression
CN112926651A (en) Enterprise credit assessment method and system
CN107609771A (en) A kind of supplier's value assessment method
CN110826886A (en) Electric power customer portrait construction method based on clustering algorithm and principal component analysis
CN110298574A (en) A kind of electricity consumption subscriber payment risk rating method based on convolutional neural networks
CN108230029A (en) Client trading behavior analysis method
CN110866782A (en) Customer classification method and system and electronic equipment
CN107481135A (en) A kind of personal credit evaluation method and system based on BP neural network
CN114943565A (en) Electric power spot price prediction method and device based on intelligent algorithm
Li et al. Predicting business risks of commercial banks based on BP-GA optimized model
CN115423538A (en) Method and device for predicting new product sales data, storage medium and electronic equipment
CN109754281A (en) A kind of supplier's attrition prediction method
CN112163781A (en) Park electricity utilization group life cycle evaluation method based on multi-dimensional index clustering
CN109190820B (en) Electric power market electricity selling quantity depth prediction method considering user loss rate
CN116611911A (en) Credit risk prediction method and device based on support vector machine
CN110807543A (en) Investment portfolio optimization method and device based on group decision intelligent search
Cao Research on the impact of artificial intelligence-based e-commerce personalization on traditional accounting methods
Sharma et al. Prediction of Real-Time Estate Pricing using Train-Test Splitting Techniques
Gonçalves et al. Credit risk analysis applying logistic regression, neural networks and genetic algorithms models
CN111667307A (en) Method and device for predicting financial product sales volume
Wei et al. A new dynamic credit scoring model based on clustering ensemble
Ohsato et al. Construction of an Input-Output Table Considering Business-to-Consumer Transactions by using Private Data
Wang Credit Strategy Design of Small and Medium-Sized Enterprises

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant