CN107169593A - A kind of universities and colleges based on neutral net file line Forecasting Methodology - Google Patents

A kind of universities and colleges based on neutral net file line Forecasting Methodology Download PDF

Info

Publication number
CN107169593A
CN107169593A CN201710273724.7A CN201710273724A CN107169593A CN 107169593 A CN107169593 A CN 107169593A CN 201710273724 A CN201710273724 A CN 201710273724A CN 107169593 A CN107169593 A CN 107169593A
Authority
CN
China
Prior art keywords
universities
colleges
rule
line
year
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710273724.7A
Other languages
Chinese (zh)
Inventor
周从华
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiangsu Tree Mdt Infotech Ltd
Original Assignee
Jiangsu Tree Mdt Infotech Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Jiangsu Tree Mdt Infotech Ltd filed Critical Jiangsu Tree Mdt Infotech Ltd
Priority to CN201710273724.7A priority Critical patent/CN107169593A/en
Publication of CN107169593A publication Critical patent/CN107169593A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance
    • G06Q50/2053Education institution selection, admissions, or financial aid

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Strategic Management (AREA)
  • Human Resources & Organizations (AREA)
  • Educational Administration (AREA)
  • Educational Technology (AREA)
  • Tourism & Hospitality (AREA)
  • Economics (AREA)
  • Marketing (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Physics & Mathematics (AREA)
  • Primary Health Care (AREA)
  • Health & Medical Sciences (AREA)
  • General Health & Medical Sciences (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

Line Forecasting Methodology is filed the invention discloses a kind of universities and colleges based on neutral net, belongs to data analysis technique field, specifically includes following steps:Missing values and outlier processing are carried out to the data collected;Treated data are normalized;Data set is classified;Prediction is modeled respectively to the data set classified.This method can be filed to college entrance examination universities and colleges over the years point to be analyzed, so as to improve the accuracy rate that next year universities and colleges file prediction, be can be applied to universities and colleges and is filed line prediction.

Description

A kind of universities and colleges based on neutral net file line Forecasting Methodology
Technical field
The invention belongs to data mining and big data analysis technical field, it is related to the Predicting Technique that line is filed towards universities and colleges.
Background technology
The line of filing of each enrollment universities and colleges is the minimum control line (also known as saving control line) delimited in the provincial enrollment office of same batch On the basis of, according to the school then this province enrollment plan, file ratio, examinee enter oneself for the examination the universities and colleges aspiration distribution situation and What the achievement distribution situation of these examinees was determined jointly.It is provincial in the College Enrollment of each batch according to Ministry of Education's regulation The ratio that examinee's archives are delivered by universities and colleges of enrollment office should be controlled 1:1~1:1.2 between.
And have every year many examinees be faced with the result finally enrolled higher position and it is low with regard to the problem of.So-called higher position is just It is during first batch is enrolled, to be moved back shelves after filing and second lot can only be carried out due to the fraction of oneself not enough Solicit parallel wish.It is so-called it is low be just exactly the achievement of oneself than the school entered oneself for the examination to file line high.
With the development of internet and big data technology, exploitation universities and colleges file line Predicting Technique, can effectively reduce student's chance To higher position and low probability just, therefore, it is necessary to carry out brand-new research to this technology.
The content of the invention
Line Forecasting Methodology is filed it is an object of the invention to provide a kind of universities and colleges based on neutral net, to improve next year Universities and colleges file the predictablity rate of line.
In order to solve the above technical problems, the technical solution adopted by the present invention is as follows.
A kind of universities and colleges based on neutral net file line Forecasting Methodology, it is characterised in that comprise the following steps:
Step one, missing values are carried out to the data collected and outlier is handled;
Treated data are normalized by step 2;
Step 3, classifies to data set;
Step 4, sets up neural network model to the data set classified and is predicted respectively.
The missing values and outlier processing include following four rule:
Rule 1.1, the poor missing values of the line of universities and colleges two is arranged to the poor average of two lines in remaining each year;
Rule 1.2, the poor outlier of two lines of universities and colleges is arranged to the average of remaining each line of year two difference;
Universities and colleges are filed line missing values and are arranged to year two line difference and save control line sum with this year by rule 1.3;
The normalization includes following two rules:
Rule 2.1, uses range method, the minimum value and maximum taken is that each line of year two is poor to the difference normalization of each line of year two Minimum value and maximum;
Rule 2.2, range method is used to the normalization of each year precedence, and the minimum value and maximum taken is that each year precedence is minimum Value and maximum.
It is described that data set is carried out to classify comprising three below rule:
Rule 3.1, it is training set 1 to filter out the universities and colleges that nearly 4 years two line differences are 0;
Rule 3.2, filters out nearly 4 years two lines difference and is respectively less than 5 points, and two lines difference is not 0 universities and colleges, is training set 2;
Rule 3.3, calculates 4 years precedence deviations of remaining universities and colleges, and deviation is training set 3 in 0~0.45 universities and colleges, and deviation exists 0.45~1 universities and colleges are training set 4, and the universities and colleges that deviation is more than 1 are training set 5, and the deviation is that annual precedence subtracts 4 years positions Absolute value of the absolute value sum of secondary average than upper precedence average.
The neural network model of setting up includes following rule:
Rule 4.1, the two lines difference that the universities and colleges in training set 1 are given tacit consent to these universities and colleges' next years is 0 point;
Rule 4.2, to two lines poor historical data of the universities and colleges in training set 2 using the first four years, is carried out using neutral net Modeling, wherein the data of the 4th year are input as remaining is exported.
Rule 4.3, the precedence historical data that the first four years are utilized respectively to training set 35 is modeled using neutral net, its In the 4th year data as export remaining be input.
The neural network model is double-deck hidden layer, and the neuron of first layer hidden layer is not less than 20 neurons, the Two layers of hidden layer are not less than 15 neurons.
The present invention has beneficial effect.Data analysis is to ensure that universities and colleges file line and predict accurate key technology.The present invention A kind of data prediction technology proposed, is characterized in that the feature for taking into full account data is classified to data, by collecting Data carry out the processing of missing values and outlier, treated data are normalized, data set is classified, it is right The data set classified is modeled the technical methods such as prediction respectively, significantly improves the accuracy that universities and colleges file line prediction.
Embodiment
Technical scheme is described in further details with reference to embodiment.
With the data instance that 12 universities and colleges recruited student in Jiangsu Province are over the years, next year is carried out to this 12 universities and colleges and files line Prediction, table 1 is the original table of this 12 universities and colleges' data over the years.
1 12 universities and colleges of table data original table over the years
By the information of table 1, after step one under this invention is handled, table 2 is obtained.
Table after the missing values of table 2. and outlier processing
By the information of table 2, after step 2 under this invention is handled, table 3 is obtained.
Table after the data normalization of table 3
By the information of table 3, after step 3 under this invention is handled, table 4~8 is obtained.
The training set 1 of table 4
The training set 2 of table 5.
The training set 3 of table 6.
The training set 4 of table 7.
The training set 5 of table 8.
The two lines difference of universities and colleges' next year in training set 1 is predicted as 0.
By the universities and colleges in training set 2 using 2012-2015 the poor historical data of two lines (2012-2014 is to input, 2015 be output) it is modeled using neutral net, by the use of 2013-2015 data as input, utilize the mould established Type predicts that two lines of 2016 are poor.Wherein neural network model is double-deck hidden layer, and first layer hidden layer has 20 neurons, Second hidden layer has 15 neurons.
By the universities and colleges in training set 3 using 2012-2015 precedence historical data (2012-2014 be input, 2015 For output) it is modeled using neutral net, it is pre- using the model established by the use of 2013-2015 data as input The precedence of survey 2016.Wherein neural network model is double-deck hidden layer, and first layer hidden layer has 20 neurons, second Hidden layer has 15 neurons.
By the universities and colleges in training set 4 using 2012-2015 precedence historical data (2012-2014 be input, 2015 For output) it is modeled using neutral net, it is pre- using the model established by the use of 2013-2015 data as input The precedence of survey 2016.Wherein neural network model is double-deck hidden layer, and first layer hidden layer has 20 neurons, second Hidden layer has 15 neurons.
By the universities and colleges in training set 5 using 2012-2015 precedence historical data (2012-2014 be input, 2015 For output) it is modeled using neutral net, it is pre- using the model established by the use of 2013-2015 data as input The precedence of survey 2016.Wherein neural network model is double-deck hidden layer, and first layer hidden layer has 20 neurons, second Hidden layer has 15 neurons.
On actual test collection, the test effect of model is as follows.
Accuracy:Error=0,24.79%
Error≤1,51.24%
Error≤2,65.7%
Error≤1,75.2%
And the test effect being modeled of not classified with two lines difference purely is as follows.
Accuracy:Error=0,21.49%
Error≤1,36.37%
Error≤2,54.55%
Error≤1,65.71%
As can be seen here, a kind of universities and colleges based on neutral net that the application is proposed file line Forecasting Methodology with higher standard True rate.

Claims (6)

1. a kind of universities and colleges based on neutral net file line Forecasting Methodology, it is characterised in that comprise the following steps:
Step one, missing values are carried out to the data collected and outlier is handled;
Treated data are normalized by step 2;
Step 3, classifies to data set;
Step 4, sets up neural network model to the data set classified and is predicted respectively.
2. a kind of universities and colleges based on neutral net according to claim 1 file line Forecasting Methodology, it is characterised in that described Missing values and outlier processing include following four rule:
Rule 1.1, the poor missing values of the line of universities and colleges two is arranged to the poor average of two lines in remaining each year;
Rule 1.2, the poor outlier of two lines of universities and colleges is arranged to the average of remaining each line of year two difference;
Universities and colleges are filed line missing values and are arranged to year two line difference and save control line sum with this year by rule 1.3.
3. a kind of universities and colleges based on neutral net according to claim 1 file line Forecasting Methodology, it is characterised in that described Normalization includes following two rules:
Rule 2.1, range method is used to the difference normalization of each line of year two, and the minimum value and maximum taken is that each line of year two difference is minimum Value and maximum;
Rule 2.2, to each year precedence normalization use range method, the minimum value and maximum taken be each year precedence minimum value and Maximum.
4. a kind of universities and colleges based on neutral net according to claim 1 file line Forecasting Methodology, it is characterised in that described Classification is carried out to data set and includes three below rule:
Rule 3.1, it is training set 1 to filter out the universities and colleges that nearly 4 years two line differences are 0;
Rule 3.2, filters out nearly 4 years two lines difference and is respectively less than 5 points, and two lines difference is not 0 universities and colleges, is training set 2;
Rule 3.3, calculates 4 years precedence deviations of remaining universities and colleges, and deviation is training set 3 in 0~0.45 universities and colleges, deviation 0.45~ 1 universities and colleges are training set 4, and the universities and colleges that deviation is more than 1 are training set 5, and the deviation is that annual precedence subtracts 4 years precedence averages Absolute value of the absolute value sum than upper precedence average.
5. a kind of universities and colleges based on neutral net according to claim 1-4 file line Forecasting Methodology, it is characterised in that institute State and set up neural network model comprising following rule:
Rule 4.1, the two lines difference that the universities and colleges in training set 1 are given tacit consent to these universities and colleges' next years is 0 point;
Rule 4.2, to two lines poor historical data of the universities and colleges in training set 2 using the first four years, is modeled using neutral net, The data of wherein the 4th year are input as remaining is exported.
Rule 4.3, the precedence historical data that the first four years are utilized respectively to training set 3~5 is modeled using neutral net, wherein The data of the 4th year are input as remaining is exported.
6. a kind of universities and colleges based on neutral net according to claim 1 file line Forecasting Methodology, it is characterised in that described Neural network model is double-deck hidden layer, and the neuron of first layer hidden layer is not less than 20 neurons, and second layer hidden layer is not Less than 15 neurons.
CN201710273724.7A 2017-04-25 2017-04-25 A kind of universities and colleges based on neutral net file line Forecasting Methodology Pending CN107169593A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710273724.7A CN107169593A (en) 2017-04-25 2017-04-25 A kind of universities and colleges based on neutral net file line Forecasting Methodology

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710273724.7A CN107169593A (en) 2017-04-25 2017-04-25 A kind of universities and colleges based on neutral net file line Forecasting Methodology

Publications (1)

Publication Number Publication Date
CN107169593A true CN107169593A (en) 2017-09-15

Family

ID=59812679

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710273724.7A Pending CN107169593A (en) 2017-04-25 2017-04-25 A kind of universities and colleges based on neutral net file line Forecasting Methodology

Country Status (1)

Country Link
CN (1) CN107169593A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108629454A (en) * 2018-05-04 2018-10-09 上饶市普适科技有限公司 The method for filing line with two or three recipe prediction colleges and universities
CN108764718A (en) * 2018-05-28 2018-11-06 王春宁 Selection method, system are estimated and volunteered to college entrance examination score based on deep learning algorithm

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108629454A (en) * 2018-05-04 2018-10-09 上饶市普适科技有限公司 The method for filing line with two or three recipe prediction colleges and universities
CN108764718A (en) * 2018-05-28 2018-11-06 王春宁 Selection method, system are estimated and volunteered to college entrance examination score based on deep learning algorithm

Similar Documents

Publication Publication Date Title
CN110070713B (en) Traffic flow prediction method based on bidirectional nested LSTM neural network
Tsamboulas et al. Use of multicriteria methods for assessment of transport projects
CN103745273B (en) Semiconductor fabrication process multi-performance prediction method
CN103632203B (en) A kind of power distribution network division of the power supply area method based on overall merit
CN106909933A (en) A kind of stealing classification Forecasting Methodology of three stages various visual angles Fusion Features
CN104408562B (en) A kind of photovoltaic system generating efficiency comprehensive estimation method based on BP neural network
CN107315884A (en) A kind of building energy consumption modeling method based on linear regression
CN110111024A (en) Scientific and technological achievement market value evaluation method based on AHP fuzzy comprehensive evaluation model
CN105956779A (en) Power transformer running state evaluation method and power transformer running state evaluation device
CN104537433A (en) Sold electricity quantity prediction method based on inventory capacities and business expansion characteristics
CN110047291A (en) A kind of Short-time Traffic Flow Forecasting Methods considering diffusion process
CN109063892A (en) Industry watt-hour meter prediction technique based on BP-LSSVM combination optimization model
CN107203752A (en) A kind of combined depth study and the face identification method of the norm constraint of feature two
CN105956768A (en) Power generation enterprise competitiveness evaluation method based on combined weight determining and improved TOPSIS
CN103632306A (en) Distribution network power supply area division method based on clustering analysis
CN106803010A (en) For the Fuzzy Grey comprehensive evaluation method and device of low permeability reservoir quantitative assessment
CN108415393A (en) A kind of GaAs product quality consistency control method and system
CN109299434B (en) Cargo customs clearance big data is intelligently graded and sampling observation rate computing system
CN103700030A (en) Grey rough set-based power grid construction project post-evaluation index weight assignment method
CN111626559A (en) Main factor analysis method-based medium-voltage distribution network line loss key characteristic index extraction method and system
CN110059765A (en) A kind of mineral intelligent recognition categorizing system and method
CN107169593A (en) A kind of universities and colleges based on neutral net file line Forecasting Methodology
CN108229011A (en) A kind of shale lithofacies development Dominated Factors judgment method, equipment and storage device
CN107403391A (en) A kind of optimal screening method for oil field development pre-arranged project
CN116109212B (en) Airport operation efficiency evaluation index design and monitoring method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication
RJ01 Rejection of invention patent application after publication

Application publication date: 20170915