CN111815488A - Photovoltaic power generation learning assisting method based on decision tree - Google Patents

Photovoltaic power generation learning assisting method based on decision tree Download PDF

Info

Publication number
CN111815488A
CN111815488A CN202010618881.9A CN202010618881A CN111815488A CN 111815488 A CN111815488 A CN 111815488A CN 202010618881 A CN202010618881 A CN 202010618881A CN 111815488 A CN111815488 A CN 111815488A
Authority
CN
China
Prior art keywords
attribute
decision tree
characteristic
power generation
attributes
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010618881.9A
Other languages
Chinese (zh)
Inventor
葛愿
汤程
徐正伟
高文根
余诺
范晓东
林其友
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Anhui Polytechnic University
Original Assignee
Anhui Polytechnic University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Anhui Polytechnic University filed Critical Anhui Polytechnic University
Priority to CN202010618881.9A priority Critical patent/CN111815488A/en
Publication of CN111815488A publication Critical patent/CN111815488A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/10Services
    • G06Q50/20Education
    • G06Q50/205Education administration or guidance
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/24Classification techniques
    • G06F18/241Classification techniques relating to the classification model, e.g. parametric or non-parametric approaches
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Engineering & Computer Science (AREA)
  • Business, Economics & Management (AREA)
  • Theoretical Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • Tourism & Hospitality (AREA)
  • Strategic Management (AREA)
  • Economics (AREA)
  • Health & Medical Sciences (AREA)
  • Evolutionary Biology (AREA)
  • Human Resources & Organizations (AREA)
  • Evolutionary Computation (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Educational Technology (AREA)
  • Educational Administration (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Marketing (AREA)
  • Primary Health Care (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • General Business, Economics & Management (AREA)
  • Water Supply & Treatment (AREA)
  • Public Health (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a photovoltaic power generation learning assisting method based on a decision tree, which specifically comprises the following steps: s1, taking the characteristic attribute and the assistance grade of the historical assistance object as sample data in a training sample set S, and constructing a decision tree based on the sample data; s2, judging the assistance grade of the assistance object based on the decision tree; and S3, determining the power utilization auxiliary amount of each auxiliary grade based on the total daily power generation amount of the photovoltaic. The student characteristic attribute information is classified by using the decision tree, students with difficult life and excellent performance are accurately screened out, the economic benefit generated by photovoltaic power generation grid connection is directly transferred to the electricity utilization account of the corresponding student according to the campus power dispatching platform, the campus photovoltaic platform is used for generating electricity to bear partial or all electricity utilization of a dormitory, and a new learning-assisting mode is developed.

Description

Photovoltaic power generation learning assisting method based on decision tree
Technical Field
The invention belongs to the technical field of self-learning, and particularly relates to a photovoltaic power generation learning assisting method based on a decision tree.
Background
With the continuous improvement of the support of the cultural education cause of China. As an important component of cultural construction, colleges and universities also develop rapidly, the number of recruits is increased year by year, and more people can enter colleges and universities for study. However, still some students are in poverty, so that the nation establishes various forms of college and university family economic difficulty student subsidy policies such as national awards, national study aids, national study loan aids, work and study fee reduction and avoidance in the advanced education stage.
The method is characterized in that the auxiliary study subsidy condition is artificially established, and the auxiliary study subsidy grade is determined in a voting mode or subjective judgment mode among students meeting the auxiliary study subsidy condition, wherein the assessment mode of the subsidy grade is relatively strong in subjectivity, and the problem of inaccurate assessment of the subsidy grade is easy to occur.
Disclosure of Invention
The invention provides a photovoltaic power generation learning assisting method based on a decision tree, which is used for determining the assistance grade of an assistance object based on the decision tree and improving the objectivity and accuracy of assessment of the assistance grade.
The invention discloses a photovoltaic power generation learning-assisting method based on a decision tree, which comprises the following steps:
s1, taking the characteristic attribute and the subsidy grade of the historical subsidy object as sample data in a training sample set S, and constructing a decision tree based on the sample data;
s2, judging the assistance grade of the assistance object based on the decision tree;
and S3, determining the power utilization auxiliary amount of each auxiliary grade based on the total daily power generation amount of the photovoltaic.
Further, the method for constructing the decision tree specifically comprises the following steps:
s11, calculating the category information entropy of the subsidy level according to the training sample set S;
s12, dividing each characteristic attribute into a plurality of sub-attributes, calculating the conditional entropy of the classification mode,
s13, taking the difference value between the category information entropy of the subsidy level and the conditional entropy of each classification mode as the attribute information gain under the corresponding classification mode;
s14, calculating attribute segmentation information quantity of the characteristic attribute;
s15, calculating the information gain rate of each characteristic attribute based on the attribute segmentation information quantity and the attribute information gain, taking the characteristic attribute with the highest information gain rate as a root node of the decision tree, if a specific sub-attribute under the characteristic attribute corresponds to a subsidy level, taking the sub-attribute under the characteristic attribute as a leaf node, and if not, taking the sub-attribute as a branch to continue splitting;
s16, removing the characteristic attributes of the root node and the bifurcation node;
s17, executing steps S11 to S16, taking the feature attributes with the highest information gain rate as next-level bifurcation nodes until all the feature attributes are used, and finally forming a decision tree;
further, the category information entropy calculation formula of the subsidy level is specifically as follows:
Figure BDA0002562346690000021
wherein, | Sj| is the number of samples for each level of assistance, | S | is the total number of samples, βNM represents the number of subsidy classes as the sum of the weights of the feature attributes.
Further, the conditional entropy calculation formula of the classification mode is specifically as follows:
Figure BDA0002562346690000022
Figure BDA0002562346690000023
feature attribute AxIs divided into1、A2、…、APMapping the samples in the training sample set S to the sample subsets corresponding to the sub-attributes, wherein AiIs a characteristic attribute AxP is a characteristic attribute AxNumber of sub-attributes, | AiI is a characteristic attribute AxIth sub-attribute AiCorresponding to the number of samples in the subset of samples, | S | is the total number of samples, βxIs a characteristic attribute AxCorresponding weight parameter, βNIs the sum of the weights of the feature attributes, | AijI denotes the sample subset AiTo obtain the number of samples, beta, of the jth subsidy levelxiIs characterized by the attribute AxIth sub-attribute AiThe weight parameter of (A), Info (A)i) And representing the information entropy of the corresponding subset of the ith sub-attribute Ai of the characteristic attribute Ax.
Further, the attribute segmentation information amount calculation formula of the feature attribute is specifically as follows:
Figure BDA0002562346690000031
wherein, | AiI is a characteristic attribute AxIth sub-attribute AiCorresponding to the number of samples in the subset of samples, | S | is the total number of samples, βxIs a characteristic attribute AxCorresponding weight parameter, βNIs the sum of the weights of the characteristic attributes.
Further, the calculation formula of the information gain ratio is specifically as follows:
Figure BDA0002562346690000032
wherein, Gain (A)x) For the attribute information gain in the classification mode, SplitInfo (A)x) The information amount is divided for the attributes of the feature attributes.
Further, the characteristic attributes include: poverty degree, public welfare activity record, illegal electricity utilization record, campus consumption record and score level.
The student characteristic attribute information is classified by using the decision tree, students with difficult life and excellent performance are accurately screened out, the economic benefit generated by photovoltaic power generation grid connection is directly transferred to the electricity utilization account of the corresponding student according to the campus power dispatching platform, the campus photovoltaic platform is used for generating electricity to bear part or all of electricity in the dormitory, and a new learning-assisting mode is developed.
Drawings
FIG. 1 is a flow chart of a decision tree-based learning-aiding method for photovoltaic power generation provided by an embodiment of the invention;
fig. 2 is a schematic diagram of a simplified decision tree with feature attributes according to an embodiment of the present invention.
Detailed Description
The following detailed description of the embodiments of the present invention will be given with reference to the accompanying drawings for a more complete, accurate and thorough understanding of the inventive concept and technical solutions of the present invention by those skilled in the art.
Fig. 1 is a flowchart of a decision tree-based photovoltaic power generation learning assistance method provided in an embodiment of the present invention, and the method specifically includes the following steps:
s0, taking the characteristic attributes and the subsidy grades of the historical subsidy objects as sample data in a training sample set S, wherein the characteristic attributes comprise: poverty degree, public welfare activity record, illegal electricity utilization record, campus consumption record and score level.
S1, calculating the category information entropy of the subsidy level according to the training sample set S, wherein the calculation formula is as follows:
Figure BDA0002562346690000041
wherein, | Sj| is the number of samples for each level of assistance, | S | is the total number of samples, βNM represents the number of subsidy classes as the sum of the weights of the feature attributes.
In the embodiment of the invention, the category information entropy represents the sum of uncertainties of various auxiliary level categories in all samples, the larger the information entropy value is, the larger the uncertainty is, and when the information entropy value is 0, only one auxiliary level category is represented. Because the influence factors of various characteristic attributes of students on the judgment result of the subsidy level are different, if the achievement of the students has larger influence than the achievement of participating in public welfare activities and the excellent achievement of the students has larger influence, a weight parameter beta is given to each characteristic attribute and the sub-attribute under each characteristic attribute on the basis of the decision tree.
S2, dividing each characteristic attribute into a plurality of sub-attributes, and calculating the conditional entropy of the classification mode, wherein the calculation formula is as follows:
Figure BDA0002562346690000042
Figure BDA0002562346690000043
feature attribute AxIs divided into1、A2、…、APMapping the total sample data to a sample subset corresponding to each sub-attribute, wherein AiIs a characteristic attribute AxP is a feature attribute AxNumber of sub-attributes, | AiI is a characteristic attribute AxIth sub-attribute AiCorresponding to the number of samples in the subset of samples, | S | is the total number of samples, βxIs a characteristic attribute AxCorresponding weight parameter, βNIs the sum of the weights of the characteristic attributes, | AijI denotes the sample subset AiTo obtain the number of samples, beta, of the jth subsidy levelxiIs a characteristic attribute AxIth sub-attribute AiThe weight parameter of (A), Info (A)i) Representing a characteristic attribute AxIth sub-attribute AiThe information entropy of the corresponding subset.
The conditional entropy of each classification mode represents the sum of the uncertainty of the occurrence of various sub-attribute classes in the classification mode, and the larger the conditional entropy is, the more disorderly the sample classes owned by the attribute are.
S3, taking the difference value between the category information entropy of the subsidy level and the conditional entropy of each classification mode as the attribute information gain under the corresponding classification mode, wherein the calculation formula is as follows:
Gain(Ax)=Info(S)-Info(Ax)
the attribute information gain represents the degree of information uncertainty reduction. The larger the information gain is, the more uncertainty of the divided samples can be better reduced by the attribute, and the attribute is selected as the next-level splitting node, so that the classification target can be better completed.
S4, calculating attribute segmentation information quantity of the characteristic attribute, wherein the calculation formula is as follows:
Figure BDA0002562346690000051
the attribute segmentation information quantity is used for considering the quantity information and the size information of branches when a certain attribute is split, and is beneficial to improving the accuracy of qualification judgment.
S5, calculating the information gain rate of each characteristic attribute based on the attribute segmentation information quantity and the attribute information gain, taking the characteristic attribute with the highest information gain rate as a root node of the decision tree, if a specific sub-attribute under the characteristic attribute corresponds to a subsidy level, taking the sub-attribute under the characteristic attribute as a leaf node, and if not, taking the sub-attribute as a branch to continue splitting, wherein the calculation formula of the information gain rate is as follows:
Figure BDA0002562346690000052
s6, removing the characteristic attributes of the root node and the bifurcation node;
s7, executing steps S1 to S6, taking the feature attributes with the highest information gain rate as next-level bifurcation nodes until all the feature attributes are used, and finally forming a decision tree; FIG. 2 is a schematic diagram of a decision tree with simplified feature attributes.
S8, judging the assistance grade of the assistance object based on the decision tree;
and S9, determining the power utilization auxiliary amount of each auxiliary grade based on the total daily power generation amount of the photovoltaic.
In the embodiment of the invention, the method for acquiring the daily electricity supply auxiliary amount of each auxiliary level specifically comprises the following steps:
the number of the assistant grade numbers m is divided into i types, and the number of the assistant objects corresponding to each assistant grade is N1、N2、.....、Ni,Qi....:Q2:Q1For the power utilization auxiliary proportion corresponding to each auxiliary grade, the total photovoltaic power generation amount before one day is Q, and the following requirements are met:
Figure BDA0002562346690000061
because the number of students is large, all people can not enjoy the benefits brought by photovoltaic learning assistance even if the students are classified according to the assistance levels, the priority coefficient corresponding to each assistance level is generated, the power utilization assistance is issued based on the sequence of the priority coefficients from large to small, and the calculation formula of the priority coefficient corresponding to each assistance level is as follows:
Figure BDA0002562346690000062
if the subsidy object cannot obtain the power utilization subsidy in the application, updating the priority coefficient corresponding to the subsidy grade based on the following formula, wherein the calculation formula is as follows:
Figure BDA0002562346690000063
wherein, the first application without power assistance is taken as the first application, RDeThe difference between the application times of the current application and the first application.
The student characteristic attribute information is classified by using the decision tree, students with difficult life and excellent performance are accurately screened out, the economic benefit generated by photovoltaic power generation grid connection is directly transferred to the electricity utilization account of the corresponding student according to the campus power dispatching platform, the campus photovoltaic platform is used for generating electricity to bear part or all of electricity in the dormitory, and a new learning-assisting mode is developed.
The present invention has been described in connection with the accompanying drawings, and it is to be understood that the invention is not limited to the precise construction and instrumentalities shown, but is intended to cover various modifications, no matter how practical the invention may be embodied, and may be embodied in other forms without departing from the spirit or essential characteristics thereof.

Claims (7)

1. A photovoltaic power generation learning assisting method based on a decision tree is characterized by comprising the following steps:
s1, taking the characteristic attribute and the assistance grade of the historical assistance object as sample data in a training sample set S, and constructing a decision tree based on the sample data;
s2, judging the assistance grade of the assistance object based on the decision tree;
and S3, determining the power utilization auxiliary amount of each auxiliary grade based on the total daily power generation amount of the photovoltaic.
2. The decision tree-based photovoltaic power generation learning-assisting method according to claim 1, wherein the decision tree construction method specifically comprises the following steps:
s11, calculating the category information entropy of the subsidy level according to the training sample set S;
s12, dividing each characteristic attribute into a plurality of sub-attributes, calculating the conditional entropy of the classification mode,
s13, taking the difference value between the category information entropy of the subsidy level and the conditional entropy of each classification mode as the attribute information gain under the corresponding classification mode;
s14, calculating attribute segmentation information quantity of the characteristic attribute;
s15, calculating the information gain rate of each characteristic attribute based on the attribute segmentation information quantity and the attribute information gain, taking the characteristic attribute with the highest information gain rate as a root node of the decision tree, if a specific sub-attribute under the characteristic attribute corresponds to a subsidy level, taking the sub-attribute under the characteristic attribute as a leaf node, otherwise, taking the sub-attribute as a branch to continue splitting;
s16, removing the characteristic attributes of the root node and the bifurcation node;
and S17, executing the steps S11 to S16, taking the feature attributes with the highest information gain rate as the next-level bifurcation nodes until all the feature attributes are used, and finally forming a decision tree.
3. The decision tree-based photovoltaic power generation learning-assisting method according to claim 2, wherein the category information entropy calculation formula of the assistance level is specifically as follows:
Figure FDA0002562346680000021
wherein, | Sj| is the number of samples for each level of assistance, | S | is the total number of samples, βNM represents the number of subsidy classes as the sum of the weights of the feature attributes.
4. The decision tree-based photovoltaic power generation learning-assisting method according to claim 2, wherein the conditional entropy calculation formula of the classification manner is as follows:
Figure FDA0002562346680000022
Figure FDA0002562346680000023
feature attribute AxIs divided into1、A2、...、APMapping the samples in the training sample set S to the sample subsets corresponding to the sub-attributes, wherein AiIs a characteristic attribute AxP is a feature attribute AxNumber of sub-attributes, | AiI is a characteristic attribute AxIth sub-attribute AiCorresponding to the number of samples in the subset of samples, | S | is the total number of samples, βxIs a characteristic attribute AxCorresponding weight parameter, βNIs the sum of the weights of the characteristic attributes, | AijI denotes the sample subset AiTo obtainNumber of samples, β, of the jth subsidy levelxiIs a characteristic attribute AxIth sub-attribute AiThe weight parameter of (A), Info (A)i) Representing a characteristic attribute AxIth sub-attribute AiThe information entropy of the corresponding subset.
5. The decision tree-based photovoltaic power generation learning-assisting method according to claim 2, wherein the attribute segmentation information amount calculation formula of the feature attribute is specifically as follows:
Figure FDA0002562346680000024
wherein, | AiI is a characteristic attribute AxIth sub-attribute AiCorresponding to the number of samples in the subset of samples, | S | is the total number of samples, βxIs a characteristic attribute AxCorresponding weight parameter, βNIs the sum of the weights of the characteristic attributes.
6. The decision tree-based photovoltaic power generation learning-aiding method according to claim 2, wherein the calculation formula of the information gain rate is as follows:
Figure FDA0002562346680000025
wherein, Gain (A)x) For the attribute information gain in the classification mode, SplitInfo (A)x) The information amount is divided for the attributes of the feature attributes.
7. The decision tree-based photovoltaic power generation learning-aiding method according to claim 1, wherein the characteristic attributes comprise: poverty degree, public welfare activity record, illegal electricity utilization record, campus consumption record and score level.
CN202010618881.9A 2020-06-30 2020-06-30 Photovoltaic power generation learning assisting method based on decision tree Pending CN111815488A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010618881.9A CN111815488A (en) 2020-06-30 2020-06-30 Photovoltaic power generation learning assisting method based on decision tree

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010618881.9A CN111815488A (en) 2020-06-30 2020-06-30 Photovoltaic power generation learning assisting method based on decision tree

Publications (1)

Publication Number Publication Date
CN111815488A true CN111815488A (en) 2020-10-23

Family

ID=72855692

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010618881.9A Pending CN111815488A (en) 2020-06-30 2020-06-30 Photovoltaic power generation learning assisting method based on decision tree

Country Status (1)

Country Link
CN (1) CN111815488A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112861692A (en) * 2021-02-01 2021-05-28 电子科技大学中山学院 Room classification model construction method and device and room classification method and device

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112861692A (en) * 2021-02-01 2021-05-28 电子科技大学中山学院 Room classification model construction method and device and room classification method and device
CN112861692B (en) * 2021-02-01 2024-03-15 电子科技大学中山学院 Method and device for constructing room classification model, and method and device for classifying rooms

Similar Documents

Publication Publication Date Title
CN108256102A (en) A kind of Independent College Studentss based on cluster comment religion data analysing method
CN110309300A (en) A method of identification natural sciences knowledge-ID
CN102591929B (en) Library data processing system and data processing method thereof
CN111143539A (en) Knowledge graph-based question-answering method in teaching field
CN106355347A (en) Faculty member evaluation method and faculty member evaluation system
CN111339285B (en) BP neural network-based enterprise resume screening method and system
CN112396252A (en) Method for acquiring construction success evaluation values of double-creation park of power internet of things
Pedersen A ‘heart of goal’and the will to succeed: Goal commitment and task performance among teachers in public schools
CN114186983B (en) Video interview multidimensional scoring method, system, computer equipment and storage medium
Yang Decision tree algorithm based university graduate employment trend prediction
Feng et al. Dialect diversity and foreign direct investment in China
CN111815488A (en) Photovoltaic power generation learning assisting method based on decision tree
CN108961453B (en) Classroom roll call assistant decision-making system
CN110188958A (en) A kind of method that college entrance will intelligently makes a report on prediction recommendation
Fotheringham et al. Multiscale geographically weighted regression: Theory and practice
CN110738565A (en) Real estate finance artificial intelligence composite wind control model based on data set
CN112700356B (en) Method and system for managing information of training personnel for real-time online education and training
Siraj et al. Pre-selection of recruitment candidates using case based reasoning
Ma et al. Optimal evaluation index system and benefit evaluation model for agricultural informatization in Beijing
Zaidi et al. Employment transitions and earnings dynamics in the SAGE model
CN113064962B (en) Environment complaint reporting event similarity analysis method
Liu et al. Mobile Landscape: An Analysis of Chinese Foreign-Educated Young Scientists
CN117196899B (en) Internet information intelligent online management system and method based on vocational education
CN112686057B (en) College student psychological consultation appointment recommendation method and system
Shkolnykova et al. Systems of Innovation in Central and Eastern European countries: Path of Economic Transition and Differences in Institutions

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20201023

RJ01 Rejection of invention patent application after publication