CN109214562A - A kind of power grid scientific research hotspot prediction and method for pushing based on RNN - Google Patents

A kind of power grid scientific research hotspot prediction and method for pushing based on RNN Download PDF

Info

Publication number
CN109214562A
CN109214562A CN201810970350.9A CN201810970350A CN109214562A CN 109214562 A CN109214562 A CN 109214562A CN 201810970350 A CN201810970350 A CN 201810970350A CN 109214562 A CN109214562 A CN 109214562A
Authority
CN
China
Prior art keywords
power grid
module
scientific
period
layer
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810970350.9A
Other languages
Chinese (zh)
Inventor
马艳
齐达立
陈玉峰
邹立达
陈素红
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Shandong Electric Power Co Ltd
Original Assignee
State Grid Corp of China SGCC
Electric Power Research Institute of State Grid Shandong Electric Power Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by State Grid Corp of China SGCC, Electric Power Research Institute of State Grid Shandong Electric Power Co Ltd filed Critical State Grid Corp of China SGCC
Priority to CN201810970350.9A priority Critical patent/CN109214562A/en
Publication of CN109214562A publication Critical patent/CN109214562A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q10/00Administration; Management
    • G06Q10/04Forecasting or optimisation specially adapted for administrative or management purposes, e.g. linear programming or "cutting stock problem"
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06QINFORMATION AND COMMUNICATION TECHNOLOGY [ICT] SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL OR SUPERVISORY PURPOSES, NOT OTHERWISE PROVIDED FOR
    • G06Q50/00Information and communication technology [ICT] specially adapted for implementation of business processes of specific business sectors, e.g. utilities or tourism
    • G06Q50/06Energy or water supply

Landscapes

  • Business, Economics & Management (AREA)
  • Engineering & Computer Science (AREA)
  • Economics (AREA)
  • Human Resources & Organizations (AREA)
  • Strategic Management (AREA)
  • Theoretical Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Marketing (AREA)
  • General Physics & Mathematics (AREA)
  • General Business, Economics & Management (AREA)
  • Tourism & Hospitality (AREA)
  • Public Health (AREA)
  • General Health & Medical Sciences (AREA)
  • Primary Health Care (AREA)
  • Water Supply & Treatment (AREA)
  • Development Economics (AREA)
  • Game Theory and Decision Science (AREA)
  • Entrepreneurship & Innovation (AREA)
  • Operations Research (AREA)
  • Quality & Reliability (AREA)
  • Management, Administration, Business Operations System, And Electronic Commerce (AREA)

Abstract

The invention discloses a kind of power grid science and technology hotspot prediction and method for pushing based on RNN, characterized by the following steps: hotspot prediction model crawls module, character representation module, feature extraction module and model training module by data and forms hotspot prediction model;Two, power grid science and technology hotspot predictions and push, A) t period power grid scientific and technical article of acquisition, above-mentioned hotspot prediction model is applied, obtains scientific research hot keyword;B) the power grid scientific and technical terms in database for natural language are clustered, generate the degree of association of scientific and technical terms;C) synthetic time series power grid science and technology hot spot, and it is pushed to user.The present invention can perceive the power grid Technological research hot spot being likely to occur in following a period of time, and for power grid, scientific research personnel provides research direction and Research Thinking;The potential technical application of the correlation of power grid science and technology hot spot can be recommended power grid researcher by the degree of association synthetic time series scientific research hot spot based on power grid scientific research vocabulary.

Description

A kind of power grid scientific research hotspot prediction and method for pushing based on RNN
Technical field
The power grid scientific research hotspot prediction and supplying system and its implementation that the present invention relates to a kind of based on RNN.
Background technique
Scientific and technological information has all played important function to country, society, the strategy of enterprise, the formulation of plan and implementation.Section Grinding hotspot prediction is the newer application demand in scientific and technological information field.Researcher, scientific research project manager the selected topic, project verification must Must have it is certain perspective, i.e., based on the contemporary scientific state of the art and social development situation, to the following issuable new reason By or generate application value new technology judge.
The method of scientific research hotspot prediction depends critically upon this field highly specialized personnel by Literature Consult and market at present The method of investigation determines the direction that hot spot occurs, in addition, associated application field is also after a new theory is born with technology It needs largely to work and goes to excavate.Therefore, need to design a kind of scientific research hotspot prediction and supplying system, can to it is one section following when Interior scientific research hotspot prediction comes out, and pushes it to scientific research clients, assists researcher and scientific research project management The work of person, especially electrical network field, the field reforming speed and employing new technology is more, thus needs promptly and accurately It is particularly important to hold current and following scientific research hot spot, power grid research work can be made to be correctly oriented.
Summary of the invention
In view of the above-mentioned problems, the invention proposes a kind of power grid scientific research hotspot prediction and method for pushing based on RNN, the party Method allows user to obtain following a period of time power grid scientific research hot keyword in time, and carries out accurate prediction and in real time Push.
To achieve the goals above, the present invention adopts the following technical scheme:
A kind of power grid science and technology hotspot prediction and method for pushing based on RNN, it is characterised in that the following steps are included:
One, hotspot prediction model
Using occur in the past period science and technology news website, bibliographic data base, and crawl module, spy by data It is pre- to levy representation module, the feature extraction module based on depth Boltzmann machine and the formation hot spot of the model training module based on RNN Survey model;
Two, power grid science and technology hotspot predictions and push
A t period power grid scientific and technical article) is acquired, by extracting power grid article TF-IDF vector, applying above-mentioned hotspot prediction Model obtains scientific research hot keyword;
B) the power grid scientific and technical terms in database for natural language are clustered, and generate the pass of scientific and technical terms in cluster process Connection degree;
C) according to power grid science and technology hot keyword and the scientific and technical terms degree of association degree synthetic time series power grid science and technology predicted Hot spot, and it is pushed to user.
Hotspot prediction model the following steps are included:
1) data crawl module, crawl power grid scientific and technological information in science and technology news website, bibliographic data base using crawler technology Article, the article textual that will be crawled, if the scientific and technological information article collection grabbed in a period of time is combined into Tt, wherein t indicates the period Serial number;
2) character representation module, this module provide a kind of scientific and technological information and technical paper Text Representation method, and right The text of a cycle carries out character representation, is characterized abstraction module and provides input, obtains T based on weight TF-IDF algorithmt's Crucial term vector, is denoted asWherein, Q is the quantity of word in scientific and technological dictionary, biFor vocabulary component in corresponding dictionary Gained weight TF-IDF value, biCalculate specific steps are as follows:
A2 t) is setjIt is TtA this paper, based on standard TF-IDF algorithm obtain tjThe TF-IDF value of i-th of vocabulary, is set asWithIndicate tjIn a keyword, wherein i beThe mark of vocabulary component in corresponding dictionary;
B2 t) is setjDownload or amount of reading be nj, reference amount is mj.SoWherein β is closed The weight of keyword,WithAll article n in the period respectivelyjWith mjAverage value;
3) feature extraction module, this module, to a cycle Text character extraction, are mould based on depth Boltzmann machine Type training module, prediction and recommending module provide data input, and depth Boltzmann machine model structure and parameters are provided that
A3) depth Boltzmann machine uses three layers of limitation Boltzmann machine;
B3) first layer is visible element layer, it is seen that elementary layer is Q × B two values matrix.WhereinT is rhythmic set, and [] is to be rounded, and η is a positive integer, and B is all PeriodMiddle largest component is multiplied by a coefficient;
C3 v) is seti,jFor the unit of a visual layers, within the t period, forIf j=bi* η, then vi,j=1,
D3) second layer is hidden layer, and the second layer isTwo values matrix, βQWith βBFor the coefficient greater than 1.
Third layer is hidden layer, and third layer isTwo values matrix;
E3) depth Boltzmann machine is successively trained using limitation Boltzmann machine training method;
F3 it) is located at period t, the output based on depth Boltzmann machine is Xt
4) model training module, after hotspot prediction refers to prediction following a period of time e, the research hotspot of appearance, this module Prediction model is provided with recommending module for prediction, the hotspot prediction model of this module is improved based on RNN structure, knot Structure is as follows with training method:
A4) hotspot prediction model structure t loop cycle layer and 3 layers of BP neural network as shown in figure 3, be made of, circulation layer Input cycle data Xt, XtFor the output of t period depth Boltzmann machine,
U is input weight, and W is circulation weight, VlFor the weight of BP neural network l respective layer.VectorIndicate circulation mind Weighting through member in t moment inputs, and calculation method is as follows:
Wherein St-1For the value of t-1 moment circulation layer, andWherein g () is activation primitive;
VectorIndicate that the BP neural network l layers of weighting in t moment inputs, calculation method is
otFor final output, calculation method is
B4 o) is calculatedtDuring, it is iterated to calculate based on following formula forward:
C4) indicate that model error, error function are with E
Wherein, N indicates sample size, y(n)Indicate the actual value of sample n, o(n)Indicate the output valve of sample n.y(n)With o(n) It is vocabulary vector, if y(n)=[c1,c2,…,ci,…,cm], m is vocabulary total quantity, if i-th of vocabulary is hot keyword, ciLabeled as 1, other non-hot vocabulary are labeled as 0, and hot spot vocabulary can have multiple;If o(n)=[d1,d2,…,di,…,dm], So y(n)logo(n)It is calculated with following formula:
y(n)logo(n)=c1logd1+c2logd2+…+cilogdi+…+cmlogdm
4) in model training module,
D4 the error term of each layer or each cycle period) is indicated with δ by error back propagation direction again, each error term Calculation method is as follows:
Wherein, diag [x] indicates to create a diagonal matrix according to vector x,
I) error function E is calculated to any period k weight matrix WkGradient formula it is as follows:
WhereinIt indicatesI-th of component of error item vector, sk-1,jIndicate Sk-1J-th of neuron output valve,
The gradient of circulation layer weight matrix W is the sum of the gradient at each moment, and formula is as follows:
II) weight matrix U gradient calculation formula it is as follows:
Wherein xk,jIndicate XkJ-th of component value,
III) it setsFor the value of l layers of BP neural network layer of t moment, then:
Weight matrix VlGradient calculation formula it is as follows:
Wherein hk-1,jIndicate Hk-1J-th of neuron output valve.
It extracts in power grid article TF-IDF vector, a kind of power grid scientific and technological information and technical paper Text Representation side is provided Method, and character representation is carried out to the text of a cycle, it is characterized abstraction module and input is provided, obtained based on weight TF-IDF algorithm Obtain TtCrucial term vector, be denoted asWherein, Q is the quantity of word in power grid science and technology dictionary, biFor corresponding dictionary Weight TF-IDF value obtained by middle vocabulary component, biCalculate specific steps are as follows:
A1 t) is setjIt is TtA this paper, based on standard TF-IDF algorithm obtain tjThe TF-IDF value of i-th of vocabulary, is set asWithIndicate tjIn a keyword, wherein i beThe mark of vocabulary component in corresponding dictionary;
B1 t) is setjDownload or amount of reading be nj, reference amount is mj.SoWherein β is closed The weight of keyword,WithAll article n in the period respectivelyjWith mjAverage value.
In this application, TF-IDF (term frequency-inverse document frequency) is a kind of use In the common weighting technique of information retrieval and data mining, this will not be detailed here.Above-mentioned RNN is Recognition with Recurrent Neural Network, Recurrent Neural Network, neural network are a kind of artificial neural network of node orientation connection cyclization, this net The internal state of network can show dynamic time sequence behavior, and different from feedforward neural network, RNN can use its internal note Recall to handle the list entries of arbitrary sequence, it can be easier to handle handwriting recognition, the speech recognition etc. if not being segmented for this. Limited Boltzmann machine can also be referred to as by limiting Boltzmann machine, it is a kind of can be distributed by input data set learning probability with Machine generates neural network, according to the difference of task, is limited the method that supervised learning or unsupervised learning can be used in Bo Ziman machine It is trained, limitation Boltzmann machine training method is the prior art, and this will not be detailed here.It is poly- in power grid scientific and technical terms cluster Class uses the prior art, and such as K-means, this will not be detailed here.
The invention has the following advantages that
1) present invention can perceive the power grid Technological research hot spot being likely to occur in following a period of time, be power grid scientific research people Member provides research direction and Research Thinking;
2) degree of association synthetic time series scientific research hot spot based on power grid scientific research vocabulary, the correlation of power grid science and technology hot spot can be dived Technical application recommend power grid researcher.
Detailed description of the invention
Fig. 1 is process frame diagram of the invention;
Fig. 2 is scientific research hotspot prediction and push frame diagram in the present invention;
Fig. 3 is the structure chart of depth Boltzmann machine in the present invention;
Fig. 4 is the hotspot prediction model structure in the present invention based on RNN.
Specific embodiment
Refering to what is shown in Fig. 1, the invention discloses a kind of power grid science and technology hotspot prediction and method for pushing based on RNN, this hair It is bright the following steps are included:
One, hotspot prediction model
With reference to Fig. 2, using occur in the past period science and technology news website, bibliographic data base, and climbed by data Modulus block, character representation module, the feature extraction module based on depth Boltzmann machine and the model training module shape based on RNN At hotspot prediction model;
Two, power grid science and technology hotspot predictions and push
A the power grid scientific and technical article for) crawling t period, by extracting, power grid article TF-IDF vector, to apply above-mentioned hot spot pre- Model is surveyed, obtains scientific research hot keyword;
B) the power grid scientific and technical terms in database for natural language are clustered, and generate the pass of scientific and technical terms in cluster process Connection degree;
C) according to power grid science and technology hot keyword and the scientific and technical terms degree of association degree synthetic time series power grid science and technology predicted Hot spot, and it is pushed to user.
Hotspot prediction model the following steps are included:
1) data crawl module, crawl power grid scientific and technological information in science and technology news website, bibliographic data base using crawler technology Article, the article textual that will be crawled, if the scientific and technological information article collection grabbed in a period of time is combined into Tt, wherein t indicates the period Serial number;
2) character representation module, this module provide a kind of scientific and technological information and technical paper Text Representation method, and right The text of a cycle carries out character representation, is characterized abstraction module and provides input, obtains T based on weight TF-IDF algorithmt's Crucial term vector, is denoted asWherein, Q is the quantity of word in scientific and technological dictionary, biFor vocabulary component in corresponding dictionary Gained weight TF-IDF value, biCalculate specific steps are as follows:
A2 t) is setjIt is TtA this paper, based on standard TF-IDF algorithm obtain tjThe TF-IDF value of i-th of vocabulary, is set asWithIndicate tjIn a keyword, wherein i beThe mark of vocabulary component in corresponding dictionary;
B2 t) is setjDownload or amount of reading be nj, reference amount is mj.SoWherein β is closed The weight of keyword,WithAll article n in the period respectivelyjWith mjAverage value;
3) feature extraction module, refering to what is shown in Fig. 3, this module is based on the special to a cycle text of depth Boltzmann machine Sign is extracted, and provides data input, depth Boltzmann machine model structure and parameters for model training module, prediction and recommending module It is provided that
A3) depth Boltzmann machine uses three layers of limitation Boltzmann machine;
B3) first layer is visible element layer, it is seen that elementary layer is Q × B two values matrix.WhereinT is rhythmic set, and [] is to be rounded, and η is a biggish positive integer, by In biValue range is [0,1], so η determines the size of B, in this patent, η setting value is 10000;That is B is in all weeks PhaseMiddle largest component is multiplied by a coefficient;
C3 v) is seti,jFor the unit of a visual layers, within the t period, forIf j=bi* η, then vi,j=1,
D3) second layer is hidden layer, and the second layer isTwo values matrix, βQWith βBFor the coefficient greater than 1.Third Layer is hidden layer, and third layer isTwo values matrix;
E3) depth Boltzmann machine is successively trained using limitation Boltzmann machine training method;
F3 it) is located at period t, the output based on depth Boltzmann machine is Xt
4) model training module, with reference to Fig. 4, the fat full arrows line expression in figure calculates forward direction, for calculating ot, empty After line arrow line indicates that error back propagation direction, hotspot prediction refer to prediction following a period of time e, the research hotspot of appearance, This module provides prediction model for prediction and recommending module, and the hotspot prediction model of this module is improved based on RNN structure , structure is as follows with training method:
A4) hotspot prediction model structure t loop cycle layer and 3 layers of BP neural network as shown in figure 3, be made of, circulation layer Input cycle data Xt, XtFor the output of t period depth Boltzmann machine,
U is input weight, and W is circulation weight, VlFor the weight of BP neural network l respective layer.VectorIndicate circulation mind Weighting through member in t moment inputs, and calculation method is as follows:
Wherein St-1For the value of t-1 moment circulation layer, andWherein g () is activation primitive;
VectorIndicate that BP neural network is inputted in the weighting of t moment, calculation method is
otFor final output, calculation method is
B4 o) is calculatedtDuring, it is iterated to calculate based on following formula forward:
C4) indicate that model error, error function are with E
Wherein, N indicates sample size, y(n)Indicate the actual value of sample n, o(n)Indicate the output valve of sample n.y(n)With o(n) It is vocabulary vector, if y(n)=[c1,c2,…,ci,…,cm], m is vocabulary total quantity, if i-th of vocabulary is hot keyword, ciLabeled as 1, other non-hot vocabulary are labeled as 0, and hot spot vocabulary can have multiple;If o(n)=[d1,d2,…,di,…,dm], So y(n)logo(n)It is calculated with following formula:
y(n)logo(n)=c1logd1+c2logd2+…+cilogdi+…+cmlogdm
In 4) model training module,
D4 the error term of each layer or each cycle period) is indicated with δ by error back propagation direction again, each error term Calculation method is as follows:
Wherein, diag [x] indicates to create a diagonal matrix according to vector x,
I) error function E is calculated to any period k weight matrix WkGradient formula it is as follows:
WhereinIt indicatesI-th of component of error item vector, sk-1,jIndicate Sk-1J-th of neuron output valve,
The gradient of circulation layer weight matrix W is the sum of the gradient at each moment, and formula is as follows:
II) weight matrix U gradient calculation formula it is as follows:
Wherein xk,jIndicate XkJ-th of component value,
III) it setsFor the value of l layers of BP neural network layer of t moment, then:
Weight matrix VlGradient calculation formula it is as follows:
Wherein hk-1,jIndicate Hk-1J-th of neuron output valve.
It extracts in power grid article TF-IDF vector, a kind of power grid scientific and technological information and technical paper Text Representation side is provided Method, and character representation is carried out to the text of a cycle, it is characterized abstraction module and input is provided, obtained based on weight TF-IDF algorithm Obtain TtCrucial term vector, be denoted asWherein, Q is the quantity of word in power grid science and technology dictionary, biFor corresponding dictionary Weight TF-IDF value obtained by middle vocabulary component, biCalculate specific steps are as follows:
A1 t) is setjIt is TtA this paper, based on standard TF-IDF algorithm obtain tjThe TF-IDF value of i-th of vocabulary, is set asWithIndicate tjIn a keyword, wherein i beThe mark of vocabulary component in corresponding dictionary;
B1 t) is setjDownload or amount of reading be nj, reference amount is mj.SoWherein β is closed The weight of keyword,WithAll article n in the period respectivelyjWith mjAverage value.
In above-mentioned 4) model training module, error back propagation formula and error term formula based on D4) simultaneously pass through reversed Propagation Neural Network training algorithm training pattern provides accurate model for power grid science and technology hot spot and prediction.
Although above-mentioned be described and verify to a specific embodiment of the invention and validity in conjunction with attached drawing, not Limiting the scope of the invention, those skilled in the art should understand that, based on the technical solutions of the present invention, this Field technical staff does not need to make the creative labor the various modifications or changes that can be made still in protection scope of the present invention Within.

Claims (4)

1. a kind of power grid science and technology hotspot prediction and method for pushing based on RNN, it is characterised in that the following steps are included:
One, hotspot prediction model
Using occur in the past period science and technology news website, bibliographic data base, and crawl module, mark sheet by data Show that module, the feature extraction module based on depth Boltzmann machine and the model training module based on RNN form hotspot prediction mould Type;
Two, power grid science and technology hotspot predictions and push
A t period power grid scientific and technical article) is acquired, by extracting power grid article TF-IDF vector, applying above-mentioned hotspot prediction mould Type obtains scientific research hot keyword;
B) the power grid scientific and technical terms in database for natural language are clustered, and generate the association of scientific and technical terms in cluster process Degree;
C the power grid science and technology hot keyword and scientific and technical terms degree of association degree synthetic time series power grid science and technology hot spot that) basis predicts, And it is pushed to user.
2. the power grid science and technology hotspot prediction and method for pushing according to claim 1 based on RNN, which is characterized in that hot spot Prediction model the following steps are included:
1) data crawl module, crawl power grid scientific and technological information article in science and technology news website, bibliographic data base using crawler technology, The article textual that will be crawled, if the scientific and technological information article collection grabbed in a period of time is combined into Tt, wherein t indicates period serial number;
2) character representation module, this module provide a kind of scientific and technological information and technical paper Text Representation method, and to one The text in period carries out character representation, is characterized abstraction module and provides input, obtains T based on weight TF-IDF algorithmtKey Term vector is denoted asWherein, Q is the quantity of word in scientific and technological dictionary, biFor obtained by vocabulary component in corresponding dictionary Weight TF-IDF value, biCalculate specific steps are as follows:
A2 t) is setjIt is TtA this paper, based on standard TF-IDF algorithm obtain tjThe TF-IDF value of i-th of vocabulary, is set as WithIndicate tjIn a keyword, wherein i beThe mark of vocabulary component in corresponding dictionary;
B2 t) is setjDownload or amount of reading be nj, reference amount is mj.SoWherein β keyword Weight,WithAll article n in the period respectivelyjWith mjAverage value;
3) feature extraction module, this module, to a cycle Text character extraction, are instructed based on depth Boltzmann machine for model Practice module, prediction and recommending module and data input is provided, depth Boltzmann machine model structure and parameters are provided that
A3) depth Boltzmann machine uses three layers of limitation Boltzmann machine;
B3) first layer is visible element layer, it is seen that elementary layer is Q × B two values matrix.WhereinT is rhythmic set, and [] is to be rounded, and η is a positive integer, and B is all PeriodMiddle largest component is multiplied by a coefficient;
C3 v) is seti,jFor the unit of a visual layers, within the t period, forIf j=bi* η, then vi,j=1,
D3) second layer is hidden layer, and the second layer isTwo values matrix, βQWith βBFor the coefficient greater than 1.Third layer is Hidden layer, third layer areTwo values matrix;
E3) depth Boltzmann machine is successively trained using limitation Boltzmann machine training method;
F3 it) is located at period t, the output based on depth Boltzmann machine is Xt
4) model training module, after hotspot prediction refers to prediction following a period of time e, the research hotspot of appearance, this module is pre- Survey with recommending module provide prediction model, the hotspot prediction model of this module is improved based on RNN structure, structure with Training method is as follows:
A4) hotspot prediction model structure t loop cycle layer and 3 layers of BP neural network as shown in figure 3, be made of, circulation layer input Cycle data Xt, XtFor the output of t period depth Boltzmann machine,
U is input weight, and W is circulation weight, VlFor the weight of BP neural network l respective layer.VectorIndicate circulation neuron It is inputted in the weighting of t moment, calculation method is as follows:
Wherein St-1For the value of t-1 moment circulation layer, andWherein g () is activation primitive;
VectorIndicate that the BP neural network l layers of weighting in t moment inputs, calculation method is
otFor final output, calculation method is
B4 o) is calculatedtDuring, it is iterated to calculate based on following formula forward:
C4) indicate that model error, error function are with E
Wherein, N indicates sample size, y(n)Indicate the actual value of sample n, o(n)Indicate the output valve of sample n.y(n)With o(n)It is word Remittance vector, if y(n)=[c1,c2,…,ci,…,cm], m is vocabulary total quantity, if i-th of vocabulary is hot keyword, ciMark It is denoted as 1, other non-hot vocabulary are labeled as 0, and hot spot vocabulary can have multiple;If o(n)=[d1,d2,…,di,…,dm], then y(n)logo(n)It is calculated with following formula:
y(n)logo(n)=c1logd1+c2logd2+…+cilogdi+…+cmlogdm
3. the power grid science and technology hotspot prediction and method for pushing according to claim 2 based on RNN, which is characterized in that 4) mould In type training module,
D4 the error term of each layer or each cycle period, the calculating of each error term) are indicated with δ by error back propagation direction again Method is as follows:
Wherein, diag [x] indicates to create a diagonal matrix according to vector x,
I) error function E is calculated to any period k weight matrix WkGradient formula it is as follows:
WhereinIt indicatesI-th of component of error item vector, sk-1,jIndicate Sk-1J-th of neuron output valve,
The gradient of circulation layer weight matrix W is the sum of the gradient at each moment, and formula is as follows:
II) weight matrix U gradient calculation formula it is as follows:
Wherein xk,jIndicate XkJ-th of component value,
III) it setsFor the value of l layers of BP neural network layer of t moment, then:
Weight matrix VlGradient calculation formula it is as follows:
Wherein hk-1,jIndicate Hk-1J-th of neuron output valve.
4. the power grid science and technology hotspot prediction and method for pushing according to any one of claim 1-3 based on RNN, feature It is, extracts in power grid article TF-IDF vector, a kind of power grid scientific and technological information and technical paper Text Representation method is provided, And character representation is carried out to the text of a cycle, it is characterized abstraction module and input is provided, obtained based on weight TF-IDF algorithm TtCrucial term vector, be denoted asWherein, Q is the quantity of word in power grid science and technology dictionary, biFor in corresponding dictionary Weight TF-IDF value obtained by vocabulary component, biCalculate specific steps are as follows:
A1 t) is setjIt is TtA this paper, based on standard TF-IDF algorithm obtain tjThe TF-IDF value of i-th of vocabulary, is set as WithIndicate tjIn a keyword, wherein i beThe mark of vocabulary component in corresponding dictionary;
B1 t) is setjDownload or amount of reading be nj, reference amount is mj.SoWherein β keyword Weight,WithAll article n in the period respectivelyjWith mjAverage value.
CN201810970350.9A 2018-08-24 2018-08-24 A kind of power grid scientific research hotspot prediction and method for pushing based on RNN Pending CN109214562A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810970350.9A CN109214562A (en) 2018-08-24 2018-08-24 A kind of power grid scientific research hotspot prediction and method for pushing based on RNN

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810970350.9A CN109214562A (en) 2018-08-24 2018-08-24 A kind of power grid scientific research hotspot prediction and method for pushing based on RNN

Publications (1)

Publication Number Publication Date
CN109214562A true CN109214562A (en) 2019-01-15

Family

ID=64989643

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810970350.9A Pending CN109214562A (en) 2018-08-24 2018-08-24 A kind of power grid scientific research hotspot prediction and method for pushing based on RNN

Country Status (1)

Country Link
CN (1) CN109214562A (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110188263A (en) * 2019-05-29 2019-08-30 国网山东省电力公司电力科学研究院 It is a kind of towards isomery when away from scientific research hotspot prediction method and system
CN110378532A (en) * 2019-07-19 2019-10-25 中南大学 A kind of scientific research theme trend prediction method based on random tree
CN110472004A (en) * 2019-08-23 2019-11-19 国网山东省电力公司电力科学研究院 A kind of method and system of scientific and technological information data multilevel cache management
CN110688477A (en) * 2019-10-10 2020-01-14 华夏幸福产业投资有限公司 Prediction method, device, equipment and storage medium in technical hotspot field
CN110705821A (en) * 2019-08-23 2020-01-17 上海科技发展有限公司 Hotspot subject prediction method, device, terminal and medium based on multiple evaluation dimensions
CN111832815A (en) * 2020-07-02 2020-10-27 山东电力研究院 Scientific research hotspot prediction method and system
CN112187890A (en) * 2020-09-15 2021-01-05 卢霞浩 Information distribution method based on cloud computing and big data and block chain financial cloud center
CN112734126A (en) * 2021-01-18 2021-04-30 武汉烽火技术服务有限公司 Hot spot prediction method, device and equipment and readable storage medium
CN113449175A (en) * 2020-03-24 2021-09-28 北京沃东天骏信息技术有限公司 Hot data recommendation method and device
CN113722424A (en) * 2021-07-20 2021-11-30 山东电力研究院 Scientific research direction recommendation method and system based on news events

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102012929A (en) * 2010-11-26 2011-04-13 北京交通大学 Network consensus prediction method and system
JP2013257765A (en) * 2012-06-13 2013-12-26 Ntt Data Corp Term extraction device, term extraction method, and program
CN105224608A (en) * 2015-09-06 2016-01-06 华南理工大学 The hot news Forecasting Methodology analyzed based on microblog data and system
CN106651030A (en) * 2016-12-21 2017-05-10 重庆邮电大学 Method for predicting user participation behavior of hot topic by improved RBF neural network
CN106682172A (en) * 2016-12-28 2017-05-17 江苏大学 Keyword-based document research hotspot recommending method
CN107038156A (en) * 2017-04-28 2017-08-11 北京清博大数据科技有限公司 A kind of hot spot of public opinions Forecasting Methodology based on big data
CN107992976A (en) * 2017-12-15 2018-05-04 中国传媒大学 Much-talked-about topic early-stage development trend predicting system and Forecasting Methodology

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102012929A (en) * 2010-11-26 2011-04-13 北京交通大学 Network consensus prediction method and system
JP2013257765A (en) * 2012-06-13 2013-12-26 Ntt Data Corp Term extraction device, term extraction method, and program
CN105224608A (en) * 2015-09-06 2016-01-06 华南理工大学 The hot news Forecasting Methodology analyzed based on microblog data and system
CN106651030A (en) * 2016-12-21 2017-05-10 重庆邮电大学 Method for predicting user participation behavior of hot topic by improved RBF neural network
CN106682172A (en) * 2016-12-28 2017-05-17 江苏大学 Keyword-based document research hotspot recommending method
CN107038156A (en) * 2017-04-28 2017-08-11 北京清博大数据科技有限公司 A kind of hot spot of public opinions Forecasting Methodology based on big data
CN107992976A (en) * 2017-12-15 2018-05-04 中国传媒大学 Much-talked-about topic early-stage development trend predicting system and Forecasting Methodology

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
张祎萍: ""基于聚类分析的热点图书排序推荐方法研究"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *
胡悦: ""微博舆情热点发现及趋势预测研究"", 《中国优秀硕士学位论文全文数据库 信息科技辑》 *

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110188263A (en) * 2019-05-29 2019-08-30 国网山东省电力公司电力科学研究院 It is a kind of towards isomery when away from scientific research hotspot prediction method and system
CN110188263B (en) * 2019-05-29 2021-11-30 国网山东省电力公司电力科学研究院 Heterogeneous time interval-oriented scientific research hotspot prediction method and system
CN110378532A (en) * 2019-07-19 2019-10-25 中南大学 A kind of scientific research theme trend prediction method based on random tree
CN110378532B (en) * 2019-07-19 2021-12-14 中南大学 Scientific research topic state prediction method based on random tree
CN110472004A (en) * 2019-08-23 2019-11-19 国网山东省电力公司电力科学研究院 A kind of method and system of scientific and technological information data multilevel cache management
CN110705821A (en) * 2019-08-23 2020-01-17 上海科技发展有限公司 Hotspot subject prediction method, device, terminal and medium based on multiple evaluation dimensions
WO2021035975A1 (en) * 2019-08-23 2021-03-04 上海科技发展有限公司 Method and apparatus for predicting hot-topic subject on basis of multiple evaluation dimensions, terminal, and medium
CN110688477A (en) * 2019-10-10 2020-01-14 华夏幸福产业投资有限公司 Prediction method, device, equipment and storage medium in technical hotspot field
CN110688477B (en) * 2019-10-10 2022-11-15 华夏幸福产业投资有限公司 Prediction method, device, equipment and storage medium in technical hotspot field
CN113449175A (en) * 2020-03-24 2021-09-28 北京沃东天骏信息技术有限公司 Hot data recommendation method and device
CN111832815B (en) * 2020-07-02 2023-12-05 国网山东省电力公司电力科学研究院 Scientific research hot spot prediction method and system
CN111832815A (en) * 2020-07-02 2020-10-27 山东电力研究院 Scientific research hotspot prediction method and system
CN112187890A (en) * 2020-09-15 2021-01-05 卢霞浩 Information distribution method based on cloud computing and big data and block chain financial cloud center
CN112187890B (en) * 2020-09-15 2021-05-07 北京联银通科技有限公司 Information distribution method based on cloud computing and big data and block chain financial cloud center
CN112734126A (en) * 2021-01-18 2021-04-30 武汉烽火技术服务有限公司 Hot spot prediction method, device and equipment and readable storage medium
CN113722424A (en) * 2021-07-20 2021-11-30 山东电力研究院 Scientific research direction recommendation method and system based on news events
CN113722424B (en) * 2021-07-20 2024-02-02 国网山东省电力公司电力科学研究院 Scientific research direction recommendation method and system based on news event

Similar Documents

Publication Publication Date Title
CN109214562A (en) A kind of power grid scientific research hotspot prediction and method for pushing based on RNN
CN109299396A (en) Merge the convolutional neural networks collaborative filtering recommending method and system of attention model
CN108733742A (en) Global normalization's reader system and method
CN104598611B (en) The method and system being ranked up to search entry
CN113761218B (en) Method, device, equipment and storage medium for entity linking
Cohen et al. End to end long short term memory networks for non-factoid question answering
Anupriya et al. LDA based topic modeling of journal abstracts
Wang et al. Outfit compatibility prediction and diagnosis with multi-layered comparison network
CN109522553A (en) Name recognition methods and the device of entity
CN110796190A (en) Exponential modeling with deep learning features
CN109299976A (en) Clicking rate prediction technique, electronic device and computer readable storage medium
CN110222163A (en) A kind of intelligent answer method and system merging CNN and two-way LSTM
CN107391485A (en) Entity recognition method is named based on the Korean of maximum entropy and neural network model
CN104615767A (en) Searching-ranking model training method and device and search processing method
CN111222318B (en) Trigger word recognition method based on double-channel bidirectional LSTM-CRF network
CN110413769A (en) Scene classification method, device, storage medium and its electronic equipment
CN106970981B (en) Method for constructing relation extraction model based on transfer matrix
CN111710428B (en) Biomedical text representation method for modeling global and local context interaction
CN106776808A (en) Information data offering method and device based on artificial intelligence
CN104182421A (en) Video clustering method and detecting method
CN109815478A (en) Medicine entity recognition method and system based on convolutional neural networks
CN111191461B (en) Remote supervision relation extraction method based on course learning
Munshi et al. Towards an automated islamic fatwa system: Survey, dataset and benchmarks
CN103593334B (en) A kind of method and system for being used to judge emotional degree of text
Matsuda et al. Benchmark for Personalized Federated Learning

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190115

WD01 Invention patent application deemed withdrawn after publication