US11868431B2 - Information processing apparatus, information processing method, and non-transitory computer readable medium - Google Patents
Information processing apparatus, information processing method, and non-transitory computer readable medium Download PDFInfo
- Publication number
- US11868431B2 US11868431B2 US17/197,377 US202117197377A US11868431B2 US 11868431 B2 US11868431 B2 US 11868431B2 US 202117197377 A US202117197377 A US 202117197377A US 11868431 B2 US11868431 B2 US 11868431B2
- Authority
- US
- United States
- Prior art keywords
- prediction
- value
- objective variable
- frequency data
- values
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active, expires
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING OR CALCULATING; COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F18/00—Pattern recognition
- G06F18/20—Analysing
- G06F18/21—Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
- G06F18/213—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods
- G06F18/2132—Feature extraction, e.g. by transforming the feature space; Summarisation; Mappings, e.g. subspace methods based on discrimination criteria, e.g. discriminant analysis
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01W—METEOROLOGY
- G01W1/00—Meteorology
- G01W1/10—Devices for predicting weather conditions
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/042—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators in which a parameter or coefficient is automatically adjusted to optimise the performance
-
- G—PHYSICS
- G05—CONTROLLING; REGULATING
- G05B—CONTROL OR REGULATING SYSTEMS IN GENERAL; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS OR ELEMENTS
- G05B13/00—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion
- G05B13/02—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric
- G05B13/04—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators
- G05B13/048—Adaptive control systems, i.e. systems automatically adjusting themselves to have a performance which is optimum according to some preassigned criterion electric involving the use of models or simulators using a predictor
Definitions
- Embodiments described herein relate to an information processing apparatus, an information processing method, and a computer program.
- a model for prediction is created.
- prediction is performed based on search for past similar examples in some cases. For example, using a given current state, a plurality of similar cases (similar examples) are selected from a database in which past cases are stored. Each case includes a state and a value of the objective variable, and a plurality of similar examples are selected that rank highly in terms of closeness of distance to the current state.
- a distribution of the values of the objective variable included in the selected similar examples is outputted as probabilistic predictions.
- the outputted distribution significantly deviates from an intended distribution of prediction values, and prediction performance is reduced.
- FIG. 1 is a block diagram of a prediction apparatus that is an information processing apparatus according to a first embodiment
- FIG. 2 shows an example of a past state DB
- FIG. 3 shows an example of selection of similar examples in a feature space
- FIG. 4 is a block diagram of a prediction corrector
- FIG. 5 is a block diagram of a member corrector
- FIG. 6 is a diagram for describing correction performed by a frequency distribution corrector
- FIG. 7 is a flowchart of an example of operation of the prediction apparatus according to the first embodiment.
- FIG. 8 is a block diagram of a prediction corrector in the prediction apparatus, according to a second embodiment
- FIG. 9 is a block diagram of a member corrector according to the second embodiment.
- FIG. 10 is a diagram for describing correction performed by a constraint corrector 34 ;
- FIG. 11 is a block diagram of a prediction apparatus according to a third embodiment.
- FIG. 12 is a block diagram of a periodicity extractor
- FIG. 13 is a block diagram of a prediction apparatus according to a fourth embodiment.
- FIG. 14 is a flowchart of an example of operation of the prediction apparatus according to the fourth embodiment.
- FIG. 15 is a block diagram of a prediction apparatus according to a fifth embodiment.
- FIG. 16 shows an example of a past numerically calculated feature DB
- FIG. 17 shows a hardware configuration of any of the prediction apparatuses (information processing apparatuses) according to the embodiments.
- an information processing apparatus includes: a processor configured to select a first case based on subject data including at least one feature, and acquire a first prediction value that is a value of an objective variable included in the first case; a first estimator configured to estimate frequency data indicating frequencies of observation values of the objective variable, based on a history of observation values of the objective variable; a second estimator configured to estimate first frequency data indicating frequencies of first prediction values, based on a history of first prediction values acquired before the first prediction value is acquired; and a corrector configured to correct the first prediction value acquired by the processor, based on the frequency data and the first frequency data.
- Prediction may be prediction of anything, such as prediction of demand for electric power, prediction of share prices on a stock market, prediction of prices of electricity on an electricity transaction market, or prediction of a meteorological variable other than the solar irradiance.
- FIG. 1 is a block diagram of a prediction apparatus 101 that is an information processing apparatus according to a first embodiment.
- the prediction apparatus 101 in FIG. 1 includes a past state database (DB) 11 , a similar example selector (processor) 12 , a state acquirer 13 , an objective variable acquirer 14 , a prediction corrector 15 , and an output device 16 .
- the state acquirer 13 is communicably connected to a state observation device 201 .
- the objective variable acquirer 14 is communicably connected to an objective variable observation device 202 .
- the past state DB 11 stores a plurality of cases, each of which includes state data (first data) on a subject system and a value of an objective variable, each in association with a time.
- a set of state data, a value of the objective variable, and a time corresponds to one case.
- the value of the objective variable is a past observation value corresponding to a state indicated by the state data, and is used for a future prediction value corresponding to the state.
- the state data (first data) on the subject system includes one or more features that characterize a state of the subject system.
- the state data includes meteorological variables such as a temperature, a humidity, and a wind speed at the point A at a certain time, as a past weather state (past features).
- the state data may include other meteorological variables and the like.
- the objective variable is a feature related to a past weather state, and is a feature to be predicted in the present embodiment.
- the objective variable is assumed to be solar irradiance at the point A on a next day, that is, 24 hours later. Assuming that a time of state data corresponding to a value of the objective variable is “t”, the value of the objective variable corresponding to the state data (the value of the objective variable included in the same case that includes the state data) is not solar irradiance at the point A at the time “t”, but solar irradiance at the point A 24 hours after “t”.
- features at points around the point A can also be added, on a supposition that the meteorological variables at such nearby points also affect the solar irradiance.
- meteorological variables before the time of prediction before the time “t” can also be added as features.
- FIG. 2 shows an example of the past state DB 11 .
- the past state DB 11 a plurality of past several years of cases, each of which is a set of state data (one or more features) and a value of the objective variable as described above, are stored for each time.
- the state acquirer 13 acquires, from the state observation device 201 , state data including one or more features of the same types as the features in the past state DB 11 .
- the state acquirer 13 acquires state data including a temperature, a humidity, an atmospheric pressure, a wind speed, and the like at each fixed interval.
- the state observation device 201 is installed at the point A, and observes a temperature, a humidity, an atmospheric pressure, a wind speed, and the like at the point A.
- the state acquirer 13 stores the acquired state data as a history in an internal storage.
- the state acquirer 13 provides, to the similar example selector 12 , current state data (subject data) that is state data at a time the similar example selector 12 uses the state data for prediction.
- the current state data may be provided in response to a request from the similar example selector 12 .
- the current state data includes one or more features at the current time, that is, the time at which prediction is intended to be performed.
- the current state data is a feature or features of the same types as the features stored in the past state DB 11 .
- the current state data includes a temperature, a humidity, an atmospheric pressure, a wind speed, and the like at the current time at the point A.
- the similar example selector 12 determines that prediction of the objective variable is performed at each predetermined time, and receives current state data to be used for prediction from the state acquirer 13 .
- the similar example selector 12 calculates a degree of similarity between the state data (one or more features) stored in the past state DB 11 and the current state data, in a feature space that is a space with each feature serving as a coordinate of a coordinate system.
- the degree of similarity is calculated based on a predetermined distance (metric) that indicates a degree of closeness in the feature space.
- metric a predetermined distance that indicates a degree of closeness in the feature space.
- the degree of similarity is measured as a Euclidean distance in the feature space. In such a case, a shorter distance indicates more similarity.
- the features include features of different dimensions and features of different scales, such as temperature, wind speed, and atmospheric pressure, it is preferable that appropriate standardization be made beforehand.
- the similar example selector 12 selects a predetermined number (M) of cases in descending order of degree of similarity (ascending order of distance, or ascending order of value as the degree of similarity).
- M a predetermined number of cases in descending order of degree of similarity (ascending order of distance, or ascending order of value as the degree of similarity).
- the selected cases are referred to as similar examples or similar cases.
- FIG. 3 shows an example of selection of similar examples in a feature space in which a plurality of the features are assumed to be a feature 1 and a feature 2 .
- a point C in the center corresponds to current state data, and points 1, 2, 3, . . . , M correspond to similar examples.
- a whole of the selected similar examples is referred to as an ensemble, and “M” is referred to as a size of the ensemble.
- Each case in the ensemble is referred to as a member.
- the members have member rankings that indicate what place a member ranks in descending order of degree of similarity. In the example in FIG. 3 , since the points 1, 2, 3, . . . , M, in this order, are closer to the current state data, the cases 1 to M, in this order, have higher rankings.
- the similar example selector 12 acquires respective values of the objective variable in the selected similar examples, as prediction values of the objective variable corresponding to the current state data.
- a set of the prediction values is referred to as ensemble prediction data.
- Each prediction value in the ensemble prediction data is also referred to as an ensemble member.
- Each member value is a prediction value of the objective variable. Dispersion of such values can be regarded as uncertainty of the prediction, and the ensemble prediction data that is a whole of the prediction values can be regarded as probabilistic predictions.
- the ensemble prediction data is probabilistic predictions based on a set of a plurality of the prediction values.
- a prediction value of a member with a ranking “k” at a time “t” is denoted by “p k (t) ”.
- the ensemble prediction data can be denoted by ⁇ p 1 (t) , p 2 (t) , . . . , p M (t) ⁇ .
- the similar example selector 12 corresponds to a processor that selects a similar example from among a plurality of cases, based on current state data, and acquires a prediction value that is a value of the objective variable included in the selected similar example.
- the objective variable acquirer 14 acquires, from the objective variable observation device 202 , an observation value of the objective variable that is to be predicted, at each fixed time interval. In other words, the objective variable acquirer 14 collects observation values of the objective variable that is to be predicted, from the objective variable observation device 202 .
- the solar irradiance at the point A is the objective variable.
- the objective variable observation device 202 such as an instrument for measuring solar irradiance is installed at the point A, and the objective variable acquirer 14 collects values of the solar irradiance from the objective variable observation device 202 .
- the objective variable acquirer 14 may store the collected observation values of the objective variable as a history in an internal storage.
- the objective variable acquirer 14 provides the collected observation values of the objective variable to the prediction corrector 15 .
- the observation values of the objective variable may be provided in response to a request from the prediction corrector 15 .
- the objective variable is an objective variable of the same type as a feature stored in past state DB 11 .
- the observation values of the objective variable collected by the objective variable acquirer 14 and the state data acquired by the state acquirer 13 may be accumulated in the past state DB 11 .
- the similar example selector 12 may acquire current state data from the past state DB 11
- the prediction corrector 15 may acquire an observation value of the objective variable from the past state DB 11 .
- the prediction corrector 15 corrects the ensemble prediction data provided from the similar example selector 12 . More specifically, the prediction corrector 15 corrects the individual prediction values “p 1 (t) ”, “p 2 (t) ”, . . . , “p M (t) ” included in the ensemble prediction data, and provides corrected prediction values “q 1 (t) ”, “q 2 (t) ”, . . . , “q M (t) ” to the output device 16 .
- FIG. 4 is a block diagram of the prediction corrector 15 .
- the prediction corrector 15 includes an objective variable cumulative distribution function estimator 21 (first estimator) 21 and member correctors 1 to M.
- the objective variable cumulative distribution function estimator 21 estimates data (frequency data) related to frequency of the observation values of the objective variable, based on the observation values of the objective variable provided from the objective variable acquirer 14 . Specifically, a cumulative distribution function “F (t) ” for the observation values of the objective variable is estimated as the frequency data.
- ⁇ ⁇ ( x ) ⁇ 1 x ⁇ 0 0 x ⁇ 0 ( 3 )
- estimation of the cumulative distribution function is performed by using data (observation values of the objective variable) collected during a predetermined time period “L”, that is, between “t ⁇ L” and “t ⁇ 1” assuming that a current time is “t”.
- a length of “L” is, as an example, approximately three months. Such a time period corresponds to a length of one season in Japan.
- a cumulative distribution function is estimated as the frequency data, a probability density distribution, a histogram, or the like may also be estimated.
- FIG. 5 is a block diagram of a member corrector k.
- the member corrector k includes a prediction value cumulative distribution function estimator 31 (second estimator) and a frequency distribution corrector 32 (corrector).
- the prediction value cumulative distribution function estimator 31 collects the prediction values “p k (t) ” for an ordinal number “k” for a fixed time period, and generates frequency data on the prediction values for the ordinal number “k”. Specifically, a cumulative distribution function for the prediction values for the ordinal number “k” is estimated as the frequency data. An estimation method and a data collection period are the same as in the case of the objective variable cumulative distribution function. Although a cumulative distribution function is estimated as the frequency data in the present example, a probability density distribution, a histogram, or the like may also be estimated.
- the frequency distribution corrector 32 calculates the corrected prediction value “q k (t) ” for the ordinal number “k” from the prediction value “p k (t) ” for the ordinal number “k”, by using the objective variable cumulative distribution function “F (t) ” and the prediction value cumulative distribution function “G k (t) ” for the ordinal number “k”.
- the frequency distribution corrector 32 calculates a cumulative probability (frequency) corresponding to the prediction value “p k (t) ” (first prediction value), based on the prediction value cumulative distribution function “G k (t) ” (frequency data).
- the value “q k (t) ” corresponding to the calculated cumulative probability (frequency) is calculated.
- the first prediction value is corrected based on the calculated value.
- the calculated value “q k (t) ” itself is used for the corrected prediction value.
- FIG. 6 is a diagram for describing the correction performed by the frequency distribution corrector 32 .
- a horizontal axis shows values of the objective variable, and a vertical axis shows values of cumulative distribution function.
- Two graphs represent the objective variable cumulative distribution function “F (t) ” and the prediction value cumulative distribution function “G k (t) ”. If prediction values are correct, the two cumulative distribution functions are expected to match each other. However, the two graphs may possibly differ from each other in actuality, and such difference between the two graphs leads to a decrease in prediction accuracy.
- Correction can be performed similarly when a probability density distribution, a histogram, or the like is used for the frequency data on the prediction values and the frequency data on the observation values. For example, a frequency or a probability corresponding to the prediction value “p k (t) ” is identified from the frequency data on the prediction values, and a value of the objective variable corresponding to the identified frequency or probability is identified from the frequency data on the observation values. The identified value of the objective variable is used for the corrected prediction value “q k (t) ”.
- FIG. 7 is a flowchart of an example of prediction processing performed by the prediction apparatus 101 according to the first embodiment.
- “t” is set to zero (S 101 )
- the similar example selector 12 calculates the prediction values ⁇ p 1 (t) , p 2 (t) , . . . , p M (t) ⁇ before correction (S 102 ).
- it is required to accumulate observation values and prediction values of the objective variable during the time period “L”. It is determined whether or not t ⁇ L (S 103 ).
- the prediction corrector 15 outputs ⁇ p 1 (t) , p 2 (t) , . . . , p M (t) ⁇ as prediction values without correction (S 104 ), on a supposition that sufficient data is not collected.
- the frequency distribution corrector 32 corrects the prediction value “p k (t) ” by using “F (t) ” and “G k (t) ”, and obtains the corrected prediction value “q k (t) ” (S 114 ). One is added to “k” (S 115 ), and while “k” is not larger than “M” (No in S 116 ), the processing returns to step S 112 .
- a set of the corrected prediction values from the member correctors 1 to M are transmitted as ensemble prediction data to the output device 16 (S 105 ).
- the output device 16 performs output processing such as displaying the ensemble prediction data on a screen or transmitting the ensemble prediction data to another device.
- the objective variable acquirer 14 acquires an observation value “o (t) ”. Specifically, at a time “t+1”, the objective variable acquirer 14 acquires an observation value (S 106 , S 107 ). In the example of predicting the solar irradiance at the point A on a next day (24 hours later), the time “t+1” corresponds to 24 hours later, and solar irradiance observed by the objective variable observation device 202 at the time 24 hours later is acquired. The prediction processing is repeated until a termination condition is fulfilled (S 108 ). Examples of the termination condition include a case where “t” reaches a predetermined value, a case where an instruction about termination is inputted by an operator of the present apparatus, and the like.
- a set of prediction values that are values included in a plurality of similar examples are corrected based on the cumulative distribution function for observation values of the objective variable, and a set of the corrected prediction values are used for ensemble prediction data, whereby prediction performance can be enhanced.
- a distribution of the prediction values differs from an intended distribution of prediction values. What is desired to be acquired as a distribution (dispersion) of prediction values is a dispersion of prediction values from a point of the current state data (the point C in FIG. 3 ) in the feature space.
- each solar irradiance 24 hours later should vary, and what is desired to be known is how the solar irradiance vary.
- a location of each prediction value is apart from the point of the current state data in the feature space, a distribution of the prediction values acquired from the feature space is different from an intended distribution of prediction values.
- each prediction value is corrected by using the cumulative distribution function for the observation values of the objective variable, whereby the above-described problem can be solved, and prediction performance can be enhanced.
- the time period “L” is short, an error included in the estimated cumulative distribution functions may be great. In such a case, an error in the correction amount based on the cumulative distribution functions may be great, and may cause a decline in prediction performance.
- a second embodiment solves such a problem.
- FIG. 8 is a block diagram of a prediction corrector 15 in the prediction apparatus 101 , according to a second embodiment.
- a difference from the first embodiment shown in FIG. 4 is that an observation value “o (t) ” of the objective variable acquired by the objective variable acquirer 14 is also given to the member correctors 1 to M.
- the constraint coefficient calculator 33 calculates a cumulative probability corresponding to the prediction value “p k (t) ” (first prediction value), based on the prediction value cumulative distribution function “G k (t) ” (frequency data). Then, on the objective variable cumulative distribution function “F (t) ” (the frequency data on the observation values of the objective variable), the value “q k (t) ” corresponding to the calculated cumulative probability is calculated. Based on a difference between the prediction value “p k (t) ” and the value “q k (t) ”, a coefficient (constraint coefficient) is calculated. As an example, a performance evaluation index is calculated based on the difference, and the constraint coefficient is determined such that the performance evaluation index is optimized or quasi-optimized.
- the constraint corrector 34 obtains a corrected prediction value “r k (t) ”, by multiplying the difference between the prediction value “p k (t) ” and the value “q k (t) ” by the constraint coefficient, and adding a resultant value of the multiplication to the prediction value “p k (t) ”, as will be described later.
- the constraint coefficient calculator 33 calculates the constraint coefficient by using a method called cross-validation such that the predetermined performance evaluation index is optimized.
- the constraint coefficient calculator 33 acquires data ⁇ (o (t ⁇ L) , p k (t ⁇ L) ), . . . , (o (t ⁇ 1) , p k (t ⁇ 1) ) ⁇ that is formed by pairing observation values ⁇ o (t ⁇ 1) , . . . , o (t ⁇ 1) ⁇ and prediction values ⁇ p k (t ⁇ L) , . . . , p k (t ⁇ 1) ⁇ of the objective variable during the past time period “L” from the time point “t”, at which prediction is performed, such that an observation value and a prediction value corresponding to the same time make a pair.
- data is divided into two sets, namely data for learning and data for validation.
- a leave-one-out method which is relatively commonly used, is used here.
- the data for validation is only one pair of an observation value and a prediction value, and this one pair is assumed to be (o v , p v ).
- the remaining “L ⁇ 1” pairs are the data for learning, and the data for learning are assumed to be ⁇ (o l (1) , p l (1) ), . . . , (o l (L ⁇ 1) , p l (L ⁇ 1) ⁇ .
- the objective variable cumulative distribution function estimator 21 calculates an objective variable cumulative distribution function “F l ” from the observation values in the data for learning.
- the prediction value cumulative distribution function estimator 31 calculates a prediction value cumulative distribution function “G l ” from the prediction values in the data for learning.
- a corrected prediction value “q v ” is calculated from a prediction value “p v ”, through a method similar to the method used by the frequency distribution corrector 32 in the first embodiment.
- the leave-one-out method when there are L pairs of data, division into the data for learning and the data for validation can be made in L different combinations. Accordingly, L errors can be obtained.
- the performance evaluation index to be optimized is, for example, RMSE (Root Mean Squared Error).
- RMSE Root Mean Squared Error
- e i represents an error calculated based on an i-th combination for division into the data for learning and the data for validation.
- the RMSE depends on the assumed value “ ⁇ ” of the constraint coefficient. In other words, the RMSE is a function for “ ⁇ ”. Accordingly, “ ⁇ ” that minimizes or quasi-minimizes the RMSE can be selected by repeating calculation while variously changing the value of “ ⁇ ”. Quasi-minimization is, for example, to make the RMSE equal to or smaller than a threshold value.
- the value of the constraint coefficient calculated by the constraint coefficient calculator 33 in the member corrector k with respect to the time “t” is denoted by “ ⁇ k (t) ”.
- the constraint corrector 34 performs correction as described below, by using the constraint coefficient “ ⁇ k (t) ” (0 ⁇ k (t) ⁇ 1) calculated by the constraint coefficient calculator 33 .
- FIG. 10 is a diagram for describing the correction performed by the constraint corrector 34 .
- the prediction corrector 15 collects observation values of the objective variable and prediction values of the objective variable, and estimates the objective variable cumulative distribution function and the prediction value cumulative distribution function (see FIGS. 4 and 5 ).
- periodicity exists in values of the objective variable, depending on a subject that is predicted, and collection of observation values of the objective variable and prediction values of the objective variable can be omitted by utilizing such periodicity.
- obvious periodicities exist, namely daily periodicity and annual periodicity.
- FIG. 11 is a block diagram of a prediction apparatus 101 according to a third embodiment.
- a periodicity extractor 41 an objective variable cumulative distribution function estimator 42 , and a prediction value cumulative distribution function estimator 43 are added.
- the objective variable cumulative distribution function estimator 21 and the prediction value cumulative distribution function estimator 31 included in the prediction corrector 15 in the first embodiment are not required.
- the periodicity extractor 41 identifies a time period associated with a current time (a time of the subject data) as a time period with periodicity of the current time. For example, the periodicity extractor 41 obtains a time period with similarity to the current time, based on a periodicity given beforehand. For example, it is assumed that the current time is day “d” of year “Y” (“d” is a day of the year). It is assumed that a width of the time period (assumed to be h days) and the number of years over which data is traced back (assumed to be n years) are predetermined. In such a case, the periodicity extractor 41 extracts similar segments as follows. Here, an example is shown where annual periodicity is utilized.
- the periodicity extractor 41 provides information on the extracted similar segments (time periods) to the objective variable cumulative distribution function estimator 42 and the prediction value cumulative distribution function estimator 43 .
- the prediction value cumulative distribution function estimator 43 calculates a prediction value for each time included in the similar segments provided by the periodicity extractor 41 , individually for each member ranking as in the first embodiment, and obtains histories of the prediction values (histories of selection of members (cases) with the same ranking). Based on the histories of the prediction values, the prediction value cumulative distribution function estimator 43 estimates prediction value cumulative distribution functions.
- the prediction value cumulative distribution functions that are estimated for the member rankings, respectively, are provided to the frequency distribution correctors 32 of the corresponding member correctors k in the prediction corrector 15 .
- the periodicity extractor 41 analyzes the past state DB 11 and detects a periodicity. The periodicity extractor 41 extracts a similar segment by utilizing the detected periodicity.
- FIG. 12 is a block diagram of the periodicity extractor 41 .
- the periodicity extractor 41 includes a power spectrum calculator and a peak detector 46 .
- the power spectrum calculator 45 reads past values of the objective variable as time-series data from the past state DB 11 , and calculates a power spectrum based on the read time-series data. In other words, the power spectrum calculator 45 calculates a power spectrum, based on a history of observation values of the objective variable.
- the power spectrum represents absolute values of amplitude (spectrum component) of frequency components corresponding to the values that change like time series. If the power spectrum has a large value at some frequency “ ⁇ ”, such a fact means that a large number of components of that frequency are included in the objective variable.
- the peak detector 46 performs peak detection based on the power spectrum, identifies a frequency component “ ⁇ ” among frequency components included in the peak detected through the peak detection, and identifies a peak width “ ⁇ ”. Based on the identified frequency component to and peak width “ ⁇ ”, a similar segment (time period) is determined.
- the frequency component “ ⁇ ” is a frequency component selected from among the frequency components included in the peak. As an example, the frequency component “ ⁇ ” is a frequency component with the largest spectrum component among the frequency components included in the peak. As another example, a median value or the like of the frequency components included in the peak may be selected.
- the cumulative distribution functions are estimated based on periodicity of the objective variable by using the past state DB 11 , whereby collection of observation values of the objective variable and prediction values of the objective variable can be omitted. Accordingly, after the prediction apparatus starts operation, correction of the prediction values (see S 105 of the flowchart in FIG. 7 ) can be performed earlier.
- the frequency distribution corrector 32 of each member corrector corrects the prediction value “p k (t) ” to “q k (t) ”.
- the correction amount (p k (t) ⁇ q k (t) ) depends on a system (metric) of measuring a distance in the feature space. There are various metrics other than the Euclidean distance used in the first embodiment. By appropriately selecting a metric, there is a possibility that prediction performance can be enhanced.
- a plurality of metrics are preset, and correction of prediction values is performed for each metric.
- a summed value of correction amounts is calculated, and corrected prediction values based on a metric for which the smallest summed value is obtained are adopted.
- FIG. 13 is a block diagram of a prediction apparatus 101 in the fourth embodiment.
- a metric setter 51 and a prediction selector 53 set selector
- N pairs of similar example selectors 12 and prediction correctors 15 are included.
- a pair of a similar example selector 12 _ 1 and a prediction corrector 15 _ 1 to a pair of a similar example selector 12 _N and a prediction corrector 15 _N are included.
- correction amount totalizers 52 _ 1 to 52 _N are provided, respectively.
- the metric setter 51 sets metrics to be used by the similar example selectors 12 _ 1 to 12 _N (presented as metrics 1 to N, respectively).
- a weighted distance is used for a metric.
- a weight for each feature is inputted.
- w i is a weight for an i-th feature, and a feature with a larger value of “w i ” is deemed to be of greater importance in calculation.
- the various metrics 1 to N are configured by variously changing the value of “w i ”.
- the similar example selectors 12 _ 1 to 12 _N and the prediction correctors 15 _ 1 to 15 _N operate as in the first embodiment, except that metrics used by the similar example selectors 12 _ 1 to 12 _N are different. In other words, the similar example selectors 12 _ 1 to 12 _N select similar examples by using mutually different metrics.
- the prediction correctors 15 _ 1 to 15 _N correct prediction values that are values of the objective variable included in the similar examples selected by the similar example selectors 12 _ 1 to 12 _N.
- the correction amount totalizers 52 _ 1 to 52 _N sum (or total) the correction amounts ⁇ p k (t) for correction performed by the frequency distribution correctors 32 _ 1 to 32 _N in the prediction corrector 15 _ 1 to 15 _N. For example, assuming that a predetermined time period is “R” and a time at which prediction is performed is “t”, a sum “S” of absolute values of the correction amounts in a segment [t ⁇ R, t ⁇ 1] is calculated as follows:
- S is a scale that represents a magnitude of the correction.
- this “S” will be referred to as a correction amount summed value.
- a correction amount summed value corresponding to a j-th metric is denoted by “S j ”.
- absolute values of the correction amounts are used when a correction amount summed value is calculated.
- an amount that can be a scale of a magnitude of the correction amount for example, a square of the correction amount can also be used.
- the prediction selector 53 selects a metric (or a pair of a similar example selector and a prediction corrector) with which the smallest correction amount summed value is obtained.
- a metric or a pair of a similar example selector and a prediction corrector
- S N the correction amount summed values
- a number denoting the selected metric (or pair) is assumed to be “j min ”.
- the prediction selector 53 generates instructional data to instruct that ensemble prediction data from a j min -th prediction corrector 15 (prediction corrector 15 _ j min ) be selected, and provides the instructional data to the output device 16 .
- the output device 16 in accordance with the instructional data from the prediction selector 53 , outputs the ensemble prediction data from the j min -th prediction corrector 15 (prediction corrector 15 _ j min ) among outputs from the N prediction correctors 15 .
- FIG. 14 is a flowchart of an example of prediction processing performed the prediction apparatus 101 according to the fourth embodiment.
- a similar example selector j represents a similar example selector 12 _ j
- a prediction corrector j represents a prediction corrector 15 _ j
- a correction amount totalizer j represents a correction amount totalizer 52 _ j.
- the similar example selector j acquires a plurality of prediction values, based on similar cases (S 122 ), and the prediction corrector j corrects the plurality of prediction values (S 123 ).
- the correction amount totalizer j calculates a correction amount summed value S j (S 124 ). One is added to “j” (S 125 ), and steps S 122 to S 125 are repeated until “j” reaches “N” (S 126 ).
- the prediction selector 53 selects a number j min of a metric (or a pair of a similar example selector and a prediction corrector) with which the smallest correction amount summed value S j is obtained (S 127 ).
- the output device 16 outputs a set of corrected prediction values (ensemble prediction data) from the j min -th prediction corrector 15 (S 128 ).
- prediction values are acquired based on a plurality of metrics, the prediction values are corrected, and corrected prediction values with which the smallest summed value of correction amounts is obtained are selected, whereby prediction performance can be enhanced.
- one or more observation values of the subject system at a certain time point are used.
- a temperature, an atmospheric pressure, and the like at a point of interest correspond to a state of the subject system.
- an actual solar irradiance on the next day does not always well agree with solar irradiance in a selected case even if a past state that is similar to current meteorological variables is selected.
- analog ensemble In view of such circumstances, in a field of weather prediction, a technique called analog ensemble is known. This technique is based on a fact that numerical weather calculation with high prediction performance is available for weather prediction. According to a basic concept of the analog ensemble, if numerical weather calculation with high prediction performance is possible, more desirable similarity can be obtained by selecting similar meteorological variables to meteorological variables at a target time point of prediction, which are derived by calculation from current meteorological variables, than by comparison with the current meteorological variables.
- a fifth embodiment embodies a configuration that adopts a concept of using numerical calculation.
- FIG. 15 is a block diagram of a prediction apparatus 101 according to the fifth embodiment.
- a numerical calculator 61 In comparison with the first embodiment, a numerical calculator 61 , a feature selector 62 , and a past numerically calculated feature DB 63 are added.
- the numerical calculator 61 performs numerical calculation of a state of the subject system at a target time point of prediction from data (current state data) indicating a current state of the subject system, based on a numerical calculation model. A plurality of features are obtained through the numerical calculation.
- the numerical calculator 61 performs numerical calculation from a past state (features) of the subject system stored in the past state DB 11 , based on the numerical calculation model. A plurality of features are obtained through the numerical calculation.
- the feature selector 62 selects a feature (numerically calculated feature) to be used to acquire a similar example, among the plurality of features obtained through the numerical calculation from the past state DB 11 , and stores the selected numerically calculated feature in the past numerically calculated feature DB 63 .
- the numerically calculated feature may be a feature of the same type as the objective variable. For example, when prediction of the solar irradiance is taken as an example, the numerically calculated feature may be a value of the solar irradiance.
- a numerically calculated feature deemed to be useful for determination of similarity may be appropriately selected and added to the past numerically calculated feature DB 63 .
- the feature selector 62 selects a feature (numerically calculated feature) to be used to acquire a similar example, among the plurality of features obtained through the numerical calculation with respect to the current state data.
- the selected numerically calculated feature (current numerically calculated feature) is provided to the similar example selector 12 .
- the selected feature may be a feature of the same type as the feature stored in the past numerically calculated feature DB 63 .
- a value of the solar irradiance at the point A on a next day is selected as a numerically calculated feature.
- another numerically calculated feature such as a temperature, a humidity, or the like at the point A on the next day may be selected, depending on contents of the past numerically calculated feature DB 63 .
- the past numerically calculated feature DB 63 stores, in a set with a value of the objective variable, the one or more features (numerically calculated features) selected by the feature selector 62 among the plurality of features obtained through the numerical calculation from the past state DB 11 .
- FIG. 16 shows an example of the past numerically calculated feature DB 63 .
- Numerically calculated features 1 to N are stored in association with a time and a value of the objective variable.
- the similar example selector 12 operates as in the first embodiment, except that the current state data and the past state DB 11 in the first embodiment are replaced with the current numerically calculated feature and the past numerically calculated feature DB 63 , respectively.
- the prediction corrector 15 , the objective variable acquirer 14 , and the output device 16 also operate as in the first embodiment.
- a feature (numerically calculated feature) is calculated through numerical calculation, whereby prediction performance can be enhanced.
- FIG. 17 illustrates a hardware configuration of the prediction apparatus (information processing apparatus) 101 according to the present embodiment.
- the information processing apparatus 101 according to the present embodiment is configured with a computer device 300 .
- the computer device 300 includes a CPU 301 , an input interface 302 , a display device 303 , a communication device 304 , a main storage device 305 and an external storage device 306 , and these are connected to each other with a bus 307 .
- the CPU (Central Processing Unit) 301 executes a computer program (prediction program) which realizes the above-described respective functional configurations of the information processing apparatus 101 on the main storage device 305 .
- the computer program may not be a single program but a plurality of programs or a combination of scripts.
- the input interface 302 is a circuit for inputting an operation signal from the input device such as a keyboard, a mouse and a touch panel, to the information processing apparatus 101 .
- the input function of the information processing apparatus 101 can be constructed on the input interface 302 .
- the display device 303 displays data or information output from the information processing apparatus 101 . While the display device 303 is, for example, an LCD (Liquid Crystal Display), a CRT (Cathode-Ray Tube), and a PDP (Plasma Display Panel), the display device 303 is not limited to this.
- the data or the information output from the computer device 300 can be displayed by this display device 303 .
- the output device of the information processing apparatus 101 can be constructed on the display device 303 .
- the communication device 304 is a circuit for the information processing apparatus 101 to communicate with an external device in a wireless or wired manner. Information can be input from the external device via the communication device 304 . Information input from the external device can be stored in a DB.
- the main storage device 305 stores a program (prediction program) which realizes processing of the present embodiment, data required for execution of the program, data generated by execution of the program, and the like.
- the program is developed and executed on the main storage device 305 .
- the main storage device 305 is, for example, a RAM, a DRAM and an SRAM, the main storage device 305 is not limited to this.
- the storage in each embodiment may be constructed on the main storage device 305 .
- the external storage device 306 stores the above-described program, data required for execution of the program, data generated by execution of the program, and the like. These kinds of program and data are read out to the main storage device 305 upon processing of the present embodiment. While the external storage device 306 is, for example, a hard disk, an optical disk, a flash memory and a magnetic tape, the external storage device 306 is not limited to this. The storage in each embodiment may be constructed on the external storage device 306 .
- the above-described program may be installed in the computer device 300 in advance or may be stored in a storage medium such as a CD-ROM. Further, the program may be uploaded on the Internet.
- the computer device 300 may include one or a plurality of the processors 301 , the input interfaces 302 , the display devices 303 , the communication devices 304 and the main storage devices 305 , or peripheral equipment such as a printer and a scanner may be connected to the computer device 300 .
- the information processing apparatus 101 may be configured with a single computer device 300 or may be configured as a system including a plurality of computer devices 300 which are connected to each other.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Environmental & Geological Engineering (AREA)
- Evolutionary Computation (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Artificial Intelligence (AREA)
- Life Sciences & Earth Sciences (AREA)
- Medical Informatics (AREA)
- Health & Medical Sciences (AREA)
- Software Systems (AREA)
- Automation & Control Theory (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Atmospheric Sciences (AREA)
- Biodiversity & Conservation Biology (AREA)
- Ecology (AREA)
- Environmental Sciences (AREA)
- Bioinformatics & Cheminformatics (AREA)
- General Engineering & Computer Science (AREA)
- Evolutionary Biology (AREA)
- Bioinformatics & Computational Biology (AREA)
- Management, Administration, Business Operations System, And Electronic Commerce (AREA)
Abstract
Description
where θ is a function referred to as Heaviside step function, and is defined by a following expression:
[Expression 3]
F (t)(q k (t))=G k (t)(p k (t)) (4)
In such a case, “qk (t)−pk (t)” corresponds to a correction amount.
e=r v −o v (5)
δp k (t) =q k (t) −p k (t) (7)
r k (t) =p k (t)+αk (t) δp k (t) (8)
-
- [day “d−h” of year “Y−1” to day “d+h” of year “Y−1”]
- [day “d−h” of year “Y−2” to day “d+h” of year “Y−2”]
- [day “d−h” of year “Y−n” to day “d+h” of year “Y−n”]
[t−τ−Δτtot−T+Δτ]
[t−2τ−Δτtot−2τ+Δτ]
[t−nτ−Δτtot−nτ+Δτ]
Claims (16)
Applications Claiming Priority (2)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| JP2020-150029 | 2020-09-07 | ||
| JP2020150029A JP7332554B2 (en) | 2020-09-07 | 2020-09-07 | Information processing device, information processing method, and computer program |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| US20220076060A1 US20220076060A1 (en) | 2022-03-10 |
| US11868431B2 true US11868431B2 (en) | 2024-01-09 |
Family
ID=80470735
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| US17/197,377 Active 2041-07-23 US11868431B2 (en) | 2020-09-07 | 2021-03-10 | Information processing apparatus, information processing method, and non-transitory computer readable medium |
Country Status (2)
| Country | Link |
|---|---|
| US (1) | US11868431B2 (en) |
| JP (1) | JP7332554B2 (en) |
Families Citing this family (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP7457752B2 (en) | 2022-06-15 | 2024-03-28 | 株式会社安川電機 | Data analysis system, data analysis method, and program |
| JP7540617B1 (en) * | 2024-06-13 | 2024-08-27 | 富士電機株式会社 | Weather forecasting device, weather forecasting method and program |
Citations (6)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2009211693A (en) * | 2008-02-29 | 2009-09-17 | Fujitsu Ltd | Pattern identification device and pattern identification method |
| JP2015210222A (en) | 2014-04-28 | 2015-11-24 | 株式会社東芝 | Weather prediction correction apparatus and weather prediction correction method |
| US20160078112A1 (en) * | 2014-09-13 | 2016-03-17 | International Business Machines Corporation | Aggregation and Analytics for Application-Specific Optimization Based on Multiple Data Sources |
| JP2016045799A (en) | 2014-08-25 | 2016-04-04 | 富士電機株式会社 | Prediction model generation device, prediction model generation method and program |
| US20170075035A1 (en) * | 2015-09-11 | 2017-03-16 | Kabushiki Kaisha Toshiba | Probabilistic weather forecasting device, probabilistic weather forecasting method, and non-transitory computer readable medium |
| JP2018105893A (en) | 2018-04-09 | 2018-07-05 | 株式会社東芝 | Weather prediction correction device, weather prediction correction method, and program |
Family Cites Families (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH09233700A (en) * | 1996-02-28 | 1997-09-05 | Fuji Electric Co Ltd | Reliability evaluation method for daily maximum power demand forecast |
| JP6750494B2 (en) * | 2016-06-29 | 2020-09-02 | 富士通株式会社 | Power demand forecasting program, power demand forecasting apparatus, and power demand forecasting method |
-
2020
- 2020-09-07 JP JP2020150029A patent/JP7332554B2/en active Active
-
2021
- 2021-03-10 US US17/197,377 patent/US11868431B2/en active Active
Patent Citations (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2009211693A (en) * | 2008-02-29 | 2009-09-17 | Fujitsu Ltd | Pattern identification device and pattern identification method |
| JP5115493B2 (en) | 2008-02-29 | 2013-01-09 | 富士通株式会社 | Pattern identification device and pattern identification method |
| JP2015210222A (en) | 2014-04-28 | 2015-11-24 | 株式会社東芝 | Weather prediction correction apparatus and weather prediction correction method |
| JP2016045799A (en) | 2014-08-25 | 2016-04-04 | 富士電機株式会社 | Prediction model generation device, prediction model generation method and program |
| US20160078112A1 (en) * | 2014-09-13 | 2016-03-17 | International Business Machines Corporation | Aggregation and Analytics for Application-Specific Optimization Based on Multiple Data Sources |
| US20170075035A1 (en) * | 2015-09-11 | 2017-03-16 | Kabushiki Kaisha Toshiba | Probabilistic weather forecasting device, probabilistic weather forecasting method, and non-transitory computer readable medium |
| JP2017053804A (en) | 2015-09-11 | 2017-03-16 | 株式会社東芝 | Stochastic weather forecasting apparatus, stochastic weather forecasting method and program |
| JP2018105893A (en) | 2018-04-09 | 2018-07-05 | 株式会社東芝 | Weather prediction correction device, weather prediction correction method, and program |
Non-Patent Citations (4)
| Title |
|---|
| B. Efron et al., "Chapter 4: The empirical distribution function and the plug-in principle," in An Introduction to the Bootstrap, Chapman & Hall/CRC, pp. 31-38 (1993). |
| H. Hersbach, "Decomposition of the Continuous Ranked Probability Score for Ensemble Prediction Systems," Am. Meteorological Soc., Weather and Forecasting, vol. 15, No. 5, pp. 559-570 (2000). |
| L.D. Monache et al., "Probabilistic Weather Prediction with an Analog Ensemble," Am. Meteorological Soc., Monthly Weather Review, vol. 141, No. 10, pp. 3498-3516 (2013). |
| Monache et al., "Probabilistic Weather Prediction with an Analog Ensemble", Am. Meterologicial Soc., Monthly Weather Review, vol. 141, No. 10, pp. 3498-3516 (2013) (hereinafter "Monache"). (Year: 2013). * |
Also Published As
| Publication number | Publication date |
|---|---|
| US20220076060A1 (en) | 2022-03-10 |
| JP2022044416A (en) | 2022-03-17 |
| JP7332554B2 (en) | 2023-08-23 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US11169911B2 (en) | Method and apparatus for performing a fitting calculation on test data and generating data fluctuation values | |
| Ezzat et al. | Spatio-temporal short-term wind forecast: A calibrated regime-switching method | |
| EP3757900A1 (en) | Time series prediction with confidence estimates using sparse recurrent mixture density networks | |
| US20170243121A1 (en) | Traffic forecasting system, traffic forecasting method and traffic model establishing method | |
| US20180038994A1 (en) | Techniques to Improve Global Weather Forecasting Using Model Blending and Historical GPS-RO Dataset | |
| CN110705772A (en) | Regional power grid wind power generation power prediction optimization method and device | |
| CN105868853A (en) | Method for predicting short-term wind power combination probability | |
| US11868431B2 (en) | Information processing apparatus, information processing method, and non-transitory computer readable medium | |
| CN115186907A (en) | Method, system, equipment and medium for predicting long-term power generation amount in wind power plant | |
| JP2020134300A (en) | Prediction method, prediction program and information processing apparatus | |
| Smith et al. | Forecasting flash floods using data-based mechanistic models and NORA radar rainfall forecasts | |
| CN117371303A (en) | A method for predicting effective wave height under ocean waves | |
| Zhang et al. | Improving the CPC’s ENSO forecasts using Bayesian model averaging | |
| CN117521907A (en) | Photovoltaic power generation interval prediction method considering photovoltaic output and meteorological factors | |
| US8732528B1 (en) | Measuring test effects using adjusted outlier data | |
| Killick et al. | Automatic locally stationary time series forecasting with application to predicting UK gross value added time series | |
| CN115908051A (en) | A method for determining the energy storage capacity of a power system | |
| KR20180129496A (en) | Method for predicting electric power demand and apparatus for the same | |
| WO2023134188A1 (en) | Index determination method and apparatus, and electronic device and computer-readable medium | |
| Hu et al. | On methods for assessment of the value of observations in convection‐permitting data assimilation and numerical weather forecasting | |
| CN120559566A (en) | A remote monitoring and control system for electric energy meters | |
| CN115983121B (en) | Coastal NWP data grid refinement method and device based on double mapping | |
| CN118825998A (en) | Photovoltaic power range prediction method and device | |
| CN113723006B (en) | LS-SVM (least squares-support vector machine) -based single-station earth change magnetic field modeling prediction method and system | |
| CN117154721A (en) | A method and equipment for regional new energy power prediction based on agent scaling mechanism |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| FEPP | Fee payment procedure |
Free format text: ENTITY STATUS SET TO UNDISCOUNTED (ORIGINAL EVENT CODE: BIG.); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY |
|
| AS | Assignment |
Owner name: KABUSHIKI KAISHA TOSHIBA, JAPAN Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KAKIMOTO, MITSURU;SHIN, HIROMASA;SHIGA, YOSHIAKI;AND OTHERS;SIGNING DATES FROM 20210308 TO 20210325;REEL/FRAME:056387/0677 |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: DOCKETED NEW CASE - READY FOR EXAMINATION |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NON FINAL ACTION MAILED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: RESPONSE TO NON-FINAL OFFICE ACTION ENTERED AND FORWARDED TO EXAMINER |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: NOTICE OF ALLOWANCE MAILED -- APPLICATION RECEIVED IN OFFICE OF PUBLICATIONS |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT RECEIVED |
|
| STPP | Information on status: patent application and granting procedure in general |
Free format text: PUBLICATIONS -- ISSUE FEE PAYMENT VERIFIED |
|
| STCF | Information on status: patent grant |
Free format text: PATENTED CASE |
|
| CC | Certificate of correction |