KR20210156654A

KR20210156654A - Stacking Ensemble Type Short-term Power Demand Prediction Method and Apparatus

Info

Publication number: KR20210156654A
Application number: KR1020200074505A
Authority: KR
Inventors: 황인준; 문지훈; 유제혁; 정승원
Original assignee: 한국전력공사; 고려대학교 산학협력단
Priority date: 2020-06-18
Filing date: 2020-06-18
Publication date: 2021-12-27

Abstract

According to the present invention, a stacking ensemble type short-term power demand predicting device comprises: a data collection unit for collecting date/time information, weather information, and power usage data of a predetermined period; a data pre-processing unit for extracting features of the collected data and performing pre-processing including maximum or minimum normalization; and a predictive modeling unit that applies preprocessed input data to unit prediction models having a different number of layers, which is two or more, based on a deep neural network and calculates a final prediction value from prediction values derived from each of the unit prediction models.

Description

Stacking Ensemble Type Short-term Power Demand Prediction Method and Apparatus

본 발명은 스태킹 앙상블 기법을 적용한 단기 전력수요 예측 방법 및 장치에 관한 것으로, 비교적 낮은 사양의 하드웨어를 이용하여 하루 이하의 단기 또는 초단기의 전력수요를 정확하게 예측하는 단기 전력수요 예측 방법 및 장치에 관한 것이다. The present invention relates to a short-term power demand prediction method and apparatus to which a stacking ensemble technique is applied, and to a short-term power demand prediction method and apparatus for accurately predicting short-term or ultra-short-term power demand of less than a day by using relatively low-spec hardware. .

전력수요 예측(Electric Load Forecasting)은 전력 계통의 효율적 운영을 위해 사전에 필요한 전력을 확보하는 근거이며, 전력 시스템 장비의 신뢰성을 보장하고 능동적인 전력예비율을 유지함으로써 정전 및 과부하로 인한 손실을 대비하기 위해 사용된다.Electric Load Forecasting is the basis for securing necessary power in advance for efficient operation of the power system. used for

전력수요 예측은 예측 기간에 따라 크게 단기, 중기, 장기 예측으로 종류가 나누어진다. 그중 단기 전력수요 예측(Short-Term Load Forecasting, STLF)은 일간 전력수요 예측, 당일 최대전력수요 예측, 하루 1시간 이하의 단위를 예측하는 초단기 전력수요 예측을 포함한다.Electricity demand forecasting is largely divided into short-term, medium-term, and long-term forecasts according to the forecast period. Among them, Short-Term Load Forecasting (STLF) includes daily power demand forecasting, daily peak power demand forecasting, and ultra-short-term power demand forecasting that predicts units of less than an hour per day.

스마트 그리드는 CCHP(Combined Cooling, Heat, and Power), ESS(Energy Storage System), 신재생 에너지 등과 관련된 단기 전력수요 예측을 기반으로 운영계획을 수립하며, 수요와 공급에 대한 양방향 상호작용을 통해 에너지 효율성을 최적화하는 데 목적을 갖는 스마트 그리드 시스템을 구축하려면 정확한 단기 전력수요 예측이 필수적으로 요구된다.The smart grid establishes an operation plan based on short-term power demand forecasts related to CCHP (Combined Cooling, Heat, and Power), ESS (Energy Storage System), and new and renewable energy. Accurate short-term electricity demand forecasting is essential to build a smart grid system aimed at optimizing efficiency.

기존의 전력수요 예측 방식들은 통계적 접근 방법, 기계학습(Machine Learning), 심층학습(Deep Learning) 알고리즘들을 활용하여 예측 모델들을 구성하였으며, 최근 심층학습은 비선형적인 외부 요인들을 효과적으로 학습할 수 있어, 통계적 접근 방법 및 기계학습보다 우수한 예측 성능을 보인다.Existing power demand forecasting methods used statistical approaches, machine learning, and deep learning algorithms to construct predictive models, and recent deep learning can effectively learn nonlinear external factors, It shows better prediction performance than approaches and machine learning.

하지만 심층학습 알고리즘들은 모델의 성능을 결정하는 여러 초매개변수(Hyper-parameter)를 고려해야 하므로 많은 시간을 필요로 하며, 실시간으로 모델을 재학습하기 위해서는 고성능의 컴퓨터 사양을 요구하는 단점을 가지고 있다. However, deep learning algorithms require a lot of time because they need to consider several hyper-parameters that determine the performance of the model, and have disadvantages in that they require high-performance computer specifications to retrain the model in real time.

또한, 기존의 예측 방식은 점 예측으로 현재 시점에서 하루 뒤의 시점을 예측하는 방식을 수행하였으나, 이는 불확실성을 대비하지 못하는 문제점을 가지고 있으며, 15분 단위의 초단기 전력수요 예측에 관한 연구 사례도 미미하다. In addition, the existing prediction method performed a method of predicting a point one day later from the present time using point prediction, but this has a problem in that it cannot prepare for uncertainty, and there are few research cases on ultra-short-term power demand prediction in units of 15 minutes. do.

대한민국 공개공보 10-2013-0035577호Republic of Korea Publication No. 10-2013-0035577

Lab 11 - PCR and PLS Regression in Python, March 23, 2016Lab 11 - PCR and PLS Regression in Python, March 23, 2016

본 발명은 계산 비용이 낮으면서도 초단기 전력수요를 정확하게 예측할 수 있는 단기 전력수요 예측 장치 및 방법을 제공하고자 한다.An object of the present invention is to provide a short-term power demand prediction apparatus and method capable of accurately predicting an ultra-short-term power demand while having a low calculation cost.

본 발명의 일 측면에 따른 스태킹 앙상블 타입 단기 전력수요 예측 장치는, 소정 기간의 날짜/시간 정보, 날씨 정보, 전력 사용량 데이터를 수집하는 데이터 수집부; 수집된 데이터들을 특징 추출하고 및 최대 최소 정규화를 포함하는 전처리를 수행하는 데이터 전처리부; 및 상기 전처리된 입력 데이터들을 심층 신경망 기반으로 2개 이상의 서로 다른 층수를 가지는 단위 예측 모델들에 적용하고, 상기 각 단위 예측 모델들이 도출한 예측값들로부터 최종 예측값을 산출하는 예측 모델링부를 포함할 수 있다.A stacking ensemble type short-term power demand prediction apparatus according to an aspect of the present invention includes: a data collecting unit for collecting date/time information, weather information, and power usage data of a predetermined period; a data pre-processing unit performing pre-processing including feature extraction of the collected data and maximum/minimum normalization; and a predictive modeling unit that applies the pre-processed input data to unit prediction models having two or more different number of layers based on a deep neural network, and calculates a final prediction value from the prediction values derived by each of the unit prediction models. .

여기서, 상기 예측 모델링부는, 최소 은닉 계층을 가지는 기본 단위 예측모델에 대한 학습을 수행하는 기본 단위 예측모델 학습부; 학습이 완료된 기본 단위 예측모델로부터, 은닉 계층이 증가된 하나 이상의 부가 단위 예측모델들을 도출하여, 상기 기본 단위 예측모델과 상기 부가 단위 예측모델들로 구성된 다중 예측모델을 생성하는 다중 예측모델 구성부; 및 상기 기본 단위 예측모델과 상기 부가 단위 예측모델들의 각 개별 예측값들을 스태킹 앙상블 처리하여 최종 예측값을 산출하는 앙상블 예측부를 포함할 수 있다.Here, the predictive modeling unit may include: a basic unit prediction model learning unit configured to learn a basic unit prediction model having a minimum hidden layer; a multi-prediction model constructing unit that derives one or more additional unit prediction models with an increased hidden layer from the basic unit prediction model that has been trained, and generates a multi-prediction model composed of the basic unit prediction model and the additional unit prediction models; and an ensemble prediction unit that calculates a final prediction value by performing stacking ensemble processing on individual prediction values of the basic unit prediction model and the additional unit prediction models.

여기서, 상기 기본 단위 예측모델 및 상기 부가 단위 예측모델들은 심층 신경망(Deep Neural Network) 모델일 수 있다.Here, the basic unit prediction model and the additional unit prediction models may be a deep neural network model.

여기서, 상기 앙상블 예측부는, 슬라이딩 윈도우 기반으로, 상기 부가 단위 예측모델들의 각 개별 예측값들에 대하여, 주성분(Principal Component)들을 추출한 후 이 주성분들을 이용해서 회귀 모델을 만드는 주성분 회귀분석을 수행할 수 있다.Here, the ensemble prediction unit may perform principal component regression analysis, which extracts principal components for each individual prediction values of the additional unit prediction models, and then creates a regression model using the principal components based on a sliding window. .

여기서, 상기 앙상블 예측부는, 상기 기본 단위 예측모델 및 상기 부가 단위 예측모델들에 대한 가중치들이 전력수요 예측 운영시에 계속 학습된 값으로 보정되도록 슬라이딩 윈도우를 적용하여 스태킹 앙상블 처리할 수 있다.Here, the ensemble prediction unit may perform stacking ensemble processing by applying a sliding window so that the weights for the basic unit prediction model and the additional unit prediction models are corrected to the values learned continuously during the power demand prediction operation.

여기서, 상기 데이터 전처리부는, 1차원의 시간 정보를 2차원 회전 좌표계로 증강하되, 요일과 공휴일의 정보는, 공휴일에 해당하는 날에는 1, 그렇지 않을 때는 0으로 표기할 수 있다.Here, the data preprocessor may augment the one-dimensional time information into a two-dimensional rotational coordinate system, but the day and holiday information may be expressed as 1 on a day corresponding to a public holiday, and 0 otherwise.

여기서, 상기 데이터 전처리부는, 예측 시점의 전력 사용량과 예측 시점의 과거 시점 전력 사용량 간의 연관성을 파악하기 위해 피어슨 상관계수를 적용할 수 있다.Here, the data preprocessor may apply a Pearson correlation coefficient to determine a correlation between the power usage at the prediction time and the power usage at the past time at the prediction time.

여기서, 수집된 데이터 또는 전처리된 데이터와 학습 결과값들이 저장되는 저장부를 더 포함할 수 있다.Here, it may further include a storage unit in which the collected data or pre-processed data and learning result values are stored.

여기서, 상기 기본 단위 예측모델 및 부가 단위 예측모델들에 대한 가중치들은, 대상 건물에 대한 전력수요 예측 운영시에 계속 학습된 값으로 보정될 수 있다.Here, the weights for the basic unit prediction model and the additional unit prediction models may be corrected to values continuously learned during the operation of predicting power demand for the target building.

본 발명의 다른 측면에 따른 스태킹 앙상블 타입 단기 전력수요 예측 방법은, 소정 기간의 기상 정보 및 전력 사용량 데이터를 수집하는 단계; 수집된 데이터들을 특징 추출하고 및 최대 최소 정규화를 포함하는 전처리를 수행하는 단계; 최소 은닉 계층을 가지는 기본 단위 예측모델에 대한 학습을 수행하는 단계; 학습이 완료된 기본 단위 예측모델로부터, 은닉 계층이 증가된 하나 이상의 부가 단위 예측모델들을 도출하여, 상기 기본 단위 예측모델과 상기 부가 단위 예측모델들로 구성된 다중 예측모델을 구성하는 단계; 상기 기본 단위 예측모델과 상기 부가 단위 예측모델들의 각 개별 예측값들을 스태킹 앙상블 처리하여 최종 예측값을 산출하는 단계를 포함할 수 있다.A stacking ensemble type short-term power demand forecasting method according to another aspect of the present invention comprises the steps of: collecting weather information and power usage data for a predetermined period; performing preprocessing including feature extraction of the collected data and maximum minimum normalization; performing learning on a basic unit prediction model having a minimum hidden layer; deriving one or more additional unit prediction models with an increased hidden layer from the basic unit prediction model for which learning is completed, and constructing a multi-prediction model composed of the basic unit prediction model and the additional unit prediction models; The method may include calculating a final predicted value by performing stacking ensemble processing on individual prediction values of the basic unit prediction model and the additional unit prediction models.

여기서, 상기 최종 예측값을 산출하는 단계에서는, 슬라이딩 윈도우 기반으로, 상기 부가 단위 예측모델들의 각 개별 예측값들에 대하여, 주성분(Principal Component)들을 추출한 후 이 주성분들을 이용해서 회귀 모델을 만드는 주성분 회귀분석을 수행할 수 있다.Here, in the step of calculating the final predicted value, principal component regression analysis is performed to create a regression model using the principal components after extracting principal components from each of the individual prediction values of the additional unit prediction models based on a sliding window. can be done

여기서, 상기 최종 예측값을 산출하는 단계에서는, 상기 기본 단위 예측모델 및 상기 부가 단위 예측모델들에 대한 가중치들이 전력수요 예측 운영시에 계속 학습된 값으로 보정되도록 슬라이딩 윈도우를 적용할 수 있다.Here, in the step of calculating the final predicted value, a sliding window may be applied so that the weights for the basic unit prediction model and the additional unit prediction models are corrected to the values learned continuously during the power demand prediction operation.

여기서, 상기 전처리를 수행하는 단계에서는, 1차원의 시간 정보를 2차원 회전 좌표계로 증강하되, 일요일과 공휴일의 정보는, 공휴일에 해당하는 날에는 1, 그렇지 않을 때는 0으로 표기할 수 있다.Here, in the step of performing the pre-processing, one-dimensional time information is augmented with a two-dimensional rotational coordinate system, but Sunday and public holidays information may be expressed as 1 on a day corresponding to a public holiday, and 0 otherwise.

여기서, 상기 전처리를 수행하는 단계에서는, 예측 시점의 전력 사용량과 예측 시점의 과거 시점 전력 사용량 간의 연관성을 파악하기 위해 피어슨 상관계수를 적용할 수 있다.Here, in the step of performing the preprocessing, a Pearson correlation coefficient may be applied to determine a correlation between the power usage at the prediction time and the power usage at the past time at the prediction time.

상술한 구성의 본 발명의 사상에 따른 단기 전력수요 예측 장치 및 방법을 실시하면, 계산 비용이 낮으면서도 초단기 전력수요를 정확하게 예측할 수 있는 이점이 있다.When the short-term power demand prediction apparatus and method according to the spirit of the present invention having the above configuration are implemented, there is an advantage in that the calculation cost is low and the ultra-short-term power demand can be accurately predicted.

본 발명의 단기 전력수요 예측 장치 및 방법은, 고성능의 컴퓨터 사양을 요구하지 않는 Python의 scikit-learn 환경에서 심층 신경망 모델을 구성할 수 있는 이점이 있다.The short-term power demand prediction apparatus and method of the present invention have the advantage of being able to construct a deep neural network model in a scikit-learn environment of Python that does not require high-performance computer specifications.

본 발명의 단기 전력수요 예측 장치 및 방법은, 단일 심층 신경망 모델뿐만 아니라 다양한 기계학습 알고리즘보다 학습 시간과 평가 시간이 매우 적게 소요되고, 더욱 우수한 예측 성능을 보인다는 이점이 있다.The short-term power demand prediction apparatus and method of the present invention has the advantage that it takes very little learning time and evaluation time and shows better prediction performance than a single deep neural network model as well as various machine learning algorithms.

도 1은 본 발명의 사상에 따른 단기 전력수요 예측 장치의 일 실시예를 도시한 블록도.
도 2는 도 1의 단기 전력수요 예측 장치에서 예측 모델을 위한 데이터 수집부 및 입력 변수 구성부를 도시한 블록도.
도 3은 도 1의 단기 전력수요 예측 장치에서 스태킹 앙상블 타입의 다중 전력수요 예측 방법을 수행하는 예측 모델 구성부를 도시한 블록도.
도 4는 본 발명의 사상에 따른 단기 전력수요 예측 방법의 일 실시예를 도시한 흐름도.1 is a block diagram illustrating an embodiment of an apparatus for predicting short-term power demand according to the spirit of the present invention.
FIG. 2 is a block diagram illustrating a data collection unit and an input variable configuration unit for a prediction model in the short-term power demand prediction apparatus of FIG. 1 .
3 is a block diagram illustrating a prediction model constructing unit for performing a stacking ensemble type multiple power demand prediction method in the short-term power demand prediction apparatus of FIG. 1 .
4 is a flowchart illustrating an embodiment of a method for predicting short-term power demand according to the spirit of the present invention.

본 발명을 설명함에 있어서 제 1, 제 2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 구성요소들은 용어들에 의해 한정되지 않을 수 있다. 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제 1 구성요소는 제 2 구성요소로 명명될 수 있고, 유사하게 제 2 구성요소도 제 1 구성요소로 명명될 수 있다. In describing the present invention, terms such as first, second, etc. may be used to describe various components, but the components may not be limited by the terms. The terms are only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, a first component may be referred to as a second component, and similarly, a second component may also be referred to as a first component.

어떤 구성요소가 다른 구성요소에 연결되어 있다거나 접속되어 있다고 언급되는 경우는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해될 수 있다.When a component is referred to as being connected or connected to another component, it may be directly connected or connected to the other component, but it can be understood that other components may exist in between. .

본 명세서에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함할 수 있다. The terminology used herein is used only to describe specific embodiments, and is not intended to limit the present invention. The singular expression may include the plural expression unless the context clearly dictates otherwise.

본 명세서에서, 포함하다 또는 구비하다 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것으로서, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해될 수 있다. In this specification, the terms include or include are intended to designate that a feature, number, step, operation, component, part, or combination thereof described in the specification exists, and includes one or more other features or numbers, It may be understood that the existence or addition of steps, operations, components, parts, or combinations thereof is not precluded in advance.

또한, 도면에서의 요소들의 형상 및 크기 등은 보다 명확한 설명을 위해 과장될 수 있다.In addition, shapes and sizes of elements in the drawings may be exaggerated for clearer description.

본 발명은 지역/용도와 관계없이 어떠한 건물에서도 적용 가능한 전력수요 예측을 위해, 어디에서도 수집 가능한, 즉, 수집이 용이한 날짜/시간 정보, 날씨 정보, 과거 전력 사용량만을 고려하여 예측 모델의 입력 변수를 구성한다. The present invention provides a predictive model input variable in consideration of only date/time information, weather information, and past power usage that can be collected anywhere, that is, easy to collect, for power demand prediction applicable to any building regardless of region/use. make up

다음, 수집된 입력 변수에 대한 전처리를 수행함에 있어서, 데이터 보간 및 연속 시간 정보를 회전 좌표계로 변환을 수행한다.Next, in performing preprocessing on the collected input variables, data interpolation and transformation of continuous time information into a rotational coordinate system are performed.

다음으로, 심층학습 모델 중 하나인 심층 신경망(Deep Neural Network)을 기반으로 은닉층 개수에 따른 여러 예측 모델들을 구성한 다음, 예측 모델에서 도출한 예측값을 이용하여 슬라이딩 윈도(Sliding Window) 방식을 통한 주성분 회귀분석 기반의 스태킹 앙상블 모델을 구성한다. Next, based on the deep neural network, which is one of the deep learning models, several prediction models are constructed according to the number of hidden layers, and then principal component regression through the sliding window method using the prediction values derived from the prediction model. Construct an analysis-based stacking ensemble model.

개발한 스태킹 앙상블 모델은 최근 전력수요 패턴을 효과적으로 반영할 수 있으므로, 기존에 제안된 다양한 예측 모델보다 정교한 예측 결과를 도출한다. 또한, 이는 다중 초단기 전력수요 예측(15분 뒤부터 하루 뒤까지 총 96 시점)에서도 정확한 전력수요 예측을 수행할 수 있으므로, 향후 스마트 그리드 운영에서 불확실성을 효과적으로 대비할 수 있는 근거를 제공한다. Since the developed stacking ensemble model can effectively reflect the recent power demand pattern, it derives more sophisticated forecasting results than various previously proposed forecasting models. In addition, this provides a basis for effectively preparing for uncertainties in smart grid operation in the future, as it can perform accurate power demand forecasting even in multiple ultra-short-term power demand forecasts (a total of 96 time points from 15 minutes later to one day later).

도 1은 본 발명의 사상에 따른 단기 전력수요 예측 장치의 일 실시예를 도시한 블록도이다.1 is a block diagram illustrating an embodiment of an apparatus for predicting short-term power demand according to the spirit of the present invention.

도시한 본 발명의 일 실시예에 따른 단기 전력수요 예측 장치(100)는, 소정 기간의 날짜/시간 정보, 날씨 정보, 전력 사용량 데이터를 수집하는 데이터 수집부(110); 입력 데이터들을 특징 추출 및 최대 최소 정규화를 수행하는 데이터 전처리부(120); 및 상기 전처리된 입력 데이터들을 심층 신경망 기반으로 2개 이상의 단위 예측 모델들에 적용하고, 상기 각 단위 예측 모델들이 도출한 예측값들을 스태킹 앙상블 처리하여 최종 예측값을 산출하는 예측 모델 구성부(160)를 포함할 수 있다.The apparatus 100 for predicting short-term power demand according to an embodiment of the present invention as shown includes a data collection unit 110 for collecting date/time information, weather information, and power usage data of a predetermined period; a data pre-processing unit 120 performing feature extraction and maximum/minimum normalization on input data; and a prediction model constructing unit 160 that applies the preprocessed input data to two or more unit prediction models based on a deep neural network, and calculates a final prediction value by stacking ensemble processing the prediction values derived from each of the unit prediction models. can do.

도시한 상기 단기 전력수요 예측 장치(100)는 크게 상기 데이터 수집부(110)와 데이터 전처리부(120)를 포함하는 입력 변수 구성부와; 상기 예측 모델 구성부로 2부분으로 구분할 수 있다. The illustrated short-term power demand prediction apparatus 100 includes an input variable configuration unit including the data collection unit 110 and the data pre-processing unit 120 ; The predictive model component can be divided into two parts.

도 2는 도 1의 단기 전력수요 예측 장치에서 예측 모델을 위한 데이터 수집부(110) 및 입력 변수 구성부를 도시한 블록도이다.FIG. 2 is a block diagram illustrating a data collection unit 110 and an input variable configuration unit for a prediction model in the short-term power demand prediction apparatus of FIG. 1 .

도 3은 도 1의 단기 전력수요 예측 장치에서 예측 모델 구성부(160)를 스태킹 앙상블 타입의 다중 전력수요 예측 방법을 수행하는 세부 구성들로 도시한 블록도이다.FIG. 3 is a block diagram illustrating detailed configurations of the prediction model configuration unit 160 for performing the stacking ensemble type multiple power demand prediction method in the short-term power demand prediction apparatus of FIG. 1 .

도 2 및 도 3에서는 도 1의 블록도 대비 저장부(140)를 더 표현하고 있음을 알 수 있다.It can be seen that the storage unit 140 is further expressed in comparison with the block diagram of FIG. 1 in FIGS. 2 and 3 .

도 2에 도시한 입력 변수 구성부, 구체적으로 데이터 수집부(110)에서는 소정 기간의 건물 전력 사용량, 기상 정보 및 이에 속하는 날짜/시간 데이터를 수집한다. The input variable configuration unit shown in FIG. 2 , specifically, the data collection unit 110 collects building power usage, weather information, and date/time data pertaining to a predetermined period of time.

전력수요 예측 모델을 구성하기 위해, 먼저 시간 정보(수집된 전력 사용량에 해당하는 타임 스탬프), 날씨 정보, 과거 전력 사용량으로 총 3가지의 특성을 갖는 데이터를 수집할 것을 제안한다. 이 데이터만 고려한 이유는 실제 스마트 그리드, 마이크로그리드가 설치된 환경에서 쉽게 수집할 수 있는 정보이기 때문이다. In order to construct the power demand prediction model, it is proposed to first collect data having three characteristics: time information (time stamp corresponding to the collected power usage), weather information, and past power usage. The reason only this data is considered is that it is information that can be easily collected in the environment where the actual smart grid and microgrid are installed.

상기 데이터 전처리부(120)가 수행하는 전력 사용량 정보 및 날짜/시간 데이터에 대한 데이터 전처리 과정(126)을 예시하면 다음과 같다.The data preprocessing process 126 for power usage information and date/time data performed by the data preprocessor 120 is exemplified as follows.

전력수요 예측은 예측 모델이 과거 정보를 학습하고 예측 시점에 대한 미래 정보를 입력하면, 그 정보에 맞추어 예측값을 도출하는 과정으로 되어 있으므로, 실제 신뢰성이 높은 미래 정보를 아는 것이 매우 중요하다. Power demand forecasting is a process of deriving a forecast value according to the information when the forecast model learns past information and inputs future information about the forecast time, so it is very important to know future information with high actual reliability.

시간 정보는 월, 일, 시, 분, 요일, 공휴일로 구성되어 있으며, 이는 미래 시점에서도 충분히 파악할 수 있는 정보이다. 월, 일, 시, 분은 주기성을 갖으나, 수집된 값을 그대로 적용하면 주기성을 반영할 수 없는 문제가 발생한다. 예컨대 11시 45분과 다음 날의 12시(자정)는 인접한 시간이지만, 수치를 그대로 표기하면 95라는 차이가 발생하게 된다. Time information consists of month, day, hour, minute, day of the week, and public holidays, which is information that can be fully understood even in the future. Month, day, hour, and minute have periodicity, but if the collected values are applied as they are, there is a problem that periodicity cannot be reflected. For example, 11:45 and 12:00 (midnight) of the next day are adjacent times, but if the numerical value is expressed as it is, a difference of 95 will occur.

이러한 문제를 해결하고자, 1차원의 시간 정보를 2차원 회전 좌표계로 증강하여 공간적인 특성을 나타내었다. 이를 통해, 1차원 데이터는 시계의 특징을 가질 수 있는 데이터로 변환하게 된다. 즉, 날짜/시간 데이터에서 하기 수학식 (1)~(8)을 적용하여 월, 일, 시, 분 데이터의 주기성을 반영한다. 이를 통해 오후 11시와 다음 날 자정의 간격인 23 대신에 근접한 숫자로 반영하여 입력 변수를 구성하므로, 예측 모델은 더욱 효과적으로 날짜/시간 정보를 학습할 수 있다.To solve this problem, spatial characteristics were expressed by augmenting one-dimensional temporal information with a two-dimensional rotational coordinate system. Through this, the one-dimensional data is converted into data having the characteristics of a watch. That is, the periodicity of month, day, hour, and minute data is reflected by applying Equations (1) to (8) below in date/time data. Through this, the predictive model can learn date/time information more effectively because input variables are constructed by reflecting close numbers instead of 23, which is the interval between 11 pm and midnight the next day.

상기 수학식 3 및 4에서

은 해당 월의 마지막 일을 나타낸다. 예를 들어

은 31이며,

은 30이다. In

Equations

3 and 4 above

represents the last day of the month. For example

is 31,

is 30.

요일과 공휴일의 정보는 명목 척도의 특성을 가지므로, 해당하는 날에는 1, 그렇지 않을 때는 0으로 표기하여 데이터를 표기할 수 있다. 예를 들어, 예측 시점의 날짜가 월요일과 평일일 때, 월요일에 해당하는 변수의 값은 1이며, 나머지 변수인 화요일, 수요일, 목요일, 금요일, 토요일, 일요일, 공휴일의 값은 0으로 표기한다. Since the information on the day of the week and public holidays has the characteristic of a nominal scale, data can be expressed by indicating 1 on the corresponding day and 0 otherwise. For example, when the date of the prediction time is Monday and weekdays, the value of the variable corresponding to Monday is 1, and the values of the remaining variables, Tuesday, Wednesday, Thursday, Friday, Saturday, Sunday, and holidays, are expressed as 0.

일반적으로 근무 시간에는 높은 전력 사용량을 보이므로, 이를 반영하는 것은 정확한 전력수요 예측을 도출할 수 있다. 요일과 공휴일 정보는 근무 시간의 여부를 나타낼 수 있는 지표이므로, 이를 총 8개의 더미변수(Dummy Variable)로 나타낸다. 여기서 공휴일은 토요일, 일요일, 공휴일로 구성된다. 그리하여 총 16개의 날짜/시간 정보를 예측 모델의 입력 변수로 구성한다. In general, high power consumption is shown during working hours, so reflecting this can lead to an accurate power demand forecast. Since the information on the day of the week and holidays is an indicator that can indicate working hours, it is represented by a total of eight dummy variables. Here, public holidays consist of Saturdays, Sundays, and public holidays. Thus, a total of 16 date/time information is configured as input variables of the predictive model.

상기 데이터 전처리부(120)가 수행하는 기상 정보에 대한 전처리 과정(122)을 다음과 같이 예시한다.The pre-processing process 122 for the weather information performed by the data pre-processing unit 120 is exemplified as follows.

본 발명에서는 실제 스마트 그리드 환경에서 효과적으로 적용할 수 있도록 기상청 동네예보의 기상 정보를 수집한다. 이는 통상적인 예측 모델은 기상청 동네예보의 기상 예측값을 토대로 전력수요 예측을 수행하므로, 실제 기상정보를 예측 모델이 학습한다면 기상 예측값과 실제값의 오차로 인해 정확한 전력수요 예측이 어려워질 수 있기 때문이다. 수집된 기상 정보는 전력 사용량과 높은 연관성을 가지는 온도, 습도, 풍속이며, 3시간 단위로 수집되므로 선형 보간법을 통해 3시간 단위를 15분 단위로 세분화한다.In the present invention, weather information of the Meteorological Administration's neighborhood forecast is collected so that it can be effectively applied in an actual smart grid environment. This is because a typical forecasting model predicts power demand based on the weather forecast of the Meteorological Administration's neighborhood forecast, so if the predictive model learns the actual weather information, it may be difficult to accurately predict the electric power demand due to the error between the forecasted value and the actual value. . The collected meteorological information is temperature, humidity, and wind speed, which are highly correlated with power consumption. Since they are collected every 3 hours, the 3-hour unit is subdivided into 15-minute units through linear interpolation.

이는, 날씨 데이터는 3 시간 해상도로 수집되었으므로 직접 날씨 데이터를 적용하는 것은 적절하지 않음을 감안한 것이며, 예측 기간과의 차이로 인해 예측 모델. 같은 날씨 데이터를 얻기 위해 전기 에너지 소비의 시간 분해능으로 선형 보간법을 사용하여 15분 해상도로 날씨 데이터를 추정한 것이다.This takes into account that the weather data was collected with a 3-hour resolution, so it is not appropriate to apply the weather data directly, and due to the difference from the forecast period, the forecast model. To obtain the same weather data, we estimate the weather data at 15-minute resolution using linear interpolation with the temporal resolution of electrical energy consumption.

날씨 정보는 기상청의 동네예보에서 예측한 기온, 습도, 풍속으로 구성되어 있습니다. 이 세 가지 정보만 반영하는 것은 실제 전력 사용량에 큰 영향을 주는 요인이며, 기상청 동네예보에서 3시간 단위로 구성된 숫자로 제공되는 정보임을 감안한 것이다. 그러나, 동네예보는 3시간 단위로 예보를 하므로, 실제 15분 단위의 전력수요를 예측할 때 단위가 맞지 않아 정확한 예측이 어렵다는 단점이 존재하는 바, 하기 수학식 9에 따른 선형 보간법을 사용해 3시간 단위의 기상 정보를 15분 단위로 조정/반영할 수 있다.Weather information consists of temperature, humidity, and wind speed predicted by the local weather forecast of the Korea Meteorological Administration. Reflecting only these three pieces of information is a factor that has a big impact on actual power consumption, considering that the information provided in 3-hour units from the Meteorological Administration's neighborhood forecast. However, since the neighborhood forecast forecasts in units of 3 hours, there is a disadvantage in that it is difficult to accurately predict because the units do not match when actually predicting the power demand in units of 15 minutes. weather information can be adjusted/reflected in 15-minute increments.

일반적으로 두 지점

,

에서의 데이터 값이 각각

,

일 때,

,

사이의 임의의 지점

에서의 데이터 값

를 계산할 수 있다.

은

에서

까지의 거리,

는

에서

까지의 거리를 말합니다. 즉, 오후 3시의 기온이 15도이고, 오후 6시의 기온이 18도라면, 오후 4시와 5시의 기온은 16도와 17도라는 것으로 유추할 수 있다. usually two points

,

Each data value in

,

when,

,

any point between

data value in

can be calculated.

silver

at

distance to,

Is

at

refers to the distance to That is, if the temperature at 3 PM is 15 degrees and the temperature at 6 PM is 18 degrees, it can be inferred that the temperatures at 4 PM and 5 PM are 16 degrees and 17 degrees.

상기 데이터 전처리부(120)가 수행하는 전력 사용량 정보에 대한 전처리 과정(124)을 다음과 같이 예시한다.The pre-processing process 124 for the power usage information performed by the data pre-processing unit 120 is exemplified as follows.

과거 전력 사용량은 예측 시점의 하루 전과 일주일 전 시점의 실제 전력 사용량을 사용하는 것이 유리하다. 다른 과거 전력 사용량, 즉 2일 전부터 6일 전까지의 전력 사용량도 반영할 수 있겠지만, 이는 예측 시점과의 전력 사용량과 높은 연관성을 보이지 않으므로, 심층 학습 기법이 예측 모델을 학습할 때 잘못 학습(과적합 문제)할 수 있기 때문이다.For the past power usage, it is advantageous to use the actual power usage one day before and one week before the forecast time. It can also reflect other historical power usage, that is, power usage from 2 days to 6 days ago, but it does not show a high correlation with the power usage at the time of prediction, so the deep learning technique learns incorrectly (overfitting) when training the prediction model. problem) because

이를 입증하기 위해, 예측 시점의 전력 사용량과 예측 시점의 과거 시점 전력 사용량 간의 연관성을 파악하기 위해 피어슨 상관계수를 사용할 수 있으며, 구체적으로는 하기 수학식 10과 같다.To prove this, the Pearson correlation coefficient may be used to determine the correlation between the power usage at the prediction time and the power usage at the past time at the prediction time, and specifically, Equation 10 below.

두 변수 x와 y 사이의 선형 상관관계를 측정하는 피어슨 상관계수는 이러한 변수의 공분산을 동일 또는 비례척도 데이터의 표준 편차 곱으로 나눈 값이며, Cauchy-Schwarz 부등식에 따라 +1과 -1 사이의 값을 가질 수 있다. 여기서 +1은 완전한 양의 선형 상관관계이고 0은 선형의 상관관계가 없음을 나타내며, -1은 완벽한 음의 선형 상관관계이다. 일반적으로 피어슨 상관계수의 수치는 아래와 같이 해석할 수 있다.The Pearson correlation coefficient, which measures the linear correlation between two variables x and y, is the value of the covariance of these variables divided by the product of the standard deviation of equal or proportional scale data, and is a value between +1 and -1 according to the Cauchy-Schwarz inequality. can have where +1 is a perfect positive linear correlation, 0 indicates no linear correlation, and -1 is a perfect negative linear correlation. In general, the numerical value of the Pearson correlation coefficient can be interpreted as follows.

r이 -1.0과 -0.7 사이이면, 강한 음적 선형관계If r is between -1.0 and -0.7, then there is a strong negative linear relationship

r이 -0.7과 -0.3 사이이면, 뚜렷한 음적 선형관계If r is between -0.7 and -0.3, then there is a pronounced negative linear relationship

r이 -0.3과 -0.1 사이이면, 약한 음적 선형관계If r is between -0.3 and -0.1, a weakly negative linear relationship

r이 -0.1과 +0.1 사이이면, 거의 무시될 수 있는 선형관계If r is between -0.1 and +0.1, then a linear relationship that can be almost neglected

r이 +0.1과 +0.3 사이이면, 약한 양적 선형관계If r is between +0.1 and +0.3, then there is a weak quantitative linear relationship

r이 +0.3과 +0.7 사이이면, 뚜렷한 양적 선형관계If r is between +0.3 and +0.7, then there is a clear quantitative linear relationship

r이 +0.7과 +1.0 사이이면, 강한 양적 선형관계If r is between +0.7 and +1.0, then there is a strong quantitative linear relationship

상기 수학식 10에서 n쌍으로 구성된 쌍을 이룬 데이터{(

),...,(

)}에 대해 표본을 기반으로 공분산과 분산을 대체하는 추정값인

를 산출할 수 있다. 여기서,

은 샘플 크기를 나타내며,

와

는 시간

로 인덱스된 개별 샘플 시점이다.

와

는 각각

와

의 표본 평균이다. In Equation 10, paired data consisting of n pairs {(

),...,(

), which is an estimate that substitutes for covariance and variance based on the sample for

can be calculated. here,

represents the sample size,

Wow

is the time

Individual sample time points indexed by .

Wow

is each

Wow

is the sample mean of

보유하고 있는 데이터 셋에서 예측 시점의 전력 사용량과 예측 시점의 과거 1일부터 7일까지의 전력 사용량의 피어슨 상관계수를 계산하였을 때, 예측 시점의 전력 사용량과 예측 시점의 하루 전과 일주일 전 전력 사용량과의 피어슨 상관계수가 0.7 이상으로 산출되어 높은 연관성을 가진 변수라는 것을 알 수 있었며, 본 발명에서도 이를 고려하여 입력 변수로 구성하였다.When the Pearson correlation coefficient is calculated between the power consumption at the prediction time and the power usage from 1 to 7 days in the past from the data set you have, the power usage at the prediction time and the power usage one day and one week before the prediction time The Pearson's correlation coefficient was calculated to be 0.7 or higher, indicating that it was a variable with high correlation, and the present invention also considered this as an input variable.

정리하면, 과거 전력 사용량(Historical Electric Load)은 최근 전력 사용량 추세를 반영할 수 있는 변수지만, 이를 모두 반영하는 것은 예측 모델의 과적합(Overfitting) 문제를 발생시킬 수 있는 바, 예측 시점과 예측 시점의 하루 전 동일 시점부터 일주일 전 동일 시점까지의 피어슨 상관계수를 계산하였으며, 예측 시점의 하루 전과 일주일 전 시점이 높은 상관관계를 보인다는 것을 확인하였다. 그래서, 이를 예측 모델의 입력 변수로 반영하였다. In summary, historical electric load is a variable that can reflect the recent electricity usage trend, but reflecting all of them can cause overfitting of the forecasting model. The Pearson correlation coefficient was calculated from the same time point one day before to the same time point one week ago, and it was confirmed that the day before and one week before the prediction time showed a high correlation. So, this was reflected as an input variable of the predictive model.

도 3에 도시한 예측 모델 구성부(162, 164, 166)에서는 효과적인 심층 신경망 모델 학습을 위해, 입력 변수 구성부에서 구성된 입력 변수들을 최대-최소 정규화(Min-Max normalization)를 통해 0~1사이의 값들로 변경한다. 다음으로 데이터 수집 기간을 일정한 비율로 훈련 집합, 검증 집합, 평가 집합으로 분할한다. In the predictive model constructing units 162, 164, and 166 shown in FIG. 3, for effective deep neural network model learning, the input variables configured in the input variable constructing unit are subjected to Min-Max normalization between 0 and 1 change to the values of Next, the data collection period is divided into a training set, a validation set, and an evaluation set at a constant rate.

도 3에서는 저장부(140)에 기록된 입력 정보를 최소-최대 정규화를 수행하는 정규화부(146); 및 기본 단위 예측모델 학습용 데이터 세트, 확장된 예측모델 구성용 데이터 세트, 실제 예측용 데이터 세트 등을 구성하는 데이터 세트 구성부(148)를 더 구비하는 것으로 표현하였다.In FIG. 3, a normalization unit 146 that performs min-max normalization on the input information recorded in the storage unit 140; And it is expressed as further comprising a data set configuration unit 148 constituting a basic unit prediction model training data set, an extended prediction model configuration data set, an actual prediction data set, and the like.

또한, 도 3에서는 도 1의 상기 예측 모델링부(160)를, 최소 은닉 계층을 가지는 심층 신경망(DNN) 모델로서 기본 단위 예측모델에 대한 학습(각 노드별 및 각 분기별 가중치들 결정)을 수행하는 기본 단위 예측모델 학습부(162); 학습이 완료된 기본 단위 예측모델로부터, 은닉 계층이 증가된 심층 신경망(DNN) 모델로서 하나 이상의 부가 단위 예측모델들을 도출하여, 상기 기본 단위 예측모델과 상기 부가 단위 예측모델들로 구성된 다중 예측모델을 생성하는 다중 예측모델 구성부(169); 및 상기 기본 단위 예측모델과 상기 부가 단위 예측모델들의 각 개별 예측값들을 스태킹 앙상블 처리하여 최종 예측값을 산출하는 앙상블 예측부(166)로 구분하여 도시하였다.In addition, in FIG. 3, the prediction modeling unit 160 of FIG. 1 is a deep neural network (DNN) model having a minimum hidden layer, and learning (determining weights for each node and each branch) is performed on a basic unit prediction model. a basic unit prediction model learning unit 162; From the basic unit prediction model that has been trained, one or more additional unit prediction models are derived as a deep neural network (DNN) model with an increased hidden layer, and a multiple prediction model composed of the basic unit prediction model and the additional unit prediction models is generated. a multi-prediction model configuration unit 169; and an ensemble prediction unit 166 that calculates a final prediction value by performing stacking ensemble processing on individual prediction values of the basic unit prediction model and the additional unit prediction models.

하기 표 1은 각 학습모델의 입력 데이터 세트를 구성할 수 있는 입력 변수들을 예시한다.Table 1 below illustrates input variables that may constitute the input data set of each learning model.

상기 표 1에서 총 21개의 입력 변수를 구성하였으며, 심층 신경망 모델의 효과적인 학습을 위해 이 변수들은 상기 정규화부(146)에 의해 하기 수학식 11과 같은 최대-최소 정규화를 거칠 수 있다.A total of 21 input variables are configured in Table 1, and for effective learning of the deep neural network model, these variables may be subjected to maximum-minimum normalization as shown in Equation 11 below by the regularizer 146 .

상기 수학식에서

는 변수

에 속해 있는

라는 변숫값의 정규화를 수행한 값이며,

와

은 데이터 셋

에서 가장 큰 값과 작은 값을 나타낸다. 그에 따라, 데이터 셋의 전체 변수는 0에서 1 사이의 값을 가지게 되며, 이를 심층 신경망 모델에 적용한다.in the above formula

is a variable

belonging to

It is a value obtained by normalizing the variable value of

Wow

silver data set

represents the largest and smallest values in Accordingly, all variables in the data set have a value between 0 and 1, and this is applied to the deep neural network model.

일반적인 인공 신경망은 하기 수학식 12와 같은 과정으로 예측값을 도출한다.A general artificial neural network derives a predicted value through a process as in Equation 12 below.

상기 수학식에서

는 비선형 함수이고,

는 연속형 또는 이산형 변수들의 벡터로 입력 변수이며,

는 연속형 또는 이산형 변수들의 벡터로 출력 노드이다.

는

의 함수로

는 은닉 노드를 가리키며,

는 활성화 함수이며,

는 가중치이다. 여기서 활성화 함수는 임계치를 적용해 의미 없는 데이터를 사전에 필터링하고 미분을 편하게 하기 위해서 사용하는 함수이다. in the above formula

is a nonlinear function,

is the input variable as a vector of continuous or discrete variables,

is the output node as a vector of continuous or discrete variables.

Is

as a function of

points to the hidden node,

is the activation function,

is the weight. Here, the activation function is a function used to filter meaningless data in advance by applying a threshold value and to facilitate differentiation.

신경망에서 중요한 것은 바로 가중치를 조절하는 것이며, 가중치 조절을 어떻게 수행하느냐에 따라 신경망 모델의 정확도가 달라지는 바, 학습 결과 또한 가중치로서 반영될 수 있다. 피드포워드(Feedforward)는 입력층, 은닉층, 출력층까지 순서대로 계산하는 방법이고, 역전파(Backpropagation)은 레이블 된 학습 데이터를 가지고 여러 개의 은닉층을 가지는 피드포워드 신경망을 학습시킬 때 사용되는 대표적인 지도학습 알고리즘이다. 즉 forward phase + backward phase이며, 입력층에서 은닉층, 출력층까지 순서대로 갔다가, 다시 출력층에서 은닉층, 입력층으로 돌아가면서 가중치를 수정하고 위 과정을 계속 반복하며 가장 좋은 결과를 도출할 수 있다.The important thing in a neural network is to adjust the weight, and the accuracy of the neural network model varies depending on how the weight is adjusted, so the learning result can also be reflected as a weight. Feedforward is a method of calculating the input layer, hidden layer, and output layer in order. Backpropagation is a representative supervised learning algorithm used when training a feedforward neural network with multiple hidden layers with labeled training data. to be. That is, it is a forward phase + backward phase, and it goes from the input layer to the hidden layer and the output layer in order, then goes back from the output layer to the hidden layer to the input layer, correcting the weights and repeating the above process to get the best result.

심층 신경망(DNN)은 입력층과 출력층 사이에 여러 개의 은닉층으로 이루어진 인공 신경망이다. 심층 신경망은 일반적인 인공신 경망과 마찬가지로 복잡한 비선형 관계(non-linear relationship)들을 모델링할 수 있다. 심층 신경망의 장점으로는 연속형, 범주형 변수에 상관없이 모두 분석이 가능하며, 입력 변수 간의 비선형 조합(입력 변수가 실제로 전력 사용량과 높은 연관성을 가지나, 피어슨 상관계수는 낮은 경우)이 가능하다. 또한, 예측 성능이 다른 기계학습 기법들보다 상대적으로 우수한 경우가 많으며, 특징 추출이 자동으로 수행되므로, 변수 선택의 번거로움을 줄일 수 있고, 데이터의 양이 많아지면 성능이 계속 향상된다. A deep neural network (DNN) is an artificial neural network composed of several hidden layers between an input layer and an output layer. Deep neural networks can model complex non-linear relationships like general artificial neural networks. The advantage of deep neural networks is that analysis is possible regardless of continuous and categorical variables, and nonlinear combinations between input variables (when input variables actually have high correlation with power consumption but low Pearson correlation coefficient) are possible. In addition, prediction performance is often relatively superior to other machine learning techniques, and since feature extraction is performed automatically, the hassle of variable selection can be reduced, and performance continues to improve as the amount of data increases.

하지만, 신경망이 복잡할 때에는 작동하는 데 오랜 시간이 요구되어, GPU가 장착된 컴퓨터 및 고사양의 컴퓨터가 필요하며, 다양한 초매개변수들을 고려해야 하기 때문에 최적의 모델을 검출하는 데 오랜 시간이 걸리는 반면, 시간 및 계산 비용이 높은 조합이 다른 데이터 셋에서 정확성을 담보하지는 않는다.However, when the neural network is complex, it takes a long time to operate, requiring a computer equipped with a GPU and a high-end computer, and it takes a long time to detect the optimal model because various hyperparameters must be considered. Combinations that are time- and computationally expensive do not guarantee accuracy in other data sets.

신경망의 초매개변수로는 은닉층의 수, 은닉노드의 수, 활성화 함수, Epoch, Learning Rate, Optimizer, Batch Size 등의 다양한 조합을 고려해야 하는데 여기서 은닉층의 수는 인공 신경망과 심층 신경망을 나누는 기준이 되고 모델 성능에 큰 영향을 주기 때문에 최적의 은닉층의 수를 결정하는 것은 매우 중요하다.As hyperparameters of neural networks, various combinations such as number of hidden layers, number of hidden nodes, activation function, epoch, learning rate, optimizer, and batch size should be considered. It is very important to determine the optimal number of hidden layers because it greatly affects the model performance.

일반적으로 Learning Rate나 Optimizer 등은 다른 연구 사례에 참고하여 최적의 값을 대략 파악할 수 있지만, 은닉층의 수는 그렇지 않다. 본 발명은 은닉층의 수와 상관없이 여러 심층 신경망 모델을 만들고 그 심층 신경망들에서 1차로 예측한 전력수요 예측값을 이용하여 스태킹 앙상블 기법을 통해 더 정확한 전력수요를 예측하는 것을 지향한다.In general, learning rate or optimizer can roughly figure out the optimal value by referring to other research cases, but the number of hidden layers is not. The present invention creates several deep neural network models irrespective of the number of hidden layers, and uses the power demand forecast value primarily predicted from the deep neural networks to more accurately predict the power demand through the stacking ensemble technique.

스태킹 앙상블 기법은 다양한 예측 모델의 여러 결괏값을 효과적으로 결합하여 최종 예측을 도출하는 기법이다. 이는 각 예측 모델은 사전에 이미 충분한 정보들을 학습하고 예측하였기 때문에, 예측값에 사전 정보도 충분히 반영되었다는 가정하에 수행되는 기법이다.The stacking ensemble technique is a technique for deriving a final prediction by effectively combining multiple results of various prediction models. This is a technique performed under the assumption that the prior information is sufficiently reflected in the prediction value because each prediction model has already learned and predicted sufficient information in advance.

이를 위해 주성분 회귀 분석이 사용될 수 있는데, 주성분 회귀분석은 말 그대로 입력 변수들의 주성분(Principal Component)들을 추출한 후 이 주성분들을 이용해서 회귀 모델을 만드는 기법이다. 즉, 주성분 분석 + 다중선형회귀분석인데, 주성분들은 서로 직교하기 때문에 다중공선성(입력 변수가 매우 유사하여 모델 학습 시 가중치 선정이 어려움)이 발생하지 않는다.For this purpose, principal component regression analysis can be used. Principal component regression analysis is a technique for creating a regression model using the principal components after extracting the principal components of the input variables. That is, in principal component analysis + multiple linear regression analysis, since the principal components are orthogonal to each other, multicollinearity (difficulty in selecting weights during model training because input variables are very similar) does not occur.

따라서 안심하고 다중선형회귀모델을 만들 수 있다. 게다가 주성분 분석을 이용해서 변환한 주성분 변수 중에서 상위 몇 개의 변수만 이용할 경우 표준화 효과를 줄 수 있어 모델의 과적합 문제(잘못 학습)도 완화할 수 있다. Therefore, multiple linear regression models can be created with confidence. In addition, if only the top few variables among the principal component variables transformed using principal component analysis are used, a standardization effect can be given, and the problem of overfitting the model (mislearning) can also be alleviated.

하지만, 일반적인 예측 모델을 구성할 때, 학습 데이터 셋과 평가 데이터 셋을 분류하여, 학습 데이터 셋에서 예측 모델을 구성하고, 이를 그대로 평가 데이터 셋에 적용합니다. 그리하면, 모델을 구성하는 시점과 예측 시점의 간격이 커질 때(평가 데이터 셋의 끝 시점), 예측 성능이 저하된다는 단점이 존재합니다.However, when constructing a general prediction model, the training data set and the evaluation data set are classified, the prediction model is constructed from the training data set, and it is applied to the evaluation data set as it is. In doing so, when the interval between the time of constructing the model and the time of prediction becomes large (the end of the evaluation data set), there is a disadvantage that the prediction performance deteriorates.

이를 보완하기 위해서는 예측 모델은 최근 트렌드와 패턴을 실시간으로 반영해야 합니다. 그리하여 여러 심층 신경망 모델(각 4층부터 7층으로 구성된 심층 신경망)이 평가 데이터 셋에서 예측한 값을 이용합니다. 여기서 모든 예측값을 적용하지 않고, 과거 일주일 치의 예측값만을 적용합니다.To compensate, predictive models must reflect recent trends and patterns in real time. Thus, several deep neural network models (deep neural networks with 4 to 7 layers each) use the predicted values from the evaluation data set. Here, we do not apply all forecasts, only the forecasts for the past week.

예를 들어, 4월 1일 오후 1시부터 4월 2일 자정 45분의 전력 사용량(15분 단위 96 시점)을 예측한다고 할 때, 3월 25일 오후 1시 15분부터 4월 1일 오후 12시 45분까지 총 7일의 시점을 고려합니다. 각 예측 모델(4개의 심층 신경망 모델)에서 예측한 값을 이용하여 주성분 회귀분석 기반의 예측 모델을 구성하고, 이를 4개 심층 신경망 모델의 4월 1일 오후 1시부터 4월 2일 자정 45분의 전력 사용량 예측값을 주성분 회귀분석 모델에 입력하면 주성분 회귀분석 모델은 최종적으로 96 시점 전력 사용량을 예측하는 방식으로 구성되어 있습니다. For example, assuming that the electricity usage (96 points in 15-minute increments) is predicted for 45 minutes from 1:00 PM on April 1 to midnight on April 2, from 1:15 PM on March 25th to April 1 PM A total of 7 days up to 12:45 are considered. A prediction model based on principal component regression analysis is constructed using the predicted values of each prediction model (4 deep neural network models), and the prediction model based on principal component regression is constructed from 1:00 PM on April 1st to midnight 45 minutes on April 2nd of the four deep neural network models. When the predicted power usage of , is input to the principal component regression model, the principal component regression model is finally constructed in such a way that it predicts the power usage at 96 points in time.

일반적으로 신경망의 초매개변수들, 특히 epoch 등은, 특정 건물(사이트)에 대한 전력수요 예측 장치의 설계시에 상기 다중 예측모델 구성부에 의해 최적값이 학습되어 결정되면, 상기 특정 건물에 대한 전력수요 예측 운영시에는, 상술한 파라미터에 대한 최적값들은 변경되지 않는다. In general, the hyperparameters of the neural network, particularly epoch, are determined by learning and determining optimal values by the multi-prediction model configuration unit at the time of designing an electric power demand prediction device for a specific building (site), for the specific building (site). During the power demand forecasting operation, the optimal values for the above-described parameters are not changed.

반면, 다중 예측모델을 구성하는 기본 단위 예측모델 및 부가 단위 예측모델에 대한 가중치들은 상기 특정 건물에 대한 전력수요 예측 운영시에 계속 학습된 값으로 보정된다. 이를 위해, 도 3의 앙상블 예측부(166) 내부에 도시한 바와 같은 슬라이딩 윈도우를 적용할 수 있다.On the other hand, the weights for the basic unit prediction model and the additional unit prediction model constituting the multi-prediction model are corrected with values continuously learned during the operation of predicting the power demand for the specific building. To this end, a sliding window as shown inside the ensemble prediction unit 166 of FIG. 3 may be applied.

상기 가중치 외에도 초매개변수들 중 일부는 상기 특정 건물에 대한 전력수요 예측 운영시에 계속 학습된 값으로 보정될 수 있다.In addition to the weight, some of the hyperparameters may be corrected to values that are continuously learned during the operation of predicting power demand for the specific building.

도 4는 본 발명의 사상에 따른 단기 전력수요 예측 방법의 일 실시예를 도시한 흐름도이다.4 is a flowchart illustrating an embodiment of a method for predicting short-term power demand according to the spirit of the present invention.

도시한 단기 전력수요 예측 방법은, 소정 기간의 기상 정보 및 전력 사용량 데이터를 수집하는 단계(S110); 수집된 데이터들을 특징 추출하고 및 최대 최소 정규화를 포함하는 전처리를 수행하는 단계(S120); 최소 은닉 계층을 가지는 기본 단위 예측모델에 대한 학습(각 노드별 및 각 분기별 가중치들 결정)을 수행하는 단계(S160); 학습이 완료된 기본 단위 예측모델로부터, 은닉 계층이 증가된 하나 이상의 부가 단위 예측모델들을 도출하여, 상기 기본 단위 예측모델과 상기 부가 단위 예측모델들로 구성된 다중 예측모델을 구성하는 단계(S170); 및 상기 기본 단위 예측모델과 상기 부가 단위 예측모델들의 각 개별 예측값들을 스태킹 앙상블 처리하여 최종 예측값을 산출하는 단계(180)를 포함할 수 있다.The illustrated short-term power demand prediction method includes the steps of collecting weather information and power usage data for a predetermined period (S110); performing pre-processing including feature extraction of the collected data and maximum minimum normalization (S120); performing learning (determining weights for each node and each branch) on a basic unit prediction model having a minimum hidden layer (S160); deriving one or more additional unit prediction models with an increased hidden layer from the basic unit prediction model for which learning is completed, and constructing a multi-prediction model including the basic unit prediction model and the additional unit prediction models (S170); and performing stacking ensemble processing on individual prediction values of the basic unit prediction model and the additional unit prediction models to calculate a final prediction value ( 180 ).

도시한 흐름도에 따른 스태킹 앙상블 기법을 적용한 다중 예측모델 예측 방법은 예측 모델의 과적합 문제를 해결하기 위해 훈련 집합의 데이터 셋을 이용하여 은닉층 2개를 갖는 심층 신경망 모델을 구성하고 검증 집합에서 최적의 Epoch를 검출한다. 다음으로, 검출된 Epoch를 적용하여 훈련 집합과 검증 집합의 데이터 셋을 이용하여 각기 다른 은닉층 개수를 갖는 4개의 심층 신경망 모델을 개발한 다음, 이를 평가 집합에 적용하여 예측값들을 도출한다.The multi-prediction model prediction method applying the stacking ensemble technique according to the flowchart shown is to construct a deep neural network model with two hidden layers using the data set of the training set to solve the overfitting problem of the prediction model, and optimize the model in the validation set. Epoch is detected. Next, four deep neural network models with different number of hidden layers are developed using the data sets of the training set and the validation set by applying the detected Epoch, and then applied to the evaluation set to derive predicted values.

예컨대, 하기 수학식 13과 같이, 4개의 심층 신경망 모델의 예측값을 통해 슬라이딩 윈도(Sliding Window) 기반의 주성분 회귀분석(Principal Component Regression)을 이용한 스태킹 앙상블 모델을 구성할 수 있다. 주성분 회귀분석은 소수의 주성분만으로 대부분의 데이터 변동성을 설명할 수 있으며, 입력 변수 간의 다중공성선 및 과적합 방지 등과 같은 장점이 있다. For example, as shown in Equation 13 below, a stacking ensemble model using sliding window-based principal component regression may be configured through the predicted values of four deep neural network models. Principal component regression analysis can account for most data variability with only a few principal components, and has advantages such as multicollinearity between input variables and prevention of overfitting.

예컨대, 보다 구체적으로, 슬라이딩 윈도 크기(Size)는 일주일, 주성분은 1개인 스태킹 앙상블 모델을 구성하였으며, 이를 이용하여 현재 시점에서 15분 뒤부터 하루 뒤까지의 15분 단위 초단기 전력수요 예측을 수행한다. 본 발명은 15분 단위뿐만 아니라 30분 단위, 1시간 단위에서도 적용할 수 있다.For example, more specifically, a stacking ensemble model with a sliding window size of one week and one main component is constructed, and using this, the ultra-short-term power demand prediction is performed in 15-minute units from 15 minutes after the present time to one day later. . The present invention can be applied not only in units of 15 minutes, but also in units of 30 minutes and 1 hour.

본 발명이 속하는 기술 분야의 당업자는 본 발명이 그 기술적 사상이나 필수적 특징을 변경하지 않고서 다른 구체적인 형태로 실시될 수 있으므로, 이상에서 기술한 실시 예들은 모든 면에서 예시적인 것이며 한정적인 것이 아닌 것으로서 이해해야만 한다. 본 발명의 범위는 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 등가개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다.Those skilled in the art to which the present invention pertains should understand that the present invention can be embodied in other specific forms without changing the technical spirit or essential characteristics thereof, so the embodiments described above are illustrative in all respects and not restrictive. only do The scope of the present invention is indicated by the following claims rather than the detailed description, and all changes or modifications derived from the meaning and scope of the claims and their equivalent concepts should be construed as being included in the scope of the present invention. .

110 : 데이터 수집부
120 : 전처리부
160 : 예측 모델링부110: data collection unit
120: preprocessor
160: predictive modeling unit

Claims

a data collection unit for collecting date/time information, weather information, and power usage data of a predetermined period;
a data pre-processing unit performing pre-processing including feature extraction of the collected data and maximum/minimum normalization; and
A predictive modeling unit that applies the preprocessed input data to unit prediction models having two or more different number of layers based on a deep neural network, and calculates a final prediction value from the prediction values derived by each of the unit prediction models
A stacking ensemble type short-term power demand forecasting device comprising a.

According to claim 1,
The predictive modeling unit,
a basic unit prediction model learning unit that performs learning on a basic unit prediction model having a minimum hidden layer;
a multi-prediction model construction unit that derives one or more additional unit prediction models with an increased hidden layer from the basic unit prediction model that has been trained, and generates a multi-prediction model composed of the basic unit prediction model and the additional unit prediction models; and
An ensemble prediction unit that calculates a final prediction value by performing stacking ensemble processing on individual prediction values of the basic unit prediction model and the additional unit prediction models
Short-term power demand forecasting device comprising a.

3. The method of claim 2,
The basic unit prediction model and the additional unit prediction models are a short-term power demand prediction device that is a deep neural network model.

4. The method of claim 3,
The ensemble prediction unit,
A short-term power demand prediction apparatus for performing principal component regression analysis to create a regression model using the principal components after extracting principal components from each of the individual prediction values of the additional unit prediction models based on a sliding window.

3. The method of claim 2,
The ensemble prediction unit,
A short-term power demand prediction apparatus for stacking ensemble processing by applying a sliding window so that the weights for the basic unit prediction model and the additional unit prediction models are corrected to the values learned continuously during the power demand prediction operation.

According to claim 1,
The data preprocessor,
Augmenting one-dimensional time information into a two-dimensional rotational coordinate system,
A short-term power demand forecasting device that indicates the day of the week and public holidays as 1 on the day corresponding to the holiday, and 0 otherwise.

According to claim 1,
The data preprocessor,
A short-term power demand forecasting device that applies the Pearson correlation coefficient to determine the correlation between the power usage at the time of prediction and the power usage at the time of the past at the time of prediction.

According to claim 1,
A storage unit in which collected data or pre-processed data and learning results are stored
Short-term power demand forecasting device further comprising a.

According to claim 1,
The short-term power demand prediction device in which the weights for the basic unit prediction model and the additional unit prediction models are corrected to values that are continuously learned during the operation of predicting power demand for a target building.

collecting weather information and power usage data for a predetermined period;
performing preprocessing including feature extraction of the collected data and maximum minimum normalization;
performing learning on a basic unit prediction model having a minimum hidden layer;
deriving one or more additional unit prediction models with an increased hidden layer from the basic unit prediction model for which learning is completed, and constructing a multi-prediction model composed of the basic unit prediction model and the additional unit prediction models;
Calculating a final predicted value by performing stacking ensemble processing on individual prediction values of the basic unit prediction model and the additional unit prediction models
Stacking ensemble type short-term power demand forecasting method comprising a.

11. The method of claim 10,
The basic unit prediction model and the additional unit prediction models are short-term power demand prediction methods that are deep neural network models.

11. The method of claim 10,
In the step of calculating the final predicted value,
A short-term power demand prediction method for performing principal component regression analysis to create a regression model using the principal components after extracting principal components from each of the individual prediction values of the additional unit prediction models based on a sliding window.

11. The method of claim 10,
In the step of calculating the final predicted value,
A short-term power demand prediction method for stacking ensemble processing by applying a sliding window so that the weights for the basic unit prediction model and the additional unit prediction models are corrected to the values learned continuously during the power demand prediction operation.

11. The method of claim 10,
In the step of performing the pre-processing,
Augmenting one-dimensional time information into a two-dimensional rotational coordinate system,
Short-term electricity demand forecasting method in which information on days of the week and holidays is marked with 1 on the day corresponding to a public holiday and 0 otherwise.

11. The method of claim 10,
In the step of performing the pre-processing,
A short-term power demand forecasting method that applies the Pearson correlation coefficient to determine the correlation between the power consumption at the time of prediction and the power usage at the time of the past at the time of prediction.

11. The method of claim 10,
The short-term power demand prediction method in which the weights for the basic unit prediction model and the additional unit prediction models are corrected to values that are continuously learned during the operation of predicting power demand for a target building.