KR20220107794A

KR20220107794A - Power information prediction method and system based on transfer learning

Info

Publication number: KR20220107794A
Application number: KR1020210010910A
Authority: KR
Inventors: 황인준; 정승민; 박성우; 정승원
Original assignee: 한국전력공사; 고려대학교 산학협력단
Priority date: 2021-01-26
Filing date: 2021-01-26
Publication date: 2022-08-02

Abstract

A transfer learning-based power information prediction system of the present invention comprises: a data collection unit that collects past power data and relevant environmental information obtainable by region or group; a data preprocessing unit that preprocesses the data collected by the data collection unit; a correlation analysis unit that selects a regional dataset with the highest Pearson correlation coefficient based on a target dataset for a specific region and other datasets for different unspecified regions; a model pretraining unit that trains a learning model in advance using the regional dataset selected by the correlation analysis unit; a model fine-tuning unit that fine-tunes the learning model using the target dataset; and a prediction unit that predicts items of the target dataset in the future using the learning model. The present invention aims to provide a transfer learning-based power demand prediction method and system capable of performing effective learning based on transfer learning when there is not sufficient leading data for learning.

Description

POWER INFORMATION PREDICTION METHOD AND SYSTEM BASED ON TRANSFER LEARNING

본 발명은 학습을 위한 선행 데이터가 충분치 않은 상태에서 동종의 다른 지역 및/또는 그룹의 선행 데이터를 이용하여 전이 학습 기반으로 학습을 수행하는 전력 수요 예측 방법에 관한 것으로, 특히, 스마트 시티를 위한 전이 학습 기반의 월간 전력 수요 예측 프로그램 형태로 구현될 수 있는 전이 학습 기반의 전력 수요 예측 방법 및 시스템에 관한 것이다.The present invention relates to a power demand prediction method for performing learning based on transfer learning using precedent data of other regions and/or groups of the same type in a state where precedent data for learning is insufficient It relates to a transfer learning-based power demand prediction method and system that can be implemented in the form of a learning-based monthly power demand prediction program.

전력수요 예측(Electric Load Forecasting)은 전력 계통의 효율적 운영을 위해 사전에 필요한 전력을 확보하는 근거이며, 전력 시스템 장비의 신뢰성을 보장하고 능동적인 전력예비율을 유지함으로써 정전 및 과부하로 인한 손실을 대비하기 위해 사용된다.Electric Load Forecasting is the basis for securing necessary power in advance for the efficient operation of the power system, and it ensures the reliability of power system equipment and maintains an active power reserve to prepare for losses due to power outages and overloads. used for

딥러닝 기법의 발달로, 높은 정확도를 가진 예측 모델을 만들 수 있지만 적은 데이터를 사용하여 모델을 구성하면 정확도를 보장할 수 없다는 단점이 있다. With the development of deep learning techniques, it is possible to create a predictive model with high accuracy, but there is a disadvantage that accuracy cannot be guaranteed if the model is constructed using a small amount of data.

예컨대, 자치구별 월간 전력 수요 예측은 연간 전력 스케줄링을 위해 꼭 필요하다. 하지만, 월간 전력 수요 데이터는 한 달에 한 포인트만 생성되기 때문에 데이터가 부족하다. For example, monthly electricity demand forecast for each autonomous district is essential for annual electricity scheduling. However, the monthly electricity demand data is insufficient because only one point is generated per month.

한편, IOT 기술 및 이를 지원하는 정보 제공 기술의 발전으로, 전력 분야에서 다양한 개소에서 다양한 형태의 정보들을 수집할 수 있으며, 단위 지자체 별로도 전력 분야 데이터를 이용하여 다양한 응용 서비스들을 개발하여 수행할 수 있는데, 각 응용 서비스가 개발될 때 마다 상술한 선행 데이터의 부족 문제가 대두될 것이다.On the other hand, with the development of IOT technology and the information providing technology that supports it, various types of information can be collected from various places in the electric power field, and various application services can be developed and performed by each local government using electric power field data. However, whenever each application service is developed, the aforementioned lack of preceding data will arise.

대한민국 공개공보 10-2019-0062741호Republic of Korea Publication No. 10-2019-0062741

본 발명은 학습을 위한 선행 데이터가 충분치 않은 상태에서 전이 학습 기반으로 유효한 학습을 수행할 수 있는 전이 학습 기반의 전력 수요 예측 방법 및 시스템을 제공하고자 한다.An object of the present invention is to provide a transfer learning-based power demand prediction method and system that can perform effective learning based on transfer learning in a state where there is not enough preceding data for learning.

보다 구체적으로, 본 발명은 데이터 부족 문제를 해결하기 위해 피어슨 상관계수 분석을 사용하여 예측 목표 데이터셋에 연관된 동종의 지역 및/또는 그룹으로 구분된 데이터셋을 찾아 모델을 사전학습하고 목표 데이터셋을 사용하여 DNN 모델을 파인튜닝 하여 예측 모델의 정확도를 향상시키는 방안을 제시한다.More specifically, the present invention uses Pearson correlation coefficient analysis to solve the problem of data shortage, finds a dataset divided into homogeneous regions and/or groups related to the prediction target dataset, pre-trains the model, and selects the target dataset. We present a method to improve the accuracy of the prediction model by fine-tuning the DNN model using

본 발명의 일 측면에 따른 전이 학습 기반의 전력 정보 예측 시스템은, 지역별 또는 그룹별로 취득가능한 과거 전력 데이터 및 관련 환경 정보를 수집하는 데이터 수집부; 상기 데이터 수집부가 수집한 데이터를 전처리하는 데이터 전처리부; 특정된 지역에 대한 목표 데이터셋과, 특정되지 않은 다른 지역별 데이터셋들을 기반으로 가장 피어슨 상관계수가 높은 지역별 데이터셋을 선별하는 상관도 분석부; 상기 상관도 분석부에서 선별된 지역별 데이터셋을 이용하여 학습 모델을 사전 학습하는 모델 사전학습부; 상기 목표 데이터셋을 이용하여 상기 학습 모델을 파인튜닝하는 모델 파인튜닝부; 및 상기 학습 모델를 이용하여 미래의 상기 목표 데이터셋의 항목들을 예측하는 예측부를 포함할 수 있다.A transfer learning-based power information prediction system according to an aspect of the present invention includes: a data collecting unit for collecting past power data and related environment information obtainable by region or group; a data pre-processing unit for pre-processing the data collected by the data collecting unit; a correlation analysis unit that selects a regional dataset having the highest Pearson correlation coefficient based on a target dataset for a specified region and other unspecified regional datasets; a model pre-training unit for pre-learning a learning model using the regional dataset selected by the correlation analysis unit; a model fine-tuning unit for fine-tuning the learning model using the target dataset; and a predictor for predicting items of the target dataset in the future by using the learning model.

여기서, 상기 데이터 수집부가 수집한 데이터 및 정보를 저장하고, 상기 학습 모델을 저장하는 저장부를 더 포함할 수 있다.Here, the data collection unit may further include a storage unit for storing the collected data and information, and for storing the learning model.

여기서, 상기 상관 계수는 피어슨 상관 계수이며, 상기 학습 모델은 DNN 모델일 수 있다.Here, the correlation coefficient may be a Pearson correlation coefficient, and the learning model may be a DNN model.

여기서, 상기 상관도 분석부는, 상관 계수의 평균값이 0.5 이하이면 상관 계수가 높은 상위 5개를, 평균값 0.5 이상 0.7 미만이면 상위 10개를, 평균값 0.7 이상 0.8 미만이면 상위 20개를, 평균값 0.8 이상이면 상위 30개의 지역별 데이터셋들을 선별할 수 있다.Here, the correlation analysis unit, if the average value of the correlation coefficient is 0.5 or less, the top 5 high correlation coefficient, if the average value 0.5 or more and less than 0.7, the top 10, if the average value 0.7 or more and less than 0.8, the top 20, the average value 0.8 or more In this case, the top 30 regional datasets can be selected.

여기서, 상기 상관도 분석부는, 하기 수학식에 따라 상관 분석을 수행할 수 있다.Here, the correlation analysis unit may perform correlation analysis according to the following equation.

(여기서, R_XY는 X와 Y 사이의 PCC, n은 관측치 수, X_i 및 Y_i는 시간 i의 값,

및

는 각각 X 및 Y의 평균값임)(where R _XY is the PCC between X and Y, n is the number of observations, X _i and Y _i are the values of time i,

and

is the average value of X and Y, respectively)

여기서, 상기 데이터 수집부는, 수집하려는 데이터의 원천 정보를 가지고 있는 외부 기관 서버에 접속 및 해당 원천 정보에 엑세스하여 전송받는 데이터 통신 모듈을 포함할 수 있다.Here, the data collection unit may include a data communication module for accessing an external agency server having source information of the data to be collected and accessing and receiving the source information.

본 발명의 다른 측면에 따른 전이 학습 기반의 전력 정보 예측 방법은, 전력 데이터들을 지역별 또는 그룹별로 구분하는 단계; 취득가능한 과거 전력 데이터 및 관련 환경 정보를 수집하는 단계; 수집한 과거 전력 데이터를 전처리하고, 지역별 또는 그룹별 구분에 따라, 지역별 또는 그룹별 데이터셋들을 생성하는 단계; 상기 지역별 또는 그룹별 데이터셋들 중 목표 데이터셋을 지정하고, 선택되지 않은 다른 지역별 또는 그룹별 데이터셋들 중에서 상기 목표 데이터셋과 상관 계수가 높은 것을 선별하는 단계; 선별된 상기 목표 데이터셋과 상관 계수가 높은 지역별 또는 그룹별 데이터셋을 이용하여 학습 모델을 사전 학습시키는 단계; 상기 목표 데이터셋을 이용하여 상기 학습 모델을 파인 튜닝하는 단계; 및 상기 학습 모델로 미래의 상기 목표 데이터셋의 항목들을 예측하는 단계를 포함할 수 있다.According to another aspect of the present invention, a method for predicting power information based on transfer learning includes dividing power data by region or group; Collecting obtainable past power data and related environmental information; pre-processing the collected past power data and generating data sets for each region or group according to the division by region or group; designating a target dataset among the datasets for each region or group, and selecting a data set having a high correlation coefficient with the target dataset from among other unselected datasets for each region or group; pre-training a learning model using the selected target data set and a data set for each region or group having a high correlation coefficient; fine-tuning the learning model using the target dataset; and predicting items of the target dataset in the future with the learning model.

여기서, 상기 미래의 상기 목표 데이터셋의 항목들을 예측하는 단계에서는, 미래의 관련 환경 정보에 대한 타 기관의 예측값을 수집하여 이용할 수 있다.Here, in the step of predicting the items of the target dataset in the future, it is possible to collect and use prediction values of other organizations for future related environmental information.

여기서, 모든 각 지역별 데이터셋에 대하여, 상기 목표 데이터셋을 지정하고, 상관 계수가 높은 것을 선별하는 단계 내지 예측하는 단계를 수행할 수 있다.Here, for all regional datasets, the target dataset may be designated, and the step of selecting or predicting the one having a high correlation coefficient may be performed.

본 발명의 또 다른 측면에 따른 전이 학습 기반의 전력 정보 예측 방법은, 특정 지역 또는 그룹의 취득가능한 과거 전력 데이터 및 관련 환경 정보를 수집하는 단계; 수집한 과거 전력 데이터를 전처리하여 목표 데이터셋을 생성하는 단계; 기 획득된 다른 지역들의 지역별 데이터셋들 중에서 상기 목표 데이터셋과 상관 계수가 높은 것을 선별하는 단계; 선별된 상기 목표 데이터셋과 상관 계수가 높은 지역별 데이터셋을 이용하여 학습 모델을 사전 학습시키는 단계; 상기 목표 데이터셋을 이용하여 상기 학습 모델을 파인 튜닝하는 단계; 및 상기 학습 모델로 미래의 상기 목표 데이터셋의 항목들을 예측하는 단계를 포함할 수 있다.A method for predicting power information based on transfer learning according to another aspect of the present invention includes: collecting historical power data and related environment information obtainable in a specific region or group; generating a target data set by pre-processing the collected past power data; selecting one having a high correlation coefficient with the target dataset from among previously acquired regional datasets of other regions; pre-training a learning model using the selected target dataset and regional datasets having a high correlation coefficient; fine tuning the learning model using the target dataset; and predicting items of the target dataset in the future with the learning model.

상술한 구성의 본 발명의 사상에 따른 전이 학습 기반의 전력 수요 예측 방법 및 시스템을 실시하면, 학습을 위한 선행 데이터가 충분치 않은 상태에서 전이 학습 기반으로 유효한 학습을 수행할 수 있는 이점이 있다.When the method and system for predicting power demand based on transfer learning according to the spirit of the present invention having the above configuration are implemented, there is an advantage in that effective learning can be performed based on transfer learning in a state where the preceding data for learning is insufficient.

도 1은 본 발명의 사상에 따른 전력 수요 예측 시스템을 도시한 블록도.
도 2는 서울시의 자치구별 전력 사용량으로서 서울시 월간 전력 데이터 수집 구조를 나타낸 데이터 구성도.
도 3은 도 2의 서울시 월간 전력 데이터 수집 구조에 대하여 피어슨 상관계수를 계산한 예시를 나타낸 개념도.
도 4a는 대상 도메인 데이터셋와 유사한 대규모 데이터셋을 사용하여 학습-훈련된 신경망에서 생성된 가중치를 사용하는 전이학습의 전체적인 구조를 나타낸 개념도.
도 4b는 일반적인 심층 신경망 구조로서 MTLF의 일반적인 DNN 구조를 나타낸 개념도.
도 4c는 전이 학습 기반 DNN 모델을 구성하는 예를 나타내는 개념도.
도 4d는 월간 전력 수요 예측을 위한 전체적인 데이터 구성 및 과정들을 도시한 개념도.
도 5는 본 발명의 사상에 따른 전력 정보 예측 방법의 일 실시예를 도시한 흐름도.
도 6는 본 발명의 사상에 따른 전력 정보 예측 방법의 다른 실시예를 도시한 흐름도.
도 7a 및 7b는 예측 모델의 MAPE 및 NRMSE에 대한 박스형 그래프.
도 8은 평균 절대 백분율 오차(MAPE) 감소율 대 피어슨 상관 계수의 관계를 나타낸 그래프.1 is a block diagram illustrating a power demand prediction system according to the spirit of the present invention.
2 is a data configuration diagram showing the monthly electricity data collection structure of Seoul as the amount of electricity used by each autonomous district in Seoul.
3 is a conceptual diagram illustrating an example of calculating a Pearson correlation coefficient with respect to the monthly power data collection structure of Seoul of FIG. 2 .
4A is a conceptual diagram showing the overall structure of transfer learning using weights generated from a neural network trained and trained using a large-scale dataset similar to the target domain dataset.
Figure 4b is a conceptual diagram showing a general DNN structure of MTLF as a general deep neural network structure.
4C is a conceptual diagram illustrating an example of configuring a transfer learning-based DNN model.
Figure 4d is a conceptual diagram showing the overall data configuration and processes for monthly power demand forecasting.
5 is a flowchart illustrating an embodiment of a method for predicting power information according to the spirit of the present invention.
6 is a flowchart illustrating another embodiment of a method for predicting power information according to the spirit of the present invention.
7A and 7B are boxed graphs for MAPE and NRMSE of predictive models.
8 is a graph showing the relationship of mean absolute percentage error (MAPE) reduction versus Pearson's correlation coefficient.

본 발명을 설명함에 있어서 제 1, 제 2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 구성요소들은 용어들에 의해 한정되지 않을 수 있다. 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 된다. 예를 들어, 본 발명의 권리 범위를 벗어나지 않으면서 제 1 구성요소는 제 2 구성요소로 명명될 수 있고, 유사하게 제 2 구성요소도 제 1 구성요소로 명명될 수 있다. In describing the present invention, terms such as first, second, etc. may be used to describe various components, but the components may not be limited by the terms. The terms are only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, a first component may be referred to as a second component, and similarly, a second component may also be referred to as a first component.

어떤 구성요소가 다른 구성요소에 연결되어 있다거나 접속되어 있다고 언급되는 경우는, 그 다른 구성요소에 직접적으로 연결되어 있거나 또는 접속되어 있을 수도 있지만, 중간에 다른 구성요소가 존재할 수도 있다고 이해될 수 있다.When a component is referred to as being connected or connected to another component, it may be directly connected or connected to the other component, but it can be understood that other components may exist in between. .

본 명세서에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함할 수 있다. The terms used herein are used only to describe specific embodiments, and are not intended to limit the present invention. The singular expression may include the plural expression unless the context clearly dictates otherwise.

본 명세서에서, 포함하다 또는 구비하다 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것으로서, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해될 수 있다. In this specification, the terms include or include are intended to designate that a feature, number, step, operation, component, part, or combination thereof described in the specification exists, and includes one or more other features or numbers, It may be understood that the possibility of the presence or addition of steps, operations, components, parts, or combinations thereof is not precluded in advance.

또한, 도면에서의 요소들의 형상 및 크기 등은 보다 명확한 설명을 위해 과장될 수 있다.In addition, shapes and sizes of elements in the drawings may be exaggerated for clearer description.

전이학습은 외부데이터를 사용하여 모델을 사전학습한 후 이를 예측에 적용하는 작업으로, 데이터가 부족한 월간 전력 수요 예측에 큰 이점을 가진다. 본 발명에서는 사전학습을 위한 외부데이터의 선택을 위해 피어슨 상관계수를 사용한다.Transfer learning is a task of pre-training a model using external data and then applying it to prediction, which has a great advantage in predicting monthly electricity demand with insufficient data. In the present invention, a Pearson correlation coefficient is used to select external data for prior learning.

도 1은 본 발명의 사상에 따른 전력 수요 예측 시스템을 도시한 블록도이다.1 is a block diagram illustrating a power demand prediction system according to the spirit of the present invention.

도시한 전력 수요 예측 시스템은, 지역별 또는 그룹별로 취득가능한 과거 전력 데이터 및 관련 환경 정보를 수집하는 데이터 수집부(110); 상기 데이터 수집부(110)가 수집한 데이터를 전처리하는 데이터 전처리부(120); 특정된 지역 또는 그룹에 대한 목표 데이터셋과, 특정되지 않은 다른 지역별 또는 그룹별 데이터셋들을 기반으로 상관계수가 높은 지역별 또는 그룹별 데이터셋을 선별하는 상관도 분석부(140); 상기 상관도 분석부(140)에서 선별된 지역별 또는 그룹별 데이터셋을 이용하여 학습 모델을 사전 학습하는 모델 사전학습부(150); 상기 목표 데이터셋을 이용하여 상기 학습 모델을 파인튜닝하는 모델 파인튜닝부(160); 상기 학습 모델를 이용하여 미래의 상기 특정된 지역의 상기 목표 데이터셋의 항목들을 예측하는 예측부(170)를 포함할 수 있다. The illustrated power demand prediction system includes: a data collection unit 110 for collecting historical power data and related environmental information obtainable for each region or group; a data pre-processing unit 120 for pre-processing the data collected by the data collecting unit 110; a correlation analysis unit 140 for selecting a data set for each region or group with a high correlation coefficient based on the target dataset for a specified region or group and other unspecified region or group-specific datasets; a model pre-training unit 150 for pre-learning a learning model using the data set for each region or group selected by the correlation analysis unit 140; a model fine-tuning unit 160 for fine-tuning the learning model using the target dataset; and a prediction unit 170 for predicting items of the target dataset in the specified region in the future by using the learning model.

도시한 바와 같이, 상기 데이터 수집부(110)가 수집한 데이터 및 정보를 저장하고, 상기 모델 사전학습부(150) 및 모델 파인튜닝부(160), 예측부(170)가 이용하는 상기 학습 모델을 저장하는 저장부(180)를 더 포함하는 것으로 규정할 수 있다.As shown, the data and information collected by the data collection unit 110 are stored, and the learning model used by the model pre-training unit 150 , the model fine-tuning unit 160 , and the prediction unit 170 is used. It can be defined as further comprising a storage unit 180 for storing.

본 발명의 사상은 학습을 위한 선행 데이터가 부족할 때 전이학습을 위한 데이터를 동종의 다른 분류 그룹들으로부터 확보하는 것을 제안하고 있다. The idea of the present invention proposes to secure data for transfer learning from other classification groups of the same kind when the preceding data for learning is insufficient.

분류 그룹은 다양한 기준으로 정해질 수 있으며, 분류 기준의 달라져도 본 발명의 사상을 그대로 적용하는데에 무리가 없는 바, 이 또한 본 발명의 권리범위에 속함은 물론이다. 다만, 설명 및 이해의 편의를 위해 가장 널리 사용되는 분류 그룹인 지역(행정 지역)에 따라 분류 그룹을 나눈 경우, 지역별이라 칭하고, 다른 분류 그룹들에 대한 경우는 그룹별이라 칭하였음을 밝혀둔다. 이하의 실시예에서는 자치구 및 업종 카테고리를 적용한 분류를 지역별로 칭하고, 상기 지역별로 분류(구분)되는 경우로 구체화하여 예시하겠다. The classification group may be determined by various criteria, and even if the classification criteria are changed, there is no difficulty in applying the spirit of the present invention as it is, which also falls within the scope of the present invention. However, for the convenience of explanation and understanding, if the classification group is divided according to the region (administrative region), which is the most widely used classification group, it is called region, and in the case of other classification groups, it is called group. In the following embodiment, the classification to which the autonomous district and industry category are applied is referred to as a region, and will be concretely illustrated as a case of being classified (divided) by region.

상기 학습 모델은 DNN 모델로 구체화하여 예시하며, DNN 모델로 선택한 경우의 이점은 후술하겠다.The learning model is exemplified by specifying the DNN model, and the advantages of selecting the DNN model will be described later.

또한, 상기 지역별 구분은 서울 전체에 대하여 자치구로 분류한 경우를 예시할 것이며, 상관 계수는 적용이 용이하고 상관성 도출이 우수한 피어슨 상관 계수로 예시하겠다.In addition, the division by region will exemplify the case of classifying Seoul into autonomous districts, and the correlation coefficient will be exemplified by the Pearson correlation coefficient, which is easy to apply and has excellent correlation derivation.

도시한 데이터 수집부(110)는 지역 및 날짜 별로 수집된 날씨, 인구, 전력 사용량에 대한 데이터를 수집하는 것을 목적으로 한다. 수집하는 데이터의 원천으로는 날씨 정보로는 기상자료개방포털을 예로 들 수 있다. 이를 위해 상기 데이터 수집부(110)는 수집하려는 데이터의 원천 정보를 가지고 있는 외부 기관 서버에 접속 및 해당 원천 정보에 엑세스하여 전송받는 데이터 통신 모듈 등을 구비할 수 있다. The illustrated data collection unit 110 aims to collect data on weather, population, and power usage collected by region and date. As a source of data to be collected, a weather data open portal is an example of weather information. To this end, the data collection unit 110 may include a data communication module that accesses and receives the source information transmitted by accessing an external institution server having source information of the data to be collected.

도시한 데이터 수집부(110)는 관련 환경 정보로서 월간 기상 정보를 수집하고, 전력 데이터로서 과거 전력 사용 데이터를 수집할 수 있다. 수집된 데이터 중 날씨, 인구, 일주일 전 전력 사용량 데이터는 입력 변수로 사용될 수 있다. The illustrated data collection unit 110 may collect monthly weather information as related environment information and collect past power usage data as power data. Among the collected data, weather, population, and electricity usage data a week ago can be used as input variables.

도시한 데이터 전처리부(120)는 상기 데이터 수집부(110)에 의해 수집된 데이터의 스케일링을(min-max scaling)를 수행하고, 모델에 사용할 수 있는 형태(목표 데이터셋 형태)로 만들어주고, 스케일링된 데이터를 상관도 분석부까지 보내주는 것을 목적으로 한다. The illustrated data preprocessing unit 120 performs min-max scaling of the data collected by the data collection unit 110 and makes it into a form that can be used in the model (target data set form), The purpose is to send the scaled data to the correlation analysis unit.

도시한 상관도 분석부(140)는 시간, 인구, 날씨, 1주일 전 전력 측면에서 유사한 월간 전력 사용 패턴을 가진 요소를 분석하고, 그 결과 상관 계수가 높은 지역별 데이터셋을 선별할 수 있다. 이때, 구현에 따라 관련 환경 정보의 유사성도 함께 분석할 수도 있다. The illustrated correlation analysis unit 140 may analyze elements having a similar monthly power usage pattern in terms of time, population, weather, and power one week ago, and as a result, select a regional dataset having a high correlation coefficient. In this case, the similarity of related environmental information may also be analyzed according to implementation.

도시한 상관도 분석부(140)는 상기 목표 데이터셋과 이와 대응될 수 있는 다른 자치구의 데이터셋과의 피어슨 상관계수를 계산하여, 가장 피어슨 상관계수가 높은 10개, 20개, 30개(T10, T20, T30)의 지역별 데이터셋을 선별하는 작업을 수행하기 위한 것이다. The illustrated correlation analysis unit 140 calculates a Pearson correlation coefficient between the target dataset and a dataset of another autonomous region that may correspond thereto, and calculates the Pearson correlation coefficient of 10, 20, 30 (T10) having the highest Pearson correlation coefficient. , T20, and T30) for selecting regional datasets.

도 2는 서울시의 자치구별 전력 사용량의 예시로, 자치구마다 "Household", "Public" 등과 같은 5개의 전력 사용 용도별 카테고리로 구분하여 수집/관리하고 있음을 확인할 수 있다.FIG. 2 is an example of electric power consumption by autonomous district of Seoul, and it can be seen that each district is divided into five categories for each use of electricity, such as "Household" and "Public", and collected/managed.

도 3은 피어슨 상관계수를 계산한 예시를 나타낸다. 도 3에서 T는 도 2의 Total이며, H는 Household이며, P는 Public이며, S는 Service이며, I는 Industrial이다.3 shows an example of calculating the Pearson correlation coefficient. In FIG. 3, T is the Total of FIG. 2, H is Household, P is Public, S is Service, and I is Industrial.

상관 계수가 높다는 것은 예컨대, 도 3에서 각 칸의 값이 1에 가까운 것을 의미한다. 상관 계수의 높고 낮음은 상대적일 수 밖에 없는 바, 상관 계수가 높은 것을 선별하는 것의 실제 구현은, 1에 가까운 상위의 소정 개수를 선별하는 것으로 수행될 수 있다. A high correlation coefficient means, for example, that the value of each cell in FIG. 3 is close to 1. Since high and low correlation coefficients are inevitably relative, the actual implementation of selecting a high correlation coefficient may be performed by selecting a predetermined number of higher values close to 1.

상기 지역별 데이터셋은 소정 개수로 선별되어 예측 학습 모델에 후속되는 사전학습에 사용되는데, 모든 지역별 데이터셋들이 목표 데이터셋과 상관 계수가 낮은 경우에, 많은 개수의 지역별 데이터셋을 적용하여 사전학습을 수행하면, 예측 학습 모델이 오히려 사전학습 과정에서 목표 도메인과 이질적이 될 수 있다. The regional datasets are selected in a predetermined number and used for pre-learning following the predictive learning model. If performed, the predictive learning model may rather become heterogeneous from the target domain in the pre-learning process.

후술하는 실험에 따라, 모든 지역별 데이터셋들의 목표 데이터셋과의 상관 계수 정도와, 선별되는 지역별 데이터셋의 개수에 대하여 다음과 같은 최적의 관계를 도출하였다.According to the experiment described below, the following optimal relationship was derived with respect to the degree of correlation coefficient of all regional datasets with the target dataset and the number of selected regional datasets.

즉, 상관 계수의 평균값이 0.5 이하이면 상관 계수가 높은 상위 5개를, 평균값 0.5 이상 0.7 미만이면 상위 10개를, 평균값 0.7 이상 0.8 미만이면 상위 20개를, 평균값 0.8 이상이면 상위 30개의 지역별 데이터셋들을 선별하여 사전학습에 적용하는 것이 최적의 성능을 가져온다.That is, if the average value of the correlation coefficient is 0.5 or less, the top 5 regions with high correlation coefficients are selected, if the average value is 0.5 or more and less than 0.7, the top 10 regions, if the average value is 0.7 or more and less than 0.8, the top 20 regions, and if the average value is 0.8 or more, the top 30 regional data Selecting sets and applying them to pre-learning results in optimal performance.

도시한 모델 사전학습부(150)는 상기 상관도 분석부(140)에서 선택된 자치구의 데이터셋을 이용하여 학습 모델로서 DNN 모델을 사전학습 하고 사전학습된 모델을 모델 파인튜닝부(160)에 제공하는 것을 목적으로 한다. The illustrated model pre-training unit 150 pre-trains the DNN model as a learning model using the dataset of the autonomous region selected by the correlation analysis unit 140 and provides the pre-trained model to the model fine-tuning unit 160 . aim to do

학습-훈련을 위해 대상 도메인에서 제한된 데이터를 사용할 수 있는 경우 전이학습과 같은 적응 기술을 사용하여 예측 모델의 성능을 향상시킬 수 있다. When limited data is available in the target domain for learning-training, adaptive techniques such as transfer learning can be used to improve the performance of predictive models.

도 4a는 대상 도메인 데이터셋와 유사한 대규모 데이터셋을 사용하여 학습-훈련된 신경망에서 생성 된 가중치를 사용하는 전이학습의 일반적인 구조를 나타낸다. 도시한 과정을 사전학습이라고 한다.Figure 4a shows the general structure of transfer learning using weights generated in a neural network trained-trained using a large-scale dataset similar to the target domain dataset. The illustrated process is called pre-learning.

상기 DNN 모델에 따른 심층 신경망에 대하여 알아보겠다.A deep neural network according to the DNN model will be described.

인공 신경망은 인간의 뇌를 모방 한 기계 학습 알고리즘으로 입력 계층, 하나 이상의 숨겨진 계층, 출력 계층의 세 계층으로 구성되고, 각 계층은 퍼셉트론이라는 여러 노드로 구성된다.An artificial neural network is a machine learning algorithm that mimics the human brain and consists of three layers: an input layer, one or more hidden layers, and an output layer, and each layer consists of several nodes called perceptrons.

각 노드는 이전 계층의 노드에서 값을 가져와 시그 모이드 함수, 쌍곡선 탄젠트 함수, ReLU(정류 선형 단위), ELU(지수 선형 단위), 스케일링 된 노드 활성화 함수를 통해 다음 노드의 활성화를 결정한다. 지수 선형 단위(SELU) 등 프로세스가 반복되고 최종 레이어(출력 레이어)에서 노드가 활성화되어 원하는 출력이 생성된다. ANN의 히든 레이어 수가 2개 이상이면 네트워크를 DNN이라고 칭한다.Each node takes a value from a node in the previous layer and determines the activation of the next node through a sigmoid function, a hyperbolic tangent function, a rectified linear unit (ReLU), an exponential linear unit (ELU), and a scaled node activation function. The process is repeated with exponential linear units (SELU), etc. and nodes are activated in the final layer (the output layer) to produce the desired output. If the number of hidden layers of an ANN is two or more, the network is called a DNN.

도 4b는 월별 전기 부하 예측을 위한 일반적인 심층 신경망 구조로서 MTLF의 일반적인 DNN 구조를 보여준다. Figure 4b shows a typical DNN structure of MTLF as a general deep neural network structure for monthly electrical load prediction.

DNN에 숨겨진 레이어가 많을수록 네트워크가 더 복잡해지며, 네트워크 복잡성이 증가하면 네트워크 성능이 향상 될 수 있다. 그러나 DNN의 구조가 필요 이상으로 복잡한 경우 모델은 보이지 않는 데이터에 대해 성능이 저하될 수 있으며 이를 과적합이라고 한다. 따라서, 적절한 수의 히든 레이어를 선택하는 것이 필수적이다.The more layers hidden in the DNN, the more complex the network, and increasing network complexity can improve network performance. However, if the structure of the DNN is more complex than necessary, the model may perform poorly on unseen data, which is called overfitting. Therefore, it is essential to select an appropriate number of hidden layers.

DNN 기반 MTLF 모델을 구성하기 위해서는 은닉층 수, 은닉층의 노드 수, 활성화 함수 등 다양한 하이퍼 파라미터를 고려해야 한다.In order to construct a DNN-based MTLF model, various hyperparameters such as the number of hidden layers, the number of nodes in the hidden layer, and the activation function must be considered.

도시한 모델 파인튜닝부(160)는 상기 모델 사전학습부(150)에서 사전학습된 DNN 모델을 목표 데이터셋을 사용하여 모델을 재학습하는 것을 목적으로 한다. 도 4c는 모델 사전학습부와 모델 파인튜닝부를 도식적으로 나타낸 것으로, 전이 학습 기반 DNN 모델을 구성하는 예를 나타내는 개념도이다. 도 4d는 월간 전력 수요 예측을 위한 전체적인 데이터 구성 및 과정들을 도시한 개념도이다.The illustrated model fine-tuning unit 160 aims to retrain the model using the target dataset of the DNN model pre-trained in the model pre-training unit 150 . 4C is a schematic diagram of a model pre-training unit and a model fine-tuning unit, and is a conceptual diagram illustrating an example of configuring a transfer learning-based DNN model. 4D is a conceptual diagram illustrating overall data configuration and processes for monthly power demand forecasting.

사전학습을 수행한 다음 더 작은 대상 도메인에 해당하는 데이터셋을 사용하여 상기 사전학습된 학습모델 네트워크가 다시 학습되는데, 프로세스를 파인튜닝이라 칭한다.After performing the pre-training, the pre-trained learning model network is trained again using a dataset corresponding to a smaller target domain, a process called fine tuning.

대상 도메인과 원본 도메인의 데이터가 유사하면 결과 모델이 만족스러운 성능을 보일 수 있다. 따라서 유사한 소스 도메인 데이터를 찾기 위해 PCC(Pearson Correlation Coefficient) 분석과 같은 상관 분석 방법을 사용하여 두 도메인 간의 유사성을 결정할 수 있다.If the data of the target domain and the source domain are similar, the resulting model can show satisfactory performance. Therefore, in order to find similar source domain data, a correlation analysis method such as Pearson Correlation Coefficient (PCC) analysis can be used to determine the similarity between two domains.

PCC 분석은 하기 수학식 1에 의해 정의되며, 여기서, R_XY는 X와 Y 사이의 PCC를 나타내고, n은 관측치 수, X_i 및 Y_i는 시간 i의 값이다. 또한,

및

는 각각 X 및 Y의 평균값이다.The PCC analysis is defined by Equation 1 below, where R _XY represents the PCC between X and Y, n is the number of observations, and X _i and Y _i are the values of time i. In addition,

and

are the average values of X and Y, respectively.

[수학식 1][Equation 1]

PCC는 두 변수 X와 Y 사이의 선형 상관 관계를 측정하는데 사용된다(값은 +1에서 1 사이임). +1 값(-1 값)은 이들 간의 총 양(음) 선형 상관 관계를 나타내는 반면, 0은 선형 상관 관계가 없음을 나타낸다.PCC is used to measure the linear correlation between two variables X and Y (values range from +1 to 1). A value of +1 (a value of -1) indicates a total positive (negative) linear correlation between them, whereas a value of 0 indicates no linear correlation.

앞서 살펴본 도 4c는 전이 학습 기반 DNN 모델을 구성하는 예를 보여준다. 여기서 X_i,S 및 X_i,T는 각각 시간 i에서 소스 도메인과 대상 도메인의 입력이고, Y_i,S 및 Y_i,T는 각각 시간 i에서 소스 도메인과 대상 도메인의 출력이다.4c previously looked at shows an example of configuring a transfer learning-based DNN model. where X _i,S and X _i,T are the inputs of the source and destination domains at time i, respectively, and Y _i,S and Y _i,T are the outputs of the source and destination domains at time i, respectively.

소스 도메인을 선택할 때 가능한 모든 긍정적 및 부정적 추세를 고려하기 위해 PCC의 절대 값이 1에 가까운 도메인을 선택하는 것이 유리하다.When selecting a source domain, it is advantageous to select a domain with an absolute value of PCC close to 1 in order to consider all possible positive and negative trends.

일반적으로는 파인튜닝을 위한 대상 도메인에 해당하는 데이터셋은 상기 목표 데이터셋이 된다.In general, a dataset corresponding to a target domain for fine tuning becomes the target dataset.

도시한 전력 예측부(170)는 상기 모델 파인튜닝부(160)에서 재학습된 모델을 사용하여 실제 전력수요량을 예측하고 reverse normalization을 통하여 실제 예측 전력 값을 도출해내는 것을 목적으로 한다.The illustrated power prediction unit 170 aims to predict the actual power demand by using the retrained model in the model fine tuning unit 160 and derive the actual predicted power value through reverse normalization.

도 5는 본 발명의 사상에 따른 전력 정보 예측 방법의 일 실시예를 도시한 흐름도이다. 5 is a flowchart illustrating an embodiment of a method for predicting power information according to the spirit of the present invention.

도시한 전력 정보 예측 방법은, 전력 데이터들을 지역별로 구분하는 단계(S10); 취득가능한 과거 전력 데이터 및 관련 환경 정보를 수집하는 단계(S110); 수집한 과거 전력 데이터를 전처리하고, 지역별 구분에 따라, 지역별 데이터셋들을 생성하는 단계(S120); 상기 지역별 데이터셋들 중 목표 데이터셋을 지정하고(S130), 선택되지 않은 다른 지역별 데이터셋들 중에서 상기 목표 데이터셋과 상관 계수가 높은 것을 선별하는 단계(S140); 선별된 상기 목표 데이터셋과 상관 계수가 높은 지역별 데이터셋을 이용하여 학습 모델을 사전 학습시키는 단계(S150); 상기 목표 데이터셋을 이용하여 상기 학습 모델을 파인 튜닝하는 단계(S160); 및 상기 학습 모델로 미래의 상기 목표 데이터셋의 항목들을 예측하는 단계(S170)를 포함할 수 있다.The illustrated power information prediction method includes the steps of dividing power data into regions (S10); Collecting obtainable past power data and related environmental information (S110); pre-processing the collected past power data and generating regional datasets according to regional classification (S120); designating a target dataset among the regional datasets (S130), and selecting a target dataset with a high correlation coefficient from among other unselected regional datasets (S140); pre-training a learning model using the selected target dataset and regional datasets having a high correlation coefficient (S150); fine tuning the learning model using the target dataset (S160); and predicting items of the target dataset in the future with the learning model (S170).

도시한 흐름도에 따른 전력 정보 예측 방법은 지역별로 구분하는 경우로 구체화한 것이며, 지역별로 구분된 전체 지역들의 전부 또는 대상 지역들로 지정된 복수개에 대하여 전력 정보 예측을 수행한다. The power information prediction method according to the illustrated flowchart is embodied as a case of division by region, and power information prediction is performed for all of the entire regions divided by region or a plurality of designated target regions.

예컨대, 서울 전체 자치구에 대하여 특정 전력 정보 예측 프로그램을 도입하는데, 과거 1달의 서울 전체 자치구의 전력 데이터만을 이용하여 학습을 수행하여, 서울 전체 자치구의 전력 수요를 예측하는 경우로 살펴볼 수 있다. 즉, 충분치 않은 기간의 선행 데이터에 문제에 대하여, 마찬가지로 충분치 않은 기간의 다른 지역들의 선행 데이터를 이용하여 전이학습을 수행하는 것으로 대응한다. For example, a specific electric power information prediction program is introduced for the entire autonomous district of Seoul, and it can be viewed as a case of predicting the electric power demand of the entire autonomous district of Seoul by performing learning using only electric power data of the entire autonomous district of Seoul for the past one month. That is, it responds to the problem of an insufficient period of precedent data by performing transfer learning using precedent data of other regions with an insufficient period as well.

예컨대, 전체 지역들이 모두 대상 지역들로 지정된 경우, 모든 각 지역별 데이터셋에 대하여, 상기 목표 데이터셋으로 지정하고(S130, S184), 상관 계수가 높은 것을 선별하는 단계(S140) 내지 예측하는 단계(S170)를 반복하여 또는 병행하여 수행하게 된다.For example, when all regions are designated as target regions, all regional datasets are designated as the target dataset (S130 and S184), and the step (S140) to predicting (S140) to predicting (S130, S184) S170) is repeated or performed in parallel.

도시한 흐름도에서는, 하나의 특정 지역의 목표 데이터셋에 대하여 미래의 상기 목표 데이터셋의 항목들을 예측하는 단계(S170)를 수행한 이후, 대상 지역들 중 예측을 수행하지 않고 남은 지역들이 있는지 확인하여(S180), 남은 지역들 중 하나에 대하여 목표 데이터셋을 변경하여(S184), 상관 계수가 높은 것을 선별하는 단계(S140)로 복귀한다.In the illustrated flowchart, after performing the step (S170) of predicting items of the target dataset in the future with respect to the target dataset of a specific region, check whether there are remaining regions without performing prediction among the target regions. (S180), the target data set is changed for one of the remaining regions (S184), and the process returns to the step (S140) of selecting the one having a high correlation coefficient.

상기 S10 단계의 전력 데이터들에 대한 지역별 구분은, 미리 다른 기관(서울시 등)에서 수행될 수 있으며, 본 발명의 사상에 따른 전력 정보 예측 방법 수행시, 상기 S10 단계는 다른 기관에서 수행된 지역별 구분 정보를 수취할 수 있다. The regional division of the power data in step S10 may be performed in advance by another institution (such as Seoul), and when the method for predicting power information according to the spirit of the present invention is performed, the region classification performed by another organization in step S10 is performed in advance. information can be received.

상기 S110 단계는, 관련 기관의 서버로부터 필요한 정보 및 데이터를 전송받을 수 있다. 예컨대, 과거 전력 소비 데이터는 전력 회사 서버로부터, 날씨 정보는 기상청 서버로부터, 인구 정보는 지자체 서버로부터 전송받을 수 있다.In step S110, necessary information and data may be transmitted from a server of a related institution. For example, past power consumption data may be transmitted from a power company server, weather information may be transmitted from a weather service server, and population information may be transmitted from a local government server.

상기 S120 단계에서는 S110 단계에서 수집한 데이터/정보를 스케일링을(min-max scaling)를 수행하고, 상기 학습 모델에 사용할 수 있는 포맷으로 변경할 수 있다. 예컨대, 도 2 및 3의 경우, 서울의 모든 자치구에 대하여 지역별 데이터셋을 생성할 수 있다. 각 지역별 데이터셋은 해당 자치구의 단위 기간 당 전체(Total) 전력 소비, 가정용(Household) 전력 소비, 공공용(Public) 전력 소비, 서비스업용(Service) 전력 소비, 산업용(Industrial) 전력 소비의 항목들로 이루어질 수 있다.In step S120, min-max scaling is performed on the data/information collected in step S110, and it can be changed into a format usable for the learning model. For example, in the case of FIGS. 2 and 3 , a regional dataset may be generated for all autonomous districts of Seoul. The data set for each region consists of items of total power consumption, household power consumption, public power consumption, service power consumption, and industrial power consumption per unit period of each autonomous district. can be done

상기 S130 단계에서는 미래의 상술한 전력 소비의 항목들을 구하는 대상이 되는 자치구의 지역별 데이터셋을 목표 데이터셋으로 지정할 수 있다.In the step S130, a regional data set of an autonomous district, which is a target for obtaining the above-described items of future power consumption, may be designated as a target data set.

상기 관련 환경 정보로서, 각 자치구의 거주 인구, 날씨 등이 적용될 수 있으며, 상기 미래의 상기 목표 데이터셋의 항목들을 예측하는 단계(S170)에서, 각 지역(즉, 자치구)의 예측 시점인 미래의 거주 인구, 날씨 정보를 적용하는 것이 유리하다. As the related environmental information, the resident population of each autonomous district, weather, etc. may be applied, and in the step (S170) of predicting the items of the future target dataset, the prediction time of each region (ie, autonomous district) of the future It is advantageous to apply the resident population and weather information.

이를 위해, 상기 미래의 상기 목표 데이터셋의 항목들을 예측하는 단계(S170)에서는, 미래의 관련 환경 정보에 대한 타 기관(예 : 기상청)의 예측값을 수집하여 이용할 수 있다. To this end, in the step of predicting the items of the target dataset in the future ( S170 ), prediction values of other organizations (eg, the Meteorological Administration) for future related environmental information may be collected and used.

도 6는 본 발명의 사상에 따른 전력 정보 예측 방법의 다른 실시예를 도시한 흐름도이다.6 is a flowchart illustrating another embodiment of a method for predicting power information according to the spirit of the present invention.

도시한 전력 정보 예측 방법은, 특정 지역의 취득가능한 과거 전력 데이터 및 관련 환경 정보를 수집하는 단계(S210); 수집한 과거 전력 데이터를 전처리하여 목표 데이터셋을 생성하는 단계(S220); 기 획득된 다른 지역들의 지역별 데이터셋들 중에서 상기 목표 데이터셋과 상관 계수가 높은 것을 선별하는 단계(S240); 선별된 상기 목표 데이터셋과 상관 계수가 높은 지역별 데이터셋을 이용하여 학습 모델을 사전 학습시키는 단계(S250); 상기 목표 데이터셋을 이용하여 상기 학습 모델을 파인 튜닝하는 단계(S260); 및 상기 학습 모델로 미래의 상기 목표 데이터셋의 항목들을 예측하는 단계(S270)를 포함할 수 있다.The illustrated method for predicting power information includes: collecting historical power data and related environmental information that can be acquired in a specific area (S210); generating a target data set by pre-processing the collected past power data (S220); selecting one having a high correlation coefficient with the target dataset from among previously acquired regional datasets of other regions (S240); pre-training a learning model using the selected target dataset and regional datasets having a high correlation coefficient (S250); fine tuning the learning model using the target dataset (S260); and predicting items of the target dataset in the future with the learning model (S270).

도시한 흐름도에 따른 전력 정보 예측 방법은 지역별로 구분하는 경우로 구체화한 것이며, 하나의 특정 지역의 전력 정보 예측을 수행하는 과정을 나타낸 것이다.The power information prediction method according to the illustrated flowchart is embodied as a case of division by region, and shows a process of performing power information prediction of one specific region.

예컨대, 서울의 특정 자치구에 대하여 특정 전력 정보 예측 프로그램을 도입하는데, 과거 1달의 상기 특정 자치구의 전력 데이터만을 이용하여 학습을 수행하되, 서울의 다른 자치구들의 과거 전력 데이터들을 이용하여 예측하는 경우로 살펴볼 수 있다. 이때, 다른 자치구들 중 하나 이상은 과거 2달 이상의 비교적 긴 과거 전력 데이터를 가질 수도 있다.For example, a specific electric power information prediction program is introduced for a specific autonomous district of Seoul. Learning is performed using only the electric power data of the specific autonomous district of the past month, but the prediction is made using the past electric power data of other autonomous districts in Seoul. can take a look At this time, one or more of the other autonomous regions may have relatively long past power data of more than two months in the past.

다음, 본 발명의 사상에 따른 높은 상관 관계를 가지는 동종의 다른 지역별 데이터셋을 이용한 전이학습의 효과에 대하여 살펴보겠다. Next, we will look at the effect of transfer learning using different regional datasets of the same type having high correlation according to the spirit of the present invention.

효과 검증을 위한 제안 모델은 월별 부하 예측 모델이며, 이를 구성하는 데 사용한 데이터셋을 설명한다. 서울시에서 제공하는 데이터셋에는 2005년 1월부터 2018년 12월까지 서울 25개 구의 월별 전기 부하가 포함되어 있다. 상기 데이터셋은 도 2 및 도 3과 같이 가정, 공공, 산업, 서비스, 총 5 가지 범주로 나눌 수 있다. The proposed model for effect verification is a monthly load prediction model, and the dataset used to construct it is described. The dataset provided by the Seoul Metropolitan Government includes the monthly electrical loads of 25 districts in Seoul from January 2005 to December 2018. The data set can be divided into a total of five categories, such as household, public, industrial, and service, as shown in FIGS. 2 and 3 .

가정용 데이터셋은 순수 주거 고객의 월별 전기 소비량으로 구성되고, 공개 데이터셋에는 사회 기반 시설 및 정부 기관과 같은 공공 목적을 위한 월별 전기 소비량이 포함되며, 마찬가지로 산업 데이터셋은 광업, 제조, 건설 등에 대한 월별 전력 소비량을 보여주고, 서비스 데이터셋에는 물/폐수 처리, 고속 운송 시스템, 상업용 사무실 및 소매 건물에 대한 월별 전기 소비량이 포함되며, 마지막으로 총 데이터셋은이 상기 네 가지 범주 전력 소비량의 합계를 나타낸다.The residential dataset consists of monthly electricity consumption by pure residential customers, the public dataset includes monthly electricity consumption for public purposes such as infrastructure and government agencies, and the industrial dataset consists of the monthly electricity consumption for mining, manufacturing, construction, etc. It shows monthly electricity consumption, the service dataset includes monthly electricity consumption for water/wastewater treatment, high-speed transportation systems, commercial office and retail buildings, and finally the total dataset is the sum of these four categories of electricity consumption. indicates.

결과적으로 데이터셋은 5개 카테고리와 25개 자치구를 포함하는 14년 동안 125개의 시퀀스 데이터로 구성되며, 전이 학습을 위한 소스 도메인으로 사용된다. 여기서, 각 자치구별로 지역별 데이터셋을 구성한다.As a result, the dataset consists of 125 sequence data over 14 years covering 5 categories and 25 boroughs, and is used as a source domain for transfer learning. Here, a regional data set is configured for each autonomous district.

입력 데이터(정보) 구성에 대하여 살펴보면, 먼저, 관련 환경 정보로서 달력 데이터가 있다. 일반적으로 전력 소비량은 다양한 요인에 의해 결정될 수 있다. 그 중 세 가지 주요 요소를 고려할 수 있다. 첫 번째 주요 요소는 연도, 월, 계절, 일 수, 주말 수 및 공휴일 수와 같은 달력 데이터이다.Looking at the input data (information) configuration, first, there is calendar data as related environment information. In general, power consumption can be determined by various factors. Of these, three main factors can be considered. The first major element is calendar data, such as the number of years, months, seasons, days, weekends, and holidays.

월에는 주기적인 속성이 있다. 예컨대, 12월과 1월은 시간적으로 인접해 있기 때문에 기온과 계절면에서 비슷한 특성을 가지고 있다. 이러한 주기성을 반영하기 위해 월 데이터를 정현파를 반영한 연속 데이터로 변환할 수 있다.Months have periodic properties. For example, since December and January are temporally adjacent, they have similar characteristics in terms of temperature and season. In order to reflect this periodicity, the monthly data can be converted into continuous data reflecting a sine wave.

또한 공휴일에는 일반적으로 전력 소비가 감소한다. 따라서 한 달의 일수와 공휴일 수는 그 달의 전력 소비량과 관련이 있다. 특히 주중 공휴일 수는 모델에 큰 영향을 미칠 수 있으므로 입력 변수에 대한 주중 공휴일 수도 고려하는 것이 유리하다. 또한 연휴 기간이 연장되면 평소와 다른 전력 수요 패턴이 발생할 가능성이 있는 바, 해당 월의 최대 휴일 길이도 고려한다. 예컨대, 해당 웹 사이트(https://www.timeanddate.com) 시간 및 날짜에서 휴일 데이터를 수집할 수 있다. 결과적으로 달력 데이터에서 24개의 입력 변수를 선택할 수 있다.Also, power consumption is generally reduced during public holidays. Therefore, the number of days and holidays in a month are related to the electricity consumption in that month. In particular, it is advantageous to consider the number of weekday holidays for the input variable, as the number of weekday holidays can have a large impact on the model. In addition, if the holiday period is extended, there is a possibility that an unusual power demand pattern may occur, so the maximum length of holiday for the month is also taken into consideration. For example, the website (https://www.timeanddate.com) may collect holiday data from the time and date. As a result, 24 input variables can be selected from the calendar data.

모델 구축을 위해 고려한 다른 환경 정보로서 지역의 인구 데이터가 있다. 서울 지역 인구 정보는 서울 오픈 데이터 플라자(http://data.seoul.go.kr/)에서 확인할 수 있다. 여기서, 확인가능한 인구 데이터에는 인구, 구역의 면적, 인구 밀도, 이주 중, 이주 외부 및 순 이주가 포함된다. 상기 데이터는 ID 카드 정보를 기반으로 매년 3 월에 결정되며, 두 가지 유형의 인구 데이터는 연간 및 월간이다.Another environmental information considered for model building is local population data. Information on the population of Seoul can be found at the Seoul Open Data Plaza (http://data.seoul.go.kr/). Here, identifiable population data includes population, area of district, population density, in-migration, out-of-migration and net migration. The data is determined in March of each year based on ID card information, and the two types of population data are annual and monthly.

연간 인구 데이터에는 인구, 지구 면적 및 인구 밀도가 포함되고, 월간 인구 데이터에는 마이그레이션 중, 외부 마이그레이션 및 순 마이그레이션이 포함된다. 한 구역(즉, 자치구)의 인구는 해당 구역의 전력 소비와 높은 상관 관계가 있다. 더 자세한 분석을 위해 월간 인구 데이터를 고려할 수 있다.Annual population data includes population, global area and population density, and monthly population data includes during migration, outward migration and net migration. The population of a district (ie, a borough) is highly correlated with electricity consumption in that district. Monthly population data can be considered for further analysis.

마지막 관련 환경 정보로서 온도, 습도 및 강수량과 같은 날씨 데이터를 적용한다. 기상청(KMA)(3일전, 중기, 1개월/3개월 전망으로 구성되는 장기 예측 등 다양한 유형의 예측을 제공)에서 월간 기상 데이터를 수집하였다. MTLF의 1개월 전망 예측을 사용하여 월간 일기 예보를 제공할 수 있다. 1개월 전망에는 평균 기온, 평균 최고 기온, 평균 최저 기온, 1개월 총 강수량이 포함된다.As the last relevant environmental information, we apply weather data such as temperature, humidity and precipitation. Monthly weather data were collected from the Korea Meteorological Administration (KMA), which provides various types of forecasts, including three-day, medium-term, and long-term forecasts consisting of one-month/three-month forecasts. Monthly weather forecasts can be provided using MTLF's one-month forecasts. One-month projections include average temperature, average maximum temperature, average minimum temperature, and total precipitation in one month.

대상 학습 모델은 특정 자치구에 대한 전력 사용 예측 모델을 제공하는 것을 목적으로 한다. 따라서, 대상 도메인, 즉, 목표 데이터셋은 자치구별 전력량으로 표시되고, 다른 자치구의 자치구별 전력량이 소스 도메인, 즉, 지역별 데이터셋이 된다. 앞서 언급했듯이 25개 자치구과 5개 범주에서 총 125개의 조합들이 존재한다.The target learning model aims to provide a power usage prediction model for a specific autonomous district. Accordingly, the target domain, that is, the target dataset, is expressed as the amount of electricity for each autonomous district, and the amount of electricity for each autonomous district of another autonomous region becomes the source domain, that is, the data set for each region. As mentioned earlier, there are a total of 125 unions in 25 boroughs and 5 categories.

먼저 소스 도메인 인 나머지 124 개의 전기 부하 데이터에서 유사한 전기 부하 데이터를 사용하여 월별 전기 부하 예측 모델을 구성한다. 이를 위해 먼저 각 소스 데이터와 대상 데이터를 학습 및 테스트 셋으로 나누었다. 소스 도메인에서 유사한로드 데이터를 찾기 위해 소스 데이터와 대상 데이터의 학습 세트 사이의 PCC를 계산하고, 그런 다음 선택한 소스 데이터를 사용하여 DNN 기반 예측 학습 모델을 구성하였다.First, a monthly electrical load prediction model is constructed using similar electrical load data from the remaining 124 electrical load data as the source domain. To this end, we first divided each source data and target data into training and test sets. To find similar load data in the source domain, we computed the PCC between the training sets of the source and target data, and then constructed a DNN-based predictive learning model using the selected source data.

마지막으로 목표 데이터의 학습 세트(목표 데이터셋)를 사용하여 예측 학습 모델을 미세 조정하고 목표 데이터의 테스트 셋을 사용하여 모델을 평가하였다. 예측 학습 모델을 구성하는 전체 단계는 앞서 도 4b 내지 4d에 나타낸 바와 같다.Finally, the predictive learning model was fine-tuned using the training set of the target data (target dataset) and the model was evaluated using the test set of the target data. The overall steps of constructing the predictive learning model are as shown in FIGS. 4B to 4D above.

상기 예측 학습 모델의 효과를 평가하기 위해 매우 광범위한 실험을 수행했다. 실험을 위해 2005년 1월부터 2018년 12월까지 서울시 25개구의 월별 전기 부하 데이터 5개 항목을 수집했다. 그중 2005년 1월부터 2016년 12월까지의 데이터를 학습 셋으로, 2017년 1월부터 2018년 12월까지의 데이터를 테스트 셋으로 사용했습니다. 실험에서 우리는 지구와 범주의 모든 조합을 대상 도메인으로, 다른 데이터를 소스 도메인으로 고려했다. 모든 실험은 Python 3.7.6에서 수행되었으며 모델은 TensorFlow 1.13.1을 사용하여 구성되었다.A very extensive experiment was performed to evaluate the effectiveness of the predictive learning model. For the experiment, five items of monthly electrical load data were collected from January 2005 to December 2018 in 25 districts of Seoul. Among them, data from January 2005 to December 2016 were used as the training set, and data from January 2017 to December 2018 were used as the test set. In our experiments, we considered all combinations of districts and categories as target domains and other data as source domains. All experiments were performed in Python 3.7.6 and the model was constructed using TensorFlow 1.13.1.

125개 도메인에 대해 계산된 PCC값이 포함된 표의 일부는 앞서 도 3에 나타낸 바와 같다. 여기서, 각 셀은 PCC값에 따라 고유한 색상으로 표시되었다.A part of the table including the PCC values calculated for 125 domains is as shown in FIG. 3 above. Here, each cell is marked with a unique color according to the PCC value.

도 3은 자치구별, 카테고리별 데이터 특성을 개략적으로 나타낸 것으로서, 예를 들어 종로의 총계는 PCC값이 높기 때문에 대부분의 다른 자치구의 경우와 매우 유사하다. 반면 용산의 산업은 PCC값이 낮기 때문에 다른 지역의 대부분의 경우와 매우 상이하다. 이는 종로의 전체 데이터가 전이 학습에 적합한 반면 용산의 산업 데이터는 그렇지 않다는 것으로 해석될 수 있다. 결과적으로 데이터 간의 유사성을 잘 고려하면 다양한 영역의 전기 데이터를 전이 학습에 효과적으로 사용할 수 있다.3 schematically shows data characteristics for each autonomous district and each category. For example, the total of Jongno is very similar to that of most other autonomous districts because the PCC value is high. On the other hand, the industry in Yongsan is very different from most of the other regions because the PCC value is low. This can be interpreted that the entire data of Jongno is suitable for transfer learning, whereas the industrial data of Yongsan is not. As a result, if similarity between data is well considered, electrical data from various areas can be effectively used for transfer learning.

또한, 예측 모델 학습을 위한 데이터 수의 영향을 관찰하기 위해 PCC 측면에서 가장 유사한 상위 10개(DNN_T10), 20개(DNN_T20), 30개(DNN_T30) 도메인을 선택하여 세 가지 DNN 모델을 구성하였다.In addition, three DNN models were constructed by selecting the top 10 (DNN_T10), 20 (DNN_T20), and 30 (DNN_T30) domains that were most similar in terms of PCC to observe the effect of the number of data for training the predictive model.

하기 표 1는 각 자치구별 5개 범주에서 가장 높은 10, 20 및 30 PCC 상관 계수값의 평균을 나열한다. 여기서, 성능 비교를 위해 평균 절대 백분율 오차(MAPE)와 정규화된 평균 제곱근 오차(NRMSE)를 고려했다.Table 1 below lists the averages of the highest 10, 20, and 30 PCC correlation coefficient values in the five categories for each borough. Here, the mean absolute percentage error (MAPE) and the normalized root mean square error (NRMSE) were considered for performance comparison.

[표 1][Table 1]

MAPE 및 NRMSE는 하기 수학식 1 및 2에 따라 정의될 수 있다.MAPE and NRMSE may be defined according to Equations 1 and 2 below.

[수학식 1][Equation 1]

[수학식 2][Equation 2]

상기 수학식들에서, n은 관측치 수,

및

는 각각 시간 i에서의 실제 값과 예측 값,

는 실제 값의 평균이다.In the above equations, n is the number of observations,

and

are the actual and predicted values at time i, respectively,

is the average of the actual values.

상기 세 가지 모델 외에도 비교를 위해 다음 4개의 예측 모델들 - 다중 선형 회귀(MLR), RF, 극한 기울기 부스트(XGB) 및 기준 DNN - 을 고려했다. 모든 분류에 대한 MAPE 및 NRMSE 값을 계산한 다음 각 학군의 평균 값을 계산했다. In addition to the above three models, the following four predictive models were considered for comparison - multiple linear regression (MLR), RF, extreme gradient boost (XGB) and baseline DNN. MAPE and NRMSE values for all classifications were calculated and then the average value for each school district was calculated.

하기 표 2와 표 3은 각 자치구에 대한 5개 데이터셋의 평균 MAPE 및 NRMSE를 나타낸다.Tables 2 and 3 below show the average MAPE and NRMSE of the five datasets for each autonomous district.

[표 2][Table 2]

(DNN : 심층 신경망; DNN_T10 : 상위 10개의 유사한 데이터를 사용하여 사전학습된 DNN; DNN_T20 : 상위 20개의 유사한 데이터를 사용하여 사전학습된 DNN; DNN_T30 : 상위 30개의 유사한 데이터를 사용하여 사전학습된 DNN; MLR : 다중 선형 회귀, RF : 랜덤 포레스트, XGB : 극도의 그라디언트 부스팅)(DNN: deep neural network; DNN_T10: DNN pre-trained using top 10 similar data; DNN_T20: DNN pre-trained using top 20 similar data; DNN_T30: DNN pre-trained using top 30 similar data ;MLR: Multiple Linear Regression, RF: Random Forest, XGB: Extreme Gradient Boosting)

[표 3][Table 3]

(DNN : 심층 신경망; DNN_T10 : 상위 10개의 유사한 데이터를 사용하여 사전학습된 DNN; DNN_T20 : 상위 20개 유사한 데이터를 사용하여 사전학습된 DNN; DNN_T30 : 상위 30개의 유사한 데이터를 사용하여 사전학습된 DNN; MLR : 다중 선형 회귀, RF : 랜덤 포레스트, XGB : 극도의 그라디언트 부스팅.)(DNN: deep neural network; DNN_T10: DNN pre-trained using top 10 similar data; DNN_T20: DNN pre-trained using top 20 similar data; DNN_T30: DNN pre-trained using top 30 similar data ;MLR: Multiple Linear Regression, RF: Random Forest, XGB: Extreme Gradient Boosting.)

하이퍼 매개 변수를 조정하는데 가장 일반적으로 사용되는 방법은 관심있는 하이퍼 매개 변수의 가능한 모든 조합을 시도하는 그리드 검색이다. 따라서, 교차 검증이 있는 그리드 검색을 사용하여 RF 및 XGB 모델에 대한 최적의 하이퍼 매개 변수를 선택했다.The most commonly used method for tuning hyperparameters is a grid search that tries all possible combinations of hyperparameters of interest. Therefore, we used grid search with cross-validation to select optimal hyperparameters for RF and XGB models.

그런데, 데이터를 학습, 검증 및 테스트 세트로 나누는 하이퍼 매개 변수 조정 방법은 검증 세트를 과적 합하는 문제를 일으킬 수 있는 바, 이를 방지하기 위해 데이터를 훈련 및 테스트 세트로 분할 한 다음 학습 데이터에 대해 5회 교차 검증을 수행했다.By the way, the hyperparameter tuning method that divides the data into training, validation, and test sets can cause the problem of overfitting the validation set. Cross-validation was performed.

도 7a와 7b는 예측 모델의 MAPE 및 NRMSE에 대한 박스형 그래프이다.7A and 7B are box graphs for MAPE and NRMSE of predictive models.

세 가지 전이 학습 기반 MTLF 모델은 기준 DNN 및 기타 기계 학습 기반 모델보다 더 나은 예측 성능을 가지고 있음을 알 수 있다.It can be seen that the three transfer learning-based MTLF models have better predictive performance than the baseline DNN and other machine learning-based models.

도 8은 평균 절대 백분율 오차(MAPE) 감소율 대 피어슨 상관 계수의 관계를 나타낸 그래프이다.8 is a graph showing the relationship between the mean absolute percentage error (MAPE) reduction rate versus the Pearson correlation coefficient.

도 8의 그래프는 DNN_T10, 20, 30에 대한 PCC 값에 대한 MAPE 감소율을 제시하여 직관적으로 추세를 보여준다. x축은 PCC를 나타내고 y축은 MAPE 감소율을 나타내며 각 점은 제안 된 모델의 각 PCC에 대한 MAPE 감소율을 나타낸다. 중앙의 빨간색 선은 MAPE 값의 추세선을 나타낸다. 도시한 그래프는 양의 추세선 기울기를 나타내므로 전이학습에 해당하는 PCC 값의 증가는 성능 향상과 밀접한 관련이 있음을 알 수 있다.The graph of FIG. 8 shows the trend intuitively by presenting the MAPE reduction rate for the PCC values for DNN_T10, 20, and 30. The x-axis represents the PCC, the y-axis represents the MAPE reduction rate, and each dot represents the MAPE reduction rate for each PCC in the proposed model. The central red line represents the trend line of MAPE values. Since the graph shown shows a positive trend line slope, it can be seen that an increase in the PCC value corresponding to transfer learning is closely related to the performance improvement.

정리하면, 본 발명의 효과를 알아보기 위한 구체적인 구성에서는, 피어슨 상관계수 분석과 전이학습을 이용하여 시간, 인구, 날씨, 1주일 전 전력 측면에서 유사한 월간 전력 사용 패턴을 가진 요소를 분석하고 이를 모델 학습에 이용한다. 모델을 연관된 데이터로 미리 사전학습 함으로써, 모델의 정확도를 높일 수 있다.In summary, in a specific configuration for examining the effects of the present invention, elements with similar monthly power usage patterns in terms of time, population, weather, and power one week ago are analyzed and modeled using Pearson correlation coefficient analysis and transfer learning. use for learning By pre-training the model with the associated data, the accuracy of the model can be increased.

특히, 도 7a는 여러 예측 모델과 본 모델(DNN_T10, DNN_T20, DNN_T30)의 예측 성능을 비교한 그래프이다. 시간, 인구, 날씨, 1주일전 전력 측면뿐만 아니라 성별, 연령대 등 다양한 정보가 추가되어도 동일한 프로세스로 분석 가능하다. 히트맵을 통해 요소와 요소 사이의 연관도를 시각화할 수 있다.In particular, FIG. 7A is a graph comparing prediction performance of several prediction models and the present models (DNN_T10, DNN_T20, DNN_T30). Even if various information such as time, population, weather, and electricity a week ago, as well as gender and age group are added, it can be analyzed in the same process. A heat map allows you to visualize elements and the degree of association between elements.

일반 데이터를 사용한 경우보다 정확한 월간 전력 수요 예측을 통해 연간 전력 스케줄링을 실제 사용량에 맞게 적용하여 손실되는 전력을 줄일 수 있음을 알 수 있다.It can be seen that the annual power scheduling can be applied to the actual usage through more accurate monthly power demand prediction than when general data is used, thereby reducing power loss.

이상의 설명은 본 발명의 기술 사상을 예시적으로 설명한 것에 불과한 것으로서, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 다양한 수정 및 변형이 가능할 것이다. 따라서, 본 발명에 개시된 실시예들은 본 발명의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 실시예에 의하여 본 발명의 기술 사상의 범위가 한정되는 것은 아니다. 본 발명의 보호 범위는 아래의 청구범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술 사상은 본 발명의 권리범위에 포함되는 것으로 해석되어야 할 것이다.The above description is merely illustrative of the technical spirit of the present invention, and various modifications and variations will be possible without departing from the essential characteristics of the present invention by those skilled in the art to which the present invention pertains. Therefore, the embodiments disclosed in the present invention are not intended to limit the technical spirit of the present invention, but to illustrate, and the scope of the technical spirit of the present invention is not limited by these embodiments. The protection scope of the present invention should be interpreted by the following claims, and all technical ideas within the scope equivalent thereto should be construed as being included in the scope of the present invention.

110 : 데이터 수집부
120 : 데이터 전처리부
140 : 상관도 분석부
150 : 모델 사전학습부
160 : 모델 파인튜닝부
170 : 예측부
180 : 저장부110: data collection unit
120: data preprocessor
140: correlation analysis unit
150: model pre-learning unit
160: model fine tuning unit
170: prediction unit
180: storage

Claims

a data collection unit for collecting historical power data and related environmental information obtainable by region or group;
a data pre-processing unit for pre-processing the data collected by the data collecting unit;
a correlation analysis unit for selecting a regional dataset having the highest Pearson correlation coefficient based on a target dataset for a specified region and other unspecified regional datasets;
a model pre-training unit for pre-learning a learning model using the regional dataset selected by the correlation analysis unit;
a model fine-tuning unit for fine-tuning the learning model using the target dataset; and
A prediction unit that predicts items of the target dataset in the future by using the learning model
Power information prediction system comprising a.

According to claim 1,
A storage unit that stores the data and information collected by the data collection unit, and stores the learning model
Power information prediction system further comprising a.

According to claim 1,
The correlation coefficient is a Pearson correlation coefficient,
The learning model is a power information prediction system that is a DNN model.

4. The method of claim 3,
The correlation analysis unit,
If the average value of the correlation coefficient is 0.5 or less, the top 5 high correlation coefficients are selected, if the average value is 0.5 to less than 0.7, the top 10 are selected, if the average value is 0.7 to less than 0.8, the top 20 data sets, and if the average value is 0.8 or more, the top 30 regional datasets are selected. A power information prediction system that selects.

4. The method of claim 3,
The correlation analysis unit,
Power information prediction system for performing correlation analysis according to the following equation.

(where R _XY is the PCC between X and Y, n is the number of observations, X _i and Y _i are the values of time i,

and

is the average value of X and Y, respectively)

According to claim 1,
The data collection unit,
A data communication module that connects to an external agency server that has the source information of the data to be collected and accesses and receives the source information
Power information prediction system comprising a.

classifying the power data by region or group;
Collecting obtainable past power data and related environmental information;
preprocessing the collected past power data, and generating regional or group-specific datasets according to region or group classification;
designating a target dataset from among the datasets for each region or group, and selecting a data set having a high correlation coefficient with the target dataset from among other unselected datasets for each region or group;
pre-training a learning model using the selected target data set and a data set for each region or group having a high correlation coefficient;
fine-tuning the learning model using the target dataset; and
predicting items of the target dataset in the future with the learning model
Power information prediction method comprising a.

8. The method of claim 7,
In the step of predicting the items of the target dataset in the future, the power information prediction method for collecting and using prediction values of other organizations for the future related environmental information.

8. The method of claim 7,
Power information prediction method, characterized in that for all regional datasets, specifying the target dataset, and selecting or predicting the one having a high correlation coefficient.

Collecting obtainable past power data and related environmental information of a specific region or group;
generating a target data set by pre-processing the collected past power data;
selecting one having a high correlation coefficient with the target dataset from among previously acquired regional datasets of other regions;
pre-training a learning model using the selected target dataset and regional datasets having a high correlation coefficient;
fine-tuning the learning model using the target dataset; and
predicting items of the target dataset in the future with the learning model
Power information prediction method comprising a.