KR20210063775A

KR20210063775A - Device for predicting price of asset use of convolution neural network

Info

Publication number: KR20210063775A
Application number: KR1020190152284A
Authority: KR
Inventors: 정현규; 윤재훈; 이유진; 김태욱
Original assignee: (주)크래프트테크놀로지스
Priority date: 2019-11-25
Filing date: 2019-11-25
Publication date: 2021-06-02

Abstract

본 발명에 따른 CNN을 이용한 자산 가격 예측 장치는, 예측하고자 하는 자산에 대한 입력 데이터와 상기 자산의 과거 가격 데이터 간의 기준일(T)과 예측운용기간(K)의 시계열 길이만큼의 차이를 두고 피팅한 데이터를 입력받는 트리 모듈과, 상기 트리 모듈을 통해 예측된 예측 가격 데이터 및 예측운용기간(K)에 해당하는 실제 가격 데이터 간의 비교값과 이미지 맵의 변수들에 기초하여 생성된 이미지 맵을 입력받는 CNN 모듈을 포함하며, 상기 CNN모듈을 통해 상기 예측 가격 데이터에 대해 학습시키고, 상기 CNN 모듈을 통해 예측하고자 하는 특정 시점의 이전에 해당하는 이미지 맵을 예측하고, 상기 예측된 이미지 맵에 기초하여 상기 예측하고자 하는 특정 시점에 대한 최종 예측 가격 데이터를 예측한다. The asset price prediction apparatus using CNN according to the present invention is fitted with a difference as much as the time series length of the reference date (T) and the predicted operating period (K) between the input data for the asset to be predicted and the past price data of the asset. A tree module that receives data, a comparison value between the predicted price data predicted through the tree module, and the actual price data corresponding to the forecast operating period (K) and an image map generated based on the variables of the image map are input Containing a CNN module, learning about the predicted price data through the CNN module, predicting the image map corresponding to the previous of a specific time to be predicted through the CNN module, based on the predicted image map Predict the final predicted price data for a specific point in time to be predicted.

Description

Asset price prediction device using CNN (Convolution Neural Network) {DEVICE FOR PREDICTING PRICE OF ASSET USE OF CONVOLUTION NEURAL NETWORK}

본 발명은 CNN(Convolution Neural Network)을 이용하여 자산의 가격을 예측하는 장치에 관한 것이다. The present invention relates to an apparatus for predicting the price of an asset using a Convolution Neural Network (CNN).

채권딜러제는 딜러로 지정된 증권사가 자체 보유자금으로 투자자들과 직접 매매거래를 행하는 제도를 의미한다. 증권사는 향후 채권수익률을 분석 및 전망하고, 주식 매매의 시기와 비슷하게 매수 및 매도 호가 수익률을 제시한다. 투자자는 증권사가 제시한 수익률이 합당하다고 판단될 경우, 채권을 매수 또는 매도할 수 있다. The bond dealer system refers to a system in which securities companies designated as dealers directly trade with investors with their own funds. Securities companies analyze and forecast future bond yields, and present bid and ask price yields similar to the timing of stock trading. Investors may buy or sell bonds if they determine that the rate of return offered by the securities company is reasonable.

증권사는 향후 채권수익률을 분석 및 전망하기 위해서는 경제적 지표, 기술적 지표 등과 같이 가격을 예측하기 위한 계량적 지표를 이용하여 예측하며, 이러한 자산의 예측 기술과 관련하여, 선행기술인 한국등록특허 제10-0410714호는 인터넷을 이용한 금융자산 가격예측 정보제공시스템을 개시하고 있다. In order to analyze and forecast future bond yields, securities companies make predictions using quantitative indicators to predict prices, such as economic indicators and technical indicators. Ho discloses a financial asset price prediction information providing system using the Internet.

자산의 가격을 예측하기 위해서는 자기 상관을 학습할 수 있는 인공지능 알고리즘이 주로 이용된다. 자기 상관을 학습하는 알고리즘은 대표적으로 RNN, LSTM 등이 존재하며, 이전의 학습파라미터가 다음 학습에 영향을 주는 구조로 구성된다. In order to predict the price of an asset, an artificial intelligence algorithm that can learn autocorrelation is mainly used. Algorithms for learning autocorrelation typically include RNN and LSTM, and it is composed of a structure in which previous learning parameters affect next learning.

그러나 자산은 고용지수, 물가지수, 산업생산지수, 금리, 상대강도지표 등과 같이 무수히 많은 계량적 지표들에 의해 결정된다. 이 무수히 많은 계량적 지표들을 RNN, LSTM 등에 입력하는 경우, 입력 데이터의 차원이 많아짐에 따라 차원의 저주가 발생되어 자산을 예측하는데 적합하지 않다는 단점을 가지고 있다. However, assets are determined by a myriad of quantitative indicators such as employment index, price index, industrial production index, interest rate, and relative strength index. When these innumerable quantitative indicators are input to RNN, LSTM, etc., as the dimension of the input data increases, a curse of dimension occurs, which is not suitable for predicting assets.

예측하고자 하는 자산에 대한 입력 데이터와 상기 자산의 과거 가격 데이터 간의 기준일(T)과 예측운용기간(K)의 시계열 길이만큼의 차이를 두고 피팅한 데이터를 입력받는 트리 모듈과, 상기 트리 모듈을 통해 예측된 예측 가격 데이터 및 예측운용기간(K)에 해당하는 실제 가격 데이터 간의 비교값과 이미지 맵의 변수들에 기초하여 생성된 이미지 맵을 입력받는 CNN 모듈을 구비하는 장치를 제공하고자 한다.A tree module that receives data fitted with a difference as much as the time series length of the reference date (T) and the forecast operating period (K) between the input data for the asset to be predicted and the past price data of the asset, and through the tree module An object of the present invention is to provide an apparatus including a CNN module that receives an image map generated based on a comparison value between the predicted predicted price data and the actual price data corresponding to the predicted operating period (K) and the variables of the image map.

또한, 상기 CNN모듈을 통해 상기 예측 가격 데이터에 대해 학습시키고, 상기 CNN 모듈을 통해 예측하고자 하는 특정 시점의 이전에 해당하는 이미지 맵을 예측하고, 상기 예측된 이미지 맵에 기초하여 상기 예측하고자 하는 특정 시점에 대한 최종 예측 가격 데이터를 예측하는 장치를 제공하고자 한다. In addition, learning about the predicted price data through the CNN module, predicting an image map prior to a specific time to be predicted through the CNN module, and predicting the specific to be predicted based on the predicted image map It is intended to provide a device for predicting the final predicted price data for a point in time.

다만, 본 실시예가 이루고자 하는 기술적 과제는 상기된 바와 같은 기술적 과제들로 한정되지 않으며, 또 다른 기술적 과제들이 존재할 수 있다. However, the technical problem to be achieved by the present embodiment is not limited to the technical problems as described above, and other technical problems may exist.

상술한 기술적 과제를 달성하기 위한 수단으로서, 본 발명의 일 실시예는, 예측하고자 하는 자산에 대한 입력 데이터와 상기 자산의 과거 가격 데이터 간의 기준일(T)과 예측운용기간(K)의 시계열 길이만큼의 차이를 두고 피팅한 데이터를 입력받는 트리 모듈과, 상기 트리 모듈을 통해 예측된 예측 가격 데이터 및 예측운용기간(K)에 해당하는 실제 가격 데이터 간의 비교값과 이미지 맵의 변수들에 기초하여 생성된 이미지 맵을 입력받는 CNN 모듈을 포함하며, 상기 CNN모듈을 통해 상기 예측 가격 데이터에 대해 학습시키고, 상기 CNN 모듈을 통해 예측하고자 하는 특정 시점의 이전에 해당하는 이미지 맵을 예측하고, 상기 예측된 이미지 맵에 기초하여 상기 예측하고자 하는 특정 시점에 대한 최종 예측 가격 데이터를 예측하는 자산 가격 예측 장치를 제공할 수 있다. As a means for achieving the above-described technical problem, an embodiment of the present invention provides as much as the time series length of the reference date (T) and the forecast operation period (K) between the input data for the asset to be predicted and the past price data of the asset. Created based on a comparison value between a tree module that receives data fitted with a difference of , and a comparison value between the predicted price data predicted through the tree module and the actual price data corresponding to the predicted operating period (K) and the variables of the image map It includes a CNN module that receives the image map that has been made, learns about the predicted price data through the CNN module, predicts the image map corresponding to the previous of a specific time to be predicted through the CNN module, and the predicted It is possible to provide an asset price prediction device that predicts the final predicted price data for the specific time point to be predicted based on the image map.

본 발명의 다른 실시예에 따르면, 상기 트리 모듈은, 상기 기준일 및 상기 기준일에 상기 예측운용기간을 더한 시점 간의 기간에 해당하는 예측 가격 데이터를 예측할 수 있다. According to another embodiment of the present invention, the tree module may predict the predicted price data corresponding to the period between the reference date and the time between the reference date and the predicted operating period.

상술한 과제 해결 수단은 단지 예시적인 것으로서, 본 발명을 제한하려는 의도로 해석되지 않아야 한다. 상술한 예시적인 실시예 외에도, 도면 및 발명의 상세한 설명에 기재된 추가적인 실시예가 존재할 수 있다.The above-described problem solving means are merely exemplary and should not be construed as limiting the present invention. In addition to the above-described exemplary embodiments, there may be additional embodiments described in the drawings and detailed description of the invention.

전술한 본 발명의 과제 해결 수단 중 어느 하나에 의하면, 인공지능 기계학습 모델이 가지는 한계점을 극복하고, 기존의 일반적인 가격 예측 모형(예를 들어, LSTM) 보다 미래의 자산 가격을 정확하게 예측할 수 있는 CNN을 이용한 자산 가격 예측 장치를 제공할 수 있다. According to any one of the above-described problem solving means of the present invention, a CNN that can overcome the limitations of the artificial intelligence machine learning model and predict the future asset price more accurately than the existing general price prediction model (eg, LSTM) It is possible to provide an asset price prediction device using

수많은 차원의 계량적 지표를 TREE 모듈을 이용하여 분류하고, 자산의 가격 예측에 영향을 미치는 입력 데이터의 차원별 기여도를 제공하는 자산 가격 예측 장치를 제공할 수 있다.It is possible to provide an asset price prediction device that classifies quantitative indicators of many dimensions using the TREE module and provides the contribution of input data by dimension that affects the price prediction of assets.

계량적 입력 데이터를 이미지 입력 데이터로 가공하여 차원의 저주, 미래를 예측하기 위한 데이터와 인공지능 학습시 사용한 데이터들의 서로 다른 확률 분포, 인공지능 학습시에 발생되는 오버피팅(overfitting) 문제들을 해결하여 높은 정확도로 자산의 가격을 예측하는 자산 가격 예측 장치를 제공할 수 있다. By processing quantitative input data into image input data, it solves the curse of dimensionality, different probability distributions between data for predicting the future and data used for AI learning, and overfitting problems that occur during AI learning. It is possible to provide an asset price prediction device that predicts the price of an asset with high accuracy.

채권, 주식 등 성격이 다른 자산군에도 동일한 방법으로 자산의 예측이 가능함에 따라 적용 범위가 넓고, 가격 예측, 예측된 기간의 예측 추이, 가격 하락 및 오름 정확도 등을 응용하여 다양한 포트폴리오 전략에 응용할 수 있도록 하는 자산 가격 예측 장치를 제공할 수 있다. As assets can be predicted in the same way for asset classes with different characteristics, such as bonds and stocks, the scope of application is wide, and it can be applied to various portfolio strategies by applying price forecasting, forecasting trends over the forecast period, and price decline and rise accuracy. It is possible to provide an asset price prediction device that allows

도 1은 본 발명의 일 실시예에 따른 가격 예측 장치의 구성도이다.
도 2a 및 도 2b는 본 발명의 일 실시예에 따른 자산의 가격 예측을 하기 위한 시뮬레이션 조건 및 시뮬레이션 조건에 따라 수집된 과거 가격 데이터 및 입력 데이터를 도시한 예시적인 도면이다.
도 3a 내지 도 3c는 본 발명의 일 실시예에 따른 제 1 예측 모듈을 이용하여예측 가격 데이터를 예측하는 과정을 설명하기 위한 예시적인 도면이다.
도 4는 본 발명의 일 실시예에 따른 예측 가격 데이터 및 실제 가격 데이터 간의 비교값을 도출하는 과정을 설명하기 위한 예시적인 도면이다.
도 5는 본 발명의 일 실시예에 따른 생성된 이미지 맵을 이용하여 특정시점의 이전에 해당하는 이미지 맵을 예측하는 과정을 설명하기 위한 예시적인 도면이다.
도 6은 본 발명의 일 실시예에 따른 예측된 최종 예측 가격 데이터를 도시한 예시적인 도면이다.
도 7은 본 발명의 일 실시예에 따른 가격 예측 장치에서 자산의 가격을 예측하는 방법의 순서도이다.1 is a block diagram of a price prediction apparatus according to an embodiment of the present invention.
2A and 2B are exemplary views illustrating simulation conditions for price prediction of assets and historical price data and input data collected according to simulation conditions according to an embodiment of the present invention.
3A to 3C are exemplary diagrams for explaining a process of predicting predicted price data using a first prediction module according to an embodiment of the present invention.
4 is an exemplary view for explaining a process of deriving a comparison value between predicted price data and actual price data according to an embodiment of the present invention.
5 is an exemplary diagram for explaining a process of predicting an image map corresponding to a previous point in time using a generated image map according to an embodiment of the present invention.
6 is an exemplary diagram illustrating predicted final predicted price data according to an embodiment of the present invention.
7 is a flowchart of a method for predicting the price of an asset in the price prediction apparatus according to an embodiment of the present invention.

아래에서는 첨부한 도면을 참조하여 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 본 발명의 실시예를 상세히 설명한다. 그러나 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시예에 한정되지 않는다. 그리고 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략하였으며, 명세서 전체를 통하여 유사한 부분에 대해서는 유사한 도면 부호를 붙였다. Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those of ordinary skill in the art can easily implement the present invention. However, the present invention may be implemented in various different forms and is not limited to the embodiments described herein. In the drawings, parts irrelevant to the description are omitted in order to clearly describe the present invention, and similar reference numerals are attached to similar parts throughout the specification.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결"되어 있다고 할 때, 이는 "직접적으로 연결"되어 있는 경우뿐 아니라, 그 중간에 다른 소자를 사이에 두고 "전기적으로 연결"되어 있는 경우도 포함한다. 또한 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미하며, 하나 또는 그 이상의 다른 특징이나 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다. Throughout the specification, when a part is said to be "connected" with another part, this includes not only "directly connected" but also "electrically connected" with another element interposed therebetween. . In addition, when a part "includes" a certain component, it means that other components may be further included, and one or more other features, not excluding other components, unless specifically stated to the contrary. It is to be understood that it does not preclude the presence or addition of any number, step, action, component, part, or combination thereof.

본 명세서에 있어서 '부(部)'란, 하드웨어에 의해 실현되는 유닛(unit), 소프트웨어에 의해 실현되는 유닛, 양방을 이용하여 실현되는 유닛을 포함한다. 또한, 1 개의 유닛이 2 개 이상의 하드웨어를 이용하여 실현되어도 되고, 2 개 이상의 유닛이 1 개의 하드웨어에 의해 실현되어도 된다.In the present specification, the term "unit" includes a unit realized by hardware, a unit realized by software, and a unit realized using both. Further, one unit may be realized by using two or more hardware, or two or more units may be realized by one piece of hardware.

본 명세서에 있어서 단말 또는 디바이스가 수행하는 것으로 기술된 동작이나 기능 중 일부는 해당 단말 또는 디바이스와 연결된 서버에서 대신 수행될 수도 있다. 이와 마찬가지로, 서버가 수행하는 것으로 기술된 동작이나 기능 중 일부도 해당 서버와 연결된 단말 또는 디바이스에서 수행될 수도 있다.In this specification, some of the operations or functions described as being performed by the terminal or device may be performed instead in a server connected to the terminal or device. Likewise, some of the operations or functions described as being performed by the server may also be performed by a terminal or device connected to the server.

이하 첨부된 도면을 참고하여 본 발명의 일 실시예를 상세히 설명하기로 한다. Hereinafter, an embodiment of the present invention will be described in detail with reference to the accompanying drawings.

도 1은 본 발명의 일 실시예에 따른 가격 예측 장치의 구성도이다. 도 1을 참조하면, 가격 예측 장치(100)는 수집부(110), 제 1 예측부(120), 도출부(130), 생성부(140) 및 제 2 예측부(150)를 포함할 수 있다. 1 is a block diagram of a price prediction apparatus according to an embodiment of the present invention. Referring to FIG. 1 , the price prediction apparatus 100 may include a collection unit 110 , a first prediction unit 120 , a derivation unit 130 , a generation unit 140 , and a second prediction unit 150 . have.

수집부(110)는 예측하고자 하는 자산의 과거 가격 데이터 및 자산과 연관된 입력 데이터를 수집할 수 있다. 예를 들어, 수집부(110)는 예측하고자 하는 자산의 과거 가격 데이터와 함께 예측하고자 하는 자산이 속한 시장의 경제 지표, 기술적 지표 및 세계공통경제지표에 대한 시계열 데이터를 입력 데이터로 수집할 수 있다. 예측하고자 하는 자산의 과거 가격 데이터 및 자산과 연관된 입력 데이터는 차원(데이터 종류)으로 구성되고, 입력 데이터의 개수는 시계열 데이터의 길이를 나타낼 수 있다. The collection unit 110 may collect past price data of an asset to be predicted and input data associated with the asset. For example, the collection unit 110 may collect, as input data, time series data for economic indicators, technical indicators, and global common economic indicators of the market to which the asset to be predicted belongs together with historical price data of the asset to be predicted. . The historical price data of the asset to be predicted and the input data associated with the asset are composed of a dimension (data type), and the number of input data may indicate the length of time series data.

여기서, 입력 데이터는 예측하고자 하는 여러 자산들의 가격을 특정한 방법론에 따라 재산정한 인덱스 프라이스(Index Price)로 한정하며, 종류는 방법론 및 자산들의 종류, 특성에 따라 매우 다양하지만, 대표적인 인덱스인 KOSPI(국내), S&P500(미국), EMBI(각 국가)로 한정될 수 있다. 대표적인 인덱스들은 많은 자산운용사 및 자금운용기관들이 운용을 할 때, 참고하는 지표로 영향력이 상당하여 예측하는데 큰 효용이 있다. 예를 들어, 인덱스를 추종하는 상품인 ETF의 경우, 가격을 직접 예측하여 투자하는 전략 및 인덱스 관련 선물 옵션 투자 전략 등에 응용될 수 있다. Here, the input data is limited to the Index Price, which recalculates the prices of various assets to be predicted according to a specific methodology, and the types vary greatly depending on the methodology and the types and characteristics of the assets, but the representative index KOSPI (domestic ), S&P 500 (USA), and EMBI (each country). Representative indices are a reference index for many asset management companies and money management institutions, and they are very useful in forecasting because their influence is significant. For example, ETFs, which are index-following products, can be applied to strategies for directly predicting prices and investment strategies for index-related futures options.

예를 들어, 자산이 채권인 경우, 수집부(110)는 채권발행국가의 경제 상황을 파악할 수 있는 지표들 및 전세계 경제에 영향을 미치는 미국의 경제지표를 입력 데이터로 수집할 수 있다. 입력 데이터는 예를 들어, 소매 판매, 산업 생산, MoneySupplyM3, CSIICMEA Index, Moody's Seasoned Baa Corporate Bond Yield, Wholesale Trade Sales, Unemployment Rate, Retail Sales Advance 등을 포함할 수 있으며, 채권발행국가를 고려하여 전산상 얻을 수 있는 모든 데이터들을 포함할 수 있다. For example, when the asset is a bond, the collection unit 110 may collect as input data indicators for understanding the economic situation of the bond issuer and the US economic indicators affecting the global economy. Input data may include, for example, retail sales, industrial production, MoneySupplyM3, CSIICMEA Index, Moody's Seasoned Baa Corporate Bond Yield, Wholesale Trade Sales, Unemployment Rate, Retail Sales Advance, etc. It can contain any data available.

다른 예를 들어, 자산이 주식인 경우, 수집부(110)는 해당주식발행 국가의 경제상황을 알 수 있는 지표들과 미국경제지표 및 주식에 대한 지표(SMB, HML, RMW, CMA, RF), 투자지표(PE Ratio, dividend_yield, market_capital ratio) 등을 입력 데이터로 수집할 수 있다. For another example, if the asset is stock, the collection unit 110 may determine the economic situation of the country issuing the stock, and the indicators for US economic indicators and stocks (SMB, HML, RMW, CMA, RF) , investment indicators (PE Ratio, dividend_yield, market_capital ratio), etc. can be collected as input data.

이러한 자산과 관련하여 수집된 입력 데이터에 대해서는 도 2a 및 도 2b를 통해 상세히 설명하도록 한다. Input data collected in relation to these assets will be described in detail with reference to FIGS. 2A and 2B .

도 2a 및 도 2b는 본 발명의 일 실시예에 따른 자산의 가격 예측을 하기 위한 시뮬레이션 조건 및 조건에 따라 수집된 과거 가격 데이터 및 입력 데이터를 도시한 예시적인 도면이다. 2A and 2B are exemplary views illustrating historical price data and input data collected according to simulation conditions and conditions for price prediction of an asset according to an embodiment of the present invention.

도 2a는 본 발명의 일 실시예에 따른 가격 예측을 하기 위한 시뮬레이션 조건을 도시한 예시적인 도면이다. 도 2a를 참조하면, 시뮬레이션 조건은 대상국가(200), 예측 일자(201), 입력 데이터 수(202), 기준일(203), 예측운용기간(204), MAP SPEC(205), 변수(206), 최소값(207), 간격(208), 경우의 수(209), 최대값(210) 등을 포함할 수 있다. 2A is an exemplary diagram illustrating simulation conditions for price prediction according to an embodiment of the present invention. Referring to Figure 2a, the simulation conditions are target country 200, forecast date 201, number of input data 202, reference date 203, forecast operating period 204, MAP SPEC 205, variable 206 , a minimum value 207 , an interval 208 , a number of instances 209 , a maximum value 210 , and the like.

예를 들어, 시뮬레이션 조건은 관리자에 의해 채권을 발행하는 대상국가(200): '남아공', 예측 일자(201): '2017/06/01~30(영업일: 22일)', 입력 데이터 수(202): '239개', 기준일(203, 이하 상세한 설명 내에서 'T'로 표현): '2017/04/27', 예측운용기간(204, 이하 상세한 설명 내에서 'K'로 표현): '24일'로 설정받을 수 있다. 여기서, 기준일(203)은 과거의 특정 시점을 의미할 수 있고, 예측운용기간(204)은 자산의 예측 가격 데이터에 대해 기준일(203)을 기준으로 하여 예측하려는 기간을 의미할 수 있다. 예를 들어, 기준일(203)이 2017/04/27이고, 예측운용기간(204)이 '24일'로 설정된 경우, 2017/04/27로부터 24일이 경과한 일자에 해당하는 자산의 가격이 예측될 수 있다. For example, the simulation conditions are: 'South Africa', '2017/06/01~30 (business days: 22 days)', number of input data ( 202): '239', base date (203, expressed as 'T' in the detailed description below): '2017/04/27', forecast operating period (204, expressed as 'K' in the detailed description below): It can be set to '24 days'. Here, the reference date 203 may mean a specific point in time in the past, and the forecast operating period 204 may mean a period to be predicted with respect to the predicted price data of an asset based on the reference date 203 . For example, if the base date 203 is 2017/04/27 and the forecast operating period 204 is set to '24 days', the price of the asset corresponding to the date 24 days have elapsed from 2017/04/27 is can be predicted.

또한, 시뮬레이션 조건은 관리자에 의해 채권의 가격을 예측하기 위해 필요한 이미지 맵에 대한 조건을 설정받을 수 있다. 예를 들어, 시뮬레이션 조건은 이미지 맵에 대해 변화를 줄 수 있도록 하는 조합인 MAP SPEC(205): '8*40', 이미지 맵의 변수(206): 'M과 N', 최소값(207): 'M-40일, N-50일', 간격(208): 'M-25일, N-13일', 경우의 수(209): 'M-8, N-40', 최대값(210): 'M-215일, N-557일'로 설정받을 수 있다. In addition, the simulation condition may be set by the manager for the image map necessary to predict the price of the bond. For example, the simulation conditions are combinations that allow changes to the image map: MAP SPEC (205): '8*40', variables in the image map (206): 'M and N', minimum value (207): 'M-40 days, N-50 days', interval (208): 'M-25 days, N-13 days', number of cases (209): 'M-8, N-40', maximum value (210) ): 'M-215 days, N-557 days' can be set.

여기서, 'M'은 추후에 이용될 제 1 예측 모듈에 해당하는 TREE 모듈에 피팅할 시계열 길이로, 과거의 얼마 동안의 정보를 트리에 피팅할 것인가를 의미하는 제 1 기간값일 수 있다. 또한, 'N'은 특정일을 기준으로, TREE 모듈에 피팅하는 경우, 과거 가격 데이터를 표현할 기간변화율의 기간으로, 과거의 얼마 동안의 기간변화율로 과거일로부터 특정일에 해당하는 시점까지의 변화율을 표현할 것인가를 의미하는 제 2 기간값일 수 있다. Here, 'M' is a time series length to be fitted to a TREE module corresponding to a first prediction module to be used later, and may be a first period value indicating how much past information is to be fitted to the tree. In addition, 'N' is the period of period change rate to express past price data when fitting to the TREE module based on a specific date, and the rate of change from the past date to the time point corresponding to a specific date as a period change rate for some past period may be a second period value indicating whether to express

도 2b는 본 발명의 일 실시에에 따라 수집된 입력 데이터를 도시한 예시적인 도면이다. 도 2b를 참조하면, 수집부(110)는 17/01/27~17/03/24까지의 예측하고자 하는 자산(예를 들어, 채권)과 관련된 입력 데이터(220)를 수집하고, 17/02/15~17/04/27까지의 예측하고자 하는 자산(예를 들어, 채권)의 과거 가격 데이터(221)를 수집할 수 있다. 2B is an exemplary diagram illustrating input data collected according to an embodiment of the present invention. Referring to FIG. 2B , the collection unit 110 collects input data 220 related to assets (eg, bonds) to be predicted from 17/01/27 to 17/03/24, and 17/02 It is possible to collect historical price data 221 of assets (eg bonds) to be predicted from /15 to 17/04/27.

다시 도 1로 돌아와서, 제 1 예측부(120)는 제 1 예측 모듈을 통해 예측 가격 데이터를 예측할 수 있다. 제 1 예측 모듈을 통해 예측 가격 데이터를 예측하는 과정에 대해서는 도 3a 내지 3c를 통해 상세히 설명하도록 한다. Returning to FIG. 1 again, the first prediction unit 120 may predict the predicted price data through the first prediction module. A process of predicting the predicted price data through the first prediction module will be described in detail with reference to FIGS. 3A to 3C .

도 3a 내지 도 3c는 본 발명의 일 실시예에 따른 제 1 예측 모듈을 이용하여예측 가격 데이터를 예측하는 과정을 설명하기 위한 예시적인 도면이다. 3A to 3C are exemplary diagrams for explaining a process of predicting predicted price data using a first prediction module according to an embodiment of the present invention.

도 3a는 본 발명의 일 실시예에 따른 기준일 및 예측운용기간에 기초하여 제 1 기간값 및 제 2 기간값이 설정되는 과정을 설명하기 위한 예시적인 도면이다. 도 3a를 참조하면, 제 1 예측부(120)는 기준일(T, 300) 및 예측운용기간(K=24)을 설정받을 수 있다. 여기서, 기준일(T, 300)은 과거의 특정 시점으로 설정되고, 제 1 기간값(302)은 기준일(T, 300)로부터 예측운용기간(K=24)을 뺀 시점(T-24)을 기준으로 하여 설정되고, 제 2 기간값(303)은 기준일(T, 300)을 기준으로 하여 설정될 수 있다. 3A is an exemplary view for explaining a process in which a first period value and a second period value are set based on a reference date and a predicted operating period according to an embodiment of the present invention. Referring to FIG. 3A , the first prediction unit 120 may receive a reference date (T, 300) and a prediction operation period (K=24) set. Here, the reference date (T, 300) is set to a specific time in the past, and the first period value 302 is based on the time (T-24) obtained by subtracting the predicted operating period (K=24) from the reference date (T, 300). , and the second period value 303 may be set based on the reference date (T, 300).

도 3a를 참조하면, 제 1 기간값(302)은 기준일(T, 2017/04/27, 300)로부터 예측운용기간(K=24)을 뺀 시점(T-24, 2017/03/24)과 기준일(T, 2017/04/27, 300)에서 예측운용기간(K=24)이 빠진 시점(T-24, 2017/03/24)으로부터 기설정된 최소값(40일)이 반영된 시점(T-64, 2017/01/27) 사이의 기간을 의미하고, 제 2 기간값(303)은 기준일(T, 2017/04/27, 300)과 기설정된 최소값(50일)에 대응하는 시점(T-50, 2017/02/16) 사이의 기간을 의미할 수 있다. Referring to FIG. 3A , the first period value 302 is the time point (T-24, 2017/03/24) minus the predicted operating period (K=24) from the reference date (T, 2017/04/27, 300) and From the base date (T, 2017/04/27, 300) when the forecast operating period (K=24) is omitted (T-24, 2017/03/24), the preset minimum value (40 days) is reflected (T-64) , 2017/01/27), and the second period value 303 is a time point (T-50) corresponding to the reference date (T, 2017/04/27, 300) and the preset minimum value (50 days) , 2017/02/16).

도 3b는 본 발명의 일 실시예에 따른 제 1 예측 모듈을 도시한 예시적인 도면이다. 도 3b를 참조하면, 제 1 예측 모듈은 TREE 모듈일 수 있다. TREE 모듈은 많은 입력 데이터들을 빠르고 정확하게 분류할 수 있는 모델로, 과거 데이터를 분류 및 피팅하는데 강점이 있다. 여기서, TREE 모듈을 이용하여 미래를 예측하는 경우 별도의 변수 조정이 필요하다.3B is an exemplary diagram illustrating a first prediction module according to an embodiment of the present invention. Referring to FIG. 3B , the first prediction module may be a TREE module. The TREE module is a model that can quickly and accurately classify many input data, and has strengths in classifying and fitting past data. Here, when predicting the future using the TREE module, a separate variable adjustment is required.

제 1 예측 모듈로 TREE 모듈을 사용하는 이유는, TREE 모듈은 수 많은 차원의 입력 데이터의 차원(데이터의 특징)들을 빠르게 분류할 수 있고, TREE 모듈의 각 노드들의 임의의 임계값으로 분류하는데 있어 각 차원들의 중요도를 측정할 수 있으며, 이를 통해, 입력 데이터의 차원들에 대한 설명력을 확보할 수 있고, 적절한 입력 데이터와 과거 가격 데이터를 사용할 경우, 정확도를 높일 수 있기 때문이다. The reason for using the TREE module as the first prediction module is that the TREE module can quickly classify the dimensions (features of data) of input data of many dimensions, and in classifying it with an arbitrary threshold value of each node of the TREE module. This is because the importance of each dimension can be measured, and through this, explanatory power for the dimensions of the input data can be secured, and accuracy can be increased when appropriate input data and past price data are used.

도 3a 및 도 3b를 참조하면, 제 1 예측부(120)는 수집된 입력 데이터 중 제 1 기간값(302)에 해당하는 입력 데이터 및 수집된 과거 가격 데이터 중 제 2 기간값(303)에 해당하는 과거 가격 데이터를 제 1 예측 모듈에 입력하고, 제 1 예측 모듈을 통해 예측 가격 데이터를 예측할 수 있다. 또한, 제 1 예측부(120)는 제 2 기간값(303)에 해당하는 과거 가격 데이터의 기간변화율값을 제 1 예측 모듈에 입력할 수 있다.3A and 3B , the first prediction unit 120 corresponds to the input data corresponding to the first period value 302 among the collected input data and the second period value 303 among the collected past price data. past price data may be input to the first prediction module, and the predicted price data may be predicted through the first prediction module. Also, the first prediction unit 120 may input a period change rate value of the past price data corresponding to the second period value 303 to the first prediction module.

여기서, 제 1 예측부(120)는 입력 데이터와 과거 가격 데이터 간의 예측운용기간(K=24)의 시계열 길이만큼의 차이를 두어 피팅할 수 있다. 예를 들어, 제 1 예측부(120)는 기준일(T, 300)에서 예측운용기간(K=24)을 뺀 시점(T-24)으로부터 제 1 기간(302)값이 반영된 시점(T-64)까지의 입력 데이터 및 제 2 기간값(303)이 반영된 시점(T-50)부터 기준일(T, 300)까지의 과거 가격 데이터를 피팅하여 TREE 모듈에 입력함으로써, T+1부터 T+24(301)까지의 예측 가격 데이터를 예측할 수 있다. Here, the first prediction unit 120 may fit by placing a difference as much as the time series length of the prediction operating period (K=24) between the input data and the past price data. For example, the first prediction unit 120 may calculate a time point (T-64) at which the value of the first period (302) is reflected from a time point (T-24) obtained by subtracting the prediction operation period (K=24) from the reference date (T, 300). ) and past price data from the time point (T-50) to the reference date (T, 300) at which the second period value 303 is reflected and input into the TREE module, T+1 to T+24 ( 301) can be predicted.

제 1 예측부(120)는 기준일(T, 300) 이후 일자(T+1) 및 기준일(T, 300)에 예측운용기간(K=24)을 더한 시점(T+24, 301) 간의 기간에 해당하는 예측 가격 데이터를 예측할 수 있다. 예를 들어, 기준일(T, 300)에 대해 예측운용기간(K)이 24일로 설정된 경우, 기준일(T, 300)에 예측운용기간(K=24)을 더한 시점(T+24, 301)까지의 인덱스 프라이스에 해당하는 예측 가격 데이터를 예측할 수 있도록, 기준일(T, 300)까지의 입력 데이터에 기초하여 기준일(T, 300)에 예측운용기간(K=24)을 더한 시점(T+24, 301)에 해당하는 예측 가격 데이터가 예측되도록 이를 목표로 하여 설명력을 갖도록 할 수 있다. 여기서, 제 1 예측 모듈(TREE 모듈)로 입력 데이터를 분류하는 각 노드의 임계값들은 관리자에 의해 설정될 수 있다. The first prediction unit 120 is a period between the date (T+1) and the reference date (T, 300) after the reference date (T, 300) plus the predicted operating period (K=24) (T+24, 301). The corresponding predicted price data can be predicted. For example, if the forecast operating period (K) is set to 24 days for the reference date (T, 300), up to the point in time (T+24, 301) when the forecast operation period (K=24) is added to the reference date (T, 300) In order to predict the predicted price data corresponding to the index price of , based on the input data up to the reference date (T, 300), the time point (T+24, 301), it is possible to have explanatory power by aiming at this to predict the predicted price data. Here, threshold values of each node for classifying input data with the first prediction module (TREE module) may be set by an administrator.

또한, 기준일(T, 300)과 예측운용기간(K=24)은 관리자에 의해 임의로 설정될 수 있다. 예측운용기간이 24일로 주로 설정되는 이유는 1개월에서 주말을 제외한 최대 영업일수가 24일이며, 보통 1개월 단위의 운용기간변경으로 전략을 운용하는 경우가 많기 때문이다. In addition, the reference date (T, 300) and the predicted operating period (K=24) may be arbitrarily set by the administrator. The reason that the forecast operating period is mainly set to 24 days is that the maximum number of business days excluding weekends is 24 days in one month, and the strategy is often operated by changing the operating period in units of one month.

다른 예를 들어, 제 1 예측부(120)는 과거 가격 데이터 대신 가격변화율 값을 제 1 예측 모듈에 입력하여 예측 가격변화율값을 예측할 수도 있다. As another example, the first prediction unit 120 may predict the predicted price change rate value by inputting the price change rate value instead of the past price data into the first prediction module.

도 3c를 참조하면, 제 1 예측부(120)는 ‘기준일(T, 2017/04/27, 322)’로부터 ‘기준일(T, 2017/04/27, 322)에 예측운용기간(K=24)을 더한 시점(T+24, 2017/05/31, 331)’ 간의 기간에 해당하는 예측 가격 데이터(311)를 예측할 수 있다. 예를 들어, 제 1 예측부(120)는 'T-24(321)~T(322)'시점에 해당하는 입력 데이터(310)를 제 1 예측 모듈인 TREE 모듈에 입력하여, 'T+1~T+24'에 해당하는 '2017/04/28~2017/05/31'(331) 기간에 해당하는 예측 가격 데이터(311)를 예측할 수 있다. Referring to FIG. 3C , the first prediction unit 120 predicts the operating period (K=24) from the 'base date (T, 2017/04/27, 322)' to the 'base date (T, 2017/04/27, 322)' ) plus the time point (T+24, 2017/05/31, 331)', the predicted price data 311 corresponding to the period can be predicted. For example, the first prediction unit 120 inputs the input data 310 corresponding to the time points 'T-24(321) to T(322)' into the TREE module, which is the first prediction module, and 'T+1 It is possible to predict the predicted price data 311 corresponding to the period '2017/04/28 ~ 2017/05/31' (331) corresponding to ~T+24'.

도출부(130)는 제 1 예측부(120)를 통해 예측된 예측 가격 데이터 및 실제 가격 데이터 간의 비교값을 도출할 수 있다. 예를 들어, 도출부(130)는 예측 가격 데이터 및 예측운용기간에 해당하는 실제 가격 데이터 간의 차이값을 도출하고, 도출된 차이값에 대한 역수를 비교값으로 도출할 수 있다. 여기서, 비교값은 예측운용기간에 대한 평균값으로 도출될 수 있다. 또한, 도출부(130)는 제 1 예측부(120)를 통해 예측된 예측 가격 데이터의 기간변화율값 및 실제 가격 데이터의 기간변화율값 간의 비교값을 도출할 수 있다. 비교값을 도출하는 과정에 대해서는 도 4를 통해 상세히 설명하도록 한다. The derivation unit 130 may derive a comparison value between the predicted price data predicted through the first prediction unit 120 and the actual price data. For example, the derivation unit 130 may derive a difference value between the predicted price data and the actual price data corresponding to the predicted operating period, and derive a reciprocal of the derived difference value as a comparison value. Here, the comparison value may be derived as an average value for the predicted operating period. Also, the derivation unit 130 may derive a comparison value between the period change rate value of the predicted price data predicted through the first predictor 120 and the period change rate value of the actual price data. A process of deriving a comparison value will be described in detail with reference to FIG. 4 .

도 4는 본 발명의 일 실시예에 따른 예측 가격 데이터의 기간변화율값 및 실제 가격 데이터의 기간변화율값 간의 비교값을 도출하는 과정을 설명하기 위한 예시적인 도면이다. 도 4를 참조하면, 도출부(130)는 예측 가격 데이터(410)의 기간변화율값 및 예측운용기간에 해당하는 실제 가격 데이터(400) 의 기간변화율값 간의 차이값을 도출하고, 도출된 차이값에 대한 역수를 비교값(420)으로서 도출할 수 있다. 여기서, 비교값(420)은 예측운용기간에 대한 평균값으로 도출될 수 있으며, 다음의 수학식 1과 같이 구성될 수 있다. 4 is an exemplary view for explaining a process of deriving a comparison value between a period change rate value of predicted price data and a period change rate value of actual price data according to an embodiment of the present invention. Referring to FIG. 4 , the derivation unit 130 derives a difference value between the period change rate value of the predicted price data 410 and the period change rate value of the actual price data 400 corresponding to the predicted operating period, and the derived difference value A reciprocal of ? may be derived as the comparison value 420 . Here, the comparison value 420 may be derived as an average value for the predicted operating period, and may be configured as in Equation 1 below.

[수학식 1][Equation 1]

수학식 1을 참조하면, K는 예측운용기간을 의미하고, T는 과거의 특정 시점을 나타내는 기준일을 의미하고, 은 해당 기간의 예측 가격 데이터와 실제 가격 데이터 간의 차이값을 의미한다. Referring to Equation 1, K denotes a forecast operating period, T denotes a reference date representing a specific time in the past, and denotes a difference between predicted price data and actual price data for the corresponding period.

이에 따르면, 기준일에서의 비교값(ZT, 420)은 T+1 ~ T+K 기간(예측운용기간)에서 비교값들의 평균값일 수 있다. 즉, 기준일에서의 비교값(ZT, 420)은 T+1 ~ T+K 기간에서의 예측 가격 데이터(410) 및 실제 가격 데이터(400)를 비교하여 도출할 수 있다. 따라서, T+K까지만 실제 가격 데이터(400)가 존재하는 경우(예를 들어, T+24가 현재시점인 경우) T+1 ~ T+K 기간(예측운용기간)에서 비교값들의 평균값으로 도출되는 비교값(Z, 420)은 T(기준일)까지 도출될 수 있다. Accordingly, the comparison value ZT, 420 at the reference date may be an average value of comparison values in the period T+1 to T+K (prediction operation period). That is, the comparison value (ZT, 420) in the reference date may be derived by comparing the predicted price data 410 and the actual price data 400 in the period T+1 to T+K. Therefore, when the actual price data 400 exists only up to T+K (for example, when T+24 is the current time), it is derived as the average value of the comparison values in the period T+1 to T+K (prediction operation period). The comparison values Z and 420 can be derived up to T (base date).

여기서, 비교값(Z, 420)는 'M(제 1 기간값)*N(제 2 기간값)'의 조합으로 제 1 예측 모듈(TREE 모듈)에 적용한 예측 가격 데이터(410)와 실제 가격 데이터(400) 간의 차이를 나타내는 Cost(Z)의 역수로, 상위 비교값(Z)은 제 1 예측 모듈(TREE 모듈)의 성능이 좋았던 제 1 기간값, 제 2 기간값의 [M, N]의 좌표를 나타내며, 특정 시점에 해당 [M, N]의 조합을 제 1 예측 모듈(TREE 모듈)에 적용하면, 정확도 높은 예측 가격 데이터를 얻을 수 있다는 장점을 갖는다. Here, the comparison value (Z, 420) is the combination of 'M (first period value)*N (second period value)', and the predicted price data 410 and actual price data applied to the first prediction module (TREE module). (400) is the reciprocal of Cost (Z) representing the difference between the values, and the upper comparison value (Z) is the value of the first period value and the second period value [M, N] in which the performance of the first prediction module (TREE module) was good. If the coordinates are indicated and the combination of [M, N] is applied to the first prediction module (TREE module) at a specific time point, it has the advantage of obtaining high-accuracy prediction price data.

생성부(140)는 제 1 기간값, 상기 제 2 기간값 및 비교값에 기초하여 이미지 맵을 생성할 수 있다. 만약, 생성부(140)가 T+1에 해당하는 이미지 맵을 생성하기 위해서는 T+25까지에 해당하는 과거 가격 데이터를 알아야하지만, 예측운용기간을 24일로 설정한 경우 T+24까지만의 과거 가격 데이터를 알고 있으므로, T+1의 에 해당하는 이미지 맵을 생성할 수 없다. 따라서, 이미지 맵은 기준일(T)까지 생성될 수 있다.The generator 140 may generate an image map based on the first period value, the second period value, and the comparison value. If, in order for the generator 140 to generate the image map corresponding to T+1, it needs to know past price data corresponding to T+25, but when the forecasting period is set to 24 days, the past price up to T+24 Since we know the data, we cannot create an image map corresponding to T+1. Accordingly, the image map may be generated by the reference date T.

생성부(140)는 제 1 기간값, 제 2 기간값 및 비교값에 기초하여 적어도 하나의 제 1 차 이미지 맵을 생성하고, 생성된 제 1 차 이미지 맵을 시간 순으로 쌓도록 가공하여 제 2 차 이미지 맵을 생성할 수 있다. 예를 들어, 이미지 맵은 다차원의 공간 데이터를 포함할 수 있으며, 구체적으로, 제 1 차 이미지 맵은 2차원 이미지이고, 제 2 차 이미지 맵은 3차원 이미지일 수 있다. The generating unit 140 generates at least one primary image map based on the first period value, the second period value, and the comparison value, and processes the generated primary image maps to be stacked in chronological order to obtain a second second image map. You can create a car image map. For example, the image map may include multidimensional spatial data. Specifically, the first image map may be a two-dimensional image, and the second image map may be a three-dimensional image.

예를 들어, 생성부(140)는 제 1 기간값(M), 제 2 기간값(N), 비교값(Z)에 기초하여 제 1 차 이미지 맵을 생성할 수 있다. 이 때, 생성부(140)는 이를 반복적으로 수행하여 획득한 비교값(Z)의 정보를 포함하는 제 1 차 이미지 맵을 복수개로 생성하고, 생성된 복수개의 제 1 차 이미지 맵을 시간 순으로 쌓도록 가공하여 제 2 차 이미지 맵을 생성할 수 있다. For example, the generator 140 may generate the first image map based on the first period value M, the second period value N, and the comparison value Z. In this case, the generating unit 140 repeatedly performs this to generate a plurality of primary image maps including information on the obtained comparison value Z, and sequentially arranges the plurality of generated primary image maps in chronological order. It is possible to create a second image map by processing it so that it is stacked.

제 2 예측부(150)는 생성된 이미지 맵을 제 2 예측 모듈에 입력하여 예측 이미지 가격 데이터에 대해 학습시키고, 제 2 예측 모듈을 통해 예측하고자 하는 특정 시점의 이전에 해당하는 이미지 맵을 예측할 수 있다. 맵을 생성하여 특정 시점의 이전에 해당하는 이미지 맵을 예측하는 과정에 대해서는 도 5를 통해 상세히 설명하도록 한다. The second prediction unit 150 inputs the generated image map into the second prediction module to learn about the prediction image price data, and predicts the image map corresponding to the previous specific time point to be predicted through the second prediction module. have. A process of generating a map and predicting an image map corresponding to a previous point in time will be described in detail with reference to FIG. 5 .

도 5는 본 발명의 일 실시예에 따른 생성된 이미지 맵을 이용하여 특정시점의 이전에 해당하는 이미지 맵을 예측하는 과정을 설명하기 위한 예시적인 도면이다. 5 is an exemplary diagram for explaining a process of predicting an image map corresponding to a previous point in time using a generated image map according to an embodiment of the present invention.

제 2 예측부(150)는 생성된 이미지 맵(503)을 제 2 예측 모듈에 입력하여 예측 가격 데이터에 대해 학습시킬 수 있다. 여기서, 제 2 예측 모듈은 학습 정확도가 높이 이미지 학습에 주로 이용되는 CNN(Convolution Neural Network, 501)가 이용될 수 있다. The second prediction unit 150 may input the generated image map 503 into the second prediction module to learn about the predicted price data. Here, as the second prediction module, a Convolution Neural Network (CNN) 501, which is mainly used for image learning with high learning accuracy, may be used.

제 2 예측 모듈로 CNN(501)을 사용하는 이유는, CNN(501)은 이미지 특징 추출 및 분류 문제에 탁월한 정확도를 보이는 네트워크로, 이 네트워크 원리를 이용하여 제 1 기간값(M)과 제 2 기간값(N)의 조합을 통해 나온 숫자인 비교값(Z) 중 높은 비교값(Z, 특징)을 추출할 수 있다. 이를 통해, 추출된 높은 비교값(Z)에 해당하는 제 1 기간값(M)과 제 2 기간값(N)의 조합을 TREE 모듈을 이용하는 제 1 예측 모듈에 적용함으로써, TREE 모듈의 예측 정확도가 높아지도록 할 수 있다. The reason for using the CNN 501 as the second prediction module is that the CNN 501 is a network showing excellent accuracy in image feature extraction and classification problems, and using this network principle, the first period value (M) and the second A high comparison value (Z, feature) among the comparison values (Z), which is a number obtained through the combination of period values (N), can be extracted. Through this, by applying the combination of the first period value (M) and the second period value (N) corresponding to the extracted high comparison value (Z) to the first prediction module using the TREE module, the prediction accuracy of the TREE module is improved. can make it higher.

CNN(501)이 아닌 다른 딥러닝 모델을 사용하는 경우, 수많은 차원(거시경제지표인 입력 데이터의 약 300개의 차원)을 입력으로 자산을 예측하려면, 300개의 차원의 기하급수배의 데이터양이 필요하지만, 금융시계열 데이터는 짧다(1년 250일->250개의 길이)는 한계를 가지고 있다. 또한 수 많은 입력데이터 차원을 각 신경망 노드에 입력시켜 학습을 하는 경우, 비선형 관계를 찾기 쉽지 않다. When using a deep learning model other than CNN (501), to predict assets with numerous dimensions (about 300 dimensions of the input data, which are macroeconomic indicators), an exponential data amount of 300 dimensions is required However, financial time series data has a limitation that it is short (250 days per year -> 250 lengths). In addition, when learning by inputting numerous input data dimensions to each neural network node, it is not easy to find a non-linear relationship.

그러나 CNN(501)을 이용하는 경우, 컨볼루션(convolution)의 커널 사이즈(kernel size)와 필터(filter) 수만큼의 파라미터만 학습시키면 되므로, 학습시에 유리하며, TREE 모듈인 제 1 예측 모듈의 주요 변수를 찾는데 도움을 주는 간접적인 역할을 하기 때문에 예측이 실패할 경우에도 타격을 적게 받을 수 있기 때문이다. However, in the case of using the CNN 501, it is advantageous in learning because only as many parameters as the kernel size and the number of filters of the convolution are learned, and the TREE module is the main component of the first prediction module. Because it plays an indirect role to help find the variable, it can be less affected if the prediction fails.

제 2 예측부(150)는 제 2 예측 모듈을 통해 기준일로부터 예측운용기간이 지난 시점에 해당하는 이미지 맵(502)을 예측할 수 있다. 예를 들어, 기준일이 'T'이고, 예측운용기간이 '24일'로 설정된 경우, 제 2 예측부(150)는 기준일(T) 이전에 생성된 제 2 차 이미지 맵(503)을 이용하여 예측운용기간(K=24) 이후를 예측하기 위해 제 2 예측 모듈(CNN, 501)을 이용하여 이미지 학습을 수행하고, 제 2 예측 모듈(CNN, 501)의 이미지 학습을 통해 T+K 시점(T+24)의 이미지 맵(502)을 예측할 수 있다.The second prediction unit 150 may predict the image map 502 corresponding to the time point when the prediction operation period has elapsed from the reference date through the second prediction module. For example, when the reference date is 'T' and the prediction operation period is set to '24 days', the second prediction unit 150 uses the secondary image map 503 generated before the reference date (T). To predict after the prediction operation period (K = 24), image learning is performed using the second prediction module (CNN, 501), and T + K time point ( T+24) of the image map 502 can be predicted.

제 2 예측부(150)는 자산의 가격 예측을 위해 가격 예측 시작일 직전의 이미지 맵(503)을 이용하여 예측 가격 데이터에 대해 학습시킬 수 있다. 이미지 맵(503)은 제 1 기간값(M)과 제 2 기간값(N)의 좌표쌍 및 해당 좌표쌍에 대응하는 비교값(Z값)으로 구성될 수 있다. 이미지 맵(503)은 어떤 제 1 기간값(M)과 제 2 기간값(N)의 좌표가 좋은 비교값(Z)을 가지고 있는지를 파악할 수 있도록 하며, 가격예측시작일(T+24)을 기준으로 예측운용기간의 시계열 길이만큼을 뺀 일자(T+24-예측운용기간)까지만 생성될 수 있다. The second prediction unit 150 may learn the predicted price data by using the image map 503 immediately before the price prediction start date in order to predict the price of the asset. The image map 503 may include a coordinate pair of the first period value M and the second period value N, and a comparison value (Z value) corresponding to the coordinate pair. The image map 503 makes it possible to determine which coordinates of the first period value (M) and the second period value (N) have a good comparison value (Z), and based on the price prediction start date (T+24) As such, it can be created only up to the date (T+24-predicted operating period) minus the time series length of the forecast operating period.

이 때, 실제의 (T+24) 시점의 이미지 맵(502)은 파악할 수 없으므로, 제 2 예측 모듈(CNN, 501)을 통해 시간적/공간적(SPATIO-TEMPORAL) 요소를 고려하여 (T+24- 예측운용기간)일자까지의 이미지 맵(503)으로 학습된 뒤, T+24 시점의 이미지 맵(502)을 예측할 수 있도록 학습시킬 수 있다. At this time, since the image map 502 of the actual (T+24) time point cannot be grasped, the (T+24-) factor is considered through the second prediction module (CNN, 501) in terms of the temporal/spatial (SPATIO-TEMPORAL) After the image map 503 is learned up to the date of the prediction operation period), the image map 502 at the time T+24 can be predicted to be learned.

예를 들어, 오늘이 T+24(2017.05.31)인 경우, 운용기간 24일을 기준으로 미래 기간에 해당하는 T+25~5+48(2017.0601~2017.06.30)까지의 가격을 예측하기 위해서는 T+24 시점에서의 이미지 맵(502)이 필요하며, 해당 시점의 이미지 맵(502)은 기준일(T)까지 생성된 이미지 맵들(503)로 학습된 제 2 예측 모듈(CNN, 501)을 통해 예측할 수 있다. For example, if today is T+24 (2017.05.31), to predict the price from T+25 to 5+48 (2017.0601 to 2017.06.30), which corresponds to the future period based on 24 days of the operating period, The image map 502 at the time T+24 is required, and the image map 502 at the time point is obtained through the second prediction module (CNN, 501) trained with the image maps 503 generated up to the reference date (T). predictable.

이후, 제 1 예측부(120)는 예측된 이미지 맵(502)에 기초하여 예측하고자 하는 특정 시점에 대한 최종 예측 가격 데이터를 예측할 수 있다. 예를 들어, 제 1 예측부(120)는 예측된 이미지 맵(502)으로부터 기설정된 임계값 이상에 해당하는 적어도 하나의 비교값을 추출하고, 추출된 비교값에 대응하는 하나의 제 1 기간값 및 제 2 기간값을 도출할 수 있다. Thereafter, the first prediction unit 120 may predict the final predicted price data for a specific time point to be predicted based on the predicted image map 502 . For example, the first prediction unit 120 extracts at least one comparison value corresponding to a predetermined threshold value or more from the predicted image map 502 , and one first period value corresponding to the extracted comparison value. and a second period value.

다시 말해, 본원 발명은 제 1 예측부 및 제 2 예측부를 통해 예측 가격 데이터를 도출하기 위한 최적의 (M, N, Z) 파라미터를 추출할 수 있다.In other words, the present invention can extract optimal (M, N, Z) parameters for deriving predicted price data through the first prediction unit and the second prediction unit.

제 1 예측부(120)는 도출된 제 1 기간값(M) 및 제 2 기간값(N)을 제 1 예측 모듈에 입력하여 예측하고자 하는 특정 시점에 대한 최종 예측 가격 데이터를 예측할 수 있다. 또는, 제 1 예측부(120)는 제 2 기간값에 기초하여 도출된 과거 가격 데이터의 기간변화율값을 제 1 예측 모듈에 피팅하여 입력하고, 예측하고자 하는 특정 시점에 대한 예측 기간변화율값 또는 최종 예측기간변화율값을 예측할 수 있다. 예측된 특정 시점에 대한 최종 예측 가격 데이터에 대해서는 도 6을 통해 상세히 설명하도록 한다. The first prediction unit 120 may input the derived first period value (M) and the second period value (N) into the first prediction module to predict the final predicted price data for a specific time point to be predicted. Alternatively, the first prediction unit 120 inputs the period change rate value of the past price data derived based on the second period value by fitting it into the first prediction module, and the forecast period change rate value for a specific time point to be predicted or the final The rate of change over the forecast period can be predicted The final predicted price data for the predicted specific time will be described in detail with reference to FIG. 6 .

도 6은 본 발명의 일 실시예에 따른 예측된 최종 예측 가격 데이터를 도시한 예시적인 도면이다. 도 6을 참조하면, 제 1 예측부(120)는 제 2 예측 모듈을 통해 예측된 이미지 맵을 이용하여 상위 20%에 해당하는 비교값(Z)을 추출하고, 추출된 비교값(Z)에 해당하는 (M.N) 조합을 제 1 예측 모듈(TREE 모듈)에 다시 계량적 입력 데이터로 피팅하고, 최종적으로 예측된 가격들을 하드보팅(hardvoting) 방법으로 앙상블(ensemble)된 값을 이용할 수 있다. 6 is an exemplary diagram illustrating predicted final predicted price data according to an embodiment of the present invention. Referring to FIG. 6 , the first prediction unit 120 extracts a comparison value (Z) corresponding to the top 20% by using the image map predicted through the second prediction module, and adds the extracted comparison value (Z). The corresponding (MN) combination may be fitted back to the first prediction module (TREE module) as quantitative input data, and the ensembled value may be used by hardvoting the finally predicted prices.

예를 들어, 제 2 예측부(150)에서 기준일(T) 이전에 생성된 이미지 맵을 예측 가격 데이터에 대해 학습을 시킨 뒤, 기준일(T)에 해당하는 이미지 맵을 입력으로 하여 기준일(T)로부터 예측운용기간(K=24)이 지난 시점(T+24)에 해당하는 이미지 맵을 예측할 수 있다. For example, after learning the image map generated before the reference date (T) in the second prediction unit 150 on the predicted price data, the image map corresponding to the reference date (T) is input to the reference date (T) It is possible to predict an image map corresponding to the time point (T+24) after the prediction operation period (K=24) has elapsed.

여기서, 제 1 예측부(120)는 예측된 이미지 맵의 상위 q%(q는 관리자에 의해 지정됨. 예를 들어, 5%)에 해당하는 비교값(Z)을 추출하고, 추출된 비교값(Z)에 해당되는 제 1 기간값(M)과 제 2 기간값(N)의 조합을 도출할 수 있다. 이 때, 추출된 제 1 기간값(M)과 제 2 기간값(N)의 조합에 따라 제 1 예측 모듈(TREE 모듈)로 피팅하여 입력하고, 최종적으로 제 1 기간값(M)과 제 2 기간값(N)의 조합의 경우의 수만큼 예측된 예측치의 평균값을 이용하여 예측하고자 하는 특정 시점에 대한 최종 예측 가격 데이터를 예측할 수 있다. 예를 들어, 예측치의 평균값은 MAP (M=8, N=40, 총 M과 N의 조합의 수=320(8*40), 상위 5%=16개(320*0.05)와 같이 구성될 수 있다. Here, the first prediction unit 120 extracts the comparison value Z corresponding to the top q% (q is designated by the administrator, for example, 5%) of the predicted image map, and the extracted comparison value ( A combination of the first period value M and the second period value N corresponding to Z) may be derived. At this time, according to the extracted combination of the first period value (M) and the second period value (N), the first prediction module (TREE module) is fitted and inputted, and finally the first period value (M) and the second period value (N) are inputted. The final predicted price data for a specific time point to be predicted can be predicted using the average value of the predicted values as many as the number of combinations of period values (N). For example, the average value of the predicted values can be configured as MAP (M = 8, N = 40, the total number of combinations of M and N = 320 (8 * 40), the top 5% = 16 (320 * 0.05)) have.

도 6을 참조하면, 제 1 예측부(120)는 총 16개의 제 1 기간값(M)과 제 2 기간값(N)의 조합으로 제 1 예측 모듈에 피팅하여 입력하여 최종 예측 가격 데이터를 예측할 수 있다. 예를 들어, 제 1 예측부(120)는 제 1 예측 모듈(TREE 모듈)에 피팅하여 입력하고, 예측(=16개 TREE 모듈을 통해 도출된 앙상블)한 것들의 평균값(hardvoting)을 이용한 결과를 통해 최종 예측 가격 데이터를 예측할 수 있다. Referring to FIG. 6 , the first prediction unit 120 predicts the final predicted price data by fitting and inputting a total of 16 combinations of the first period value (M) and the second period value (N) to the first prediction module. can For example, the first prediction unit 120 fits and inputs the first prediction module (TREE module), and predicts the result using the average value (hardvoting) of predictions (= ensemble derived through 16 TREE modules). It is possible to predict the final predicted price data through

이러한 가격 예측 장치(100)는 가격을 예측하는 명령어들의 시퀀스를 포함하는 매체에 저장된 컴퓨터 프로그램에 의해 실행될 수 있다. 컴퓨터 프로그램은 컴퓨팅 장치에 의해 실행될 경우, 예측하고자 하는 자산의 과거 가격 데이터 및 자산과 연관된 입력 데이터를 수집하고, 수집된 입력 데이터 중 제 1 기간값에 해당하는 입력 데이터 및 수집된 과거 가격 데이터 중 제 2 기간값에 해당하는 과거 가격 데이터를 제 1 예측 모듈에 입력하고, 제 1 예측 모듈을 통해 예측 가격 데이터를 예측하고, 예측된 예측 가격 데이터 및 실제 가격 데이터 간의 비교값을 도출하고, 제 1 기간값, 상기 제 2 기간값 및 비교값에 기초하여 이미지 맵을 생성하고, 생성된 이미지 맵을 제 2 예측 모듈에 입력하여 예측 가격 데이터에 대해 학습시키고, 제 2 예측 모듈을 통해 예측하고자 하는 특정 시점의 이전에 해당하는 이미지 맵을 예측하고, 예측된 이미지 맵에 기초하여 예측하고자 하는 특정 시점에 대한 최종 예측 가격 데이터를 예측하도록 하는 명령어들의 시퀀스를 포함하는 매체에 저장된 컴퓨터 프로그램을 제공할 수 있다. The price prediction apparatus 100 may be executed by a computer program stored in a medium including a sequence of instructions for predicting a price. When the computer program is executed by the computing device, it collects historical price data of the asset to be predicted and input data associated with the asset, and the input data corresponding to the first period value among the collected input data and the second of the collected historical price data. Input the past price data corresponding to the 2 period values into the first prediction module, predict the predicted price data through the first prediction module, derive a comparison value between the predicted predicted price data and the actual price data, and the first period An image map is generated based on the value, the second period value and the comparison value, the generated image map is input to the second prediction module to learn about the predicted price data, and a specific point in time to be predicted through the second prediction module It is possible to provide a computer program stored in a medium including a sequence of instructions for predicting an image map corresponding to the previous of , and predicting the final predicted price data for a specific point in time to be predicted based on the predicted image map.

도 7은 본 발명의 일 실시예에 따른 가격 예측 장치에서 자산의 가격을 예측하는 방법의 순서도이다. 도 7에 도시된 가격 예측 장치(100)에서 자산의 가격을 예측하는 방법은 도 1 내지 도 6에 도시된 실시예에 따라 시계열적으로 처리되는 단계들을 포함한다. 따라서, 이하 생략된 내용이라고 하더라도 도 1 내지 도 6에 도시된 실시에에 따른 가격 예측 장치(100)에서 자산의 가격을 예측하는 방법에도 적용된다. 7 is a flowchart of a method for predicting the price of an asset in the price prediction apparatus according to an embodiment of the present invention. The method of predicting the price of an asset in the price prediction apparatus 100 illustrated in FIG. 7 includes steps of time-series processing according to the embodiments illustrated in FIGS. 1 to 6 . Therefore, even if omitted below, it is also applied to the method of predicting the price of an asset in the price prediction apparatus 100 according to the embodiment shown in FIGS. 1 to 6 .

단계 S710에서 가격 예측 장치(100)는 예측하고자 하는 자산의 과거 가격 데이터 및 자산과 연관된 입력 데이터를 수집할 수 있다. In step S710 , the price prediction apparatus 100 may collect past price data of an asset to be predicted and input data associated with the asset.

단계 S720에서 가격 예측 장치(100)는 수집된 입력 데이터 중 제 1 기간값에 해당하는 입력 데이터 및 수집된 과거 가격 데이터 중 제 2 기간값에 해당하는 과거 가격 데이터를 제 1 예측 모듈에 입력할 수 있다. In step S720, the price prediction apparatus 100 may input the input data corresponding to the first period value among the collected input data and the past price data corresponding to the second period value among the collected past price data to the first prediction module. have.

단계 S730에서 가격 예측 장치(100)는 제 1 예측 모듈을 통해 예측 가격 데이터를 예측할 수 있다. In step S730, the price prediction apparatus 100 may predict the predicted price data through the first prediction module.

단계 S740에서 가격 예측 장치(100)는 예측된 예측 가격 데이터 및 실제 가격 데이터 간의 비교값을 도출할 수 있다. In step S740, the price prediction apparatus 100 may derive a comparison value between the predicted predicted price data and the actual price data.

단계 S750에서 가격 예측 장치(100)는 제 1 기간값, 상기 제 2 기간값 및 비교값에 기초하여 이미지 맵을 생성할 수 있다. In operation S750, the price prediction apparatus 100 may generate an image map based on the first period value, the second period value, and the comparison value.

단계 S760에서 가격 예측 장치(100)는 생성된 이미지 맵을 제 2 예측 모듈에 입력하여 예측 가격 데이터에 대해 학습시킬 수 있다. In step S760, the price prediction apparatus 100 may input the generated image map into the second prediction module to learn about the predicted price data.

단계 S770에서 가격 예측 장치(100)는 제 2 예측 모듈을 통해 예측하고자 하는 특정 시점의 이전에 해당하는 이미지 맵을 예측할 수 있다. In step S770 , the price prediction apparatus 100 may predict an image map corresponding to a previous specific time point to be predicted through the second prediction module.

단계 S780에서 가격 예측 장치(100)는 예측된 이미지 맵에 기초하여 예측하고자 하는 특정 시점에 대한 최종 예측 가격 데이터를 예측할 수 있다. In step S780, the price prediction apparatus 100 may predict the final predicted price data for a specific time point to be predicted based on the predicted image map.

상술한 설명에서, 단계 S710 내지 S780은 본 발명의 구현예에 따라서, 추가적인 단계들로 더 분할되거나, 더 적은 단계들로 조합될 수 있다. 또한, 일부 단계는 필요에 따라 생략될 수도 있고, 단계 간의 순서가 전환될 수도 있다.In the above description, steps S710 to S780 may be further divided into additional steps or combined into fewer steps, according to an embodiment of the present invention. In addition, some steps may be omitted as necessary, and the order between steps may be switched.

도 1 내지 도 7을 통해 설명된 가격 예측 장치에서 자산의 가격을 예측하는 방법은 컴퓨터에 의해 실행되는 매체에 저장된 컴퓨터 프로그램 또는 컴퓨터에 의해 실행 가능한 명령어를 포함하는 기록 매체의 형태로도 구현될 수 있다. 또한, 도 1 내지 도 7을 통해 설명된 가격 예측 장치에서 자산의 가격을 예측하는 방법은 컴퓨터에 의해 실행되는 매체에 저장된 컴퓨터 프로그램의 형태로도 구현될 수 있다. The method of predicting the price of an asset in the price prediction apparatus described through FIGS. 1 to 7 may be implemented in the form of a computer program stored in a medium executed by a computer or a recording medium including instructions executable by a computer. have. In addition, the method of predicting the price of an asset in the price prediction apparatus described with reference to FIGS. 1 to 7 may be implemented in the form of a computer program stored in a medium executed by a computer.

컴퓨터 판독 가능 매체는 컴퓨터에 의해 액세스될 수 있는 임의의 가용 매체일 수 있고, 휘발성 및 비휘발성 매체, 분리형 및 비분리형 매체를 모두 포함한다. 또한, 컴퓨터 판독가능 매체는 컴퓨터 저장 매체를 포함할 수 있다. 컴퓨터 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술로 구현된 휘발성 및 비휘발성, 분리형 및 비분리형 매체를 모두 포함한다. Computer-readable media can be any available media that can be accessed by a computer, and includes both volatile and nonvolatile media, removable and non-removable media. Further, the computer-readable medium may include a computer storage medium. Computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data.

전술한 본 발명의 설명은 예시를 위한 것이며, 본 발명이 속하는 기술분야의 통상의 지식을 가진 자는 본 발명의 기술적 사상이나 필수적인 특징을 변경하지 않고서 다른 구체적인 형태로 쉽게 변형이 가능하다는 것을 이해할 수 있을 것이다. 그러므로 이상에서 기술한 실시예들은 모든 면에서 예시적인 것이며 한정적이 아닌 것으로 이해해야만 한다. 예를 들어, 단일형으로 설명되어 있는 각 구성 요소는 분산되어 실시될 수도 있으며, 마찬가지로 분산된 것으로 설명되어 있는 구성 요소들도 결합된 형태로 실시될 수 있다. The above description of the present invention is for illustrative purposes only, and those of ordinary skill in the art to which the present invention pertains will be able to understand that other specific forms can be easily modified without changing the technical spirit or essential features of the present invention. will be. Therefore, it should be understood that the embodiments described above are illustrative and non-limiting in all respects. For example, each component described as a single type may be implemented in a distributed manner, and similarly, components described as being distributed may also be implemented in a combined form.

본 발명의 범위는 상기 상세한 설명보다는 후술하는 특허청구범위에 의하여 나타내어지며, 특허청구범위의 의미 및 범위 그리고 그 균등 개념으로부터 도출되는 모든 변경 또는 변형된 형태가 본 발명의 범위에 포함되는 것으로 해석되어야 한다. The scope of the present invention is indicated by the claims to be described later rather than the detailed description, and all changes or modified forms derived from the meaning and scope of the claims and their equivalent concepts should be construed as being included in the scope of the present invention. do.

100: 가격 예측 장치
110: 수집부
120: 제 1 예측부
130: 도출부
140: 생성부
150: 제 2 예측부100: price prediction device
110: collection unit
120: first prediction unit
130: derivation part
140: generator
150: second prediction unit

Claims

In the asset price prediction device using C&N,
a tree module that receives data fitted with a difference as much as the time series length of the reference date (T) and the forecast operating period (K) between the input data of the asset to be predicted and the past price data of the asset;
A CNN module that receives an image map generated based on the comparison value between the predicted price data predicted through the tree module and the actual price data corresponding to the prediction operating period (K) and the variables of the image map;
Learning about the predicted price data through the CNN module, predicting an image map corresponding to a previous point of time to be predicted through the CNN module, and based on the predicted image map at a specific time point to be predicted To predict the final forecast price data for
Asset price prediction device using CNN.

According to claim 1, wherein the tree module,
Predicting the forecast price data corresponding to the period between the reference date and the time point of adding the forecast operating period to the reference date,
Asset price prediction device using CNN.

The method of claim 1,
The input data is characterized in that it is an index price that recalculates the prices of the assets to be predicted,
The index includes one or more of KOSPI (domestic), S&P 500 (USA), EMBI (each country),
Asset price prediction device using CNN.

According to claim 1, wherein the comparison value,
Derived from the average value for the forecast operating period,
Asset price prediction device using CNN.

The method of claim 1, wherein the parameters of the image map are:
A first period value set on the basis of the time when the predicted operating period (K) is subtracted from the reference date (T);
including a second period value set based on the time point of subtracting the predicted operating period (K) from the reference date (T),
Asset price prediction device using CNN.

The method of claim 5, wherein the image map,
At least one first image map generated based on the first period value, the second period value, and the comparison value, and a second generated by processing the generated first image maps to be stacked in chronological order containing the car image map,
Asset price prediction device using CNN.

The method of claim 6,
The first image map is a two-dimensional image, and the second image map is a three-dimensional image,
Asset price prediction device using CNN.