KR20190130535A

KR20190130535A - Pair Trading System based on the Stock bigdata and Controlling Method for the Same

Info

Publication number: KR20190130535A
Application number: KR1020190145965A
Authority: KR
Inventors: 고경훈
Original assignee: 주식회사 코스콤
Priority date: 2019-11-14
Filing date: 2019-11-14
Publication date: 2019-11-22

Abstract

The present invention provides a pair trading system based on stock big data and a control method therefor. The pair trading system comprises: a collection engine module crawling securities-related big data provided from internet sites of various information collection media, classifying the crawled data, and storing the same in a DB; a control module including the collection engine module, controlling the entire process of big data-based pair trading, and performing control such that the pair trading information is processed to calculate a sell/buy time point of a set item and produce an expected scenario; an analysis engine module filtering a big data-based securities item pair trading list by processing and analyzing the big data crawled under the control of functions by the control module; and a result calculation module calculating an actual buy/sell entry timepoint for the securities item pair trading list filtered by the analysis engine module under the control of functions by the control module, calculating the profits, producing a final expected scenario, and transmitting the same to a set customer terminal or electronic address. The present invention crawls the securities-related big data, processes and analyzes the same in multi-stages, produces the securities item pair trading list, calculates the actual buy/sell entry timepoint and produces the best expected scenario, and transmits the same to the set customer, thereby extracting pair trading item groups from all the items regardless of item association and providing a service. As a result, a high-quality service can be provided to securities investors.

Description

Fair Trading System based on the Stock bigdata and Controlling Method for the Same}

본 발명은 증권 빅데이터 기반 페어 트레이딩시스템 및 그 제어방법에 관한 것으로, 특히 증권관련 빅데이터들을 크롤링하여 다단계로 가공 및 분석하고 증권종목 페어 리스트를 산출한 후 실제 매수/매도 진입시점 및 최상 예상 시나리오를 산출하여 설정된 고객에게 전송함으로써, 증권투자자에 대한 최상의 고수익서비스를 제공하는 증권 빅데이터기반 페어 트레이딩시스템 및 그 제어방법에 관한 것이다. The present invention relates to a securities big data-based pair trading system and a control method thereof, in particular, securities-related big data is processed in multiple stages by crawling and analyzing a securities item pair list, and then buying / selling entry points and best expected scenarios. The present invention relates to a securities big data-based pair trading system and a method of controlling the same, by providing a high yield service for a securities investor by calculating and transmitting the same to a set customer.

일반적으로 통신 및 산업의 발달과 함께 주가지수의 양상도 급변하고 있다. 즉, 지수의 업종별 세분화, 특정주식의 비중증가로 인해 종합지수는 내려가나 특정지수는 오히려 폭등하는 지수 간의 무상관성 심화, 사이버 매매 전산 프로그램의 다양한 지수 제공 및 제공시간의 단축(업종별 지수, 30초 지수 등), 사이버 매매자의 급격한 증가 및 초단타 매매, 특정지수에 속하는 전체종목이 전부상승(상한) 또는 전부 하락(하한)하는 집단화의 심화, 세계증시의 동조화로 인한 미국 다우, 나스닥의 등락에 따른 다음날 국내 주가의 급등락 등으로 인해 주가지수의 변동폭이 확대될 뿐만 아니라 변동시간도 초 스피드화 하고 있다. 향후 주가 등락폭 확대 또는 상하한선 폐지 등이 실행될 경우 지수변화는 더욱 심화될 것으로 예측된다. 따라서, 상기와 같이 지수의 급등락이 상존하므로 주식투자자에게 있어 매매(매수/매도) 의사결정하기가 매우 어렵고 동시에 그러한 의사결정은 전체손익에 상당한 영향을 미치게 된다. 그러므로 최근에는 그러한 변동 위험성이나 수익률 극대화를 위해 상관성이 있는 종목을 추적하는 종목 페어 트레이딩 방법이 사용되고 있다.Generally, with the development of telecommunications and industry, the pattern of stock index is also changing rapidly. In other words, due to the segmentation of the index by industry, the increase in the weight of specific stocks, the composite index declines, but the correlation index increases rather than the specific index soaring. Indices, etc.), the rapid rise and fall of cyber traders, the deepening of groupings in which all stocks belonging to a particular index rise (upper) or all down (lower). The next day, due to the sharp rise in domestic stock prices, not only the fluctuation of the stock index has widened but also the change time has been speeding up. If the share price fluctuates sharply or the upper and lower limits are abolished, the index change will intensify further. Therefore, as the index fluctuates as mentioned above, it is very difficult for a stock investor to make a trading (buy / sell) decision, and at the same time, the decision has a significant impact on the overall profit and loss. Therefore, in recent years, a pair trading method has been used to track correlated stocks in order to maximize such risks and returns.

그러면, 상기와 같은 종래 종목 페어 트레이딩방법을 도 1을 참고로 살펴보면, 유사업종 내에서 연관성이 있는 종목군을 크롤링하는 제1 단계(S101)와;Then, referring to FIG. 1 for the conventional stock pair trading method as described above, a first step (S101) of crawling related stock groups in the dairy business;

상기 제1 단계(S101)후에 크롤링된 연관성이 있는 종목군내의 종목들을 분석하는 제2 단계(S102)와; A second step (S102) of analyzing items in the related item group crawled after the first step (S101);

상기 제2 단계(S102)후에 분석된 종목 중에서 페어 트레이딩이 가능한 종목들을 산출하여 고객에게 제공하는 제4 단계(S103)를 포함하여 구성된다. Comprising a fourth step (S103) of calculating the items available for pair trading among the items analyzed after the second step (S102) to provide to the customer.

한편, 상기와 같은 종래 종목 페어 트레이딩방법을 좀 더 구체적으로 살펴보면, 예컨대, 한국의 증권거래소의 코스피나 코스닥에 상장된 유사업종 내에서 연관성이 있는 종목군을 먼저 크롤링한다. 그리고 상기와 같이 크롤링된 연관성이 있는 종목군내의 종목 예컨대, 철강을 생산하는 포스코와 현대제철 혹은 전자제품을 생산하는 삼성전자와 LG전자 등 서로 연관성이 있는 종목군내의 종목들을 분석한다. 또한 상기 분석 과정 중에 그 분석된 종목 중에서 예컨대, 전자제품을 생산하는 삼성전자와 LG전자 등 서로 연관성이 있는 종목군내의 종목 중에서 페이 트레이딩이 가능한 종목이 있을 경우 이들을 취합한 다음 매도/매수 시점 등의 정보를 산출하여 고객의 단말기로 제공한다. On the other hand, when looking at the conventional stock trading method more specifically, for example, the relevant stock group in the KOSPI or KOSDAQ listed on the stock exchange of Korea first crawls. In addition, we will analyze the stocks in the related stock groups, such as POSCO, which produces steel, and Hyundai Steel, or Samsung and LG, which produce electronic products. In addition, if there are stocks among the analyzed stocks among the related stocks, such as Samsung Electronics and LG Electronics, which produce electronic products during the analysis process, and they are available for pay trading, they may be collected and then sold or bought. The information is calculated and provided to the customer's terminal.

그러나, 상기와 같은 종래 종목 페어 트레이딩방법은 페어 트레이딩을 위한 종목들이 유사업종 내에서만 산출이 가능하기 때문에 서비스의 폭이 매우 제한적이어서 투자자에게 제공되는 고품위서비스가 매우 한정적이었으며, 또한 페어 트레이딩 종목의 발굴 시에 단순 가공 처리하여 페어트레이딩 종목을 산출하므로 그에 따라 매수/매도 진입시점에 대한 승률 정밀도가 저조하여 정보의 활용성도 상당히 저하시키는 문제점이 발생되었다.However, the conventional pair trading method as described above, since the trading for the pair trading items can be calculated only in the dairy business, the service is very limited, and the high quality service provided to the investor is very limited. As the fair trading item is calculated by simple processing at the time, the accuracy of the odds for the buy / sell entry point is low, resulting in a significant decrease in the utilization of information.

이에 본 발명은 상기와 같은 종래기술의 문제점을 해결하기 위해 발명된 것으로, 증권관련 빅데이터들을 크롤링하여 다단계로 가공 및 분석하고 증권종목 페어 리스트를 산출한 후 실제 매수/매도 진입시점 및 최상 예상 시나리오를 산출하여 설정된 고객에게 전송함으로써, 종목 연관성에 관계없이 전 종목을 대상으로 산출한 페어 트레이딩종목군을 발굴하여 최상의 고수익서비스를 제공할 수 있는 증권 빅데이터 기반 페어 트레이딩시스템 및 그 제어방법을 제공함에 그 목적이 있다.Therefore, the present invention has been invented to solve the problems of the prior art as described above, after crawling and analyzing securities related big data in multiple stages and calculating the stock list of stock items, the actual buying / selling entry point and the best expected scenario. By calculating and sending to the set customer, The purpose of this study is to provide a pair trading system based on securities big data that can provide the best high-profit service by discovering the pair trading stocks calculated for all stocks regardless of the relatedness of stocks.

본 발명의 또 다른 목적은 주식투자자에게 증권 빅데이터를 기반으로 하여 산출된 페어 트레이딩 종목들에 대한 매수/매도 진입시점 및 최상 예상 시나리오를 제공하여 최적화된 매매결정을 내리도록 하므로 그에 따라 주식투자에 대한 손실을 최소화할 수 있는 증권 빅데이터 기반 페어 트레이딩시스템 및 그 제어방법을 제공하는데 있다.Another object of the present invention is to provide stock investors with the best buy and sell entry point and best forecast scenario for the pair trading items calculated on the basis of securities big data to make an optimized trading decision accordingly To provide a securities big data-based pair trading system and a control method thereof that can minimize losses.

상기와 같은 목적을 달성하기 위한 본 발명은 각종 정보수집매체의 인터넷 사이트로부터 제공되는 증권관련 빅데이터들을 크롤링하고 그 크롤링된 데이터들을 구분하여 DB에 저장시키는 수집엔진모듈과;The present invention for achieving the above object is a collection engine module for crawling the securities-related big data provided from the Internet site of the various information collection media and classify the crawled data in a DB;

상기 수집엔진모듈을 포함하여 빅데이터 기반 페어 트레이딩 전 과정을 제어하고 그 페어 트레이딩정보를 가공하여 설정된 종목의 매도/매수 시점 및 예상 시나리오가 산출되도록 제어하는 제어모듈과;A control module for controlling the entire process of big data-based pair trading including the collection engine module and processing the pair trading information to calculate a sell / buy time point and an expected scenario of a set item;

상기 제어모듈의 기능 제어 하에 크롤딩된 빅데이터들을 다단계로 가공 및 분석하여 빅데이터 기반 증권종목 페어 트레이딩 리스트를 필터링하는 분석엔진모듈과;An analysis engine module for filtering and analyzing big data crawled under a function control of the control module in multiple stages to filter a big data-based stock trading pair trading list;

상기 제어모듈의 기능 제어 하에 분석엔진모듈에 의해 필터링된 상기 증권종목 페어 트레이딩 리스트를 대상으로 실제 매수/매도 진입시점을 산출한 다음 수익을 계산하여 최종 예상 시나리오를 산출하여 설정된 고객단말기나 전자주소로 전송시키는 결과산출모듈을 포함하는 증권 빅데이터기반 페어 트레이딩시스템을 제공한다.Under the control of the control module, the actual buying / selling entry point is calculated on the stock trading pair trading list filtered by the analysis engine module, and then the profit is calculated to calculate the final expected scenario to the set customer terminal or electronic address. It provides a securities big data-based pair trading system including a result calculation module for transmitting.

본 발명의 또 다른 특징은 수집엔진모듈이 제어모듈의 기능 제어 하에 각종 정보수집매체의 인터넷 사이트로부터 제공되는 증권관련 빅데이터들을 크롤링하고 그 크롤링된 데이터들을 구분하여 DB에 저장시키는 제1 과정과;Still another aspect of the present invention provides a method comprising: a first process of a collecting engine module crawling securities related big data provided from internet sites of various information collecting media under function control of a control module, and classifying the crawled data in a DB;

상기 제1 과정 후에 분석엔진모듈이 제어모듈의 기능 제어 하에 상기 크롤딩된 빅데이터들을 다단계로 가공 및 분석하여 빅데이터 기반 증권종목 페어 리스트를 필터링하는 제2 과정과;A second step of analyzing, by the analysis engine module, the crawled big data in a multi-step process and analyzing the big data-based securities pair list after the first process by controlling a function of a control module;

상기 제2 과정 후에 결과산출모듈이 제어모듈의 기능 제어 하에 분석엔진모듈에 의해 필터링된 상기 증권종목 페어 리스트를 대상으로 실제 매수/매도 진입시점을 산출한 다음 수익을 계산하여 최종 예상 시나리오를 산출하여 설정된 고객단말기나 전자주소로 전송시키는 제3 과정을 포함하는 증권 빅데이터 기반 페어 트레이딩시스템의 제어방법을 제공한다.After the second process, the result calculation module calculates the actual buy / sell entry point for the securities item pair list filtered by the analysis engine module under the control of the control module, calculates the profit and calculates the final expected scenario. It provides a control method of a securities big data-based pair trading system including a third process of transmitting to a set customer terminal or an electronic address.

상기와 같은 본 발명에 의하면, 증권관련 빅데이터들을 크롤링하여 다단계로 가공 및 분석하고 증권종목 페어 리스트를 산출한 후 실제 매수/매도 진입시점 및 최상 예상 시나리오를 산출하여 설정된 고객에게 전송함으로써, 종목 연관성에 관계없이 전 종목을 대상으로 페어 트레이딩종목군을 발굴하여 서비스를 제공하므로 그에 따라 증권투자자에 대한 고품위서비스를 제공할 수 있는 효과가 있다.According to the present invention as described above, by crawling securities-related big data processing and analysis in a multi-step, calculate the securities item pair list, calculate the actual buy / sell entry point and the best expected scenario and send to the set customer, the item relevance Irrespective of whether or not the fair trading group is discovered and provided to all stocks, high-quality services can be provided to securities investors.

상기와 같은 본 발명은 또한 주식투자자에게 증권 빅데이터를 기반으로 하여 산출된 페어 트레이딩 종목들에 대한 매수/매도 진입시점 및 최상 예상 시나리오를 제공하여 최적화된 매매결정을 내리도록 하므로 그에 따라 주식투자에 대한 손실을 최소화함과 더불어 수익률을 극대화시키는 효과도 있다.The present invention as described above also provides the stock investor with the buy / sell entry point and the best forecast scenario for the pair trading items calculated on the basis of the securities big data to make an optimized trading decision accordingly In addition to minimizing losses, it also maximizes returns.

도 1은 종래 종목 페어 트레이딩방법의 이례를 설명하는 플로우차트.
도 2는 본 발명에 따른 증권 빅데이터 기반 페어 트레이딩시스템을 설명하는 설명도.
도 3은 본 발명의 플로우차트.
도 4는 본 발명 시스템의 분석엔진모듈에 의해 실행되는 종목 페어링 과정의 일례를 설명하는 설명도.
도 5는 본 발명 시스템의 결과산출모듈에 의해 실행되는 결과산출과정의 일례를 설명하는 설명도. 1 is a flowchart illustrating an example of a conventional item pair trading method.
2 is an explanatory diagram illustrating a securities big data based pair trading system according to the present invention.
3 is a flowchart of the present invention.
4 is an explanatory diagram for explaining an example of an item pairing process executed by the analysis engine module of the present invention system.
5 is an explanatory diagram for explaining an example of a result calculation process executed by the result calculation module of the system of the present invention;

본 발명은 다양한 변환을 가할 수 있고 여러 가지 실시예를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 상세한 설명에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변환, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 본 발명을 설명함에 있어서 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다.As the invention allows for various changes and numerous embodiments, particular embodiments will be illustrated in the drawings and described in detail in the written description. However, this is not intended to limit the present invention to specific embodiments, it should be understood to include all transformations, equivalents, and substitutes included in the spirit and scope of the present invention. In the following description of the present invention, if it is determined that the detailed description of the related known technology may obscure the gist of the present invention, the detailed description thereof will be omitted.

제1, 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다.Terms such as first and second may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another.

본 명세서에서 사용한 용어는 단지 특정한 실시예를 설명하기 위해 사용된 것으로, 본 발명을 한정하려는 의도가 아니다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. 본 명세서에서, "포함하다" 또는 "가지다" 등의 용어는 명세서상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다.The terminology used herein is for the purpose of describing particular embodiments only and is not intended to be limiting of the invention. Singular expressions include plural expressions unless the context clearly indicates otherwise. As used herein, the terms "comprise" or "have" are intended to indicate that there is a feature, number, step, action, component, part, or combination thereof described on the specification, and one or more other features. It is to be understood that the present invention does not exclude the possibility of the presence or the addition of numbers, steps, operations, components, components, or a combination thereof.

또한, 본 발명을 설명함에 있어서 관련된 공지 기술에 대한 구체적인 설명이 본 발명의 요지를 불필요하게 흐릴 수 있다고 판단되는 경우 그 상세한 설명을 생략한다.In addition, in describing the present invention, when it is determined that the detailed description of the related known technology may unnecessarily obscure the gist of the present invention, the detailed description thereof will be omitted.

이하, 본 발명의 실시예에 대해 관련 도면들을 참조하여 상세히 설명하기로 한다.Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings.

도 2는 본 발명에 따른 증권 빅데이터 기반 페어 트레이딩시스템을 설명하는 설명도이고, 도 3은 본 발명의 플로우차트이며, 도 4는 본 발명 시스템의 분석엔진모듈에 의해 실행되는 종목 페어링 과정의 일례를 설명하는 설명도이고, 도 5는 본 발명 시스템의 결과산출모듈에 의해 실행되는 결과산출과정의 일례를 설명하는 설명도이다.2 is an explanatory diagram illustrating a securities big data based pair trading system according to the present invention, FIG. 3 is a flowchart of the present invention, and FIG. 4 is an example of an item pairing process executed by the analysis engine module of the present invention system. 5 is an explanatory diagram for explaining an example of a result calculation process executed by the result calculation module of the system of the present invention.

상기 도 2를 참조하면, 각종 정보수집매체(1) 예컨대, 각종 포털사이트, 증권거래소 사이트 및 증권정보제공 사이트로부터 제공되는 증권관련 빅데이터들을 크롤링하고 그 크롤링된 데이터들을 구분하여 DB(2)에 저장시키는 수집엔진모듈(3)과;Referring to FIG. 2, various information collecting media 1 crawls securities related big data provided from various portal sites, stock exchange sites, and stock information providing sites, and classifies the crawled data into DB 2. A collection engine module 3 for storing;

상기 수집엔진모듈(3)을 포함하여 빅데이터 기반 페어 트레이딩 전 과정을 제어하고 그 페어 트레이딩정보를 가공하여 설정된 종목의 매도/매수 시점 및 예상 시나리오가 산출되도록 제어하는 제어모듈(4)과;A control module (4) including the collection engine module (3) to control the entire process of big data-based pair trading and to process the pair trading information to calculate a sell / buy time point and an expected scenario of a set item;

상기 제어모듈(4)의 기능 제어 하에 크롤딩된 빅데이터들을 다단계로 가공 및 분석하여 빅데이터 기반 증권종목 페어 트레이딩 리스트를 필터링하는 분석엔진모듈(5)과;An analysis engine module (5) for filtering and analyzing big data crawled under a function control of the control module (4) in multiple stages to filter the big data-based securities stock trading list;

상기 제어모듈(4)의 기능 제어 하에 분석엔진모듈(5)에 의해 필터링된 상기 증권종목 페어 트레이딩 리스트를 대상으로 실제 매수/매도 진입시점을 산출한 다음 수익을 계산하여 최종 예상 시나리오를 산출하여 설정된 고객단말기나 전자주소로 전송시키는 결과산출모듈(6)을 포함하여 구성된다. Under the control of the control module 4, the actual buying / selling entry point is calculated on the stock trading pair trading list filtered by the analysis engine module 5, and then the profit is calculated to calculate the final expected scenario. And a result calculation module 6 for transmitting to a customer terminal or an electronic address.

그리고, 상기 수집엔진모듈(3)은 제어모듈(4)의 기능 제어 하에 인터넷을 통해 각종 정보수집매체(1) 예컨대, 각종 포털사이트, 코스피, 코스닥 및 각종 선물정보를 제공하는 증권거래소 사이트 및 증권정보제공 사이트로부터 제공되는 각종 증권관련 빅데이터들을 크롤링하고 그 크롤링된 데이터들을 설정된 분류절차에 따라 구분하여 DB(2)에 저장시킨다. 예컨대, 상기 수집엔진모듈(3)은 코스피 전종목데이터 및 코스닥 전종목데이터를 포함하는 시세데이터와, 코스피 200 및 코스닥 50 을 포함하는 지수데이터 및, 코스피업종 및 코스닥업종을 포함하는 업종데이터를 기간을 정하여 크롤링한다.In addition, the collection engine module 3 is a stock exchange site and securities providing various information collecting media 1, for example, various portal sites, KOSPI, KOSDAQ, and various futures information through the Internet under the control of the function of the control module 4. It crawls various securities-related big data provided from the information providing site and stores the crawled data in the DB 2 according to the set classification procedure. For example, the collection engine module 3 may be configured by setting quote period data including all KOSPI stock data and all KOSDAQ stock data, index data including KOSPI 200 and KOSDAQ 50, and industry data including KOSPI and KOSDAQ industries. Crawl

또한 상기 분석엔진모듈(5)은 제어모듈(4)의 기능제어하에 상기 수집엔진모듈(3)에 의해 크롤링된 빅데이터중 전종목을 대상으로 상관관계가 있는 지수 및 종목만을 먼저 추출하고, 그 추출된 지수 및 종목간의 스프레드(종목간의 차이)를 산출한다. 더 나아가, 상기 분석엔진모듈(5)은 상기와 같이 가공해서 추출된 결과물을 가지고 1차로 [수학식1]의 상관 관계식을 이용하여 상관 관계가 있는지를 분석한 후 예측을 할 수 없는 랜덤한 종목이 정상성(안정성)이 있는(일정범위의 위치와 설정된 일정기간 내에서의 특정종목의 가격변동성이 적은 경우를 말함)지를 2차로 분석하며, 상기 2차 분석후에 추출된 결과물이 3차로 정규성(정규분포-가격변동의 예측이 쉬운 종목)이 있는지를 분석한다. 예를 들어, 상기 분석엔진모듈(5)은 임의 두 종목을 선택하고 해당 종목들의 상관관계를 수학식 1에 넣어 산출하게 되는데, 이때, 로그차분을 통해 가격 변화량을 산출한 다음 수학식 2를 통해 가격변화량을 구하여 두종목 상관계수를 산출한다. In addition, the analysis engine module 5 first extracts only indexes and items that are correlated to all items among the big data crawled by the collection engine module 3 under the function control of the control module 4, and then extracts the items. Calculated indexes and spreads among stocks (differences between stocks). Furthermore, the analysis engine module 5 has a result obtained by processing as described above, and firstly analyzes whether there is a correlation using the correlation formula of [Equation 1] and then cannot predict a random item. Secondly, this normality (stability) (where the position of a certain range and the price volatility of a specific item within a set period) is analyzed secondly, and the result extracted after the second analysis is the third normality ( Analyze whether there is a normal distribution-an item that can predict price fluctuations easily. For example, the analysis engine module 5 selects two random items and calculates correlations between the corresponding items in Equation 1. At this time, the price change is calculated through logarithmic difference, and then Equation 2 is used. The correlation between the two stocks is calculated by obtaining the price change.

[수학식 1][Equation 1]

[수학식 2][Equation 2]

그리고, 상기 분석엔진모듈(5)은 제어모듈(4)의 기능 제어 하에 상기 수학식 1, 2에 의해 얻어진 두 종목들의 상관관계수를 이용하여 정상성(안정성)을 검증하게 되는데, 이 과정에서 먼저 수학식 3의 선형회귀를 통해 변수들을 산출한다. In addition, the analysis engine module 5 verifies normality (stability) by using correlation coefficients of two items obtained by Equations 1 and 2 under the control of the control module 4. First, variables are calculated through linear regression of Equation 3.

[수학식 3][Equation 3]

또한 상기 분석엔진모듈(5)은 상기와 같이 수학식 3의 선형회귀를 통해 변수들을 산출한 후 다시 수학식 4를 통해 상기 수학식 3에 의해 산출된 회귀 변수를 통해 정상성을 검증하게 된다(검증조건은 수학식 4에 기반한 회귀모형의 p-value<0.05). In addition, the analysis engine module 5 calculates the variables through the linear regression of Equation 3 as described above, and then verifies normality through the regression variables calculated by Equation 3 through Equation 4 ( The verification condition is p-value <0.05 of the regression model based on equation (4).

[수학식 4][Equation 4]

따라서, 상기 분석엔진모듈(5)이 상기와 같은 단계를 거쳐 실행하면, 데이터 사이즈를 줄이고, 페어 트레이딩을 할 수 있는 최적의 종목1 과 종목 2를 산출하게 된다. Therefore, when the analysis engine module 5 executes through the above steps, it reduces the data size and calculates the optimal item 1 and item 2 for pair trading.

더 나아가, 상기 결과산출모듈(6)에서는 상기 분석엔진모듈(5)에 의해 산출된 페어 트레이딩 종목들에 대한 매도/매수 진입시점을 각각 산출하고 예상수익에 대한 시나리오를 만들어서 고객에게 제공하게 된다. 이때, 상기 결과산출모듈(6)을 좀 더 구체적으로 살펴보면, 상기 결과산출모듈(6)은 상기 분석엔진모듈(5)에 의해 산출된 스프레드에서 제트스코어(표준점수)를 구해서 설정된 파라미터 임계치 예컨대, 2(파라미터 임계치)보다 크거나 작은것을 구해서 스프레드가 가장 크게 벌어질 때는 매도/매수 진입시점을 판단하고 그 판단된 매도/매도 진입시점의 가격들을 그래프나 리포트에 날짜별 예상수익에 대한 시나리오를 만들어서 고객에게 제공한다. 즉, 상기 결과산출모듈(6)은 수학식 5을 이용하여 정상성(안정성)을 검증하게 되는데, 이때 상기 결과산출모듈(6)은 먼저 진입시점 전략모델을 산출하고 그 산출결과을 이용하여 최적의 페어트레이딩 종목쌍을 산출하게 된다.In addition, the result calculation module 6 calculates selling / buying entry points for the pair trading items calculated by the analysis engine module 5 and creates a scenario for the expected profit and provides it to the customer. In this case, the result calculation module 6 will be described in more detail. The result calculation module 6 obtains a jet score (standard score) from a spread calculated by the analysis engine module 5, and sets a parameter threshold value, for example, When the spread is the biggest and the spread is the largest or smaller than 2 (parameter threshold), the sell / buy entry point is determined. Provide to customers. That is, the result calculation module 6 verifies normality (stability) using Equation 5, wherein the result calculation module 6 first calculates an entry point strategic model and uses the result of calculation A pair trading item pair is calculated.

[수학식 5][Equation 5]

다음에는 상기와 같은 구성으로 된 본 발명의 제어방법을 설명한다.Next, the control method of the present invention having the above configuration will be described.

본 발명의 방법은 도 3에 도시된 바와같이 수집엔진모듈이 제어모듈의 기능제어하에 각종 정보수집매체의 인터넷 사이트로부터 제공되는 증권관련 빅데이터들을 크롤링하고 그 크롤링된 데이터들을 구분하여 DB에 저장시키는 제1 과정(S1)과;According to the method of the present invention, as shown in FIG. 3, the collection engine module crawls the securities related big data provided from the Internet site of various information collecting media under the function control of the control module, and classifies the crawled data and stores them in the DB A first process (S1);

상기 제1 과정(S1)후에 분석엔진모듈이 제어모듈의 기능 제어 하에 상기 크롤딩된 빅데이터들을 다단계로 가공 및 분석하여 빅데이터 기반 증권종목 페어 리스트를 필터링하는 제2 과정(S2)과;A second step (S2) of, after the first step (S1), an analysis engine module filtering and analyzing the big data-based securities pair list by processing and analyzing the crawled big data in multiple stages under the function control of a control module;

상기 제2 과정(S2)후에 결과산출모듈이 제어모듈의 기능 제어 하에 분석엔진모듈에 의해 필터링된 상기 증권종목 페어 리스트를 대상으로 실제 매수/매도 진입시점을 산출한 다음 수익을 계산하여 최종 예상 시나리오를 산출하여 설정된 고객단말기나 전자주소로 전송시키는 제3 과정(S3)을 포함하여 구성된다. After the second process (S2), the result calculation module calculates the actual buying / selling entry point for the securities item pair list filtered by the analysis engine module under the function control of the control module, and then calculates the profit to be the final expected scenario. It is configured to include a third process (S3) to calculate and transmit to the set customer terminal or electronic address.

그리고, 상기 제1 과정(S1)에는 수집엔진모듈이 제어모듈의 기능 제어 하에 인터넷을 통해 각종 정보수집매체 예컨대, 각종 포털사이트, 코스피, 코스닥 및 각종 선물정보를 제공하는 증권거래소 사이트 및 증권정보제공 사이트로부터 제공되는 각종 증권관련 빅데이터들을 크롤링하고 그 크롤링된 데이터들을 설정된 분류절차에 따라 구분하여 DB에 저장시키는 단계를 더 포함한다. In the first step (S1), the collection engine module provides various information collecting media such as various portal sites, KOSPI, KOSDAQ, and various futures information through the Internet under the control of the control module. The method may further include crawling various securities related big data provided from the site and classifying the crawled data according to a set classification procedure and storing the crawled data in a DB.

예컨대, 상기 제1 과정(S1)에서는 수집엔진모듈이 코스피 전 종목 데이터 및 코스닥 전 종목 데이터를 포함하는 시세데이터와, 코스피 200 및 코스닥 50 을 포함하는 지수데이터 및, 코스피업종 및 코스닥업종을 포함하는 업종데이터를 크롤링하는 정밀 크롤링단계를 더 포함한다. For example, in the first process (S1), the collection engine module includes quote data including KOSPI stock data and KOSDAQ stock data, index data including KOSPI 200 and KOSDAQ 50, and KOSPI and KOSDAQ stocks. The method further includes a precise crawling step for crawling industry data.

또한 상기 제2 과정(S2)에는 분석엔진모듈이 제어모듈의 기능 제어 하에 수집엔진모듈에 의해 크롤링된 빅데이터중 전 종목을 대상으로 상관관계가 있는 지수 및 종목만을 먼저 추출하는 추출단계와, In addition, in the second process (S2), the analysis engine module first extracts only indexes and items that are correlated to all items among the big data crawled by the collection engine module under the control of the control module;

상기 추출단계후에 그 추출된 지수 및 종목간의 스프레드(종목간의 차이)를 산출하는 데이터 가공단계와, A data processing step of calculating a spread (difference between items) between the extracted indexes and items after the extraction step;

상기 데이터 가공단계후에 그 다음에서 가공해서 추출된 결과물을 가지고 1차로 상관 관계식을 이용하여 상관 관계가 있는지를 분석하는 1차 분석단계와, A first analysis step of analyzing whether or not there is a correlation by using a correlation equation first with the result extracted after the data processing step;

상기 1차 분석단계후에 예측을 할 수 없는 램덤한 종목이 정상성(안정성)이 있는 지를(레인지 범위 안에서 위치 혹은 설정된 일정기간내에 가격변동성이 적게 나타나는 지의 여부를 2차로 분석하는 2차 분석단계와, A second analysis step of analyzing whether random items that cannot be predicted after the first analysis step have normality (stability) (whether there is less price volatility within a range range or within a predetermined period of time); ,

상기 2차 분석단계후에 그다음에는 추출된 결과물이 3차로 정규성(정규분포-가격변동의 예측이 쉬운 종목인지의 여부)이 있는지를 분석하는 3차 분석단계를 더 포함한다. After the second analysis step, the third step further includes a third analysis step of analyzing whether the extracted result is third-order normality (regular distribution-whether the item is easy to predict price fluctuations).

따라서, 상기와 같은 제2 과정(S2)의 세부단계를 거쳐 실행하면, 상기 분석엔진모듈에 의해 데이터 사이즈를 줄일 수 있게 되고, 페어 트레이딩을 할 수 있는 최적의 종목1 과 종목 2를 산출하게 된다. Therefore, if the data is executed through the detailed steps of the second process S2 as described above, the data size can be reduced by the analysis engine module, and the optimal item 1 and item 2 for pair trading can be calculated. .

더 나아가, 상기 제3 과정(S3)에는 결과산출모듈이 분석엔진모듈에 의해 산출된 페어 트레이딩 종목들에 대한 매도/매수 진입시점을 각각 산출하고 예상수익에 대한 시나리오를 만들어서 고객에게 제공하는 단계를 더 포함한다. 이때, 상기 제3 과정(S3)에서, 결과산출모듈은 상기 분석엔진모듈에 의해 산출된 스프레드에서 제트스코어(표준점수)를 구해서 파라미터 임계치 예컨대, 2 보다 크거나 작은것을 구해서 스프레드가 가장 크게 벌어질 때는 매도/매수 진입시점을 판단하는 진입시점 판단단계와, 상기 진입시점 판단단계후에 그 판단된 매도/매도 진입시점의 가격들을 그래프나 리포트에 날짜별 예상수익에 대한 시나리오를 만들어서 고객에게 제공하는 예상 시나리오 산출단계를 더 포함한다.Furthermore, in the third process S3, the result calculation module calculates selling / buying entry points for the pair trading items calculated by the analysis engine module, respectively, and provides a scenario for the expected profit to the customer. It includes more. At this time, in the third process (S3), the result calculation module obtains a jet score (standard score) from the spread calculated by the analysis engine module, and obtains a parameter threshold, for example, larger or smaller than 2, so that the spread will be greatest. In this case, an entry point judgment step of judging the point of entry of a sell / buy and an anticipation of providing prices to the customer by creating scenarios for expected profits by date on a graph or report after the entry point judgment step are made. Further comprising the scenario calculation step.

환언하면, 본발명에 따른 증권 빅데이터기반 페어 트레이딩시스템(7)을 사용하려면, 먼저, 제어모듈(4)은 수집엔진모듈(3)을 통해 각종 정보수집매체(1) 예컨대, 각종 포털사이트, 증권거래소 사이트 및 증권정보제공 사이트로부터 제공되는 증권관련 빅데이터들을 크롤링하고 그 크롤링된 데이터들을 구분하여 DB(2)에 저장시킨다. 그리고, 상기와 같이 크롤링 과정이 완료되면, 상기 제어모듈(4)은 분석엔진모듈(5)을 통해 상기 크롤딩된 빅데이터들을 다단계로 가공 및 분석하여 빅데이터 기반 증권종목 페어 리스트를 필터링한다. 또한 상기와 같은 분석과정후에 제어모듈(4)은 결과산출모듈(6)을 통해 분석엔진모듈(5)에 의해 필터링된 상기 증권종목 페어 리스트를 대상으로 실제 매수/매도 진입시점을 산출한 다음 수익을 계산하여 최종 예상 시나리오를 산출하여 설정된 고객단말기나 전자주소로 전송시키므로, 증권 빅데이터를 기반으로 하는 페어 트레이딩을 실행하게 된다.In other words, in order to use the securities big data-based pair trading system 7 according to the present invention, first, the control module 4 uses the collection engine module 3 to collect various information collecting media 1, for example, various portal sites, It crawls the stock-related big data provided from the stock exchange site and the stock information providing site and stores the crawled data in the DB (2). When the crawling process is completed as described above, the control module 4 processes and analyzes the crawled big data in multiple stages through the analysis engine module 5 to filter the big data-based stock item pair list. In addition, after the analysis process as described above, the control module 4 calculates the actual buy / sell entry point for the securities item pair list filtered by the analysis engine module 5 through the result calculation module 6, and then generates profits. The final expected scenario is calculated and then sent to the set customer terminal or electronic address, so that pair trading based on securities big data is executed.

이때, 상기 데이터 크롤링 과정을 좀 더 구체적으로 살펴보면, 상기 수집엔진모듈(3)은 제어모듈(4)의 기능 제어 하에 인터넷을 통해 각종 정보수집매체(1) 예컨대, 각종 포털사이트, 코스피, 코스닥 및 각종 선물정보를 제공하는 증권거래소 사이트 및 증권정보제공 사이트로부터 제공되는 각종 증권관련 빅데이터들을 크롤링하고 그 크롤링된 데이터들을 설정된 분류절차에 따라 구분하여 DB(2)에 저장시키게 된다. 상기 데이터 크롤링과정에서, 상기 수집엔진모듈(3)은 빅데이터 수집대상을 예컨대, 코스피 전 종목 데이터 및 코스닥 전 종목 데이터를 포함하는 시세데이터와, 코스피 200 및 코스닥 50 을 포함하는 지수데이터 및, 코스피업종 및 코스닥업종을 포함하는 업종데이터를 구분하여 정밀하게 크롤링하게 된다. At this time, looking at the data crawling process in more detail, the collection engine module (3) is a variety of information gathering medium (1), for example, various portal sites, KOSPI, KOSDAQ and the Internet through the function control of the control module (4) It crawls various stock-related big data provided from the stock exchange site and the stock information providing site that provide various futures information, and stores the crawled data in the DB 2 according to the set classification procedure. In the data crawling process, the collection engine module 3 may include big data collection targets such as quote data including all KOSPI stock data and all KOSDAQ stock data, index data including KOSPI 200 and KOSDAQ 50, and KOSPI. Industry data, including industry and KOSDAQ, will be classified and crawled precisely.

또한 상기 분석과정에서 상기 분석엔진모듈(5)은 제어모듈(4)의 기능 제어 하에 수집엔진모듈(3)에 의해 크롤링된 빅데이터중 전종목을 대상으로 상관관계가 있는 지수 및 종목만을 먼저 추출하고, 그 추출된 지수 및 종목간의 스프레드(종목간의 차이)를 산출한다. 이때 상기 분석엔진모듈(5)은 상기와 스프레드 산출과정을 통해 산출된 그 추출된 결과물을 가지고 1차로 상관 관계식을 이용하여 상관 관계가 있는지를 분석하게 되고, 그 후 그 추출된 동일한 결과물을 가지고 예측을 할 수 없는 램덤한 종목이 정상성(안정성)이 있는 지를 2차로 분석하며, 그다음 다시한번 그 추출된 결과물이 정규성(정규분포-가격변동의 예측이 쉬운 종목)이 있는지를 3차로 분석한다. 따라서, 상기와 같은 세부분석과정을 거쳐 실행하면, 상기 분석엔진모듈(5)에 의해 크롤링시 대량으로 많았던 빅데이터의 데이터 사이즈가 상당히 줄어들게 되고, 페어 트레이딩을 할 수 있는 최적의 종목1 과 종목 2를 산출하게 된다. Further, in the analysis process, the analysis engine module 5 first extracts only indexes and items that are correlated to all items among the big data crawled by the collection engine module 3 under the control of the control module 4. Then, the extracted index and the spread between the stocks (differences between stocks) are calculated. At this time, the analysis engine module 5 analyzes whether there is a correlation using the correlation expression with the extracted result calculated through the process of calculating the spread and the first, and then predicts the extracted result with the same result. Secondly, we analyze whether the random item which cannot be done is normal (stability), and then again, if the extracted result is normal (regular distribution – item that is easy to predict price fluctuation). Therefore, if the analysis is carried out through the detailed analysis process as described above, the data size of the big data, which was largely crawled by the analysis engine module 5, is considerably reduced, and the best stocks 1 and 2 can be traded. Will yield.

상기 분석과정을 좀 더 구체적으로 설명하면, 상기 분석엔진모듈(5)은 상기와 같이 가공해서 추출된 결과물을 가지고 1차로 [수학식1]의 상관 관계식을 이용하여 상관 관계가 있는지를 분석한 후 예측을 할 수 없는 랜덤한 종목이 정상성(안정성)이 있는(일정범위의 위치와 설정된 일정기간 내에서의 특정종목의 가격변동성이 적은 경우를 말함)지를 2차로 분석하며, 상기 2차 분석 후에 추출된 결과물이 3차로 정규성(정규분포-가격변동의 예측이 쉬운 종목)이 있는지를 분석한다. 예를들어, 상기 분석엔진모듈(5)은 임의 두 종목을 선택하고 해당 종목들의 상관관계를 수학식 1에 넣어 산출하게 되는데, 이때, 로그차분을 통해 가격 변화량을 산출한 다음 수학식 2를 통해 가격변화량을 구하여 두 종목 상관계수를 산출한다. In more detail, the analysis engine module 5 analyzes whether or not there is a correlation using the correlation equation [Equation 1] with the result extracted by processing as described above. Secondly, we analyze whether random stocks that cannot be predicted have normality (stability) (that is, when the price range of certain stocks are small within a certain period of time) and after the second analysis. Analyze whether the extracted results are third-order normality (regular distribution-stocks with easy prediction of price changes). For example, the analysis engine module 5 selects two random items and calculates correlations between the corresponding items in Equation 1. At this time, the price change is calculated through logarithmic difference, and then Equation 2 is used. The correlation between the two stocks is calculated by obtaining the price change.

[수학식 1][Equation 1]

[수학식 2][Equation 2]

[수학식 3][Equation 3]

또한 상기 분석엔진모듈(5)은 상기와 같이 수학식 3의 선형회귀를 통해 변수들을 산출한 후 다시 수학식 4를 통해 상기 수학식 3에 의해 산출된 회귀 변수를 통해 정상성을 검중하게 된다(검증조건은 수학식 4에 기반한 회귀모형의 p-value<0.05). In addition, the analysis engine module 5 checks the normality through the regression variable calculated by Equation 3 through Equation 4 after calculating the variables through linear regression of Equation 3 as described above ( The verification condition is p-value <0.05 of the regression model based on equation (4).

[수학식 4][Equation 4]

예를 들어, 도 4의 그래프를 참고로 설명하면, 종목 대상을 동일업종이 아닌 비동일 업종으로서 대우증권과 광주신세계를 얻었고 기간은 약 1년(2015년 6월-2016년 6월)으로 설정하여서 상기 수학식 1 내지 4에 넣어 산출한 상관계수는 0.95를 얻게되는데, 상기 대우증권(종목1)과 광주신세계(종목2)은 영업일 기준 180일 동안 0.95의 상관도를 나타내었다.For example, referring to the graph of FIG. 4, the targets of the stocks were acquired by Daewoo Securities and Gwangju Shinsegae as non-identical sectors, and the period was set to about one year (June 2015-June 2016). The correlation coefficient calculated in the equations 1 to 4 is 0.95, and the Daewoo Securities (Item 1) and Gwangju Shinsegae (Item 2) showed a correlation of 0.95 for 180 business days.

더 나아가, 상기 결과산출과정에서, 상기 결과산출모듈(6)은 분석엔진모듈(5)에 의해 산출된 페어 트레이딩 종목들에 대한 매도/매수 진입시점을 각각 산출하고 예상수익에 대한 시나리오를 만들어서 고객에게 제공하게되는데, 이때, 상기 결과산출모듈(6)은 상기 분석엔진모듈(5)에 의해 산출된 스프레드에서 제트스코어(표준점수)를 구해서 설정된 파라미터 임계치 예컨대, 2 보다 크거나 작은것을 구해서 스프레드가 가장 크게 벌어질 때는 매도/매수 진입시점을 판단한 다음 그 판단된 매도/매도 진입시점의 가격들을 그래프나 리포트에 날짜별 예상수익에 대한 예상시나리오를 만들어서 고객에게 제공한다.Furthermore, in the result calculation process, the result calculation module 6 calculates selling / buying entry points for the pair trading items calculated by the analysis engine module 5 and creates a scenario for the expected profit. In this case, the result calculation module 6 obtains a jet score (standard score) from the spread calculated by the analysis engine module 5 to obtain a parameter threshold value, for example, greater than or less than 2, and the spread is calculated. At the highest level, we determine when to enter a sell / buy entry, and then provide prices to the customer by creating forecast scenarios for expected earnings on a date or graph in a report.

여기서, 상기 결과산출과정을 좀 더 구체적으로 설명하면, 더 나아가, 상기 결과산출모듈(6)에서는 상기 분석엔진모듈(5)에 의해 산출된 페어 트레이딩 종목들에 대한 매도/매수 진입시점을 각각 산출하고 예상수익에 대한 시나리오를 만들어서 고객에게 제공하게 된다. 이때, 상기 결과산출모듈(6)을 좀 더 구체적으로 살펴보면, 상기 결과산출모듈(6)은 상기 분석엔진모듈(5)에 의해 산출된 스프레드에서 제트스코어(표준점수)를 구해서 설정된 파라미터 임계치 예컨대, 2(파라미터 임계치)보다 크거나 작은것을 구해서 스프레드가 가장 크게 벌어질 때는 매도/매수 진입시점을 판단하고 그 판단된 매도/매도 진입시점의 가격들을 그래프나 리포트에 날짜별 예상수익에 대한 시나리오를 만들어서 고객에게 제공한다. Herein, the result calculation process will be described in more detail. Furthermore, the result calculation module 6 calculates selling / buying entry points for the pair trading items calculated by the analysis engine module 5, respectively. Create a scenario for the expected revenue and provide it to the customer. In this case, the result calculation module 6 will be described in more detail. The result calculation module 6 obtains a jet score (standard score) from a spread calculated by the analysis engine module 5, and sets a parameter threshold value, for example, When the spread is the biggest and the spread is determined to be greater than or less than 2 (parameter threshold), the sell / buy entry point is determined, and the price of the determined sell / sell entry point is created in a graph or report for scenarios of estimated earnings by date. Provide to customers.

즉, 상기 결과산출모듈(6)은 수학식 5을 이용하여 정상성(안정성)을 검증하게 되는데, 이때 상기 결과산출모듈(6)은 먼저 진입시점 전략모델을 산출하고 그 산출결과을 이용하여 최적의 페어트레이딩 종목쌍을 산출하게 된다.That is, the result calculation module 6 verifies normality (stability) using Equation 5, wherein the result calculation module 6 first calculates an entry point strategic model and uses the result of calculation A pair trading item pair is calculated.

[수학식 5][Equation 5]

예컨대, 상기 결과산출모듈(6)은 상기 대우증권(종목1)과 광주신세계(종목2)의 표준점수(Z-Score)를 ± 2를 기준으로 가정할 경우 스프레드 부호의 변동 횟수가 영업일 기준 180일 동안 18회로, 해당 기간 동안 거래 가능한 충분한 날짜를 확보한다. 이후 투자자들에게 두 종목 조합을 대상으로 하는 수익금 시뮬레이션을 실행하면되는데, 결과적으로 도 5에 도시된 바와같이 두 종목은 해당 기간동안 안정적으로 가격 변동이 있는 종목 쌍이라 할 수 있다. 또한 상기 해당 종목쌍이 그리는 시세 그래프는 안전성 검정에서 P-value <= 0.5를 갖는 유의미한 회귀모형이 됨을 알 수 있다.For example, when the result calculation module 6 assumes Z-Scores of the Daewoo Securities (item 1) and Gwangju Shinsegae (item 2) as ± 2, the number of changes of the spread sign is 180 based on the business day. 18 times during the day, ensure that there are enough days available for trading during that period. After that, investors can run a profit simulation targeting the two stock combinations. As a result, as shown in FIG. 5, the two stocks can be regarded as a stable stock price fluctuation during the period. In addition, it can be seen that the quote graph drawn by the corresponding pair of products becomes a significant regression model having P-value <= 0.5 in the safety test.

1 : 각종 정보수집매체 2 : DB
3 : 수집엔진모듈 4 ; 제어모듈
5 : 분석엔진모듈 6 : 결과산출모듈
7 : 페어 트레이딩시스템1: Various information collecting media 2: DB
3: collection engine module 4; Control module
5: Analysis engine module 6: Result calculation module
7: Fair Trading System

Claims

A collection engine module for crawling securities related big data provided from internet sites of various information collecting media and classifying the crawled data in a DB;
A control module for controlling the big data-based pair trading process including the collection engine module and processing the pair trading information to calculate a sell / buy time point and an expected scenario of a set item;
An analysis engine module for filtering the big data-based securities stock trading list by processing and analyzing the big data crawled in multiple stages under the control of the control module;
Under the control of the control module, the actual buying / selling entry point is calculated on the stock trading pair trading list filtered by the analysis engine module, and then the profit is calculated to calculate the final expected scenario to the set customer terminal or electronic address. It further comprises a result calculation module for transmitting;
The collection engine module further performs a function of crawling quote data including all KOSPI stock data and KOSDAQ stock data, index data including KOSPI 200 and KOSDAQ 50, and industry data including KOSPI and KOSDAQ stocks. ;
The analysis engine module extracts only indexes and stocks that are correlated to all stocks among the big data crawled by the collection engine module under the control of the control module, and calculates a spread between the extracted indexes and stocks. Including more;
The analysis engine module analyzes whether the correlation is a first order using a correlation expression with the processed extracted result, and then analyzes whether the random items that cannot be predicted are normal (stability) in the second order. Further including a function of analyzing whether the extracted result is third-order normal after the second analysis;
The result calculation module obtains a jet score from the spread calculated by the analysis engine module, obtains a value larger or smaller than the set parameter threshold, and judges the time when the spread is the largest, when the sell / buy entry point is made. Further comprising providing prices to entry points to the customer by creating scenarios for expected revenue per day in graphs or reports;
The analysis engine module selects two random stocks in order to calculate the pair trading stocks on the stock trading pair trading list, calculates a change in price through a log difference, and calculates a correlation between the stocks in Equation 1 below. After calculating the correlation coefficient of the two items, the variables are calculated through the linear regression of Equation 3 below using the correlation coefficient of the two items obtained under the control function of the control module to verify the normality (stability) Securities big data-based pair trading system.
[Equation 1]

[Equation 3]

A first process of the collecting engine module crawling securities related big data provided from internet sites of various information collecting media under function control of the control module, and classifying the crawled data in a DB;
A second process of analyzing and analyzing the crawled big data in a multi-step process by the analysis engine module under the function control of a control module after the first process to filter the big data-based stock item pair list;
After the second process, the result calculation module calculates the actual buy / sell entry point for the securities item pair list filtered by the analysis engine module under the control of the control module, calculates the profit and calculates the final expected scenario. Including a third process of transmitting to a set customer terminal or an electronic address;
In the first process, the collection engine module crawls quote data including all KOSPI and KOSDAQ stock data, index data including KOSPI 200 and KOSDAQ 50, and industry data including KOSPI and KOSDAQ industries. Further comprising steps;
In the second process, an analysis engine module extracts an index and an item that are correlated to all items among the big data crawled by the collection engine module under the control of the control module; A data processing step of calculating the indexes and spreads between the items (differences between the items);
In the third process, the result calculation module obtains a jet score (standard score) from the spread calculated by the analysis engine module, and obtains a value larger or smaller than a set parameter threshold to determine when to enter a sell / buy point when the spread is most widespread. An entry scenario determination step, and an expected scenario calculation step of providing prices to the customer by creating scenarios for expected revenue by date on a graph or report after the entry point determination step;
In the data processing step, a first analysis step of analyzing whether the analysis engine module has a result extracted by processing through the data processing step and correlation using a correlation equation firstly, and after the first analysis step A second analysis step of analyzing whether the random item that the engine module cannot predict is normal (stability) in a second step, and whether the extracted result is third-order normality after the second analysis step; The module further includes a third analysis step;
In the second process, the analysis engine module selects two random stocks to calculate the pair trading stocks on the stock trading list and calculates the correlation between the stocks in Equation 1 below to calculate the price change through the log difference. Calculating the correlation coefficients of the two items, and calculating the variables through the linear regression of Equation 3 below using the correlation coefficient of the two items obtained by the analysis engine module under the control of the control module. Stability) further comprising the step of controlling the securities big data-based pair trading system control method.
[Equation 1]

[Equation 3]