KR20200112495A

KR20200112495A - Server and method for providing commercial analysis services

Info

Publication number: KR20200112495A
Application number: KR1020190033082A
Authority: KR
Inventors: 김장현
Original assignee: 김장현
Priority date: 2019-03-22
Filing date: 2019-03-22
Publication date: 2020-10-05

Abstract

An objective of the present invention is to efficiently improve sales of a business. According to one embodiment of the present invention, a server for providing a business district analysis service comprises: a memory storing a program for providing a business district analysis service; and a processor to execute the program stored in the memory. The processor receives sales information including a business type, a location, a date, and sales from a business terminal, and calculates predicted sales of the business by using a fuzzy system based on the sales information, as the program is executed. The fuzzy system is modeled based on a cluster center wherein previously stored past sales information including a business type, a location, a date, and sales is clustered by a subtractive clustering algorithm. The subtractive clustering algorithm normalizes the past sales information to calculate the distance between data, and calculates the cluster center based on the size of the distance potential value between the normalized data.

Description

Commercial area analysis service providing server and its method {SERVER AND METHOD FOR PROVIDING COMMERCIAL ANALYSIS SERVICES}

본 발명은 상권 분석 서비스 제공 서버 및 그 방법에 관한 것으로서, 영업장의 위치, 매출, 날짜 등의 영업정보에 기초하여 예상 매출 등의 상권 관련 정보를 정확하게 파악하여 영업자에게 제공하기 위한 것이다.The present invention relates to a commercial area analysis service providing server and method thereof, and provides it to a business operator by accurately grasping commercial area related information such as expected sales based on business information such as location, sales, and date of a business place.

최근 최저임금 인상과 국제 경제의 불황으로 서민 경제의 근간이 되는 소상공인 경제가 악화되고 있다. 특히 빈부 격차가 커지고 있는 현재의 국내 경제에서 지역 간의 경제 차이도 커지고 있다. 따라서, 소상공인과 지역 경제를 활성화하고 발전시킬 수 있는 위치기반의 지역 소상공인 플랫폼의 도입이 필요한 실정이다.With the recent increase in the minimum wage and the recession of the international economy, the economy of small businesses, which is the basis of the common people's economy, is deteriorating. In particular, in the current domestic economy where the gap between the rich and the poor is widening, the economic gap between regions is also increasing. Therefore, it is necessary to introduce a location-based local small business platform that can revitalize and develop the small business and the local economy.

또한, 소상공인들에게 매출에 대한 리스크는 중요한 이슈 중에 하나이다. 자신이 하려는 업종과 위치 등의 상권에 따른 과거의 매출을 확인하고 이후 영업장의 미래 매출을 예측함으로써 소상공인들이 가게를 운영하는데 리스크를 줄일 수 있다. 이를 통해 계획적인 지출을 할 수 있으며 매출에 영향을 주는 부분을 분석할 수 있다.Also, for small business owners, the risk of sales is one of the important issues. It is possible to reduce the risk of small business owners operating a store by checking the past sales according to the business district such as the type of business and location they intend to do and predicting the future sales of the business site. This allows you to make planned expenditures and analyze the parts that affect sales.

그러나, 현 시점에서 영업장의 매출을 예측할 수 있는 마땅한 솔루션이 구비되어 있지 않은 실정이고, 매출을 예측한다고 하여도 정확하지 않아서 영업장 운영에 큰 도움을 주지 못하고 있다.However, at the present time, there is no suitable solution for predicting sales of the business site, and even predicting sales is not accurate, so it does not greatly help in business operation.

대한민국 공개특허공보 제10-2007-0076884호(발명의 명칭: 관능평가를 통한 매출액 예측시스템 및 그 예측시스템의 매출액 예측 방법)Republic of Korea Patent Publication No. 10-2007-0076884 (title of the invention: sales forecasting system through sensory evaluation and sales forecasting method of the forecasting system)

본 발명에 따른 상권 분석 서비스 제공 서버 및 그 방법은 상권을 분석하여 영업자에게 예상 매출 정보를 제공하며 매출에 영향을 미치는 다양한 요인을 분석한 데이터를 제공하여 영업자의 영업을 효율적으로 개선시키고자 한다.The server for providing a commercial area analysis service and its method according to the present invention is intended to efficiently improve the business of the business operator by providing predicted sales information to the business owner by analyzing the business area and by providing data analyzing various factors affecting the sales.

본 발명에 따른 상권 분석 서비스 제공 서버 및 그 방법은 소상공인들이 영업하고자 하는 상권에서 추후 예상 매출이 얼마인지 시계열적으로 예측하여 창업을 할 때 리스크를 줄이고자 한다. The server for providing a commercial area analysis service and its method according to the present invention aims to reduce the risk when starting a business by predicting in a time series how much future sales will be expected in the commercial area where small business owners intend to operate.

본 발명의 일 실시예에 따른 상권 분석 서비스를 제공하는 서버에 있어서, 상권 분석 서비스를 제공하는 프로그램이 저장된 메모리; 및 메모리에 저장된 프로그램을 실행하는 프로세서를 포함하고, 프로세서는 프로그램의 수행에 따라, 영업자 단말로부터 영업장 업종, 위치, 날짜 및 매출을 포함하는 영업정보를 수신하고, 영업정보를 기초로 퍼지 시스템(fuzzy system)을 이용하여 영업장의 예상 매출을 산출하며, 퍼지 시스템은 미리 저장된 영업장 업종, 위치, 날짜 및 매출을 포함하는 과거 영업정보가 섭트랙티브 클러스터링 알고리즘(subtractive clustering algorithm)을 통하여 클러스터링된 클러스터 중심을 기초로 하여 모델링된 것이고, 섭트랙티브 클러스터링 알고리즘은 과거 영업정보를 정규화시켜 데이터 간의 거리를 산출하고, 정규화된 데이터 간의 거리 포텐셜 값의 크기를 기초로 하여 클러스터 중심을 산출하는, 상권 분석 서비스 제공 서버를 제공하고자 한다.A server providing a commercial area analysis service according to an embodiment of the present invention, comprising: a memory storing a program providing a commercial area analysis service; And a processor for executing a program stored in the memory, wherein the processor receives sales information including business type, location, date and sales from the business operator terminal according to the execution of the program, and based on the sales information, a fuzzy system system), and the fuzzy system focuses on clusters in which past sales information including pre-stored business type, location, date, and sales are clustered through a subtractive clustering algorithm. It is modeled on the basis, and the subtractive clustering algorithm normalizes the past sales information to calculate the distance between data, and calculates the cluster center based on the size of the distance potential value between the normalized data. Want to provide.

본 발명의 일 실시예에 따른 상권 분석 서비스 제공 서버에 의하여 수행되는 상권 분석 서비스 제공 방법에 있어서, 영업자 단말로부터 영업장 업종, 위치, 날짜 및 매출을 포함하는 영업정보를 수신하는 단계; 및 영업정보를 기초로 퍼지 시스템(fuzzy system)을 이용하여 영업장의 예상 매출을 산출하는 단계를 포함하며, 퍼지 시스템은 미리 저장된 영업장 업종, 위치, 날짜 및 매출을 포함하는 과거 영업정보가 섭트랙티브 클러스터링 알고리즘(subtractive clustering algorithm)을 통하여 클러스터링된 클러스터 중심을 기초로 하여 모델링된 것이고, 섭트랙티브 클러스터링 알고리즘은 과거 영업정보를 정규화시켜 데이터 간의 거리를 산출하고, 정규화된 데이터 간의 거리 포텐셜 값의 크기를 기초로 하여 클러스터 중심을 산출하는, 상권 분석 서비스 제공 방법을 제공하고자 한다.In the commercial area analysis service providing method performed by the commercial area analysis service providing server according to an embodiment of the present invention, the method comprising: receiving business information including a business type, location, date, and sales from a business operator terminal; And calculating the expected sales of the business site using a fuzzy system based on the business information, and the fuzzy system includes the previously stored business information including the business type, location, date and sales of the business site. It is modeled based on the cluster center through a subtractive clustering algorithm, and the subtractive clustering algorithm normalizes past sales information to calculate the distance between data, and calculates the size of the distance potential between normalized data. It is intended to provide a method of providing a commercial area analysis service that calculates the cluster center on the basis.

본 발명의 일 실시예에 따른 상권 분석 서비스 제공 서버 및 그 방법은 상권을 분석하여 영업자에게 예상 매출 정보를 제공하며 매출에 영향을 미치는 다양한 요인을 분석한 데이터를 제공하여 영업자의 영업을 효율적으로 개선시킬 수 있다.The server and method for providing a commercial area analysis service according to an embodiment of the present invention provide predicted sales information to the business owner by analyzing the business area, and provide data that analyzes various factors affecting sales to efficiently improve the business of the business operator. I can make it.

본 발명의 일 실시예에 따른 상권 분석 서비스 제공 서버 및 그 방법은 소상공인들이 영업하고자 하는 상권에서 추후 예상 매출이 얼마인지 시계열적으로 예측하여 제공함으로써, 소상공인들이 창업을 할 때 리스크를 줄일 수 있다.The server and method for providing a commercial area analysis service according to an embodiment of the present invention can reduce the risk when small business owners start a business by providing a time-series prediction of how much future expected sales will be in a commercial area where small business owners intend to do business.

도 1은 본 발명의 일 실시예에 따른 상권 분석 서비스 제공 서버의 구성을 보여주는 도면이다.
도 2는 본 발명의 일 실시예에 따른 상권 분석 서비스 제공 방법의 진행 과정을 보여주는 흐름도이다.
도 3은 본 발명의 일 실시예에 따른 섭트랙티브 클러스터링 방법의 진행 과정을 보여주는 흐름도이다.
도 4는 본 발명의 일 실시예에 따른 퍼지 시스템의 구성을 보여주는 도면이다.
도 5는 본 발명의 일 실시예에 따른 예상 매출을 산출하는 프로세스의 일 예를 보여주는 흐름도이다.
도 6은 본 발명의 일 실시예에 따른 영업자 단말 프로그램의 예상 매출 통계 그래프 화면을 보여주는 도면이다.
도 7은 본 발명의 일 실시예에 따른 퍼지 시스템의 퍼지 규칙 개수에 따른 예측 오차를 보여주는 도면이다.1 is a diagram showing the configuration of a commercial area analysis service providing server according to an embodiment of the present invention.
2 is a flowchart illustrating a process of a method for providing a commercial area analysis service according to an embodiment of the present invention.
3 is a flowchart illustrating a process of a subtractive clustering method according to an embodiment of the present invention.
4 is a diagram showing the configuration of a purge system according to an embodiment of the present invention.
5 is a flowchart illustrating an example of a process of calculating expected sales according to an embodiment of the present invention.
6 is a diagram showing a graph of predicted sales statistics of a business operator terminal program according to an embodiment of the present invention.
7 is a diagram illustrating a prediction error according to the number of fuzzy rules in a fuzzy system according to an embodiment of the present invention.

아래에서는 첨부한 도면을 참조하여, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자가 용이하게 실시할 수 있도록 본 발명의 실시예를 상세히 설명한다. 그러나, 본 발명은 여러 가지 상이한 형태로 구현될 수 있으며 여기에서 설명하는 실시예에 한정되지 않는다. 그리고, 도면에서 본 발명을 명확하게 설명하기 위해서 설명과 관계없는 부분은 생략하였으며, 명세서 전체를 통하여 유사한 부분에 대해서는 유사한 도면 부호를 붙였다.Hereinafter, exemplary embodiments of the present invention will be described in detail with reference to the accompanying drawings so that those of ordinary skill in the art may easily implement the present invention. However, the present invention may be implemented in various different forms and is not limited to the embodiments described herein. Further, in the drawings, parts not related to the description are omitted in order to clearly describe the present invention, and similar reference numerals are attached to similar parts throughout the specification.

명세서 전체에서, 어떤 부분이 다른 부분과 "연결"되어 있다고 할 때, 이는 "직접적으로 연결"되어 있는 경우뿐만 아니라, 그 중간에 다른 소자를 사이에 두고 "전기적으로 연결"되어 있는 경우도 포함한다. 또한, 어떤 부분이 어떤 구성요소를 "포함"한다고 할 때, 이는 특별히 반대되는 기재가 없는 한 다른 구성요소를 제외하는 것이 아니라 다른 구성요소를 더 포함할 수 있는 것을 의미한다.Throughout the specification, when a part is said to be "connected" with another part, this includes not only "directly connected" but also "electrically connected" with another element interposed therebetween. . In addition, when a part "includes" a certain component, it means that other components may be further included rather than excluding other components unless specifically stated to the contrary.

이하에서는, 첨부한 도면을 참조하여 본 발명에 따른 상권 분석 서비스 제공 서버 및 그 방법에 대하여 설명하기로 한다.Hereinafter, a commercial area analysis service providing server and a method according to the present invention will be described with reference to the accompanying drawings.

도 1은 본 발명의 일 실시예에 따른 상권 분석 서비스 제공 서버의 구성을 보여주는 도면이다.1 is a diagram showing the configuration of a commercial area analysis service providing server according to an embodiment of the present invention.

도 1을 참조하면, 본 발명의 일 실시예에 따른 상권 분석 서비스 제공 서버(100)는 영업자 단말과 통신하여 영업자 단말로부터 제공되는 영업 정보를 기초로 하여 상권 분석 결과를 영업자 단말로 제공하기 위한 서버에 관한 것이다.Referring to FIG. 1, a commercial area analysis service providing server 100 according to an embodiment of the present invention is a server for providing a commercial area analysis result to a business area based on sales information provided from the business operator terminal in communication with a business operator terminal. It is about.

상권 분석 서비스 제공 서버(100)는 통신 모듈(110), 메모리(120), 프로세서(130), 데이터베이스(DB: 140)를 포함할 수 있다.The commercial area analysis service providing server 100 may include a communication module 110, a memory 120, a processor 130, and a database (DB: 140).

통신 모듈(110)은 통신망과 연동하여 상권 분석 서비스 제공 서버(100)에 통신 인터페이스를 제공하는데, 영업자 단말과 데이터를 송수신하는 역할을 수행할 수 있다. 여기서, 통신 모듈(110)은 다른 네트워크 장치와 유무선 연결을 통해 제어 신호 또는 데이터 신호와 같은 신호를 송수신하기 위해 필요한 하드웨어 및 소프트웨어를 포함하는 장치일 수 있다.The communication module 110 provides a communication interface to the commercial area analysis service providing server 100 by interworking with a communication network, and may perform a role of transmitting and receiving data to and from a business operator terminal. Here, the communication module 110 may be a device including hardware and software necessary for transmitting and receiving a signal such as a control signal or a data signal through a wired or wireless connection with another network device.

메모리(120)는 상권 분석 서비스를 제공하기 위한 프로그램이 기록될 수 있다. 또한, 메모리(120)는 프로세서(130)가 처리하는 데이터를 일시적 또는 영구적으로 저장하는 기능을 수행할 수 있다. 여기서, 메모리(120)는 휘발성 저장 매체(volatile storage media) 또는 비휘발성 저장 매체(non-volatile storage media)를 포함할 수 있으나, 본 발명의 범위가 이에 한정되는 것은 아니다.In the memory 120, a program for providing a commercial area analysis service may be recorded. In addition, the memory 120 may perform a function of temporarily or permanently storing data processed by the processor 130. Here, the memory 120 may include a volatile storage medium or a non-volatile storage medium, but the scope of the present invention is not limited thereto.

프로세서(130)는 상권 분석 서비스 제공 서버(100)에서 상권 분석 서비스를 제공하기 위한 프로그램이 수행하는 전체 과정을 제어할 수 있다. 프로세서(130)가 수행하는 과정의 각 단계에 대해서는 도2 내지 도 7을 참조하여 후술하기로 한다.The processor 130 may control an entire process performed by a program for providing a commercial area analysis service in the commercial area analysis service providing server 100. Each step of the process performed by the processor 130 will be described later with reference to FIGS. 2 to 7.

여기서, 프로세서(130)는 프로세서(processor)와 같이 데이터를 처리할 수 있는 모든 종류의 장치를 포함할 수 있다. 여기서, '프로세서(processor)'는, 예를 들어 프로그램 내에 포함된 코드 또는 명령으로 표현된 기능을 수행하기 위해 물리적으로 구조화된 회로를 갖는, 하드웨어에 내장된 데이터 처리 장치를 의미할 수 있다. 이와 같이 하드웨어에 내장된 데이터 처리 장치의 일 예로서, 마이크로프로세서(microprocessor), 중앙처리장치(central processing unit: CPU), 프로세서 코어(processor core), 멀티프로세서(multiprocessor), ASIC(application-specific integrated circuit), FPGA(field programmable gate array) 등의 처리 장치를 망라할 수 있으나, 본 발명의 범위가 이에 한정되는 것은 아니다.Here, the processor 130 may include all types of devices capable of processing data, such as a processor. Here, the'processor' may refer to a data processing device embedded in hardware having a circuit physically structured to perform a function represented by a code or instruction included in a program. As an example of a data processing device built into the hardware as described above, a microprocessor, a central processing unit (CPU), a processor core, a multiprocessor, and an application-specific integrated (ASIC) circuit) and processing devices such as field programmable gate arrays (FPGAs), but the scope of the present invention is not limited thereto.

데이터베이스(140)에는 상권 분석 서비스를 제공하기 위한 정보가 저장되어 있을 수 있다. 예를 들어, 후술할 영업장 위치, 업종, 광고비, 매출, 날짜 등을 포함하는 영업 정보 등이 저장될 수 있다.In the database 140, information for providing a commercial area analysis service may be stored. For example, sales information including a location of a business place, type of business, advertisement cost, sales, date, etc. to be described later may be stored.

이하에서는, 본 발명의 일 실시예에 따른 상권 분석 서비스 제공 서버(100)가 상권 분석 서비스를 제공하는 방법에 대하여 상세하게 설명하기로 한다.Hereinafter, a method of providing the commercial area analysis service providing server 100 according to an embodiment of the present invention will be described in detail.

도 2를 참조하면, 먼저 상권 분석 서비스 제공 서버(100)는 영업자 단말로부터 영업장 업종, 위치, 날짜, 매출을 포함하는 영업정보를 수신하는 단계(S200)를 수행할 수 있다. 여기서 영업정보는 영업자가 영업장을 운영하면서 생성되는 정보로서, 광고비, 광고 효율(매출/광고비*100%) 등을 더 포함할 수 있다.Referring to FIG. 2, first, the commercial area analysis service providing server 100 may perform a step S200 of receiving sales information including a business type, location, date, and sales from a business operator terminal. Here, the sales information is information generated while a business operator operates a business site, and may further include advertisement cost, advertisement efficiency (sales/advertisement cost * 100%), and the like.

단계(S200) 다음으로, 상권 분석 서비스 제공 서버(100)는 영업정보를 기초로 퍼지 시스템(fuzzy system)을 이용하여 영업장의 예상 매출을 산출하는 단계(S210)를 수행할 수 있다.Step S200 Next, the commercial area analysis service providing server 100 may perform a step S210 of calculating the expected sales of the business site using a fuzzy system based on the business information.

여기서, 퍼지 시스템은 미리 저장된 과거의 영업장 업종, 위치, 날짜, 매출을 포함하는 과거 영업정보가 섭트랙티브 클러스터링 알고리즘(subtractive clustering algorithm)을 통하여 클러스터링된 클러스터 중심을 기초로 하여 모델링된 것일 수 있다. 과거 영업정보는 해당 상권에서 다수의 영업장이 운영된 결과로 생성된 정보로서, 광고비, 광고 효율(매출/광고비*100%) 등을 더 포함할 수 있다.Here, the fuzzy system may be modeled based on a cluster center in which past sales information including previously stored past business types, locations, dates, and sales are clustered through a subtractive clustering algorithm. The past business information is information generated as a result of the operation of a number of business sites in the corresponding commercial area, and may further include advertising costs, advertising efficiency (sales/advertising costs * 100%), and the like.

섭트랙티브 클러스터링 알고리즘은 과거 영업정보를 정규화하고, 데이터 간의 거리를 산출하여 정규화된 데이터 간의 거리 포텐셜 값의 크기를 기초로 하여 클러스터 중심을 산출하기 위한 것이다. 바람직하게, 섭트랙티브 클러스터링 알고리즘은 정규화된 데이터 간의 거리 포텐셜 값의 크기가 가장 큰 클러스터 중심을 첫번째 클러스터 중심으로 구하고, 첫번째 클러스터 중심의 영향을 제거한 다음에 데이터 간의 거리 포텐셜 값의 크기가 가장 크거나, 포텐셜 값의 크기가 작더라도 클러스터 중심 간의 거리가 큰 클러스터 중심을 두번째 클러스터 중심으로 구하는 과정을 반복하여, 복수 개의 클러스터 중심을 구하기 위한 것일 수 있다.The subtractive clustering algorithm normalizes past sales information, calculates a distance between data, and calculates a cluster center based on the size of a distance potential value between normalized data. Preferably, the subtractive clustering algorithm finds the center of the cluster with the largest size of the distance potential between normalized data as the center of the first cluster, removes the influence of the center of the first cluster, and then the size of the distance potential between the data is the largest or , Even if the size of the potential value is small, a process of obtaining a cluster center having a large distance between cluster centers as a second cluster center may be repeated to obtain a plurality of cluster centers.

도 3은 본 발명의 일 실시예에 따른 섭트랙티브 클러스터링 방법의 진행 과정을 보여주는 흐름도이다.3 is a flowchart illustrating a process of a subtractive clustering method according to an embodiment of the present invention.

이하에서는 도 3을 참조하여, 섭트랙티브 클러스터링 알고리즘에 대해 구체적으로 설명하기로 한다.Hereinafter, a subtractive clustering algorithm will be described in detail with reference to FIG. 3.

퍼지 시스템에 입출력 데이터가 주어졌을 때, 퍼지 시스템을 모델링하는 방법 중 하나인 데이터 클러스터링 알고리즘은 목적 함수를 최소화하기 위해, 퍼지 시스템을 최적화하는 기법으로서 K-평균(K-means) 알고리즘이나 FCM(Fuzzy C-Means) 알고리즘에 기초하고 있다. K-평균 알고리즘은 클러스터 범위 안의 모든 데이터 점들로부터 클러스터 중심(cluster center)까지의 거리를 제곱한 것들의 합으로 나타내는 성능 지표의 최소화에 기반을 두고 있으며, FCM 알고리즘은 각 데이터의 점과 각 클러스터 중심과의 거리를 기초로 하여 유사도 측정을 기반으로 하는 목적 함수의 최적화방식을 사용한다. When input/output data is given to a fuzzy system, the data clustering algorithm, which is one of the methods of modeling a fuzzy system, is a technique that optimizes the fuzzy system to minimize the objective function, using the K-means algorithm or FCM (Fuzzy It is based on the C-Means) algorithm. The K-means algorithm is based on minimization of a performance index expressed as the sum of the squared distances from all data points in the cluster range to the cluster center, and the FCM algorithm Based on the distance to and based on the similarity measurement, an optimization method of the objective function is used.

또 다른 클러스터링 알고리즘으로는 산(Mountain) 클러스터링 방법이 있다. 산 클러스터링 방법은 데이터 영역을 그리드 선(grid line)으로 분할하고 데이터 값과 그리드 선의 교차점 사이의 포텐셜 값(potential value)을 계산한다. 포텐셜 값은 거리의 함수로 주어지며 계산된 포텐셜 값 중 가장 큰 값을 갖는 그리드 선의 교차점이 첫 번째 클러스터 중심(cluster center)이 된다. 두 번째 클러스터 중심의 선정은 두 번째 클러스터 중심이 첫 번째 클러스터 중심 근처에 결정되는 것을 막기 위해 첫 번째 클러스터 중심의 영향을 제거한 후 두 번째 클러스터 중심을 구하는 방식으로 이루어진다. 이러한 과정은 포텐셜 값이 일정한 값에 도달할 때까지 반복된다. 그러나, 이러한 방법은 차원이 증가함에 따라 계산량이 지수적으로 증가하는 문제를 가지고 있다.Another clustering algorithm is the mountain clustering method. The acid clustering method divides a data area into grid lines and calculates a potential value between a data value and an intersection point of the grid line. The potential value is given as a function of distance, and the intersection of the grid line having the largest value among the calculated potential values becomes the first cluster center. The second cluster center is selected by removing the influence of the first cluster center to prevent the second cluster center from being determined near the first cluster center, and then obtaining the second cluster center. This process is repeated until the potential value reaches a constant value. However, this method has a problem that the amount of computation exponentially increases as the dimension increases.

따라서, 본 발명에서는 위와 같은 문제를 보완하기 위해 데이터 영역을 그리드 선으로 구분하지 않고 주어진 데이터에서 클러스터 중심을 구하는 변형된 알고리즘인 섭트랙티브 클러스터링 알고리즘을 사용한 것이다.Accordingly, in the present invention, in order to compensate for the above problem, a subtractive clustering algorithm, which is a modified algorithm that obtains the cluster center from a given data, is used without dividing the data area with a grid line.

섭트랙티브 클러스터링 알고리즘은 데이터간의 거리의 함수로 주어지는 포텐셜 값이 최대인 점을 첫 번째 클러스터 중심으로 선정하고 첫 번째 클러스터 중심의 영향을 제거한 상태에서 최대 포텐셜 값을 갖는 데이터가 다음 클러스터 중심이 된다. 이러한 과정은 포텐셜 값이 일정한 기준에 도달할 때까지 반복된다.In the subtractive clustering algorithm, the point with the maximum potential value given as a function of the distance between data is selected as the center of the first cluster, and the data with the maximum potential value becomes the center of the next cluster with the influence of the center of the first cluster removed. This process is repeated until the potential value reaches a certain criterion.

입력이 n차원이고 출력이 1차원 공간을 이루고 있는 (n+1)차원 입출력 공간상에서 n개의 데이터

가 주어졌을 때, 섭트랙티브 클러스터링 알고리즘은 다음과 같은 순서로 이루어진다. 첫째, 주어진 데이터를 [0,1]로 정규화 한다(S300). 둘째, 데이터 간의 거리를 구한다. 즉, 각 데이터 간의 거리는 하기 [수학식1]과 같이 주어지는 함수 P로 표현되며 P_i를 포텐셜 값이라 한다. 그리고, 데이터 간의 거리 포텐셜 값이 가장 큰 k(1≤k≤N)번째 클러스터 중심을 산출한다(S310).N data in the (n+1)-dimensional input/output space where the input is n-dimensional and the output is a 1-dimensional space

When is given, the subtractive clustering algorithm is performed in the following order. First, the given data is normalized to [0,1] (S300). Second, find the distance between data. That is, the distance between each data is expressed by a function P given as in [Equation 1] below, and P _i is called a potential value. Then, the k (1≦k≦N)-th cluster center having the largest distance potential between data is calculated (S310).

[수학식 1][Equation 1]

P_i = 데이터 간의 거리 포텐셜 값 P _i = distance potential value between data

N = 입력 데이터 개수N = number of input data

X_i, X_j = 데이터 값X _i , X _j = data value

여기에서 α는

로 주어지며, r_a는 양의 상수로서 r_a밖의 데이터는 포텐셜 값에는 영향을 거의 주지 못하게 된다. 실험을 통해 r_a가 0.3일 때 최적화된 결과를 보여주었다. 즉, r_a가 0.3 일 때, 이를 기초로 생성된 퍼지 규칙을 통해 알고리즘을 수행한 결과 오차율이 최소로 출력 되었다.Where α is

It is given as, and r _a is a positive constant, and data outside of r _a hardly affects the potential value. Through the experiment, the optimized result was shown when r _a was 0.3. That is, when r _a is 0.3, the error rate is output to the minimum as a result of performing the algorithm through the fuzzy rule generated based on this.

구체적으로, 데이터를 정규화 한 후에, 데이터 간의 거리 포텐셜 값이 가장 큰 첫 번째 클러스터 중심을 구한다. N개의 포텐셜 값 중 가장 큰 값을 P₁ ^*라 놓고 이 때의 데이터가 첫 번째 클러스터 중심 X₁ ^*가 된다. 그 다음으로, 첫 번째 클러스터 중심의 영향을 제거한다. 첫 번째 클러스터 중심 근처에는 많은 데이터가 존재하기 때문에 그 영향을 제거하지 않으면 두 번째 클러스터 중심 또한 첫 번째 클러스터 중심 근처에 발생될 가능성이 높기 때문에 하기 [수학식 2]와 같이 첫 번째 클러스터 중심의 영향을 제거한 포텐셜 값 P_i'을 구한다. Specifically, after normalizing the data, the center of the first cluster having the largest distance potential value between the data is obtained. The largest of the N potential values is P ₁ ^* , and the data at this time becomes the center of the first cluster X ₁ ^* . Next, the influence of the center of the first cluster is removed. Since there is a lot of data near the center of the first cluster, if the influence is not removed, the center of the second cluster is likely to occur near the center of the first cluster. The removed potential value P _i 'is calculated.

[수학식2][Equation 2]

여기에서 P_i는 식 [수학식1]에 의해 계산된 값이다. 또한, β는

로 주어지며, r_b는 양의 상수로서 r_a보다 큰 값을 취해 클러스터 중심 근처에 다음 클러스터 중심이 나타나지 않도록 한다. 실험을 통해 r_b값의 설정은 1.5r_a 로 하였을 때 좋은 결과를 보여주었다. 즉, r_b가 1.5r_a일 때, 이를 기초로 생성된 퍼지 규칙을 통해 알고리즘을 수행한 결과 오차율이 최소로 출력 되었다.Here, P _i is a value calculated by Equation [Equation 1]. Also, β is

Is given by, and r _b is a positive constant and takes a value greater than r _a so that the next cluster center does not appear near the cluster center. Through the experiment, the setting of the r _b value showed good results when set to 1.5 r _a . That is, when r _b is 1.5 r _a , the error rate is output to the minimum as a result of performing the algorithm through the fuzzy rule generated based on this.

이를 통해, 첫 번째 클러스터 중심의 영향을 제거한 N개의 포텐셜 값 P_i'중 가장 높은 값을 P₂ ^*라 놓고, 이 때의 데이터가 두 번째 클러스터 중심 X₂ ^*으로서 두 번째 클러스터 중심을 구할 수 있다. Thereby, removal of the influence of the first cluster center N of potential value P _i 'the highest place la P ₂ ^* in the data of this time is a second cluster center X ₂ ^* can obtain the second cluster center .

이러한 과정을 반복하여, k번째 클러스터 중심 X_k ^*가 구해졌을 때, k번째 클러스터 중심의 영향을 제거한 포텐셜 값은 하기 [수학식3]으로 표현할 수 있으며 (k+1)번째 클러스터 중심을 구할 수 있다(S320). When the k-th cluster center X _k ^* is obtained by repeating this process, the potential value from which the influence of the k-th cluster center has been removed can be expressed by the following [Equation 3], and the (k+1)-th cluster center can be obtained. Yes (S320).

[수학식 3][Equation 3]

P_i ^' = k번째 클러스터 중심의 영향을 제거한 포텐셜 값P _i ^' = potential value with the effect of the k-th cluster center removed

P_i = 데이터 간의 거리 포텐셜 값P _i = distance potential value between data

P_k ^*= k 번째 클러스터 중심의 포텐셜 값P _k ^* = potential value of the k-th cluster center

X_i = 데이터X _i = data

β =

, (r_b=양의 상수)β =

, (r _b = positive constant)

그 다음으로,

을 클러스터 중심들

중에서 상호간의 가장 짧은 거리로 정의하였을 때,

이면 클러스터 중심으로 채택하고 위의 과정을 반복한다.

이고

이면 클러스터 중심으로 채택하고,

이고

이면 그 때의

를 0으로 하고 다음으로 높은 포텐셜 값을 선정한 후,

이면 새로운 클러스터 중심으로 채택한다.

이면 위의 과정의 반복을 중단한다. Next,

Cluster centers

Among them, when defined as the shortest distance between each other,

In this case, it is adopted as the cluster center and the above process is repeated.

ego

Adopt the back cluster center,

ego

Back then

Is set to 0 and the next highest potential value is selected,

If so, it is adopted as the center of the new cluster.

If so, the repetition of the above process is stopped.

이 때,

와

은 클러스터 중심 선정의 상하 기준이 되는 상수로서 0부터 1사이의 값이 된다. 즉, P_k ^*를 P₁ ^* 으로 나누어서 첫 번째 포텐셜 값의 크기를 1로 만들었으며, 또한 클러스터 중심 선정의 상하 기준(

,

)을 둠으로써 P_k ^*/P₁ ^* 이

보다 크면 클러스터 중심으로 선정하고

보다 작으면 클러스터 중심으로 선정하지 않으며 그 사이일 경우에는 조건부로 클러스터 중심으로 선정하게 된다(S330). 이러한 조건부를 선정 기준에 넣음으로써 포텐셜 값이 작더라도 클러스터 중심 간의 거리(

)가 크면 클러스터 중심으로 선정되게 된다.At this time,

Wow

Is a constant that is the upper and lower standard of cluster center selection and is a value between 0 and 1. That is, P _k ^* is divided by P ₁ ^* to make the size of the first potential value 1, and the upper and lower criteria for cluster center selection (

,

), P _k ^* /P ₁ ^*

If it is larger, it is selected as the cluster center

If it is smaller than, the cluster center is not selected, and in the case of the interval, the cluster center is conditionally selected (S330). By putting these conditionals into the selection criterion, the distance between cluster centers (

If) is large, it is selected as the cluster center.

위와 같은 과정을 통해 퍼지 시스템에 입출력 데이터가 주어졌을 때 입출력 공간이 (n+1)차원인 M개의 클러스터 중심

을 결정할 수 있으며, 결정된 클러스터 중심을 기초로 하여 퍼지 시스템을 모델링할 수 있는 것이다.When input/output data is given to the fuzzy system through the above process, the input/output space is centered on M clusters with (n+1) dimensions.

Can be determined, and a fuzzy system can be modeled based on the determined cluster center.

도 4는 본 발명의 일 실시예에 따른 퍼지 시스템의 구성을 보여주는 도면이다. 4 is a diagram showing the configuration of a purge system according to an embodiment of the present invention.

이하에서는 도 4 를 참조하여, 입력이 n차원이고 출력이 1차원으로 주어진 N개의 데이터를 클러스터링하여 (n+1)차원을 갖는 M개의 클러스터 중심을 결정하는 과정으로부터 M개의 규칙을 갖는 퍼지 시스템을 모델링하는 과정에 대해 살펴보기로 한다. Hereinafter, referring to FIG. 4, a fuzzy system having M rules is formed from the process of determining the centers of M clusters having (n+1) dimensions by clustering N data given in n-dimensional input and 1-dimensional output. Let's look at the modeling process.

퍼지 시스템 모델링은 맘다니(Mamdani) 퍼지모델 또는 T-S-K(Takagi-Sugeno-Kang) 퍼지모델을 기초로 수행될 수 있다. 맘다니 퍼지 알고리즘으로 모델링할 경우, 모델링 과정은 클러스터 중심을 입출력 공간으로 분할하는 과정과 퍼지규칙으로 매핑하는 과정으로 나누어진다. T-S-K 퍼지 알고리즘으로 모델링할 경우에는 퍼지규칙으로 매핑 하는 과정에 후건부 규칙이 갖는 파라미터의 최적화 과정이 첨가된다.Fuzzy system modeling may be performed based on a Momdani fuzzy model or a Takagi-Sugeno-Kang (T-S-K) fuzzy model. In the case of modeling with the mamdani fuzzy algorithm, the modeling process is divided into a process of dividing the cluster center into input/output spaces and a process of mapping with fuzzy rules. In the case of modeling with the T-S-K fuzzy algorithm, the process of optimizing the parameters of the post-construction rule is added to the process of mapping with the fuzzy rule.

퍼지 시스템이 가지는 핵심적인 특징은 주어진 정보를 퍼지분할(fuzzy partitioning)하는 개념으로서 수치값(crisp number)대신 퍼지집합(fuzzy sets)에 기초하여 동작하는데, 퍼지집합은 단순한 숫자보다 더 많은 정보를 포함하게 된다. 따라서 퍼지집합을 이용함으로써 부정확성에 기초한 정보의 일반화를 이룰 수 있다. A key feature of a fuzzy system is the concept of fuzzy partitioning of given information, which operates based on fuzzy sets instead of a crisis number. Fuzzy sets contain more information than simple numbers. Is done. Therefore, generalization of information based on inaccuracy can be achieved by using fuzzy sets.

퍼지 시스템의 설계 과정은 먼저 입출력 변수들을 결정하고 각 변수들의 공간을 분할하여 적정한 수의 언어라벨을 할당하는 것으로 시작된다. 입출력 변수에 대한 언어라벨이 결정되면, 이를 이용하여 전문가의 지식을 표현하기 위한 퍼지규칙들을 정의하고, 각 언어라벨에는 멤버십함수를 대응시켜 퍼지집합으로 만든다. 이 밖에도 퍼지화, 비퍼지화 그리고 추론방법 등을 결정하여야 한다. The design process of a fuzzy system begins by first determining input/output variables, dividing the space of each variable, and allocating an appropriate number of language labels. When language labels for input/output variables are determined, fuzzy rules for expressing expert knowledge are defined using them, and membership functions are mapped to each language label to form a fuzzy set. In addition, it is necessary to determine the method of fuzzification, defuzzification, and reasoning.

도4를 참조하면, 퍼지 시스템은 퍼지화(fuzzification), 규칙베이스(rule base), 퍼지추론(fuzzy inference), 그리고 비퍼지화(defuzzification)로 구성되어 동작하는 것을 볼 수 있다.Referring to FIG. 4, it can be seen that the fuzzy system is composed of fuzzification, rule base, fuzzy inference, and defuzzification.

퍼지화(fuzzification)는 자연 언어의 모호함과 불확실한 정보를 취급하는데 중요한 역할을 한다. 특히, 자율이동로봇의 입출력 데이터는 항상 수치 데이터이므로 이를 퍼지시스템에서 사용하려면 수치 데이터를 퍼지집합으로 변환해야 한다. 이러한 방법에는 측정된 수치 값이 신뢰성이 있다고 간주될 때 측정치를 퍼지 싱글톤(fuzzy singleton)으로 변환시키는 방법과, 측정된 수치 값이 외란 등의 영향으로 불확실하다고 간주될 때 측정치의 확률적인 특성을 고려하여 가우시안, 이등변삼각형, 사다리꼴 형태의 퍼지집합으로 변환하는 방법이 있다. Fuzzification plays an important role in dealing with ambiguity and uncertain information in natural language. In particular, since the input/output data of the autonomous mobile robot is always numerical data, the numerical data must be converted into a fuzzy set to be used in a fuzzy system. These methods include converting the measured value to a fuzzy singleton when the measured numerical value is considered reliable, and the probabilistic characteristics of the measured value when the measured numerical value is considered to be uncertain due to influences such as disturbance. Considering this, there is a method of converting to a fuzzy set in the form of Gaussian, isosceles triangle, and trapezoid.

규칙베이스(rule base)는 전문가의 경험이나 지식으로부터 퍼지규칙의 작성 및 적용 그리고 퍼지 추론의 기본이 되는 상황들을 포함한다. 퍼지추론(fuzzy inference)은 언어적 형태로 기술된 퍼지규칙을 퍼지시스템에 적용하기 위한 논리적인 실행 과정을 갖는다. 추론이란, 어떤 주어진 사실이나 관계로부터 새로운 사실이나 관계를 유출해 내는 과정을 의미하며 전문가의 지식에 의한 규칙과 추론의 합성 과정에 의해 출력값을 구하는 것이다. 추론 방법에는 최대-최소(max-min)추론과 max-product 추론 등이 있다.The rule base includes situations that are the basis of the creation and application of fuzzy rules and fuzzy inference from the experience and knowledge of experts. Fuzzy inference has a logical execution process to apply fuzzy rules described in linguistic form to a fuzzy system. Reasoning refers to the process of leaking out a new fact or relationship from a given fact or relationship, and obtaining an output value through the synthesis of rules and inferences based on expert knowledge. Inference methods include max-min inference and max-product inference.

비퍼지화(defuzzification)는 퍼지시스템에서 생성된 퍼지값을 실제로 사용하기 위해 수치 값으로 변환하는 기능을 수행한다. 비퍼지화 방법에는 무게중심법(center of gravity method)과 최대평균법(mean of maximum method)등이 사용될 수 있다. Defuzzification performs the function of converting the fuzzy values generated in the fuzzy system into numerical values for actual use. For the defuzzification method, a center of gravity method and a mean of maximum method can be used.

퍼지모델은 크게 두 가지로 분류할 수 있는데, 이는 정보의 표현 형태에 의해 구분된다. 첫 번째는 맘다니 퍼지 모델로서, 모호한 술어로 묘사되는 IF-THEN 언어 규칙들에 기초하여 퍼지추론을 수행한다. 이러한 형태의 퍼지모델은 시스템 동작을 묘사하는 언어 모델(linguistic models)에 기초를 두고서 자연어를 그대로 사용한다. 즉, 규칙의 전건부와 후건부 모두가 퍼지집합으로 표현되어 융통성(flexibility)을 갖게 된다. 한 예로서 퍼지제어기를 생각할 수 있는데, 퍼지제어 규칙은 일반적인 상식에 기초해 제어 전략을 나타내는 언어로서 표현된다.Fuzzy models can be classified into two broad categories, which are classified by the form of information representation. The first is the mamdani fuzzy model, which performs fuzzy inference based on IF-THEN language rules described by ambiguous predicates. This type of fuzzy model uses natural language as it is based on linguistic models that describe system behavior. In other words, both the front and back parts of the rule are expressed in a fuzzy set to have flexibility. As an example, a fuzzy controller can be considered, and the fuzzy control rules are expressed as a language representing a control strategy based on common sense.

두 번째는 T-S-K 퍼지모델로서, 퍼지모델과 비퍼지모델의 혼합 형태를 가진다. 규칙 형태는 퍼지집합으로 구성되는 전건부와 함수 형태로 주어지는 후건부를 갖으며, 수치 정보를 표현할 수 있는 능력을 갖는다. 따라서 정보가 데이터 형태로 주어지면 학습 알고리즘을 통해 맘다니 퍼지모델 보다는 쉽게 시스템 식별(system identification)을 할 수 있다. T-S-K퍼지모델은 맘다니 퍼지모델과는 달리 후건부 소속 함수를 입출력 선형 관계식이나 상수 값으로 표현하는 혼합 퍼지추론 방법으로서 비퍼지화를 추론에 포함시킨 형태이다. 이러한 추론 방법은 입력 공간을 퍼지 분할하여 각각의 퍼지 부공간에 대한 선형 입출력 관계를 할당함으로써 기존의 선형 근사화 방식보다 적은 수의 규칙으로도 비선형 특성을 갖는 입출력 관계를 정확하게 표현할 수 있다는 장점을 가진다.The second is the T-S-K fuzzy model, which has a mixture of fuzzy and non-fuzzy models. The rule form has a pre-determined part composed of a fuzzy set and a post-determined part provided in a function form, and has the ability to express numerical information. Therefore, if information is given in the form of data, system identification can be performed more easily than a fuzzy model using a learning algorithm. Unlike the Mamdani fuzzy model, the T-S-K fuzzy model is a mixed fuzzy reasoning method that expresses the posterior part membership function as an input/output linear relational expression or a constant value. This inference method has the advantage of being able to accurately express input/output relationships with nonlinear characteristics even with fewer rules than the conventional linear approximation method by fuzzy partitioning the input space and allocating linear input/output relationships for each fuzzy subspace.

하기 [수학식4]는 n개의 입력 변수

과 1개의 출력 변수 y_i를 갖는 T-S-K 퍼지모델의 퍼지규칙을 보여주고 있다. [Equation 4] below shows n input variables

It shows the fuzzy rule of the TSK fuzzy model with and one output variable y _i .

[수학식4][Equation 4]

여기에서

는 퍼지 입력 멤버십함수이며, 가우시안 형태일 경우 하기 [수학식5] 로 표현될 수 있다. y_i는 i번째 규칙의 출력을 나타내며,

는 입력 변수들과 상수들의 선형 함수로서 하기 [수학식6]으로 표현될 수 있다.From here

Is a fuzzy input membership function, and in the case of a Gaussian form, it can be expressed by the following [Equation 5]. y _i represents the output of the ith rule,

Is a linear function of input variables and constants, and can be expressed as Equation 6 below.

[수학식5][Equation 5]

[수학식6][Equation 6]

이때 c_ji는 멤버십함수의 중심을, w_ji는 멤버십함수의 폭을 나타내는 변수이며

는 데이터가 주어지면 결정되어야 하는 상수이다. Here, c _ji is the center of the membership function, and w _ji is the variable representing the width of the membership function.

Is a constant that must be determined given data.

M개의 규칙을 갖고 max-product 추론 방법을 사용하였을 때,

번째 전건부 규칙의 정규화한 적합도 μ_i는 식 하기 [수학식7]과 같이 표현되며 퍼지 시스템의 최종 추론값 y^*는 하기 [수학식8]과 같이 표현될 수 있다. When using the max-product inference method with M rules,

The normalized goodness-of-fit μ _i of the first pre-determined rule is expressed as Equation 7 below, and the final inference y ^* of the fuzzy system can be expressed as Equation 8 below.

[수학식7][Equation 7]

[수학식8][Equation 8]

T-S-K 퍼지모델은 후건부 변수가 입력 변수와의 함수로 표현되어졌다는 점에서 맘다니 퍼지모델에 비해 두 가지 측면에서 이점을 갖는다. 첫째, 동작자의 지식이나 경험으로부터 규칙 추출은 주어진 입출력 데이터로부터 선형식의 계수를 추정하는 작업으로 귀결되어 쉽게 퍼지규칙을 생성시킬 수 있게 된다. 둘째, 후건부에 선형식을 사용함으로써 적은 제어규칙으로도 많은 정보를 포함시킬 수 있다는 점이다.The T-S-K fuzzy model has two advantages over the Momdani fuzzy model in that the posterior part variable is expressed as a function of the input variable. First, rule extraction from the operator's knowledge or experience results in a task of estimating the coefficient of the linear equation from the given input/output data, so that fuzzy rules can be easily generated. Second, it is possible to include a lot of information with a small number of control rules by using a linear equation in the posterior part.

한편, 맘다니 퍼지모델로 모델링할 경우, 모델링 과정은 클러스터 중심을 입출력 공간으로 분할하는 과정과 퍼지규칙으로 매핑 하는 과정으로 나누어질 수 있다.On the other hand, when modeling with the mamdani fuzzy model, the modeling process can be divided into a process of dividing the cluster center into an input/output space and a process of mapping with a fuzzy rule.

먼저, 클러스터 중심을 입출력 공간으로 분할하는 과정에 대해 살펴보면, (n+1)차원인 M개의 클러스터 중심

은 입출력 공간으로 분할될 수 있다. 즉, i번째 클러스터 중심 X_i ^* 는 입력 공간이 n차원인 x_i ^*로, 출력 공간이 1차원인 z_i ^*로 분할될 수 있다. First, looking at the process of dividing the cluster center into input/output spaces, the (n+1) dimensional M cluster centers

Can be divided into input/output space. That is, the i-th cluster center X _i ^* may be divided into x _i ^* having an n-dimensional input space and z _i ^* having a 1-dimensional output space.

다음으로, 분할된 입출력 공간을 퍼지규칙으로 매핑한다. 즉, 분할된 입력 공간 x_i ^*는 퍼지규칙의 전건부 멤버십함수의 위치를 결정하며, 분할된 출력 공간 z_i ^*는 출력 멤버십함수의 위치를 결정한다. 연구에서는 [수학식5]와 같은 가우시안 형태의 멤버십함수를 사용하였으며, 폭 w_ji는 경험에 기초하여 하기 [수학식9]를 통해 결정하였다. Next, the divided input/output space is mapped with a fuzzy rule. That is, the divided input space x _i ^* determines the position of the pre-construction unit membership function of the fuzzy rule, and the divided output space z _i ^* determines the position of the output membership function. In the study, a Gaussian-type membership function as shown in [Equation 5] was used, and the width w _ji was determined through [Equation 9] below based on experience.

[수학식9] [Equation 9]

여기에서 M은 클러스터 중심의 수를, r_a는 [수학식1]에서 사용된 값과 동일한 값을, 그리고

는 클러스터 중심의 입력 공간에서 벡터간의 최대거리를 나타낸다.Where M is the number of cluster centers, r _a is the same value as used in [Equation 1], and

Represents the maximum distance between vectors in the cluster-centered input space.

입력 변수가 n차원인 x이고, 출력 변수가 1차원인 z를 갖는 맘다니 퍼지모델에서 퍼지규칙이 [수학식4]와 같은 형태인 하기 [수학식10]으로 주어지고, max-product 추론 방법을 사용하였을 때 i번째 퍼지규칙의 적합도 μ_i는 하기 [수학식11]을 통해서 구할 수 있다. In the Mamdani fuzzy model in which the input variable is n-dimensional x and the output variable is 1-dimensional z, the fuzzy rule is given by [Equation 10] in the form of [Equation 4], and the max-product inference method When is used, the fitness μ _i of the i-th fuzzy rule can be obtained through the following [Equation 11].

[수학식10][Equation 10]

[수학식11][Equation 11]

여기에서 α는 [수학식1]에서 사용된 값과 동일한 값이며 입력 멤버십함수는 [수학식5]와 같은 가우시안 형태이다. Here, α is the same value as the value used in [Equation 1], and the input membership function is in the Gaussian form as in [Equation 5].

최종 출력 y^*는 규칙의 후건부 멤버십함수가 싱글톤 z_i ^* 로 주어지고 단순화된 추론 방법을 사용하면 하기 [수학식12]로 나타낼 수 있다. The final output y ^* can be expressed by the following [Equation 12] when the posterior member membership function of the rule is given as a singleton z _i ^* and a simplified reasoning method is used.

[수학식12][Equation 12]

여기서, 분할된 출력 공간 z_i ^*는 하기 [수학식13]과 같이 상수와 입력 변수들과의 선형 함수가 된다. Here, the divided output space z _i ^* becomes a linear function of constants and input variables as shown in [Equation 13] below.

[수학식13][Equation 13]

파라미터 G_i와 h_i를 결정하기 위해 선형 최소자승법(linear least-squares estimation)을 적용하면, i번째 퍼지규칙의 정규화된 적합도 ρ_i은 식 하기 [수학식14]로, 출력 y는 하기 [수학식15]로 표현된다. When linear least-squares estimation is applied to determine the parameters G _i and h _i , the normalized goodness of fit ρ _i of the i-th fuzzy rule is expressed as Equation 14, and the output y is It is expressed as Equation 15].

[수학식14][Equation 14]

[수학식15][Equation 15]

파라미터 G_i와 h_i의 결정은 N개의 입력 데이터

및 출력 데이터

가 주어지면 하기 [수학식16]을 통해 계산할 수 있다. Determination of parameters G _i and h _i is N input data

And output data

Given is, it can be calculated through the following [Equation 16].

[수학식16][Equation 16]

여기에서 i=1,2,…,M이고 j=1,2,…,N일 때, ρ_ji는 j번째 데이터에서 i번째 규칙의 정규화된 적합도를 나타낸다. Where i=1,2,... ,M and j=1,2,... When ,N, ρ _ji represents the normalized fitness of the i-th rule in the j-th data.

[수학식16]은 모델의 출력과 주어진 데이터의 출력간의 제곱 오차를 최소화시키도록 파라미터를 결정하는 과정으로서, 하기 [수학식17]의 형태로 간단히 나타낼 수 있으며 재귀형 최소자승법(recursive least-squares estimation)이나 의사 역행렬(pseudo inverse)을 이용하여

와

를 구할 수 있다. 단, 입출력 데이터의 수는 결정하고자 하는 파라미터보다 많아야 한다. [Equation 16] is a process of determining a parameter to minimize the squared error between the output of the model and the output of the given data, and can be simply expressed in the form of [Equation 17] and recursive least-squares estimation) or pseudo inverse

Wow

Can be obtained. However, the number of input/output data must be greater than the parameter to be determined.

[수학식17][Equation 17]

G_i와 h_i를 결정하기 위하여

를 최소화하는 방법인 의사 역행렬 방법의 하기 [수학식18]을 사용하여 하기 [수학식19]로 해를 구하였다. To determine G _i and h _i

The solution was obtained by the following [Equation 19] using the following [Equation 18] of the pseudo-inverse matrix method, which is a method of minimizing.

[수학식18][Equation 18]

그러나,

은 많은 계산 시간을 필요로 하며,

의 역행렬이 존재하지 않을 수 있기 때문에 재귀형 최소자승법을 사용하여 해를 구하기도 한다. But,

Takes a lot of computation time,

Since the inverse matrix of may not exist, a recursive least-squares method is used to find the solution.

맘다니 퍼지규칙의 후건부 파라미터 G_i와 h_i가 결정되면, 최종 출력 값 y^*는 하기[수학식20]에 의해 구할 수 있다. When the posterior construction parameters G _i and h _i of the mamdani fuzzy rule are determined, the final output value y ^* can be obtained by the following [Equation 20].

[수학식20][Equation 20]

[수학식20]의 결과는 T-S-K퍼지모델에서의 최종 출력과 일치함을 알 수 있다. 이 연구에서 사용하고 있는 T-S-K 퍼지시스템 모델은 후건부가 전건부 입력변수들의 함수 형태로 표현된다. T-S-K 퍼지모델은 후건부 소속 함수를 입출력 선형 관계식이나 상수 값으로 표현하는 혼합 퍼지 추론 방법으로서 비퍼지화를 추론에 포함시킨 형태이다. 이러한 추론방법은 입력 공간을 퍼지 분할하여 각각의 퍼지 부공간에 대한 선형 입출력 관계를 할당함으로써 기존의 선형근사화 방식보다 적은 수의 규칙으로도 비선형 특성을 갖는 입출력 관계를 나타낼 수 있다는 장점이 있다.It can be seen that the result of [Equation 20] is consistent with the final output in the T-S-K fuzzy model. In the T-S-K fuzzy system model used in this study, the posterior construction part is expressed in the form of a function of the previous construction part input variables. The T-S-K fuzzy model is a mixed fuzzy inference method that expresses the posterior part membership function as an input/output linear relational expression or a constant value. This inference method has the advantage of being able to express input/output relationships having nonlinear characteristics with fewer rules than the conventional linear approximation method by fuzzy partitioning the input space and allocating linear input/output relationships to each fuzzy subspace.

정리하면, 본 발명의 일 실시예에 따른 상권 분석 서비스 제공 방법은 영업정보를 입출력 데이터로 설정하고, 섭트랙티브 클러스터링 알고리즘을 통하여 클러스터링된 클러스터 중심을 기초로 하여, 맘다니 퍼지모델 또는 T-S-K 퍼지모델을 통하여 모델링된 퍼지 시스템을 통하여, 영업장의 예상 매출을 산출할 수 있는 것이다.In summary, the method of providing a commercial area analysis service according to an embodiment of the present invention sets sales information as input/output data, and based on the cluster center clustered through a subtractive clustering algorithm, the Momdani fuzzy model or the TSK fuzzy model Through the fuzzy system modeled through, it is possible to calculate the expected sales of the business site.

도 5는 본 발명의 일 실시예에 따른 예상 매출을 산출하는 프로세스의 일 예를 보여주는 흐름도이다.5 is a flowchart illustrating an example of a process of calculating expected sales according to an embodiment of the present invention.

도 5를 참조하면 일 예로, 영업자는 영업자 단말의 프로그램에서 영업장의 위치, 예측 월, 업종, 광고 비용의 영업정보를 순차로 입력할 수 있다. 상권 분석 서비스 제공 서버는 상기 영업정보를 수신하여, 설계된 퍼지 시스템에 입력하고, 분석 결과로서 미래 예상 매출을 출력할 수 있다. 출력된 예상 매출 정보는 영업자 단말로 전송되어 영업자 단말의 프로그램에서 제공될 수 있다.Referring to FIG. 5, for example, a business operator may sequentially input business information of a location of a business place, a predicted month, a business type, and an advertisement cost in a program of a business operator's terminal. The commercial area analysis service providing server may receive the sales information, input it into a designed fuzzy system, and output expected future sales as an analysis result. The output predicted sales information may be transmitted to the business operator's terminal and provided in a program of the business operator's terminal.

도 6은 본 발명의 일 실시예에 따른 영업자 단말 프로그램의 예상 매출 통계 그래프 화면을 보여주는 도면이다.6 is a diagram showing a graph of predicted sales statistics of a business operator terminal program according to an embodiment of the present invention.

도 6을 참조하면, 마지막 단계로서, 퍼지 시스템으로 출력된 예상 매출은 영업자 단말의 프로그램에서 시계열적인 통계 그래프 데이터로 제공될 수 있는 것이다(S220).Referring to FIG. 6, as a final step, the estimated sales output through the fuzzy system may be provided as time-series statistical graph data in a program of the business operator's terminal (S220).

도 7은 본 발명의 일 실시예에 따른 퍼지 시스템의 퍼지 규칙 개수에 따른 예측 오차를 보여주는 도면이다.7 is a diagram illustrating a prediction error according to the number of fuzzy rules in a fuzzy system according to an embodiment of the present invention.

도 7을 참조하면 퍼지 규칙의 개수에 따라 예측 오차율이 감소하는 것을 볼 수 있다. 예를 들어, 입력 변수가 엽업장의 업종, 지역, 광고비, 매출인 경우에 퍼지 규칙이 5개가 생성되는 경우에는 예측 오차(e^k _min)가 다소 존재하나, 퍼지 규칙이 10개 생성되는 경우에는 예측 오차(e^k _min)가 상대적으로 작은 것을 알 수 있다. Referring to FIG. 7, it can be seen that the prediction error rate decreases according to the number of fuzzy rules. For example, if the input variables are the industry, region, advertising cost, and sales of a foliar farm, if 5 fuzzy rules are generated, there is some prediction error (e ^k _min) , but when 10 fuzzy rules are generated It can be seen that the prediction error (e ^k _min) is relatively small.

이상으로 설명한 본 발명의 일 실시예에 따른 상권 분석 서비스 제공 서버 및 그 방법은 상권을 분석하여 영업자에게 예상 매출 정보를 제공하며 매출에 영향을 미치는 다양한 요인을 분석한 데이터를 제공하여 영업자의 영업을 효율적으로 개선시킬 수 있다.The server and method for providing a commercial area analysis service according to an embodiment of the present invention described above provide predicted sales information to the business owner by analyzing the business area, and provide data obtained by analyzing various factors affecting sales to manage the business of the business area. It can be improved efficiently.

한편, 본 발명의 일 실시예에 따른 상권 분석 서비스 제공 방법은 컴퓨터에 의해 실행되는 프로그램 모듈과 같은 컴퓨터에 의해 실행가능한 명령어를 포함하는 기록 매체의 형태로도 구현될 수 있다. 컴퓨터 판독 가능 매체는 컴퓨터에 의해 액세스될 수 있는 임의의 가용 매체일 수 있고, 휘발성 및 비휘발성 매체, 분리형 및 비분리형 매체를 모두 포함한다. 또한, 컴퓨터 판독가능 매체는 컴퓨터 저장 매체를 포함할 수 있다. 컴퓨터 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보의 저장을 위한 임의의 방법 또는 기술로 구현된 휘발성 및 비휘발성, 분리형 및 비분리형 매체를 모두 포함한다. 본 발명의 방법 및 시스템은 특정 실시예와 관련하여 설명되었지만, 그것들의 구성 요소 또는 동작의 일부 또는 전부는 범용 하드웨어 아키텍쳐를 갖는 컴퓨터 시스템을 사용하여 구현될 수 있다.Meanwhile, the method for providing a commercial area analysis service according to an embodiment of the present invention may be implemented in the form of a recording medium including instructions executable by a computer such as a program module executed by a computer. Computer-readable media can be any available media that can be accessed by a computer, and includes both volatile and nonvolatile media, removable and non-removable media. Further, the computer-readable medium may include a computer storage medium. Computer storage media includes both volatile and nonvolatile, removable and non-removable media implemented in any method or technology for storage of information such as computer readable instructions, data structures, program modules or other data. Although the methods and systems of the present invention have been described in connection with specific embodiments, some or all of their components or operations may be implemented using a computer system having a general-purpose hardware architecture.

이상의 설명은 본 발명의 기술적 사상을 예시적으로 설명한 것에 불과한 것으로서, 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자라면 본 발명의 본질적인 특성에서 벗어나지 않는 범위에서 다양한 수정 및 변형이 가능할 것이다. 따라서, 본 발명에 개시된 실시예는 본 발명의 기술 사상을 한정하기 위한 것이 아니라 설명하기 위한 것이고, 이러한 실시예에 의하여 본 발명의 기술 사상의 범위가 한정되는 것은 아니다. 본 발명의 보호 범위는 아래의 청구범위에 의하여 해석되어야 하며, 그와 동등한 범위 내에 있는 모든 기술적 사상은 본 발명의 권리범위에 포함되는 것으로 해석되어야 할 것이다.The above description is merely illustrative of the technical idea of the present invention, and those of ordinary skill in the art to which the present invention pertains will be able to make various modifications and variations without departing from the essential characteristics of the present invention. Accordingly, the embodiments disclosed in the present invention are not intended to limit the technical idea of the present invention, but to explain the technical idea, and the scope of the technical idea of the present invention is not limited by these embodiments. The scope of protection of the present invention should be interpreted by the claims below, and all technical ideas within the scope equivalent thereto should be interpreted as being included in the scope of the present invention.

100: 상권 분석 서비스 제공 서버
110: 통신 모듈
120: 메모리
130: 프로세서
140: 데이터베이스100: commercial area analysis service providing server
110: communication module
120: memory
130: processor
140: database

Claims

In the commercial district analysis service providing method performed by the commercial district analysis service providing server,
Receiving business information including business type, location, date and sales from the business operator terminal; And
Comprising the step of calculating the expected sales of the business site using a fuzzy system based on the business information,
The fuzzy system is modeled based on a cluster center in which past sales information including pre-stored business type, location, date, and sales are clustered through a subtractive clustering algorithm,
The subtractive clustering algorithm normalizes the past sales information to calculate a distance between data, and calculates a cluster center based on a size of a distance potential value between the normalized data.

The method of claim 1,
The subtractive clustering algorithm,
A method for providing a commercial area analysis service for calculating a distance between normalized data based on the following [Equation 1] and calculating a k (1≦k≦N)-th cluster center having the largest distance potential value between the data.
[Equation 1]

P _i = distance potential value between data
N = number of input data
X _i , X _j = data value
α =

, (r _a = positive constant)

The method of claim 2,
Comprising the step of calculating the k+1-th cluster center (X _k+1 ) from which the calculated k-th cluster center (X _k ^* ) is removed based on the following [Equation 3] .
[Equation 3]

P _i ^' = potential value with the effect of the k-th cluster center removed
P _i = distance potential value between data
P _k ^* = potential value of the k-th cluster center
X _i = data
β =

, (r _b = positive constant)

The method of claim 3,
For the calculated cluster centers, the upper limit criteria for cluster center selection (

, 1>

) And lower limit standard (

,

>

> 0),

this

If it is larger, it is selected as the cluster center,

this

If it is smaller than that, it is not selected as the cluster center, and if it is in between, the method of providing commercial district analysis service is conditionally selected as the cluster center.

The method of claim 2,
r _a = 0.3 people, how to provide commercial analysis service.

The method of claim 3,
r _b = 1.5r _a , how to provide commercial analysis services.

The method of claim 1,
The fuzzy system is a method for providing a commercial area analysis service based on a Momdani fuzzy algorithm.

The method of claim 1,
The fuzzy system is a method for providing a commercial area analysis service based on a TSK (Takagi-Sugeno-Kang) fuzzy algorithm.

In the server that provides a commercial area analysis service,
A memory in which a program providing a commercial area analysis service is stored; And
Comprising a processor for executing a program stored in the memory,
According to the execution of the program, the processor,
Receives sales information including business type, location, date and sales from the business operator terminal, and calculates the expected sales of the business site using a fuzzy system based on the business information,
The fuzzy system is modeled based on a cluster center in which past sales information including pre-stored business type, location, date, and sales are clustered through a subtractive clustering algorithm,
The subtractive clustering algorithm normalizes the past sales information to calculate a distance between data, and calculates a cluster center based on a size of a distance potential value between the normalized data.

The method of claim 9,
The subtractive clustering algorithm,
A commercial area analysis service providing server for calculating a distance between normalized data based on the following [Equation 1] and calculating a k (1≦k≦N)-th cluster center having the largest distance potential value between the data.
[Equation 1]

P _i = distance potential value between data
N = number of input data
X _i , X _j = data
α =

, (r _a = positive constant)

The method of claim 10,
Comprising the step of obtaining the k+1-th cluster center (X _k+1 ) from which the calculated k-th cluster center (X _k ^* ) is removed based on the following [Equation 3].
[Equation 3]

Pi' = potential value with the effect of the k-th cluster center removed
P _i = distance potential value between data
P _k ^* = potential value of the k-th cluster center
X _i = data
β =

, (rb=positive constant)

The method of claim 11,
The method of claim 3,
For the calculated cluster centers, the upper limit criteria for cluster center selection (

, 1>

) And lower limit standard (

,

>

> 0),

this

If it is larger, it is selected as the cluster center,

this

If it is smaller than, it is not selected as the cluster center, and if it is in between, it is conditionally selected as the cluster center.

The method of claim 10,
ra = 0.3 person, server providing commercial analysis service.

The method of claim 11,
rb = 1.5ra, commercial area analysis service providing server.

The method of claim 9,
The fuzzy system is based on a Momdani fuzzy algorithm, a commercial area analysis service providing server.

The method of claim 9,
The fuzzy system is based on a TSK (Takagi-Sugeno-Kang) fuzzy algorithm, a commercial area analysis service providing server.

A computer-readable recording medium recording a program for performing the method according to any one of claims 1 to 8 on a computer.