KR20160040341A

KR20160040341A - Method of analyzing a stream of a business area and system for performing the same

Info

Publication number: KR20160040341A
Application number: KR1020140132998A
Authority: KR
Inventors: 신상림
Original assignee: 키위플 주식회사
Priority date: 2014-10-02
Filing date: 2014-10-02
Publication date: 2016-04-14
Also published as: KR101715737B1

Abstract

Disclosed is a method of analyzing a stream of a business area and a system for performing the same. The system of analyzing a stream of a business area comprises: a web document collection unit which collects a plurality of web documents; a morpheme analysis unit which analyzes a morpheme in the collected web documents; a morpheme classification-storing unit which classifies and stores the analyzed morphemes based on a region; and a determining unit for stream of the business area which determines the region of the business area based on the number of the morpheme stored based on the region. Thus, the stream of the business area is figured out by extracting information from a real time web document such as a social commerce, a concert, an SNS, a blog, a news, thereby the business area for the region may be analyzed based on the stream of time for region information and a region characteristic.

Description

[0001] METHOD OF ANALYZING A STREAM OF A BUSINESS AREA AND SYSTEM FOR PERFORMING THE SAME [0002]

본 발명은 지역 상권 흐름 분석 방법 및 이를 수행하기 위한 시스템에 관한 것으로, 보다 상세하게는 웹 분석을 통한 지역 상권 흐름 분석 방법 및 이를 수행하기 위한 시스템에 관한 것이다.Field of the Invention The present invention relates to a method and an apparatus for analyzing a regional commercial flow, and more particularly, to a method for analyzing a local commercial flow through a web analysis and a system for performing the same.

최근 인터넷 사용이 점차 활발해짐에 따라, 많은 사람들이 특정 웹 사이트의 게시판, 블로그(blog), 커뮤니티(community)인 카페, 페이스북이나 미니홈피와 같은 소셜 네트워크서비스(SNS, Social Network Service) 등의 온라인 매체를 통해서 자신의 의견을 표현하고 있다. 따라서 사용자들은 종종 특정한 정보의 가치를 평가할 때 다른 사용자들이 인터넷 상에 올려놓은 의견 정보를 참조하여 결정한다. Recently, as the use of the Internet becomes more and more active, many people use social networks such as bulletin boards, blogs, community cafes, social network services such as Facebook and mini homepage We express our opinion through online media. Therefore, users often make decisions by referring to the information that other users post on the Internet when evaluating the value of certain information.

예를들어, 인터넷 상의 웹 포털, 업종별 전문 사이트, 개인 블로그 등에는 상품 품평에서 영화 품평까지 다양한 사용자들의 의견이 존재한다. 이러한 사용자들의 의견들은 일반 사용자들이 제품을 구매하거나 영화를 보기 전에 참조하기 위한 자료로 이용할 수 있으며, 마케팅 담당자나 주식 매매자 등이 각 제품 또는 기업에 대한 사용자들의 의견을 알고자 하는 경우에도 사용할 수 있다. For example, there are opinions from various users, from product reviews to movie reviews, on web portals on the Internet, specialized websites for industry, and personal blogs. These users' opinions can be used by general users as references for purchasing products or viewing movies, and even when a marketing person or a stock trader wants to know the opinions of users about each product or company .

한국공개특허 제10-2013-0030434호(명칭: 온라인 상에 게재된 웹 문서 기반 상권분석 서비스 시스템 및 방법)(2013. 03. 27. 자 공개)Korean Patent Laid-Open No. 10-2013-0030434 (Name: Web-based commercial document analysis service system and method published online) (Published on March 23, 2013) 한국공개특허 제10-2012-0035600호(명칭: 상권정보 제공 시스템 및 방법, 그 상권정보 제공장치, 서비스장치 및 상권정보 제공단말)(2012. 04. 16. 자 공개)Korean Patent Laid-Open Publication No. 10-2012-0035600 (name: system and method for providing information on merchandise information, merchandise information providing apparatus, service apparatus and information providing terminal for merchandise information) (disclosed on Apr. 16, 2012) 한국공개특허 제10-2013-0019629호(명칭: 부동산 물건의 유형별 입지 및 상권분석 서비스 제공 시스템, 방법 및 컴퓨터 판독 가능한 기록 매체)(2013. 02. 27. 자 공개)Korean Patent Laid-Open No. 10-2013-0019629 entitled " System and method for locating and analyzing commercial goods by types of real estate goods, and computer-readable recording medium " 한국등록특허 제10-1078344호(명칭: 상권내에서의 업종별 매출 추정 방법)(2011. 10. 25. 자 등록)Korean Registered Patent No. 10-1078344 (Name: How to estimate the sales by industry in the trade) (Registered on October 25, 2011)

이에 본 발명의 기술적 과제는 이러한 점에 착안한 것으로, 본 발명의 목적은 소셜 커머스, 공연, SNS, 블로그, 뉴스와 같은 실시간 웹 문서에서 정보를 추출하는 웹 분석을 통해 시간의 흐름에 따른 지역 상권 흐름을 파악하기 위한 지역 상권 흐름 분석 방법을 제공하는 것이다.SUMMARY OF THE INVENTION Accordingly, the present invention has been made keeping in mind the above problems occurring in the prior art, and it is an object of the present invention to provide a method and apparatus for analyzing a local commercial web site through a web analysis, which extracts information from a real-time web document such as a social commerce, And to provide a method for analyzing local commercial flow to understand the flow.

본 발명의 다른 목적은 상기한 지역 상권 흐름 분석 방법을 수행하기 위한 시스템을 제공하는 것이다.It is another object of the present invention to provide a system for performing the method of analyzing local commercial flow.

상기한 본 발명의 목적을 실현하기 위하여 일실시예에 따른 지역 상권 흐름 분석 방법은, 복수의 웹 문서들을 수집하는 단계; 수집된 웹 문서들에서 형태소 분석을 수행하는 단계; 분석된 형태소들을 지역별로 분류하여 저장하는 단계; 및 상기 지역 기준으로 저장된 상기 형태소의 수를 근거로 상권 지역을 판별하는 단계를 포함한다.In order to accomplish the object of the present invention, a method for analyzing a local merchandise flow according to an exemplary embodiment includes: collecting a plurality of web documents; Performing morpheme analysis on the collected web documents; Classifying and analyzing the analyzed morphemes by region; And determining a commercial area based on the number of morphemes stored on the area basis.

일실시예에서, 상기 형태소는, 관심지점(POI) DB에 등록된 정보에 대응하는 관심지점 형태소 및 상기 관심지점 DB에 등록되지 않은 위치 정보에 대응하는 위치 형태소를 포함할 수 있다. In one embodiment, the morpheme may include a point morpheme corresponding to the information registered in the point of interest (POI) DB and a position morpheme corresponding to the position information not registered in the point of interest DB.

일실시예에서, 상기 분석된 형태소들을 지역별로 분류하여 저장하는 단계는, 분석된 형태소들에서 상기 관심지점 형태소 또는 상기 위치 형태소를 추출하는 단계; 및 지역 기준으로 상기 관심지점 형태소 및 상기 위치 형태소 각각을 분류하여 저장하는 단계를 포함할 수 있다. In one embodiment, the step of classifying and storing the analyzed morphemes by region includes extracting the morpheme of interest or the morpheme in the analyzed morphemes; And classifying and storing each of the interest point morpheme and the position morpheme on an area basis.

일실시예에서, 상기 상권 지역을 판별하는 단계는, 상기 지역 기준으로 저장된 상기 관심지점 형태소의 수와 상기 위치 형태소의 수를 근거로 상기 상권 지역을 판별할 수 있다. In one embodiment, the step of discriminating the commercial area may discriminate the commercial area based on the number of the interest point morpres and the number of the position morpreses stored on the area basis.

일실시예에서, 상기 상권 지역을 판별하는 단계는 지역에 대응하여 상권을 분석하는 지역별 상권분석 단계, 지역 및 기간에 대응하여 상권을 분석하는 기간별 상권분석 단계, 지역에 대응하여 기입된 웹 문서 종류별로 상권을 분석하는 웹 문서 종류별 상권분석 단계 및 관심지점 카테고리별로 상권을 분석하는 관심지점 카테고리별 상권분석 단계 중 하나 이상을 포함할 수 있다. In one embodiment, the step of discriminating the commercial area includes analyzing a commercial area corresponding to the area, analyzing the commercial area according to the area, analyzing the commercial area corresponding to the area and period, analyzing the commercial area according to the period, A commercial document analysis step for analyzing a commercial document, and a commercial document analysis step for analyzing a commercial article according to an interest point category.

일실시예에서, 상기 웹 문서는 소셜커머스, 공연, SNS, 블로그, 뉴스를 포함할 수 있다. In one embodiment, the web document may include a social commerce, a performance, an SNS, a blog, and a news.

상기한 본 발명의 다른 목적을 실현하기 위하여 일실시예에 따른 지역 상권 흐름 분석 시스템은 복수의 웹 문서들을 수집하는 웹 문서 수집부; 수집된 웹 문서들에서 형태소를 분석하는 형태소 분석부; 분석된 형태소들을 지역별로 분류하여 저장하는 형태소 분류-저장부; 및 상기 지역 기준으로 저장된 상기 형태소의 수를 근거로 상권 지역을 판별하는 상권 흐름 판별부를 포함한다. According to another aspect of the present invention, there is provided a regional merchant flow analysis system comprising: a web document collection unit for collecting a plurality of web documents; A morphological analysis unit for analyzing morphemes in the collected web documents; A morpheme classifying and storing unit for classifying and analyzing the analyzed morphemes by region; And a merchandise flow discrimination unit for discriminating a merchandise area based on the number of morphemes stored on the area basis.

일실시예에서, 상기 형태소 분류-저장부는, 분석된 형태소들에서 상기 관심지점 형태소 및 상기 위치 정보에 대응하는 위치 형태소를 추출하는 형태소 추출부; 및 지역 기준으로 상기 관심지점 형태소 및 상기 위치 형태소 각각을 분류하는 형태소 분류부를 포함할 수 있다. In one embodiment, the morpheme classification-storing unit includes: a morpheme extracting unit for extracting the morpheme of interest and the morpheme corresponding to the position information in the analyzed morphemes; And a morpheme classifier for classifying each of the interest point morpheme and the position morpheme on an area basis.

일실시예에서, 지역 상권 흐름 분석 시스템은 상기 형태소 분류부에 의해 분류된 관심지점 형태소를 저장하는 관심지점 형태소 저장 DB; 및 상기 형태소 분류부에 의해 분류된 위치 형태소를 저장하는 위치 형태소 저장 DB를 더 포함할 수 있다. In one embodiment, the local merchant flow analysis system includes an interest point morpheme store DB that stores the interest point morpheme classified by the morpheme classifier; And a location morpheme storage DB for storing location morphemes classified by the morpheme classification unit.

일실시예에서, 상기 상권 흐름 판별부는, 상기 지역 기준으로 상기 관심지점 형태소 저장 DB에 저장된 관심지점 형태소의 수와 상기 위치 형태소 저장 DB에 저장된 위치 형태소의 수를 시간의 흐름에 따른 분석을 통해 상권 지역을 판별할 수 있다. In one embodiment, the merchandise flow determination unit determines the number of points of interest morpresent stored in the interest point morpheme storage DB and the number of position morpheme stores stored in the location morpheme store DB by the regional reference, The area can be identified.

일실시예에서, 상기 상권 흐름 판별부는, 지역에 대응하여 상권을 분석하는 지역별 상권분석 모듈, 지역 및 기간에 대응하여 상권을 분석하는 기간별 상권분석 모듈, 지역에 대응하여 기입된 웹 문서 종류별로 상권을 분석하는 웹 문서 종류별 상권분석 모듈 및 관심지점 카테고리별로 상권을 분석하는 관심지점 카테고리별 상권분석 모듈 중 하나 이상을 포함할 수 있다. In one embodiment, the merchandise flow determination unit may include: a regional commercial analysis module for analyzing a commercial area corresponding to a region; a commercial analysis module for each period to analyze a commercial area corresponding to a region and a period; A commercial document analysis module for each type of web document analyzing the commercial document category, and a commercial document analysis module for each interest category for analyzing commercial articles by the interest point category.

이러한 지역 상권 흐름 분석 방법 및 이를 수행하기 위한 시스템에 의하면, 소셜 커머스, 공연, SNS, 블로그, 뉴스와 같은 실시간 웹 문서에서 정보를 추출하여 저장하고 분석하여 시간의 흐름에 따른 지역 상권 흐름을 파악하므로써, 지역 정보나 지역 특성을 시간의 흐름으로 지역에 대한 상권을 분석할 수 있다. .According to the method of analyzing local commercial flow and the system for performing the same, information is extracted from real-time web documents such as social commerce, performance, SNS, blog, and news, and stored and analyzed, , Regional information or local characteristics can be analyzed over time to analyze commercial areas. .

도 1은 본 발명의 일실시예에 따른 지역 상권 흐름 분석 시스템을 설명하기 위한 블럭도이다.
도 2는 도 1에 도시된 상권 흐름 판별부를 설명하기 위한 블록도이다.
도 3은 본 발명의 일실시예에 따른 지역 상권 흐름 분석 방법을 설명하기 위한 흐름도이다.
도 4는 도 3에 도시된 상권 지역을 판별하는 단계를 설명하기 위한 흐름도이다.
도 5는 도 4에 도시된 지역별 상권분석 단계를 설명하기 위한 흐름도이다.
도 6은 도 4에 도시된 기간별 상권분석 단계를 설명하기 위한 흐름도이다.
도 7은 도 4에 도시된 웹 문서 종류별 상권분석 단계를 설명하기 위한 흐름도이다.
도 8은 도 4에 도시된 관심지점 카테고리별 상권분석 단계를 설명하기 위한 흐름도이다.
도 9는 웹 문서에서 기간별 지역별 추출된 관심지점 형태소와 위치 형태소의 전체 개수를 설명하기 위한 그래프이다.
도 10은 각 지역에 대응하는 관심지점(POI) 형태소의 전체 개수의 예를 설명하기 위한 그래프이다.
도 11은 각 지역의 2D 단위 면적당 관심지점(POI)의 밀도의 예를 설명하기 위한 그래프이다.
도 12는 지역의 3D 단위 면적당 관심지점(POI)의 밀도의 예를 설명하기 위한 그래프이다.
도 13은 단순 상권지수를 설명하기 위한 그래프이다. 특히 표 6에 대응하는 그래프가 도시된다.
도 14는 2D 면적이 반영된 2D 상권지수를 설명하기 위한 그래프이다.
도 15는 3D 면적이 반영된 3D 상권지수를 설명하기 위한 그래프이다. FIG. 1 is a block diagram for explaining a local commercial flow analysis system according to an embodiment of the present invention.
FIG. 2 is a block diagram for explaining the commercial flow determiner shown in FIG. 1. FIG.
FIG. 3 is a flowchart illustrating a method of analyzing a local commercial floor flow according to an exemplary embodiment of the present invention.
FIG. 4 is a flowchart for explaining the step of discriminating the commercial area shown in FIG.
FIG. 5 is a flowchart for explaining the regional merchandise analysis steps shown in FIG.
FIG. 6 is a flow chart for explaining the commercial article analysis step according to the period shown in FIG.
FIG. 7 is a flow chart for explaining the merchandise analysis step for each web document type shown in FIG.
FIG. 8 is a flowchart for explaining a commercial area analysis step for each point of interest category shown in FIG.
9 is a graph for explaining the total number of interest point morphemes and location morphemes extracted by region and region in a web document.
10 is a graph for explaining an example of the total number of points of interest (POI) morphemes corresponding to each area.
11 is a graph for explaining an example of the density of the point of interest (POI) per 2D unit area of each area.
FIG. 12 is a graph for explaining an example of the density of the point of interest (POI) per 3D unit area of the area.
13 is a graph for explaining the simple commercial exponent index. In particular, a graph corresponding to Table 6 is shown.
FIG. 14 is a graph for explaining the 2D commercial index reflecting the 2D area.
FIG. 15 is a graph for explaining the 3D premium index reflecting the 3D area.

이하, 첨부한 도면들을 참조하여, 본 발명을 보다 상세하게 설명하고자 한다. 본 발명은 다양한 변경을 가할 수 있고 여러 가지 형태를 가질 수 있는 바, 특정 실시예들을 도면에 예시하고 본문에 상세하게 설명하고자 한다. 그러나, 이는 본 발명을 특정한 개시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS The present invention will now be described in more detail with reference to the accompanying drawings. The present invention is capable of various modifications and various forms, and specific embodiments are illustrated in the drawings and described in detail in the text. It should be understood, however, that the invention is not intended to be limited to the particular forms disclosed, but includes all modifications, equivalents, and alternatives falling within the spirit and scope of the invention.

제1, 제2 등의 용어는 다양한 구성요소들을 설명하는데 사용될 수 있지만, 상기 구성요소들은 상기 용어들에 의해 한정되어서는 안 된다. 상기 용어들은 하나의 구성요소를 다른 구성요소로부터 구별하는 목적으로만 사용된다. 예를들어, 본 발명의 권리 범위를 벗어나지 않으면서 제1 구성요소는 제2 구성요소로 명명될 수 있고, 유사하게 제2 구성요소도 제1 구성요소로 명명될 수 있다. 단수의 표현은 문맥상 명백하게 다르게 뜻하지 않는 한, 복수의 표현을 포함한다. The terms first, second, etc. may be used to describe various components, but the components should not be limited by the terms. The terms are used only for the purpose of distinguishing one component from another. For example, without departing from the scope of the present invention, the first component may be referred to as a second component, and similarly, the second component may also be referred to as a first component. The singular expressions include plural expressions unless the context clearly dictates otherwise.

본 출원에서, "포함하다" 또는 "가지다" 등의 용어는 명세서 상에 기재된 특징, 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것이 존재함을 지정하려는 것이지, 하나 또는 그 이상의 다른 특징들이나 숫자, 단계, 동작, 구성요소, 부분품 또는 이들을 조합한 것들의 존재 또는 부가 가능성을 미리 배제하지 않는 것으로 이해되어야 한다. In this application, the terms "comprises", "having", and the like are used to specify that a feature, a number, a step, an operation, an element, a part or a combination thereof is described in the specification, But do not preclude the presence or addition of one or more other features, integers, steps, operations, components, parts, or combinations thereof.

또한, 다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 여기서 사용되는 모든 용어들은 본 발명이 속하는 기술 분야에서 통상의 지식을 가진 자에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다. 일반적으로 사용되는 사전에 정의되어 있는 것과 같은 용어들은 관련 기술의 문맥 상 가지는 의미와 일치하는 의미를 가지는 것으로 해석되어야 하며, 본 출원에서 명백하게 정의하지 않는 한, 이상적이거나 과도하게 형식적인 의미로 해석되지 않는다. Also, unless otherwise defined, all terms used herein, including technical or scientific terms, have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. Terms such as those defined in commonly used dictionaries are to be interpreted as having a meaning consistent with the contextual meaning of the related art and are to be interpreted as either ideal or overly formal in the sense of the present application Do not.

설명에 앞서, 본 발명에서 언급하는 웹 문서는 온라인 매체 상에 네티즌들에 의해 게재된 본문, 본문에 부가된 댓글, 첨부파일 또는 이미지로서, 특정 지역, 특정 업종에 대한 네티즌의 긍정 또는 부정 등의 의견을 포함한다.Prior to the description, the web document referred to in the present invention includes texts posted by netizens on the online medium, comments added to the texts, attachments or images, and the like, such as affirmation or negation of netizen Include comments.

온라인 매체는 네티즌들에 의해 웹 문서가 발생될 수 있는 웹 사이트(신문 사이트, 방송 사이트, 전문 매장 사이트), 게시판, 블로그, 카페, 쇼셜 네트워크 서비스를 지원하는 사이트 등 다른 사용자간 커뮤니티(community)가 가능한 채널을 의미한다.The online medium is a community between other users, such as web sites (newspaper sites, broadcast sites, specialty store sites), bulletin boards, blogs, cafes, sites that support social network services, etc. where web documents can be generated by netizens Means a possible channel.

도 1은 본 발명의 일실시예에 따른 지역 상권 흐름 분석 시스템을 설명하기 위한 블럭도이다. FIG. 1 is a block diagram for explaining a local commercial flow analysis system according to an embodiment of the present invention.

도 1을 참조하면, 지역 상권 흐름 분석 시스템(100)은 웹 문서 수집부(110), 웹 문서 저장 DB(120), 형태소 분석부(130), 형태소 분류-저장부(140), 관심지점 형태소 저장 DB(150), 위치 형태소 저장 DB(160) 및 상권 흐름 판별부(170)를 포함한다. 도 1에서, 지역 상권 흐름 분석 시스템(100)은 상기 웹 문서 수집부(110), 상기 웹 문서 저장 DB(120), 상기 형태소 분석부(130), 상기 형태소 분류-저장부(140), 상기 관심지점(Point of Interest; POI) 형태소 저장 DB(150), 상기 위치 형태소 저장 DB(160) 및 상기 상권 흐름 판별부(170)로 구성된 것으로 도시하였으나, 이는 설명의 편의를 위해 논리적으로 구분하였을 뿐 하드웨어적으로 구분한 것은 아니다. 1, the local merchant flow analysis system 100 includes a web document collection unit 110, a web document storage DB 120, a morphological analysis unit 130, a morphological classification-storage unit 140, A storage DB 150, a location morpheme storage DB 160, and a merchant flow determination unit 170. 1, the local merchandise flow analysis system 100 includes the web document collection unit 110, the web document storage DB 120, the morpheme analysis unit 130, the morpheme classification-storage unit 140, A point of interest (POI) morpheme storage DB 150, the location morpheme storage DB 160, and the merchandise flow discrimination unit 170. However, this is logically divided for convenience of explanation It is not a hardware distinction.

일반적으로 관심지점(POI) 데이터는 주요 시설물, 역, 공항, 터미널, 호텔 등을 좌표로 전자 수치 지도에 표시하는 데이터이다. 보통 목적지 검색에 사용되는 검색 데이터와 바탕 화면에 표시만 되는 바탕 데이터로 구분할 수 있다. 목표지 검색에서 사용자가 목적지에 대한 정보, 즉 목적지의 주소나 전화번호 또는 정확한 명칭을 알고 있는 경우는 관심지점 데이터를 직접 입력함으로써 직접 목적지를 검색할 수 있다. Point of interest (POI) data is data that displays an electronic digital map with coordinates of major facilities, stations, airports, terminals, hotels, and the like. It can be classified into search data used for destination search and background data only displayed on the desktop. If the user knows the information about the destination, that is, the address or telephone number of the destination or the exact name, the user can directly search for the destination by directly inputting the point of interest data.

본 실시예에서, 관심지점 형태소는 상기한 POI 데이터에 대응하는 텍스트(즉, 문자)를 지칭하고, 위치 형태소는 관심지점 형태소에 등록되지는 않았으나 일반인들이 많이 사용하는 위치를 지칭하는 텍스트이다. In this embodiment, the point of interest morpheme refers to the text (i. E., Character) corresponding to the POI data, and the location morpheme is text that is not registered in the point of interest morpheme but refers to a location commonly used by the public.

상기 웹 문서 수집부(110)는 복수의 웹 문서들을 수집하여 상기 웹 문서 저장 DB(120)에 저장한다. 상기 웹 문서는 소셜커머스, 공연, SNS, 블로그, 뉴스 등을 포함할 수 있다. The web document collection unit 110 collects a plurality of web documents and stores the collected web documents in the web document storage DB 120. The web document may include social commerce, performance, SNS, blog, news, and the like.

상기 형태소 분석부(130)는 상기 웹 문서 저장 DB(120)에 저장된 웹 문서들에서 형태소를 분석한다.The morpheme analysis unit 130 analyzes the morphemes in the web documents stored in the web document DB 120.

상기 형태소 분류-저장부(140)는 형태소 추출부(142) 및 형태소 분류부(144)를 포함하고, 상기 형태소 분석부(130)에서 분석된 형태소들을 지역별로 분류하여 상기 관심지점 형태소 저장 DB(150) 또는 상기 위치 형태소 저장 DB(160)에 저장한다.The morpheme classifying and storing unit 140 includes a morpheme extracting unit 142 and a morpheme sorter 144. The morpheme sorter storing unit 140 classifies morphemes analyzed by the morpheme analyzing unit 130 into regions and stores them in the interest point morpheme storage DB 150) or the location morpheme storage DB (160).

상기 형태소 추출부(142)는 분석된 형태소들에서 관심지점 정보에 대응하는 관심지점 형태소 또는 위치 정보에 대응하는 위치 형태소를 추출하여 상기 형태소 분류부(144)에 제공한다. The morpheme extracting unit 142 extracts a morpheme corresponding to the interest point morpheme or the position information corresponding to the interest point information in the analyzed morphemes and provides the extracted morpheme to the morpheme sorter 144.

상기 형태소 분류부(144)는 상기 형태소 추출부(142)에서 제공되는 관심지점 형태소 또는 위치 형태소를 지역 기준으로 각각 분류하여 상기 관심지점 형태소 저장 DB(150) 또는 상기 위치 형태소 저장 DB(160)에 저장한다.The morpheme classifier 144 classifies the morphemes of interest or position morphemes provided by the morpheme extractor 142 on the basis of regions and stores them in the interest point morpheme storage DB 150 or the location morpheme storage DB 160 .

상기 관심지점 형태소 저장 DB(150)는 상기 형태소 분류부(144)에 의해 분류된 관심지점 형태소를 지역별로 저장한다. The interest point morpheme storage DB 150 stores the interest point morpheme classified by the morpheme classifying unit 144 for each region.

상기 위치 형태소 저장 DB(160)는 상기 형태소 분류부(144)에 의해 분류된 위치 형태소를 지역별로 저장한다. The location morpheme storage DB 160 stores location morphemes classified by the morpheme classification unit 144 for each region.

상기 상권 흐름 판별부(170)는 지역 기준으로 저장된 형태소의 수, 즉 관심지점 형태소의 수 및 위치 형태소의 수를 근거로 상권 지역을 판별한다.The merchandise flow determiner 170 determines the merchant area based on the number of morphemes stored on an area basis, that is, the number of morpheme points of interest and the number of morphemes.

이상에서 설명된 바와 같이, 본 실시예에 따르면, 지역 정보나 지역 특성을 시간의 흐름으로 파악하여 지역에 대한 상권분석이 필요하다는 점에 착안하여, 소셜 커머스, 공연, SNS, 블로그, 뉴스 등과 같은 실시간 웹 문서를 수집하고 수집된 웹 문서에서 위치 정보 및 관심지점(POI)을 추출하고, 지역 기준으로 분류 및 분석한 후, 언급된 횟수를 기반으로 인기 지역을 판별한다. 특히, 추출된 웹 문서의 종류 및 관심지점(POI) 카테고리에 따라 어떤 분야에서 인기가 있는지 판별할 수 있다. As described above, according to the present embodiment, attention is paid to the fact that regional information and regional characteristics are analyzed as a flow of time and regional trade analysis is required. Thus, it is possible to analyze commercials such as social commerce, performances, SNS, We collect real-time web documents, extract location information and points of interest (POIs) from the collected web documents, classify and analyze them on a regional basis, and then identify popular areas based on the number of times mentioned. In particular, it is possible to determine which category is popular depending on the type of the extracted web document and the POI category.

도 2는 도 1에 도시된 상권 흐름 판별부를 설명하기 위한 블록도이다. FIG. 2 is a block diagram for explaining the commercial flow determiner shown in FIG. 1. FIG.

도 2를 참조하면, 상권 흐름 판별부(170)는 지역별 상권분석 모듈(172), 기간별 상권분석 모듈(174), 웹 문서 종류별 상권분석 모듈(176), 관심지점 카테고리별 상권분석 모듈(178) 및 인터페이스 모듈(179)을 포함하고, 지역 기준으로 저장된 형태소의 수를 근거로 상권 지역을 판별한다. 도 2에서, 상권 흐름 판별부(170)는 상기 지역별 상권분석 모듈(172), 상기 기간별 상권분석 모듈(174), 상기 웹 문서 종류별 상권분석 모듈(176), 상기 관심지점 카테고리별 상권분석 모듈(178) 및 상기 인터페이스 모듈(179)로 구성된 것으로 도시하였으나, 이는 설명의 편의를 위해 논리적으로 구분하였을 뿐 하드웨어적으로 구분한 것은 아니다. Referring to FIG. 2, the merchandise flow discrimination unit 170 includes a regional merchandise analysis module 172, a merchandise trade analysis module 174, a merchandise trade analysis module 176 according to a web document type, a merchandise trade analysis module 178, And an interface module 179, and determines a commercial area based on the number of morphemes stored on an area basis. 2, the merchandise flow determiner 170 receives the merchandise trade analysis module 172, the merchandise trade analysis module 174, the web site category analysis module 176, 178 and the interface module 179. However, the interface module 179 is logically divided for the sake of convenience of description, but is not limited to hardware.

상기 지역별 상권분석 모듈(172)은 상기 인터페이스 모듈(179)을 통해 지역 선택을 위한 정보를 단말기(300)에 제공한다. 상기한 지역 선택을 위한 정보를 단말기(300)에 제공하는 것은 생략될 수도 있다. 예를들어, 실시간 또는 주기적으로 지역별 상권분석이 수행되도록 프로그램된 경우 상기한 지역 선택을 위한 정보를 단말기(300)에 제공하는 것은 생략될 수도 있다. 이어, 상기 지역별 상권분석 모듈(172)은 상기한 단말기(300)를 통해 특정 지역이 선택되면, 선택된 지역에 대응하는 관심지점 형태소의 수를 카운트한다. 관심지점 형태소에 가중치를 반영하도록 설정되어 있다면, 상기 지역별 상권분석 모듈(172)은 상기 관심지점 형태소의 수에 가중치를 반영한다. 또한, 상기 지역별 상권분석 모듈(172)은 선택된 지역에 대응하는 위치 형태소의 수를 카운트한다. 위치 형태소에 가중치를 반영하도록 설정되어 있다면, 상기 지역별 상권분석 모듈(172)은 상기 위치 형태소의 수에 가중치를 반영한다. 상기 지역별 상권분석 모듈(172)은 상기 관심지점 형태소의 수와 상기 위치 형태소의 수를 합산한 후, 합산치와 기준치를 서로 비교한다. 상기 합산치가 상기 기준치보다 작은 것으로 체크되면, 상기 지역별 상권분석 모듈(172)은 선택된 지역을 비인기 상권으로 판별한다. 상기 합산치가 상기 기준치보다 큰 것으로 체크되면, 상기 지역별 상권분석 모듈(172)은 선택된 지역을 인기 상권으로 판별한다. 상기 합산치가 상기 기준치와 같은 것으로 체크되면, 상기 지역별 상권분석 모듈(172)은 선택된 지역을 정체 상권으로 판별한다.The regional commercial value analysis module 172 provides the terminal 300 with information for area selection through the interface module 179. [ Providing the information for the area selection to the terminal 300 may be omitted. For example, if the regional commercial analysis is programmed to be performed in real time or periodically, providing the information for the area selection to the terminal 300 may be omitted. Then, when the specific area is selected through the terminal 300, the area-specific commercial value analysis module 172 counts the number of interest point morpresent corresponding to the selected area. If the weights are set to reflect the weights at the points of interest, the regional commercial analysis module 172 reflects the weights to the number of points of interest. In addition, the regional commercial analysis module 172 counts the number of location morpres- sories corresponding to the selected area. If the location morpheme is set to reflect the weight, the regional commercial analysis module 172 reflects the weight to the number of location morphemes. The regional commercial value analysis module 172 adds the number of the morpheme points of interest and the number of the position morphemes, and then compares the sum value and the reference value with each other. If it is checked that the total value is smaller than the reference value, the region-specific commercial analysis module 172 discriminates the selected region as a non-commercial commercial. If it is checked that the total value is larger than the reference value, the region-specific commercial value analysis module 172 determines the selected region as a popular commercial area. If it is checked that the sum value is the same as the reference value, the region-specific commercial area analysis module 172 discriminates the selected area as a congested commercial area.

상기 기간별 상권분석 모듈(174)은 상기 인터페이스 모듈(179)을 통해 지역 선택을 위한 정보를 상기 단말기(300)에 제공한다. 상기한 지역 선택을 위한 정보를 단말기(300)에 제공하는 것은 생략될 수도 있다. 예를들어, 실시간 또는 주기적으로 지역별 상권분석이 수행되도록 프로그램된 경우 상기한 지역 선택을 위한 정보를 단말기(300)에 제공하는 것은 생략될 수도 있다. 이어, 상기 기간별 상권분석 모듈(174)은 상기 단말기(300)를 통해 특정 지역이 선택된 것으로 체크되면, 제1 기간 및 제2 기간 선택을 위한 정보를 상기 단말기(300)에 제공한다. 상기 기간별 상권분석 모듈(174)은 상기 제1 기간 및 상기 제2 기간이 확정된 것으로 체크되면, 선택된 지역과 선택된 제1 기간에 대응하는 형태소의 수를 카운트되어 제1 카운트값을 생성한다. 상기 기간별 상권분석 모듈(174)은 선택된 지역과 선택된 제2 기간에 대응하는 형태소의 수를 카운트하여 제2 카운트값을 생성한다. 이어, 상기 기간별 상권분석 모듈(174)은 상기 제1 카운트값과 상기 제2 카운트값을 비교하여, 제1 카운트값이 제2 카운트값보다 큰 것으로 체크되면, 선택된 지역이 제2 기간 동안 인기상권인 것으로 판별한다. 상기 기간별 상권분석 모듈(174)은 상기 제1 카운트값이 상기 제2 카운트값보다 작은 것으로 체크되면, 선택된 지역이 제1 기간 동안 인기 상권인 것으로 판별한다. 상기 기간별 상권분석 모듈(174)은 상기 제1 카운트값과 상기 제2 카운트값이 같은 것으로 체크되면, 선택된 지역이 제1 기간이나 제2 기간 동안, 정체 상권인 것으로 판별한다.The periodic merchandise analysis module 174 provides the terminal 300 with information for area selection through the interface module 179. [ Providing the information for the area selection to the terminal 300 may be omitted. For example, if the regional commercial analysis is programmed to be performed in real time or periodically, providing the information for the area selection to the terminal 300 may be omitted. Then, the periodic merchandise analysis module 174 provides the terminal 300 with information for selecting the first period and the second period, if the specific region is selected through the terminal 300. [ If it is checked that the first period and the second period are confirmed, the periodic commercial value analysis module 174 counts the number of morpheme corresponding to the selected region and the selected first period to generate a first count value. The periodic commercial analysis module 174 counts the number of morphemes corresponding to the selected region and the selected second period to generate a second count value. If the first count value is greater than the second count value, then the merchandise value analysis module 174 compares the first count value with the second count value. If the first count value is greater than the second count value, . If the first count value is less than the second count value, the periodical commercial value analysis module 174 determines that the selected region is a popular commercial area for the first period. If the first count value and the second count value are checked to be equal to each other, the periodical commercial value analysis module 174 determines that the selected region is a stagnant commercial region for the first period or the second period.

상기 웹 문서 종류별 상권분석 모듈(176)은 상기 인터페이스 모듈(179)을 통해 웹 문서 선택을 위한 정보를 상기 단말기(300)에 제공한다. 상기한 웹 문서 선택을 위한 정보를 단말기(300)에 제공하는 것은 생략될 수도 있다. 예를들어, 실시간 또는 주기적으로 웹 문서 종류별 상권분석이 수행되도록 프로그램된 경우 상기한 웹 문서 선택을 위한 정보를 단말기(300)에 제공하는 것은 생략될 수도 있다. 상기 웹 문서 종류별 상권분석 모듈(176)은 상기 인터페이스 모듈(179)을 통해 상기한 단말기(300)를 통해 특정 웹 문서가 선택된 것으로 체크되면, 선택된 웹 문서에 대응하는 관심지점 형태소의 수를 카운트한다. 관심지점 형태소에 가중치를 반영하도록 설정되어 있다면, 상기 웹 문서 종류별 상권분석 모듈(176)은 상기 관심지점 형태소의 수에 가중치를 반영한다. 이어, 상기 웹 문서 종류별 상권분석 모듈(176)은 선택된 웹 문서에 대응하는 위치 형태소의 수를 카운트한다. 위치 형태소에 가중치를 반영하도록 설정되어 있다면, 상기 웹 문서 종류별 상권분석 모듈(176)은 상기 위치 형태소의 수에 가중치를 반영한다. 상기 웹 문서 종류별 상권분석 모듈(176)은 상기 관심지점 형태소의 수와 상기 위치 형태소의 수를 합산하여 합산치를 생성한다. 상기 웹 문서 종류별 상권분석 모듈(176)은 생성된 합산치와 기 설정된 기준치를 비교한다. 상기 합산치가 상기 기준치보다 작은 것으로 체크되면, 상기 웹 문서 종류별 상권분석 모듈(176)은 선택된 웹 문서에 포함된 지역이 비인기 상권인 것으로 판별한다. 상기 합산치가 상기 기준치보다 큰 것으로 체크되면, 상기 웹 문서 종류별 상권분석 모듈(176)은 선택된 웹 문서에 포함된 지역이 인기 상권인 것으로 판별한다. 상기 합산치가 상기 기준치와 같은 것으로 체크되면, 상기 웹 문서 종류별 상권분석 모듈(176)은 선택된 웹 문서에 포함된 지역이 정체 상권인 것으로 판별한다.The merchandise analysis module 176 for each web document type provides information for selecting a web document to the terminal 300 through the interface module 179. [ Providing the information for selecting the web document to the terminal 300 may be omitted. For example, if it is programmed to perform commercial analysis by web document type in real time or periodically, providing the information for selecting the web document to the terminal 300 may be omitted. If it is checked through the interface module 179 that the specific web document is selected through the terminal 300, the merchandise value analysis module 176 for each web document type counts the number of points of interest corresponding to the selected web document . If the weights are set to reflect the weights at the interest point morpheme, the merchandise value analysis module 176 for each web document type reflects the weights to the number of morpheme points of interest. Then, the commercial document analysis module 176 counts the number of location morphemes corresponding to the selected web document. If the position morpheme is set to reflect the weight, the commodity analysis module 176 for each web document type reflects the weight to the number of position morphemes. The commercial document analysis module 176 for each web document type sums up the number of interest morphemes and the number of location morphemes to generate a sum value. The commercial document analysis module 176 for each web document type compares the generated sum value with predetermined reference values. If it is checked that the total value is smaller than the reference value, the merchandise analysis module 176 for each web document type determines that the area included in the selected web document is a non-merchant merchant. If it is checked that the sum is larger than the reference value, the merchandise analysis module 176 for each web document type determines that the region included in the selected web document is a popular merchandise. If the total value is checked to be equal to the reference value, the merchandise entry analysis module 176 for each web document type determines that the area included in the selected web document is a stadium area.

상기 관심지점 카테고리별 상권분석 모듈(178)은 상기 인터페이스 모듈(179)을 통해 관심지점 카테고리 선택을 위한 정보를 상기 단말기(300)에 제공한다. 상기한 관심지점 카테고리 선택을 위한 정보를 단말기(300)에 제공하는 것은 생략될 수도 있다. 예를들어, 실시간 또는 주기적으로 관심지점 카테고리별 상권분석이 수행되도록 프로그램된 경우 상기한 관심지점 카테고리 선택을 위한 정보를 단말기(300)에 제공하는 것은 생략될 수도 있다. 상기 관심지점 카테고리별 상권분석 모듈(178)은 상기 인터페이스 모듈(179)을 통해 상기한 단말기(300)를 통해 특정 관심지점 카테고리가 선택되면, 선택된 관심지점 카테고리에 대응하는 관심지점 형태소의 수를 카운트한다. 관심지점 형태소에 가중치를 반영하도록 설정되어있다면, 상기 관심지점 카테고리별 상권분석 모듈(178)은 상기 관심지점 형태소의 수에 가중치를 반영한다. 상기 관심지점 카테고리별 상권분석 모듈(178)은 선택된 관심지점 카테고리에 대응하는 위치 형태소의 수를 카운트한다. 위치 형태소에 가중치를 반영하도록 설정되어 있다면, 상기 관심지점 카테고리별 상권분석 모듈(178)은 상기 위치 형태소의 수에 가중치를 반영한다. 상기 관심지점 카테고리별 상권분석 모듈(178)은 상기 관심지점 형태소의 수와 상기 위치 형태소의 수를 합산하고, 상기 합산치와 기준치를 비교한다. 상기 합산치가 상기 기준치보다 작은 것으로 체크되면, 상기 관심지점 카테고리별 상권분석 모듈(178)은 선택된 관심지점 카테고리가 비인기 상권인 것으로 판별한다. 상기 합산치가 상기 기준치보다 큰 것으로 체크되면, 상기 관심지점 카테고리별 상권분석 모듈(178)은 선택된 관심지점 카테고리가 인기 상권인 것으로 판별한다. 상기 합산치가 상기 기준치와 같은 것으로 체크되면, 상기 관심지점 카테고리별 상권분석 모듈(178)은 선택된 관심지점 카테고리가 정체 상권인 것으로 판별한다.The trade point analysis module 178 for each point of interest category provides the terminal 300 with information for selecting a point of interest category via the interface module 179. Providing the terminal 300 with information for the above-mentioned interest point category selection may be omitted. For example, providing the terminal 300 with information for the above-mentioned interest point category selection may be omitted when the commercial analysis is performed to perform the commercial point-by-point category analysis in real time or periodically. When the particular interest point category is selected through the terminal 300 through the interface module 179, the commercial value analysis module 178 for each interest point category counts the number of interest point morpheme corresponding to the selected interest point category do. If the interest point morpheme is set to reflect the weight, the trade point analysis module for each point of interest 178 reflects the weight to the number of points of interest morpheme. The trade point analysis module 178 for each point of interest category counts the number of position morphemes corresponding to the selected point of interest category. If the location morpheme is set to reflect the weight, the trade point analysis module 178 for each point of interest category reflects the weight to the number of location morphemes. The merchandise analysis module 178 for each interest point category sums the number of interest morphemes and the number of position morphemes, and compares the summation value with a reference value. If it is checked that the sum value is smaller than the reference value, the commercial spot analysis module 178 of the interest spot category determines that the selected interest spot category is the non-commercial spot. If it is checked that the summation value is larger than the reference value, the commercial value analysis module 178 for each point of interest category determines that the selected point of interest category is the popular commercial value. If the summation value is checked to be equal to the reference value, the merchandise analysis module 178 for each interested point category determines that the selected interest point category is a congested merchant.

도 3은 본 발명의 일실시예에 따른 지역 상권 흐름 분석 방법을 설명하기 위한 흐름도이다. FIG. 3 is a flowchart illustrating a method of analyzing a local commercial floor flow according to an exemplary embodiment of the present invention.

도 3을 참조하면, 소셜커머스, 공연, SNS, 블로그, 뉴스 등과 같은 웹 문서가 수집되어 저장된다(단계 S100). 상기한 웹 문서는 실시간으로 수집될 수도 있고, 일정 시간마다 주기적으로 수집될 수도 있다. Referring to FIG. 3, Web documents such as social commerce, performances, SNS, blog, news, and the like are collected and stored (step S100). The web document may be collected in real time or may be periodically collected at predetermined intervals.

이어, 수집된 웹 문서에서 형태소가 분석된다(단계 S102). 본 실시예에서, 상기 형태소는 관심지점(POI) DB에 등록된 정보에 대응하는 관심지점 형태소 및 상기 관심지점 DB에 등록되지 않은 위치 정보에 대응하는 위치 형태소를 포함할 수 있다. Next, the morpheme is analyzed in the collected web document (step S102). In this embodiment, the morpheme may include a point morpheme corresponding to the information registered in the point of interest (POI) DB and a position morpheme corresponding to the position information not registered in the point of interest DB.

이어, 분석된 형태소들에서 하나의 형태소가 추출된다(단계 S104). Then, one morpheme is extracted from the analyzed morphemes (step S104).

이어, 단계 S104에서 추출된 형태소가 관심지점 형태소인지의 여부가 체크된다(단계 S110). Then, it is checked whether or not the morpheme extracted in step S104 is the interest point morpheme (step S110).

단계 S110에서 관심지점 형태소인 것으로 체크되면, 지역 기준으로 관심지점 형태소가 분류된다(단계 S112). If it is checked in step S110 that the point of interest is a morpheme of interest, the interest point morpheme is classified on a regional basis (step S112).

이어, 분류된 관심지점 형태소가 관심지점 형태소 DB에 저장된다(단계 S114). Then, the classified interest point morpheme is stored in the interest point morpheme DB (step S114).

한편, 단계 S110에서 관심지점 형태소가 아닌 것으로 체크되면, 위치 형태소인지의 여부가 체크된다(단계 S120). On the other hand, if it is checked in step S110 that the morpheme is not the point of interest, it is checked whether or not the morpheme is a position morpheme (step S120).

단계 S120에서 위치 형태소인 것으로 체크되지 않으면, 단계 S104로 피드백된다. 관심지점 형태소도 아니고 위치 형태소도 아닌 형태소는 인물의 이름을 지칭하는 형태소나 일반적인 물건, 예를들어, 컵, 쟁반, 책상 등과 같은 사물을 지칭하는 형태소 등일 수 있다. If it is not checked that it is a position morpheme in step S120, the process returns to step S104. A morpheme that is neither a point of interest point nor a position morpheme can be a morpheme that refers to the name of a person or a morpheme that refers to a general object, such as a cup, a tray, a desk, and the like.

단계 S120에서 위치 형태소인 것으로 체크되면, 지역 기준으로 위치 형태소가 분류된다(단계 S122). If it is checked in step S120 that the location morpheme is located, the location morpheme is classified on an area basis (step S122).

이어, 분류된 위치 형태소가 위치 형태소 저장 DB에 저장된다(단계 S124). Then, the classified position morpheme is stored in the position morpheme storage DB (step S124).

분석된 형태소들에서 추출되지 않은 잔여 형태소가 존재하는지의 여부가 체크된다(단계 S130). It is checked whether there is a residual morpheme not extracted in the analyzed morphemes (step S130).

단계 S130에서 잔여 형태소가 존재하는 것으로 체크되면, 단계 S104로 피드백된다. If it is checked in step S130 that the residual morpheme exists, the flow returns to step S104.

단계 S130에서 잔여 형태소가 존재하지 않은 것으로 체크되면, 형태소들의 수, 즉 관심지점 형태소들의 수와 위치 형태소들의 수를 근거로 상권 지역이 판별된다(단계 S140). If it is checked in step S130 that the residual morpheme is not present, the commercial area is determined based on the number of morphemes, that is, the number of interest morpres and the number of position morpreses (step S140).

관심지점 형태소들의 수가 많거나 위치 형태소들의 수가 많다는 것은 많은 사람들이 각종 웹 문서를 생성하거나 이를 재생산하는 행위가 많은 것을 의미한다. 이에 따라, 특정 지역의 상권이 인기 상권인지 비인기 상권인지를 판별할 수 있다. 예를들어, 특정 지역에서, 관심지점 형태소들의 수나 위치 형태소들의 수가 기준치보다 많다면 해당 지역은 인기 상권 지역으로 판별될 수 있고, 관심지점 형태소들의 수나 위치 형태소들의 수가 기준치보다 적다면 해당 지역은 비인기 상권 지역으로 판별될 수 있다. A large number of interest point morphemes or a large number of position morphemes means that many people generate and reproduce various web documents. Accordingly, it is possible to determine whether the commercial area of a specific area is a popular commercial area or an unlicensed commercial area. For example, in a particular area, if the number of interest point morphemes or the number of position morphemes is greater than the reference value, then the region can be identified as a popular commercial area, and if the number of interest morphemes or the number of position morphemes is smaller than the reference value, It can be determined as a commercial area.

또한, 일정 기간 동안 특정 지역의 상권 흐름을 파악할 수도 있다. 예를들어, 특정 지역에 대응하여 올해 여름 기간 동안 관심지점 형태소들의 수나 위치 형태소들의 수가 특정 지역에 대응하여 1년 전 여름 기간 동안 관심지점 형태소들의 수나 위치 형태소들의 수 보다 많다면 해당 지역은 인기 상권 지역으로 부상되고 있는 것으로 판별될 수 있다. 반면에, 특정 지역에 대응하여 올해 여름 기간 동안 관심지점 형태소들의 수나 위치 형태소들의 수가 특정 지역에 대응하여 1년 전 여름 기간 동안 관심지점 형태소들의 수나 위치 형태소들의 수 보다 적다면 해당 지역은 인기 상권 지역에서 퇴색되고 있는 것으로 판별될 수 있다.Also, it is possible to grasp the flow of commerce in a specific area for a certain period of time. For example, if the number of interest point morphemes or the number of position morphemes corresponding to a specific region during the summer of this year corresponds to a specific region and is greater than the number of interest morphemes or position morphemes during the summer period of one year ago, It can be judged that it is floating in the area. On the other hand, if the number of interest branch morphemes or the number of location morphemes corresponding to a specific region during the summer of this year corresponds to a specific region and is smaller than the number of interest morphemes or position morphemes during the summer period of one year ago, It can be judged that it is being faded.

도 4는 도 3에 도시된 상권 지역을 판별하는 단계를 설명하기 위한 흐름도이다. FIG. 4 is a flowchart for explaining the step of discriminating the commercial area shown in FIG.

도 3 및 도 4를 참조하면, 지역별 상권분석이 요청되는지의 여부가 체크된다(단계 S210). Referring to FIG. 3 and FIG. 4, it is checked whether regional commercial analysis is requested (step S210).

단계 S210에서 지역별 상권분석이 요청되는 것으로 체크되면, 지역별 상권이 분석된다(단계 S220). 상기한 지역별 상권분석은 후술하는 도 5에서 상세히 설명한다. 본 실시예에서, 지역별 상권분석 요청 여부를 체크하는 것은 실시간 또는 주기적으로 지역별 상권분석이 수행되도록 설정된 프로그램에 의해 이루어질 수 있다. 예를들어, 특정 시간이 도래됨에 따라 지역별 상권분석 요청이 이루어진 것으로 판단되어 상기한 지역별 상권 분석 동작이 이루어질 수 있다. 이에 따라, 무수히 많은 웹 문서들을 실시간으로 또는 일정 주기로 수집하여 수집된 웹 문서들에 포함된 각종 관심지점 형태소 또는 위치 형태소를 추출하고 이를 근거로 지역별 상권분석을 위한 각종 정보들을 저장하므로 과거 누적된 상권분석을 위한 정보들을 이용하여 시간의 흐름에 따른 상권분석 동작을 보다 빠르게 수행할 수 있고, 상권분석의 신뢰도를 높일 수 있다. If it is checked in step S210 that the regional merchandise analysis is requested, the regional merchandise is analyzed (step S220). The regional commercial analysis will be described in detail later with reference to FIG. In this embodiment, checking whether or not a request to analyze a commercial article by region can be made by a program configured to perform regional commercial analysis in real time or periodically. For example, it is determined that a request for analysis of a merchandise by region has been made according to the arrival of a specific time, and the above-described merchandise analysis for each region can be performed. Accordingly, it is possible to collect a large number of web documents in real time or at regular intervals, extract various interest point morphemes or location morphemes contained in the collected web documents, By using the information for analysis, it is possible to perform the commercial analysis operation according to the time, and the reliability of the commercial analysis can be improved.

단계 S210에서 지역별 상권분석이 미요청되는 것으로 체크되면, 기간별 상권분석이 요청되는지의 여부가 체크된다(단계 S230).If it is checked in step S210 that regional commercial analysis is not requested, it is checked whether or not a commercial analysis for each period is requested (step S230).

단계 S230에서 기간별 상권분석이 요청되는 것으로 체크되면, 기간별 상권이 분석된다(단계 S240). 상기한 기간별 상권분석은 후술하는 도 6에서 상세히 설명한다. 본 실시예에서, 기간별 상권분석 요청 여부를 체크하는 것은 실시간 또는 주기적으로 지역별 상권분석이 수행되도록 설정된 프로그램에 의해 이루어질 수 있다. 예를들어, 특정 시간이 도래됨에 따라 기간별 상권분석 요청이 이루어진 것으로 판단되어 상기한 기간별 상권 분석 동작이 이루어질 수 있다. 이에 따라, 무수히 많은 웹 문서들을 실시간으로 또는 일정 주기로 수집하여 수집된 웹 문서들에 포함된 각종 관심지점 형태소 또는 위치 형태소를 추출하고 이를 근거로 기간별 상권분석을 위한 각종 정보들을 저장하므로 과거 누적된 상권분석을 위한 정보들을 이용하여 시간의 흐름에 따른 상권분석 동작을 보다 빠르게 수행할 수 있고, 상권분석의 신뢰도를 높일 수 있다.If it is checked in step S230 that the commercial article analysis by period is requested, the commercial article by period is analyzed (step S240). The above-mentioned period-by-period commercial analysis will be described in detail with reference to FIG. In this embodiment, checking whether or not the request for analysis of the commercial article according to the period is performed can be made by a program configured to perform regional commercial analysis in real time or periodically. For example, it is determined that the request for the analysis of the merchandise for each period is made according to the arrival of the specific time, so that the merchandise analysis operation for each period can be performed. Accordingly, it is possible to collect a large number of web documents in real time or at regular intervals, to extract various interest point morphemes or location morphemes contained in the collected web documents, By using the information for analysis, it is possible to perform the commercial analysis operation according to the time, and the reliability of the commercial analysis can be improved.

단계 S230에서 기간별 상권분석이 미요청되는 것으로 체크되면, 웹 문서 종류별 상권분석이 요청되는지의 여부가 체크된다(단계 S250).If it is checked in step S230 that the commercial article analysis by period is not requested, it is checked whether commercial article analysis by the type of web document is requested (step S250).

단계 S250에서 웹 문서 종류별 상권분석이 요청되는 것으로 체크되면, 웹 문서 종류별 상권이 분석된다(단계 S260). 상기한 웹 문서 종류별 상권분석은 후술하는 도 7에서 상세히 설명한다. 본 실시예에서, 웹 문서 종류별 상권분석 요청 여부를 체크하는 것은 실시간 또는 주기적으로 웹 문서 종류별 상권분석이 수행되도록 설정된 프로그램에 의해 이루어질 수 있다. 예를들어, 특정 시간이 도래됨에 따라 웹 문서 종류별 상권분석 요청이 이루어진 것으로 판단되어 상기한 지역별 상권 분석 동작이 이루어질 수 있다. 이에 따라, 무수히 많은 웹 문서들을 실시간으로 또는 일정 주기로 수집하여 수집된 웹 문서들에 포함된 각종 관심지점 형태소 또는 위치 형태소를 추출하고 이를 근거로 웹 문서 종류별 상권분석을 위한 각종 정보들을 저장하므로 과거 누적된 상권분석을 위한 정보들을 이용하여 시간의 흐름에 따른 상권분석 동작을 보다 빠르게 수행할 수 있고, 상권분석의 신뢰도를 높일 수 있다.If it is checked in step S250 that the commercial document analysis by the type of the web document is requested, the commercial document by the web document type is analyzed (step S260). The above-described analysis of the commercial article according to the type of the web document will be described in detail with reference to FIG. In this embodiment, checking whether or not a commercial document analysis request for each type of web document is checked can be performed by a program configured to perform commercial document analysis for each type of web document in real time or periodically. For example, it is determined that a request for analyzing a piece of merchandise according to a type of a web document is made according to the arrival of a specific time, and the above-described merchandise analysis for each region can be performed. Accordingly, it collects a large number of web documents in real time or at regular intervals, extracts various interest point morphemes or location morphemes included in the collected web documents, and stores various information for commercial analysis according to the types of web documents based on the collected morphemes. By using the information for analyzing the commercial area, it is possible to perform the commercial analysis operation according to the time, and the reliability of the commercial area analysis can be improved.

단계 S250에서 웹 문서 종류별 상권분석이 미요청되는 것으로 체크되면, 관심지점 카테고리별 상권분석이 요청되는지의 여부가 체크된다(단계 S270).If it is checked in step S250 that the commercial document analysis by web document type is not requested, it is checked whether or not the commercial document analysis by interest point category is requested (step S270).

단계 S270에서 관심지점 카테고리별 상권분석이 요청되는 것으로 체크되면, 관심지점 카테고리별 상권이 분석된다(단계 S280). 상기한 관심지점 카테고리별 상권분석은 후술하는 도 8에서 상세히 설명한다. 본 실시예에서, 관심지점 카테고리별 상권분석 요청 여부를 체크하는 것은 실시간 또는 주기적으로 관심지점 카테고리별 상권분석이 수행되도록 설정된 프로그램에 의해 이루어질 수 있다. 예를들어, 특정 시간이 도래됨에 따라 관심지점 카테고리별 상권분석 요청이 이루어진 것으로 판단되어 상기한 지역별 상권 분석 동작이 이루어질 수 있다. 이에 따라, 무수히 많은 웹 문서들을 실시간으로 또는 일정 주기로 수집하여 수집된 웹 문서들에 포함된 각종 관심지점 형태소 또는 위치 형태소를 추출하고 이를 근거로 관심지점 카테고리별 상권분석을 위한 각종 정보들을 저장하므로 과거 누적된 상권분석을 위한 정보들을 이용하여 시간의 흐름에 따른 상권분석 동작을 보다 빠르게 수행할 수 있고, 상권분석의 신뢰도를 높일 수 있다.If it is checked in step S270 that a commercial analysis by interest point category is requested, commercials by interest point category are analyzed (step S280). The commercial analysis by the interest point categories described above will be described in detail in FIG. 8 to be described later. In this embodiment, checking whether or not to request a commercial analysis by interest point category can be performed by a program set to perform commercial analysis by interest point category in real time or periodically. For example, as the specific time comes, it is determined that a request for analysis of a commercial article by category of interest has been made, so that the commercial analysis operation can be performed according to the region. Accordingly, it is possible to collect a large number of web documents in real time or at regular intervals, to extract various interest point morphemes or location morphemes contained in the collected web documents, Using the information for the accumulated commercial analysis, it is possible to perform the commercial analysis operation according to the time, and the reliability of the commercial analysis can be improved.

본 실시예에서, 상권분석은 지역별 상권분석, 기간별 상권분석, 웹 문서 종류별 상권분석, 관심지점 카테고리별 상권분석의 순서로 이루어지는 것을 설명하였으나, 그 순서는 다양하게 변경될 수 있다. 또한, 상술된 상권분석 외에 다양한 다른 방식의 상권분석도 가능하다. In the present embodiment, the commercial analysis is performed in the order of regional commercial analysis, commercial analysis by period, commercial analysis by web document type, and commercial analysis by interest branch category, but the order may be variously changed. In addition to the above-described commercial article analysis, various other types of commercial article analysis are also possible.

또한, 본 실시예에서는, 지역별 상권을 분석하거나, 기간별 상권을 분석하거나, 웹 문서 종류별 상권을 분석하거나 관심지점 카테고리별 상권을 분석하는 단계들이 서로 분리되어 이루어지는 것을 설명하였으나, 상술된 상권을 분석하는 단계들은 서로 혼합되어 사용될 수도 있다. 예를들어, 지역별 상권과 기간별 상권을 분석하거나, 지역별 상권, 기간별 상권 및 웹 문서 종류별 상권을 분석할 수도 있다. 또한, 지역별 상권, 기간별 상권, 웹 문서 종류별 상권 및 관심지점 카테고리별 상권을 분석할 수도 있다. In addition, in the present embodiment, it has been explained that the steps of analyzing a commercial area by region, analyzing a commercial area by period, analyzing a commercial area according to a type of a web document, or analyzing a commercial area by category of interest are separately performed. However, The steps may be used mixed with each other. For example, it is possible to analyze regional commercial areas and periodical commercial areas, and analyze regional commercial areas, commercial areas by period, and commercial areas classified by types of web documents. It is also possible to analyze commercials by region, by period, by web document type, and by point of interest category.

도 5는 도 4에 도시된 지역별 상권분석 단계를 설명하기 위한 흐름도이다.FIG. 5 is a flowchart for explaining the regional merchandise analysis steps shown in FIG.

도 4 및 도 5를 참조하면, 특정 지역이 선택된다(단계 S300).Referring to FIGS. 4 and 5, a specific region is selected (step S300).

이어, 선택된 지역에 대응하는 관심지점 형태소의 수가 카운트된다(단계 S304).Then, the number of points of interest morpheme corresponding to the selected area is counted (step S304).

이어, 관심지점 형태소에 가중치를 반영하는지의 여부가 체크된다(단계 S306). 본 실시예에서, 관심지점 형태소에 반영되는 가중치는 관심지점 형태소에 의한 상권분석의 비중과 위치 형태소에 의한 상권분석의 비중을 조율하기 위해 관리자에 의해 다양하게 설정될 수 있다. 예를들어, 위치 형태소에 의한 상권분석의 비중보다 관심지점 형태소에 의한 상권분석의 비중을 높이고자 한다면, 관심지점 형태소에 가중치, 예를들어, 1.1이나 1.5 등을 곱하는 방식으로 관심지점 형태소의 수를 위치 형태소의 수보다 높여 관심지점 형태소에 의한 상권분석의 비중을 높일 수 있다. 한편, 위치 형태소에 의한 상권분석의 비중보다 관심지점 형태소에 의한 상권분석의 비중을 낮추고자 한다면, 관심지점 형태소에 가중치, 예를들어, 0.8이나 0.9 등을 곱하는 방식으로 관심지점 형태소의 수를 위치 형태소의 수보다 낮추어 위치 형태소에 의한 상권분석의 비중을 높일 수 있다.Then, whether or not the weight is reflected in the morpheme of interest is checked (step S306). In this embodiment, the weights reflected in the morpheme of the interest point can be variously set by the manager in order to adjust the weight of the commercial analysis by the interest point morpheme and the weight of the commercial analysis by the position morpheme. For example, if we want to increase the weight of the commercial analysis by the interest branch morpheme rather than the commercial segment analysis by the location morpheme, the number of branch morphemes of interest is multiplied by the weight, for example, 1.1 or 1.5, To the number of location morphemes, it is possible to increase the proportion of commercial analysis by the morpheme of interest. In order to reduce the proportion of commercial analysis by the interest branch morpheme, the number of branch morphemes of interest is multiplied by a weighting factor, for example, 0.8 or 0.9, The number of morphemes can be lowered, and the proportion of commercial analysis by location morphemes can be increased.

단계 S306에서 가중치 반영으로 체크되면, 관심지점 형태소의 수에 가중치가 반영된다(단계 S308).If weights are checked in step S306, the weight is reflected in the number of points of interest morpheme (step S308).

단계 S306에서 가중치 미반영으로 체크되거나 단계 S308에 이어, 선택된 지역에 대응하는 위치 형태소의 수가 카운트된다(단계 S310).In step S306, the number of position morpres- sories corresponding to the selected area is counted (step S310), or after the step S308, the number of position morpres- sories corresponding to the selected area is counted.

위치 형태소에 가중치를 반영하는지의 여부가 체크된다(단계 S312). 본 실시예에서, 위치 형태소에 반영되는 가중치는 관심지점 형태소에 의한 상권분석의 비중과 위치 형태소에 의한 상권분석의 비중을 조율하기 위해 관리자에 의해 다양하게 설정될 수 있다. 예를들어, 관심지점 형태소에 의한 상권분석의 비중보다 위치 형태소에 의한 상권분석의 비중을 높이고자 한다면, 위치 형태소에 가중치, 예를들어, 1.1이나 1.5 등을 곱하는 방식으로 위치 형태소의 수를 관심지점 형태소의 수보다 높여 위치 형태소에 의한 상권분석의 비중을 높일 수 있다. 한편, 관심지점 형태소에 의한 상권분석의 비중보다 위치 형태소에 의한 상권분석의 비중을 낮추고자 한다면, 위치 형태소에 가중치, 예를들어, 0.8이나 0.9 등을 곱하는 방식으로 위치 형태소의 수를 관심지점 형태소의 수보다 낮추어 관심지점 형태소에 의한 상권분석의 비중을 높일 수 있다.Whether or not the position morpheme reflects the weight is checked (step S312). In this embodiment, the weights reflected in the location morpheme can be variously set by the administrator to adjust the weight of the commercial analysis by the interest point morpheme and the weight of the commercial analysis by the location morpheme. For example, if we want to increase the proportion of commercial analysis by location morpheme rather than the proportion of commercial analysis by the interest branch morpheme, we can estimate the number of position morphemes by multiplying the position morpheme with weights, for example, 1.1 or 1.5, The number of branch morphemes is higher than that of branch morphemes. In order to reduce the proportion of commercial analysis by location morpheme rather than the proportion of commercial analysis by the interest branch morpheme, the number of position morphemes is multiplied by the weight, eg, 0.8 or 0.9, It is possible to increase the proportion of commercial analysis by the morpheme of interest.

단계 S312에서 위치 형태소의 수에 가중치가 반영된다(단계 S314).In step S312, the weight is reflected in the number of position morphemes (step S314).

단계 S312에서 가중치 미반영으로 체크되거나 단계 S314에 이어, 관심지점 형태소의 수와 위치 형태소의 수가 합산된다(단계 S316).In step S312, the number of interest morpres and the number of position morpreses are added (step S316).

합산치(S)와 기준치(R)가 서로 비교된다(단계 S318).The sum value S and the reference value R are compared with each other (step S318).

단계 S318에서 합산치(S)가 기준치(R)보다 작은 것으로 체크되면, 선택된 지역이 비인기 상권으로 판별된다(단계 S320).If it is checked in step S318 that the sum value S is smaller than the reference value R, the selected area is determined as an unauthorized commercial area (step S320).

단계 S318에서 합산치(S)가 기준치(R)보다 큰 것으로 체크되면, 선택된 지역이 인기 상권으로 판별된다(단계 S322).If it is checked in step S318 that the sum value S is larger than the reference value R, the selected area is determined as a popular commercial area (step S322).

단계 S318에서 합산치(S)가 기준치(R)와 같은 것으로 체크되면, 선택된 지역이 정체 상권으로 판별된다(단계 S324).If it is checked in step S318 that the sum value S is the same as the reference value R, the selected area is determined to be a still image area (step S324).

도 6은 도 4에 도시된 기간별 상권분석 단계를 설명하기 위한 흐름도이다.FIG. 6 is a flow chart for explaining the commercial article analysis step according to the period shown in FIG.

도 4 및 도 6을 참조하면, 특정 지역이 선택된다(단계 S400). Referring to FIGS. 4 and 6, a specific region is selected (step S400).

이어, 제1 기간 및 제2 기간이 각각 선택된다(단계 S404). 본 실시예에서, 제1 기간은 제2 기간보다 과거로 설정될 수 있다. 예를들어, 제2 기간이 2014년 9월이라면, 제1 기간은 제2013년 9월로 설정될 수 있다. 본 실시예에서, 지역 선택을 위한 정보가 제공된 후 기간 선택을 위한 정보가 제공되는 것을 설명하였으나, 하나의 사용자 인터페이스 화면을 통해 두 종류의 정보를 함께 제공할 수도 있다. Next, the first period and the second period are respectively selected (step S404). In this embodiment, the first period may be set to be earlier than the second period. For example, if the second period is September 2014, the first period may be set to September 2013. Although it has been described in the present embodiment that information for period selection is provided after information for area selection is provided, two types of information may be provided together through one user interface screen.

이어, 상기한 단말기를 통해 제1 기간 및 제2 기간이 상권분석을 위해 확정되는지의 여부가 체크된다(단계 S406).Then, it is checked whether the first period and the second period are confirmed for the commercial analysis through the terminal (step S406).

단계 S406에서 제1 기간 및 제2 기간이 확정된 것으로 체크되면, 선택된 지역과 선택된 제1 기간에 대응하는 형태소의 수가 카운트되어 제1 카운트값(C1)이 생성된다(단계 S408).If it is checked in step S406 that the first period and the second period are determined, the number of morpresent corresponding to the selected region and the selected first period is counted to generate the first count value C1 (step S408).

이어, 선택된 지역과 선택된 제2 기간에 대응하는 형태소의 수가 카운트되어 제2 카운트값(C2)이 생성된다(단계 S410).Next, the number of morphemes corresponding to the selected region and the selected second period is counted, and a second count value C2 is generated (Step S410).

제1 카운트값(C1)과 제2 카운트값(C2)이 서로 비교된다(단계 S412).The first count value C1 and the second count value C2 are compared with each other (step S412).

단계 S412에서 제1 카운트값(C1)이 제2 카운트값(C2)보다 큰 것으로 체크되면, 선택된 지역은 제2 기간 동안 인기상권으로 판별된다(단계 S414). 예를들어, 선택된 지역이 강남역이고, 제1 기간이 2013년 9월이고, 제2 기간이 2014년 9월이라면, 강남역 인근 지역은 2013년 9월에 비해 2014년 9월이 인기상권으로 부상되고 있다고 판별될 수 있다. If it is checked in step S412 that the first count value C1 is larger than the second count value C2, the selected area is determined as a popular commercial area for the second period (step S414). For example, if the selected region is Gangnam Station, the first period is September 2013, and the second period is September 2014, the area near Gangnam Station will rise to popular trade in September 2014 compared to September 2013 .

단계 S412에서 제1 카운트값(C1)이 제2 카운트값(C2)보다 작은 것으로 체크되면, 선택된 지역은 제1 기간 동안 인기 상권으로 판별된다(단계 S416). 예를들어, 선택된 지역이 강남역이고, 제1 기간이 2013년 9월이고, 제2 기간이 2014년 9월이라면, 강남역 인근 지역은 2013년 9월에 비해 2014년 9월이 비인기상권으로 하락되고 있다고 판별될 수 있다.If it is checked in step S412 that the first count value C1 is smaller than the second count value C2, the selected area is determined as a popular commercial area for the first period (step S416). For example, if the selected region is Gangnam Station, the first period is September 2013, and the second period is September 2014, the area near Gangnam Station will be lowered to the non-commercial area in September 2014 compared to September 2013 .

단계 S412에서 제1 카운트값(C1)과 제2 카운트값(C2)이 같은 것으로 체크되면, 선택된 지역은 제1 기간이나 제2 기간 동안, 정체 상권으로 판별된다(단계 S418). 예를들어, 선택된 지역이 강남역이고, 제1 기간이 2013년 9월이고, 제2 기간이 2014년 9월이라면, 강남역 인근 지역은 2013년 9월이나 2014년 9월이나 상권의 흐름 변화가 없는 정체 상권을 유지하고 있다고 판별될 수 있다.If it is checked in step S412 that the first count value C1 and the second count value C2 are the same, the selected area is determined to be a congested area for the first period or the second period (step S418). For example, if the selected region is Gangnam Station, the first period is September 2013, and the second period is September 2014, the area near Gangnam Station will be September 2013 or September 2014, It can be judged that it is maintaining the congestion state.

도 7은 도 4에 도시된 웹 문서 종류별 상권분석 단계를 설명하기 위한 흐름도이다.FIG. 7 is a flow chart for explaining the merchandise analysis step for each web document type shown in FIG.

도 4 및 도 7을 참조하면, 특정 웹 문서가 선택된다(단계 S500). 예를들어, 소셜커머스, 공연, SNS, 블로그, 뉴스 등의 웹 문서에서 어느 하나가 선택될 수 있다. 여기서, 부가적으로 특정 지역이 선택될 수도 있다. Referring to FIGS. 4 and 7, a specific web document is selected (step S500). For example, any one of web documents such as social commerce, performance, SNS, blog, and news may be selected. Here, a specific area may be selected additionally.

이어, 선택된 웹 문서에 대응하는 관심지점 형태소의 수가 카운트된다(단계 S504).Then, the number of points of interest morpheme corresponding to the selected web document is counted (step S504).

이어, 관심지점 형태소에 가중치를 반영하는지의 여부가 체크된다(단계 S506). 본 실시예에서, 관심지점 형태소에 반영되는 가중치는 도 5에서 설명되었으므로 그 설명은 생략한다. Then, whether or not the weight is reflected in the morpheme of interest is checked (step S506). In this embodiment, the weights reflected in the point of interest morpheme are described in Fig. 5, and a description thereof will be omitted.

단계 S506에서 가중치 반영으로 체크되면, 관심지점 형태소의 수에 가중치가 반영된다(단계 S508).If weights are checked in step S506, weights are reflected in the number of points of interest morphology (step S508).

단계 S506에서 가중치 미반영으로 체크되거나 단계 S508에 이어, 선택된 웹 문서에 대응하는 위치 형태소의 수가 카운트된다(단계 S510).In step S506, the number of position morphemes corresponding to the selected web document is counted (step S510).

위치 형태소에 가중치를 반영하는지의 여부가 체크된다(단계 S512). 본 실시예에서, 위치 형태소에 반영되는 가중치는 도 5에서 설명되었으므로 그 설명은 생략한다. Whether or not the position morpheme reflects the weight is checked (step S512). In this embodiment, the weight values reflected in the position morpheme are described in Fig. 5, and a description thereof will be omitted.

단계 S512에서 위치 형태소의 수에 가중치가 반영된다(단계 S514).In step S512, the weight is reflected in the number of position morpreses (step S514).

단계 S512에서 가중치 미반영으로 체크되거나 단계 S514에 이어, 관심지점 형태소의 수와 위치 형태소의 수가 합산되어 합산치(S)가 생성된다(단계 S516).In step S512, the number of points of interest and the number of position morpreses are summed to generate a summation value S (step S516).

단계 S516에서 생성된 합산치(S)와 기 설정된 기준치(R)가 서로 비교된다(단계 S518).The sum value S generated in step S516 is compared with a preset reference value R (step S518).

단계 S518에서 합산치(S)가 기준치(R)보다 작은 것으로 체크되면, 선택된 웹 문서는 비인기 상권으로 판별된다(단계 S520).If it is checked in step S518 that the sum value S is smaller than the reference value R, the selected web document is determined as an unauthorized merchandise (step S520).

단계 S518에서 합산치(S)가 기준치(R)보다 큰 것으로 체크되면, 선택된 웹 문서는 인기 상권으로 판별된다(단계 S522).If it is checked in step S518 that the sum value S is larger than the reference value R, the selected web document is determined to be a popular commercial article (step S522).

단계 S518에서 합산치(S)가 기준치(R)와 같은 것으로 체크되면, 선택된 웹 문서는 정체 상권으로 판별된다(단계 S524).If it is checked in step S518 that the sum value S is the same as the reference value R, the selected web document is judged to be a still image area (step S524).

도 8은 도 4에 도시된 관심지점 카테고리별 상권분석 단계를 설명하기 위한 흐름도이다. FIG. 8 is a flowchart for explaining a commercial area analysis step for each point of interest category shown in FIG.

도 4 및 도 8을 참조하면, 특정 관심지점 카테고리가 선택된다(단계 S600). 본 실시예에서, 관심지점 카테고리는 대분류, 하나의 대분류에서 분류된 중분류, 하나의 중분류에서 분류된 소분류 등으로 구분되어 정의될 수 있다. 예를들어, 대분류 카테고리는 생활편의, 쇼핑 등으로 구분될 수 있다. 중분류 카테고리는 음식점, 대형 유통점 등으로 구분될 수 있다. 소분류 카테고리는 패스트푸드, 백화점 등으로 구분될 수 있다. Referring to Figs. 4 and 8, a specific point of interest category is selected (step S600). In the present embodiment, the interest point category can be defined by being divided into a large category, a middle category classified in one large category, and a small category classified in one middle category. For example, the major category can be divided into life, shopping, and the like. The middle category can be divided into restaurants and large retail stores. The subcategories can be divided into fast food, department stores, etc.

이어, 선택된 관심지점 카테고리에 대응하는 관심지점 형태소의 수가 카운트된다(단계 S604).Then, the number of points of interest morpresent corresponding to the selected interest point category is counted (step S604).

이어, 관심지점 형태소에 가중치를 반영하는지의 여부가 체크된다(단계 S606). 본 실시예에서, 관심지점 형태소에 반영되는 가중치는 도 5에서 설명되었으므로 그 설명은 생략한다. 단계 S606에서 가중치 반영으로 체크되면, 관심지점 형태소의 수에 가중치가 반영된다(단계 S608).Then, whether or not the weight is reflected in the morpheme of interest is checked (step S606). In this embodiment, the weights reflected in the point of interest morpheme are described in Fig. 5, and a description thereof will be omitted. If weights are checked in step S606, the weight is reflected in the number of points of interest morpheme (step S608).

단계 S606에서 가중치 미반영으로 체크되거나 단계 S608에 이어, 선택된 관심지점 카테고리에 대응하는 위치 형태소의 수가 카운트된다(단계 S610).In step S606, the number of position morphemes corresponding to the selected interest point category is counted (step S610).

위치 형태소에 가중치를 반영하는지의 여부가 체크된다(단계 S612). 본 실시예에서, 위치 형태소에 반영되는 가중치는 도 5에서 설명되었으므로 그 설명은 생략한다. 단계 S612에서 위치 형태소의 수에 가중치가 반영된다(단계 S614).Whether or not the position morpheme reflects the weight is checked (step S612). In this embodiment, the weight values reflected in the position morpheme are described in Fig. 5, and a description thereof will be omitted. In step S612, the weight is reflected in the number of position morpreses (step S614).

단계 S612에서 가중치 미반영으로 체크되거나 단계 S614에 이어, 관심지점 형태소의 수와 위치 형태소의 수가 합산된다(단계 S616).In step S612, the number of interest morpres and the number of position morpreses are added (step S616).

합산치(S)와 기준치(R)가 서로 비교된다(단계 S618).The sum value S and the reference value R are compared with each other (step S618).

단계 S618에서 합산치(S)가 기준치(R)보다 작은 것으로 체크되면, 선택된 관심지점 카테고리는 비인기 상권으로 판별된다(단계 S620).If it is checked in step S618 that the sum value S is smaller than the reference value R, the selected interest point category is determined as an unauthorized commercial area (step S620).

단계 S618에서 합산치(S)가 기준치(R)보다 큰 것으로 체크되면, 선택된 관심지점 카테고리는 인기 상권으로 판별된다(단계 S622).If it is checked in step S618 that the sum value S is larger than the reference value R, the selected interest point category is determined as a popular commercial area (step S622).

단계 S618에서 합산치(S)가 기준치(R)와 같은 것으로 체크되면, 선택된 관심지점 카테고리는 정체 상권으로 판별된다(단계 S624).If it is checked in step S618 that the sum value S is the same as the reference value R, the selected interest point category is determined to be a congested area (step S624).

이하에서, 본 실시예에 따른 시간에 따른 상권분석을 보다 상세히 설명한다. 설명의 편의를 위해, 서울시내의 중요 상권이라 할 수 있는 강남구 신사동, 영등포, 명동을 지역으로 설정하고, 웹 문서에서 관심지점 형태소와 위치 형태소를 추출하는 기간 역시 2014년 1월부터 2014년 11월까지 추출하는 것을 그 일례로 설명한다. Hereinafter, the commercial analysis according to the present embodiment will be described in more detail. For convenience of explanation, we set Gangnam-gu, Sinsa-dong, Yeongdeungpo, and Myung-dong, which are important commercial areas in Seoul, as the region, and the period of extracting the interest point morpheme and location morpheme from the web document is from January 2014 to November 2014 Extraction will be described as an example.

표 1은 웹 문서에서 기간별 지역별 추출된 관심지점 형태소와 위치 형태소의 전체 개수를 설명하기 위한 표이다. 도 9는 웹 문서에서 기간별 지역별 추출된 관심지점 형태소와 위치 형태소의 전체 개수를 설명하기 위한 그래프이다. 특히, 표 1에 대응하는 그래프가 도시된다. 본 실시예에서, 웹 문서에서 추출되고 지역별 관심지점(POI) 형태소와 위치 형태소의 전체 개수의 예를 나타낸다. Table 1 is a table for describing the total number of interest point morphemes and position morphemes extracted by region and region in web documents. 9 is a graph for explaining the total number of interest point morphemes and location morphemes extracted by region and region in a web document. In particular, the graph corresponding to Table 1 is shown. In the present embodiment, an example of the total number of local morpheme (POI) morphemes and location morphemes extracted from a web document is shown.

[표 1][Table 1]

표 1 및 도 9를 참조하면, 2014년 1월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소와 위치 형태소의 전체 개수는 1,000, 3,000 및 5,000이고, 2014년 2월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소와 위치 형태소의 전체 개수는 1,000, 3,100 및 4,500이고, 2014년 3월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소와 위치 형태소의 전체 개수는 1,000, 3,000 및 5,000이다.Referring to Table 1 and FIG. 9, in January 2014, the total number of points of interest and location morphemes related to Sinsa-dong, Yeongdeungpo and Myung-dong in the web document were 1,000, 3,000 and 5,000, The total number of morpheme and location morphemes related to Sinsa-dong, Yeongdeungpo, and Myung-dong in the document are 1,000, 3,100, and 4,500, respectively. In March 2014, each of the points of interest related to Sinsa-dong, Yeongdeungpo, And the total number of location morphemes is 1,000, 3,000 and 5,000.

또한, 2014년 4월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소와 위치 형태소의 전체 개수는 1,000, 2,900 및 5,200이고, 2014년 5월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소와 위치 형태소의 전체 개수는 1,000, 2,800 및 5,200이고, 2014년 6월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소와 위치 형태소의 전체 개수는 1,000, 2,700, 5,100이다. Also, in April of 2014, the total number of interest point morphemes and location morphemes related to Sinsa-dong, Yeongdeungpo and Myung-dong in Web documents were 1,000, 2,900 and 5,200, respectively. In May 2014, The total number of morphemes and location morphemes for each interest point related to Sinsa-dong, Yeongdeungpo, and Myung-dong in the web document in June 2014 is 1,000, 2,800 and 5,200, 1,000, 2,700, 5,100.

이러한 방식으로 2014년 7월 내지 11월 각각에 대응하여 웹 문서에서 각 지역과 관련하는 관심지점 형태소와 위치 형태소의 전체 개수가 정리될 수 있다. In this way, the total number of interest point morphemes and location morphemes related to each region in the web document can be arranged corresponding to each July to November, 2014.

도 9를 참조하면, 웹 문서들에서 매월 각 지역에 대응하여 추출된 관심지점 형태소와 위치 형태소의 전체 개수를 근거로 일명 웹 문서 분석지수를 정의할 수 있다. Referring to FIG. 9, a web document analysis index can be defined based on the total number of interest morphemes and location morphemes extracted corresponding to each region in each month in web documents.

도 9에 도시된 바와 같이, 시간이 경과되더라도 웹 문서에서 명동이 가장 많이 히트되고 있고, 영등포가 중간 정도의 수준으로 히트되고 있으며, 신사동이 가장 적은 수준으로 히트되고 있음을 확인할 수 있다. As shown in FIG. 9, even though the time elapses, it can be confirmed that Myungdong is hit the most in a web document, Yeongdeungpo is hit at a medium level, and the sinsomong is hit to the lowest level.

표 2는 표 1에서 각 지역에 대응하는 관심지점(POI) 형태소의 전체 개수의 예를 설명하기 위한 표이다. 도 10은 각 지역에 대응하는 관심지점(POI) 형태소의 전체 개수의 예를 설명하기 위한 그래프이다. 특히, 표 2에 대응하는 그래프가 도시된다. Table 2 is a table for explaining an example of the total number of points of interest (POI) morphemes corresponding to the respective regions in Table 1. 10 is a graph for explaining an example of the total number of points of interest (POI) morphemes corresponding to each area. In particular, the graph corresponding to Table 2 is shown.

[표 2][Table 2]

표 2 및 도 10을 참조하면, 2014년 1월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소의 전체 개수는 500, 400 및 3,500이고, 2014년 2월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소의 전체 개수는 480, 500 및 3,200이고, 2014년 3월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소의 전체 개수는 520, 600 및 3,500이다.Referring to Table 2 and FIG. 10, in January 2014, the total number of interest point spots related to Sinsa-dong, Yeongdeungpo, and Myong-dong in web documents was 500, 400 and 3,500, and in February 2014, , Yeongdeungpo, and Myungdong are 480, 500, and 3,200, respectively. In March 2014, the total number of morpheme points of interest related to Sinsa-dong, Yeongdeungpo, 600 and 3,500, respectively.

또한, 2014년 4월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소의 전체 개수는 479, 650 및 3,200이고, 2014년 5월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소의 전체 개수는 500, 850 및 3,500이고, 2014년 6월에 웹 문서에서 신사동, 영등포, 명동과 관련하는 각각의 관심지점 형태소의 전체 개수는 530, 900 및 3,600이다.Also, in April of 2014, the total number of morpheme points of interest related to Sinsa-dong, Yeongdeungpo, and Myeong-dong were 479, 650 and 3,200, respectively. In May 2014, web documents related to Sinsa-dong, Yeongdeungpo, The total number of each interest point morpheme is 500, 850, and 3,500, and in June 2014, the total number of interest point morphemes related to Sinsa-dong, Yeongdeungpo, and Myung-dong in web documents is 530, 900, and 3,600.

이러한 방식으로 2014년 7월 내지 11월 각각에 대응하여 웹 문서에서 각 지역과 관련하는 관심지점 형태소의 전체 개수가 정리될 수 있다. In this way, the total number of interest point morphemes related to each region in the web document can be arranged corresponding to each July to November, 2014.

도 10에 나타낸 바와 같이, 웹 문서들에서 매월 각 지역에 대응하여 추출된 관심지점 형태소의 전체 개수를 근거로 일명 지역별 POI 지수를 정의할 수 있다. 도 10에 도시된 바와 같이, 시간이 경과되더라도 웹 문서에서 명동이 가장 많이 히트되고 있고 신사동은 가장 적게 히트되고 있음을 확인할 수 있다. 한편, 시간이 경과됨에 따라 영등포는 점차적으로 증가하는 수준으로 히트되고 있음을 확인할 수 있다. As shown in FIG. 10, it is possible to define a local POI index based on the total number of interest point morphemes extracted corresponding to each region of each month in web documents. As shown in FIG. 10, it can be seen that even though the time elapses, Myongdong is the most hits in the web document and the sinsomong is the least hit. On the other hand, as time elapses, Yeongdeungpo is gradually getting hit.

한편, 상권분석시 인기도와 상점의 전체 개수 뿐 아니라 상권의 면적이나 밀도 역시 중요하다. 즉, 일반적으로 상가는 1층뿐만 아니라 2층 이상 또는 지하층에도 배치되므로 상권의 면적이나 밀도를 상권분석시 고려해야 할 변수이다. On the other hand, in analysis of commercial areas, not only the popularity and the total number of shops but also the area and density of commercial areas are also important. In other words, since commercial districts are located not only on the first floor but also on two or more floors or in the basement, the area and density of the commercial area are variables to be considered when analyzing the commercial area.

특히, 평면적인 분석보다는 입체적인 분석이 상권분석의 신뢰도를 높일 수 있다. 입체적인 상권분석을 위해서는 2차원(이하, 2D) 면적을 3차원(이하, 3D) 면적으로 변환하는 작업이 요구된다. In particular, stereoscopic analysis can enhance the reliability of commercial analysis rather than planar analysis. In order to analyze a stereoscopic commercial article, it is required to convert a two-dimensional (hereinafter, 2D) area into a three-dimensional (3D) area.

표 3은 각 지역의 2D 면적과 3D 면적의 예를 나타낸다. 여기서, 면적은 상대적인 크기 비교가 가능하도록 단순히 숫자로만 나타내었다. 또한, 3D 면적은 해당 지역내에 있는 건물 정보 DB를 이용하여 계산될 수 있다. 표 3에서 AOI는 관심영역(area of interest)의 의미한다. Table 3 shows examples of 2D area and 3D area in each area. Here, the area is simply represented by numbers so that relative size comparison is possible. Also, the 3D area can be calculated using the building information DB in the area. In Table 3, AOI means the area of interest.

[표 3][Table 3]

표 3을 참조하면, 2D 면적에서 신사동이 가장 작고 영등포가 중간이며 명동은 가장 넓은 면적인 반면, 3D 면적에서 신사동이 가장 넓고 명동이 중간이며 영등포가 가장 작은 면적인 것을 확인할 수 있다. 즉, 비록 2D 면적에서 명동이 영등포보다 넓은 면적이다. 하지만, 명동에는 비교적 낮은 높이의 건물들이 존재하는 반면 영등포에는 비교적 높은 높이의 건물들이 존재한다. 따라서, 2D 면적에서 영등포와 명동간의 차이보다는 3D 면적에서 영등포와 명동간의 차이는 좁혀진 것을 확인할 수 있다. As shown in Table 3, it can be seen that Sinsa-dong is the smallest in the 2D area, Yeongdeungpo is the middle, and Myeongdong is the widest area, while the 3D area has the widest sinsa-dong, Myongdong is the middle and Yeongdeungpo is the smallest area. In other words, although the area of 2D is larger than that of Yeongdeungpo, Myungdong. However, there are relatively low - height buildings in Myung - dong, while relatively high - height buildings exist in Yeongdeungpo. Therefore, it can be seen that the difference between Yeongdeungpo and Myongdong is narrowed in the 3D area rather than the difference between Yeongdeungpo and Myongdong in 2D area.

표 3에서 각 지역별 2D 면적이나 3D 면적은 시간의 흐름에 따라 고정될 수도 있고, 변경될 수도 있다. 예를들어, 1열전에 비해 현재 건물이 신축되거나 증축되는 경우 2D 면적이나 3D 면적은 시간의 흐름에 따라 증가할 수 있다. 하지만, 본 실시예에서는 설명의 편의를 위해 시간의 2D 면적이나 3D 면적은 고정된 것으로 예시한다. In Table 3, the 2D area or the 3D area for each region may be fixed or changed over time. For example, if the current building is stretched or expanded relative to the previous one, the 2D area or 3D area may increase over time. However, in the present embodiment, for convenience of explanation, the 2D area or the 3D area of time is exemplified as being fixed.

표 4는 각 지역의 2D 단위 면적당 관심지점(POI)의 밀도의 예를 나타낸다. 도 11은 각 지역의 2D 단위 면적당 관심지점(POI)의 밀도의 예를 설명하기 위한 그래프이다. 특히, 표 4에 대응하는 그래프가 도시된다. 본 실시예에서, 표 2에 나타낸 각 지역별 기간별 관심지점 형태소의 전체 개수를 표 3에 나타낸 각 지역별 2D 면적으로 나눈 값을 2D 단위 면적당 관심지점의 밀도로 정의한다. Table 4 shows examples of densities of points of interest (POIs) per 2D area area in each area. 11 is a graph for explaining an example of the density of the point of interest (POI) per 2D unit area of each area. In particular, the graph corresponding to Table 4 is shown. In this embodiment, the total number of interest point morpheme points for each region shown in Table 2 divided by the 2D area for each region shown in Table 3 is defined as the density of the point of interest per 2D unit area.

[표 4][Table 4]

표 4 및 도 11을 참조하면, 2014년 1월 동안 신사동, 영등포, 명동과 관련하는 각각의 2D 단위 면적당 관심지점의 밀도는 0.5, 0.1333, 0.7이고, 2014년 2월 동안 신사동, 영등포, 명동과 관련하는 각각의 2D 단위 면적당 관심지점의 밀도는 0.48, 0.1666, 0.64이고, 2014년 3월 동안 신사동, 영등포, 명동과 관련하는 각각의 2D 단위 면적당 관심지점의 밀도는 0.52, 0.2, 0.7이다.Referring to Table 4 and FIG. 11, the density of the points of interest per each 2D unit area related to Sinsa-dong, Yeongdeungpo, and Myondong during Jan 2014 is 0.5, 0.1333, and 0.7, The density of the points of interest per each 2D unit area is 0.48, 0.1666, and 0.64, and the density of the points of interest per each 2D unit area related to Sinsa-dong, Yeongdeungpo, and Myondong during March 2014 is 0.52, 0.2, and 0.7.

이러한 방식으로 2014년 4월 내지 11월 각각에 대응하여 각 지역의 2D 단위 면적당 관심지점의 밀도가 계산될 수 있다. In this way, the density of points of interest per 2D unit area of each region can be calculated corresponding to April to November 2014 respectively.

도 11에서 확인할 수 있듯이, 시간의 흐름에 따라, 명동에 대응하는 2D 단위 면적당 관심지점의 밀도나 신사동에 대응하는 2D 단위 면적당 관심지점의 밀도는 변화가 없지만, 영등포에 대응하는 2D 단위 면적당 관심지점의 밀도는 점차적으로 증가하는 것을 확인할 수 있다. 11, there is no change in the density of the point of interest per 2D unit area corresponding to the sphere of Myeongdong or the density of the point of interest per 2D unit area corresponding to the sphinx of the soul. However, Is gradually increased.

2D 단위 면적당 관심지점의 밀도가 낮다는 것은 상가의 밀도가 낮다는 의미이므로 상가의 밀도가 낮을수록 상권 경쟁은 낮다는 것을 유추할 수 있다. 한편, 2D 단위 면적당 관심지점의 밀도가 높다는 것은 상가의 밀도가 높다는 의미이므로 상가의 밀도가 높을수록 상권 경쟁은 치열하다는 것을 유추할 수 있다.Since the density of the point of interest per 2D unit area means that the density of the commercial area is low, it can be deduced that the commercial density is low as the density of the commercial area is low. On the other hand, high density of points of interest per 2D unit area means high densities of shopping malls, so it can be inferred that the higher the density of downtown malls, the more fierce the competition is.

표 5는 지역의 3D 단위 면적당 관심지점(POI)의 밀도의 예를 나타낸다. 도 12는 지역의 3D 단위 면적당 관심지점(POI)의 밀도의 예를 설명하기 위한 그래프이다. 특히 표 5에 대응하는 그래프가 도시된다. 본 실시예에서, 표 2에 나타낸 각 지역별 기간별 관심지점 형태소의 전체 개수를 표 3에 나타낸 각 지역별 3D 면적으로 나눈 값을 3D 단위 면적당 관심지점의 밀도로 정의한다. Table 5 shows an example of the density of the point of interest (POI) per 3D area of the area. FIG. 12 is a graph for explaining an example of the density of the point of interest (POI) per 3D unit area of the area. In particular, a graph corresponding to Table 5 is shown. In this embodiment, the total number of morpheme points of interest for each region shown in Table 2 divided by the 3D area for each region shown in Table 3 is defined as the density of the point of interest per 3D unit area.

[표 5][Table 5]

표 5 및 도 12를 참조하면, 2014년 1월 동안 신사동, 영등포, 명동과 관련하는 각각의 3D 단위 면적당 관심지점의 밀도는 0.05, 0.08, 0.5833이고, 2014년 2월 동안 신사동, 영등포, 명동과 관련하는 각각의 3D 단위 면적당 관심지점의 밀도는 0.048, 0.1, 0.5333이고, 2014년 3월 동안 신사동, 영등포, 명동과 관련하는 각각의 3D 단위 면적당 관심지점의 밀도는 0.052, 0.12, 0.5833이다.Referring to Table 5 and FIG. 12, the densities of the points of interest per each 3D unit area related to Sinsa-dong, Yeongdeungpo, and Myondong during January 2014 are 0.05, 0.08 and 0.5833, and in Sinsa-dong, Yeongdeungpo, The densities of the points of interest per each 3D unit area are 0.048, 0.1 and 0.5333, and the densities of the points of interest per each 3D unit area related to Sinsa-dong, Yeongdeungpo, and Myong-dong in March 2014 are 0.052, 0.12 and 0.5833.

이러한 방식으로 2014년 4월 내지 11월 각각에 대응하여 각 지역의 3D 단위 면적당 관심지점의 밀도가 계산될 수 있다. In this way, the density of points of interest per 3D area of each area can be calculated corresponding to April to November 2014 respectively.

도 12에서 확인할 수 있듯이, 시간이 흐르더라도 명동에 대응하는 3D 단위 면적당 관심지점의 밀도는 신사동에 대응하는 3D 단위 면적당 관심지점의 밀도 보다 항상 높은 값을 갖는 것을 확인할 수 있다. 또한, 시간의 흐름에 따라, 명동에 대응하는 3D 단위 면적당 관심지점의 밀도의 변화나 신사동에 대응하는 3D 단위 면적당 관심지점의 밀도의 변화는 큰 변화가 없다는 것을 확인할 수 있다. As can be seen in FIG. 12, it can be seen that the density of the point of interest per 3D unit area corresponding to the sphere of Myeongdong is always higher than the density of the point of interest per 3D unit area corresponding to the sphincter. It can also be seen that there is no significant change in the density of the point of interest per 3D unit area corresponding to the myeongdong, or the density of the point of interest per 3D unit area corresponding to the sphincter, with the passage of time.

하지만, 시간의 흐름에 따라, 영등포에 대응하는 3D 단위 면적당 관심지점의 밀도 는 그 변화는 점차적으로 증가하는 것을 확인할 수 있다. 특히, 2014년 1월 동안에는 3D 단위 면적당 관심지점의 밀도가 신사동과 영등포가 거의 동일하였으나, 시간이 흘러 2014년 11월 동안에는 3D 단위 면적당 관심지점의 밀도가 증가하여 영등포는 신사동에 비해 절대적 우위에 있는 것을 확인할 수 있다. However, it can be seen that as the time passes, the density of the point of interest per 3D unit area corresponding to Yeongdeungpo gradually increases. In particular, during January 2014, the density of points of interest per unit area of 3D was almost the same for Sinsa-dong and Yeongdeungpo, but over time, the density of points of interest per 3D unit area increased during November 2014, .

3D 단위 면적당 관심지점의 밀도가 낮다는 것은 상가의 밀도가 낮다는 의미이므로 상가의 밀도가 낮을수록 상권 경쟁은 낮다는 것을 유추할 수 있다. 한편, 3D 단위 면적당 관심지점의 밀도가 높다는 것은 상가의 밀도가 높다는 의미이므로 상가의 밀도가 높을수록 상권 경쟁은 치열하다는 것을 유추할 수 있다.The low density of the point of interest per 3D unit area means that the density of the commercial area is low, so it can be deduced that the commercial density is low as the density of the commercial area is low. On the other hand, high density of points of interest per 3D unit area means high densities of shopping malls, so it can be inferred that the higher the densities of shopping malls, the more fierce the competition is.

상술된 표들을 토대로 단순 상권지수, 2D 면적이 반영된 2D 상권지수, 3D면적이 반영된 3D 상권지수 각각이 후술하는 표 6, 표 7, 표 8과 같이 계산될 수 있다. Based on the above-described tables, the simple commercial index, the 2D commercial index reflecting the 2D area, and the 3D commercial index reflecting the 3D area can be calculated as shown in Tables 6, 7 and 8, respectively.

표 6은 단순 상권지수를 설명하기 위한 표이다. 도 13은 단순 상권지수를 설명하기 위한 그래프이다. 특히 표 6에 대응하는 그래프가 도시된다. 본 실시예에서, 표 1에 나타낸 각 지역별 기간별 관심지점 형태소와 위치 형태소의 전체 개수를 표 2에 나타낸 각 지역별 관심지점(POI) 형태소의 전체 개수로 나눈 값을 상권지수로 정의한다. Table 6 is a table for explaining simple commercial indexes. 13 is a graph for explaining the simple commercial exponent index. In particular, a graph corresponding to Table 6 is shown. In the present embodiment, a value obtained by dividing the total number of points of interest morpheme and position morpheme by period for each region shown in Table 1 is divided by the total number of POI morphemes shown in Table 2 as a commercial index.

[표 6][Table 6]

표 6 및 도 13을 참조하면, 2014년 1월 동안 신사동, 영등포, 명동에 대응하는 단순 상권지수 각각은 2, 7.5, 1.428이고, 2014년 2월 동안 신사동, 영등포, 명동에 대응하는 단순 상권지수 각각은 2.083, 6.2, 1.406이고, 2014년 3월 동안 신사동, 영등포, 명동에 대응하는 단순 상권지수 각각은 1.923, 5, 1.428이다.Referring to Table 6 and FIG. 13, the simple commodity indexes corresponding to Sinsa-dong, Yeongdeungpo and Myondong in January 2014 are 2, 7.5 and 1.428, respectively. In February 2014, simple commercial indexes corresponding to Sinsa-dong, Yeongdeungpo, While the simple commercial index corresponding to Sinsa-dong, Yeongdeungpo, and Myeong-dong is 1.923, 5, and 1.428, respectively, in March 2014.

이러한 방식으로 2014년 4월 내지 11월 각각에 대응하여 각 지역의 단순 상권지수들이 계산될 수 있다.In this way, simple trade indexes for each region can be calculated corresponding to April to November 2014 respectively.

도 13에서 확인할 수 있듯이, 전체적으로 명동의 단순 상권지수는 상대적으로 낮으면서 변화가 없다는 것을 확인할 수 있다. 하지만, 신사동의 단순 상권지수는 시간이 흐름에 따라 점차적으로 증가하는 것을 확인할 수 있다. 또한, 영등포의 단순 상권지수는 2014년 1월에는 매우 높지만 시간이 흐름에 따라 급격히 낮아지는 것을 확인할 수 있다. As can be seen in FIG. 13, it can be seen that the simple commercial index of Myondong is relatively low overall, and there is no change. However, we can confirm that the simple commercial index of Sinsa-dong gradually increases with time. In addition, Yeongdeungpo's simple commercial index is very high in January 2014, but it shows that it drops sharply over time.

표 7은 2D 면적이 반영된 2D 상권지수를 설명하기 위한 표이다. 도 14는 2D 면적이 반영된 2D 상권지수를 설명하기 위한 그래프이다. 특히 표 7에 대응하는 그래프가 도시된다. 본 실시예에서, 표 1에 나타낸 각 지역별 기간별 관심지점 형태소와 위치 형태소의 전체 개수를 표 2에 나타낸 각 지역별 관심지점(POI) 형태소의 전체 개수로 나눈 후 그 결과값을 표 4의 2D 단위 면적당 관심지점의 밀도로 나눈 값을 상권지수로 정의한다. Table 7 is a table for explaining the 2D commercial index reflecting the 2D area. FIG. 14 is a graph for explaining the 2D commercial index reflecting the 2D area. In particular, a graph corresponding to Table 7 is shown. In this embodiment, the total number of interest point morpheme and position morpheme for each region shown in Table 1 is divided by the total number of POI morphemes shown in Table 2, The value divided by the density of the point of interest is defined as a trade index.

[표 7][Table 7]

표 7 및 도 14를 참조하면, 2014년 1월 동안 신사동, 영등포, 명동에 대응하는 2D 상권지수 각각은 4, 56.25, 2.0408이고, 2014년 2월 동안 신사동, 영등포, 명동에 대응하는 2D 상권지수 각각은 4.34, 37.2, 2.197이고, 2014년 3월 동안 신사동, 영등포, 명동에 대응하는 2D 상권지수 각각은 3.698, 25, 2.040이다.Referring to Table 7 and FIG. 14, the 2D commercial index corresponding to Sinsa-dong, Yeongdeungpo, and Myeong-dong is 4, 56.25, and 2.0408 in January 2014, and the 2D commercial index corresponding to Sinsa-dong, Yeongdeungpo, Respectively, and the 2D commercial index corresponding to Sinsa-dong, Yeongdeungpo, and Myeong-dong is 3.698, 25, and 2.040 in March 2014, respectively.

이러한 방식으로 2014년 4월 내지 11월 각각에 대응하여 각 지역의 2D 상권지수들이 계산될 수 있다.In this manner, 2D commercial indexes for each region can be calculated corresponding to April to November 2014 respectively.

도 14에서 확인할 수 있듯이, 전체적으로 명동의 2D 상권지수는 상대적으로 낮으면서 변화가 없다는 것을 확인할 수 있다. 하지만, 신사동의 2D 상권지수는 시간이 흐름에 따라 점차적으로 증가하는 것을 확인할 수 있다. 또한, 영등포의 2D 상권지수는 2014년 1월에는 매우 높지만 시간이 흐름에 따라 급격히 낮아지는 것을 확인할 수 있다.As can be seen from FIG. 14, the overall 2D commercial index of Myungdong is relatively low, and it can be confirmed that there is no change. However, it can be seen that the 2D commercial index of Sinsa-dong gradually increases with time. In addition, the 2D commercial index of Yeongdeungpo is very high in January 2014, but it shows that it drops sharply over time.

즉, 2D 밀도가 낮을 수록 2D 상권지수가 증가하고, 2D 밀도가 높을수록 2D 상권지수가 감소하는 것을 확인할 수 있다. 2D 밀도가 낮다는 것은 경쟁 업체의 수가 작다는 것을 의미할 수 있고 이에 따라 하나의 상가가 커버하는 영역이 넓다는 것을 의미할 수 있다. That is, the 2D commercial index increases as the 2D density decreases, and the 2D commercial index decreases as the 2D density increases. A low 2D density may mean that the number of competitors is small, which means that the area covered by one shop is wide.

표 8은 3D 면적이 반영된 3D 상권지수를 설명하기 위한 표이다. 도 15는 3D 면적이 반영된 3D 상권지수를 설명하기 위한 그래프이다. 특히 표 8에 대응하는 그래프가 도시된다. 본 실시예에서, 표 1에 나타낸 각 지역별 기간별 관심지점 형태소와 위치 형태소의 전체 개수를 표 2에 나타낸 각 지역별 관심지점(POI) 형태소의 전체 개수로 나눈 후 그 결과값을 표 5의 3D 단위 면적당 관심지점의 밀도로 나눈 값을 상권지수로 정의한다. Table 8 is a table for explaining 3D commercial index reflecting 3D area. FIG. 15 is a graph for explaining the 3D premium index reflecting the 3D area. In particular, the graph corresponding to Table 8 is shown. In the present embodiment, the total number of interest point morpheme and position morpheme per each region shown in Table 1 is divided by the total number of POI morphemes shown in Table 2, The value divided by the density of the point of interest is defined as a trade index.

[표 8][Table 8]

표 8 및 도 15를 참조하면, 2014년 1월 동안 신사동, 영등포, 명동에 대응하는 3D 상권지수 각각은 40, 93.75, 2.4489이고, 2014년 2월 동안 신사동, 영등포, 명동에 대응하는 3D 상권지수 각각은 43.40, 62, 2.636이고, 2014년 3월 동안 신사동, 영등포, 명동에 대응하는 3D 상권지수 각각은 36.98, 41.66, 2.4489이다.Referring to Table 8 and FIG. 15, 3D commercial indexes corresponding to Sinsa-dong, Yeongdeungpo and Myondong are 40, 93.75 and 2.4489, respectively, during January 2014, and 3D commercial indexes corresponding to Sinsa-dong, Yeongdeungpo, Respectively, and the 3D commercial index corresponding to Sinsa-dong, Yeongdeungpo, and Myeong-dong is 36.98, 41.66, and 2.4489 in March 2014, respectively.

이러한 방식으로 2014년 4월 내지 11월 각각에 대응하여 각 지역의 3D 상권지수들이 계산될 수 있다.In this way, 3D commercial indexes for each region can be calculated corresponding to April to November 2014 respectively.

도 15에서 확인할 수 있듯이, 전체적으로 명동의 3D 상권지수는 다른 두 지역이 비해 매우 낮고 변화 역시 없다는 것을 확인할 수 있다. 한편, 영등포의 3D 상권지수는 2014년 1월에는 매우 높지만 시간이 흐름에 따라 급격히 낮아지는 것을 확인할 수 있다. 그럼에도 불구하고 영등포의 3D 상권지수는 명동의 3D 상권지수보다는 높다는 것을 확인할 수 있다. As can be seen in FIG. 15, the overall 3D commercial index of Myondong is very low compared to the other two regions, and it can be seen that there is no change. On the other hand, Yeongdeungpo 3D commercial index is very high in January 2014, but it shows a rapid decline over time. Nevertheless, the 3D commercial index of Yeongdeungpo is higher than the 3D commercial index of Myongdong.

하지만, 신사동의 3D 상권지수는 2014년 1월부터 2014년 3월까지는 명동의 3D 상권지수보다는 높고 영등포의 3D 상권지수보다는 낮지만, 2014년 4월부터는 명동이나 영등포의 3D 상권지수보다 높다는 것을 확인할 수 있다. 즉, 신사동의 3D 상권지수는 시간이 흐름에 따라 점차적으로 증가하여 2014년 11월에는 명동이나 영등포의 3D 상권지수보다 월등히 높다는 것을 확인할 수 있다. However, the 3D commercial index of Sinsa-dong is higher than the 3D commercial index of Myong-dong and the 3D commercial index of Yeongdeungpo from January 2014 to March 2014, but it is higher than the 3D commercial index of Myong-dong and Yeongdeungpo from April 2014 . In other words, the 3D commercial index of Sinsa-dong gradually increases with time, which is much higher than the 3D commercial index of Myeong-dong and Yeongdeungpo in November 2014.

즉, 3D 밀도가 낮을 수록 3D 상권지수가 증가하고, 3D 밀도가 높을수록 3D 상권지수가 감소하는 것을 확인할 수 있다. 3D 밀도가 낮다는 것은 경쟁 업체의 수가 작다는 것을 의미할 수 있고 이에 따라 하나의 상가가 커버하는 영역이 넓다는 것을 의미할 수 있다.That is, the 3D commercial index increases as the 3D density decreases, and the 3D commercial index decreases as the 3D density increases. A low 3D density may mean that the number of competitors is small, which means that the area covered by a single mall is wide.

상술된 사항들을 간략히 정리하면 아래와 같다. The following are briefly summarized as follows.

도 9를 참조하면, 웹 문서에서 언급이 많으면 해당 지역은 인기 상권으로 보인다. 하지만, 도 13을 참조하면, 시계열상의 추이와 관심지점(POI) 밀도 대비 웹 분석으로 상권의 상대적 가치를 알 수 있다. 즉, 웹 분석 결과가 좋아도 관심지점(POI)을 고려했을 때 영등포나 명동은 오히려 상권이 나빠지는 것을 확인할 수 있다. Referring to FIG. 9, if there is a lot of mention in a web document, the area appears to be a popular commercial area. However, referring to FIG. 13, the relative value of the commercial area can be known by the web analysis on the time series and the POI density. In other words, even if the results of web analysis are good, Yeongdeungpo and Myungdong can be seen to be rather bad when the POI is considered.

또한, 도 14를 참조하면, 지역의 넓이 밀도를 고려하면 또 다른 시각에서 분석이 가능하다. 즉, 도 13에서는 상권이 점점 좋아지는 것으로 보이지만 밀도를 고려하면 특히 신사동의 경우 상권이 아주 많이 좋아지지는 않는 것을 확인할 수 있다. Also, referring to FIG. 14, it is possible to analyze from another viewpoint considering the area density of the area. In other words, although the commercial area seems to be getting better in Fig. 13, considering the density, it can be confirmed that the commercial area does not improve very much especially in the case of Sinsa-dong.

또한, 지역의 건물 층까지 고려한 3D 밀도를 고려하면 다시 또 다른 시각의 분석이 가능하다. 즉, 도 14에서는 별로 좋아 보이지 않는 신사동 상권이 도 15를 확인하면 두드러지게 좋아지고 있음을 확인할 수 있다. 이는 고층 건물이 많은 지역으로서 인구밀도가 높고 상대적으로 사용 가능한 지역이 넓기 때문이다. Also, considering the 3D density considering the local building layer, it is possible to analyze another viewpoint again. That is, in FIG. 14, it can be confirmed that the sinsa-dong commercial area which does not look very good is remarkably improved by checking Fig. This is due to the high density of the high-rise buildings and the relatively wide availability of the area.

이상 설명된 본 발명에 따른 실시예는 다양한 컴퓨터 구성요소를 통하여 실행될 수 있는 프로그램 명령어의 형태로 구현되어 컴퓨터 판독 가능한 기록 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능한 기록 매체는 프로그램 명령어, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 컴퓨터 판독 가능한 기록 매체에 기록되는 프로그램 명령어는 본 발명을 위하여 특별히 설계되고 구성된 것이거나 컴퓨터 소프트웨어 분야의 당업자에게 공지되어 사용 가능한 것일 수 있다. 컴퓨터 판독 가능한 기록 매체의 예에는, 하드디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체, CD-ROM 및 DVD와 같은 광기록 매체, 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical medium), 및 ROM, RAM, 플래시 메모리 등과 같은, 프로그램 명령어를 저장하고 실행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령어의 예에는, 컴파일러에 의하여 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용하여 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드도 포함된다. 하드웨어 장치는 본 발명에 따른 처리를 수행하기 위하여 하나 이상의 소프트웨어부로 변경될 수 있으며, 그 역도 마찬가지이다. The embodiments of the present invention described above can be implemented in the form of program instructions that can be executed through various computer components and recorded in a computer-readable recording medium. The computer-readable recording medium may include program commands, data files, data structures, and the like, alone or in combination. The program instructions recorded on the computer-readable recording medium may be those specifically designed and configured for the present invention or may be those known and used by those skilled in the computer software arts. Examples of computer-readable media include magnetic media such as hard disks, floppy disks and magnetic tape, optical recording media such as CD-ROM and DVD, magneto-optical media such as floptical disks, medium, and hardware devices specifically configured to store and execute program instructions, such as ROM, RAM, flash memory, and the like. Examples of program instructions include machine language code, such as those generated by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like. The hardware device may be modified into one or more software components for performing the process according to the present invention, and vice versa.

이상에서는 실시예를 참조하여 설명하였지만, 해당 기술 분야의 숙련된 당업자는 하기의 특허 청구의 범위에 기재된 본 발명의 사상 및 영역으로부터 벗어나지 않는 범위 내에서 본 발명을 다양하게 수정 및 변경시킬 수 있음을 이해할 수 있을 것이다. It will be apparent to those skilled in the art that various modifications and variations can be made in the present invention without departing from the spirit or scope of the invention as defined in the appended claims. You will understand.

이상에서 설명한 바와 같이, 본 발명에 따르면, 지역 정보나 지역 특성을 시간의 흐름으로 파악하여 지역에 대한 상권분석이 필요하다는 점에 착안하여, 소셜 커머스, 공연, SNS, 블로그, 뉴스 등과 같은 실시간 웹 문서를 수집하고 수집된 웹 문서에서 위치 정보 및 관심지점(POI)을 추출하고, 지역 기준으로 분류 및 분석하고, 언급된 횟수를 기반으로 인기 지역을 판별한다. 특히, 추출된 웹 문서의 종류 및 관심지점(POI) 카테고리에 따라 어떤 분야에서 인기가 있는지 판별할 수 있다.As described above, according to the present invention, attention is paid to the fact that regional information and regional characteristics are analyzed as a flow of time, and commercial analysis is required for the region. Thus, real-time web information such as social commerce, performance, SNS, It collects documents and extracts location and point of interest (POI) from the collected web documents, classifies and analyzes them on a regional basis, and determines popular areas based on the number of times mentioned. In particular, it is possible to determine which category is popular depending on the type of the extracted web document and the POI category.

100 : 지역 상권 흐름 분석 시스템 110 : 웹 문서 수집부
120 : 웹 문서 저장 DB 130 : 형태소 분석부
140 : 형태소 분류-저장부 142 : 형태소 추출부
144 : 형태소 분류부 150 : 관심지점 형태소 저장 DB
160 : 위치 형태소 저장 DB 170 : 상권 흐름 판별부
172 : 지역별 상권분석 모듈 174 : 기간별 상권분석 모듈
176 : 웹 문서 종류별 상권분석 모듈
178 : 관심지점 카테고리별 상권분석 모듈
179 : 인터페이스 모듈 200 : POI DB
300 : 단말기100: Local commerce flow analysis system 110: Web document collection section
120: web document storage DB 130: morphological analysis unit
140: morpheme classification-storing section 142: morpheme extracting section
144: morpheme classifier 150: store point of interest morpheme DB
160: location morpheme storage DB 170: merchandise flow discrimination unit
172: Regional trade analysis module 174: Regional trade analysis module
176: Commercial analysis module by web document type
178: Commercial analysis module by category of interest
179: Interface module 200: POI DB
300:

Claims

Collecting a plurality of web documents;
Performing morpheme analysis on the collected web documents;
Classifying and analyzing the analyzed morphemes by region; And
And determining a commercial area based on the number of morphemes stored on the area basis.

2. The system according to claim 1, wherein the morpheme includes a point morpheme corresponding to the information registered in the point of interest (POI) DB and a position morpheme corresponding to the position information not registered in the point of interest DB How to analyze commercial flow.

3. The method of claim 2, wherein classifying and storing the analyzed morphemes by region comprises:
Extracting the interest point morpheme or the position morpheme in the analyzed morphemes; And
And classifying and storing each of the interest point morpheme and the position morpheme on a regional basis.

4. The method according to claim 3, wherein the step of discriminating the commercial area comprises: determining the commercial area based on the number of the interest point morpres and the number of the location morpreses stored on the area basis .

5. The method of claim 4, wherein the step of determining the commercial area comprises:
Analyzing the commercial area according to the region, analyzing the commercial area according to the region, analyzing the commercial area according to the region and the period, analyzing the commercial area according to the type of the web document written in correspondence with the region, And analyzing the commercial according to the branch category.

The method of claim 1, wherein the web document includes a social commerce, a performance, an SNS, a blog, and a news.

A web document collection unit for collecting a plurality of web documents;
A morphological analysis unit for analyzing morphemes in the collected web documents;
A morpheme classifying and storing unit for classifying and analyzing the analyzed morphemes by region; And
And a merchandise flow discrimination unit for discriminating a merchandise area based on the number of morphemes stored on the basis of the area standard.

8. The method according to claim 7, wherein the morpheme includes a point morpheme corresponding to the information registered in the point of interest (POI) DB and a position morpheme corresponding to the position information not registered in the point of interest DB Commercial flow analysis system.

9. The apparatus according to claim 8, wherein the morpheme classification-
A morpheme extracting unit for extracting the morpheme of interest and the morpheme corresponding to the position information in the analyzed morphemes; And
And a morpheme classifying unit for classifying each of the interest point morpheme and the position morpheme on an area basis.

10. The method of claim 9,
A point of interest morpheme storage DB storing an interest point morpheme classified by the morphological classification unit; And
And a location morpheme storage DB for storing location morphemes classified by the morpheme classification unit.

11. The apparatus according to claim 10,
Characterized in that the commercial area is determined by analyzing the flow of time on the basis of the number of interest point morphemes stored in the interest point morpheme storage DB and the number of location morpreses stored in the location morpheme store DB Commercial flow analysis system.

12. The apparatus according to claim 11,
A commercial analysis module for analyzing the commercial area according to the region, a commercial analysis module for analyzing the commercial area in correspondence with the region and the period, a commercial analysis module for the web document classified by the type of the web document written corresponding to the region, And a commercial analysis module for each point of interest that analyzes commercials by branch category.