KR101997111B1

KR101997111B1 - Apparatus for calculating convergence rate according to web page analysis based on artificial intelligence and operating method thereof

Info

Publication number: KR101997111B1
Application number: KR1020180002094A
Authority: KR
Inventors: 김관호; 최영철; 정민우; 곽두희; 고정경; 박선영
Original assignee: 인천대학교 산학협력단
Priority date: 2018-01-08
Filing date: 2018-01-08
Publication date: 2019-07-05

Abstract

The present invention relates to a device for calculating a fusion index based on webpage analysis based on artificial intelligence and an operating method thereof. The present invention determines an industrial fusion index between first industries selected with regard to a first region and second industries selected with regard to a second region based on the industrial fusion index, transmits a regional fusion index calculated after determining an average value of the determined industrial fusion index as the regional fusion index to a terminal of a manager, and allows companies to perform fusion activities so as to use a separate inter-industry relation table after extracting company information of various industries, thereby analyzing various fusion information and providing the analyzed fusion information to the companies for contributing new technology development and increase of competitiveness which are a goal of the companies.

Description

TECHNICAL FIELD [0001] The present invention relates to an apparatus for calculating a fusion index based on artificial intelligence based web page analysis, and a method of operating the same. [0002]

본 발명은 웹 페이지의 분석을 통해 해당 웹 페이지를 운영하는 기업의 키워드 정보를 추출한 후 추출된 기업 키워드 정보 및 산업 융복합 지수에 기초하여 융합 지수를 연산하는 장치 및 이의 동작 방법에 대한 것이다.The present invention relates to a device for extracting keyword information of a company operating a corresponding web page through analysis of a web page and then calculating a fusion index based on the extracted enterprise keyword information and an industrial fusion composite index, and an operation method thereof.

신기술 개발과 신규 시장 창출은 기업의 경쟁력 제고를 위해 반드시 필요한 활동이다. 이러한 신기술 개발과 신규 시장 창출을 효과적으로 수행하기 위해서 기업들은 융합 활동을 통하여 서로 다른 분야의 기술과 아이디어들을 결합해야 할 필요가 있다. 지난 2015년 디지털타임즈와 한국산업기술진흥협회가 공동 진행한 ‘산업현장 융합 활동 현황 설문조사’에 따르면 국내 기업의 80% 이상이 융합 활동이 필요하다고 답하였다. 이를 통하여 기업들 또한 융합에 대한 중요성에 대하여 인식하고 있다는 것을 알 수 있다. 뿐만 아니라 기업들이 융합 활동이 필요하다고 생각한 이유로는 '신기술·신제품 개발'(42.5%)이 가장 많은 것을 확인할 수 있었고 실제 융합 활동을 진행한 기업들 가운데 매출액이 증가한 기업의 비율이 56.1%에 달했다. 이는 감소했다고 답한 5%에 비하여 매우 높은 수치이다. 따라서 융합 활동이 기업들이 목표하는 신기술 개발과 경쟁력 제고에 기여한다는 것을 알 수 있다.The development of new technologies and the creation of new markets are essential activities for enhancing competitiveness of enterprises. To effectively develop these new technologies and create new markets, companies need to combine technologies and ideas from different fields through convergence activities. According to the "Survey on the status of industrial field convergence activities" jointly conducted by the Digital Times and Korea Industrial Technology Promotion Association in 2015, more than 80% of Korean companies answered that convergence activities are necessary. This shows that companies are also aware of the importance of convergence. In addition, the reason why companies think that convergence is necessary is that 'new technology · new product development' (42.5%) is the most, and among the companies that have actually converged, 56.1% of companies have increased their sales. This is a very high figure compared to the 5% who said they had decreased. Therefore, it can be seen that convergence activities contribute to the development of new technologies and enhancement of competitiveness.

현재 정부와 민간단체에서는 기업들의 융합을 위한 정책으로 융합제품 및 서비스의 개발에 대한 금전적 지원이 이루어지고 있고 융합을 통한 새로운 시장을 개척하기 위해 기업 간 교류협력 강화 사업들을 펼치고 있다. 하지만 융합을 저해하는 가장 큰 요인은 융합관련 시장이나 기술 등에 대한 정보의 부족으로 이를 해결하기 위한 노력이 필요한 실정이다.Currently, government and private organizations are providing financial support for the development of convergence products and services as a policy for convergence of companies, and they are carrying out projects to strengthen exchanges and cooperation between companies to develop new markets through convergence. However, the biggest obstacle to convergence is the lack of information on convergence-related markets and technologies.

현재 한국은행에서는 정보제공을 위해 국내 산업군 간의 연관성 수치를 알 수 있는 산업연관표를 발행하고 있다. 하지만 기업이나 지역의 융합과 같은 추가적인 정보는 알 수 없다는 단점이 있다. 기존의 국내 연구로는 “우수 중소 제조기업을 위한 스카우팅 시스템과 추천 알고리즘”에 관한 연구가 이루어진 바가 있지만 제조기업에만 국한 된다는 점에서 전체 산업 군에 대한 융합정보 제공이 어렵다는 한계가 있다.Currently, the Bank of Korea is issuing an industry association chart that can be used to provide information on the linkages among domestic industry groups. However, there is a disadvantage that additional information such as business or regional convergence is unknown. In the existing domestic research, "Scouting system and recommendation algorithm for small and medium sized manufacturing companies" has been studied, but it is difficult to provide convergence information for the entire industrial group because it is limited to manufacturing companies.

따라서, 기업들이 융합 활동을 수행함으로써 기업들이 목표하는 신기술 개발과 경쟁력 제고에 기여할 수 있도록, 여러 산업 군의 기업정보를 추출한 후 별도의 산업연관표를 이용하여 다양한 융합 정보를 분석하고, 분석된 융합 정보를 기업들에게 제공할 수 있는 기법에 관한 연구가 필요하다.Therefore, in order to contribute to the development of new technologies and enhance competitiveness of companies by performing convergence activities, companies extract information of companies in various industries, analyze various fusion information using a separate industry association table, And to provide them to companies.

본 발명에 따른 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치 및 이의 동작 방법은 산업융복합지수에 기초하여 제1 지역에 대해 선택된 제1 산업군들과 제2 지역에 대해 선택된 제2 산업군들 간의 산업융복합지수를 결정하고, 결정된 산업융복합지수의 평균 값을 지역 융합 지수로 결정한 후 상기 연산된 지역 융합 지수를 상기 관리자의 단말에 전송함으로써 기업들이 융합 활동을 수행함으로써 기업들이 목표하는 신기술 개발과 경쟁력 제고에 기여할 수 있도록, 여러 산업 군의 기업정보를 추출한 후 별도의 산업연관표를 이용하여 다양한 융합 정보를 분석하고, 분석된 융합 정보를 기업들에게 제공하고자 한다.A fusion index computing device based on the artificial intelligence-based web page analysis according to the present invention and its operation method are characterized in that a first industry group selected for the first region and a second industry group selected for the second region And then transmits the calculated regional fusion index to the terminal of the manager so that the companies can perform the convergence activities, and thus the new technology In order to contribute to development and competitiveness enhancement, it is necessary to extract company information of various industrial groups, analyze various fusion information using a separate industry association table, and provide the analyzed fusion information to companies.

본 발명의 일실시예에 따른 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치는 다수의 기업들 각각에 대해 기업 주소 및 사전 설정된 다수의 산업군들 중 적어도 하나의 산업군이 매칭되어 있는 기업 정보를 저장하는 기업 정보 저장부, 상기 사전 설정된 다수의 산업군들 간의 산업융복합지수를 저장하는 산업융복합지수 저장부, 관리자의 단말로부터 제1 지역 명칭 및 제2 지역 명칭이 입력되면, 상기 기업 정보 저장부에 저장되어 있는 상기 기업 정보에 매칭되어 있는 기업 주소에 기초하여 상기 다수의 산업군들 각각마다 상기 제1 지역 명칭에 따른 제1 지역 및 제2 지역 명칭에 따른 제2 지역에 위치하는 제1 산업별 기업 개수 및 제2 산업별 기업 개수를 카운트하는 산업별 기업 개수 카운트부, 상기 제1 산업별 기업 개수에 기초하여 상기 다수의 산업군들 중 상기 제1 지역에 위치하는 기업의 개수가 많은 순서대로 N - 상기 N은 1 이상의 정수임 - 개의 제1 산업군들을 선택하고, 상기 제2 산업별 기업 개수에 기초하여 상기 다수의 산업군들 중 상기 제2 지역에 위치하는 기업의 개수가 많은 순서대로 M - 상기 M은 1 이상의 정수임 - 개의 제2 산업군들을 선택하는 산업군 선택부, 상기 산업융복합지수에 기초하여 상기 제1 지역에 대해 선택된 제1 산업군들과 상기 제2 지역에 대해 선택된 제2 산업군들 간의 산업융복합지수를 결정하고, 상기 결정된 산업융복합지수의 평균 값을 지역 융합 지수로 결정하는 융합 지수 연산부 및 상기 연산된 지역 융합 지수를 상기 관리자의 단말에 전송하는 융합 지수 전송부를 포함한다.The apparatus for calculating a fusion index based on the analysis of a web page based on an artificial intelligence according to an embodiment of the present invention is characterized in that a company address and at least one industry group of a plurality of pre- An industrial complex index storage unit for storing an industrial composite index between the predetermined plurality of industries and a first area name and a second area name from an administrator terminal, Based on a business address matching the company information stored in the first region and the second region based on the first region name and the second region based on the first region name, A number of companies for each industry, for counting the number of companies and the number of companies per second industry; Selecting a first industry group of N - N, where N is an integer equal to or greater than 1, in a descending order of the number of enterprises located in the first region among a plurality of industry groups, An industry group selection unit for selecting the second industry groups that are M - M in the descending order of the number of enterprises located in the second region and the M is an integer of 1 or more; A fusion index calculation unit for determining an industrial fusion composite index between the first industry groups and the second industry groups selected for the second region and determining an average value of the determined industrial fusion composite indexes as a regional fusion index, And transmits the index to the terminal of the manager.

또한, 본 발명의 일실시예에 따른 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치의 동작 방법은 다수의 기업들 각각에 대해 기업 주소 및 사전 설정된 다수의 산업군들 중 적어도 하나의 산업군이 매칭되어 있는 기업 정보를 저장하는 기업 정보 저장부를 유지하는 단계, 상기 사전 설정된 다수의 산업군들 간의 산업융복합지수를 저장하는 산업융복합지수 저장부를 유지하는 단계, 관리자의 단말로부터 제1 지역 명칭 및 제2 지역 명칭이 입력되면, 상기 기업 정보 저장부에 저장되어 있는 상기 기업 정보에 매칭되어 있는 기업 주소에 기초하여 상기 다수의 산업군들 각각마다 상기 제1 지역 명칭에 따른 제1 지역 및 제2 지역 명칭에 따른 제2 지역에 위치하는 제1 산업별 기업 개수 및 제2 산업별 기업 개수를 카운트하는 단계, 상기 제1 산업별 기업 개수에 기초하여 상기 다수의 산업군들 중 상기 제1 지역에 위치하는 기업의 개수가 많은 순서대로 N - 상기 N은 1 이상의 정수임 - 개의 제1 산업군들을 선택하고, 상기 제2 산업별 기업 개수에 기초하여 상기 다수의 산업군들 중 상기 제2 지역에 위치하는 기업의 개수가 많은 순서대로 M - 상기 M은 1 이상의 정수임 - 개의 제2 산업군들을 선택하는 단계, 상기 산업융복합지수에 기초하여 상기 제1 지역에 대해 선택된 제1 산업군들과 상기 제2 지역에 대해 선택된 제2 산업군들 간의 산업융복합지수를 결정하고, 상기 결정된 산업융복합지수의 평균 값을 지역 융합 지수로 결정하는 단계 및 상기 연산된 지역 융합 지수를 상기 관리자의 단말에 전송하는 단계를 포함한다.Also, the operation method of the fusion index computing device based on the analysis of the web page based on the artificial intelligence according to the embodiment of the present invention is characterized in that, for each of a plurality of companies, at least one industry group among a plurality of industries The method comprising the steps of: maintaining an enterprise information storage unit for storing enterprise information that is stored in the enterprise information storage unit; 2, a first area name and a second area name according to the first area name are registered for each of the plurality of industry groups based on a business address matched with the enterprise information stored in the enterprise information storage unit Counting the number of companies in the first industry and the number of companies in the second industry located in the second region according to the first mountain, The number of companies located in the first region among the plurality of industry groups is selected in the order of N - N, where N is an integer of 1 or more, based on the number of companies by business, Selecting, based on the number of industries located in the second region among the plurality of industry groups, M - the second industry groups in which M is an integer equal to or greater than 1, Determining a composite industrial composite index between the first industry groups selected for the first region and the second industry groups selected for the second region and determining an average value of the determined composite industrial composite index as a regional fusion index; And transmitting the local fusion index to the terminal of the manager.

본 발명에 따른 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치 및 이의 동작 방법은 산업융복합지수에 기초하여 제1 지역에 대해 선택된 제1 산업군들과 제2 지역에 대해 선택된 제2 산업군들 간의 산업융복합지수를 결정하고, 결정된 산업융복합지수의 평균 값을 지역 융합 지수로 결정한 후 상기 연산된 지역 융합 지수를 상기 관리자의 단말에 전송함으로써 기업들이 융합 활동을 수행함으로써 기업들이 목표하는 신기술 개발과 경쟁력 제고에 기여할 수 있도록, 여러 산업 군의 기업정보를 추출한 후 별도의 산업연관표를 이용하여 다양한 융합 정보를 분석하고, 분석된 융합 정보를 기업들에게 제공할 수 있다.A fusion index computing device based on the artificial intelligence-based web page analysis according to the present invention and its operation method are characterized in that a first industry group selected for the first region and a second industry group selected for the second region And then transmits the calculated regional fusion index to the terminal of the manager so that the companies can perform the convergence activities, and thus the new technology In order to contribute to development and competitiveness enhancement, it is possible to extract information on companies in various industrial groups, analyze various fusion information using a separate industry association table, and provide analyzed fusion information to companies.

도 1은 본 발명의 일실시예에 따른 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치의 구조를 도시한 도면이다.
도 2는 본 발명의 일실시예에 따른 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치의 동작 방법을 도시한 순서도이다.1 is a diagram illustrating a structure of a fusion index computing apparatus based on analysis of a web page based on an artificial intelligence according to an embodiment of the present invention.
2 is a flowchart illustrating an operation method of a fusion index computing apparatus based on analysis of a web page based on an artificial intelligence according to an embodiment of the present invention.

이하에서는 본 발명에 따른 실시예들을 첨부된 도면을 참조하여 상세하게 설명하기로 한다. 이러한 설명은 본 발명을 특정한 실시 형태에 대해 한정하려는 것이 아니며, 본 발명의 사상 및 기술 범위에 포함되는 모든 변경, 균등물 내지 대체물을 포함하는 것으로 이해되어야 한다. 각 도면을 설명하면서 유사한 참조부호를 유사한 구성요소에 대해 사용하였으며, 다르게 정의되지 않는 한, 기술적이거나 과학적인 용어를 포함해서 본 명세서 상에서 사용되는 모든 용어들은 본 발명이 속하는 기술분야에서 통상의 지식을 가진 사람에 의해 일반적으로 이해되는 것과 동일한 의미를 가지고 있다.Hereinafter, embodiments according to the present invention will be described in detail with reference to the accompanying drawings. It is to be understood that the description is not intended to limit the invention to the specific embodiments, but includes all modifications, equivalents, and alternatives falling within the spirit and scope of the invention. Like reference numerals in the drawings are used for similar elements and, unless otherwise defined, all terms used in the specification, including technical and scientific terms, are to be construed in a manner that is familiar to those skilled in the art. It has the same meaning as commonly understood by those who have it.

도 1은 본 발명의 일실시예에 따른 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치(100)의 구조를 도시한 도면이다.1 is a diagram illustrating a structure of a fusion index calculating apparatus 100 based on analysis of a web page based on an artificial intelligence according to an embodiment of the present invention.

도 1을 참조하면, 본 발명의 일실시예에 따른 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치(100)는 기업 정보 저장부(110), 산업군 단어 저장부(120), 기업 정보 입력부(130), 산업융복합지수 저장부(140), 산업별 기업 개수 카운트부(150), 산업군 선택부(160), 융합 지수 연산부(170) 및 융합 지수 전송부(180)를 포함한다.1, a fusion index computing apparatus 100 based on analysis of a web page based on an artificial intelligence according to an embodiment of the present invention includes an enterprise information storage unit 110, an industry word storage unit 120, An industry cluster number storage unit 140, an industry sector number count unit 150, an industry group selection unit 160, a fusion index calculation unit 170, and a fusion index transmission unit 180.

기업 정보 저장부(110)에는 다수의 기업들 각각에 대해 기업 주소 및 사전 설정된 다수의 산업군들 중 적어도 하나의 산업군이 매칭되어 있는 기업 정보가 저장되어 있다. 또한, 상기 기업 정보 저장부(110)는 상기 다수의 기업들 각각에 대해 중요 단어를 더 매칭시켜 저장할 수 있다.The enterprise information storage unit 110 stores company information for each of a plurality of companies, and company information in which at least one industry group among a plurality of predetermined industry groups is matched. In addition, the enterprise information storage unit 110 may further store important words for each of the plurality of companies.

예컨대, 기업 정보 저장부(110)에는 하기의 표 1과 같이 정보가 저장되어 있을 수 있다.For example, information may be stored in the enterprise information storage unit 110 as shown in Table 1 below.

기업 정보 인덱스Company Information Index 기업 주소Company Address 산업군 정보Industry information 중요 단어Important word 기업 정보 1Company Information 1 주소 1Address 1 농림수산품Agriculture, forestry and fisheries products 단어 1, 단어 2, 단어 3Word 1, Word 2, Word 3 기업 정보 2Company Information 2 주소 2Address 2 농림수산품Agriculture, forestry and fisheries products 단어 1, 단어 3Word 1, word 3 기업 정보 3Company Information 3 주소 3Address 3 광산품Minerals 단어 4, 단어 5, 단어 6Word 4, Word 5, Word 6 기업 정보 4Company Information 4 주소 4Address 4 음식료품Food and beverage 단어 7, 단어 8Word 7, word 8 기업 정보 5Company Information 5 주소 5Address 5 문화 및 기타 서비스Culture and other services 단어 9, 단어 10, 단어 11Word 9, Word 10, Word 11 기업 정보 6Company Information 6 주소 6Address 6 문화 및 기타 서비스Culture and other services 단어 9, 단어 11, 단어 12Word 9, Word 11, Word 12 ...... ...... ...... ......

산업군 단어 저장부(120)는 서로 다른 특성 벡터들이 할당되어 있는 사전 설정된 다수의 산업군 단어들을 포함하는 사전 설정된 다수의 산업군 단어 그룹들을 저장할 수 있다. 여기에서, 상기 다수의 산업군 단어 그룹들 각각에 포함된 다수의 산업군 단어들은 서로 간의 특성 벡터를 이용하여 연산되는 유사도가 사전 설정된 기준 유사도 이상인 단어들일 수 있다.The industry word storage 120 may store a plurality of predetermined industry group word groups including a plurality of predetermined industry group words to which different characteristic vectors are assigned. Here, the plurality of industry group words included in each of the plurality of industry group words may be words whose similarity calculated using the feature vector between them is equal to or greater than a predetermined reference similarity degree.

산업군 단어 저장부(120)에는 하기 표 2와 같이 서로 다른 특성 벡터들이 할당되어 있는 사전 설정된 다수의 산업군 단어들을 포함하는 사전 설정된 다수의 산업군 단어 그룹들을 저장할 수 있다.The industry word storage unit 120 may store a plurality of preset industry word groups including a plurality of preset industry word words to which different characteristic vectors are allocated as shown in Table 2 below.

산업군 단어 그룹 인덱스Industry Group Word Index 산업군 단어Industry word 특성 벡터Characteristic vector 산업군 단어 그룹 1Industry Group 1 컴퓨터computer (1, 2, 0, 0, 3)(1, 2, 0, 0, 3) 휴대폰cellphone (2, 1, -1, -1, 2)(2, 1, -1, -1, 2) 노트북laptop (3, 2, 0, -1, 1)(3, 2, 0, -1, 1) 산업군 단어 그룹 2Industry Group Word 2 고등어Mackerel (-1, 1, 4, 3, 1)(-1, 1, 4, 3, 1) 꽁치Pacific saury (0, -1, 3, 6, 2)(0, -1, 3, 6, 2) 멸치Anchovy (-1, -1, 5, 2, -1)(-1, -1, 5, 2, -1) ...... ...... ......

여기에서, 산업군 단어 그룹 1은 전기 및 전자기기일 수 있고, 산업군 단어 그룹 2는 농림수산품일 수 있다.Here, industry word group 1 may be electrical and electronic devices, and industry word group 2 may be agriculture, forestry and fisheries products.

여기서, 벡터 사이의 유사도는 하기의 수학식 1에 따라 연산될 수 있다.Here, the similarity between the vectors can be calculated according to the following equation (1).

여기서, S는 특성 벡터 A와 B 사이의 유사도로 -1에서 1사이의 값을 가지며, 그 값이 클수록 유사한 특성 벡터임을 의미하고, A_i는 특성 벡터 A의 i번째 성분, B_i는 특성 벡터 B의 i번째 성분을 의미한다.Here, S is a similarity between the characteristic vectors A and B, and has a value between -1 and 1, and the larger the value is, the similar characteristic vector, A _i is the i-th component of the characteristic vector A, B _i is the characteristic vector And the i-th component of B, respectively.

예컨대, 상기 표 2에서 "컴퓨터"라는 산업군 단어와 "휴대폰"이라는 산업군 단어에 각각 할당되어 있는 특성 벡터 간의 유사도를 연산하게 되면, 하기의 수학식 2와 같이 연산될 수 있다.For example, when the similarity degree between the industrial word word "computer" and the characteristic word " mobile phone ", respectively, in Table 2 is calculated, the following equation (2) can be calculated.

또한, 예컨대, 상기 표 2에서 "컴퓨터"라는 산업군 단어와 "고등어"이라는 산업군 단어에 각각 할당되어 있는 특성 벡터 간의 유사도를 연산하게 되면, 하기의 수학식 3과 같이 연산될 수 있다.For example, when the similarity degree between the industry word word "computer" and the characteristic word assigned to the industry word word "mackerel" is calculated in the above Table 2, it can be calculated as follows.

"컴퓨터"라는 산업군 단어와 "휴대폰"이라는 산업군 단어에 각각 할당되어 있는 특성 벡터 간의 유사도(즉, 0.8058)와 "컴퓨터"라는 산업군 단어와 "고등어"이라는 산업군 단어에 각각 할당되어 있는 특성 벡터 간의 유사도(즉, 0.2020)를 비교하면, "컴퓨터"라는 산업군 단어는 "휴대폰"이라는 산업군 단어와 유사하지만 "고등어"이라는 산업군 단어와는 유사하지 않음을 알 수 있다. 여기에서, 사전 설정된 기준 유사도(예컨대, 0.5000)에 기초하여 유사한 산업군 단어인지를 판별할 수 있다.The similarity (ie, 0.8058) between the industry word word "computer" and the industry word word "mobile phone", and the similarity degree between the industry word word "computer" and the industry word word "mackerel" (Ie, 0.2020), it can be seen that the industry word "computer" is similar to the industry word "mobile" but not the industry word "mackerel". Here, it is possible to determine whether it is a similar industrial word based on a predetermined reference similarity degree (e.g., 0.5000).

상기 표 2와 같은 산업군 단어 저장부(120)에 저장되는 복수의 산업군 단어들은 관리자에 의해 임의로 설정된 단어들이며, 각 산업군 단어들에 할당되어 있는 특성 벡터들은 관리자에 의해서 설정된 각 단어들 간의 유사도 기준에 따라 소정의 유사도가 연산되도록 할당된 값일 수 있다. 이때, 각 산업군 단어들 간의 상기 유사도 기준은 웹을 통해 다양한 정보들을 수집하고, 각 정보들에 대한 분석과 학습을 통해서 다양한 산업군 단어들 간의 관계 분석을 수행한 결과에 기반한 기준일 수 있다.The plurality of industry word words stored in the industry word storage unit 120 as shown in Table 2 are words arbitrarily set by the administrator, and the characteristic vectors assigned to the respective industry word words are similar to the similarity standard between the words set by the administrator And may be a value assigned to calculate a predetermined degree of similarity. In this case, the similarity criterion between the words of each industry group may be based on the result of analyzing the relation information between various industry groups through analyzing and learning various information through the web.

기업 정보 입력부(130)는 입력된 제1 기업의 웹 페이지에 대한 접속 주소에 기초하여, 상기 제1 기업의 웹 페이지에 접속한 후 상기 제1 기업의 웹 페이지로부터 상기 제1 기업의 웹 페이지 상에 존재하는 복수의 제1 텍스트들을 이용하여 상기 제1 기업에 대한 기업 정보를 생성한 후 상기 기업 정보 저장부에 저장함으로써 기업 정보 저장부(110)에 저장되어 있는 기업 정보를 갱신할 수 있다. 이를 위하여, 기업 정보 입력부(130)는 텍스트 추출부(131), 단어 추출부(132), 기업 주소 결정부(133), 중요 단어 선택부(134), 산업군 결정부(135) 및 기업 정보 입력부(136)을 포함할 수 있다.The company information input unit 130 accesses the web page of the first company on the basis of the input address of the web page of the first company, The enterprise information stored in the enterprise information storage unit 110 can be updated by generating the enterprise information for the first enterprise using the plurality of first texts stored in the enterprise information storage unit. The enterprise information input unit 130 includes a text extracting unit 131, a word extracting unit 132, an enterprise address determining unit 133, an important word selecting unit 134, an industry group determining unit 135, (136).

텍스트 추출부(131)는 제1 기업의 웹 페이지에 대한 접속 주소가 입력되면, 상기 접속 주소를 기초로 상기 제1 기업의 웹 페이지에 접속하여 상기 제1 기업의 웹 페이지로부터 상기 제1 기업의 웹 페이지 상에 존재하는 복수의 제1 텍스트들을 추출할 수 있다.When the access address for the web page of the first company is inputted, the text extracting unit 131 accesses the web page of the first company on the basis of the access address, extracts the web page of the first company from the web page of the first company, A plurality of first texts existing on the web page can be extracted.

이때, 본 발명의 일실시예에 따르면, 텍스트 추출부(131)는 상기 제1 기업의 웹 페이지를 구성하는 HTML(Hypertext Markup Language) 코드를 파싱(parsing)하여 상기 HTML 코드 상에서 텍스트 입력과 연관된 태그(tag)를 통해 삽입되어 있는 텍스트들을 추출함으로써, 상기 제1 기업의 웹 페이지 상에 존재하는 상기 복수의 제1 텍스트들을 추출하되, 상기 HTML 코드 상에 하이퍼링크 태그가 존재하는 경우, 상기 하이퍼링크 태그를 통해 링크되어 있는 서브 페이지에 접속하여 상기 서브 페이지의 HTML 코드로부터 텍스트 입력과 연관된 태그를 통해 삽입되어 있는 텍스트들도 함께 추출함으로써, 상기 제1 기업의 웹 페이지 상에 존재하는 상기 복수의 제1 텍스트들에 대한 추출을 수행할 수 있다.In this case, according to an embodiment of the present invention, the text extracting unit 131 parses HTML (Hypertext Markup Language) code constituting the web page of the first company, extracting the plurality of first texts existing on the web page of the first company by extracting the inserted text through a tag, if a hyperlink tag exists on the HTML code, Tag, and extracting texts inserted through a tag associated with text input from the HTML code of the sub-page, together with the plurality of texts existing on the web page of the first company, 1 < / RTI > texts.

관련해서, 텍스트 추출부(131)는 제1 기업의 웹 페이지를 구성하는 HTML 코드에서 텍스트 입력과 연관된 태그를 통해 삽입되어 있는 텍스트들을 추출하되, "<a href>"와 같은 하이퍼링크 태그가 존재하는 경우, 해당 하이퍼링크 태그를 통해 링크되어 있는 서브 페이지에 접속해서 상기 서브 페이지의 HTML 코드로부터 텍스트 입력과 연관된 태그를 통해 삽입되어 있는 텍스트들을 함께 추출함으로써, 상기 제1 기업의 웹 페이지 상에 존재하는 복수의 제1 텍스트들을 추출할 수 있다.In this regard, the text extracting unit 131 extracts the text inserted through the tag associated with the text input in the HTML code constituting the web page of the first company, and a hyperlink tag such as "<a href> & , It is possible to access the sub page linked through the hyperlink tag and to extract the texts inserted through the tag associated with the text input from the HTML code of the sub page to be present on the web page of the first company A plurality of first texts may be extracted.

단어 추출부(132)는 상기 복수의 제1 텍스트들에 대해 형태소 분석을 수행하여 상기 복수의 제1 텍스트들로부터 복수의 제1 단어들을 추출할 수 있다.The word extracting unit 132 may extract a plurality of first words from the plurality of first texts by performing morphological analysis on the plurality of first texts.

기업 주소 결정부(133)는 상기 복수의 제1 단어들에 기초하여 상기 제1 기업의 기업 주소를 결정할 수 있다. 예를 들어, 상기 복수의 제1 단어들 중 연속된 단어들이 지역 정보를 나타내는 단어들(예 : “서울시”, “영등포구”, “여의도동” 등)인 경우, 기업 주소 결정부(133)는 상기 지역 정보를 나타내는 연속된 단어들을 상기 제1 기업의 기업 주소로 결정할 수 있다.The enterprise address determination unit 133 may determine the enterprise address of the first company based on the plurality of first words. For example, if the consecutive words among the plurality of first words are words indicating local information (e.g., "Seoul City", "Yeongdeungpo-gu", "Yeouido-dong" And consecutive words indicating the local information can be determined as the corporate address of the first company.

중요 단어 선택부(134)는 상기 복수의 제1 단어들의 상기 제1 기업의 웹 페이지 상에서의 등장 빈도수에 기초하여 상기 복수의 제1 단어들 중 적어도 하나의 중요 단어를 선택할 수 있다.The important word selection unit 134 may select at least one important word among the plurality of first words based on the frequency of occurrence on the web page of the first company of the plurality of first words.

구체적으로, 중요 단어 선택부(134)는 상기 제1 기업의 웹 페이지 상에서의 등장 빈도수에 기초하여 상기 복수의 제1 단어들 중 적어도 하나의 중요 단어를 선택하되, 산업군 단어 저장부(120)에 저장되어 있는 산업군 단어들과 매칭되는 단어를 적어도 하나의 중요 단어로 선택할 수 있다. 이에 따라, “페이”라는 단어가 상기 제1 기업의 웹 페이지 상에서 가장 많이 등장하였다고 하더라도, 산업군 단어 저장부(120)에 “페이”라는 산업군 단어가 저장되어 있지 않은 경우, 중요 단어 선택부(134)는 상기 “페이”를 중요 단어로 선택하지 않을 수 있다.Specifically, the important word selection unit 134 selects at least one important word among the plurality of first words based on the frequency of occurrence on the web page of the first company, At least one important word can be selected to match a stored industry word. Accordingly, even if the word " pay " appears most frequently on the web page of the first company, if the industry word word " pay " is not stored in the industry word storage unit 120, May not select the " pay " as an important word.

산업군 결정부(135)는 상기 적어도 하나의 중요 단어가 선택되면, 상기 산업군 단어 저장부를 참조하여, 상기 적어도 하나의 중요 단어에 대한 특성 벡터를 확인하고, 상기 적어도 하나의 중요 단어에 대한 특성 벡터와 상기 다수의 산업군 단어 그룹들 각각에 포함되어 있는 상기 다수의 산업군 단어들 각각에 대한 특성 벡터 간의 유사도를 연산하고, 상기 연산된 유사도에 기초하여 상기 제1 기업에 대해 상기 다수의 산업군들 중 적어도 하나의 산업군을 결정할 수 있다. 이때, 산업군 결정부(135)는 상기 적어도 하나의 중요 단어와 상기 다수의 산업군 단어 그룹들 각각에 포함되어 있는 상기 다수의 산업군 단어들 간의 유사도의 평균을 상기 적어도 하나의 중요 단어에 대한 유사도로서 연산할 수 있다.When the at least one important word is selected, the industry group determining unit 135 refers to the industry word storage unit to identify a characteristic vector for the at least one important word, and calculates a characteristic vector for the at least one important word Calculating a degree of similarity between feature vectors for each of the plurality of industry group words contained in each of the plurality of industry group words, and calculating, based on the calculated degree of similarity, at least one of the plurality of industry groups Of the industry. At this time, the industry group decision unit 135 calculates an average of the similarities between the at least one important word and the plurality of industry group words included in each of the plurality of industry word groups as a similarity to the at least one important word can do.

예를 들어, 상기 제1 기업의 웹 페이지 상에서 상기 적어도 하나의 중요 단어가 “휴대폰”, “노트북” 및 “고등어”로 선택된 경우, 산업군 결정부(135)는 “휴대폰”, “노트북” 및 “고등어”에 대한 특성 벡터(즉, (2, 1, -1, -1, 2), (3, 2, 0, -1, 1) 및 (-1, 1, 4, 3, 1))를 확인하고, “휴대폰”, “노트북” 및 “고등어”에 대한 특성 벡터와 상기 다수의 산업군 단어 그룹들 각각에 포함되어 있는 상기 다수의 산업군 단어들 각각에 대한 특성 벡터 간의 유사도를 연산한 후 상기 연산된 유사도에 기초하여 상기 제1 기업에 대해 상기 다수의 산업군들 중 적어도 하나의 산업군을 결정할 수 있다. 이때, 산업군 결정부(135)는 “휴대폰”, “노트북” 및 “고등어”에 대한 특성 벡터와 산업군 단어 그룹 1에 포함되어 있는 산업군 단어들 각각에 대한 특성 벡터 간에 연산된 유사도들의 평균 값과 “휴대폰”, “노트북” 및 “고등어”에 대한 특성 벡터와 산업군 단어 그룹 2에 포함되어 있는 산업군 단어들 각가에 대한 특성 벡터 간에 연산된 유사도들의 평균 값에 기초하여, 상기 제1 기업의 산업군을 결정할 수 있다.For example, if the at least one important word is selected as "mobile phone", "notebook", and "mackerel" on the web page of the first company, (2, 1, -1, -1, 2), (3, 2, 0, -1, 1) and (-1, 1, 4, 3, 1) for the mackerel And calculates the similarity between the characteristic vector for "mobile phone", "notebook" and "mackerel" and the characteristic vector for each of the plurality of industry word words contained in each of the plurality of industry word groups, And determine at least one industry group of the plurality of industry groups for the first enterprise based on the similarity degree. At this time, the industry group decision unit 135 determines the average value of the similarities calculated between the characteristic vector for "mobile phone", "notebook" and "mackerel" and the characteristic vector for each industry word included in the industry word group 1, Based on the average value of the similarities calculated between the characteristic vectors for the "mobile phone", "notebook" and "mackerel" and the characteristic vectors for the respective industry words included in the industry word group 2, .

기업 정보 입력부(136)는 상기 제1 기업에 대해 상기 결정된 기업 주소 및 상기 제1 기업에 대해 상기 결정된 적어도 하나의 산업군을 매칭시켜 상기 제1 기업에 대한 기업 정보를 생성한 후 상기 기업 정보 저장부에 저장할 수 있다.The company information input unit 136 generates company information for the first company by matching the determined company address and the determined at least one industry group to the first company, Lt; / RTI >

예를 들어, 상기 제1 기업의 기업 주소가 주소 7이고, 산업군 정보가 전기 및 전자기기로 결정되고, 상기 제1 기업의 웹 페이지 상에서 상기 적어도 하나의 중요 단어가 “휴대폰”, “노트북” 및 “고등어”로 선택된 경우, 기업 정보 입력부(136)는 상기 표 1과 같은 기업 정보를 하기 표 3과 같이 갱신할 수 있다.For example, if the corporate address of the first company is address 7, the industry group information is determined to be electrical and electronic, and the at least one important word on the web page of the first company is " If "mackerel" is selected, the enterprise information input unit 136 can update the enterprise information as shown in Table 1 as shown in Table 3 below.

기업 정보 인덱스Company Information Index 기업 주소Company Address 산업군 정보Industry information 중요 단어Important word 기업 정보 1Company Information 1 주소 1Address 1 농림수산품Agriculture, forestry and fisheries products 단어 1, 단어 2, 단어 3Word 1, Word 2, Word 3 기업 정보 2Company Information 2 주소 2Address 2 농림수산품Agriculture, forestry and fisheries products 단어 1, 단어 3Word 1, word 3 기업 정보 3Company Information 3 주소 3Address 3 광산품Minerals 단어 4, 단어 5, 단어 6Word 4, Word 5, Word 6 기업 정보 4Company Information 4 주소 4Address 4 음식료품Food and beverage 단어 7, 단어 8Word 7, word 8 기업 정보 5Company Information 5 주소 5Address 5 문화 및 기타 서비스Culture and other services 단어 9, 단어 10, 단어 11Word 9, Word 10, Word 11 기업 정보 6Company Information 6 주소 6Address 6 문화 및 기타 서비스Culture and other services 단어 9, 단어 11, 단어 12Word 9, Word 11, Word 12 기업 정보 7Company Information 7 주소 7Address 7 전기 및 전자기기Electrical and electronic equipment 휴대폰, 노트북, 고등어Mobile phones, notebooks, mackerel ...... ...... ...... ......

한편, 기업 정보 입력부(136)는 상기 적어도 하나의 중요 단어 중 상기 제1 기업의 산업군 정보에 상응하는 산업군 단어 그룹에 포함되어 있는 상기 다수의 산업군 단어들 각각에 대한 특성 벡터 간의 유사도가 사전 설정된 기준 유사도 이하인 중요 단어의 경우, 기업 정보에 매칭시켜 저장하지 않을 수 있다. 이에 따라, “휴대폰”, “노트북” 및 “고등어” 중 “고등어”는 상기 제1 기업에 대한 기업 정보에 포함되지 않을 수 있다.Meanwhile, the company information input unit 136 may include a business information input unit 136 for inputting a plurality of business word groups corresponding to the industry group information of the first company among the at least one important word, In case of an important word of similarity or less, it may not be stored in matching with the company information. Accordingly, " mackerel " among " mobile phone ", " notebook ", and " mackerel " may not be included in the enterprise information for the first company.

다음으로, 산업융복합지수 저장부(140)는 상기 사전 설정된 다수의 산업군들 간의 산업융복합지수를 저장할 수 있다. 여기에서, 산업융복합지수는, 예를 들어, 한국은행에서 발행하는 산업연관표 중 국산투입계수 표의 행렬을 전치(Transpose)한 후 행의 합계가 1이 되도록 행렬에 기재된 성분들의 값을 정규화한 값일 수 있다.Next, the industrial integrated complex index storage unit 140 may store the industrial integrated complex index among the predetermined plurality of industries. Here, the Industrial Composite Complex Index is a value obtained by normalizing the values of the components described in the matrix so that the sum of the rows is 1 after transposing the matrix of the input coefficient table of the Korean among the industry association tables issued by the Bank of Korea .

예컨대, 산업융복합지수 저장부(140)에는 하기의 표 4와 같이 정보가 저장되어 있을 수 있다.For example, the industrial fusion complex index storage unit 140 may store information as shown in Table 4 below.

농림수산품Agriculture, forestry and fisheries products 광산품Minerals 음식료품Food and beverage ...... 문화 및 기타 서비스Culture and other services 합계Sum 농림수산품Agriculture, forestry and fisheries products 0.1300240.130024 0.0000590.000059 0.4059230.405923 ...... 0.0037760.003776 1One 광산품Minerals 0.0023920.002392 0.0000000.000000 0.0028770.002877 ...... 0.0107240.010724 1One 음식료품Food and beverage 0.4081630.408163 0.0002720.000272 0.174880.17488 ...... 0.0027310.002731 1One ...... ...... ...... ...... ...... ...... ...... 문화 및 기타 서비스Culture and other services 0.0026620.002662 0.0000260.000026 0.0594790.059479 ...... 0.0337290.033729 1One

표 4를 참조하면, 농림수산품의 경우, 농림수산품과 0.130024, 광산품과 0.000059, 음식료품과 0.405923 및 문화 및 기타 서비스와 0.003776의 산업융복합지수를 가지며, 산업들 간의 산업융복합지수의 총 합은 1임을 알 수 있으며, 마찬가지로, 문화 및 기타 서비스의 경우, 농림수산품과 0.002662, 광산품과 0.000026, 음식료품과 0.059479 및 문화 및 기타 서비스와 0.033729의 산업융복합지수를 가지며, 산업들 간의 산업융복합지수의 총 합은 1임을 알 수 있다.Table 4 shows that for agriculture, forestry and fisheries products, there are 0.130024, 0.1000024, minerals and 0.000059, food and beverage products and 0.405923, and cultural and other services and 0.003776, respectively. Similarly, in the case of culture and other services, it has the Industrial Combined Composite Index of Agriculture, Forestry and Fisheries, 0.002662, Minerals and 0.000026, F & B and 0.059479 and Cultural and Other Services and 0.033729, The sum is 1.

산업별 기업 개수 카운트부(150)는 관리자의 단말(190)로부터 제1 지역 명칭 및 제2 지역 명칭이 입력되면, 상기 기업 정보 저장부에 저장되어 있는 상기 기업 정보에 매칭되어 있는 기업 주소에 기초하여 상기 다수의 산업군들 각각마다 상기 제1 지역 명칭에 따른 제1 지역 및 제2 지역 명칭에 따른 제2 지역에 위치하는 제1 산업별 기업 개수 및 제2 산업별 기업 개수를 카운트할 수 있다. 관리자의 단말(190)로부터 입력되는 제1 지역 명칭 및 제2 지역 명칭이 서로 동일한 경우, 산업별 기업 개수 카운트부(150)는 제1 산업별 기업 개수 및 제2 산업별 기업 개수를 동일하게 카운트할 수 있다. 이때, 단말 데이터 수신부(155)가 관리자의 단말(190)로부터 제1 지역 명칭 및 제2 지역 명칭을 수신할 수 있다.If the first area name and the second area name are input from the terminal 190 of the manager, the number of companies for each industry sector counting unit 150 is calculated based on the company address matched with the company information stored in the enterprise information storage unit The number of companies in the first industry and the number of companies in the second industry, which are located in the second area according to the first area name and the second area name, may be counted for each of the plurality of industries. If the first area name and the second area name inputted from the terminal 190 of the manager are identical to each other, the industry-specific number-of-enterprises counting unit 150 can count the number of the first industry and the number of the second- . At this time, the terminal data receiving unit 155 can receive the first area name and the second area name from the terminal 190 of the administrator.

예를 들어, 기업 정보 1 내지 3을 각각 가지는 기업 1 내지 3이 상기 제1 지역 명칭에 따른 제1 지역에 위치하고, 기업 정보 4 내지 6을 각각 가지는 기업 4 내지 6이 상기 제2 지역 명칭에 따른 제2 지역에 위치하는 경우, 산업별 기업 개수 카운트부(150)는 제1 지역의 농림수산품에 대해 제1 산업별 기업 개수를 2로 카운트하고, 광산품에 대해 제1 산업별 기업 개수를 1로 카운트하며, 제2 지역의 음식료품에 대해 제2 산업별 기업 개수를 1로 카운트하고, 문화 및 기타 서비스에 대해 제2 산업별 기업 개수를 2로 카운트할 수 있다. 그러나, 예를 들어, 제1 지역 명칭 및 제2 지역 명칭이 서로 동일한 경우, 산업별 기업 개수 카운트부(150)는 제1 및 제2 지역의 농림수산품에 대해 제1 및 2 산업별 기업 개수를 2로 카운트하고, 광산품에 대해 제1 및 2 산업별 기업 개수를 1로 카운트하며, 음식료품에 대해 제1 및 2 산업별 기업 개수를 1로 카운트하고, 문화 및 기타 서비스에 대해 제1 및 2 산업별 기업 개수를 2로 카운트할 수 있다.For example, if the companies 1 to 3 each having the company information 1 to 3 are located in the first area according to the first area name, and the companies 4 to 6 each having the company information 4 to 6 are located according to the second area name In the case of being located in the second region, the number-of-enterprises counting section 150 counts the number of companies per 1 industry for the agriculture, forestry and marine products of the first region as 2, counts the number of enterprises per 1 industry as 1 for mining products, For food and beverages in the second region, the number of enterprises by the second industry is counted as 1, and the number of enterprises by the second industry is counted as 2 for culture and other services. However, for example, if the first area name and the second area name are identical to each other, the industry-specific number-of-businesses counting unit 150 counts the number of companies for the first and second industries as 2 Counts the number of companies in the first and second industries as 1, counts the number of companies in the first and second industries as 1, counts the number of companies in the first and second industries for culture and other services as 1 Can be counted.

산업군 선택부(160)는 상기 제1 산업별 기업 개수에 기초하여 상기 다수의 산업군들 중 상기 제1 지역에 위치하는 기업의 개수가 많은 순서대로 N개의 제1 산업군들을 선택하고, 상기 제2 산업별 기업 개수에 기초하여 상기 다수의 산업군들 중 상기 제2 지역에 위치하는 기업의 개수가 많은 순서대로 M개의 제2 산업군들을 선택할 수 있다. 상기 N 및 M은 1 이상의 정수이다. The industry group selection unit 160 selects N first industry groups in descending order of the number of companies located in the first region among the plurality of industry groups based on the number of the first industry groups, The number of the second industry groups can be selected in descending order of the number of the enterprises located in the second region among the plurality of industry groups based on the number of the second industry groups. N and M are integers of 1 or more.

예를 들어, N 및 M이 1인 경우, 산업군 선택부(160)는 제1 지역에 대해 제1 산업별 기업 개수가 2인 농림수산품을 제1 산업군으로 선택하고, 제2 지역에 대해 제2 산업별 기업 개수가 2인 문화 및 기타 서비스를 제2 산업군으로 선택할 수 있다. 한편, 예를 들어, 제1 지역 명칭 및 제2 지역 명칭이 서로 동일하고, N 및 M이 2인 경우, 산업군 선택부(160)는 제1 및 2 지역에 대해 제1 및 2 산업별 기업 개수가 2인 농림수산품과 문화 및 기타 서비스를 제1 및 2 산업군으로 선택할 수 있다.For example, when N and M are 1, the industry group selection unit 160 selects agricultural and marine aquatic products having a first industry number of 2 for the first region as the first industry group and selects the second industry as the second industry Cultural and other services with two companies can be selected as the second industry group. On the other hand, if, for example, the first area name and the second area name are the same, and N and M are two, the industry group selection unit 160 selects the first and second industries 2 people, agriculture, forestry, marine products, culture and other services can be selected as the first and second industry groups.

융합 지수 연산부(170)는 상기 산업융복합지수에 기초하여 상기 제1 지역에 대해 선택된 제1 산업군들과 상기 제2 지역에 대해 선택된 제2 산업군들 간의 산업융복합지수를 결정하고, 상기 결정된 산업융복합지수의 평균 값을 지역 융합 지수로 결정할 수 있다. 구체적으로, 융합 지수 연산부(170)는 제1 산업군들 각각의 제2 산업군들 각각에 대한 산업융복합지수를 결정하고, 제2 산업군들 각각의 제1 산업군들 각각에 대한 산업융복합지수를 결정한 후 상기 결정된 산업융복합지수의 평균 값을 지역 융합 지수로 결정할 수 있다.The fusion index computing unit 170 determines an industrial fusion composite index between the first industry groups selected for the first region and the second industry groups selected for the second region based on the industrial fusion composite index, The average value of the composite composite index can be determined by the regional fusion index. Specifically, the fusion index calculation unit 170 determines the composite index for each of the second industry groups of each of the first industry groups and determines the composite index of each industry for each of the first industry groups of the second industry groups The average value of the INDFs determined above may be determined as a regional fusion index.

예를 들어, 제1 지역에 대해 농림수산품이 제1 산업군으로 선택되고, 제2 지역에 대해 문화 및 기타 서비스가 제2 산업군으로 선택된 경우, 융합 지수 연산부(170)는 제1 산업군인 농림수산품의 제2 산업군인 문화 및 기타 서비스에 대한 산업융복합지수를 0.003776으로 결정하고, 제2 산업군인 문화 및 기타 서비스의 제1 산업군인 농림수산품에 대한 산업융복합지수를 0.002662로 결정한 후 상기 결정된 산업융복합지수의 평균 값(즉, 0.003219)을 제1 지역 및 제2 지역 간의 지역 융합 지수로 결정할 수 있다.For example, if agricultural and marine products are selected as the first industry group for the first region and culture and other services are selected as the second industry group for the second region, the fusion index calculation unit 170 calculates the fusion index The industrial complex index for culture and other services of the second industrial group was determined to be 0.003776 and the industrial composite index for agriculture, forestry and marine products, the first industrial group of culture and other services of the second industrial group, was determined to be 0.002662, The average value of the composite index (i.e., 0.003219) can be determined as the regional fusion index between the first region and the second region.

그러나, 예를 들어, 제1 지역 명칭 및 제2 지역 명칭이 서로 동일하여, 제1 및 2 지역에 대해 농림수산품과 문화 및 기타 서비스가 제1 및 2 산업군으로 선택된 경우, 제1 산업군인 농림수산품의 제2 산업군인 농림수산품 및 문화 및 기타 서비스에 대한 산업융복합지수를 0.130024 및 0.003776으로 결정하고, 제1 산업군인 문화 및 기타 서비스의 제2 산업군인 농림수산품 및 문화 및 기타 서비스에 대한 산업융복합지수를 0.002662 및 0.033729로 결정하고, 제2 산업군인 농림수산품의 제1 산업군인 농림수산품 및 문화 및 기타 서비스에 대한 산업융복합지수를 0.130024 및 0.003776으로 결정하고, 제2 산업군인 문화 및 기타 서비스의 제1 산업군인 농림수산품 및 문화 및 기타 서비스에 대한 산업융복합지수를 0.002662 및 0.033729로 결정한 후 상기 결정된 산업융복합지수의 평균 값(즉, 0.0425477)을 제1 지역(또는 제2 지역)의 지역 융합 지수로 결정할 수 있다.However, for example, if the first area name and the second area name are the same, and the agricultural, forestry, marine products, culture and other services are selected as the first and second industries for the first and second areas, , The industrial industrial complex index for agriculture, forestry, marine products and culture and other services is determined to be 0.130024 and 0.003776, and the second industry of military industry and culture and other services The composite indices were determined to be 0.002662 and 0.033729, and the industrial composite index for agriculture, forestry and fisheries products and culture and other services, which are the first industries of agriculture, forestry and marine products, was determined as 0.130024 and 0.003776, , The industry industrial composite index for agriculture, forestry and fisheries products and culture and other services was 0.002662 and 0.033729, Value (that is, 0.0425477) can be determined by local fusion index of the first region (or second region).

한편, 상기 융합 지수 연산부(170)는 상기 제1 산업군들과 상기 제2 산업군들 간의 상기 산업융복합지수를 결정하되, 상기 제1 산업군들과 상기 제2 산업군들에 중복되는 제3 산업군 간에 결정된 산업융복합지수를 제외한 산업융복합지수의 평균 값을 지역 융합 지수로 결정할 수 있다.The fusion index calculation unit 170 determines the FDI index between the first industry group and the second industry group, and determines the FDI index between the first industry group and the third industry group that overlaps with the second industry group The regional fusion index can be used to determine the average value of the industrial fusion composite index excluding the composite industrial composite index.

예를 들어, 농림수산품과 문화 및 기타 서비스가 상기 제1 산업군들과 상기 제2 산업군들에 동시에 포함되는 경우, 상기 융합 지수 연산부(170)는 제1 또는 제2 산업군인 농림수산품의 제2 또는 제1 산업군인 농림수산품에 대한 산업융복합지수(즉, 0.130024)와 제1 또는 제2 산업군인 문화 및 기타 서비스의 제2 또는 제1 산업군인 문화 및 기타 서비스에 대한 산업융복합지수(즉, 0.033729)를 제외한 산업융복합지수의 평균 값(즉, 0.003219)을 지역 융합 지수로 결정할 수 있다.For example, when the agricultural, forestry, marine products, culture, and other services are simultaneously included in the first industry group and the second industry group, the fusion index computing unit 170 may calculate the fusion index, (Ie, 0.130024) for the first industrial group, agriculture, forestry and marine products, and the industrial composite index (ie, the index for the second or first industry group culture and other services of the first or second industrial group culture and other services) 0.033729) can be determined by the regional fusion index (ie, 0.003219).

융합 지수 전송부(180)는 상기 연산된 지역 융합 지수를 상기 관리자의 단말(190)에 전송할 수 있다.The fusion index transmitting unit 180 may transmit the calculated local fusion index to the terminal 190 of the administrator.

상술한 바와 같은 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치(100)는 기업들이 융합 활동을 수행함으로써 기업들이 목표하는 신기술 개발과 경쟁력 제고에 기여할 수 있도록, 여러 산업 군의 기업정보를 추출한 후 별도의 산업연관표를 이용하여 다양한 융합 정보를 분석하고, 분석된 융합 정보를 기업들에게 제공할 수 있다.The convergence index computing device 100 based on the above-described artificial intelligence-based web page analysis is a system that extracts corporate information from various industries in order to contribute to the development of new technologies and competitiveness of companies by performing convergence activities Then, it can analyze various fusion information by using a separate industry association table and provide the analyzed fusion information to the companies.

또한, 상술한 바와 같은 인공지능 기반의 웹 페이지 분석에 기초한 융합 지수 연산 장치(100)는 기업 간의 기업 융합 지수를 연산한 후 상기 연산된 기업 융합 지수를 상기 관리자의 단말(190)에 전송할 수 있다.In addition, the convergence index computing apparatus 100 based on the above-described artificial intelligence-based web page analysis may calculate an ICCI between enterprises and transmit the calculated ICC index to the terminal 190 of the administrator .

상기 융합 지수 연산부(170)는 상기 관리자의 단말(190)로부터 제2 기업 명칭 및 제3 기업 명칭이 입력되면, 상기 산업융복합지수에 기초하여 상기 제2 기업 명칭에 따른 제2 기업에 대해 매칭되어 있는 적어도 하나의 산업군과 상기 제3 기업 명칭에 따른 제3 기업에 대해 매칭되어 있는 적어도 하나의 산업군 간의 기업 융합 지수를 연산할 수 있다. 이때, 단말 데이터 수신부(155)가 관리자의 단말(190)로부터 제2 기업 명칭 및 제3 기업 명칭을 수신할 수 있다.When the second company name and the third company name are input from the terminal 190 of the manager, the fusion index calculation unit 170 performs a matching operation on the second company according to the second company name, And at least one industry group that is matched for at least one industry group that is in the third enterprise group and the third enterprise group that is the third enterprise name. At this time, the terminal data receiving unit 155 can receive the second company name and the third company name from the terminal 190 of the manager.

예를 들어, 제2 기업 명칭에 대응하는 제2 기업의 산업군이 광산품이고, 제3 기업 명칭에 대응하는 제3 기업의 산업군이 음식료품인 경우, 상기 융합 지수 연산부(170)는 상기 제2 기업 명칭에 따른 제2 기업에 대해 매칭되어 있는 적어도 하나의 산업군과 상기 제3 기업 명칭에 따른 제3 기업에 대해 매칭되어 있는 적어도 하나의 산업군 간의 기업 융합 지수(0.001575)를 연산할 수 있다. 이때, 기업 융합 지수는 제2 기업에 대해 매칭되어 있는 적어도 하나의 산업군의 제3 기업에 대해 매칭되어 있는 적어도 하나의 산업군에 대한 기업 융합 지수(0.002877)와 제3 기업에 대해 매칭되어 있는 적어도 하나의 산업군의 제2 기업에 대해 매칭되어 있는 적어도 하나의 산업군에 대한 기업 융합 지수(0.000272)의 평균 값일 수 있다.For example, when the industry group of the second company corresponding to the second company name is a mineral product and the industry group of the third company corresponding to the third company name is a food or beverage product, the fusion index operating unit 170 reads the second company name (0.001575) between at least one industry group matched for the second company according to the third company name and at least one industry group matching for the third company according to the third company name. At this time, the Fusion Indicator represents the Fusion Index (0.002877) for at least one industry group matching at least one industry group matching at least one industry group (0.002877) and at least one (0.000272) for at least one industry group that is matched for the second firm of the industry group.

상기 융합 지수 전송부(180)는 상기 연산된 기업 융합 지수(0.001575)를 상기 관리자의 단말(190)에 전송할 수 있다.The fusion index transmitter 180 may transmit the calculated FER index (0.001575) to the terminal 190 of the administrator.

도 2는 웹 페이지 분석에 기초한 키워드 정보 자동추출 방법을 도시한 순서도이다.2 is a flowchart showing a method of automatically extracting keyword information based on a web page analysis.

도 2를 참조하면, 단계(S210)에서는 다수의 기업들 각각에 대해 기업 주소 및 사전 설정된 다수의 산업군들 중 적어도 하나의 산업군이 매칭되어 있는 기업 정보를 저장하는 기업 정보 저장부를 유지할 수 있다.Referring to FIG. 2, in step S210, a plurality of companies may be provided with a company information storage unit for storing company addresses in which a company address and at least one industry group of a plurality of predetermined industry groups are matched.

단계(S220)에서는 상기 사전 설정된 다수의 산업군들 간의 산업융복합지수를 저장하는 산업융복합지수 저장부를 유지할 수 있다.In step S220, an industrial complex index storing unit that stores an industrial composite index between the predetermined plurality of industries can be maintained.

단계(S230)에서는 관리자의 단말로부터 제1 지역 명칭 및 제2 지역 명칭이 입력되면, 상기 기업 정보 저장부에 저장되어 있는 상기 기업 정보에 매칭되어 있는 기업 주소에 기초하여 상기 다수의 산업군들 각각마다 상기 제1 지역 명칭에 따른 제1 지역 및 제2 지역 명칭에 따른 제2 지역에 위치하는 제1 산업별 기업 개수 및 제2 산업별 기업 개수를 카운트할 수 있다.In step S230, when the first area name and the second area name are input from the terminal of the administrator, the management information is stored for each of the plurality of industries based on the enterprise address matched with the enterprise information stored in the enterprise information storage unit The number of the first industry and the number of the second industry can be counted in the first region according to the first region name and the second region according to the second region name.

단계(S240)에서는 상기 제1 산업별 기업 개수에 기초하여 상기 다수의 산업군들 중 상기 제1 지역에 위치하는 기업의 개수가 많은 순서대로 N개의 제1 산업군들을 선택하고, 상기 제2 산업별 기업 개수에 기초하여 상기 다수의 산업군들 중 상기 제2 지역에 위치하는 기업의 개수가 많은 순서대로 M개의 제2 산업군들을 선택할 수 있다.In step S240, N first industry groups are selected in descending order of the number of companies located in the first region among the plurality of industry groups on the basis of the number of companies by the first industry, The M second industry groups can be selected in descending order of the number of enterprises located in the second region among the plurality of industry groups.

단계(S250)에서는 상기 산업융복합지수에 기초하여 상기 제1 지역에 대해 선택된 제1 산업군들과 상기 제2 지역에 대해 선택된 제2 산업군들 간의 산업융복합지수를 결정하고, 상기 결정된 산업융복합지수의 평균 값을 지역 융합 지수로 결정할 수 있다.In step S250, an industrial composite index between the first industry groups selected for the first region and the second industry groups selected for the second region is determined based on the composite industrial composite index, The average value of the index can be determined by the regional fusion index.

상기 단계(S250)는 상기 제1 산업군들과 상기 제2 산업군들 간의 상기 산업융복합지수를 결정하되, 상기 제1 산업군들과 상기 제2 산업군들에 중복되는 제3 산업군 간에 결정된 산업융복합지수를 제외한 산업융복합지수의 평균 값을 지역 융합 지수로 결정할 수 있다.The step S250 is to determine the IND complex index between the first industry group and the second industry group, and determine the IND complex index between the first industry group and the second industry group, The average value of industrial composite composite indexes can be determined by regional fusion index.

단계(S260)에서는 상기 연산된 지역 융합 지수를 상기 관리자의 단말에 전송할 수 있다.In step S260, the calculated regional fusion index may be transmitted to the terminal of the administrator.

상술한 바와 같은 본 발명의 일실시예에 따른 웹 페이지 분석에 기초한 키워드 정보 자동추출 방법은 서로 다른 특성 벡터들이 할당되어 있는 사전 설정된 다수의 산업군 단어들을 포함하는 사전 설정된 다수의 산업군 단어 그룹들을 저장하는 산업군 단어 저장부를 유지하는 단계, 제1 기업의 웹 페이지에 대한 접속 주소가 입력되면, 상기 접속 주소를 기초로 상기 제1 기업의 웹 페이지에 접속하여 상기 제1 기업의 웹 페이지로부터 상기 제1 기업의 웹 페이지 상에 존재하는 복수의 제1 텍스트들을 추출하는 단계, 상기 복수의 제1 텍스트들에 대해 형태소 분석을 수행하여 상기 복수의 제1 텍스트들로부터 복수의 제1 단어들을 추출하는 단계, 상기 복수의 제1 단어들에 기초하여 상기 제1 기업의 기업 주소를 결정하는 단계, 상기 복수의 제1 단어들의 상기 제1 기업의 웹 페이지 상에서의 등장 빈도수에 기초하여 상기 복수의 제1 단어들 중 적어도 하나의 중요 단어를 선택하는 단계, 상기 적어도 하나의 중요 단어가 선택되면, 상기 산업군 단어 저장부를 참조하여, 상기 적어도 하나의 중요 단어에 대한 특성 벡터를 확인하고, 상기 적어도 하나의 중요 단어에 대한 특성 벡터와 상기 다수의 산업군 단어 그룹들 각각에 포함되어 있는 상기 다수의 산업군 단어들 각각에 대한 특성 벡터 간의 유사도를 연산하고, 상기 연산된 유사도에 기초하여 상기 제1 기업에 대해 상기 다수의 산업군들 중 적어도 하나의 산업군을 결정하는 단계 및 상기 제1 기업에 대해 상기 결정된 기업 주소 및 상기 제1 기업에 대해 상기 결정된 적어도 하나의 산업군을 매칭시켜 상기 제1 기업에 대한 기업 정보를 생성한 후 상기 기업 정보 저장부에 저장하는 단계를 더 수행할 수 있다.The automatic keyword information extraction method based on the web page analysis according to an embodiment of the present invention as described above stores a plurality of preset group of industry word groups including a plurality of predetermined industry group words assigned different characteristic vectors Storing an industry word storage unit; when a connection address for a web page of a first company is inputted, accessing a web page of the first company based on the connection address, Extracting a plurality of first texts existing on a web page of the plurality of first texts, performing a morphological analysis on the plurality of first texts to extract a plurality of first words from the plurality of first texts, Determining an enterprise address of the first company based on a plurality of first words, Selecting at least one important word among the plurality of first words on the basis of a frequency of occurrence on a web page of a first company; if the at least one important word is selected, Identifying a feature vector for at least one important word and comparing a feature vector for the at least one important word with a feature vector for each of the plurality of industry word words contained in each of the plurality of industry word groups, And determining at least one industry group of the plurality of industry groups for the first enterprise based on the calculated degree of similarity, determining the determined enterprise address for the first enterprise and the determined enterprise group for the first enterprise After generating enterprise information for the first company by matching at least one industry group, And storing it in an information storage unit.

또한, 상술한 바와 같은 본 발명의 일실시예에 따른 웹 페이지 분석에 기초한 키워드 정보 자동추출 방법에 있어서, 상기 기업 정보 저장부를 유지하는 단계는 상기 다수의 기업들 각각에 대해 중요 단어가 더 매칭되어 있는 기업 정보를 저장하는 상기 기업 정보 저장부를 유지하고, 상기 기업 정보 입력 단계는 상기 제1 기업에 대해 상기 적어도 하나의 중요 단어가 더 매칭된 상기 제1 기업에 대한 기업 정보를 생성한 후 상기 기업 정보 저장부에 저장할 수 있다.In the method for automatically extracting keyword information based on analysis of a web page according to an embodiment of the present invention as described above, the step of maintaining the enterprise information storage unit may further include matching of important words with respect to each of the plurality of companies Wherein the enterprise information storing step stores the enterprise information stored in the enterprise information storing step, wherein the enterprise information storing step stores enterprise information for the first company, And can be stored in an information storage unit.

또한, 상술한 바와 같은 본 발명의 일실시예에 따른 웹 페이지 분석에 기초한 키워드 정보 자동추출 방법은 상기 관리자의 단말로부터 제2 기업 명칭 및 제3 기업 명칭이 입력되면, 상기 산업융복합지수에 기초하여 상기 제2 기업 명칭에 따른 제2 기업에 대해 매칭되어 있는 적어도 하나의 산업군과 상기 제3 기업 명칭에 따른 제3 기업에 대해 매칭되어 있는 적어도 하나의 산업군 간의 기업 융합 지수를 연산하는 단계 및 상기 연산된 기업 융합 지수를 상기 관리자의 단말에 전송하는 단계를 더 수행할 수 있다.In addition, if the second company name and the third company name are inputted from the terminal of the manager, the automatic keyword information extraction method based on the web page analysis according to an embodiment of the present invention as described above, Calculating an enterprise convergence index between at least one industry group matched to a second company according to the second company name and at least one industry group matching a third company according to the third company name; And transmitting the calculated FER to the terminal of the manager.

이상, 도 2를 참조하여 본 발명의 일실시예에 따른 융합 지수 연산 장치의 동작 방법에 대해 설명하였다. 여기서, 본 발명의 일실시예에 따른 융합 지수 연산 장치의 동작 방법은 도 1을 이용하여 설명한 융합 지수 연산 장치(100)의 동작에 대한 구성과 대응될 수 있으므로, 이에 대한 보다 상세한 설명은 생략하기로 한다.The operation method of the fusion index calculating apparatus according to the embodiment of the present invention has been described above with reference to FIG. Here, the operation method of the fusion index calculating apparatus according to an embodiment of the present invention may correspond to the operation of the operation of the fusion index calculating apparatus 100 described with reference to FIG. 1, so that a detailed description thereof will be omitted. .

본 발명의 일실시예에 따른 융합 지수 연산 장치의 동작 방법은 컴퓨터와의 결합을 통해 실행시키기 위한 저장매체에 저장된 컴퓨터 프로그램으로 구현될 수 있다.The operation method of the fusion index computing apparatus according to an embodiment of the present invention may be implemented by a computer program stored in a storage medium for execution through a combination with a computer.

또한, 본 발명의 일실시예에 따른 융합 지수 연산 장치의 동작 방법은 다양한 컴퓨터 수단을 통하여 수행될 수 있는 프로그램 명령 형태로 구현되어 컴퓨터 판독 가능 매체에 기록될 수 있다. 상기 컴퓨터 판독 가능 매체는 프로그램 명령, 데이터 파일, 데이터 구조 등을 단독으로 또는 조합하여 포함할 수 있다. 상기 매체에 기록되는 프로그램 명령은 본 발명을 위하여 특별히 설계되고 구성된 것들이거나 컴퓨터 소프트웨어 당업자에게 공지되어 사용 가능한 것일 수도 있다. 컴퓨터 판독 가능 기록 매체의 예에는 하드 디스크, 플로피 디스크 및 자기 테이프와 같은 자기 매체(magnetic media), CD-ROM, DVD와 같은 광기록 매체(optical media), 플롭티컬 디스크(floptical disk)와 같은 자기-광 매체(magneto-optical media), 및 롬(ROM), 램(RAM), 플래시 메모리 등과 같은 프로그램 명령을 저장하고 수행하도록 특별히 구성된 하드웨어 장치가 포함된다. 프로그램 명령의 예에는 컴파일러에 의해 만들어지는 것과 같은 기계어 코드뿐만 아니라 인터프리터 등을 사용해서 컴퓨터에 의해서 실행될 수 있는 고급 언어 코드를 포함한다. Also, the method of operating the fusion index computing device according to an embodiment of the present invention may be implemented in the form of program instructions that can be executed through various computer means and recorded in a computer-readable medium. The computer-readable medium may include program instructions, data files, data structures, and the like, alone or in combination. The program instructions recorded on the medium may be those specially designed and constructed for the present invention or may be available to those skilled in the art of computer software. Examples of computer-readable media include magnetic media such as hard disks, floppy disks and magnetic tape; optical media such as CD-ROMs and DVDs; magnetic media such as floppy disks; Magneto-optical media, and hardware devices specifically configured to store and execute program instructions such as ROM, RAM, flash memory, and the like. Examples of program instructions include machine language code such as those produced by a compiler, as well as high-level language code that can be executed by a computer using an interpreter or the like.

이상과 같이 본 발명에서는 구체적인 구성 요소 등과 같은 특정 사항들과 한정된 실시예 및 도면에 의해 설명되었으나 이는 본 발명의 보다 전반적인 이해를 돕기 위해서 제공된 것일 뿐, 본 발명은 상기의 실시예에 한정되는 것은 아니며, 본 발명이 속하는 분야에서 통상적인 지식을 가진 자라면 이러한 기재로부터 다양한 수정 및 변형이 가능하다. As described above, the present invention has been described with reference to particular embodiments, such as specific elements, and specific embodiments and drawings. However, it should be understood that the present invention is not limited to the above- And various modifications and changes may be made thereto by those skilled in the art to which the present invention pertains.

따라서, 본 발명의 사상은 설명된 실시예에 국한되어 정해져서는 아니되며, 후술하는 특허청구범위뿐 아니라 이 특허청구범위와 균등하거나 등가적 변형이 있는 모든 것들은 본 발명 사상의 범주에 속한다고 할 것이다.Accordingly, the spirit of the present invention should not be construed as being limited to the embodiments described, and all of the equivalents or equivalents of the claims, as well as the following claims, belong to the scope of the present invention .

Claims

An enterprise information storage unit for storing company information for each of a plurality of companies, the company information having at least one industry group matching a company address and a predetermined plurality of industry groups;
The Industrial Merger Composite Index (MCCI) between the above-mentioned predetermined plurality of industry groups - the Industrial Merger Composite Index is obtained by transposing a matrix of the input coefficient table of Korean among the industry association tables issued by the Bank of Korea, Which is a value calculated by normalizing the values of the industrial complex index;
When the first area name and the second area name are inputted from the terminal of the manager, the first area name and the second area name are stored for each of the plurality of industry groups based on the enterprise address matched with the enterprise information stored in the enterprise information storage unit An industry-specific number-of-businesses counting unit for counting the number of first-industry and second-industry companies located in a second region according to a first region and a second region name,
The first industry groups are selected in order of N - N, where N is an integer of 1 or more, in the order of the number of enterprises located in the first region among the plurality of industry groups based on the number of the first industry groups, An industry group selection unit for selecting, based on the number of companies, M - M, where M is an integer equal to or greater than 1, in order of the number of enterprises located in the second area among the plurality of industry groups;
A first industry group selected for the first region and a second industry group selected for the second region based on the composite industrial complex index among the plurality of industry groups stored in the industrial fusion complex index storage unit; A fusion index calculating unit for determining a composite index and determining an average value of the determined IND complex index as a regional fusion index between the first region and the second region; And
And for transmitting the determined regional fusion index to the terminal of the manager
Lt; / RTI >
The fusion index calculation unit determines an industrial fusion composite index between the first industry group and the second industry group, and determines the composite index of the first industry group and the second industry group, excluding the composite index of industrial combination determined between the first industry group and the third industry group The average value of the industrial fusion composite index is determined as the regional fusion index between the first region and the second region,
Wherein the convergence index calculating unit calculates the convergent index based on the composite index of the industries among the plurality of industry groups stored in the composite index storing unit when the second company name and the third company name are inputted from the terminal of the manager, Further determining an industrial fusion composite index between at least one industry group matching the second company according to the company name and at least one industry group matching the third company according to the third company name, The average value of the industrial combined composite index is determined as the enterprise convergence index between the second company and the third company,
And the convergence index transmission unit transmits the determined enterprise fusion index to the terminal of the manager
Fusion index computing device.

delete

The method according to claim 1,
A plurality of industry group word groups including a plurality of predetermined industry group words assigned different characteristic vectors, a plurality of industry group words included in each of the plurality of industry group word groups, Wherein the similarity degree is more than a predetermined reference similarity degree;
When a connection address for a web page of a first company is input, accessing a web page of the first company based on the connection address, and accessing a web page of the first company from the web page of the first company, A text extracting unit for extracting first texts of the text data;
A word extraction unit for performing morphological analysis on the plurality of first texts to extract a plurality of first words from the plurality of first texts;
An enterprise address determination unit for determining an enterprise address of the first company based on the plurality of first words;
An important word selection unit for selecting at least one important word among the plurality of first words based on an appearance frequency on the web page of the first company of the plurality of first words;
If the at least one important word is selected, identifying the feature vector for the at least one important word by referring to the industry word storage, and comparing the feature vector for the at least one important word with the plurality of industry word groups An industry group decision to calculate the similarity between feature vectors for each of the plurality of industry group words included in each of the plurality of industry groups and to determine at least one industry group among the plurality of industry groups for the first company based on the calculated degree of similarity part; And
A company information input unit for generating the company information for the first company by matching the determined company address and the determined at least one industry group to the first company,
Further comprising:

The method of claim 3,
Wherein the enterprise information storage unit stores enterprise information in which important words are more matched to each of the plurality of companies,
Wherein the enterprise information input unit generates enterprise information for the first company matching the at least one important word with respect to the first enterprise and stores the enterprise information in the enterprise information storage unit
Fusion index computing device.

delete

Maintaining an enterprise information storage unit for storing company information for each of a plurality of companies, the enterprise addresses matching at least one industry group among a plurality of industry groups and a predetermined address;
The Industrial Merger Composite Index (MCCI) between the above-mentioned predetermined plurality of industry groups - the Industrial Merger Composite Index is obtained by transposing a matrix of the input coefficient table of Korean among the industry association tables issued by the Bank of Korea, A value obtained by normalizing the value of the industrial complex index storage unit;
When the first area name and the second area name are inputted from the terminal of the manager, the first area name and the second area name are stored for each of the plurality of industry groups based on the enterprise address matched with the enterprise information stored in the enterprise information storage unit Counting the number of first industries and the number of second industries in the second region according to the first region and the second region name according to the first region and the second region;
The first industry groups are selected in order of N - N, where N is an integer of 1 or more, in the order of the number of enterprises located in the first region among the plurality of industry groups based on the number of the first industry groups, Selecting a second industry group in which M - M is an integer equal to or more than 1 - in order of the number of enterprises located in the second region among the plurality of industry groups based on the number of enterprises;
A first industry group selected for the first region and a second industry group selected for the second region based on the composite industrial complex index among the plurality of industry groups stored in the industrial fusion complex index storage unit; Determining a composite index and determining an average value of the determined composite index as a regional fusion index between the first region and the second region;
Transmitting the determined regional fusion index to the terminal of the manager;
When the second company name and the third company name are inputted from the terminal of the manager, the second company name and the third company name are inputted from the terminal of the manager, And further determining an industrial composite index between at least one industry group matched for the second enterprise and at least one industry group matched for the third enterprise according to the third enterprise name, Determining an average value as an enterprise convergence index between the second company and the third company; And
And transmitting the determined FER to the terminal of the manager
Lt; / RTI >
Wherein the step of determining the regional fusion index comprises determining an index of industrial complexity between the first industry group and the second industry group and determining an industrial fusion composite index between the first industry group and the second industry group, The average value of the industrial combined composite index excluding the composite index is determined as the regional fusion index between the first region and the second region
A method of operating a fusion index computing device.

delete

The method according to claim 6,
A plurality of industry group word groups including a plurality of predetermined industry group words assigned different characteristic vectors, a plurality of industry group words included in each of the plurality of industry group word groups, Wherein the similarity degree is greater than or equal to a predetermined reference similarity degree;
When a connection address for a web page of a first company is input, accessing a web page of the first company based on the connection address, and accessing a web page of the first company from the web page of the first company, Extracting first texts of the text;
Performing morphological analysis on the plurality of first texts to extract a plurality of first words from the plurality of first texts;
Determining an enterprise address of the first company based on the plurality of first words;
Selecting at least one important word of the plurality of first words based on an appearance frequency on a web page of the first company of the plurality of first words;
If the at least one important word is selected, identifying the feature vector for the at least one important word by referring to the industry word storage, and comparing the feature vector for the at least one important word with the plurality of industry word groups Computing the similarity between the feature vectors for each of the plurality of industry words included in each of the plurality of industry groups and determining at least one industry group among the plurality of industry groups for the first company based on the calculated similarity; And
Generating company information for the first company by matching the determined company address and the determined at least one industry group with respect to the first company to the first company and storing the generated company information in the company information storage unit
Further comprising the step of:

9. The method of claim 8,
The step of maintaining the enterprise information storage unit may include maintaining the enterprise information storage unit storing the enterprise information in which the important words are more matched to the plurality of enterprises,
Wherein the step of storing in the enterprise information storage unit stores enterprise information for the first company matching the at least one important word with respect to the first enterprise and stores the information in the enterprise information storage unit
A method of operating a fusion index computing device.

delete

A computer-readable recording medium recording a program that causes a computer to perform the method of any one of claims 6, 8, and 9.

A computer program stored in a storage medium for executing the method of any one of claims 6, 8, or 9 through a combination with a computer.