KR20120052144A

KR20120052144A - System and method on navigating niche technological areas using multi dimensional frame

Info

Publication number: KR20120052144A
Application number: KR1020110016472A
Authority: KR
Inventors: 강민수; 구자철; 김철영
Original assignee: (주)광개토연구소; 강민수
Priority date: 2010-11-15
Filing date: 2011-02-24
Publication date: 2012-05-23
Also published as: KR101216116B1; KR101189068B1; KR20120052147A; KR20120052146A; KR101282717B1; KR20120052145A

Abstract

PURPOSE: A system and a method for processing data in a patent information system capable of providing a searching service of a niche technological area using a multi dimensional frame are provided to easily produce a search service of a niche technical area by forming an n dimensional frame based on category information. CONSTITUTION: A cross-domain frame selecting unit(5111) receives selection information for a first patent classification combination using first patent classification and a second classification combination using second classification from a user computer. A cross-domain combination generating unit(5112) uses the first patent classification combination and the second classification combination and constitutes a cross-domain combination having cross-domains. A niche analysis information generating unit(5120) generates niche analysis information into a cross-domain value based on a cross-domain document combination corresponding to a cross-domain.

Description

Information processing method and system of patent information system that provides niche technology area search service using n-dimensional frame {System and Method on Navigating Niche Technological Areas Using Multi Dimensional Frame}

본 발명은 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템의 정보 처리 방법 및 그 시스템에 관한 것으로 더욱 자세하게는 특허 분류 등과 같은 분류 정보로 n차원 프레임을 형성하여 용이한 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템의 정보 처리 방법 및 그 시스템에 관한 것이다.
The present invention relates to an information processing method and a system of a patent information system that provides a niche technology area search service. More particularly, the present invention provides an easy nich technology area search service by forming an n-dimensional frame with classification information such as patent classification. An information processing method of a patent information system and a system thereof.

특허 정보는 기술 정보, 권리 정보 및 경영 정보의 성격을 가지며, 글로벌한 국제 경쟁에서 그 중요성이 증가 되고 있다. 기술 정보로서 기술 개발 동향, 개별 특허에 적용된 기술적 아이디어를 알 수 있고, 권리 정보로서 개별 특허의 권리 범위, 국내외 권리화 정도를 파악할 수 있으며, 경영 정보로서 경쟁 기업의 기술 개발 동향 등을 파악할 수 있다.Patent information has the characteristics of technical information, rights information and management information, and its importance is increasing in the global international competition. As the technical information, it is possible to know the trend of technology development and technical ideas applied to individual patents, and as the right information, it is possible to grasp the scope of rights of individual patents and the degree of nationalization of domestic and foreign rights.

특허 정보를 입수할 수 있는 방법은 다양하나, 크게 1) 각국 특허청이 제공해 주는 특허 정보 시스템을 이용하는 것과 2) 민간 기업이 개발한 특허 정보 시스템을 이용하는 것, 및 3) 개별 기업이 자사의 목적에 맞게 구축된 특허 정보 시스템을 이용하는 것 등으로 대별할 수 있다. 1)의 방법의 대표적인 예가 대한민국 특허청(www.kipo.go.kr) 및 대한민국 특허정보원(www.kipris.or.kr)이 제공하는 특허 정보 시스템이 있으며, 2)의 대표적인 예로 미국의 www.delphion.com, 일본의 www.patolis.co.jp, 대한민국의 www.wips.co.kr, www.wisdomain.com 등 있다.There are many ways to obtain patent information, but there are largely 1) the use of patent information system provided by each country's patent office, 2) the use of patent information system developed by private companies, and 3) the individual company's purpose. It can be classified into using a patent information system that is properly constructed. A representative example of the method of 1) is a patent information system provided by the Korean Intellectual Property Office (www.kipo.go.kr) and the Korean Patent Information Service (www.kipris.or.kr), and 2) a representative example of the US www.delphion .com, www.patolis.co.jp in Japan, www.wips.co.kr in Korea and www.wisdomain.com.

www.delphion.com 의 운영회사인 톰슨사이언티픽과 같은 각 국가의 민간 기업들은 특허 정보 데이터베이스를 구축해 놓고, 특허 검색 엔진을 통하여 입력된 검색식에 대해 다양한 방식으로 검색 결과를 제공해 주고 있다. 또한, 이들 특허 정보를 기반으로 한 다양한 종류의 분석 소프트웨어가 개발되어 있으며, www.micropatent.com의 운영사(톰슨사이언티픽에 인수 합병되었음)가 개발한 상표명 AUREKA로 보급되는 소프트웨어가 유명하다. 그리고, 세계 각국의 기술 선진 대기업들은 내부에 특허 검색 및 특허 관리 시스템을 구축하여 운영하고 있는 것으로 알려져 있으나, 외부에서 그 기업들의 특허 검색 및 특허 관리 시스템에 대해 접근하기가 어렵다.Private companies in each country, such as Thomson Scientific, the operator of www.delphion.com, have established a database of patent information and provide search results in a variety of ways for search expressions entered through a patent search engine. In addition, various types of analysis software have been developed based on this patent information, and the software distributed under the trade name AUREKA developed by the operator of www.micropatent.com (acquired and acquired by Thomson Scientific) is famous. And, it is known that the technology giants around the world have built and operated patent retrieval and patent management systems inside, but it is difficult to access their patent retrieval and patent management systems from outside.

이러한 특허 검색 및 특허 관리 시스템을 구축하기 위해서는 수십억원 이상의 비용이 지출되는 것이 통상이어서, 국내외 대부분의 중견 및 중소 벤처 기업들은 자체 특허 검색 및 특허 관리 시스템을 구축하고 있지 못하고 있다. 이들 중견 및 중소 벤처 기업들은 특허 전담 조직을 두고 있는 경우도 다수 있으며, 이들이 자사의 특허에 대한 관리 및 자사를 위한 특허 정보를 수집하고 있다. 하지만, 기업 규모가 작거나, 특허에 대한 의식이 크지 않은 기업들은 이러한 특허 전담 조직을 갖추고 있지 못하는 경우가 많다. 이들 기업들은 특허법률사무소나 로펌 등에 의뢰하여 자신의 특허를 관리하고 있는 수준에 거치고 있다.It is common to spend more than billions of dollars to construct such a patent search and patent management system, and most domestic and foreign mid-sized and small venture companies do not have their own patent search and patent management system. Many of these mid- to small- and mid-sized startups have a patent-only organization, which manages their patents and collects patent information for them. However, companies that are small in size or do not have high awareness of patents often do not have a dedicated patent organization. These companies are in charge of managing their patents by requesting patent law firms or law firms.

이들 특허 전담 조직은 www.delphion.com 등과 같은 외부에서 접근 가능한 유무료 온라인 특허 정보 서비스 업체들을 이용하여, 이들이 제공해 주는 플랫폼으로 특허 정보를 수집, 관리하고 있다. 그리고, 기업 명의로 된 특허 출원의 대부분은 그 기업 소속 발명자들에 의해서 이루어 지며, 그 발명자는 자신의 발명과 관련된 국내외의 특허 정보를 수집하는 것이 전세계적인 추세가 된다. 그 이유가 특허 공개 기간의 존재에도 불구하고 전세계의 신기술 정보는 70% 이상이 특허 정보의 형태로 공개되며, 특허 정보가 논문 정보 등에 비해서 상업적으로 이용가능한 아이디어가 많기 때문으로 풀이된다. 하지만, 이들 발명자들의 상당 부분은 특허 검색에 익숙해 있지 않거나, 검색에 익숙한 자라 하더라도 www.delphion.com 등과 같은 외부 특허 정보 서비스 제공업체에 접속하여 특허 정보를 입수 받고 있다. These patent organizations use externally accessible free online patent information service companies such as www.delphion.com to collect and manage patent information on the platform they provide. In addition, most of the patent applications in the name of the company is made by the inventors belonging to the company, the inventors of the global trend to collect domestic and foreign patent information related to their invention. The reason for this is that despite the existence of the patent publication period, more than 70% of the world's new technology information is disclosed in the form of patent information, and because the patent information has many commercially available ideas compared to the paper information and the like. However, many of these inventors, even if they are not used to patent searching or who are familiar with searching, have access to external patent information service providers such as www.delphion.com to obtain patent information.

통상적으로 발명자들은 특정 기술 분야에 대하여 수년 내지 십수년간 지속적으로 연구하는 것이 일반적이므로, 자신 발명한 기술 분야가 크게 변동되지 않는 것이 일반적이다. 그러므로, 자신이 발명한 기술 분야와 직접 또는 간접적으로 관련되는 분야에 관한 국내 또는 해외 각국의 특허 기술 정보는 발명자에게 유익한 정보가 될 것이다. 그러므로, 개별 기업 단위 뿐만 아니라, 그 기업 소속의 발명자 단위로 운영되며, 발명자 단위까지 최적화되어 특정 발명자 만을 위한 차별화된 특허 정보 서비스 시스템의 공급이 요청되어 왔으며, 이와 같은 특허 정보 서비스 시스템은 자체적으로 개발하고 유지할 수 없는 중견 및 중소벤처 기업들에게는 특허 정보에 대한 접근성을 크게 향상시킬 것이며, 자체적으로 개발 유지할 수 있는 기업들에게는 사내 특허 정보 유통의 새로운 플랫폼이 될 것이다.It is common for the inventors to continuously study the specific technical field for several years to several decades, so that the technical field of the invention of the present invention is not greatly changed. Therefore, the patent technical information of the country or foreign countries about the field directly or indirectly related to the technical field invented by him will be useful information to the inventor. Therefore, it has been requested to supply a differentiated patent information service system for a specific inventor, which is operated by the inventor unit of the company as well as an individual company unit, and optimized to the inventor unit. Such a patent information service system has been developed by itself. It will greatly improve access to patent information for medium and small venture companies that cannot be maintained and will be a new platform for in-house patent information distribution for companies that can develop and maintain their own.

AUREKA는 텍스트마이닝 처리를 통해 키워드를 추출하고, 이를 co-occurrence 처리하여 등고선 맵으로 제공해 주는 AUREKA 서비스는 매력적인 등고선 맵으로 빈발 키워드군을 보여주고 있으나, 바다(푸른색으로 채워져 있는 영역) 또는 키워드가 밀집하지 않은 영역에 대한 기술적 의미가 없어(AUREKA에는 그리드(위도/경도, 좌표축)이 없다.) 틈새 기술의 구조적 발견 등과 같은 고부가가치 서비스를 제공해 주지 못하는 문제가 있다.AUREKA extracts keywords through text mining and co-occurrences to provide contour maps. The AUREKA service shows frequent keyword groups with attractive contour maps, but the ocean (area filled with blue) or keywords There is no technical meaning for non-dense areas (AUREKA has no grid (latitude / longitude, coordinate axes)) and there is a problem that it cannot provide high value-added services such as structural discovery of niche technology.

이에 따라, 니치 기술을 조기에 발견하고 선점하기 위해서는 체계적으로 니치 기술 영역이나 틈새 기술 영역의 발견을 지원해 탐색을 지원해 주는 서비스나 시스템의 개발이 절실히 요구되어 왔다.
Accordingly, in order to detect and preempt niche technology early, there is an urgent need for the development of services or systems that support discovery by systematically discovering niche or niche technology areas.

본 발명이 해결하려는 첫번째 기술적 과제는 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템의 정보 처리 방법을 제시하는 것이다.The first technical problem to be solved by the present invention is to propose an information processing method of a patent information system that provides a niche technology area search service.

본 발명이 해결하려는 두번째 기술적 과제는 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템을 제시하는 것이다.
The second technical problem to be solved by the present invention is to propose a patent information system that provides a niche technology area search service.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 유무선 네트워크를 통하여 사용자 컴퓨터로 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템의 정보 처리 방법에 있어서, 상기 특허 정보 시스템이 (a) 상기 사용자 컴퓨터로부터 제1 특허 분류를 사용하는 제1 특허 분류 집합 및 적어도 하나 이상의 제2 분류를 사용하는 제2 분류 집합에 대한 선택 정보를 입수 받는 단계; (b) 상기 제1 특허 분류 집합 및 상기 제2 분류 집합을 사용하여 적어도 2개 이상의 교차 영역을 가지는 교차 영역 집합을 구성하는 단계; 및 (c) 상기 교차 영역 집합을 구성하는 적어도 하나 이상의 교차 영역에 대하여, 상기 교차 영역에 대응되는 교차 영역 문건 집합(교차 영역 집합)을 기준으로 하나 이상의 니치 분석 정보를 교차 영역값으로 생성하는 단계;를 포함하며, 상기 제1 특허 분류는 IPC, F-term, FI, UPSC, ECLA 중 어느 하나 이상인 것인 것을 특징으로 하는 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템의 정보 처리 방법을 제시한다.In order to achieve the technical problem to be achieved by the present invention, an information processing method of a patent information system that provides a niche technology area search service to a user computer through a wired or wireless network, wherein the patent information system is (a) from the user computer; Receiving selection information about the first patent classification set using the patent classification and the second classification set using the at least one second classification; (b) constructing an intersection region set having at least two intersection regions using the first patent classification set and the second classification set; And (c) generating at least one niche analysis information as at least one cross region value of at least one cross region constituting the cross region set based on a cross region document set (cross region set) corresponding to the cross region. And, wherein the first patent classification is any one or more of IPC, F-term, FI, UPSC, and ECLA, and provides an information processing method of a patent information system providing a nich technology area search service. .

(d) 상기 교차 영역에 상기 교차 영역값 및 상기 교차 영역값을 기 설정된 기준으로 처리한 교차 영역값 처리 결과 중 어느 하나 이상을 상기 교차 영역에 표시하도록 하는 단계;를 더 포함하는 것이 바람직하다.and (d) displaying at least one of the intersection area value and the intersection area value processing result of processing the intersection area value and the intersection area value on a predetermined basis in the intersection area in the intersection area.

상기 교차 영역 집합은 상기 제1 특허 분류 집합과 상기 제2 분류 집합을 사용하는 매트릭스나 큐브 형태의 교차 영역 프레임인 것이 바람직하다.The cross region set is preferably a cross region frame in the form of a matrix or a cube using the first patent classification set and the second classification set.

상기 교차 영역 문건 집합은 적어도 하나 이상의 타겟 문건 집합의 부분 문건 집합인 것이며, 상기 타겟 문건 집합은 상기 사용자가 지정하거나 생성한 문건 집합이거나, 상기 시스템이 지정하거나 생성한 문건 집합인 것인 것이며, 선택적으로 상기 타겟 문건 집합은 기 설정된 한정 조건을 충족하는 문건들로만 한정될 수 있는 것이며, 상기 한정 조건은 시간 한정, 출원인이나 권리자 한정, 발명자 한정, 특허 평가 점수나 특허 평가 등급의 한정, 출원인, 권리자 또는 발명자의 속성 한정, 특허 평가 지표의 충족 정도의 한정, 특정 서지 사항 속성 포함 여부의 한정, 특허에 대한 측정값의 한정 중 어느 하나 이상인 것이 바람직하다.The cross-domain document set is a partial document set of at least one target document set, and the target document set is a document set specified or created by the user, or a document set specified or generated by the system, and optional As such, the target document set may be limited to documents that satisfy a predetermined limit condition, and the limit condition may be a time limit, an applicant or an owner limit, an inventor limit, a patent evaluation score or a patent evaluation level limit, an applicant, an owner, or It is preferable that it is any one or more of limitation of the attribute of an inventor, limitation of the satisfaction degree of a patent evaluation index, limitation of whether a specific bibliographic attribute is included, or limitation of the measured value about a patent.

상기 제2 분류는 특허 분류 또는 특허 분류와 대응 관계를 가지는 준 특허 분류인 것이며, 상기 준 특허 분류는 IndextoUSPC, CatchWord 중 어느 하나 이상인 것이며, 상기 준 특허 분류는 준 특허 분류 1개당 적어도 하나 이상의 특허 분류가 대응되는 것인 것이 바람직하다.The second classification is a patent classification or a semi-patent classification having a corresponding relationship with the patent classification, the semi-patent classification is any one or more of IndextoUSPC and CatchWord, and the semi-patent classification is at least one patent classification per one semi-patent classification. It is preferable that is corresponded.

상기 (a) 단계에서, 적어도 하나 이상의 하위 특허 분류를 가지는 적어도 하나 이상의 상위 특허 분류를 선택함으로써, 상기 선택된 상위 특허 분류에 대한 상기 하위 특허 분류를 상기 제1 특허 분류 집합으로 처리하는 것인 것이 바람직하다.In the step (a), it is preferable to process the lower patent classification for the selected upper patent classification as the first patent classification set by selecting at least one or more upper patent classifications having at least one lower patent classification. Do.

상기 교차 영역 문건 집합은 타겟 문건 집합 중 상기 제1 특허 분류 집합을 구성하는 제1 특허 분류와 상기 제2 분류 집합을 구성하는 제 2분류의 속성을 동시에 충족하는 문건들로 구성되는 것이며, 상기 속성의 충족은 상기 교차 영역 문건에 상기 제1 특허 분류와 상기 제2 분류의 전부 또는 일부분이 동시에 나타나 있거나, 상기 교차 영역 문건에 나와 있는 특허 분류의 특허 분류 체계 상의 적어도 하나 이상의 상위 특허 분류의 전부 또는 일부분이 상기 제1 특허 분류와 상기 제2 분류에 동시에 나타나는 것인 것이 바람직하다.The cross-domain document set is composed of documents that simultaneously satisfy the attributes of the first patent classification constituting the first patent classification set and the second classification constituting the second classification set among the target document sets. Satisfies all or part of the first patent classification and the second classification simultaneously in the cross-domain document, or all or at least one higher patent classification on the patent classification system of the patent classification in the cross-domain document Preferably, a portion appears simultaneously in the first patent classification and the second classification.

상기 제1 특허 분류 집합은 특허 분류 체계 상에서 어느 하나의 상위 특허 분류에 대하여, 상기 상위 특허 분류의 전체 하위 특허 분류 중 상기 사용자가 선택하는 하위 특허 분류를 포함하여 구성되는 것인 것이 바람직하다.Preferably, the first patent classification set includes a lower patent classification selected by the user among all lower patent classifications of the upper patent classification with respect to any one upper patent classification in the patent classification system.

상기 니치 분석 정보는 상기 사용자가 선택할 수 있는 것이며, 상기 니치 분석 정보는 적어도 하나 이상의 니치 분석 지표나 니치 분석 모델을 사용하는 것이며, 상기 교차 영역값은 상기 니치 분석 지표나 상기 니치 분석 모델의 선택에 따라 변동되는 것인 것이 바람직하다.The niche analysis information may be selected by the user, and the niche analysis information may use at least one niche analysis index or a niche analysis model, and the intersection area value may be used to select the niche analysis index or the niche analysis model. It is preferable to vary according to.

상기 교차 영역 프레임에는 상기 제1 특허 분류 집합을 구성하는 특허 분류와 상기 제2 분류 집합을 구성하는 제2 분류가 나타나는 것이며, 상기 교차 영역 프레임은 상기 제1 특허 분류 집합을 구성하는 특허 분류가 나타나는 제1 프레임축, 상기 제2 분류 집합을 구성하는 제2 분류가 나타나는 제2 프레임축으로 구성되며, 상기 교차 영역이 배치되는 방법은 상기 제1 특허 분류 집합이 특허 분류 체계 상에서 나열되는 순서 및 상기 제2 분류 집합이 상기 제2 분류 체계 상에서 나열되는 순서를 기준으로 배치하는 제1 배치 방법, 제1 특허 집합의 특허 분류에 대응되는 교차 영역 집합이 없는 없거나 제2 분류 집합의 제2 분류에 대응되는 교차 영역 집합이 없는 경우, 교차 영역 집합이 없는 특허 분류나 제2 분류를 제외하고 교차 영역 집합이 존재하는 특허 분류나 제2 분류에 대하여, 상기 제1 특허 분류 집합이 특허 분류 체계 상에서 나열되는 순서 및 상기 제2 분류 집합이 상기 제2 분류 체계 상에서 나열되는 순서를 기준으로 배치하는 제2 배치 방법, 및 상기 제1 프레임축 방향의 모든 교차 영역에 대한 교차 영역값의 합계가 높을수록 상기 제2 프레임축에 가까이 배치하고, 상기 제2 프레임축 방향의 모든 교차 영역에 대한 교차 영역값의 합계가 높을수록 상기 제1 프레임축에 가까이 배치하는 제3 배치 방법 중 어느 하나의 방법이 적용되는 것인 것이며, 상기 제1 배치 방법, 상기 제2 배치 방법 및 상기 제3 배치 방법은 상기 사용자의 선택이나 상기 시스템의 설정에 따라 선택될 수 있는 것인 것이 바람직하다.In the cross region frame, a patent classification constituting the first patent classification set and a second classification constituting the second classification set appear. The cross region frame indicates a patent classification constituting the first patent classification set. A first frame axis, a second frame axis in which a second classification constituting the second classification set appears, and a method of arranging the intersection areas includes an order in which the first patent classification set is arranged on a patent classification system; A first arrangement method in which a second classification set is arranged based on the order listed on the second classification scheme, there is no cross region set corresponding to the patent classification of the first patent set, or the second classification set corresponds to the second classification of the second classification set In the absence of a cross-section set, the patent for which there is a cross-section set except for the patent classification or the second classification without the cross-section set. A second arrangement method for classifying or classifying the second classification based on the order in which the first set of patent classifications are arranged on the patent classification system and the order in which the second classification set is arranged on the second classification system; and The higher the sum of the crossing area values for all the crossing areas in the first frame axis direction is closer to the second frame axis, the higher the sum of the crossing area values for all the crossing areas in the second frame axis direction is higher. Any one of the third arrangement methods to be disposed close to one frame axis is applied, and the first arrangement method, the second arrangement method, and the third arrangement method are selected by the user or set up of the system. It is preferred that it can be selected according to.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 유무선 네트워크를 통하여 사용자 컴퓨터로 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템에 있어서, 상기 사용자 컴퓨터로부터 제1 특허 분류를 사용하는 제1 특허 분류 집합 및 적어도 하나 이상의 제2 분류를 사용하는 제2 분류 집합에 대한 선택 정보를 입수 받는 교차 영역 프레임 선택부; 상기 제1 특허 분류 집합 및 상기 제2 분류 집합을 사용하여 적어도 2개 이상의 교차 영역을 가지는 교차 영역 집합을 구성하는 교차 영역 집합 생성부; 및 상기 교차 영역 집합을 구성하는 적어도 하나 이상의 교차 영역에 대하여, 상기 교차 영역에 대응되는 교차 영역 문건 집합(교차 영역 집합)을 기준으로 하나 이상의 니치 분석 정보를 교차 영역값으로 생성하는 니치 분석 정보 생성부;를 포함하며, 상기 제1 특허 분류는 IPC, F-term, FI, UPSC, ECLA 중 어느 하나 이상인 것인 것을 특징으로 하는 니치 기술 영역 탐색 서비스를 제공해 주는 특허 정보 시스템을 제시한다.In order to achieve the technical problem to be achieved by the present invention, a patent information system for providing a niche technology area search service to a user computer through a wired or wireless network, the first patent classification set using a first patent classification from the user computer and A cross region frame selection unit receiving selection information about a second set of classifications using at least one second classification; An intersection region generation generator configured to form an intersection region set having at least two intersection regions using the first patent classification set and the second classification set; And generating niche analysis information for at least one intersecting region constituting the intersecting region set, by generating one or more niche analysis information as an intersecting region value based on a cross region document set (intersecting region set) corresponding to the intersecting region set. And a first patent classification is any one or more of IPC, F-term, FI, UPSC, and ECLA. The patent information system provides a niche technology area search service.

상기 교차 영역에 상기 교차 영역값 및 상기 교차 영역값을 기 설정된 기준으로 처리한 교차 영역값 처리 결과 중 어느 하나 이상을 상기 교차 영역에 표시하도록 하는 니치 분석 정보 표시부;를 더 포함하는 것이 바람직하다.The niche analysis information display unit may further include a niche analysis information display unit configured to display at least one of the intersection area value and the intersection area value processing result of processing the intersection area value on a predetermined basis in the intersection area.

제1 특허 분류 집합 및 제2 분류 집합의 선택 이력 및 현재 선택 사항을 관리하는 선택 프레임 관리부;를 더 포함하는 것이 바람직하다.Preferably, the apparatus further includes a selection frame management unit configured to manage selection history and current selection of the first patent classification set and the second classification set.

상기 니치 분석 정보 생성부는 니치 관점별로 니치 분석 정보를 생성하는 니치 관점 분석 정보 생성부; 및 니치 모델별로 니치 분석 정보를 생성하는 니치 모델 분석 정보 생성부; 중 어느 하나 이상을 더 포함하는 것인 것이 바람직하다.The niche analysis information generation unit may include a niche viewpoint analysis information generation unit configured to generate niche analysis information for each niche viewpoint; And a niche model analysis information generator configured to generate niche analysis information for each niche model. It is preferable to further include any one or more of.

상기 교차 영역 집합을 배치하는 교차 영역 집합 배치부;를 더 포함하며, 상기 교차 영역 집합 배치부가 상기 교차 영역 집합 배치부가 상기 교차 영역을 배치하는 방법은The cross region set arrangement unit for arranging the cross region set further comprises, wherein the cross region set arrangement unit the cross region set arrangement unit arranges the cross region

상기 제1 특허 분류 집합이 특허 분류 체계 상에서 나열되는 순서 및 상기 제2 분류 집합이 상기 제2 분류 체계 상에서 나열되는 순서를 기준으로 배치하는 제1 배치 방법, 제1 특허 집합의 특허 분류에 대응되는 교차 영역 집합이 없는 없거나 제2 분류 집합의 제2 분류에 대응되는 교차 영역 집합이 없는 경우, 교차 영역 집합이 없는 특허 분류나 제2 분류를 제외하고 교차 영역 집합이 존재하는 특허 분류나 제2 분류에 대하여, 상기 제1 특허 분류 집합이 특허 분류 체계 상에서 나열되는 순서 및 상기 제2 분류 집합이 상기 제2 분류 체계 상에서 나열되는 순서를 기준으로 배치하는 제2 배치 방법, 및 제1 프레임축 방향의 모든 교차 영역에 대한 교차 영역값의 합계가 높을수록 제2 프레임축에 가까이 배치하고, 제2 프레임축 방향의 모든 교차 영역에 대한 교차 영역값의 합계가 높을수록 제1 프레임축에 가까이 배치하는 제3 배치 방법 중 어느 하나의 방법 중 어느 하나를 실시하는 것이며, 교차 영역 프레임에는 상기 제1 특허 분류 집합을 구성하는 특허 분류와 상기 제2 분류 집합을 구성하는 제2 분류가 나타나는 것이며, 상기 교차 영역 프레임은 상기 제1 특허 분류 집합을 구성하는 특허 분류가 나타나는 제1 프레임축, 상기 제2 분류 집합을 구성하는 제2 분류가 나타나는 제2 프레임축으로 구성되며, 상기 제1 배치 방법, 상기 제2 배치 방법 및 상기 제3 배치 방법은 상기 사용자의 선택이나 상기 시스템의 설정에 따라 선택될 수 있는 것인 것이 바람직하다.
A first arrangement method for arranging the first patent classification set on the patent classification system and the second classification set on the second classification system on the basis of the first classification method and the patent classification of the first patent set If there is no cross region set or there is no cross region set corresponding to the second classification of the second classification set, the patent classification or the second classification with the cross region set except for the patent classification or the second classification without the cross region set A second arrangement method for arranging, based on the order in which the first set of patent classifications are arranged on the patent classification system and the order in which the second classification set is arranged on the second classification system, and a first frame axis direction. The higher the sum of the crossing area values for all the crossing areas, the closer to the second frame axis, and for all crossing areas in the second frame axis direction. The higher the sum of the difference region values is, the more any one of the third arrangement methods arranged closer to the first frame axis is performed, and in the cross region frame, the patent classification constituting the first patent classification set and the The second classification constituting the second classification set appears, and the cross-region frame indicates the first frame axis in which the patent classification constituting the first patent classification set appears, and the second classification constituting the second classification set appears. It is preferable that it is comprised by a 2nd frame axis, and the said 1st arrangement method, the said 2nd arrangement method, and the said 3rd arrangement method can be selected according to the selection of the user or the setting of the system.

본 발명을 실시하면 다음과 같은 효과가 있다.Implementing the present invention has the following effects.

첫째 니치 기술 영역을 극히 용이하게 탐색할 수 있어 니치 기술 영역의 발견 및 선점의 기회를 향유할 수 있게 된다. 특히, 특정한 문건 집합을 대상으로 다양하게 프레임 축을 변경해 가면서 그 문건 집합을 다각도로 분석해 볼 수 있으며, 다양한 프레임 하에서의 니치 영역을 용이하게 탐색해 갈 수 있다.First, the niche technology area can be explored very easily, thus enjoying the opportunity of discovery and preemption. In particular, by varying the frame axis for a specific document set, the document set can be analyzed from various angles, and the niche area under various frames can be easily explored.

둘째, 다양한 니치 관점별로 니치 평가 요소가 대응되므로, 니치 기술 영역에 대한 체계적인 분석이 가능하다.Second, since niche evaluation factors correspond to various niche perspectives, a systematic analysis of niche technology areas is possible.

셋째, 니치 옵션 처리부를 통하여 니치 옵션을 활용하여 다양한 한정을 할 수 있고, 니치 영역을 정밀하게 탐색해 갈 수 있게 된다.Third, the niche option processing unit enables various limitations by using the niche option, and precisely explores the niche area.

넷째, 니치 기술 영역이 시각적으로 분명하게 드러나게 되며, 탐색의 효율성이 극대화된다. 한편, 니치 기술 영역에 대한 평가값에 칼라 등의 범례를 적용함으로써 시각적인 명쾌함을 누릴 수 있어, 탐색의 우선 순위를 용이하게 정할 수 있다.
Fourth, the niche description area becomes clearly visible and maximizes the efficiency of navigation. On the other hand, by applying a legend such as a color to the evaluation value for the niche description region, visual clarity can be enjoyed, and the priority of search can be easily determined.

도 1은 본 발명의 특허 정보 시스템의 네트워크적 구성에 관한 일실시예적 도면이다.
도 2는 본 발명의 특허 정보 시스템의 구성에 관한 일실시예적 도면이다.
도 3은 본 발명의 데이터 가공부에 관한 일실시예적 도면이다.
도 4는 본 발명의 데이터 가공부에 관한 일실시예적 도면이다.
도 5는 본 발명의 특허 정보 서비스 지원부에 관한 일실시예적 도면이다.
도 6은 본 발명의 특허 분석 정보 생성부에 관한 일실시예적 도면이다.
도 7은 본 발명의 특허 분석 정보 생성부에 관한 일실시예적 도면이다.
도 8은 본 발명의 니치 기술 영역 발견 시스템의 구성에 관한 일 실시예적 도면이다.
도 9는 본 발명의 니치 평가 모델을 생성하는 방법에 대한 일 실시예적 도면이다.
도 10은 본 발명의 니치 평가 모델로 니치 평가값을 생성하는 방법에 대한 일 실시예적 도면이다.
도 11은 본 발명의 니치 평가 모델을 활용하여 니치 기술 영역을 발견하는 시뮬레이션 방법에 대한 일 실시예적 도면이다.
도 12는 본 발명의 니치 기술 영역 발견 시스템이 교차 영역에 니치 평가 요소값으로 교차 영영값을 생성하는 방법에 관한 일 실시예적 도면이다.
도 13은 본 발명의 니치 기술 영역 발견 시스템이 니치 옵션을 사용하여 니치 분석 정보를 생성하는 방법에 관한 일 실시예적 도면이다.
도 14는 본 발명의 니치 기술 영역 발견 시스템이 특허 분류 집합이나 분류 집합을 변경하여 새로운 교차 영역 집합을 구성하고 교차 영영값을 생성하는 방법에 관한 일 실시예적 도면이다.
도 15는 본 발명의 니치 기술 영역 발견 시스템이 니치 옵션을 사용하여 타겟 문건 집합을 한정하여 새롭게 교차 영영값을 생성하는 방법에 관한 일 실시예적 도면이다.
도 16 본 발명의 니치 기술 영역 발견 시스템의 일 실시예적 구현예이다. 본 발명의 니치 기술 영역 탐색 UI가 나타나 있다.
도 17는 제1 특허 분류를 사용하는 제1 특허 분류 집합의 선택 방법에 대한 일 구현예를 보여 주는 도면이다. 도 17에서는 5D107AA 전체가 선택되고 있음을 보여 준다.
도 18은 다른 니치 평가 요소(다른 니치 관점)에 대하여 상기 니치 분석 정보 생성부가 니치 분석 정보를 생성하기 위하여, 니치 관점을 선택하고 있는 모습을 보여 주는 일 실시예적 도면이다.
도 19는 같이 상기 교차 영역 프레임 선택부를 이용하여, 제1 특허 분류 집합 및 제2 분류 집합 중 사용자가 원하는 특허 분류나 분류를 루트(root)로부터 선택해 가능 과정을 보여 주는 개념도이다.
도 20은 가로 프레임이나 세로 프레임 중 어느 하나 이상이 변경되고, 변경된 프레임을 기준으로 니치 관점에 따른 니치 분석 정보가 생성되는 예를 보여 주는 도면이다.
도 21은 특정 국가의 특허에 대하여, 기술 영역 A에 대한 특허 출원수의 시기적 변화를 나타내는 그래프의 일례이다.
도 22는 기술 영역 A, B, C에 대한 특허 출원수의 시기적 변화를 나타내는 그래프의 일례이다.
도 23은 Gradient Boost algorithm 알고리즘에 대한 요약을 설명하는 도면이다.
도 24는 과적합의 경우를 설명하기 위한 보조 도면이다.
도 25는 Friedman(2002)이 새로이 stochastic gradient boosting algorithm을 제안한 알고리즘에 대한 요약을 설명하는 도면이다.
도 26는 니치 평가 모델 생성 과정에 관한 일 실시예적 개념을 설명하는 도면이다.
도 27은 종래 기술의 Aureka 분석 결과 화면에 관한 일실시예적 도면이다. 본 화면에서는 좌표축의 제목, 범례, 구역, 셀(cell)의 기술적 명칭이 없음을 알 수 있다.
도 28은 본 발명의 틈새 영역 발견 분석 결과에 관한 다른 일실시예적 도면이다. 본 화면에서는 각 축, 구역/셀(cell)에 대하여 thermograph 효과가 적용되어 있음을 볼 수 있다.
도 29는 H01B 1/00의 하부에 있는 H01B 1/06의 형제 노드와 H01B 1/06의 하위 특허 분류 노드를 보여 주는 특허 분류 체계의 일 부분에 대한 도면이다.
도 30은 WIPO가 제공하는 IPC에 대한 캐치워드로 이들에 대한 정보에 관한 일실시예적 도면이다.
도 31는 USPTO가 제공하는 USPC에 대한 인덱스에 대한 일실시예적 도면이다.1 is a diagram illustrating an exemplary network configuration of a patent information system of the present invention.
2 is a diagram illustrating an exemplary embodiment of a patent information system of the present invention.
3 is an exemplary diagram of a data processing unit of the present invention.
4 is an exemplary diagram of a data processing unit of the present invention.
5 is an exemplary view of a patent information service support unit of the present invention.
6 is an exemplary view of a patent analysis information generation unit of the present invention.
7 is an exemplary view of a patent analysis information generation unit of the present invention.
FIG. 8 is an exemplary view of the configuration of the niche description region discovery system of the present invention. FIG.
9 is an exemplary diagram for a method of generating a niche evaluation model of the present invention.
10 is an exemplary diagram for a method for generating a niche evaluation value with the niche evaluation model of the present invention.
FIG. 11 is an exemplary diagram for a simulation method of discovering a niche description region using the niche evaluation model of the present invention. FIG.
12 is an exemplary diagram of a method for generating a crossing zero value as a niche evaluation element value in an intersection region by the niche description region discovery system of the present invention.
FIG. 13 is an exemplary diagram of a method for generating niche analysis information using a niche description area discovery system of the present invention.
FIG. 14 is an exemplary diagram illustrating a method for changing a patent classification set or a classification set to form a new cross region set and generating a cross zero value by the niche description region discovery system of the present invention.
FIG. 15 is an exemplary diagram illustrating a method of generating a cross-zero value by defining a target document set using a niche option by the niche description region discovery system of the present invention.
Figure 16 is one embodiment implementation of a niche description region discovery system of the present invention. The niche description region navigation UI of the present invention is shown.
FIG. 17 is a diagram illustrating an embodiment of a method of selecting a first patent classification set using the first patent classification. 17 shows that the entire 5D107AA is being selected.
18 is a diagram illustrating an example in which the niche analysis information generation unit selects a niche viewpoint in order to generate niche analysis information for another niche evaluation element (another niche viewpoint).
19 is a conceptual diagram illustrating a process of selecting a patent classification or classification desired from a root among a first patent classification set and a second classification set by using the cross region frame selection unit.
20 is a diagram illustrating an example in which at least one of a horizontal frame and a vertical frame is changed and niche analysis information according to a niche perspective is generated based on the changed frame.
21 is an example of the graph which shows the temporal change of the patent application number with respect to the technical area A with respect to the patent of a specific country.
FIG. 22 is an example of the graph which shows the temporal change of the patent application number with respect to the technical areas A, B, and C. FIG.
FIG. 23 is a diagram illustrating a summary of the Gradient Boost algorithm algorithm. FIG.
24 is an auxiliary diagram for explaining the case of overfitting.
FIG. 25 is a diagram illustrating a summary of an algorithm proposed by Friedman (2002) to newly propose a stochastic gradient boosting algorithm.
FIG. 26 is a view for explaining an exemplary concept related to a niche evaluation model generation process. FIG.
FIG. 27 is a diagram illustrating an Aureka analysis result screen according to the related art. FIG. In this screen, it can be seen that there is no technical name of the title, legend, zone, and cell of the coordinate axis.
28 is another exemplary diagram illustrating a result of a gap region discovery analysis according to the present invention. In this screen, you can see that the thermograph effect is applied to each axis and zone / cell.
FIG. 29 is a diagram of a portion of a patent classification system showing sibling nodes of H01B 1/06 at the bottom of H01B 1/00 and lower patent classification nodes of H01B 1/06.
30 is an exemplary diagram of information on the catchwords for the IPCs provided by the WIPO.
FIG. 31 is an exemplary diagram of an index for a USPC provided by the USPTO. FIG.

이하, 도면을 참조하면서 더욱 상세하게 설명한다.Hereinafter, it demonstrates in detail, referring drawings.

본 발명의 특허 정보 시스템(10000)은 도 1에서 예시되어 있는 바와 같이, 유무선 네트워크(200)를 통하여 사용자 컴퓨터(100)에게 정보 서비스를 제공한다. 상기 특허 정보 시스템(10000)은 적어도 하나 이상의 연계 시스템(300)과 유무선 네트워크(200)로 연결되어 있을 수 있는데, 상기 연계 시스템(300)의 예는 특허 raw data를 제공하는 각국 특허청 또는 raw data 공급자의 시스템, 기업 정보 제공자의 시스템 등이 그 예가 될 수 있으며, 본 발명의 특허 정보 시스템(10000)의 서비스를 연계하여 제공하는 적어도 하나 이상의 시스템이 그 예가 될 수 있으며, 본 설명에 한정하지 않는다.As illustrated in FIG. 1, the patent information system 10000 of the present invention provides an information service to a user computer 100 through a wired / wireless network 200. The patent information system 10000 may be connected to at least one link system 300 and a wired / wireless network 200. An example of the link system 300 is a patent office or a raw data provider of each country that provides patent raw data. An example may be a system, a system of an enterprise information provider, or the like, and at least one or more systems that provide services by linking the services of the patent information system 10000 of the present invention may be examples.

상기 특허 정보 시스템(10000)은 도 2에서 예시되어 있는 바와 같이, 크게 데이터부(1000) 및 데이터 가공부(2000), 검색 처리부(3100), 분석 정보 생성부(4000), 등을 포함하고 있다. 상기 데이터부(100)는 특허 데이터부(1100), 비특허 데이터부(1200)로 구성되며, 상기 특허 데이터부(1100)는 특허 명세서 파일부(1110)와 특허 DB(1120), 특허 분류 DB(1130) 및 기타 분류 DB부(1140)를 포함하고 있다. As illustrated in FIG. 2, the patent information system 10000 includes a data unit 1000, a data processing unit 2000, a search processing unit 3100, an analysis information generating unit 4000, and the like. . The data unit 100 includes a patent data unit 1100 and a non-patent data unit 1200. The patent data unit 1100 includes a patent specification file unit 1110, a patent DB 1120, and a patent classification DB. 1130 and other classification DB units 1140.

특허 DB(1120)는 모든 특허에 대한 서지 사항과 명세서 본문, 도면 등을 필드별로 관리하며, 상기 명세서 본문을 구성하는 각종 필드(제목, 요약, 선행 기술, 특허 청구 범위, 발명의 상세한 설명 등)에서 추출한 핵심 키워드를 포함하고 있다. 한편, 상기 특허들은 상기 특허들에 대한 선행 기술 문건으로 인용 정보를 더 포함하고 있을 수 있다. 미국 특허 data를 예로 들면, 상기 인용 정보는 Reference에 포함된 정보들로, 미국 특허 문건 번호, 외국 특허 문건 번호, 비특허 문건에 대한 표시자 등이 포함되어 있다. 한편, 특허청 심사관이나 관련자에 의한 선행 기술 조사 정보, 심사관 의견 제출 시 참증 정보 등도 광의의 인용 정보가 된다. 특정 문건에 전방 인용 정보가 있는 경우, 상기 전방 인용 정보에 포함된 문건의 입장에서 보면 상기 특정 문건은 후방 인용 문건이 된다. 특정 문건을 기준으로 전방 인용 정보에 포함된 문건은 부모(parent) 문건이 되며, 상기 부모 문건을 기준으로 보면 상기 특정 문건은 자식(child) 문건이 된다. 이러한 child-parent 관계에 있는 정보를 DB로 처리하는 것은 당업자에게는 자명한 것으로 상세한 설명은 생략한다.The patent DB 1120 manages bibliographic matters of all patents, the specification body, and drawings for each field, and various fields (title, summary, prior art, claims, detailed description, etc.) constituting the specification body. Contains key keywords extracted from. Meanwhile, the patents may further include citation information as prior art documents for the patents. For example, in the US patent data, the citation information is information included in the reference, and includes US patent document numbers, foreign patent document numbers, and indicators for non-patent documents. Meanwhile, information on prior art research by examiners and related parties of patent offices, and information on verification when submitting opinions of examiners are also widely cited information. If there is forward citation information in a specific document, the specific document becomes a backward citation document from the perspective of the document included in the forward citation information. The document included in the forward citation information based on a specific document becomes a parent document, and based on the parent document, the specific document becomes a child document. It is obvious to those skilled in the art to process such information in child-parent relationship with a DB, and detailed description thereof will be omitted.

특허 문건의 서지 사항에는 발행 국가 정보, 각종 일자 정보, 각종 번호 정보, 적어도 하나 이상의 권리자 정보, 적어도 하나 이상의 발명자 정보, 적어도 하나 이상의 특허 분류 정보, 적어도 하나 이상의 우선권 정보 등이 포함되어 있다. 일자에는 출원일, 공개일, 등록일 및 기타 일자들이 있다. 각종 번호 정보에는 출원번호, 공개번호, 등록번호, 원출원 번호, 우선권 주장 번호 등이 있다. 권리자 정보는 출원인, 양수인, 특허권자 등이 있으며, 권리자의 변동이 있고, 변동이 관리되는 경우, 양도인과 양수인에 대한 정보 및 최종 권리자 정보가 있을 수 있다. 우선권 정보에는 우선권 주장 번호, 주장일자, 국가 등의 정보가 포함되어 있다. 한편, 분할출원이나 일부 계속 출원, 계속 출원 등이 있는 경우, 원출원 번호, 원출원일자 등의 정보가 부가되어 있다. 또한, 대표도, 제목, 요약, 색인어 등도 서지 사항에 포함되기도 한다. 한편, 가공된 서지 사항으로는 국내 패밀리 정보(분할출원, 변경출원이나 일부계속출원, 계속출원 관계에 있는 특허 출원)이나, 해외 패밀리 정보(조약 우선권 관계로 연관될 수 있는 출원, 국제출원 등)이 있을 수 있다. 한편, 자연어 처리 등을 통해서 특허 명세서 본문의 텍스트를 본문을 구성하는 각 필드별 또는 필드 통합별로 기 설정된 키워드 추출 방식으로 추출한 핵심 키워드 정보가 더 있을 수 있다. 특허 분류 정보는 공통적인 IPC 이외에도 USPC, FT, FI, ECLA 등과 같이 각 국가별로 자국만의 특별한 특허 분류가 있을 수 있다.Bibliographic details of patent documents include country information, various date information, various number information, at least one owner information, at least one inventor information, at least one patent classification information, at least one priority information, and the like. Dates include application date, publication date, registration date, and other dates. Various number information includes application number, publication number, registration number, original application number, priority claim number, and the like. The owner information includes the applicant, the assignee, the patent owner, and the like, and if there is a change in the owner, and the change is managed, there may be information on the assignor and the assignee and information on the last owner. Priority information includes information such as priority claim number, claim date, and country. On the other hand, when there is a divisional application, partial continuous application, continuous application, or the like, information such as the original application number and the original application date is added. Representatives, titles, summaries, and index terms may also be included in the bibliography. On the other hand, processed bibliographic information includes domestic family information (split application, change application or partial application, patent application that has a continuous application), or foreign family information (applications that may be related to treaty priority relationship, international application, etc.). This can be. On the other hand, there may be further key keyword information extracted by a predetermined keyword extraction method for each field or field integration constituting the body of the text of the patent specification body through natural language processing. In addition to the common IPC, the patent classification information may have a unique patent classification for each country such as USPC, FT, FI, ECLA, etc.

도 29는 H01B 1/00의 하부에 있는 H01B 1/06의 형제 노드와 H01B 1/06의 하위 특허 분류 노드를 보여 주는 특허 분류 체계의 일 부분에 대한 도면이다. 이와 같이 특허 분류는 계층 구조를 가진다. 본 발명의 특허 분류는 특허 분류는 IPC, USPC, FT, FI, ECLA를 포함한다.FIG. 29 is a diagram of a portion of a patent classification system showing sibling nodes of H01B 1/06 at the bottom of H01B 1/00 and lower patent classification nodes of H01B 1/06. Thus, the patent classification has a hierarchical structure. Patent classification of the present invention, the patent classification includes IPC, USPC, FT, FI, ECLA.

인덱스(index)란, 캐치워드(catchword)라고 불리기도 하는 것으로, 단어, 구 또는 절에 적어도 하나 이상의 특허 분류를 대응해 놓은 체계를 말한다. 상기 인덱스 중 대표적인 것으로 IPC를 처리해 놓은 catchword가 있으며, 미국 특허청이 발행하는 index to USPC가 있다. 상기 인덱스도 특허 분류처럼 계층 구조를 가지는 경우가 많다. 상기 인덱스에는 제품명/부품명/요소기술 등에 해당하는 키워드가 내재되어 있는 경우가 많다. 상기 인덱스는 특허 분류를 용이하게 찾아 보기 쉽게 해 놓은 것으로 도 13 및 도 14에 그 예시가 있다. 도 13는 WIPO가 제공하는 IPC에 대한 캐치워드로 이들에 대한 정보는 2010년 3월 현재 http://www.wipo.int/ipc/itos4ipc/ITSupport_and_download_area/20100101/MasterFiles/에 있는 ipcr _ catchwordindex _20100101. zip 파일로 입수 가능하다. 도 13에서 ABACUSES에 대하여 G06C 1/00이 대응되어 있음을 볼 수 있다. 이때, ABACUSES를 대하여 G06C 1/00에 대응되는 인덱스라 명명한다. 이에 따라, G06C 1/00은 Abacuses로 역맵핑 할 수 있다. 한편, 도 13의 내용 중 인덱스 ABARADING에서 알 수 있듯이, 상기 캐치워드 체계는 적어도 1단계 이상의 계층 구조를 가지고 있음을 알 수 있다. ABARADING은 2단계 계층임을 보여 준다.An index, also called a catchword, refers to a system in which at least one patent classification is associated with a word, phrase, or clause. Among the indexes, there is a catchword that processes the IPC, and there is an index to USPC issued by the US Patent and Trademark Office. The index also has a hierarchical structure like a patent classification. In the index, keywords corresponding to product names / part names / element descriptions are often embedded. The index makes the classification of patents easy to find and examples are shown in FIGS. 13 and 14. FIG. 13 is a catchword for IPC provided by WIPO and information about them is ipcr _ catchwordindex _20100101 at http://www.wipo.int/ipc/itos4ipc/ITSupport_and_download_area/20100101/MasterFiles/ as of March 2010 . Available as a zip file. 13, it can be seen that G06C 1/00 corresponds to ABACUSES. In this case, ABACUSES is called an index corresponding to G06C 1/00. Accordingly, G06C 1/00 can be reverse mapped to Abacuses. Meanwhile, as can be seen from the index ABARADING in FIG. 13, it can be seen that the catchword system has at least one hierarchical structure. ABARADING shows that it is a two-tier hierarchy.

도 14는 USPTO가 USPC에 대한 인덱스로 이들에 대한 정보는 2010년 3월 현재 http://www.uspto.gov/web/patents/classification/uspcindex/indextouspc.htm에서 확인할 수 있다. 도 14의 내용 중에서 인덱스 Abrading은 3단계의 계층으로 분류되어 있음을 알 수 있다. 인덱스 Abrading은 USPC 451/38에 대응되고 있음을 알 수 있으며, Abrading의 하위 인덱스에 Class 451 이외의 다른 Class의 특허 분류도 존재함을 알 수 있다.14 is a USPTO index to the USPC information about them can be found at http://www.uspto.gov/web/patents/classification/uspcindex/indextouspc.htm as of March 2010. It can be seen that the index abrading is classified into three levels of hierarchy in the contents of FIG. 14. It can be seen that the index Abrading corresponds to USPC 451/38, and there is a patent classification of a class other than Class 451 in the lower index of Abrading.

상기 데이터 가공부(2000)는 도 3에서 예시되어 있는 바와 같이, 핵심 키워드 생성부와 분류 메타 데이터 생성부 및 목적 특화 데이터 생성부를 포함하고 있다. 니치 발견 서비스를 위해서는 분류 메타 데이터 생성부가 필요하며, 발견된 니치 기술 영역을 분석하기 위해서는 핵심 키워드 생성부 및 목적 특화 데이터 생성부가 필요하게 된다.As illustrated in FIG. 3, the data processor 2000 includes a core keyword generator, a classification metadata generator, and a purpose-specific data generator. For the niche discovery service, a classification metadata generation unit is required, and in order to analyze the found niche description area, a core keyword generation unit and a purpose-specific data generation unit are required.

이하, 도면을 참조하면서 더욱 더 상세하게 설명한다. 도 8은 본 발명의 니치 기술 영역 발견 시스템(5000)의 구성에 관한 일 실시예적 도면이다. 상기 니치 기술 영역 발견 시스템(5000)은 니치 기술 영역 발견을 지원하는 니치 기술 영역 발견 엔진, 니치 평가 모델을 생성하는 니치 평가 모델 생성부(5200) 및 니치 발견 정보를 생성하는 니치 발견 정보 생성부(5300)를 포함하고 있다. 상기 니치 기술 영역 발견 엔진은 니치 기술 영역의 발견을 용이하게 돕는 프레임 정보를 처리하는 n차원 프레임 처리부(5110), 니치 영역에 대한 분석 정보를 생성하는 니치 분석 정보 생성부(5120), 니치 관련된 각종 옵션을 처리하는 니치 옵션 처리부(5130)를 포함하고 있다. Hereinafter, the present invention will be described in more detail with reference to the drawings. 8 is an exemplary diagram of the configuration of the niche description region discovery system 5000 of the present invention. The niche description region discovery system 5000 may include a niche description region discovery engine that supports niche description region discovery, a niche evaluation model generator 5200 for generating a niche evaluation model, and a niche discovery information generation unit for generating niche discovery information ( 5300). The niche description region discovery engine includes an n-dimensional frame processing unit 5110 to process frame information to facilitate discovery of the niche description region, a niche analysis information generation unit 5120 to generate analysis information about the niche region, and various kinds of niche related information. A niche option processor 5130 for processing the option is included.

상기 n차원 프레임 처리부(5110)는 n차원의 프레임에 대하여, 선택된 복수 개의 프레임으로 교차 영역이 생성될 때, 프레임의 선택을 지원하는 교차 영역 프레임 선택부(5111), 상기 교차 영역 프레임 선택부(5111)를 통하여 프레임이 결정되는 경우, 상기 결정되는 프레임으로 교차 영역 집합을 생성하는 교차 영역 집합 생성부(5112), 입수된 타겟 문건 집합에 대하여, 생성되는 교차 영역 집합을 구성하는 교차 영역에 니치 분석 정보를 표시하는 니치 분석 정보 표시부(5113), 사용자에게 선택된 프레임 정보를 관리하는 선택 프레임 관리부(5114)를 포함하고 있다.The n-dimensional frame processing unit 5110 may include an intersection region frame selection unit 5111 and an intersection region frame selection unit that support frame selection when an intersection region is generated with a plurality of selected frames with respect to an n-dimensional frame. When the frame is determined through 5111, a cross region set generation unit 5112 generating a cross region set using the determined frame, and a niche at a cross region constituting the generated cross region set with respect to the obtained target document set. A niche analysis information display unit 5113 for displaying analysis information and a selection frame management unit 5114 for managing frame information selected by the user are included.

상기 니치 분석 정보 생성부(5120)는 교차 영역별로 선택된 니치 관점별 분석 정보를 생성하는 니치 관점 분석 정보 생성부, 교차 영역별로 선택된 니치 모델별로 분석 정보를 생성하는 니치 모델 분석 정보 생성부(5122)를 포함하고 있다. 한편, 상기 니치 분석 정보 생성부(5120)는 상기 사용자의 니치 관점의 선택을 돕기 위해, 니치 분석 종류를 제시할 수 있으며, 이는 본 발명의 니치 분석 종류 제시부(5123)가 담당한다. 그리고, 상기 니치 모델 종류의 선택을 돕는 것은 본 발명의 니치 모델 종류 제시부(5124)가 담당한다.The niche analysis information generation unit 5120 is a niche perspective analysis information generation unit for generating analysis information for each niche perspective selected for each intersection area, and a niche model analysis information generation unit 5122 for generating analysis information for each niche model selected for each intersection area. It includes. On the other hand, the niche analysis information generation unit 5120 may suggest a niche analysis type to assist the user in selecting the niche viewpoint, which is in charge of the niche analysis type presenting unit 5123 of the present invention. In addition, the niche model type presenting unit 5124 of the present invention is responsible for assisting the selection of the niche model type.

상기 니치 옵션 처리부(5130)는 한정 처리를 수행하는 한정 계열의 기능부와 범례 처리부(5135)를 포함한다. 상기 한정 계열의 기능부는 기간을 한정하는 기간 한정부가 있으며, 출원인, 발명자, 대리인 등의 한정을 관리하는 주체 한정부(5132), 기술 분야의 한정을 관리하는 기술 분야 한정부(5133), 개별 특허의 한정을 관리하는 개별 특허 한정부(5134)가 있다. 상기 기간에는 기간의 종류(출원일, 공개일, 등록일, 최선일 등)별로 기간이 관리 될 수 있을 것이다. 출원인의 한정은 출원인명 또는 출원인의 종류별(기업, 대학, 공공기관, 개인) 또는 출원인의 국적별(주소 정보나 우선권 주장 시의 국가 정보로 처리할 수 있다.) 또는 기 설정된 출원인 속성(특허 괴물 등과 같은 다분쟁 발생 출원인, 대기업, 고피인용 출원인 등)별로 한정될 수 있을 것이다. 발명자의 한정은 발명자별, 발명자 국적별, 또는 발명자의 소속 기관의 종류나 속성별로 한정될 수 있을 것이다. 한편, 기술 분야의 한정은 특허 분류나 준 특허분류(Catchword, Index 등)를 사용하는 한정, 키워드를 사용하는 한정 등이 있을 수 있다. 개별 특허에 대한 한정은 서지 사항의 양(청구항 수 등), 서지 사항을 처리한 양(피인용수 등)의 정도나, 기 설정된 속성(특허 자동 평가 등급이나 평가 점수, 분쟁 발생 특허, 표준 특허풀 포함 특허 등)의 양이나 여부 등으로 한정될 수 있을 것이다.The niche option processor 5130 includes a limited series of functional units and a legend processor 5135 to perform a limited process. The functional part of the limited series includes a time limiting part for limiting a period, subject limiting part 5152 for managing limitations of applicants, inventors, agents, etc., technical field limiting part 5133 for managing limitations in the technical field, and individual patents. There is an individual patent limiter 5134 which manages the limitation of. The period may be managed by the type of period (application date, publication date, registration date, best date, etc.). Applicant's limitation may be based on the applicant's name or the applicant's type (company, university, public institution, individual) or nationality of the applicant (can be treated as address information or national information at the time of priority claim) or preset applicant attribute (patent monster) Multi-dispute applicants, such as large companies, applicants for citations, etc.) will be limited to. The inventor may be limited by the inventor, by the nationality of the inventor, or by the type or attribute of the organization to which the inventor belongs. Meanwhile, the limitation of the technical field may include a patent classification, a limitation using a quasi patent classification (Catchword, Index, etc.), a limitation using a keyword, and the like. Restrictions on individual patents may include the amount of bibliographic matters (such as the number of claims), the amount of bibliographic matters processed (such as citations), or the pre-set attributes (automatic patent ratings, evaluation scores, disputed patents, and standard patents). It may be limited to the amount or whether) (including the patent (including pool)).

니치 평가 모델 생성부(5200)는 니치 평가 모델을 생성하기 위하여 복수개의 샘플 문건 집합을 기 설정된 분할 기준으로 분할하는 문건 집합 분할 모듈(5210), 니치 기준 요소값을 생성하는 니치 기준 요소값 생성 모듈(5220), 니치 평가 요소별로 주어진 문건 집합에 대하여 니치 평가 요소값을 생성하는 니치 평가 요소값 생성 모듈(5230), 니치 평가 알고리즘을 생성하는 니치 평가 알고리즘 생성 모듈(5240), 생성된 니치 평가 알고리즘을 검증하는 니치 평가 알고리즘 검증 모듈(5250), 주어진 문건 집합에 대하여 니치 평가값을 생성하는 니치 평가값 생성 모듈(5260)을 포함하고 있다. 니치 평가 모델과 관련된 각종 data는 니치 평가 모델 DB(5280)에 저장된다. 한편, 특정 문건 집합(교차 영역별 문건 집합을 포함한다.)에 대하여 생성되는 상기 니치 평가 요소값은 니치 평가 요소값 DB(5270)에 저장될 수 있다.The niche evaluation model generator 5200 may include a document set dividing module 5210 for dividing a plurality of sample document sets into preset dividing criteria in order to generate a niche evaluation model, and a niche reference element value generating module for generating niche reference element values. (5220), a niche evaluation element value generation module 5230 for generating niche evaluation element values for a given set of documents for each niche evaluation element, a niche evaluation algorithm generation module 5240 for generating a niche evaluation algorithm, and a generated niche evaluation algorithm A niche evaluation algorithm verification module 5250 for verifying the data, and a niche evaluation value generation module 5260 for generating a niche evaluation value for a given set of documents. Various data related to the niche evaluation model are stored in the niche evaluation model DB 5280. Meanwhile, the niche evaluation element value generated for a specific document set (including a cross-sectional document set) may be stored in the niche evaluation element value DB 5270.

상기 니치 발견 정보 생성부(5300)는 니치 분석용 특허 문건 집합을 입수하는 니치 분석용 문건 집합 입수부(5310)를 포함하고 있다. 상기 니치 분석용 문건 집합 입수부(5310)는 사용자 컴퓨터(100)로부터 문건 집합을 입수 받을 수도 있으며, 상기 특허 정보 시스템(10000)으로부터 문건 집합을 입수 받을 수 있다. 입수 받는 문건 집합은 사용자 컴퓨터(100) 또는 특허 정보 시스템(10000)이 검색 엔진부나 DB 쿼리 처리부 등을 이용하여 검색하거나 호출(select)하는 문건 집합일 수도 있으며, 상기 사용자나 시스템이 저장해 놓은 문건 집합 중에서 지정되는 어느 하나 이상이 될 수도 있다. 한편, 상기 니치 발견 정보 생성부(5300)는 니치 발견을 위한 기 설정된 적어도 하나 이상의 시뮬레이션을 수행하는 니치 발견 시뮬레이션 모듈(5320)을 더 포함하고 있을 수 있다. 발견되거나 분석된 니치 기술 영역에 관한 정보는 니치 발견 리포팅 정보 생성부(5330)를 통해서 웹이나, pdf 등의 파일로 리포팅 정보가 생성된다. 한편, 상기 사용자는 본 발명의 니치 기술 발견 UI부(5340)를 통하여 상기 특허 정보 시스템(10000)과 커뮤니케이션 할 수 있다.The niche discovery information generation unit 5300 includes a niche analysis document set acquisition unit 5310 to obtain a niche analysis patent document set. The niche analysis document set acquisition unit 5310 may obtain a document set from the user computer 100, and obtain a document set from the patent information system 10000. The received document set may be a set of documents searched or selected by the user computer 100 or the patent information system 10000 using a search engine unit or a DB query processing unit, or the like. It may be any one or more of the above. Meanwhile, the niche discovery information generation unit 5300 may further include a niche discovery simulation module 5320 that performs at least one preset simulation for niche discovery. The information about the found or analyzed niche description region is generated through the niche discovery reporting information generation unit 5330, and the reporting information is generated in a file such as a web or a pdf. On the other hand, the user can communicate with the patent information system 10000 through the niche technology discovery UI unit 5340 of the present invention.

도 16은 본 발명의 니치 기술 영역 발견 시스템(5000)의 일 실시예적 구현예이다. 상기 도 16에는 본 발명의 니치 기술 발견 UI부(5340)를 구성하는 UI부가 도시되어 있다. 상기 UI부에는 가로축과 세로축의 선택을 지원하는 프레임 선택 UI부(UI110), 니치 관점의 선택을 지원하는 니치 관점 선택 UI부(UI120), 니치 모델의 선택을 지원하는 니치 모델 선택 UI(미도시), 니치 옵션 UI부(UI130) 및 니치 분석 결과를 제공하는 니치 분석 결과 UI(UI140) 및 범례 보기 UI(UI150)가 예시되어 있음을 알 수 있다. 상기 프레임 선택 UI부는 세로 선택부와 가로 선택부를 포함하고 있음을 알 수 있다. 상기 니치 관점 선택 UI부에는 니치 관점을 대분류, 중분류 및 소분류로 구분하여 제공하면서 니치 관점을 선택할 수 있다. 상기 소분류의 니치 관점은 니치 평가 요소에 대응된다. 한편, 적어도 하나 이상의 소분류 니치 관점이 모여 1개의 중분류 니치 관점이 될 수 있는데, 이러한 것을 니치 관점(니치 평가 요소)의 그룹핑이라 하고, 하나의 니치 평가 요소 그룹은 적어도 하나 이상의 니치 평가 요소를 포함한다. 상기 니치 옵션 UI부에는 날짜 기준과 날짜 범위나 연도를 선택할 수 있음을 볼 수 있다.16 is one embodiment implementation of a niche description region discovery system 5000 of the present invention. 16 illustrates a UI unit constituting the niche technology discovery UI unit 5340 of the present invention. The UI unit may include a frame selection UI unit UI110 supporting selection of a horizontal axis and a vertical axis, a niche perspective selection UI unit UI120 supporting selection of a niche perspective, and a niche model selection UI supporting selection of a niche model (not shown) ), The niche option UI unit UI130, and the niche analysis result UI UI140 and the legend viewing UI UI150 providing the niche analysis result are illustrated. It can be seen that the frame selection UI unit includes a vertical selection unit and a horizontal selection unit. The niche perspective selection UI unit may select the niche perspective while providing the niche perspectives into major, middle, and small categories. The above-mentioned niche viewpoints correspond to niche evaluation elements. On the other hand, at least one small classification niche viewpoint may be gathered into one mid-class niche viewpoint, which is called grouping of niche viewpoints (niche evaluation elements), and one niche evaluation element group includes at least one niche evaluation element. . In the niche option UI, it can be seen that a date basis and a date range or year can be selected.

상기 니치 분석 결과 UI부는 n*m(n, m은 자연수)개의 교차 영역을 가지고 있음을 보여 준다. 도 16에서는 세로축(세로 프레임)(UI160)에는 FT 5D107AA이하에 있는 FT들이 나열되어 있으며, 가로축(가로 프레임)(UI170)에는 FT 5D107BB이하에 있는 FT들이 나열되어 있음을 알 수 있다. 이러한 FT들은 각각 세로축과 가로축의 최외곽을 형성하고 있음을 알 수 있다. 한편, 복수 개의 가로축과 복수 개의 세로축이 교차하는 교차 영역에는 니치 분석 정보가 나타나 있음을 볼 수 있다. 도 16에서는 상기 니치 분석 정보로, 각 교차 영역에는 니치 관점이자 니치 평가 요소인 총량 중에서도 권리량 중에서도 출원량에 대한 정보가 표시되어 있음을 알 수 있다. 상기 출원량에 대하여 기 설정된 범례 기준에 따라 색깔 등과 같은 범례 표시가 적용되어 있음도 볼 수 있다. FT 체계에서 테마+영문자2자리(예를 들면5D107AA)를 표기할 때, 5D107AA00과 같이 테마+명문자 2자리+00으로 표기하기도 한다.01부터는 이 코드의 하위가 된다.The niche analysis result shows that the UI unit has n * m (n and m are natural numbers) intersection regions. In FIG. 16, it can be seen that FTs below FT 5D107AA are listed on the vertical axis (vertical frame) UI160, and FTs below FT 5D107BB are listed on the horizontal axis (horizontal frame) UI170. It can be seen that these FTs form the outermost of the vertical axis and the horizontal axis, respectively. On the other hand, it can be seen that the niche analysis information appears in the intersection area where the plurality of horizontal axes and the plurality of vertical axes intersect. In FIG. 16, as the niche analysis information, it can be seen that information on the application amount among the total amount, which is the niche viewpoint and the niche evaluation factor, is displayed in each intersection area. It can also be seen that a legend display such as color is applied according to a preset legend criterion for the application amount. When using the FT system, the theme + English letter 2 digits (for example, 5D107AA) is sometimes referred to as theme + name 2 digits +00, such as 5D107AA00.

상기 니치 분석 결과는 입수되거나 선택된 타겟 문건 집합을 대상으로 하여 처리된다. 상기 니치 기술 영역 발견 시스템(5000)의 정보 처리 방법은 도 플1에 잘 나타나 있다. 상기 니치 기술 영역 발견 시스템(5000)의 교차 영역 프레임 선택부(5111)는 상기 사용자 컴퓨터(100)로부터 제1 특허 분류를 사용하는 제1 특허 분류 집합 및 적어도 하나 이상의 제2 분류를 사용하는 제2 분류 집합에 대한 선택 정보를 입수(Sa11)한다. 도 17는 제1 특허 분류를 사용하는 제1 특허 분류 집합의 선택 방법에 대한 일 구현예를 보여 주고 있다. 상기 사용자는 세로축에 5D107AA01부터 5D107AA20까지의 모든 특허 분류를 선택하고자 하는 경우라면, 상기 사용자는 5D107AA01부터 5D107AA20이 속하는 5D107AA를 선택하면, 상기 교차 영역 프레임 선택부(5111)는 5D107AA의 하위에 있는 5D107AA01부터 5D107AA20까지의 특허 분류 목록을 제공한다. 이때, 상기 사용자가 상위 특허 분류인 5D107AA을 선택하면, 상기 교차 영역 프레임 선택부(5111)는 5D107AA 하에 있는 모든 특허 분류인 5D107AA01부터 5D107AA20까지의 특허 분류가 세로축으로 선택되는 방식으로 정보 처리할 수 있다. The niche analysis result is processed for a target document set obtained or selected. The information processing method of the niche description region discovery system 5000 is well illustrated in FIG. The cross region frame selection unit 5111 of the niche description region discovery system 5000 may include a first patent classification set using a first patent classification and a second classification using at least one second classification from the user computer 100. Obtain selection information for the classification set (Sa11). FIG. 17 illustrates an embodiment of a method of selecting a first set of patent classifications using the first patent classification. If the user wants to select all patent classifications from 5D107AA01 to 5D107AA20 on the vertical axis, when the user selects 5D107AA to which 5D107AA01 to 5D107AA20 belongs, the cross-area frame selection unit 5111 may start from 5D107AA01 below 5D107AA. A list of patent classifications up to 5D107AA20 is provided. In this case, when the user selects the upper patent classification 5D107AA, the cross-region frame selection unit 5111 may process information in such a manner that the patent classifications of 5D107AA01 to 5D107AA20, which are all patent classifications under 5D107AA, are selected on the vertical axis. .

한편, 상기 사용자는 나열된 5D107AA01부터 5D107AA20 중에서 자신이 원하는 적어도 하나 이상의 특허 분류를 선택할 수 있고, 상기 교차 영역 프레임 선택부(5111)는 선택된 특허 분류만으로, 세로축 프레임을 구성할 수 있을 것이다. 동일한 방식으로 가로축을 선택할 수 있을 것이다. 한편, 제3의 축이 새로운 차원으로 필요한 경우, 상기 제3의 축에 해당되는 제3의 프레임도 동등한 방식으로 구성할 수 있을 것이다. 한편, 본 문단에서는 특허 분류를 예시로 들었지만, Catchword나 IndextoUSPC 등과 같은 계층(tree) 구조를 가지는 임의의 정보 구조에서도 본 문단의 발명 사상은 적용될 수 있을 것이다. 즉, 상기 사용자는 적어도 하나 이상의 하위 특허 분류를 가지는 적어도 하나 이상의 상위 특허 분류를 선택함으로써, 상기 선택된 상위 특허 분류에 대한 상기 하위 특허 분류를 상기 제1 특허 분류 집합 또는 제2 분류 집합으로 처리할 수 있을 것이다. 5D107AA01부터 5D107AA20의 특허 분류들이나, 이들 중에서 선택된 적어도 하나 이상의 특허 분류들은 제1 특허 분류 집합을 구성한다. 물론, 제1 특허 분류들이 특허 분류 체계나 기타 분류 체계 상 모두 동일한 형제 관계(sibling)이거나, 공통되는 부모(parent)를 가질 필요는 없을 것이다. 즉, 세로축을 구성하는 특허 분류 중에서 어느 하나 이상은 5D107AA 이하에서 선택되고, 다른 어느 하나 이상은 5H001이하에서 선택될 수 있을 것이다. 상기 특허 분류의 선택은 일본 특허 분류의 FT을 예시로 설명하였지만, 이는 계층 구조를 가지는 다른 특허 분류인 IPC나 USPC, ELCA나 FI에 대해서도 동등하게 적용될 수 있음은 당업자에게 자명할 것이다.On the other hand, the user may select at least one or more patent classifications he wants from 5D107AA01 to 5D107AA20 listed, and the cross region frame selection unit 5111 may configure a vertical axis frame only by the selected patent classification. You can select the horizontal axis in the same way. On the other hand, if a third axis is required as a new dimension, the third frame corresponding to the third axis may be configured in an equivalent manner. Meanwhile, although the patent classification is taken as an example in the present paragraph, the inventive idea of the present paragraph may be applied to any information structure having a tree structure such as Catchword or IndextoUSPC. That is, the user may process the lower patent classification of the selected upper patent classification as the first patent classification set or the second classification set by selecting at least one or more upper patent classifications having at least one lower patent classification. There will be. Patent classifications 5D107AA01 to 5D107AA20, or at least one or more patent classifications selected therefrom, constitute the first patent classification set. Of course, it is not necessary for the first patent classifications to be the same sibling or to have a common parent in the patent classification scheme or other classification schemes. That is, any one or more of the patent classification constituting the vertical axis may be selected from 5D107AA or less, and any other one or more may be selected from 5H001 or less. Although the selection of the patent classification has been described as an example of the FT of the Japanese patent classification, it will be apparent to those skilled in the art that this can be equally applied to other patent classifications having a hierarchical structure, IPC, USPC, ELCA or FI.

한편, 도 17에서 알 수 있듯이, 5D107AA01 이하에는 5D107AA02, 5D107AA02, 5D107AA03과 같은 하위 특허 분류가 더 포함되어 있을 수 있다. 이때, 상기 세로축에 나타날 때, 5D107AA01만 나타나게 할지, 5D107AA01의 하위 특허 분류도 모두 나타나게 할 지는 사용자의 선택에 따라 달라 질 수 있다. 하지만, 정보 처리는 본 발명의 하위 특허 분류 기초 관련 정보 자동 포함 사상에 따라 5D107AA01 이하의 모든 특허 정보는 5D107AA01에 계상되게 된다. 즉, 특정한 특허 문건에 5D107AA02의 표기 있는 경우, 이 표기의 기술 내용은 5D107AA01에도 당연히 포함되므로, 5D107AA01에 대한 특허 정보 분석은 5D107AA01뿐만 아니라 5D107AA02, 5D107AA03, 및 5D107AA04도 함께 포괄하여 수행되어야 한다. 물론, 5D107AA02는 하위 특허 분류가 없으므로, 5D107AA02에 대한 특허 분석 정보에는 5D107AA02만으로 처리함은 당연할 것이다. On the other hand, as can be seen in Figure 17, 5D107AA01 or less may further include lower patent classifications such as 5D107AA02, 5D107AA02, 5D107AA03. In this case, whether to show only 5D107AA01 or all of the lower patent classifications of 5D107AA01 may appear depending on the user's selection. However, in the information processing, all patent information of 5D107AA01 or less is stored in 5D107AA01 according to the idea of automatically including sub-patent classification-related information of the present invention. That is, if a particular patent document has a notation of 5D107AA02, the technical content of this notation is naturally included in 5D107AA01, so that the analysis of patent information on 5D107AA01 should be carried out including 5D107AA02, 5D107AA03, and 5D107AA04 as well as 5D107AA01. Of course, 5D107AA02 does not have a lower patent classification, so it would be natural to treat only 5D107AA02 in patent analysis information for 5D107AA02.

한편, 특허 분류의 표기만으로는 5D107AA01 이하에 5D107AA02, 5D107AA03, 5D107AA04가 있고, 5D107AA05는 5D107AA01와 동등한 레벨에 있다는 것 알 수 없어, 상기 특허 분석 정보의 처리에서 like나 %와 같은 확장자를 활용하는 정보 처리는 특허 분류 타이틀 정보에 도트(dot)가 포함되어 있는 경우에는 불가능하다. 따라서, 본 발명의 데이터 가공부(2000)는 특허 분류 체계를 참조하여 하기 표 1과 같은 구조의 데이터 체계를 생성해 놓는다.On the other hand, the notation of patent classification alone does not indicate that there are 5D107AA02, 5D107AA03, and 5D107AA04 below 5D107AA01, and 5D107AA05 is on the same level as 5D107AA01, so that information processing using an extension such as or% in the processing of the patent analysis information is This is impossible if a dot is included in the patent classification title information. Therefore, the data processing unit 2000 of the present invention generates a data system having a structure as shown in Table 1 below with reference to the patent classification system.

C1C1 C2C2 C3C3 C4C4 C5C5 C6C6 ...... CnCn Self 특허 분류Self patent classification 5D5D 5D1075D107 5D1075D107 5D5D 5D1075D107 5D107AA5D107AA 5D107AA5D107AA 5D5D 5D1075D107 5D107AA5D107AA 5D107AA015D107AA01 5D107AA015D107AA01 5D5D 5D1075D107 5D107AA5D107AA 5D107AA015D107AA01 5D107AA025D107AA02 5D107AA025D107AA02 5D5D 5D1075D107 5D107AA5D107AA 5D107AA015D107AA01 5D107AA035D107AA03 5D107AA035D107AA03 5D5D 5D1075D107 5D107AA5D107AA 5D107AA015D107AA01 5D107AA045D107AA04 5D107AA045D107AA04 5D5D 5D1075D107 5D107AA5D107AA 5D107AA055D107AA05 5D107AA055D107AA05 ...... ...... ...... ...... ...... ...... ...... ......

표 1에서 알 수 있듯이, 자기 자신의 특허 분류가 있을 때, 특허 분류 체계를 참조하여 자신의 상위 특허 분류를 각 계층마다 생성해 놓을 수 있다. 이때, 생성해 놓는 상위 특허 분류는 최상위 특허분류까지 생성해 놓을 수도 있지만, 기 설정된 부분(예를 들면, 확장자 활용 검색이 가능한 수준까지만 생성해 놓을 수도 있다. 위 표 1의 경우에는 C5 계층이 된다.)까지만, 생성해 놓을 수도 있을 것이다. 한편, C3(FT의 경우 테마)와 C1 사이에 임의의 분류 단계를 1개 더 둘 수도 있을 것이며, 이 경우, C1은 반드시 5D가 될 필요는 없을 것이며, 다른 표기가 될 수도 있을 것이다. 즉, n개의 다른 테마가 계층적으로 묶어 하나의 기술군 표기가 될 수 있고, 5D로 시작하는 테마코드를 가지는 기술이 포함되어 있다고 하여 그 기술군 표기가 반드시 5D일 필요는 없을 것이다. 예를 들면, 5D107과 5H001 등이 묶여 Electric Motor Control이라는 기술군명을 가지는 기술군을 형성할 수도 있을 것이다.As shown in Table 1, when there is a patent classification of its own, it is possible to create its own upper patent classification for each layer by referring to the patent classification system. In this case, the generated upper patent classification may be generated up to the highest patent classification, but may be generated only up to a predetermined level (for example, a level capable of searching for the extension utilization. Only up to.) On the other hand, there may be one more arbitrary sorting step between C3 (the theme for FT) and C1, in which case C1 will not necessarily have to be 5D, and may be in other notation. That is, n different themes may be hierarchically grouped into a description of a technology group, and a description having a theme code starting with 5D may not be necessarily 5D. For example, 5D107 and 5H001 may be tied together to form a technology group having a technology group name of Electric Motor Control.

상기 표 1과 같은 계층 구조는 IPC나 USPC 등과 같은 다른 특허 분류에 대해서도 동일하게 생성할 수 있을 것이다. 예를 들어 H01F 1/032가 있는 경우, 하기와 같은 IPC 분류 체계 정보를 활용하여 하기 표 2와 같은 정보를 생성할 수 있을 것이다.The hierarchical structure as shown in Table 1 may be generated similarly for other patent classifications such as IPC or USPC. For example, if there is H01F 1/032, the information as shown in Table 2 may be generated by using the following IPC classification system information.

(1) 섹션 : H 전기 (1) Section: H Electric

(2) 클래스 : H01 기본적 전기소자 (2) Class: H01 Basic Electrical Components

(3) 서브클래스 : H01F 자석 (3) Subclass: H01F Magnet

(4) 메인그룹 : H01F 1/00 자성재료를 특징으로 하는 자석 또는 자성체 (4) Main group: magnets or magnetic bodies characterized by H01F 1/00 magnetic material

(5) 1-도트 서브그룹 : H01F 1/01 ● 무기재료로 된 것 (5) 1-dot subgroups: H01F 1/01 • Inorganic materials

(6) 2-도트 서브그룹 : H01F 1/03 ● ● 보자력에 의해 특징 되는 것 (6) Two-dot subgroups: H01F 1/03 ● characterized by the coercivity

(7) 3-도트 서브그룹 : H01F 1/032 ● ● ● 경질 자성재료의 것(7) 3-dot subgroups: H01F 1/032 ● ● ● of hard magnetic material

C1C1 C2C2 C3C3 C4C4 C5C5 C6C6 C7C7 ...... CnCn Self 특허 분류Self patent classification HH H01H01 H01FH01F H01F 1/00H01F 1/00 H01F 1/01H01F 1/01 H01F 1/03H01F 1/03 H01F 1/032H01F 1/032 H01F 1/032H01F 1/032

이에 따라, 표 3을 참조하면서, 특허 정보 분석에서 하위 특허 분류 기호 정보 자동 포함 사상에 대해서 더욱 더 상세하게 설명한다.Accordingly, referring to Table 3, the idea of automatically including lower patent classification code information in patent information analysis will be described in more detail.

Self 특허 분류Self patent classification Self 특허 분류의 문건수Number of Documents in Self Patent Classification 합산 처리되는 특허 분류의 문건수The number of documents of the sum of patents processed ...... Self 분류가 받은 피인용수Citations Received by Self Classification Self 분류가 받은 피인용수Citations Received by Self Classification 5D107AA015D107AA01 1One 1010 ...... 55 1414 5D107AA025D107AA02 22 22 ...... 44 44 5D107AA035D107AA03 33 33 ...... 33 33 5D107AA045D107AA04 44 44 ...... 22 22 5D107AA055D107AA05 55 55 ...... 1One 1One

표 3에서 알 수 있듯이, 5D107AA02, 5D107AA03, 5D107AA04의 값이 자신들의 상위인 5D107AA01에 합산됨을 알 수 있다. 본 발명의 특허 정보의 분석에서는 특정 특허 분류를 기준으로 분석 정보를 생성할 때, 그 특정 특허 분류의 하위 특허 분류가 표기되어 있는 특허 문건의 정보도 포함시켜 특허 정보를 생성한다.As can be seen in Table 3, it can be seen that the values of 5D107AA02, 5D107AA03, and 5D107AA04 are added to their upper 5D107AA01. In the analysis of patent information of the present invention, when generating analysis information based on a specific patent classification, the patent information is generated by including information of a patent document in which a lower patent classification of the specific patent classification is indicated.

이어, 상기 니치 기술 영역 발견 엔진은 상기 제1 특허 분류 집합 및 상기 제2 분류 집합을 사용하여 적어도 2개 이상의 교차 영역을 가지는 교차 영역 집합을 구성(Sa12)한다. 상기 예는 5D107AA의 하위에 있는 특허 분류와 과 5D107BB의 하위에 있는 특허 분류로 교차 영역 집합이 구성됨을 보여 주고 있다. 상기 교차 영역은 도 16에서와 같이 특허 분류의 타이틀 정보가 프레임에 표기되는 것이 바람직하나, 출력 특성(5D107AA01)와 같이 특허 분류 타이틀과 전체 특허 분류 코드가 다 보이게 하거나, 출력특성(AA01)과 같이 타이틀 정보와 간략한 형태의 특허 분류 코드가 보이는 방식으로 처리할 수도 있을 것이다.Subsequently, the niche description region discovery engine configures (Sa12) an intersection region set having at least two or more intersection regions using the first patent classification set and the second classification set. The above example shows that a cross-sectional set consists of a patent classification under 5D107AA and a patent classification under 5D107BB. 16, the title information of the patent classification is preferably displayed on the frame as shown in FIG. 16. However, the patent classification title and the entire patent classification code are shown as shown in the output characteristic 5D107AA01, or as shown in the output characteristic AA01. It may be processed in such a way that title information and a short form of patent classification code are shown.

제1 특허 분류를 PCi라 하고, 제2 분류를 Cj하고 할 때, 교차 영역은 (PCi, Cj)로 특정되며, 교차 영역 집합은 (PCi, Cj)의 집합이 된다. 이때, 상기 (PCi, Cj)가 표시되는(나타나거나 배치되는) 방식은 크게 3가지가 있다. 첫째 배치 방법은 상기 제1 특허 분류 집합이 특허 분류 체계 상에서 나열되는 순서 및 상기 제2 분류 집합이 상기 제2 분류 체계 상에서 나열되는 순서를 기준으로 배치되는 것이다. 이 경우, 교차 영역에 값이 없는 공백 셀이 존재할 수 있게 되며, 열 모두가 공백이거나, 행 모두가 공백인 공백열과 공백행이 존재하게 된다. 특히, 입수되는 타겟 문건 집합이 작은 경우에는 프레임을 적절하게 선택하더라도 다수의 공백 셀과 공백 열, 공백 행이 나타나게 된다. 둘째 배치 방법은 제1 특허 집합의 특허 분류에 대응되는 교차 영역 집합이 없는 없거나 제2 분류 집합의 제2 분류에 대응되는 교차 영역 집합이 없는 경우, 교차 영역 집합이 없는 특허 분류나 제2 분류를 제외하고 교차 영역 집합이 존재하는 특허 분류나 제2 분류에 대하여, 상기 제1 특허 분류 집합이 특허 분류 체계 상에서 나열되는 순서 및 상기 제2 분류 집합이 상기 제2 분류 체계 상에서 나열되는 순서를 기준으로 배치되는 것이다. 이 경우, 적어도 공백 열이나 공백 행이 모두 존재할 가능성은 현저하게 떨어지게 된다. 셋째 배치 방법은 상기 제1 프레임축 방향의 모든 교차 영역에 대한 교차 영역값의 합계가 높을수록 상기 제2 프레임축에 가까이 배치하고, 상기 제2 프레임축 방향의 모든 교차 영역에 대한 교차 영역값의 합계가 높을수록 상기 제1 프레임축에 가까이 배치되는 것이다. 이 경우, 모서리 쪽으로 갈수록 니치 평가 요소값이나 니치 평가값 등이 높거나 낮은 교차 영역들이 모이게 된다. 상기 제1 배치 방법, 상기 제2 배치 방법 및 상기 제3 배치 방법은 상기 사용자의 선택이나 상기 시스템의 설정에 따라 선택될 수 있을 것이다.When the first patent classification is called PCi and the second classification is Cj, the intersection area is specified as (PCi, Cj), and the intersection area set is a set of (PCi, Cj). At this time, there are three ways in which (PCi, Cj) is displayed (appeared or arranged). The first arrangement method is based on the order in which the first set of patent classifications are arranged on the patent classification system and the order in which the second classification set is arranged on the second classification system. In this case, there may be blank cells without values in the intersection area, and there may be blank columns and blank rows in which all columns are blank or all rows are blank. In particular, when the set of target documents obtained is small, a large number of blank cells, blank columns, and blank rows appear even if the frame is properly selected. The second method of placement may include a patent classification or a second classification having no cross-region set when there is no cross-region set corresponding to the patent classification of the first patent set or no cross-region set corresponding to the second classification of the second classification set. Except for a patent classification or a second classification in which a cross-sectional set exists, based on the order in which the first patent classification set is listed on the patent classification system and the order in which the second classification set is listed on the second classification system. To be deployed. In this case, the possibility of having at least both empty columns or empty rows is significantly reduced. In a third arrangement method, the higher the sum of the intersection area values for all the intersection areas in the first frame axis direction is, the closer the second frame axis is, and the intersection area values for all the intersection areas in the second frame axis direction are obtained. The higher the sum, the closer the first frame axis is. In this case, intersection regions having a higher or lower niche evaluation factor value or a niche evaluation value are collected toward the edge. The first placement method, the second placement method and the third placement method may be selected according to the user's selection or the setting of the system.

이어, 상기 니치 분석 정보 생성부(5120)는 상기 교차 영역 집합을 구성하는 적어도 하나 이상의 교차 영역에 대하여, 상기 교차 영역에 대응되는 교차 영역 문건 집합을 기준으로 하나 이상의 니치 분석 정보를 교차 영역값으로 생성(Sa13)한다. 상기 니치 분석 정보 중의 하나는 니치 평가 요소값이며, 니치 평가 요소는 니치 관점이 될 수 있다. 상기 니치 기술 영역 발견 시스템(5000)의 니치 분석 정보 생성부(5120)는 입수된 타겟 문건 집합을 대상으로 하여, 상기 타겟 문건 집합 중 상기 (PCi, Cj)라는 교차 영역에 대응되는 부분 타겟 문건 집합을 대상으로 기 설정된 기 설정된 니치 평가 요소에 대응되는 니치 평가 요소값을 생성한다. 상기 니치 평가 정보의 생성은 상기 니치 분석 정보 생성부(5120)의 니치 관점 분석 정보 생성부가 수행한다. 상기 n차원 프레임 처리부(5110)의 상기 니치 분석 정보 표시부(5113)는 상기 교차 영역에 상기 교차 영역값 및 상기 교차 영역값을 기 설정된 기준으로 처리한 교차 영역값 처리 결과 중 어느 하나 이상을 상기 교차 영역에 표시(Sa14)한다. 즉, 상기 니치 분석 정보 표시부(5113)는 생성된 니치 분석 정보를 표시하는 기능을 수행한다. 상기 니 도 16은 출원수란 니치 관점에 대응되는 니치 평가 요소에 대하여 각 (PCi, Cj)별로 니치 평가 요소값이 생성되고, 생성된 니치 평가 요소값이 표시되고 있음을 보여 줄 수 있다. 한편, 상기 니치 기술 영역 발견 시스템(5000)의 니치 분석용 문건 집합 입수부(5310)는 검색이나 쿼리 조건 설정 또는 관리하는 문건 집합군에서의 선택 등을 활용하여, 적어도 하나 이상의 문건을 포함하는 타겟 문건 집합을 입수한다.Subsequently, the niche analysis information generation unit 5120 converts one or more niche analysis information into at least one intersection region constituting the intersection region set based on the intersection region document set corresponding to the intersection region as the intersection region value. Create (Sa13). One of the niche analysis information is a niche evaluation element value, and the niche evaluation element may be a niche viewpoint. The niche analysis information generation unit 5120 of the niche description region discovery system 5000 targets the obtained target document set, and sets a partial target document set corresponding to the intersection area of the (PCi, Cj) among the target document sets. A niche evaluation element value corresponding to a preset niche evaluation element is generated. The niche evaluation information is generated by the niche perspective analysis information generator of the niche analysis information generator 5120. The niche analysis information display unit 5113 of the n-dimensional frame processing unit 5110 crosses any one or more of the intersection area value processing result of processing the intersection area value and the intersection area value on a predetermined basis in the intersection area. The area is displayed (Sa14). That is, the niche analysis information display unit 5113 performs a function of displaying the generated niche analysis information. 16 illustrates that the number of niche evaluation elements is generated for each (PCi, Cj) of the niche evaluation elements corresponding to the niche viewpoints, and the generated niche evaluation element values are displayed. On the other hand, the niche analysis document set acquisition unit 5310 of the niche description region discovery system 5000 utilizes a search or query condition setting or selection from a document set group to manage, a target including at least one document Obtain a set of documents.

도 16에서는 (PCi, Cj)에 대하여, 가장 간단한 니치 관점이자 니치 평가 요소인 출원수에 대한 니치 분석 정보가 생성되었지만, 상기 니치 분석 정보는 출원수에 한하지 않고, 하기 표 4내지 표7과 같은 다양한 니치 평가 요소에 대해서도 생성될 수 있을 것이다. 하기, 표 4 내지 표7은 이러한 니치 평가 요소를 보여 주고 있다.In FIG. 16, for (PCi, Cj), the niche analysis information for the number of applications, which is the simplest niche viewpoint and the niche evaluation element, is generated, but the niche analysis information is not limited to the number of applications. The same may be generated for various niche evaluation elements. Tables 4 to 7 below show these niche evaluation elements.

표 4는 양적 속성에 관련된 니치 평가 요소의 예를 보여 주고 있다.Table 4 shows examples of niche evaluation factors related to quantitative attributes.

표 5는 양적 속성에 관련된 니치 평가 요소의 예를 보여 주고 있다.Table 5 shows examples of niche evaluation factors related to quantitative attributes.

표 6은 증감 속성에 관련된 니치 평가 요소의 예를 보여 주고 있다.Table 6 shows examples of niche evaluation factors related to the increase and decrease attributes.

표 7은 신규(도입/참여) 속성에 관련된 니치 평가 요소의 예를 보여 주고 있다.Table 7 shows examples of niche evaluation factors related to new (introduction / participation) attributes.

상기 표 4 내지 표 7에 있어서, C3는 IPC, FI, ECLA의 경우 subclass, FT의 경우 테마, USPC의 경우 class가 되며, C4는 IPC, FI, ECLA의 경우 main gropup, FT의 경우 테마 직하위, USPC의 경우 도트가 없는 subclass가 된다.In Tables 4 to 7, C3 is a subclass for IPC, FI, and ECLA, a theme for FT, and a class for USPC, and C4 is a main gropup for IPC, FI, and ECLA, and a theme under the theme for FT. In case of USPC, it becomes subclass without dot.

도 18은 다른 니치 평가 요소(다른 니치 관점)에 대하여 상기 니치 분석 정보 생성부(5120)가 니치 분석 정보를 생성하기 위하여, 니치 관점을 선택하고 있는 모습을 보여 주는 도면이다. 상기 니치 관점이 선택되고 나면 새로운 분석 결과가 나타나게 된다. 상기 도 18과 같이 상기 니치 기술 영역 발견 시스템(5000)은 다수의 니치 평가 요소에 대하여 니치 평가 요소를 선택할 수 있도록 지원하고, 생성된 니치 평가 요소값을 생성해 상기 사용자에게 보여 줄 수 있다.FIG. 18 is a diagram showing that the niche analysis information generation unit 5120 selects a niche viewpoint for different niche evaluation elements (different niche viewpoints) to generate niche analysis information. Once the niche perspective has been selected, a new analysis result will appear. As illustrated in FIG. 18, the niche description area discovery system 5000 may support selecting a niche evaluation element for a plurality of niche evaluation elements, and generate and display the generated niche evaluation element values to the user.

한편, 상기 니치 기술 영역 발견 시스템(5000)은 니치 옵션을 통하여 타겟 문건 집합을 한정할 수 있으며, 이는 도 13에 잘 나와 있다. 도 13에서 알 수 있듯이, 상기 니치 기술 영역 발견 시스템(5000)은 타겟 문건 집합 입수(Sa41)하고, 상기 사용자 컴퓨터(100)로부터 제1 특허 분류를 사용하는 제1 특허 분류 집합 및 적어도 하나 이상의 제2 분류를 사용하는 제2 분류 집합에 대한 선택 정보를 입수(Sa42)하며, 상기 제1 특허 분류 집합 및 상기 제2 분류 집합을 사용하여 적어도 2개 이상의 교차 영역을 가지는 교차 영역 집합을 구성(Sa43)하고, 상기 교차 영역 집합을 구성하는 적어도 하나 이상의 교차 영역에 대하여, 상기 교차 영역에 대응되는 교차 영역 문건 집합을 기준으로 하나 이상의 니치 분석 정보를 교차 영역값으로 생성(Sa44)하며, 상기 교차 영역에 상기 교차 영역값 및 상기 교차 영역값을 기 설정된 기준으로 처리한 교차 영역값 처리 결과 중 어느 하나 이상을 상기 교차 영역에 표시(Sa45)한다. 이어, 상기 사용자 컴퓨터(100)로부터 상기 니치 옵션 처리부(5130)를 통하여 기간 한정, 주체 한정, 기술 분야 한정, 개별 특허 한정 중 어느 하나 이상에 대한 니치 옵션 정보를 입수(Sa46)하고, 상기 니치 옵션을 반영하여 교차 영역 문건 집합을 한정(Sa47)하며, 상기 교차 영역 집합을 구성하는 적어도 하나 이상의 교차 영역에 대하여, 상기 교차 영역에 대응되는 한정된 교차 영역 문건 집합을 기준으로 하나 이상의 니치 분석 정보를 교차 영역값으로 생성(Sa48)하고, 상기 교차 영역에 상기 교차 영역값 및 상기 교차 영역값을 기 설정된 기준으로 처리한 교차 영역값 처리 결과 중 어느 하나 이상을 상기 교차 영역에 표시(Sa49)한다.Meanwhile, the niche description region discovery system 5000 may define a target document set through a niche option, which is illustrated in FIG. 13. As can be seen in FIG. 13, the niche description region discovery system 5000 obtains a target document set Sa41 and uses a first patent classification set and at least one or more agents using a first patent classification from the user computer 100. Obtain selection information on a second classification set using two classifications (Sa42), and construct a cross-region set having at least two or more intersection areas using the first patent classification set and the second classification set (Sa43). And generating at least one niche analysis information as an intersection area value based on the intersection area document set corresponding to the intersection area with respect to at least one intersection area constituting the intersection area set (Sa44). Display at least one of the intersection area value and the intersection area value processing result obtained by processing the intersection area value on the basis of a predetermined value in (S) a45). Subsequently, the niche option information is obtained from the user computer 100 through the niche option processing unit 5130 for niche option information for any one or more of a period limitation, a subject limitation, a technical field limitation, and an individual patent limitation (Sa46). The set of intersection region documents is defined by reflecting the intersection, and at least one intersection region constituting the intersection region set crosses one or more niche analysis information based on the limited intersection region document set corresponding to the intersection region. An area value is generated (Sa48), and at least one of the intersection area value processing result obtained by processing the intersection area value and the intersection area value on a predetermined basis is displayed on the intersection area (Sa49).

한편, 상기 니치 기술 영역 발견 시스템(5000)은 상기 교차 영역 프레임 선택부(5111)를 통하여 선택된 가로축은 고정한 채 세로축을 다른 특허 분류로 변경하거나, 세로축은 고정한 채, 세로축을 변경하는 기능을 지원한다. 이러한 정보 처리 방법은 도 14에 잘 나와 있다. 상기 니치 기술 영역 발견 시스템(5000)은 타겟 문건 집합 입수(Sa31)하고, 상기 사용자 컴퓨터(100)로부터 제1 특허 분류를 사용하는 제1 특허 분류 집합 및 적어도 하나 이상의 제2 분류를 사용하는 제2 분류 집합에 대한 선택 정보를 입수(Sa32)하며, 상기 제1 특허 분류 집합 및 상기 제2 분류 집합을 사용하여 적어도 2개 이상의 교차 영역을 가지는 교차 영역 집합을 구성(Sa33)하여, 상기 교차 영역 집합을 구성하는 적어도 하나 이상의 교차 영역에 대하여, 상기 교차 영역에 대응되는 교차 영역 문건 집합을 기준으로 하나 이상의 니치 분석 정보를 교차 영역값으로 생성(Sa34)하고, 상기 교차 영역에 상기 교차 영역값 및 상기 교차 영역값을 기 설정된 기준으로 처리한 교차 영역값 처리 결과 중 어느 하나 이상을 상기 교차 영역에 표시(Sa35)한다. Sa35의 한 형태는 도 16이 될 수 있다. 이때, 도 19에서와 같이 상기 교차 영역 프레임 선택부(5111)를 이용하여, 제1 특허 분류 집합 및 제2 특허 분류 집합 중 어느 하나 이상에 대한 변경 정보 입수(Sa36)할 수 있다. 제1 특허 분류 집합 등을 선택할 때는 다단계로 탐색해 가면서 선택될 수 있으며, 도 19는 이를 잘 보여 준다. 즉, 5D107BB를 선택하기 위해서는 최상위인 FT->Electric Motor Control->5D107->5D107BB를 선택하는 과정을 거처야 한다. 통상적으로 탐색 과정은 주변부를 탐색해 가는 것이 보통이므로, 탐색의 대상이 되는 차원축(프레임)을 변경하려 할 때, 위와 같이 루트(root)로부터 차례로 선택해 가는 것은 불편함을 초래할 수 있을 것이다. 이에, 상기 교차 영역 프레임 선택부(5111)은 선택된 특허 분류나 선택된 분류의 이력(history)을 관리해 주어, 사용자가 특허 분류나 분류를 변경하려 할 때, 직전에 선택된 특허 분류나 분류를 최우선적으로 보여 줄 수도 있다. 이 경우 도 19에서는 맨 위에 맨 우측에 용도가 선택되는 창을 최우선적으로 보여 줄 수도 있을 것이다. 이 경우, 최상위인 FT->Electric Motor Control->5D107->5D107BB로 선택해 가는 수고를 덜 수 있게 된다. 상기 교차 영역 프레임 선택부(5111)는 변경이 발생한 상기 제1 특허 분류 집합 또는 상기 제2 분류 집합을 사용하여 적어도 2개 이상의 교차 영역을 가지는 교차 영역 집합을 구성(Sa37)한다. 상기 니치 분석 정보 생성부(5120)는 상기 교차 영역 집합을 구성하는 적어도 하나 이상의 교차 영역에 대하여, 상기 교차 영역에 대응되는 교차 영역 문건 집합을 기준으로 하나 이상의 니치 분석 정보를 교차 영역값으로 생성(Sa38)하고, 상기 니치 분석 정보 표시부(5113)는 상기 교차 영역에 상기 교차 영역값 및 상기 교차 영역값을 기 설정된 기준으로 처리한 교차 영역값 처리 결과 중 어느 하나 이상을 상기 교차 영역에 표시(Sa39)한다. 도 20은 가로 프레임이나 세로 프레임 중 어느 하나 이상이 변경되고, 변경된 프레임을 기준으로 니치 관점에 따른 니치 분석 정보가 생성되는 예를 보여 주고 있다. 도 20에서는 세로축에 5D107AA이하의 특허 분류와 가로축에 5D107CC 이하의 특허 분류로 프레임이 형성되고 있음을 보여 주고 있다. 프레임이 변경 됨에 따라, (PCi,Cj)가 (PCz,Cw)가 될 수 있으며, 상기 교차 영역 집합 생성부(5112)는 (PCz,Cw)에 대하여 교차 영역 집합을 생성한다. 이 경우 도 16과 도 20를 비교하는 경우, 모든 교차 영역이 바뀌었기 때문에 바뀐 교차 영역에 대하여 니치 관점인 출원수에 대한 니치 평가 요소값을 보여 주고 있음을 알 수 있다.Meanwhile, the niche description region discovery system 5000 supports a function of changing the vertical axis to another patent classification while fixing the horizontal axis selected through the crossing area frame selection unit 5111 or changing the vertical axis while fixing the vertical axis. . This information processing method is well illustrated in FIG. The niche description region discovery system 5000 obtains a target document set Sa31 and uses a first patent classification set using a first patent classification from the user computer 100 and a second using at least one or more second classifications. Obtaining selection information about a classification set (Sa32), and constructing an intersection region set having at least two or more intersection regions by using the first patent classification set and the second classification set (Sa33), and setting the intersection region set. For at least one intersection region constituting a, one or more niche analysis information is generated as an intersection region value based on a set of intersection region documents corresponding to the intersection region (Sa34), and the intersection region value and the intersection region value are generated in the intersection region. Any one or more of the intersection area value processing result of processing the intersection area value based on a preset reference is displayed on the intersection area (Sa35). One form of Sa35 may be FIG. 16. In this case, as shown in FIG. 19, the cross region frame selection unit 5111 may be used to obtain change information (Sa36) for any one or more of the first patent classification set and the second patent classification set. When selecting the first patent classification set and the like, it may be selected while searching in multiple stages, and FIG. 19 illustrates this well. In other words, in order to select 5D107BB, it is necessary to go through FT-> Electric Motor Control-> 5D107-> 5D107BB. In general, since the search process generally searches for the periphery, it may be inconvenient to select from the root in order to change the dimensional axis (frame) to be searched. Accordingly, the cross region frame selection unit 5111 manages the history of the selected patent classification or the selected classification, and when a user attempts to change the patent classification or classification, the selected patent classification or classification is prioritized first. You can also show In this case, FIG. 19 may first show a window in which a use is selected at the top right. In this case, you can save yourself the trouble of choosing FT-> Electric Motor Control-> 5D107-> 5D107BB. The intersection area frame selection unit 5111 configures an intersection area set having at least two or more intersection areas using the first patent classification set or the second classification set in which the change has occurred (Sa37). The niche analysis information generator 5120 may generate at least one niche analysis information as a cross region value for at least one cross region constituting the cross region set based on a cross region document set corresponding to the cross region. The niche analysis information display unit 5113 displays one or more of the intersection area value and the intersection area value processing result of processing the intersection area value and the intersection area value on a predetermined basis in the intersection area in the intersection area (Sa39). )do. FIG. 20 illustrates an example in which at least one of a horizontal frame and a vertical frame is changed and niche analysis information according to a niche perspective is generated based on the changed frame. 20 shows that a frame is formed with a patent classification of 5D107AA or less on the vertical axis and a patent classification of 5D107CC or less on the horizontal axis. As the frame is changed, (PCi, Cj) may be (PCz, Cw), and the cross region set generation unit 5112 generates the cross region set with respect to (PCz, Cw). In this case, when comparing FIG. 16 and FIG. 20, since all the intersection area | regions changed, it turns out that it shows the value of the niche evaluation element with respect to the number of applications which are niche viewpoints about the changed intersection area | region.

하기 표 8은 도 16 내지 도 20에 나타나는 특허 분류 타이틀과 특허 분류 코드를 대응시켜 놓은 참고용 자료이다.Table 8 below is reference data in which a patent classification title and a patent classification code shown in FIGS. 16 to 20 are corresponded.

상기 니치 분석 종류 제시부(5123)는 표 4 내지 표7과 같은 니치 관점을 대분류, 중분류 및 소분류 니치 관점별로 니치 분석 종류를 제시할 수 있다. 복수 개의 니치 관점을 포함하여 생성되는 니치 모델이 있는 경우, 상기 니치 모델 종류의 선택을 돕는 것은 본 발명의 니치 모델 종류 제시부(5124)가 담당한다.The niche analysis type presenting unit 5123 may present a niche analysis type according to the major, middle, and small classification niche viewpoints as shown in Tables 4 to 7. When there is a niche model generated by including a plurality of niche views, the niche model type presenting unit 5124 of the present invention is responsible for assisting in selecting the niche model type.

이어, 도면을 참조하면서 본 발명의 니치 평가 모델 생성 방법에 대해서 설명한다. Next, the niche evaluation model generation method of this invention is demonstrated, referring drawings.

니치 평가 모델이란 특정한 교차 영역 (PCi,Cj) 또는 특정한 조건으로 특정되는 특정 문건 집합이 니치성을 얼마나 가지는 지를 평가하는 니치 점수 내지는 니치 등급을 생성하는 모델을 말한다. 니치 평가 모델이 생성되기 위해서는 독립 변수 및 독립 변수값과 종속 변수 및 종속 변수값이라는 2가지 조건이 선행되어야 한다. 니치 평가 요소는 니치 평가 모델을 생성하기 위한 독립 변수가 될 수 있으며, 상기 특정 문건 집합에 대해서 니치 평가 요소로 니치 평가 요소값이라는 독립 변수값을 생성할 수 있음은 전술한 바와 같다. 니치 평가 모델 수립을 위해서 문제시 되는 것은 종속 변수의 설정이다. 종속 변수는 전체적으로 니치 속성을 반영하는 변수이어야 한다. 니치 평가 요소값이라는 입력 변수값이 있을 때, 이러한 입력 변수값에 반응하는 값으로 니치라는 속성을 전체적으로 잘 표상하는 것이 종속 변수가 된다.The niche evaluation model refers to a model that generates niche scores or niche grades for evaluating how niche the particular set of documents specified by a specific intersection area (PCi, Cj) or a specific condition is. In order to generate a niche evaluation model, two conditions must be preceded: independent and independent variable values and dependent and dependent variable values. As described above, the niche evaluation element may be an independent variable for generating a niche evaluation model, and an independent variable value of the niche evaluation element value may be generated as the niche evaluation element for the specific document set. A problem for establishing a niche evaluation model is the setting of dependent variables. The dependent variable must be a variable that reflects niche attributes as a whole. When there is an input variable value called a niche evaluation element value, the dependent variable is a well-represented attribute of nichi as a value in response to the input variable value.

도 21은 특정 국가의 특허에 대하여, 기술 영역 A에 대한 특허 출원수의 시기적 변화를 나타내는 그래프의 일례이며, 도 22는 기술 영역 A, B, C에 대한 특허 출원수의 시기적 변화를 나타내는 그래프이다. 니치 속성을 가지는 기술 영역이란 도 21 내지 도 22에서 알 수 있듯이, 특정 시점을 기준으로 특허 출원수 등이 가파르게 상승하고 있는 특징이 있으되, 가파른 상승 이후에는 니치가 아닐 것이다. 한편, 특정 기술 영역에 특허 출원수 등이 거의 없는 경우에는 비어 있는 틈새(niche) 기술일 수도 있지만, 쓸모가 적어나 없어 버려진(deserted) 기술일 수도 있다. 본 발명에서 니치성이 큰 기술 영역에 대한 기본적인 가정은 니치성이 큰 기술 영역은 시간의 경과에 따라, 과거의 특정 시점, 현재, 또는 적어도 미래의 특정 시점에 그 기술 영역에 대한 채움(filling)의 속성이 있다는 것이다. 즉, 니치 기술 영역은 시간의 경과에 따라, 그 영역에 대한 특허 출원수가 증가하거나, 출원인 수가 증가하거나, 피 인용수가 증가하거나, 발명자 수가 증가하거나 하는 등 기타 어떠한 형태로든 채움이 발생한다. 따라서, 채움의 대상 또는 채움의 결과로 나타나는 수치값 또는 이러한 수치값을 복합하여 생성하는 수치값이 종속 변수가 될 수 있다. 예를 들면, 5년 기준 출원수 증가량, 5년 기준 출원수 증가율, 5년 기준 출원인수 증가량, 5년 기준 출원인수 증가율 또는 (5년 기준 출원수 증가율)*(5년 기준 출원인수 증가율) 등이 종속 변수의 한 예가 될 수 있다. 더욱 구체적으로는 분할 시점 기준으로 과거 5년 기준의 출원수와 분할 시점 기준 이후의 5년 동안의 출원수가 비교되는 경우, 5년 기준 출원수 증가량 및 5년 기준 출원수 증가율을 구할 수 있게 된다. 이러한 종속 변수가 본 발명의 니치 기준 요소가 되며, 종속 변수값은 니치 기준 요소값이 된다.FIG. 21 is an example of a graph showing a temporal change in the number of patent applications for the technology region A with respect to a patent of a specific country, and FIG. 22 is a graph showing a temporal change in the number of patent applications for the technology regions A, B and C. FIG. . As can be seen from FIGS. 21 to 22, the technical region having the niche attribute has a characteristic in which the number of patent applications and the like rises steeply based on a specific time point, but it will not be a niche after the steep rise. On the other hand, when there are almost no patent applications in a specific technical area, it may be an empty niche technology, but it may be a technology that is discarded due to its uselessness. In the present invention, the basic assumption for the technical region having a high niche is that the technical region having a high niche is filling the technical region at a specific point in the past, present, or at least a certain point in the future as time passes. Is that there is a property of. That is, the niche description region fills in any other form over time, such as an increase in the number of patent applications, an increase in the number of applicants, an increase in the number of inventors, or the like. Accordingly, the dependent variable may be a numerical value resulting from the object of filling or a result of filling, or a numerical value generated by combining these numerical values. For example, five-year application growth, five-year application growth, five-year application growth, five-year application growth, or (five-year application growth) * (five-year application growth), etc. This is an example of a dependent variable. More specifically, when the number of applications in the past five years and the number of applications for five years after the splitting point are compared on the basis of the splitting point, the increase in the number of five-year applications and the increase in the number of five-year applications can be obtained. This dependent variable becomes the niche reference element of the present invention, and the dependent variable value becomes the niche reference element value.

한편, 사용자에게 니치 분석 정보는 특정 과거 시점이 아니라, 상기 니치 기술 영역 발견 시스템(5000)을 사용하는 현재 시점에서 특정 기술 영역이 니치 속성을 얼마나 가지는 지가 관심사일 것이다. 따라서, 니치 평가 모델도 특정 시점을 기준으로 니치성을 평가해 줄 수 있어야 한다. 따라서, 종속 변수나 독립 변수에 있어서, 변수값을 계산할 때, 시점의 선택이 중요하게 된다. 즉, 모델 수립의 특성 상 특정 과거 시점까지의 데이터로 그 특정 과거 시점 이후를 예측하는 속성 상, 모델 수립에서도 시점의 선택은 변수값에 큰 영향을 미치게 된다.Meanwhile, the niche analysis information may be of interest to the user, not how much the specific technology region has the niche attribute at the current time using the niche description region discovery system 5000, not the specific past viewpoint. Therefore, the niche evaluation model should be able to evaluate niche based on a specific time point. Therefore, in the dependent variable or the independent variable, the selection of the viewpoint becomes important when calculating the variable value. That is, due to the nature of model establishment, the data up to a certain past point in time are predicted after the specific past point in time, and thus the selection of the point of view in the model establishment has a great influence on the variable value.

본 발명에서는 상당히 큰 N개의 기술 영역에 대하여, 샘플 문건 집합을 입수하고, 입수된 샘플 문건 집합을 분할한다. 분할하는 방법은 크게 2가지가 있다. 첫째는 N개의 샘플 문건 집합에 공통된 시점 분할 기준을 적용하는 공통 시점 분할 방식이다. 이는, 적어도 하나 이상의 특정 시점을 기준으로 N개의 샘플 문건 집합을 강제 분할하는 방식이다. 예를 들면, N개의 샘플 문건 집합을 공개일이나 최선일(우선권 주장 시 우선일)을 기준으로 2005년 이전과 2006년 이후로 2분할하거나, 5년 단위로 n분할 하는 방법이다. 둘째는 N개의 샘플 문건 집합에 공통된 속성 분할 기준을 적용하는 공통 속성 분할 방식이다. 이 방식은 N개의 샘플 문건 집합에 따라 그 속성 분할 기준을 충족하는 시점을 기준으로 분할하는 방법이다. 예를 들면, 3년 단위 출원 증가율이 최초로 25% 이상인 경우, 최초로 25%가 되는 연도나 그 연도에서 1년 또는 2년을 뺀 연도를 기준으로 2분할 하는 방식이다. 이 경우, 샘플 문건 집합마다 이 속성을 만족하는 시점이 다르므로, 분할되는 기준 연도도 다를 수 있으며, 경우에 따라서는 분할이 되지 않을 수도 있을 것이다. 이에 따라 상기 N개의 표본 문건 집합의 크기는 공통 속성 분할 기준의 적용 시 줄어들 수 있을 것이다(to smaller N). 한편, 출원 증가율을 50%로 하는 등과 같이 속성 기준값을 다르게 가져 가면 N개의 표본 문건 집합의 크기는 공통 속성 분할 기준의 적용 시 훨씬 더 줄어들 수 있을 것이다(much smaller N). 전자의 경우는 제1 유형의 니치 발견 모델에 사용될 수 있다면, 후자의 경우는 제2 유형(급진적 니치 영역)의 발견에 사용될 수 있을 것이다. 물론 이때, 상기 표본의 생성은 줄어든 N개의 표본 문건 집합이 사용될 것이다. 즉, 공통 속성의 종류나 공통 속성 기준값의 만족 정도로 다양한 니치 평가 모델이 생성될 수 있게 된다. 설명의 편의상 니치 평가 모델의 생성에 사용되는 표본 문건 집합의 개수는 N으로 표기한다.In the present invention, a sample document set is obtained, and the obtained sample document set is divided into N large technical areas. There are two ways to divide. The first is a common view splitting scheme that applies a common view splitting criterion to a set of N sample documents. This is a method of forcibly dividing a set of N sample documents based on at least one specific time point. For example, N sample documents can be divided into two parts before 2005 and after 2006, or n times by five years, based on the publication date or the best date (priority date when claiming priority). The second is a common attribute splitting scheme that applies common attribute splitting criteria to N sample document sets. This method divides based on a time point when the attribute splitting criteria are satisfied according to a set of N sample documents. For example, if the three-year application growth rate is more than 25% for the first time, it is divided into two years based on the first year of 25% or the year minus one or two years. In this case, since the time points for satisfying the attribute are different for each sample document set, the reference year to be split may also be different, and in some cases, the split may not be performed. Accordingly, the size of the N sample document set may be reduced when the common attribute splitting criteria are applied (to smaller N). On the other hand, if the attribute reference value is taken differently, such as application growth rate of 50%, the size of the N sample document sets may be much smaller when applying the common attribute splitting criteria (much smaller N). The former case could be used for the first type of niche discovery model, while the latter case could be used for the discovery of the second type (radical niche region). Of course, at this time, the generation of the sample will be used a reduced set of N sample documents. That is, various kinds of niche evaluation models can be generated to the degree of satisfaction of the type of common attribute or the common attribute reference value. For convenience of explanation, the number of sample document sets used to generate the niche evaluation model is denoted by N.

상기 도 22에서 예시 되듯이, 공통 시점 분할 방식보다 공통 속성 분할 방식이 상대적으로 더 타당할 수 있다. 예를 들어, 도 22에서 보듯이, 05년을 기준으로 강제 분할하면, 기술 A에 대해서는 니치성이 충분히 반영될 수 있지만, 기술 C에 대해서는 니치성이 충분히 반영되지 못할 수도 있기 때문이다. 기술 A에 대해서는 05년, 기술 B에 대해서는 04년, 기술 C에 대해서는 01년을 기준으로 분할하는 것이 상대적으로 더 타당할 수 있으며, 샘플 데이터의 낭비를 최소화시킬 수 있을 것이다. As illustrated in FIG. 22, the common attribute splitting scheme may be more appropriate than the common view splitting scheme. For example, as shown in FIG. 22, when the forced division is performed based on 2005, the nicheability can be sufficiently reflected for the technology A, but the nicheity may not be sufficiently reflected for the technology C. FIG. Segmentation based on the year 05 for technology A, year 04 for technology B, and year 01 for technology C may be more reasonable, minimizing waste of sample data.

상기와 같이 N개의 표본 문건 집합에 대하여 분할이 있게 되면, 하나의 표본 문건 집합은 제1 부분 표본 문건 집합(분할 시점 이전에 대응되는 문건 집합)과 적어도 하나 이상의 제2 부분 표본 문건 집합(분할 시점 이후에 대응되는 문건 집합)을 구성하게 된다. 이때, 제1 부분 표본 문건 집합이 어떠한 속성을 가질 때, 상기 분할 시점 이후에 그 기술 영역에서 채움(filling)이 일어나는가를 찾아 내는 것이 니치 평가 모델 생성의 핵심을 이룬다. 이때, 니치 평가 모델의 생성에 제2 부분 표본 문건 집합도 사용될 수 있지만, 본 발명에서는 제1 부분 표본 문건 집합을 사용하여, 니치 평가 모델을 생성하는 것이 특징이다. 제1 부분 표본 문건 집합을 기준으로 할 때, 상기 제2 부분 표본 문건 집합은 채움(filling)이 일어난 결과가 된다. 즉, 상기 니치 평가 모델 생성의 핵심은 빠르게 채워진 제2 부분 표본 문건 집합에 기여하는데 기여한 니치 평가 요소를 찾아내는 것이며, 이러한 니치 평가 요소의 니치 평가 요소값은 제1 부분 표본 문건 집합을 기준으로 생성된다.As described above, when there are partitions for a set of N sample documents, one sample document set includes a first partial sample document set (a set of documents corresponding to a split point) and at least one second partial sample document set (a split time). After that, a corresponding document set) is formed. In this case, when the first partial sample document set has an attribute, finding out whether filling occurs in the description region after the splitting time point constitutes the core of the niche evaluation model generation. In this case, although the second partial sample document set may also be used to generate the niche evaluation model, the present invention is characterized by generating the niche evaluation model using the first partial sample document set. Based on the first partial sample document set, the second partial sample document set is a result of filling. In other words, the key to generating the niche evaluation model is to find the niche evaluation factors that contributed to the second set of rapidly filled sample documents, and the niche evaluation element values of these niche evaluation elements are generated based on the first partial sample document set. .

상기와 같이 각 표본 문건 집합마다 종속 변수값이 적어도 하나 이상의 니치 기준 요소값이 생성되고, 독립 변수값으로 각 표본 문건 집합의 제1 부분 표본 문건 집합을 대상으로 적어도 2 이상의 니치 평가 요소에 대한 니치 평가 요소값이 생성되면, 하기 표 9와 같은 니치 모델 생성을 위한 데이터 값이 결정 나게 된다.As described above, at least one niche reference element value having a dependent variable value is generated for each sample document set, and a niche for at least two niche evaluation elements of the first partial sample document set of each sample document set as an independent variable value. When the evaluation element value is generated, data values for generating a niche model shown in Table 9 are determined.

하기 표 10은 상기 표 9의 구체적인 예이다. 표 10은 표본 문건 집합을 FT 단위로 생성했다. 변수명은 상기 표 4 내지 표 7에서 사용된 것들이다.
Table 10 below is a specific example of Table 9. Table 10 generates a sample document set in FT units. Variable names are those used in Tables 4 to 7.

상기 표 10과 같은 데이터로 상기 니치 기술 영역 발견 시스템(5000)은 니치 평가 모델을 생성할 수 있다. 상기 표 10에서는 최말단 또는 타이틀이 1 dot인 FT별로 표본 문건 집합을 구성했지만, FT의 00 레벨을 기준으로 표본 문건 집합을 구성할 수도 있을 것이다. 한편, FT 이외에도, IPC나 USPC 등과 같은 다른 특허 분류의 기 설정된 레벨(최말단 특허 분류 또는 n dot subgroup(IPC의 경우), n dot subclass(USPC의 경우)별 특허 분류)로도 표본 문건 집합을 생성할 수 있을 것이다. 한편, 상기 표본 문건 집합은 특허 문건 집합이므로, 복수 개의 rising 키워드i에 대하여, 키워드 i를 핵심 키워드로 포함하는 특허 문건 집합을 상기 표본 문건 집합이 될 수도 있을 것이다.The niche description region discovery system 5000 may generate a niche evaluation model based on the data shown in Table 10. In Table 10, although a sample document set is configured for each FT whose end or title is 1 dot, the sample document set may be configured based on the 00 level of the FT. On the other hand, in addition to the FT, a sample document set is generated at a predetermined level (the last patent classification or the n dot subgroup (IPC) and the n dot subclass (USPC) patent classification) of other patent classifications such as IPC or USPC. You can do it. Meanwhile, since the sample document set is a patent document set, the sample document set may be a patent document set including a keyword i as a key keyword for a plurality of rising keywords i.

이하, 도 9를 참조하면서 본 발명 사상을 더욱 더 상세하게 설명한다. 상기 니치 기술 영역 발견 시스템(5000)은 N 개의 표본 문건 집합에 대하여, 상기 N개 각각의 표본 문건 집합을 적어도 하나 이상의 분할 기준을 적용하여 2 이상의 부분으로 분할하여 제1 부분 표본 문건 집합과 적어도 하나 이상의 제2 부분 표본 문건 집합을 구성(SM11)하고, 제1 부분 표본 문건 집합과 상기 제2 부분 표본 문건 집합을 대상으로 니치 기준 요소에 대한 니치 기준 요소값 정보를 생성(SM12)하고, 제1 부분 표본 문건 집합을 대상으로 적어도 2 이상의 니치 평가 요소에 대하여 니치 평가 요소값 정보를 생성(SM13)하며, Hereinafter, the spirit of the present invention will be described in more detail with reference to FIG. 9. The niche description region discovery system 5000 divides each of the N sample document sets into two or more parts by applying at least one partitioning criterion to N sample document sets, and sets the first partial sample document set and at least one or more. Construct a second partial sample document set (SM11), generate niche reference element value information for the niche reference element for the first partial sample document set and the second partial sample document set (SM12), and generate the first partial sample document Generate niche evaluation element value information for at least two niche evaluation elements for the document set (SM13),

상기 니치 기준 요소값을 종속 변수값으로 하고, 상기 니치 평가 요소값을 독립 변수값으로 하여 기 설정된 통계 알고리즘을 적용하여 니치 평가 모델을 생성(SM14)한다.The niche evaluation model is generated by applying a predetermined statistical algorithm using the niche reference element value as the dependent variable value and the niche evaluation element value as the independent variable value (SM14).

이어, 상기 표 10과 같은 종속 변수값과 독립 변수값이 샘플별로 준비되어 있을 때, 상기 니치 기술 영역 발견 시스템(5000)의 니치 평가 모델 생성부(5200)는 기 설정된 적어도 하나 이상의 통계 모델을 적용하여 니치 평가 모델을 생성한다. 상기 통계 모델은 선형 회귀 모델을 사용할 수도 있지만, 선형 회귀 모델은 강건함(robust)이 떨어지고, 예측력이 낮은 문제가 있다. 이에, 비선형 회귀 알고리즘 중에서도 기계 학습 알고리즘을 사용하는 것이 여러 면에서 좋은 니치 평가 모델을 생성할 수 있다. 하기에서는 기계 학습 알고리즘 중에서도 부스팅 알고리즘을 사용하는 것에 대해서 설명한다. 하기 설명에서 당업자는 하기 설명으로 부스팅 알고리즘뿐만 아니라 랜덤 포리스트 등과 같은 다른 기계 학습 알고리즘을 사용할 수도 있을 것이다.Subsequently, when the dependent variable values and the independent variable values as shown in Table 10 are prepared for each sample, the niche evaluation model generator 5200 of the niche description region detection system 5000 applies at least one preset statistical model. To generate a niche evaluation model. Although the statistical model may use a linear regression model, the linear regression model has a problem of low robustness and low predictive power. Therefore, among the nonlinear regression algorithms, the use of machine learning algorithms can generate a good evaluation model in many respects. The following describes using a boosting algorithm among machine learning algorithms. In the following description, those skilled in the art will be able to use not only the boosting algorithm but also other machine learning algorithms, such as a random forest, in the following description.

니치 영역을 탐색하고 이에 영향을 미치는 주요한 변수들을 찾는 문제는 통계학의 한 분야인 기계 학습(Machine Learning)분야에서 보편적으로 연구되는 회귀(Regression)문제이다. 이러한 회귀 문제를 푸는 방법에는 가장 기본적인 최소제곱법을 비롯한 수많은 방법들이 있다. 회귀 문제는 아래의 식과 같이 일반적으로 표현할 수 있는데, 이는 어떤 loss function, ψ이 주어져 있을 때 기대값을 최소화하는 함수를 추정하는 문제로 볼 수 있다.The problem of exploring the niche domain and finding the main variables affecting it is the regression problem that is commonly studied in the field of machine learning, which is a branch of statistics. There are many ways to solve this regression problem, including the most basic least squares method. The regression problem can be generally expressed as the following equation, which is a problem of estimating a function that minimizes the expected value when a certain loss function ψ is given.

전통적으로 추정함수 f에 대하여 모수적인 가정을 하거나 선형 가정을 하고 많은 방법들이 연구되어 왔다. 하지만 현실에는 몇 개의 모수를 가지고 설명하기에는 훨씬 복잡한 현상들이 많고 선형 모형은 극히 제한적으로 적용될 수 밖에 없는 한계점을 가지고 있다.　 따라서 복잡한 현상을 설명하기 위해서는 추정함수 f에 대해서 비모수적이고 비선형적인 가정하에서 분석이 이루어져야 한다. 하지만 이러한 조건하에서의 분석은 모수적, 선형적 분석방법보다 많은 어려움이 따른다. 본 과제에서 분석하는 특허자료 또한 굉장히 방대하고 복잡하기 때문에 이러한 자료를 잘 설명하려면 비모수적이고 비선형적인 모형이 수반되어야 한다. 그래서 그러한 방법들 중 대표적이고 우수한 방법이라고 할 수 있는 부스팅 모형을 적용하여 니치 영역 탐색 작업을 수행하였다.Traditionally, many methods have been studied, making parametric or linear assumptions about the estimation function f. However, in reality there are many more complex phenomena to explain with a few parameters, and the linear model has limitations that can be applied in a very limited way. Therefore, in order to explain complex phenomena, the estimation function f needs to be analyzed under nonparametric and nonlinear assumptions. However, analysis under these conditions is more difficult than parametric and linear analytical methods. The patent data analyzed in this project are also very large and complex, so a good description of these data is accompanied by a nonparametric and nonlinear model. Therefore, the niche area search was performed by applying the boosting model, which is a representative and excellent method among such methods.

부스팅(Boosting) 알고리즘은 Freund와 Schapire(1997)에 의해서 처음으로 분류(Classification) 문제를 다루기 위한 방법으로 제안되었다. 그 후 많은 학자들에 의해 활발한 연구가 진행되어 많은 발전이 이루어졌고 현재는 Friedman이 제안한 Gradient Boost algorithm이 보편적으로 널리 쓰이고 있다. The boosting algorithm was first proposed by Freund and Schapire (1997) as a way to deal with classification problems. Since then, many studies have been conducted by many scholars, and many developments have been made. Currently, Friedman's proposed Gradient Boost algorithm is widely used.

위와 같이 정의되는 목적함수롤 최소로 하는 함수를 찾는 것이 목표인데 이 때 국지적으로 gradient의 반대방향으로 움직이면 목적함수의 값이 가장 크게 감소한다고 할 수 이다 그래서 추정함수, f을 아래와 같은 식으로 업데이트 할 수 있다.The goal is to find a function that minimizes the objective function defined as above. In this case, the value of the objective function decreases the most if the object moves in the opposite direction of the gradient. Can be.

Gradient Boost algorithm 알고리즘은 도 23에 잘 요약되어 있다. 즉 이 알고리즘은 각 단계마다 새로운 함수 g을 계속 규합해서 광대한 머신을 만드는 알고리즘이라고 할 수 있다.The Gradient Boost algorithm algorithm is well summarized in FIG. In other words, it is an algorithm that creates a vast machine by continuously combining new functions g at each step.

초기에 부스팅 알고리즘에는 과적합의 문제가 거의 없는지 알았지만 실제로는 과적합 문제가 존재한다. 과적합은 현재 주어진 자료에 대한 적응력이 너무 높아져서 정작 새로운 자료에 대해서는 적응력이 떨어지는 현상을 말한다. 도 24는 과적합의 경우를 그림으로 잘 표현해 주고 있다. 녹색으로 표시된 분류규칙은 현재 주어진 자료에 아주 잘 적응을 해서 잘못 분류한 경우가 하나도 없다 모든 주어진 자료를 완벽하게 분류하였다. 하지만 이러한 분류 규칙이 새로운 자료에 대해서도 잘 작동하리라는 보장은 없는 것이다. 반면 검정선으로 표시된 분류 규칙은 몇 개의 자료를 잘못 분류하고는 있지만 대부분의 자료를 적절히 잘 분류하고 있음을 볼 수 있다. 검정색으로 표시된 분류 규칙이 훨씬 더 자연스러운 분류 규칙이라고 할 수 있다.Initially, we found that the boosting algorithm had almost no problem of overfitting, but there was actually a problem of overfitting. Overfitting is a phenomenon in which adaptability to current data is so high that it becomes inadequate for new data. Fig. 24 well illustrates the case of overfitting. The classification rules, shown in green, are very well adapted to the current data given and are never misclassified. However, there is no guarantee that these classification rules will work for new data. On the other hand, the classification rule marked with a black line shows that some data are misclassified, but most are classified properly. The classification rule in black is a much more natural classification rule.

따라서 현재 주어진 자료에서 구축하는 모형이 주어진 자료를 너무 따라가지 않도록 제약을 주어 과적합 문제를 해결해야 한다. 그래서 Gradient Boost algorithm에도 shrinkage parameter을 도입하여 이러한 과적합 문제를 해결하게 된다. 그리고 Friedman(2002)이 새로이 stochastic gradient boosting algorithm을 제안하는데 이는 함수 g을 추정할 때 자료 전체를 쓰지 말고 일부만을 사용하여 추정하면 성능이 더 좋아진다는 것이다. 특허 자료의 경우도 자료가 상당히 방대하기 때문에 이 방법을 적용하면 시간도 절약할 수 있을 뿐만 아니라 성능 향상도 도모할 수 있다. 이를 정리하면 도 25과 같이 요약할 수 있다. 함수 g을 적합하는 과정에서 Tree을 쓰는 것이 보통이다. 일반적으로는 한번 분기하는 stump을 사용하지만 변수들 간에 교호작용이 있다고 판단이 되면 분기를 여러 번 하는 Tree을 사용할 수도 있다.Therefore, the problem of overfitting should be solved by restricting the model that is constructed from the current data so that it does not follow the given data too much. Thus, the shrinkage parameter is also introduced in the Gradient Boost algorithm to solve this overfitting problem. Friedman (2002) proposes a new stochastic gradient boosting algorithm, which shows better performance when estimating function g using only a part of the data rather than the entire data. In the case of patent data, the data are quite large, so this method not only saves time but also improves performance. This can be summarized as shown in FIG. 25. It is common to use Tree in the process of fitting function g. In general, you use a stump that branches once, but you can use a tree that branches several times if you determine that there are interactions between variables.

도 25 에 있는 알고리즘을 구현하여 니치 자료에 적합하여 분석을 수행하였다. 알고리즘을 보면 알겠지만 구현할 때 중요한 것 중의 하나가 규합하는 tree의 개수와 과적합을 방지하는 shrinkage parameter의 선택이다. 보통 tree개수가 증가하면 shrinkage parameter는 작아지고 반대로 shrinkage parameter가 커지면 tree개수는 작아지는 경향이 있다. 그래서 보통 부스팅 모형을 적합할 때는 shrinkage parameter을 작은 값으로 고정시키고 tree의 개수를 변화시켜 가면서 최적의 모형을 찾는다. 실제 분석 과정에서는 shrinkage parameter로 0.01을 사용한다.The algorithm shown in FIG. 25 was implemented and analyzed for the niche data. As you can see from the algorithm, one of the important things to implement is the number of trees to match and the choice of shrinkage parameters to prevent overfitting. In general, as the number of trees increases, the shrinkage parameter decreases, whereas as the number of trees increases, the number of trees tends to decrease. So when fitting a boosting model, fix the shrinkage parameter to a small value and change the number of trees to find the optimal model. In the actual analysis process, 0.01 is used as the shrinkage parameter.

도 26는 니치 평가 모델 생성 과정에 관한 일 실시예적 개념을 설명하는 도면이다.FIG. 26 is a view for explaining an exemplary concept related to a niche evaluation model generation process. FIG.

규합하는 tree의 개수에 따라서 수많은 부스팅 모형이 생기기 마련이다. 이런 수많은 모형들 중에서 최적의 모형을 선택할 필요가 있다. 이는 모형 선택의 문제이고 이에 관해서도 수많은 연구들이 수행되고 있고 여러 방법들이 알려져 있다. 그 중 부스팅 모형에 적합하다고 할 수 있는 cross validation 방법을 이용하여 최적의 모형을 선별하였다. 특허 자료에 대해서 5-fold cross validation 방법을 적용하여 최적 모형을 선택하였다. 도 26는 cross validation 방법을 도식적으로 잘 보여주고 있다. 분쟁특허와 비분쟁 특허가 섞여 있는 자료를 5개의 집합으로 나눈 다음 각각의 경우마다 4개의 집합으로 모형을 구축하고 나머지 하나의 집합으로 test error을 구해서 이 오류가 가장 작은 모형을 선택하는 방법이다.Depending on the number of trees to be assembled, many boosting models are created. Among these numerous models, it is necessary to choose the best one. This is a matter of model selection, and numerous studies have been conducted on this and many methods are known. Among them, the optimal model was selected by using the cross validation method, which is suitable for the boosting model. The optimal model was selected by applying 5-fold cross validation method on patent data. 26 illustrates a cross validation method well. After dividing the data with conflict patent and non-dispute patent into 5 sets, each model is constructed with 4 sets and the test set is calculated with the other set to select the model with the smallest error.

상기와 같은 방법으로 니치 평가 모델을 생성한다. 상기 니치 평가 모델은 독립 변수로서 니치 평가 요소값이 제공되면, 종속 변수로서 니치 기준 요소값을 생성해 낸다. 니치 기준 요소값이 5년 기준 출원수의 증가량이고, 최근 5년간의 입수 문건 집합에 대한 니치 평가 요소값이 제공되면, 니치 평가 모델은 5년 기준 출원수 증가량을 제시하므로, 앞으로 5년간의 출원수가 제시될 수 있게 된다.The niche evaluation model is generated as described above. The niche evaluation model generates a niche reference element value as a dependent variable when the niche evaluation element value is provided as an independent variable. If the niche reference element value is an increase in the number of applications for five years, and the niche evaluation element value is provided for the set of acquisition documents in the last five years, the niche evaluation model shows the increase in the number of applications for five years, and therefore the application for the next five years. The number can be presented.

이와 같이 상기 니치 기술 영역 발견 시스템(5000)은 니치 평가의 대상이 되는 지정되거나 입력된 입수 문건 집합을 입수(SM21)하고, 상기 입수 문건 집합에 대하여 적어도 2 이상의 상기 니치 평가 요소에 대하여 상기 니치 평가 요소값 정보를 생성(SM22)하고, 생성된 상기 니치 평가 요소값 정보를 사용하여 상기 니치 평가 모델로 니치 평가값을 생성(SM23)하게 된다.As such, the niche description area discovery system 5000 obtains a designated or input set of acquisition documents that are subject to niche evaluation (SM21), and evaluates the niche for at least two or more of the niche evaluation elements with respect to the set of acquisition documents. The element value information is generated (SM22), and the niche evaluation value is generated by the niche evaluation model (SM23) using the generated niche evaluation element value information.

상기 n차원 프레임이 구성하는 교차 영역에 대응되는 문건 집합을 입수 문건 집합으로 간주하여, 교차 영역별로 니치 평가값을 생성할 수 있게 된다. 니치 평가값의 생성은 본 발명의 니치 평가값 생성 모듈(5260)이 담당한다. 상기 생성된 니치 평가값은 니치 분석 정보의 일종이 되며, 상기 니치 분석 정보 표시부(5113)를 통하여 상기 n차원 프레임에 나타나게 된다. 이때, 상기 타겟 문건 집합에 상기 니치 옵션 처리부(5130)를 통하여 한정이 되게 되면, 교차 영역에 대응되는 문건 집합이 한정되고 되고, 문건 집합에 대응되는 니치 평가 요소값이 변경되기 때문에, 니치 평가값도 변동되게 된다.It is possible to generate a niche evaluation value for each intersection area by considering a set of documents corresponding to the intersection area of the n-dimensional frame as an acquisition document set. Generation of the niche evaluation value is carried out by the niche evaluation value generation module 5260 of the present invention. The generated niche evaluation value becomes a kind of niche analysis information, and is displayed in the n-dimensional frame through the niche analysis information display unit 5113. In this case, when the target document set is limited through the niche option processing unit 5130, the document set corresponding to the intersection area is limited, and the niche evaluation element value corresponding to the document set is changed. Will also fluctuate.

이어, 본 발명의 니치 발견 시뮬레이션 모듈(5320)에 대해 설명한다. 하나의 타겟 문건 집합이 있을 경우, 그 타겟 문건 집합별로 다양한 종류의 교차 영역 집합을 생성할 수 있을 것이다. 타겟 문건 집합에 포함된 최말단 특허 분류의 직상위 특허 분류 또는 상위 특허 분류가 n개일 경우, 2차원 교차 영역 집합만 하더라도 적어도 nC2 개의 교차 영역 집합이 생성될 수 있게 된다. 한편, 니치 옵션 처리부(5130)를 통하는 경우, 더욱 더 다양한 교차 영역 집합이 생성될 수 있으며, 2 종류 이상의 특허 분류가 대응되는 특허 문건일 경우에는 더욱 더 다양한 교차 영역 집합이 생성될 수 있고, CatchWord나 Index가 대응될 수 있는 경우, 더욱 더 다양한 교차 영역 집합이 생성될 수 있다. 타겟 문건 집합이 주어졌을 때, 이러한 다양한 교차 영역 집합의 생성은 시스템적으로 구성될 수 있음도 당연할 것이다.Next, the niche discovery simulation module 5320 of the present invention will be described. If there is a single target document set, it is possible to generate various kinds of cross-region sets for each target document set. When the top patent classification or the top patent classification of the last patent classification included in the target document set is n, at least nC2 cross region sets may be generated even if the two-dimensional cross region set is only. On the other hand, through the niche option processor 5130, a more diverse set of cross regions can be generated, and in the case of a patent document corresponding to two or more types of patent classification, a more diverse set of cross regions can be generated, and CatchWord If more than one Index can be matched, a more diverse set of intersections can be created. Given a set of target documents, it will be appreciated that the generation of these various cross-section sets can be organized systematically.

상기 니치 기술 영역 발견 시스템(5000)은 문건 집합 분할 모듈(5210)을 사용하여 입수한 입수 문건 집합(타겟 문건 집합)을 분할하여 복수 개의 분할 입수 문건 집합을 생성(SM31)하고, 상기 생성된 분할 입수 문건 집합별로 상기 입수 문건 집합에 대하여 적어도 2 이상의 상기 니치 평가 요소에 대하여 상기 니치 평가 요소값 정보를 생성(SM32)하고, 생성된 상기 니치 평가 요소값 정보를 사용하여 상기 니치 평가 모델로 상기 생성된 분할 입수 문건 집합별로 니치 평가값을 생성(SM33)한 다음, 상기 복수 개의 분할 입수 문건 집합별로 생성되는 상기 니치 평가값에서 기 설정된 평가값 이상이거나, 기 설정된 랭킹 이상인 상기 분할 입수 문건 집합 정보를 선별(SM34)한다. 이러한 시뮬레이션은 본 발명의 니치 발견 시뮬레이션 모듈(5320)이 주도적으로 담당한다.The niche description region discovery system 5000 divides the acquired document set (target document set) obtained using the document set dividing module 5210 to generate a plurality of divided acquisition document sets (SM31), and generates the divided The niche evaluation element value information is generated for at least two or more niche evaluation elements of the acquisition document set for each acquisition document set (SM32), and the niche evaluation model is generated using the generated niche evaluation element value information. After generating the niche evaluation value for each of the divided acquisition document sets (SM33), the divided acquisition document set information that is equal to or greater than a predetermined evaluation value or a predetermined ranking is generated from the niche evaluation values generated for each of the plurality of divisional acquisition document sets. Screening (SM34). This simulation is led by the niche discovery simulation module 5320 of the present invention.

이어, 본 발명의 니치 기술 영역 발견 시스템(5000)를 중심으로 하여, 본 발명의 기술 사상을 더욱 더 상세하게 설명한다. 상기 니치 분석용 문건 집합 입수부(5310)은 특허 집합을 입수하는 특허 집합 입수부과 입수한 특허 집합을 한정하는 입수 특허 집합 한정부을 포함하고 있다. 상기 입수 대상 특허 집합 생성부은 대상 특허 집합을 입수하는 대상 특허 집합 입수부과 대상 특허 집합을 한정하는 대상 특허 집합 한정부을 포함하고 있다. 대상 특허 집합 입수부는 검색 엔진에 검색식을 질의하거나, DBMS에 질의식을 질의하거나, 상기 특허 정보 시스템(10000)을 사용하고 있는 사용자가 관리하는 적어도 하나 이상의 특허 문건 또는 특허 문건 ID를 포함하는 특허 문건 집합 또는 적어도 하나 이상의 특허 검색식이나 질의식으로 특허 문건 집합을 호출 또는 생성 또는 선택할 수 있게 된다. Next, with reference to the niche technology region discovery system 5000 of the present invention, the technical idea of the present invention will be described in more detail. The niche analysis document set acquisition unit 5310 includes a patent set acquisition unit for obtaining a patent set and an acquisition patent set limiting unit for limiting the acquired patent set. The acquisition target patent set generation unit includes a target patent set acquisition unit for acquiring the target patent set and a target patent set limitation unit for defining the target patent set. The target patent set obtaining unit queries at least one patent document or patent document ID managed by a user who uses the patent information system 10000 to query a search engine, a query expression to a DBMS, or a DBMS. A set of documents or at least one or more patent search or query expressions can be called, created or selected.

상기 니치 분석용 문건 집합 입수부(5310)은 1) 검색 엔진으로의 검색어 입력 또는 2) 특허 DB(112)를 구성하는 필드를 활용하는 쿼리의 입력, 3) 상기 검색어 또는 상기 쿼리에 대응되는 특허 정보 추출 조건의 입력이나 선택 등으로 생성될 수 있다. 상기 특허 정보 추출 조건은 상기 특허 문건에 포함된 발행국, 시간 범위, 출원인이나 권리자나 양수인, 발명자, 특허 분류, (특허 주제 분류 및 기 설정된 분류 속성 등이 있다면, 이들 중 어느 하나 이상) 중 어느 하나 또는 어느 2 이상의 조합으로 생성되는 것인 것이며, 상기 시간 범위의 기준이 되는 시간은 상기 특허 문건의 우선일, 출원일, 공개일 및 등록일 중 어느 하나 또는 어느 2 이상의 조합을 사용하는 것이며, 상기 특허 분류는 IPC, USPC, FT, FI 및 ECLA 중 어느 하나 또는 어느 2 이상의 조합인 것이며, 상기 특허 주제 분류는 상기 IPC의 catchword, 상기 USPC의 catchword 또는 상기 IPC, USPC, FT, FI 및 ECLA 중 어느 하나 이상을 사용하여 생성되는 주제 분류 중 어느 하나 이상인 것이며, 상기 기 설정된 분류 속성은 상기 출원인이나 상기 권리자나 상기 양수인에 대한 분류 속성, 상기 특허 분류를 구성하는 특허 분류별 분류 속성, 상기 발명자에 대한 분류 속성, 또는 상기 특허 주제 분류별 분류 속성 중 어느 하나 이상이 될 수 있다. 예를 들어, 출원인에 대하여 기업, 대학, 연구 기관, 개인 등의 구분이 있을 때, 대학 속성을 가지는 출원인만으로 한정 지을 수 있을 것이며, 특정 기간 동안 특정 개수 이상의 출원/등록된 특허를 가지는 출원인(다출원인)만으로 한정 지을 수도 있을 것이다.The niche analysis document set acquisition unit 5310 may include 1) inputting a search word to a search engine or 2) input of a query utilizing a field constituting the patent DB 112, 3) a patent corresponding to the search word or the query. It may be generated by input or selection of information extraction conditions. The patent information extraction condition may be any one of the issuing country, time range, applicant, owner or assignee, inventor, patent classification, (if there is a patent subject classification and a predetermined classification attribute, etc.) included in the patent document. Or a combination of any two or more of the above, wherein the time that is the basis of the time range is any one or any combination of two or more of the priority date, application date, publication date, and registration date of the patent document, and the patent classification Is any one or combination of two or more of IPC, USPC, FT, FI, and ECLA, and the patent subject classification is one or more of the catchword of the IPC, the catchword of the USPC, or the IPC, USPC, FT, FI, and ECLA. Is one or more of the subject classifications generated using, and the predetermined classification attribute is the applicant or the owner or the assignee For the classification properties can be, or any one or more of the patent subject by Category classification properties of the classified-patent classification properties, the inventors constituting the category attribute, the patent classification. For example, when there is a distinction between an applicant, a company, a university, a research institute, an individual, etc., the applicant may be limited to only applicants having university attributes, and applicants having a specific number of applied / registered patents for a specific period of time. Applicant) may be limited only.

한편, 상기 니치 분석용 문건 집합 입수부(5310)은 특정한 특허 집합의 전방 인용 특허 집합, 후방 인용 특허 집합, 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 인용 발생 입수 특허 집합 등을 호출 또는 선택 또는 입수할 수 있다. 입수 대상 특허 집합 생성부이 특정한 조건을 가하여 대상 특허 집합을 한정하는 경우, 상기 조건을 반영하여, 데이터부로부터 대상 특허를 입수해 온다. 상기 특허정보특허 정보 시스템(10000)은 적어도 하나 이상의 카데고리별로 특허 집합을 생성하고 관리할 수 있으며, 상기 특허 집합의 관리는 본 발명의 특허 정보 시스템(10000) 특허 집합 관리부가 수행한다. 상기 특허 정보 시스템(10000) 특허 집합 관리부의 하부에는 출원인별로 특정되는 특허 집합을 관리하는 출원인별 특허 집합 관리부와, 특허 분류별로 특정되는 특허 집합을 관리하는 특허분류별 특허 집합 관리부와 기타 분류나 카테고리별로 특정되는 특허 집합을 관리하는 기타 분류별 특허 집합 관리부등 이 있다. 한편, 사용자들이 생성하는 특허 집합은 회원별 특허 집합 관리부가 수행한다.Meanwhile, the niche analysis document set obtaining unit 5310 calls or selects a forward cited patent set, a back cited patent set, a forward self cited patent set, a back self cited patent set, a citation generation obtained patent set, and the like of a specific patent set. Or can be obtained. When the acquisition target patent set generation unit applies specific conditions to define the target patent set, the target patent is obtained from the data unit by reflecting the above conditions. The patent information patent information system 10000 may generate and manage a patent set for at least one or more categories, and the management of the patent set is performed by the patent information manager 100 of the present invention. Under the patent set management unit of the patent information system 10000, an applicant-specific patent set management unit managing a patent set specific to each applicant, a patent set management unit by patent classification managing a patent set specified by patent classification, and other classifications or categories. There are other classification patent set management departments that manage specific patent sets. On the other hand, the patent set generated by the user is performed by the member patent set management unit.

상기와 같은 특허 문건 집합 및 축이 결정나게 되면, 개념적으로 가로m개 세로 n개로 구성되는 mXn 개의 셀로 구성되는 매트릭스가 생성되게 된다. 상기 (mi,nj) 교차 영역에는 특허 문건 부분 집합이 대응되게 된다. 예를 들면, mi와 nj가 특허 분류인 경우, 상기 (mi,nj)에는 특허 분류 mi와 nj를 동시에 포함하는 상기 선택된 특허 문건 집합의 모든 특허들이 입수되게 되며, 이러한 특허들은 상기 특허 문건 집합의 일부이므로, 상기 특허 문건 부분 집합이 되게 된다. 이때, mi나 nj 중 특허 분류가 있을 때에는 특허 분류의 종류뿐만 아니라, main 특허 분류와 sub 특허 분류의 문제가 발생하게 된다. 특허 문건에는 적어도 하나 이상의 main 특허 분류가 포함되어 있으며, 경우에 따라서는 1개 이상의 sub 특허 분류가 포함되어 있을 수 있게 되므로, 사용될 특허 분류의 범위가 문제시 되게 된다. 이는 옵션으로 제공될 수 있으며, 사용자는 어느 특허 분류까지를 사용할 것인지를 선택할 수 있게 된다. 가급적, main 특허 분류와 sub 특허 분류 모두를 사용하는 것이 더 타당할 것이다. When the patent document set and axis are determined as described above, a matrix consisting of mXn cells consisting of m horizontally n vertically n cells is conceptually generated. The (mi, nj) intersection region corresponds to a subset of patent documents. For example, if mi and nj are patent classifications, (mi, nj) will receive all patents of the selected patent document set that includes patent classifications mi and nj simultaneously, and these patents will be assigned to the patent document set. Part of the patent document. At this time, when there is a patent classification among mi or nj, not only the kind of patent classification but also a problem of main patent classification and sub patent classification occurs. The patent document includes at least one main patent classification, and in some cases, one or more sub patent classifications may be included, so the scope of the patent classification to be used is problematic. This may be provided as an option, and the user may select which patent classification to use. Where possible, it would be more reasonable to use both main and sub patent classifications.

이어, 상기 (mi,nj)교차 영역에 대응되는 특허 문건 부분 집합에 대한 분석이 개시되며, 이는 본 발명의 특허 분석 정보 생성부(4000)가 처리한다. 상기 특허 분석 정보 생성부(4000)는 상기 특허 문건 부분 집합에 대하여 기설정된 평가 정보를 생성한다. 우선적으로 상기 특허 분석 정보 생성부(4000)는 상기 특허 문건 부분 집합에 속하는 개별 특허 단위 및/또는 상기 특허 문건 부분 집합에 속하는 특허 집합 단위별로 청구항수, 최초 (독립)청구항 길이, 페이지수, 도면수, 국내 패밀리수(분할+ US계속/일부계속), 해외 출원 패밀리수, 해외 등록 패밀리수, 국내 우선권 주장수, Large company 여부, 해외 출원인 여부, 권리자의 분쟁 제기 회수, 권리자의 분쟁 피소 회수, 전체 분쟁수, 최근 3년간 분쟁수, 전체 피고수, 최근 3년간 평균 피고수, 연차등록기간, 표준특허풀 포함여부, 양도 발생 여부, 피인용수, Reference수, IPC수, Subclass가 다른 IPC수, USPC수, Class가 다른 USPC수, FT수, Theme가 다른 FT수, (공동)출원인 수, (공동)발명자 수, 발명자 국적수, 권리의 경제적 잔여수명, IPC main group 기준 최근 3년간 US 출원수, IPC main group 기준 최근 3년간 non US 출원수, USPC no dot subgroup 기준 최근 3년간 US, IPC main group 기준 최근 3년간 US 출원수/최근10년간 US출원수, IPC main group 기준 최근 3년간 non US 출원수/ 최근10년간 non US출원수, USPC no dot subgroup 기준 최근 3년간 US출원수/ 최근10년간 US출원수, IPC main group 기준 최근 3년간 기업 비율, IPC main group 기준 최근 3년간 대기업 비율, IPC main group 기준 최근 3년간 대학/연구기관 비율, IPC main group 기준 최근 3년간 개인(not assigned) 비율, IPC main group 기준 최근 3년간 US권리자(권리자 주소 정보 참조) 비율, 출원일 백분위수, 전체 유사 모집단에서의 자사 특허 비율, 후방 인용에서의 자사 특허 비율, 전방 인용에서의 자사 특허 비율, 후방 인용에서의 자사 특허 개수, 전방 인용에서의 자사 특허 개수, 전방 인용 특허군에서의 기업 비율, 전방 인용 특허군에서의 대기업 비율, 전방 인용 특허군에서의 대학/연구기관 비율, 전방 인용 특허군에서의 개인 비율, 전방 인용 특허군에서의 US 권리자 비율, 전방 인용에서의 논문 비율, 전방 인용에서의 non US 특허 비율, 후방 인용 특허군에서의 기업 비율, 후방 인용 특허군에서의 대기업 비율, 후방 인용 특허군에서의 대학/연구기관 비율, 후방 인용 특허군에서의 개인 비율, 후방 인용 특허군에서의 US 권리자 비율, 후방 인용에서의 non US 특허 비율, 전체 유사 모집단에서의 자사 특허 비율, 5년 집중 영역 정도(AI), 5년 평균 특허 품질 지수, 5년 지수 평점, 5년 공동출원인 수, 5년 집중 영역 정도(AI), 5년 평균 특허 품질 지수, 5년 지수 평점, 5년 공동 발명자수, 실시권 설정 개수, 출원인 재무, 5년 국가별 상대 출원량, 5년 국가별 상대 등록량, 5년 국가별 상대 출원인수, 5년 국가별 상대 발명자수, 5년 국가별 특정 출원인 상대 출원량, 5년 출원인 중 기업 비율, 5년 출원인 중 대학 비율, 5년 국가별 특정 출원인 상대 등록량, 5년 국가별 특정 출원인 포함 출원량, 5년 국가별 특정 출원인 포함 등록량, 기술 분야별 rising 키워드 필드별(특허청구범위) 포함수, 5년 국가별 frequency rising 속도, 5년 국가별 network size rising 속도, 핵심 키워드의 특허 청구 범위 포함 개수, 5년 국가별 출원 증감율, 5년 국가별 등록 증감율, 5년 국가별 출원인수 증감율, 5년 국가별 발명자수 증감율, 5년 출원인 중 기업 비율, 5년 출원인 중 대학 비율, 5년 특정 출원인 포함 비율, 특허조회수, 수명 주기상의 위치, 연차등록납부액 등과 같은 니치 기술 영역 평가 요소 중 어느 하나 이상을 계산하거나 입수한다. 상기 니치 기술 영역 평가 요소에는 본 문단에서 예시한 평가 요소들의 변동값(증감율, 증가 속도 등)도 평가 요소가 됨은 물론이다 할 것이다.Subsequently, an analysis of a subset of patent documents corresponding to the (mi, nj) intersection region is disclosed, which is processed by the patent analysis information generating unit 4000 of the present invention. The patent analysis information generation unit 4000 generates preset evaluation information on the patent document subset. First of all, the patent analysis information generating unit 4000 includes the number of claims, the original (independent) claim length, the number of pages, and the drawings for each patent unit belonging to the patent document subset and / or patent set units belonging to the patent document subset. Number of domestic families (divided + US continued / partially), number of overseas filed families, number of family members registered overseas, number of domestic priority claims, large company status, status of foreign applicants, number of disputes filed by right holder, number of disputes filed by right holder, Total number of disputes, number of disputes in the last three years, total number of defendants, average number of defendants in the last three years, annual registration period, inclusion of the standard patent pool, whether transfers have occurred, number of citations, references, IPCs, IPCs with different subclasses , Number of USPCs, number of USPCs with different classes, number of FTs with different themes, number of (co) applicants, number of (co) inventors, number of inventors, economic remaining life of rights, based on IPC main group Number, IPC main gr Based on oup Non-US applications in the last 3 years, USPC no dot subgroup in the last 3 years US, in the IPC main group US applications in the last 3 years / US applications in the last 10 years Non US applications for the past 10 years, USPC no dot subgroups US applications for the past 3 years / US applications for the past 10 years, ratio of companies for the last 3 years based on IPC main group, ratio of large companies for the last 3 years based on IPC main group, IPC main group Based on university / research institutions in the last three years, not assigned ratio in the last three years based on the IPC main group, US right holder (see rights holder's address information) in the last three years, based on the IPC main group Percentage of own patents, Percentage of patents in back citations, Percentages of patents in forward citations, Percentages of patents in back citations, Percentages of patents in forward citations, Percentage of companies in forward citation patents, Forward citation patents Percentage of large companies in books, ratio of universities / research in forward citation patent group, individual ratio in forward citation patent group, US proprietor ratio in forward citation patent group, article ratio in forward citation patent, non US patent in forward citation Ratio, percentage of firms in back-cited patents, ratio of large companies in back-cited patents, ratios of universities / research in back-cited patents, individuals in back-cited patents, ratio of US owners in back-cited patents , Non-US patent ratios in back citations, percentage of own patents in the overall similar population, 5-year focused area (AI), 5-year average patent quality index, 5-year index rating, 5-year joint applicants, 5-year focused area Accuracy (AI), 5-year average patent quality index, 5-year index rating, 5-year co-inventors, number of license settings, applicant finance, 5-year relative filing, 5-year relative registration, 5-year country Relative Applicants, Number of relative inventors by country for 5 years, relative application volume of applicants by country for 5 years, percentage of companies among applicants for 5 years, university ratio among applicants for 5 years, relative registration of applicants by country for 5 years, application including applicants by country for 5 years Volume, 5 years country-specific registration amount, country-specific rising keyword field (patent claims), 5 year country frequency rising rate, 5 year country network size rising rate, key keyword claims included Number, 5 years of application increase / decrease rate, 5 years of registration increase / decrease rate by country, 5 years of applicant increase / decrease rate by country, 5 years of applicant increase / decrease rate, company rate among 5 year applicants, university rate among 5 year applicants, 5 year specific applicants Calculate or obtain any one or more of the assessment criteria for niche technology areas, such as inclusion rates, patent counts, life cycle positions, and annual payments. In the niche description area evaluation element, the change value (rate of increase / decrease rate, increase rate, etc.) of the evaluation elements exemplified in this paragraph will also be an evaluation element.

한편, 도 28은 본 발명의 틈새 영역 발견 분석 결과에 관한 다른 일실시예적 도면으로, 도 10과는 다른 니치 기술 영역 평가 요소값을 사용하여 생성되는 니치 기술 영역 평가 정보에 대하여 각 축, 교차 영역/셀(cell)에 대하여 thermograph 효과가 적용되어 있음을 볼 수 있다.FIG. 28 is a diagram illustrating another embodiment of a niche region discovery analysis result according to the present invention. FIG. 28 is a diagram illustrating niche description region evaluation information generated by using a niche description region evaluation element value different from FIG. It can be seen that the thermograph effect is applied to the / cell.

한편, 상기 교차 영역값에 대한 특허 문건 부분 집합이 정해지는 경우, 상기 특허 문건 부분 집합에 속하는 특허들에 대해 기설정된 분석 정보를 제공해 줄 수 있다. 상기 특허 문건 부분 집합에 특허 문건이 대응되는 경우, 상기 특허 문건 부분 집합에 포함된 특허들에 대해서 정량 분석 정보가 제공될 수 있을 것이며(분석의 축은 시간, 출원인, 발명자, 특허 분류, 국가, 등록 여부 등이 될 수 있을 것이다.), 개별 특허들에 대한 서지 사항 및 전문 정보가 링크 될 수도 있을 것이다.On the other hand, when the patent document subset for the intersection area value is determined, predetermined analysis information may be provided for patents belonging to the patent document subset. If a patent document corresponds to the patent document subset, quantitative analysis information may be provided for the patents included in the patent document subset (the axis of analysis is time, applicant, inventor, patent classification, country, registration). And bibliography and full text of individual patents.

상기와 같은 본 발명 사상은 3 이상의 n차원 매트릭스로 확장할 수 있음은 당업자에게 당연할 것이며, 이 경우, 상기 교차 영역값은 n차원의 큐브 형태가 될 것이다. 한편, 사용자는 2차원까지의 정보만 시각적으로 확인할 수 있으므로, n개의 차원 중 자신이 원하는 차원 2개를 선택하여, 선택된 차원에 대한 상기 니치 기술 평가 정보를 볼 수 있게 된다. 생성된 큐브 data에 있어서, slice and dice 정보 처리를 하여, 원하는 매트릭스를 선택하거나, 선택된 매트릭스에 대하여 축 등을 변동해 보는 인터페이스를 상기 특허 정보 시스템(10000)은 상기 사용자 컴퓨터(100)에 제공할 수 있을 것이다.It will be apparent to those skilled in the art that the inventive concept as described above can be extended to three or more n-dimensional matrices, in which case the intersection area value will be in the form of an n-dimensional cube. On the other hand, since the user can only visually confirm up to two dimensions of information, the user can select two of the desired dimensions among the n dimensions, and can view the niche technology evaluation information on the selected dimension. In the generated cube data, the patent information system 10000 provides the user computer 100 with an interface for processing slice and dice information to select a desired matrix or to change an axis or the like with respect to the selected matrix. Could be.

본 발명 사상에서 예시로 든 계층 구조는 특허 분류뿐만 아니라, 도 13 내지 도14에 예시된 IPC 대응 캐치워드나, USPC 대응 Index에 대해서도 적용될 수 있음은 당연할 것이다.
The hierarchical structure exemplified in the inventive concept may be applied not only to the patent classification but also to the IPC corresponding catchwords illustrated in FIGS. 13 to 14 or the USPC corresponding index.

본 발명은 특허 정보 산업, R&D 지원 사업, 신사업 영역 발굴 등에 다양하게 활용될 수 있다.
The present invention can be used in various ways, such as patent information industry, R & D support business, new business area discovery.

10000 : 특허 정보 시스템
300 : 연계 시스템
200 : 유무선 네트워크
100 : 사용자 컴퓨터
10000 : 특허 정보 시스템
1000 : 데이터부
1100 : 특허 데이터부
1200 : 비특허 데이터부
1300 : 핵심 키워드 DB
1400 : 분류 메타데이터 DB
1700 : 사용자 데이터부
1800 : 기타 DB
1900 : 목적 특화 데이터부
2000 : 데이터 가공부
2100 : 핵심 키워드 생성부
2200 : 분류 메타 데이터 생성부
2300 : 목적 특화 데이터 생성부
2400 : 유사 특허 집합 생성부
2500 : 네트워크 데이터 생성부
3000 : 특허 정보 서비스 지원부
3000 : 검색 처리부
4000 : 특허 분석 정보 생성부
4100 : 분석 대상 입수부
4200 : 특허 분석 처리부
4210 : 분석 지표 DB
4220 : 분석 명령 구문 DB
4300 : 특수 분석 정보 생성부
4400 : 분석 옵션 처리부
4600 : 분석 결과 리포팅부
20000 : 목적 특화 시스템부
5000 : 니치 기술 영역 발견 시스템
6000 : 특허 자동 평가 시스템
7000 : 특허 분쟁 예측 시스템
8000 : 미래 기술 예측 시스템
9000 : 융합 정보 처리 시스템
1110 : 특허 명세서 파일부
1120 : 특허 DB
1130 : 특허분류 DB
1140 : 기타 특허 데이터 DB
1210 : 비특허파일부
1220 : 비특허 DB
1230 : 기업정보 DB
1240 : 기타 비특허 데이터 DB
1310 : 핵심 키워드 메타 데이터 DB
1910 : 출원인 DB
1920 : 규칙 데이터부
1930 : 언어 데이터부
1940 : 표준 특허풀 데이터부
2110 : 키워드 추출부
2120 : 핵심 키워드 선정부
2130 : 구문 키워드 생성부
2140 : 핵심 키워드 메타 정보 생성부
2150 : 제품 키워드 생성부
2210 : 분류 대표 핵심 키워드군 생성부
2220 : 분류간 맵핑 정보 생성부
2230 : 핵심 키워드 계층화부
2240 : 특허 자동 분류부
2310 : 통계 분석용 데이터 생성부
2032 : 인용 분석용 데이터 생성부
2330 : 발견 분석용 데이터 생성부
2410 : 클러스터링 처리부
2510 : 노드 데이터 생성부
2520 : 에지 데이터 생성부
5100 : 니치 기술 영역 발견 엔진
5110 : n차원 프레임 처리부
5111 : 교차 영역 프레임 선택부
5112 : 교차 영역 집합 생성부
5113 : 니치 분석 정보 표시부
5114 : 선택 프레임 관리부
5120 : 니치 분석 정보 생성부
5121 : 니치 관점 분석 정보 생성부
5122 : 니치 모델 분석 정보 생성부
5123 : 니치 분석 종류 제시부
5124 : 니치 모델 종류 제시부
5130 : 니치 옵션 처리부
5131 : 기간 한정부
5132 : 주체 한정부
5133 : 기술 분야 한정부
5134 : 개별 특허 한정부
5135 : 범례 처리부
5200 : 니치 평가 모델 생성부
5210 : 문건 집합 분할 모듈
5220 : 니치 기준 요소값 생성 모듈
5230 : 니치 평가 요소값 생성 모듈
5240 : 니치 평가 알고리즘 생성 모듈
5250 : 니치 평가 알고리즘 검증 모듈
5260 : 니치 평가값 생성 모듈
5270 : 니치 평가 요소값 DB
5280 : 니치 평가 모델 DB
5300 : 니치 발견 정보 생성부
5310 : 니치 분석용 문건 집합 입수부
5320 : 니치 발견 시뮬레이션 모듈
5330 : 니치 발견 리포팅 정보 생성부
5340 : 니치 기술 분쟁 UI부
970 : 단위 융합 요소 생성부
971 : 핵심 키워드 단위 융합 요소 생성부
972 : 특허 분류 단위 융합 요소 생성부
973 : 하이브리드 융합 요소 생성부
974 : 인덱스 단위 융합 요소 생성부
910 : 융합 처리 모집합 입수부
920 : 융합 처리 대상 집합 생성부
921 : 집합 분할부
921-1 : 분할 처리 기준 제공부
921-2 : 분할 처리 기준 입수부
921-3 : 분할 처리 기준 DB
922 : 인용 특허 집합 입수부
923 : 집합 지정 정보 입수부
930 : 융합 처리 기준 정보부
931 : 융합 처리 기준 제공부
932 : 융합 처리 기준 입수부
933 : 융합 처리 기준 생성부
934 : 융합 처리 기준 정책 DB
940 : 융합 정보 처리부
941 : 키워드 융합 정보 처리부
942 : 특허 분류 융합 정부 처리부
943 : 하이브리드 융합 정보 처리부
960 : 융합 정보 배치 생성부
961 : 융합 정보 배치 생성 정책부
962 : 융합 정보 DB
950 : 융합 정보 분석부
951 : 융합 정보 시각화부
952 : 융합 정보 UI부
980 : 공백 융합 후보 발견부
981 : 대등 공백 융합 후보 발견부
982 : 심화 공백 융합 후보 발견부
983 : 중요 공백 융합 후보 발견부10000: Patent Information System
300: linkage system
200: wired and wireless network
100: user computer
10000: Patent Information System
1000: data section
1100: Patent data part
1200: non-patent data portion
1300: Key Keyword DB
1400: classification metadata DB
1700: user data section
1800: other DB
1900: Purpose-specific data section
2000: Data Processing Department
2100: key keyword generator
2200: classification metadata generation unit
2300: purpose-specific data generation unit
2400: Similar patent set generation unit
2500: network data generator
3000: Patent Information Service Support Department
3000: search processing unit
4000: patent analysis information generation unit
4100: acquisition target
4200: Patent Analysis Processing Unit
4210: Analytical Indicators DB
4220: Analysis command syntax DB
4300: special analysis information generation unit
4400: analysis option processing unit
4600: Analysis result reporting unit
20000: Purpose Specialized System
5000: Nich Technology Zone Discovery System
6000: Patent Auto Evaluation System
7000: Patent Dispute Prediction System
8000: Future Technology Prediction System
9000: Convergence Information Processing System
1110: Patent specification file portion
1120: Patent DB
1130: Patent Classification DB
1140: Other Patent Data DB
1210: Non-Patent File Division
1220: Non-patent DB
1230: Company Information DB
1240: other non-patent data DB
1310: Core Keyword Metadata DB
1910: Applicant DB
1920: rule data portion
1930: language data section
1940: Standard Patent Pool Data Section
2110: keyword extraction unit
2120: Key Keyword Selection Unit
2130: syntax keyword generation unit
2140: core keyword meta information generation unit
2150: product keyword generator
2210: classification keyword core keyword group generation unit
2220: mapping information generation unit between classifications
2230 key keyword stratification unit
2240: Patent automatic classification unit
2310: statistical data generation unit
2032: citation analysis data generation unit
2330: data generation unit for discovery analysis
2410: clustering processing unit
2510: node data generator
2520: edge data generator
5100: Nich Technology Zone Discovery Engine
5110: n-dimensional frame processing unit
5111: intersection area frame selection part
5112: intersection region generation unit
5113: niche analysis information display unit
5114: selection frame management unit
5120: niche analysis information generation unit
5121: niche perspective analysis information generation unit
5122: niche model analysis information generation unit
5123: Niche analysis type presentation unit
5124: Niche model type presentation unit
5130: niche option processing unit
5131: Limited Time
5132: subject limited part
5133: limited technical field
5134: Individual Patent Limited
5135 legend processing unit
5200: Niche Evaluation Model Generator
5210: document set split module
5220: niche reference element value generation module
5230: niche evaluation element value generation module
5240: Nich Evaluation Algorithm Generation Module
5250: niche evaluation algorithm verification module
5260: niche evaluation value generation module
5270: Niche evaluation element value DB
5280: Niche Evaluation Model DB
5300: niche discovery information generation unit
5310: Obtaining Document Set for Niche Analysis
5320: Nich Discovery Simulation Module
5330: niche discovery reporting information generator
5340: Nichi Technical Dispute UI Department
970: unit fusion element generation unit
971: Core keyword unit fusion element generator
972: Patent classification unit fusion element generation unit
973: hybrid fusion element generator
974: Index unit fusion element generation unit
910: convergence processing collection acquisition
920: Fusion processing target set generation unit
921: set divider
921-1: Division processing standard provider
921-2: Division processing standard acquisition part
921-3: Partition processing base DB
922: Acquisition of cited patent set
923: collection set information acquisition unit
930: convergence processing reference information
931: Fusion processing standard provider
932: Acquisition of fusion treatment criteria
933: Fusion processing reference generation unit
934: convergence processing criteria policy DB
940: convergence information processing unit
941: Keyword Convergence Information Processing Unit
942: Patent classification convergence government processing unit
943: hybrid convergence information processing unit
960: Fusion information batch generation unit
961: Convergence Information Batch Generation Policy Department
962: Fusion Information DB
950: convergence information analysis unit
951: Convergence Information Visualizer
952: Convergence Information UI
980: blank fusion candidate discovery unit
981: Equal Blank Convergence Candidate Finder
982: Deep Blank Convergence Candidate Finder
983: Critical Space Convergence Candidate Finder

Claims

An information processing method of a patent information system that provides a niche technology area search service to a user computer through a wired or wireless network,
The patent information system
(a) receiving selection information from the user computer about a first patent classification set using a first patent classification and a second classification set using at least one or more second classifications;
(b) constructing an intersection region set having at least two intersection regions using the first patent classification set and the second classification set; And
(c) generating at least one niche analysis information as at least one cross region value of at least one cross region constituting the cross region set based on a cross region document set (cross region set) corresponding to the cross region; Including;
The first patent classification is an IPC, F-term, FI, UPSC, ECLA any one or more of the information processing method of the patent information system providing a niche technology area search service, characterized in that.

The method of claim 1,
(d) displaying at least one of the intersection area value and the intersection area value processing result obtained by processing the intersection area value and the intersection area value on a preset basis in the intersection area; An information processing method of a patent information system providing a nich technology search service.

The method of claim 1,
And wherein the intersection area set is a matrix area or a cube shape intersection area frame using the first patent classification set and the second classification set.

The method of claim 1,
The cross-domain document set is a partial document set of at least one target document set,
The target document set is a document set designated or created by the user, or a document set designated or generated by the system,
Optionally, the target document set may be limited to documents that satisfy a predetermined limitation condition.
The above limitation conditions may include time limitation, applicant or owner limitation, inventor limitation, limitation of patent evaluation score or patent evaluation grade, limitation of attribute of applicant, owner or inventor, satisfaction of patent evaluation index, and inclusion of specific bibliographic attribute. An information processing method of a patent information system providing a niche technology area search service, characterized in that any one or more of limitations and limitations of measured values for a patent.

The method of claim 1,
The second classification is a patent classification or a semi-patent classification having a corresponding relationship with a patent classification,
The quasi-patent classification is at least one of IndextoUSPC and CatchWord,
The semi-patent classification is an information processing method of a patent information system for providing a niche technology area search service, characterized in that at least one patent classification per one semi-patent classification.

The method of claim 1,
In step (a), by selecting at least one or more upper patent classification having at least one lower patent classification, the lower patent classification for the selected upper patent classification to process the first patent classification set. An information processing method of a patent information system that provides a nich technology area search service.

The method according to claim 6,
The cross-domain document set is composed of documents that simultaneously satisfy the attributes of the first patent classification constituting the first patent classification set and the second classification constituting the second classification set among target document sets,
Satisfaction of the attribute may include all or a portion of the first patent classification and the second classification simultaneously displayed in the cross-domain document, or at least one higher patent classification in the patent classification system of the patent classification in the cross-domain document. An information processing method of a patent information system for providing a niche technology area search service, characterized in that all or a part thereof appears simultaneously in the first patent classification and the second classification.

The method of claim 1,
The first patent classification set includes a lower patent classification selected by the user among all lower patent classifications of the upper patent classification with respect to any one upper patent classification in the patent classification system. Information processing method of patent information system that provides technology area search service.

The method of claim 1,
The niche analysis information may be selected by the user,
The niche analysis information is to use at least one niche analysis index or niche analysis model,
And said intersection area value is varied according to selection of said niche analysis index or said niche analysis model.

The method of claim 3, wherein
In the cross region frame, a patent classification constituting the first patent classification set and a second classification constituting the second classification set appear.
The cross region frame includes a first frame axis in which a patent classification constituting the first patent classification set appears, a second frame axis in which a second classification constituting the second classification set appears,
The intersection area is arranged
A first arrangement method for arranging based on the order in which the first set of patent classifications are arranged on the patent classification system and the order in which the second classification set is arranged on the second classification system;
If there is no cross region set corresponding to the patent classification of the first patent set or there is no cross region set corresponding to the second classification of the second classification set, the cross region except for the patent classification or the second classification without the cross region set For a patent classification or a second classification in which a set exists, a second arrangement in which the first patent classification set is arranged based on the order in which the first classification set is arranged on the patent classification system and the order in which the second classification set is arranged on the second classification system. Placement method, and
The higher the sum of the crossing area values for all the crossing areas in the first frame axis direction, the closer to the second frame axis, and the higher the sum of the crossing area values for all the crossing areas in the second frame axis direction. Any one of the third arrangement method to be disposed close to the first frame axis is applied,
The first arrangement method, the second arrangement method and the third arrangement method may be selected according to the user's selection or the setting of the system. How information is processed.

A patent information system that provides niche technology area search service to a user computer through a wired or wireless network,
A cross region frame selection unit for receiving selection information about a first patent classification set using a first patent classification and a second classification set using at least one or more second classifications from the user computer;
An intersection region generation generator configured to form an intersection region set having at least two intersection regions using the first patent classification set and the second classification set; And
A niche analysis information generation unit generating at least one niche analysis information as a cross region value with respect to at least one cross region constituting the cross region set, based on a cross region document set (intersecting region set) corresponding to the cross region. ;;
The first patent classification is a patent information system for providing a niche technology area search service, characterized in that any one or more of IPC, F-term, FI, UPSC, ECLA.

12. The method of claim 11,
And a niche analysis information display unit configured to display, in the intersection area, at least one of the intersection area value and the intersection area value processing result of processing the intersection area value on a predetermined basis in the intersection area. Patent information system providing niche technical area search service.

13. The method of claim 12,
And a selection frame manager for managing selection history and current selection of the first and second classification sets.

The method of claim 1,
The niche analysis information generation unit
A niche viewpoint analysis information generation unit configured to generate niche analysis information for each niche viewpoint; And
A niche model analysis information generator configured to generate niche analysis information for each niche model; Patent information system for providing a niche technology area search service, characterized in that it further comprises any one or more of.

The method of claim 1,
And a cross region set arrangement unit for arranging the cross region set.
Wherein the intersection area set arrangement unit arranged the intersection area set arrangement unit is
A first arrangement method for arranging based on the order in which the first set of patent classifications are arranged on the patent classification system and the order in which the second classification set is arranged on the second classification system;
If there is no cross region set corresponding to the patent classification of the first patent set or there is no cross region set corresponding to the second classification of the second classification set, the cross region except for the patent classification or the second classification without the cross region set For a patent classification or a second classification in which a set exists, a second arrangement in which the first patent classification set is arranged based on the order in which the first classification set is arranged on the patent classification system and the order in which the second classification set is arranged on the second classification system. Placement method, and
The higher the sum of the crossing area values for all the crossing areas in the first frame axis direction, the closer to the second frame axis. The higher the sum of the crossing area values for all the crossing areas in the second frame axis direction, the higher the first frame. To perform any one of the method of any one of the third positioning method to be placed close to the axis,
In the cross region frame, a patent classification constituting the first patent classification set and a second classification constituting the second classification set appear.
The cross region frame includes a first frame axis in which a patent classification constituting the first patent classification set appears, a second frame axis in which a second classification constituting the second classification set appears,
The first arrangement method, the second arrangement method and the third arrangement method may be selected according to the user's selection or the setting of the system. .