KR101271115B1

KR101271115B1 - System, Media, Program and Method on Generating Patent Risk Hedging Prediction Information

Info

Publication number: KR101271115B1
Application number: KR1020110025696A
Authority: KR
Inventors: 강민수
Original assignee: 강민수; (주)광개토연구소
Priority date: 2011-03-23
Filing date: 2011-03-23
Publication date: 2013-06-04
Also published as: KR20120108126A

Abstract

본 발명은 특허 리스크 헤징 예측 정보 생성 방법, 그 방법을 실시하는 시스템, 그 방법을 실시하는 프로그램, 그 프로그램이 기록된 기록 매체에 관한 것이다.
본 발명을 실시하면 시스템적으로 신뢰성 있고 타당성 높은 특허 리스크 헤징 예측 정보를 생성할 수 있다.The present invention relates to a method for generating patent risk hedging prediction information, a system for implementing the method, a program for performing the method, and a recording medium on which the program is recorded.
Implementation of the present invention enables the generation of systemically reliable and relevant patent risk hedging prediction information.

Description

Method for generating patent risk hedging prediction information, a system for implementing the method, a program for implementing the method, and a recording medium recording the program {System, Media, Program and Method on Generating Patent Risk Hedging Prediction Information}

본 발명은 특허 리스크 헤징 예측 정보 생성 방법, 그 방법을 실시하는 시스템, 그 방법을 실시하는 프로그램, 그 프로그램이 기록된 기록 매체에 관한 것으로서, 더욱 더 상세하게는 특허 분쟁 예측 모델을 수립하고, 특허 분쟁 예측 모델값을 사용하여 특허 분쟁 리스크의 헤징에 기여하는 특허 리스크 헤징 예측 정보 생성 방법, 그 방법을 실시하는 시스템, 그 방법을 실시하는 프로그램, 그 프로그램이 기록된 기록 매체에 관한 것이다.The present invention relates to a method for generating patent risk hedging prediction information, a system for implementing the method, a program for executing the method, and a recording medium on which the program is recorded. A method for generating patent risk hedging prediction information that contributes to hedging of patent dispute risk using a dispute prediction model value, a system for implementing the method, a program for performing the method, and a recording medium on which the program is recorded.

특허 전쟁의 시대를 맞아 전세계적으로 특허 분쟁이 급증하고 있다. 종래에는 특허 분쟁이 발생한 분쟁 정보를 DB로 구축하고, 분쟁 정보를 분석해 주거나, 분쟁 발생 정보를 뉴스 레터 등으로 전송해 주거나, 분쟁 정보를 검색할 수 있도록 해 주는 수준에서 그쳤다. 이들 서비스는 과거에 발생한 특허 분쟁에 대한 서비스로 미래에 발생할 분쟁에 대한 예측 정보를 제공해 주지 못하였다. 이에 따라, 분쟁의 원고나 피고가 되는 기업 등은 각자 처해진 현실이나, 제품, 기술 및 특허 포트폴리오가 다른 바, 이들에게 특화된 분쟁 예측 정보를 제공해 주는 서비스의 도입이 절실히 필요하다. 나아가 체계적으로 특허 분쟁 리스크를 헤징할 수 있는 정보를 생성하거나, 크로스 라이센싱 대상자나 대상 특허를 발견하는 특허 정보 처리 방법 및 그 시스템도 필요성이 급증하고 있다.In the era of the patent war, patent disputes are increasing worldwide. In the past, it was only at the level of constructing the dispute information in which the patent dispute occurred as a DB, analyzing the dispute information, transmitting the dispute occurrence information to a newsletter, or searching for the dispute information. These services were for patent disputes that occurred in the past and did not provide predictive information about future disputes. As a result, the disputed plaintiffs and the defendant companies are different from each other, and their products, technologies, and patent portfolios are different. Therefore, there is an urgent need to introduce a service that provides them with specialized dispute prediction information. Furthermore, there is an increasing need for patent information processing methods and systems for generating information capable of systematically hedging the risk of patent disputes, or for discovering cross-licensing targets or target patents.

본 발명이 해결하고자 하는 기술적 과제는 특허 리스크 헤징 예측 정보 생성 방법, 그 방법을 실시하는 시스템, 그 방법을 실시하는 프로그램, 그 프로그램이 기록된 기록 매체를 제시하는 것이다.The technical problem to be solved by the present invention is to present a method for generating patent risk hedging prediction information, a system for implementing the method, a program for implementing the method, and a recording medium on which the program is recorded.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, (A) 적어도 한 종류 이상의 특허 분쟁에 사용된 특허를 포함하는 적어도 하나 이상의 분쟁 특허 집합과 적어도 하나 이상의 비분쟁 특허 집합을 입수하는 단계; (B) 상기 분쟁 특허 집합을 구성하는 적어도 2 이상의 분쟁 특허들과 상기 비분쟁 특허 집합을 구성하는 적어도 2 이상의 비분쟁 특허들에 대하여 기 설정된 적어도 2 이상의 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성하는 단계; 및 (C) 상기 분쟁 특허와 상기 비분쟁 특허에 대하여, 상기 분쟁 예측 요소값을 설명 변수값으로 하고, 상기 분쟁 특허에 부여된 분쟁 특허 부여값과 상기 분쟁 특허 부여값과 다르게 부여되며 상기 비분쟁 특허에 부여된 비분쟁 특허 부여값을 반응 변수값으로 하여 기 설정된 통계학적 처리를 수행하여 기 설정된 적어도 하나 이상의 분쟁 예측 모델을 생성하는 적어도 하나 이상의 분쟁 예측 모델을 수립하는 단계;를 포함하는 것을 특징으로 하는 특허 정보 시스템의 특허 분쟁 예측 모델 생성 방법을 제시한다.In order to achieve the technical problem to be achieved by the present invention, (A) obtaining at least one disputed patent set and at least one non-disputed patent set including patents used in at least one or more kinds of patent disputes; (B) generate a dispute prediction element value for at least two dispute prediction elements preset for at least two dispute patents constituting the dispute patent set and at least two non-dispute patents constituting the non-dispute patent set; Making; And (C) the dispute patent and the non-dispute patent, wherein the dispute prediction element value is an explanatory variable value, and is different from the dispute patent grant value and the dispute patent grant value assigned to the dispute patent and are non-dispute. And establishing at least one dispute prediction model for generating at least one dispute prediction model by performing a predetermined statistical process using the non-dispute patent grant value assigned to the patent as a response variable value. A patent dispute prediction model generation method of a patent information system is presented.

상기 특허 분쟁에 사용된 특허는 사법부에 제기하는 특허 분쟁에 사용된 특허, 행정부에 제기하는 특허 분쟁에 사용된 특허, 특허 침해자에 대한 경고장에 사용된 특허, 특허 침해자에 대한 특허권의 행사에 사용된 특허 및 특허 로열티가 발생한 특허 중 어느 한 종류 이상인 것이 바람직하다.The patents used in the patent disputes are used for patents used in patent disputes filed with the judiciary, patents used in patent disputes filed with the administration, patents used in warnings against patent infringers, and the exercise of patent rights against patent infringers. It is preferable that any one or more types of patents used and patents in which patent royalties have occurred occur.

상기 비분쟁 특허 집합의 크기는 상기 분쟁 특허 집합의 크기보다 같거나, 큰 것인 것이며, 상기 비분쟁 특허 집합은 상기 분쟁 특허 집합의 통계학적 속성 중 어느 하나 이상을 공유하면서 전체 특허 집합에서 추출되는 것인 제1 비분쟁 집합 생성 방법 및 랜덤(random)하게 추출되는 제2 비분쟁 집합 생성 방법 중에서 선택되는 어느 하나의 방법으로 생성되는 것인 것이 바람직하다.The size of the non-disputed patent set is equal to or larger than the size of the disputed patent set, and the non-disputed patent set is extracted from the entire patent set while sharing any one or more of the statistical properties of the disputed patent set. It is preferably generated by any one method selected from the first non-dispute set generating method and the second non-dispute set generating method randomly extracted.

상기 분쟁 예측 요소값은 분쟁 예측 요소별로 상기 분쟁 특허 집합과, 상기 비분쟁 특허 집합을 구성하는 개별 특허 단위로 생성되는 것이며, 상기 분쟁 예측 요소는 상기 개별 특허별 자기 속성 계열, 자타 속성 계열, 분야 속성 계열 중 어느 하나 이상의 속성 계열과 관련되는 것인 것이 바람직하다.The dispute prediction element value is generated in each of the dispute patent set and the individual patent units constituting the non-dispute patent set for each dispute prediction element, and the dispute prediction element includes the self-property series, the other attribute series, and the field attribute for each individual patent. It is preferred to be associated with any one or more of the attribute sequences.

상기 분쟁 예측 요소에는 인용 관련 분쟁 예측 요소를 포함하고 있는 것이며, 상기 인용 관련 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성할 때는 직접 인용, 간접 인용, 잠재 인용 및 사슬 인용 중 어느 하나 이상의 인용을 사용하는 것이며, 상기 직접 인용, 간접 인용, 잠재 인용 및 사슬 인용 중 2 이상을 사용하는 방법은 각 인용의 종류별로 독립적으로 처리하여 기 설정된 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성하는 제1 인용 사용 방법 및 2 이상의 인용을 복합적으로 사용하되, 각 인용의 종류별로 기 설정된 가중치를 부여하여 기 설정된 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성하는 제2 인용 사용 방법 중 어느 하나 이상의 방법을 사용하는 것인 것이 바람직하다.The dispute prediction element includes a citation related dispute prediction element, and when generating a dispute prediction element value for the citation related dispute prediction element, one or more citations of direct citation, indirect citation, latent citation, and chain citation are used. The method of using two or more of the direct citation, indirect citation, latent citation, and chain citation uses a first citation to independently process each citation type to generate a dispute prediction element value for a preset dispute prediction element. Using at least one of a second method of using a method and a second citation using a combination of two or more citations, and assigning a predetermined weight to each citation type to generate a dispute prediction element value for the preset dispute prediction element. Is preferably.

상기 분쟁 예측 요소에는 동일한 종류의 분쟁 예측 요소에 대하여 적어도 기간 설정을 달리하는 분쟁 예측 요소가 포함되어 있는 것이며, 상기 기간 설정은 상기 동일한 종류의 분유에 대하여 적어도 하나 이상이 설정되는 것인 것이 바람직하다.It is preferable that the dispute prediction element includes a dispute prediction element having at least a different period setting for the same kind of dispute prediction element, and the period setting is that at least one or more is set for the same type of milk powder. .

상기 분쟁 예측 요소는 적어도 하나 이상의 인용 관점 분쟁 예측 요소, 적어도 하나 이상의 다분쟁 특허 관점 분쟁 예측 요소, 적어도 하나 이상의 다분쟁 유발자 관점 분쟁 예측 요소 및 적어도 하나 이상의 다분쟁 기술군 관점 분쟁 예측 요소 중에서 선택되는 어느 하나 이상을 포함하는 것인 것이 바람직하다.The dispute prediction element may be selected from at least one citation viewpoint dispute prediction element, at least one multi-dispute patent viewpoint dispute prediction element, at least one multi-dispute cause perspective dispute prediction element, and at least one multi-dispute technology group perspective dispute prediction element It is preferred to include any one or more.

상기 분쟁 특허에 상기 분쟁 특허 부여값을 부여하는 방법은, 상기 분쟁 특허의 속성에 따라 다르게 부여되는 제1 분쟁 특허 부여값 부여 방법 및 상기 분쟁 특허의 속성과 무관하게 분쟁 발생 여부로만 분쟁 특허 부여값을 부여하는 제2 분쟁 특허 부여값 부여 방법 중 어느 하나를 사용하는 것이 바람직하다.The method for assigning the dispute patent grant value to the dispute patent may include a method for granting a first dispute patent value assigned differently according to the attributes of the dispute patent and a dispute patent grant value only if a dispute occurs regardless of the attributes of the dispute patent. It is preferable to use any one of the second dispute patent granting value granting methods for granting.

상기 분쟁 특허의 속성은 복수 분쟁 속성, 공동 피고수 속성, 공동 참여 속성 중 어느 하나 이상인 것인 것이며, 상기 복수 분쟁 속성은 상기 분쟁 특허가 적어도 2 이상의 분쟁에 관련되는 속성인 것이며, 상기 공동 피고 속성은 상기 분쟁 특허가 적어도 2 이상의 피고를 상대로 분쟁을 제기하는 분쟁에 관련되는 속성인 것이며, 상기 공동 참여 속성은 상기 분쟁 특허가 관련된 분쟁에 적어도 하나 이상의 다른 분쟁 특허도 관련되는 속성인 것인 것이 바람직하다.The attribute of the disputed patent is one or more of a plural dispute attribute, a joint defendant attribute, and a joint participation attribute. The plural dispute attribute is an attribute in which the dispute patent is related to at least two disputes. Is an attribute relating to a dispute in which the disputed patent raises a dispute against at least two or more defendants, and wherein the joint participation attribute is an attribute in which at least one or more other disputed patents are also related to the dispute related to the disputed patent. Do.

상기 제1 분쟁 특허 부여값 부여 방법을 채용하는 경우, 상기 분쟁 특허 부여값을 부여하는 방법은 상기 분쟁 특허가 복수 분쟁 속성을 가질 경우에는 상기 분쟁 특허 부여값을 복수 분쟁 속성을 가지지 않을 경우보다 높게 부여하는 것이거나, 상기 분쟁 특허가 공동 피고 속성을 가질 경우에는 상기 분쟁 특허 부여값을 공동 피고 속성을 가지지 않을 경우보다 높게 부여하는 것이거나 상기 분쟁 특허가 공동 참여 속성을 가질 경우에는 상기 분쟁 특허 부여값을 공동 참여 속성을 가지지 않을 경우보다 낮게 부여하는 것인 것이 바람직하다.In the case of adopting the method for granting the first dispute patent granting value, the method for granting the dispute patent granting value is higher than the case where the dispute patent granting value has no plural dispute attribute when the dispute patent has plural dispute attributes. Granting the disputed patent if the disputed patent has the attributes of the joint defendant, or granting the disputed patent grant value higher than without the joint defendant. It is desirable to give the value lower than if it does not have a joint participation attribute.

상기 제1 분쟁 특허 부여값 부여 방법을 채용하는 경우, 상기 분쟁 특허 부여값을 부여하는 방법은 상기 분쟁 특허가 상기 복수 분쟁 속성을 가지는 경우, 상기 분쟁 특허 부여값을 부여하는 방법은 상기 분쟁 특허가 관련되는 전체 분쟁수, 전체 분쟁의 시간별 분포 속성, 상기 전체 분쟁의 피고별 분포 속성 중 어느 하나 이상을 참조하여 부여되는 것인 것이거나, 상기 분쟁 특허가 상기 공동 피고 속성을 가지는 경우, 상기 분쟁 특허 부여값을 부여하는 방법은 상기 분쟁 특허가 관련되는 전체 피고수, 분쟁 당 평균 피고수, 분쟁당 피고수의 통계적 분포 속성 중 어느 하나 이상을 참조하여 부여되는 것인 것이거나, 상기 분쟁 특허가 상기 공동 참여 속성을 가지는 경우, 분쟁 특허 부여값을 부여하는 방법은 상기 분쟁 특허가 관련되는 분쟁에서 상기 분쟁 특허의 평균 지분, 상기 평균 지분의 통계적 분포 속성 중 어느 하나 이상을 참조하여 부여되는 것인 것이 바람직하다.In the case of adopting the method for granting the first dispute patent granting value, the method for granting the dispute patent granting value is the method for granting the dispute patent granting value when the dispute patent has the plurality of dispute attributes. The disputed patent, which is granted by referring to at least one of the total number of related disputes, the time-distributed attribute of all disputes, and the distribution-specific distribution attributes of all disputes, or when the dispute patent has the joint defendant attribute. The method of granting a grant value may be granted with reference to any one or more of the total number of defendants to which the dispute patent relates, the average defendant per dispute, and the statistical distribution attribute of the defendant per dispute. In the case of having a joint participation attribute, a method of assigning a dispute patent granting value may be performed in the dispute involving the dispute patent. Preferably, the patent is granted by referring to any one or more of the average share of the patent and the statistical distribution attribute of the average share.

상기 통계학적 처리는 다중 회귀 분석인 것이 바람직하다.Preferably, the statistical treatment is multiple regression analysis.

상기 다중 회귀 분석은 기계 학습 계열의 비선형 알고리즘을 사용하는 것인 것이 바람직하다.Preferably, the multiple regression analysis uses a nonlinear algorithm of machine learning series.

상기 기계 학습 계열의 비선형 알고리즘은 트리(tree)를 활용한 앙상블 기법을 사용하는 것인 것이 바람직하다.It is preferable that the nonlinear algorithm of the machine learning series uses an ensemble technique using a tree.

상기 분쟁 예측 모델은 상기 분쟁 특허의 속성에 따라 다르게 부여되는 제1 분쟁 특허 부여값 부여 방법별, 사용되는 분쟁 예측 요소군별 및 사용되는 분쟁 특허 집합별 중 어느 하나 이상이 적용되어 생성되는 것인 것이 바람직하다.The dispute prediction model is generated by applying any one or more of the method for granting a first dispute patent grant value, the dispute prediction element group used, and the dispute patent set used differently according to the attributes of the dispute patent. desirable.

사용되는 상기 분쟁 특허 집합은 사용자가 지정하는 속성을 충족하는 분쟁 특허 집합인 것인 것이 바람직하다.The dispute patent set used is preferably a dispute patent set that satisfies an attribute specified by the user.

(D) 상기 분쟁 예측 모델을 사용하여 입수되는 특허 집합에 속하는 특허들에 대하여 특허별로 분쟁 예측 모델값을 생성하는 단계;를 더 실시하며, 상기 입수되는 특허 집합은 전체 특허 집합 또는 기 설정된 특허 집합 또는 사용자가 지정한 특허 집합 또는 상기 사용자가 지정한 특허 집합과 관련된 관련 특허 집합 중 어느 하나 이상인 것이며, 생성된 상기 분쟁 예측 모델값은 상기 특허와 대응하여 임시 또는 영구적으로 저장되는 것인 것이거나, 상기 분쟁 예측 모델값을 요청한 자에게 전송하는 것인 것이 바람직하다.(D) generating a dispute prediction model value for each patent for patents belonging to a patent set obtained using the dispute prediction model, wherein the obtained patent set is a whole patent set or a preset patent set. Or a patent set designated by the user or a related patent set related to the patent set designated by the user, wherein the generated dispute prediction model value is stored temporarily or permanently in response to the patent, or the dispute prediction It is desirable to send the model value to the requestor.

상기 분쟁 예측 모델값을 생성하는 것은 기 설정된 주기를 따르거나 또는 기 설정된 조건의 충족 여부에 따라서 생성되는 것인 것이 바람직하다.The generation of the dispute prediction model value is preferably generated according to a predetermined cycle or in accordance with a predetermined condition.

상기의 어느 한 항의 방법을 실시하는 것을 특징으로 하는 특허 정보 시스템을 제시한다.The patent information system which implements the method of any one of the above is proposed.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 상기의 어느 한 항의 방법을 실시하는 컴퓨터가 읽을 수 있는 프로그램이 기록된 기록 매체를 제시한다.In order to achieve the technical problem to be achieved by the present invention, there is provided a recording medium on which a computer-readable program for implementing the method of any one of the above is recorded.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 상기의 어느 한 항의 방법을 실시하는 컴퓨터가 읽을 수 있는 프로그램을 제시한다.In order to achieve the technical problem to be achieved by the present invention, a computer-readable program for implementing the method of any one of the above is proposed.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 분쟁 특허 집합을 구성하는 적어도 2 이상의 분쟁 특허들과 상기 비분쟁 특허 집합을 구성하는 적어도 2 이상의 비분쟁 특허들에 대하여 기 설정된 적어도 2 이상의 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성하는 분쟁 예측 요소값 생성부; 상기 분쟁 특허와 상기 비분쟁 특허에 대하여, 상기 분쟁 예측 요소값을 설명 변수값으로 하고, 상기 분쟁 특허에 부여된 분쟁 특허 부여값과 상기 분쟁 특허 부여값과 다르게 부여되며 상기 비분쟁 특허에 부여된 비분쟁 특허 부여값을 반응 변수값으로 하여 기 설정된 통계학적 처리를 수행하여 기 설정된 적어도 하나 이상의 분쟁 예측 모델값을 생성하는 적어도 하나 이상의 분쟁 예측 모델을 수립하는 분쟁 예측 모델 생성부; 및 상기 분쟁 예측 모델을 사용하여 입수되는 특허 집합에 속하는 특허들에 대하여 특허별로 분쟁 예측 모델값을 생성하는 특허별 분쟁 예측 모델값 생성부;를 포함하여 분쟁 예측 정보를 처리하는 것을 특징으로 하는 특허 분쟁 예측 모델을 생성하는 특허 정보 시스템을 제시한다.In order to achieve the technical problem to be achieved by the present invention, at least two or more dispute prediction elements which are preset for at least two or more disputed patents constituting the disputed patent set and at least two or more non-disputed patents constituting the non-disputed patent set A dispute prediction element value generation unit configured to generate a dispute prediction element value for the; Regarding the dispute patent and the non-dispute patent, the dispute prediction element value is set as an explanatory variable value, and the dispute patent grant value and the dispute patent grant value assigned to the dispute patent are different from the dispute patent grant value. A dispute prediction model generation unit configured to establish at least one dispute prediction model generating a preset at least one dispute prediction model value by performing a predetermined statistical process using the non-dispute patent grant value as a response variable value; And a dispute prediction model value generation unit for each patent for generating a dispute prediction model value for each patent for patents belonging to a patent set obtained using the dispute prediction model. We present a patent information system that creates a dispute prediction model.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, (a) (a1) (a11) 적어도 하나 이상의 특허를 포함하는 자기 특허 집합을 먼저 입수하고, (a12) 상기 자기 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허 집합을 입수하거나, (a2) (a21) 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 먼저 입수하고, (a22) 상기 타겟 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 자기 특허 집합을 입수하는 단계; (b) 상기 타겟 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 분쟁 예측 모델값을 입수하는 단계; 및 (c) 상기 특허별 분쟁 예측 모델값을 활용하여 적어도 하나 이상의 분쟁 예측 정보를 생성하는 단계;를 포함하는 것을 특징으로 하는 특허 정보 시스템의 특허 분쟁 예측 정보 처리 방법을 제시한다.In order to achieve the technical problem to be achieved by the present invention, (a) (a1) (a11) first obtain a self-patent set including at least one patent, (a12) at least one or more having a predetermined relationship with the self-set Obtaining a target patent set or (a2) (a21) first obtaining a target patent set including at least one or more patents, and (a22) obtaining at least one self patent set having a predetermined association with the target set; (b) obtaining at least one dispute prediction model value for each patent for each patent constituting the target patent set; And (c) generating at least one or more dispute prediction information by using the dispute prediction model value for each patent.

상기 자기 특허 집합은 사용자가 생성하거나 관리하는 특허 집합인 사용자 관리 특허 집합이거나 상기 특허 정보 시스템이 생성하거나 관리하는 특허 집합인 시스템 관리 특허 집합인 것이며, 상기 자기 특허 집합은 기 설정된 집합 한정 옵션을 사용하여 한정되는 것이거나, 기 설정된 분할 기준을 적용하여 2 이상의 부분 특허 집합으로 분할 될 수 있는 것인 것이며, 상기 타겟 특허 집합은 사용자가 생성하거나 관리하는 특허 집합인 사용자 관리 특허 집합이거나 상기 특허 정보 시스템이 생성하거나 관리하는 특허 집합인 시스템 관리 특허 집합인 것이며, 상기 타겟 특허 집합은 기 설정된 집합 한정 옵션을 사용하여 한정되는 것이거나, 기 설정된 분할 기준을 적용하여 2 이상의 부분 특허 집합으로 분할 될 수 있는 것인 것이 바람직하다.The self patent set is a user management patent set which is a patent set created or managed by a user, or a system management patent set which is a patent set created or managed by the patent information system, and the self patent set uses a preset set limitation option. It is to be limited to, or to be divided into two or more partial patent set by applying a predetermined division criteria, the target patent set is a user management patent set which is a patent set created or managed by the user or the patent information system This is a system management patent set, which is a patent set that is created or managed, and the target patent set may be defined using a predetermined set limiting option, or may be divided into two or more partial patent sets by applying a predetermined division criterion. It is preferable.

상기 타겟 특허 집합은 상기 자기 특허 집합을 구성하는 자기 특허들과 기 설정된 관련성을 가지는 특허들로 이루어지는 관련 특허 관계인 것이며, 상기 기 설정된 관련성은 인용-피인용 특허 관계, 텍스트 마이닝 기법 상의 유사 특허군 관계, 특허 분류 상 유사 기술 특허 관계 중 어느 하나 이상인 것이 바람직하다.The target patent set is a related patent relationship consisting of patents having a preset relationship with self patents constituting the self patent set, and the preset relationship is a citation-citation patent relationship, a similar patent group relationship in a text mining technique. In the patent classification, it is preferable that at least one of the similar technology patent relations be used.

상기 타겟 특허 집합을 구성하는 타겟 특허들은 2 이상의 자기 특허들과 기 설정된 관련성을 가질 수 있는 것인 것이며, 상기 관련성은 계량화 될 수 있는 것이며, 상기 계량화는 관련성 빈도 및 관련성 강도 중 어느 하나 이상을 포함하여 구성되는 것인 것이 바람직하다.Target patents constituting the target patent set may have a predetermined association with two or more self patents, the association may be quantified, and the quantification includes any one or more of an association frequency and an association strength. It is preferable that it is configured by.

상기 관련성 빈도는 상기 타겟 특허에 대응되는 기 설정된 관련성을 가지는 자기 특허의 개수인 것이며, 상기 관련성 강도는 인용-피인용 특허 관계 정보, 유사 특허군 관계 정보 및 유사 기술 특허 관계 정보 중 어느 하나 이상을 사용하여 생성되는 것인 것이 바람직하다.The relevance frequency is the number of self patents having a predetermined relevance corresponding to the target patent, and the relevance strength is one or more of citation-cited citation relationship information, similar patent group relationship information, and similar technology patent relationship information. It is preferable that it is produced using.

상기 인용-피인용 특허 관계 정보로 상기 관련성 강도를 생성하는 것은 인용-피인용 깊이 정보 및 인용-피인용 종류 정보 중 어느 하나 이상을 사용하여 생성되는 것을 포함하는 것인 것이 바람직하다.Generating the relevance strength with the cited-cited citation relationship information preferably includes using one or more of cited-cited depth information and cited-cited type information.

상기 유사 특허군 관계 정보로 상기 관련성 강도를 생성하는 것은 상기 자기 특허와 상기 타겟 특허의 유사도 정보를 사용하여 생성되는 것인 것이며, 상기 유사도 정보는 상기 자기 특허와 상기 타겟 특허에서 추출되는 추출 키워드, 상기 자기 특허와 상기 타겟 특허의 레퍼런스(reference) 정보에서 생성되는 인용-피인용 정보, 상기 자기 특허와 상기 타겟 특허에서 추출되는 적어도 한 종류 이상의 특허 분류 및 상기 특허 분류가 속하는 특허 분류 체계 상의 상기 특허 분류의 적어도 한 단계 이상의 상위 특허 분류 중 어느 하나 이상의 활용을 포함하여 생성되는 것인 것이 바람직하다.Generating the relevance strength using the similar patent group relationship information is generated by using similarity information between the self patent and the target patent, and the similarity information is an extracted keyword extracted from the self patent and the target patent, Citation-cited information generated from reference information of the own patent and the target patent, at least one or more types of patent classifications extracted from the own patent and the target patent, and the patent on the patent classification system to which the patent classification belongs Preferably at least one step of the classification is generated including the utilization of any one or more of the upper patent classification.

상기 유사 기술 특허 관계 정보로 상기 관련성 강도를 생성하는 것은 상기 자기 특허와 상기 타겟 특허에 포함되어 있는 적어도 한 종류 이상의 특허 분류에 대하여, 상기 특허 분류가 속하는 특허 분류 체계 상에서의 일치 깊이, 일치 빈도 및 일치 서열 중 어느 하나 이상을 활용하여 생성되는 것인 것이 바람직하다.Generating the relevance strength with the similar technology patent relationship information may correspond to at least one or more types of patent classifications included in the self patent and the target patent, the matching depth, the matching frequency, and the like in the patent classification system to which the patent classification belongs. It is preferably one that is generated utilizing any one or more of the matching sequences.

상기 특허별 분쟁 예측 모델값은 적어도 하나 이상의 인용 관점 분쟁 예측 요소, 적어도 하나 이상의 다분쟁 특허 관점 분쟁 예측 요소, 적어도 하나 이상의 다분쟁 유발자 관점 분쟁 예측 요소 및 적어도 하나 이상의 다분쟁 기술군 관점 분쟁 예측 요소 중에서 선택되는 어느 하나 이상의 분쟁 예측 요소의 분쟁 예측 요소값을 사용하여 생성되는 것인 것이 바람직하다.The dispute prediction model value for each patent may include at least one citation viewpoint dispute prediction element, at least one multi-dispute patent viewpoint dispute prediction element, at least one multi-dispute cause perspective dispute prediction element and at least one multi-dispute technology group perspective dispute prediction. It is preferable that it is generated using the dispute prediction element value of at least one dispute prediction element selected from the elements.

상기 특허별 분쟁 예측 모델값을 입수하는 것은 상기 타겟 특허에 대하여 실시간으로 상기 특허별 분쟁 예측 모델값을 생성하거나, 기 생성된 특허별 분쟁 예측 모델값을 저장하고 있는 특허별 분쟁 예측 모델값 DB에서 호출하는 것인 것이 바람직하다.Obtaining the dispute prediction model value for each patent includes generating a dispute prediction model value for each patent in real time with respect to the target patent or in a patent dispute prediction model value DB for storing a previously generated dispute prediction model value for each patent. It is desirable to call.

상기 생성되는 분쟁 예측 정보는 입수된 상기 자기 집합에 대응되는 타겟 특허별 분쟁 예측 정보, 입수된 상기 자기 집합에 대응되는 적어도 하나 이상의 타겟 특허군별 분쟁 예측 정보, 입수된 상기 자기 집합에 대응되는 타겟 특허 집합에 대한 분쟁 예측 정보, 적어도 하나 이상의 부분 자기 집합에 대응되는 타겟 특허별 분쟁 예측 정보, 적어도 하나 이상의 부분 자기 집합에 대응되는 적어도 하나 이상의 타겟 특허군별 분쟁 예측 정보 중 어느 하나 이상인 것이 바람직하다.The generated dispute prediction information may include dispute prediction information for each target patent corresponding to the acquired self-set, dispute prediction information for at least one target patent group corresponding to the acquired self-set, and target patent corresponding to the acquired self-set At least one of the dispute prediction information for the set, the dispute prediction information for each target patent corresponding to the at least one partial self-set, and the at least one dispute prediction information for the at least one target patent group corresponding to the at least one partial self-set is preferably one or more.

상기 분쟁 예측 정보는 기 설정된 적어도 하나 이상의 분쟁 예측 정보값, 적어도 하나 이상의 분쟁 예측 분석 정보, 적어도 하나 이상의 분쟁 예측 근거 정보 중 어느 하나 이상인 것이 바람직하다.The dispute prediction information may be at least one of preset at least one dispute prediction information value, at least one dispute prediction analysis information, and at least one dispute prediction basis information.

상기 (a12) 단계는 기 설정된 조건을 충족하는 특허만을 상기 타겟 특허로 입수하는 것인 것이거나, 상기 (a22) 단계는 기 설정된 조건을 충족하는 특허만을 자기 특허로 입수하는 것인 것이며, 상기 기 설정된 조건은 상기 사용자가 설정하거나, 상기 특허 정보 시스템이 설정하는 것인 것이 바람직하다.The step (a12) is to obtain only a patent that meets a predetermined condition as the target patent, or the step (a22) is to obtain only a patent that meets a predetermined condition as its own patent. The set condition is preferably set by the user or set by the patent information system.

상기 기 설정된 조건은 권리자 조건, 관련성 조건, 및 개별 특허 속성 조건 중 어느 하나 이상인 것이 바람직하다.Preferably, the preset condition is any one or more of an owner condition, a relevance condition, and an individual patent attribution condition.

상기 타겟 특허 집합을 구성하는 적어도 하나 이상의 타겟 특허 및 상기 타겟 특허 집합에 대하여 기 설정된 분할 기준 또는 기 설정된 선별 기준을 적용한 적어도 하나 이상의 부분 타겟 특허 집합 중 어느 하나 이상은 기 설정된 랭킹을 가지는 것이며, 상기 랭킹은 상기 관련성 빈도, 상기 관련성 강도, 상기 분쟁 예측 모델값 및 사용자 입력 관련성 정보 중 어느 하나 이상을 사용하는 랭킹 생성 규칙에 따라서 생성되는 것인 것이 바람직하다.At least one of at least one target patent constituting the target patent set and at least one partial target patent set to which a predetermined division criterion or a predetermined selection criterion is applied to the target patent set has a predetermined ranking. The ranking is preferably generated according to a ranking generation rule using any one or more of the relevance frequency, the relevance strength, the dispute prediction model value, and user input relevance information.

상기 (a12) 단계의 상기 타겟 특허 집합의 입수는 분할된 상기 부분 자기 특허 집합별로 입수하는 것인 것이며, 이 경우 상기 (c) 단계의 상기 분쟁 예측 정보를 생성하는 것은 상기 부분 자기 특허 집합에 대응되는 부분 타겟 특허 집합별로 생성되는 것인 것이거나 상기 (a22) 단계의 상기 자기 특허 집합의 입수는 분할된 상기 부분 타겟 특허 집합별로 입수하는 것인 것이며, 이 경우 상기 (c) 단계의 상기 분쟁 예측 정보를 생성하는 것은 상기 부분 타겟 특허 집합에 대응되는 부분 자기 특허 집합별로 생성되는 것인 것이 바람직하다.Acquiring the target patent set in the step (a12) is to obtain the divided partial patents, and in this case, generating the dispute prediction information in the step (c) corresponds to the partial self-patent set. Is to be generated for each partial target patent set to be obtained or to obtain the self-patent set in the step (a22) is to obtain for each of the divided partial target patent set, in this case the dispute prediction of the step (c) Generating the information is preferably generated for each partial self patent set corresponding to the partial target patent set.

상기 (a) 단계에서,(a1)의 방법이 선택되는 경우, 상기 (a11) 단계의 상기 자기 특허 집합의 입수는 적어도 2 이상의 자기 특허 집합의 입수인 것이며, 상기 (a12) 단계의 상기 타겟 특허 집합의 입수는 2 이상의 상기 자기 특허 집합별로 입수하는 것인 것이며, 상기 (c) 단계의 상기 분쟁 예측 정보를 생성하는 것은 2 이상의 상기 자기 특허 집합에 대응되는 타겟 특허 집합별로 생성되는 것인 것이거나 (a2)의 방법이 선택되는 경우, 상기 (a21) 단계의 상기 타겟 특허 집합의 입수는 적어도 2 이상의 타겟 특허 집합의 입수인 것이며, 상기 (a22) 단계의 상기 자기 특허 집합의 입수는 2 이상의 상기 타겟 특허 집합별로 입수하는 것인 것이며, 상기 (c) 단계의 상기 분쟁 예측 정보를 생성하는 것은 2 이상의 상기 타겟 특허 집합에 대응되는 자기 특허 집합별로 생성되는 것인 것이 바람직하다.In the step (a), when the method of (a1) is selected, the acquisition of the self-patent set in the step (a11) is the acquisition of at least two self-patent sets, and the target patent of the step (a12) Acquisition of a set is obtained by each of two or more self-patent sets, and generating the dispute prediction information of step (c) is generated by a target patent set corresponding to the two or more self-patent sets. When the method of (a2) is selected, the acquisition of the target patent set in the step (a21) is the acquisition of at least two target patent sets, and the acquisition of the self patent set in the step (a22) is two or more of the It is obtained by the target patent set, and generating the dispute prediction information of the step (c) is generated for each self patent set corresponding to two or more target patent set It would be preferable.

상기 타겟 특허 집합의 입수는 사용자가 생성하거나 선택하는 적어도 하나 이상의 사용자 특허 집합인 것이며, 상기 사용자 특허 집합이 상기 자기 특허 집합과 가지는 관련성은 상기 자기 특허 집합을 구성하는 자기 특허 중 적어도 어느 하나와 상기 사용자 특허 집합을 구성하는 타겟 특허 중 적어도 어느 하나가 인용-피인용 특허 관계, 텍스트 마이닝 기법 상의 유사 특허군 관계, 특허 분류 상 유사 기술 특허 관계 중 어느 하나 이상의 관계를 계산할 수 있는 것인 것이 바람직하다.The acquisition of the target patent set is at least one user patent set generated or selected by a user, and the association of the user patent set with the self patent set is related to at least one of the self patents constituting the self patent set and the user. It is preferable that at least one of the target patents constituting the patent set is capable of calculating any one or more of the citation-citation patent relationship, the similar patent group relationship in the text mining technique, and the similar technology patent relationship in the patent classification.

상기 (c) 단계의 상기 분쟁 예측 정보를 생성하는 것은 자기 특허별로 생성되는 것이거나, 적어도 2 이상의 분할되거나 선별되는 자기 특허 집합별로 생성되는 것이거나, 적어도 2 이상의 분할되거나 선별되는 타겟 특허 집합별로 생성되는 것인 것이며, 상기 자기 특허 집합의 선별이나 분할 또는 상기 타겟 특허 집합의 선별이나 분할은 사용자에 의해서 선별되거나 분할되는 방식 또는 상시 시스템의 기 설정된 선별 기준이나 분할 기준에 의해 분할되는 방식 중 어느 하나 이상이 적용되는 것인 것이 바람직하다.Generating the dispute prediction information of the step (c) is generated for each patent, or for each of at least two divided or selected self patent sets, or for each of at least two divided or selected target patent sets. The selection or division of the self-patent set or the selection or division of the target patent set may be selected or divided by a user or divided by predetermined selection criteria or division criteria of the system at all times. It is preferable that the above is applied.

상기 기 설정된 선별 기준이나 분할 기준은 타겟 특허 집합을 구성하는 타겟 특허의 권리자명, 권리자 속성, 특허 기술 분류, 기 설정된 적어도 하나 이상의 평가값 중 어느 하나 이상인 것인 것이 바람직하다.The predetermined selection criterion or division criterion may be any one or more of an owner name, an attribute of an owner, a patent technology classification, and at least one predetermined evaluation value of a target patent constituting a target patent set.

상기 권리자 속성은 상기 시스템이 지정하는 적어도 하나 이상의 속성 및 상기 사용자가 지정하는 적어도 하나 이상의 속성 중 어느 하나 이상인 것이거나, 상기 평가값은 상기 타겟 특허에 대한 적어도 하나 이상의 질적 평가 요소 중 어느 하나 이상을 사용하여 평가된 질적 평가 요소값인 것이 바람직하다.The rights holder attribute may be any one or more of at least one attribute specified by the system and at least one attribute specified by the user, or the evaluation value may be any one or more of at least one qualitative evaluation element for the target patent. It is preferably a qualitative evaluation factor value evaluated using.

상기 (b) 단계의 특허별 분쟁 예측 모델값의 입수는 입수된 1차 타겟 특허 집합에 대하여 기 설정된 집합 연산을 수행한 후에 생성되는 2차 타겟 특허 집합을 구성하는 2차 타겟 특허를 대상으로 실시되는 것이며, 상기 2차 타겟 특허 집합은 상기 1차 타겟 특허 집합을 구성하는 1차 타겟 특허에 대한 삭제, 상기 1차 타겟 특허 집합에 대한 한정 및 2차 타겟 특허 집합으로의 신규 특허 추가 중 어느 하나 이상이 실시되어 생성되는 것인 것이 바람직하다.Obtaining a dispute prediction model value for each patent in the step (b) is performed on a secondary target patent constituting a secondary target patent set generated after performing a predetermined set operation on the obtained primary target patent set. The secondary target patent set is any one of deletion of the primary target patents constituting the primary target patent set, limitation on the primary target patent set, and addition of a new patent to the secondary target patent set. It is preferable that the above is performed and produced.

상기 (c) 단계의 상기 분쟁 예측 정보의 생성은 상기 사용자로부터 입수 받은 사용자 가중치 정보를 반영하여 생성되는 것인 것이며, 상기 사용자 가중치 정보는 상기 타겟 특허 집합을 구성하는 타겟 특허 중 어느 하나 이상에 대하여 개별 타겟 특허별로 상기 사용자가 설정하는 타겟 특허별 가중치 정보, 타겟 특허의 개별 속성에 설정하는 속성별 가중치 정보, 인용-피인용 관계에 대해 설정하는 인용-피인용 가중치 정보, 텍스트 마이닝 관계에 대해 설정하는 텍스트 마이닝 가중치 정보, 특허 분류상 유사 기술 특허 관계에 대해 설정하는 유사 기술 특허 관계 가중치 정보 중 어느 하나 이상인 것이 바람직하다.The generation of the dispute prediction information in the step (c) is to be generated by reflecting the user weight information received from the user, the user weight information for any one or more of the target patents constituting the target patent set Weight information for each target patent set by the user for each target patent, weight information for each attribute set for individual properties of the target patent, citation-to-cited weight information for the citation-to-cited relationship, and text mining relationship The text mining weight information and the similar technology patent relationship weight information set for the similar technology patent relationship in the patent classification are preferably one or more.

상기 분쟁 예측 모델값은 분쟁 특허와 비분쟁 특허에 대하여, 분쟁 예측 요소값을 설명 변수값으로 하고, 상기 분쟁 특허에 부여된 분쟁 특허 부여값과 상기 분쟁 특허 부여값과 다르게 부여되며 상기 비분쟁 특허에 부여된 비분쟁 특허 부여값을 반응 변수값으로 하여 기 설정된 통계학적 처리를 수행하여 생성되는 것인 것이 바람직하다.The dispute prediction model value is set to a dispute prediction element value as an explanatory variable value for dispute patents and non-dispute patents, and is provided differently from the dispute patent grant value and the dispute patent grant value assigned to the dispute patent, and the non-dispute patent It is preferable that the non-dispute patent grant value given to the response variable value is generated by performing a predetermined statistical process.

상기 통계학적 처리는 트리(tree)를 활용한 앙상블 기법을 사용하는 기계 학습 계열의 알고리즘을 사용하는 것인 것이 바람직하다.Preferably, the statistical processing uses a machine learning sequence algorithm using an ensemble technique using a tree.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 상기의 어느 한 항의 방법을 실시하는 것을 특징으로 하는 특허 분쟁 예측 정보를 처리하는 특허 정보 시스템을 제시한다.In order to achieve the technical problem to be achieved by the present invention, there is proposed a patent information system for processing patent dispute prediction information, characterized in that the method of any one of the above.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 적어도 하나 이상의 특허를 포함하는 자기 특허 집합을 입수하는 자기 특허 집합 생성부; 상기 자기 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허 집합을 생성하거나 입수하는 타겟 특허 집합 생성부; 상기 타겟 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 분쟁 예측 모델값을 입수하는 분쟁 예측 모델값 입수부; 및 상기 특허별 분쟁 예측 모델값을 활용하여 적어도 하나 이상의 분쟁 예측 정보를 생성하는 분쟁 예측 정보 생성부;를 포함하는 것을 특징으로 하는 특허 분쟁 예측 정보를 처리하는 특허 정보 시스템을 제시한다.In order to achieve the technical problem to be achieved by the present invention, a self-patent set generation unit for obtaining a self-patent set including at least one patent; A target patent set generation unit generating or obtaining at least one target patent set having a predetermined relation with the self set; A dispute prediction model value obtaining unit which obtains at least one dispute prediction model value for each patent for the individual patents constituting the target patent set; And a dispute prediction information generation unit for generating at least one dispute prediction information by using the dispute prediction model value of each patent.

상기 자기 특허 집합을 구성하는 자기 특허와 상기 타겟 특허 집합을 구성하는 타겟 특허 간에서 기 설정된 관련성을 계산하는 다중 관계성 처리 모듈;를 더 포함하며 상기 다중 관계성 처리 모듈이 계산하는 상기 기 설정된 관련성은 인용-피인용 특허 관계, 텍스트 마이닝 기법 상의 유사 특허군 관계, 특허 분류 상 유사 기술 특허 관계 중 어느 하나 이상인 것이 바람직하다.And a multiple relationship processing module that calculates a predetermined relationship between a self patent constituting the self patent set and a target patent constituting the target patent set. Is preferably at least one of a citation-cited patent relationship, a similar patent group relationship on a text mining technique, and a similar technology patent relationship on a patent classification.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, (I) (I1) (I11) 적어도 하나 이상의 특허를 포함하는 자기 특허 집합을 먼저 입수하고, (I12) 상기 자기 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허 집합을 입수하거나, (I2) (I21) 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 먼저 입수하고, (I22) 상기 타겟 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 자기 특허 집합을 입수하는 단계; (J) 상기 타겟 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 라이센싱 예측 모델값을 입수하는 단계; 및 (K) 상기 특허별 라이센싱 예측 모델값을 활용하여 적어도 하나 이상의 라이센싱 예측 정보를 생성하는 단계;를 포함하는 것인 것을 특징으로 하는 특허 정보 시스템의 특허 라이센싱 예측 정보 처리 방법을 제시한다.In order to achieve the technical problem to be achieved by the present invention, (I) (I1) (I11) first obtain a self-patent set including at least one or more patents, (I12) at least one or more having a predetermined relationship with the self-set Obtaining a target patent set or (I2) (I21) first obtaining a target patent set including at least one or more patents, and (I22) obtaining at least one self patent set having a predetermined association with the target set; (J) obtaining at least one license prediction model value for each patent for each patent constituting the target patent set; And (K) generating at least one license prediction information by using the license prediction model value for each patent. The method provides a method for processing patent licensing prediction information of a patent information system.

상기 (I12) 단계에서 상기 타겟 특허 집합은 상기 자기 특허 집합을 구성하는 자기 특허의 최선일을 기준으로 하여 후출원인 것이거나, 상기 (I22) 단계에서 상기 자기 특허 집합은 상기 타겟 특허 집합을 구성하는 타겟 특허의 최선일을 기준으로 하여 선출원인 것인 것이 바람직하다.In the step (I12), the target patent set is a post-application based on the best date of the self patent constituting the self patent set, or in the step (I22), the self patent set is a target constituting the target patent set. It is preferable that it is a prior application on the basis of the best date of a patent.

상기 자기 특허와 상기 타겟 특허는 기 설정된 관련성을 가지는 특허들로 이루어지는 관련 특허 관계인 것이며, 상기 기 설정된 관련성은 인용-피인용 특허 관계, 텍스트 마이닝 기법 상의 유사 특허군 관계, 특허 분류 상 유사 기술 특허 관계 중 어느 하나 이상인 것이며, 상기 타겟 특허 집합을 구성하는 타겟 특허들은 2 이상의 자기 특허들과 기 설정된 관련성을 가질 수 있는 것인 것이며, 상기 관련성은 계량화 될 수 있는 것이며, 상기 계량화는 관련성 빈도 및 관련성 강도 중 어느 하나 이상을 포함하여 구성되는 것인 것이 바람직하다.The self-patent and the target patent are related patent relationships consisting of patents having a predetermined relationship, and the predetermined relationship is a citation-to-citation patent relationship, a similar patent group relationship in a text mining technique, and a similar technology patent relationship in a patent classification. Any one or more of the target patents constituting the target patent set may have a predetermined relationship with two or more self patents, the relevance may be quantified, and the quantification is the relevance frequency and relevance intensity It is preferable that it is comprised including any one or more of these.

상기 인용-피인용 특허 관계 정보로 상기 관련성 강도를 생성하는 것은 인용-피인용 깊이 정보 및 인용-피인용 종류 정보 중 어느 하나 이상을 사용하여 생성되는 것을 포함하는 것이거나 상기 유사 특허군 관계 정보로 상기 관련성 강도를 생성하는 것은 상기 자기 특허와 상기 타겟 특허의 유사도 정보를 사용하여 생성되는 것인 것이거나, 상기 유사 기술 특허 관계 정보로 상기 관련성 강도를 생성하는 것은 상기 자기 특허와 상기 타겟 특허에 포함되어 있는 적어도 한 종류 이상의 특허 분류에 대하여, 상기 특허 분류가 속하는 특허 분류 체계 상에서의 일치 깊이, 일치 빈도 및 일치 서열 중 어느 하나 이상을 활용하여 생성되는 것이며, 상기 유사도 정보는 상기 자기 특허와 상기 타겟 특허에서 추출되는 추출 키워드, 상기 자기 특허와 상기 타겟 특허의 레퍼런스 정보에서 생성되는 인용-피인용 정보, 상기 자기 특허와 상기 타겟 특허에서 추출되는 적어도 한 종류 이상의 특허 분류 및 상기 특허 분류가 속하는 특허 분류 체계 상의 상기 특허 분류의 적어도 한 단계 이상의 상위 특허 분류 중 어느 하나 이상의 활용을 포함하여 생성되는 것인 것이며, 상기 유사 기술 특허 관계 정보로 상기 관련성 강도를 생성하는 것은 상기 자기 특허와 상기 타겟 특허에 포함되어 있는 적어도 한 종류 이상의 특허 분류에 대하여, 상기 특허 분류가 속하는 특허 분류 체계 상에서의 일치 깊이, 일치 빈도 및 일치 서열 중 어느 하나 이상을 활용하여 생성되는 것인 것이 바람직하다.Generating the relevance strength with the citation-cited patent relationship information may include generating using at least one of citation-cited depth information and citation-cited citation type information or as the similar patent group relationship information. Generating the relevance strength is generated using similarity information between the self patent and the target patent, or generating the relevance strength with the similar technology patent relationship information is included in the self patent and the target patent. The at least one kind of patent classification, which is generated by utilizing any one or more of the matching depth, the matching frequency and the matching sequence on the patent classification system to which the patent classification belongs, the similarity information is the self-patent and the target Extracted keywords extracted from patents, the self-patent and the target feature Citation-cited information generated from fictitious reference information, at least one or more types of patent classifications extracted from the self-patent and the target patent, and at least one level higher patent classification of the patent classification on the patent classification system to which the patent classification belongs It is generated by using any one or more of the above, and generating the relevance strength with the similar technology patent relationship information is at least one kind of patent classification included in the self-patent and the target patent, the patent Preferably, the classification is generated by utilizing any one or more of the matching depth, the matching frequency, and the matching sequence on the patent classification system to which the classification belongs.

상기 생성되는 라이센싱 예측 정보는 상기 타겟 집합에 대응되는 자기 특허별 라이센싱 예측 정보, 상기 타겟 집합에 대응되는 적어도 하나 이상의 자기 특허군별 라이센싱 예측 정보, 상기 타겟 집합에 대응되는 자기 특허 집합에 대한 라이센싱 예측 정보, 적어도 하나 이상의 부분 타겟 집합에 대응되는 자기 특허별 라이센싱 예측 정보, 적어도 하나 이상의 부분 타겟 집합에 대응되는 적어도 하나 이상의 자기 특허군별 라이센싱 예측 정보 중 어느 하나 이상인 것이 바람직하다.The generated licensing prediction information includes licensing prediction information per self patent corresponding to the target set, licensing prediction information for at least one self patent group corresponding to the target set, and licensing prediction information for the self patent set corresponding to the target set. The at least one sub-target set may be one or more of the licensing prediction information for each patent corresponding to the at least one sub-target set or at least one self-patent licensing prediction information corresponding to the at least one partial target set.

상기 라이센싱 예측 정보는 기 설정된 적어도 하나 이상의 라이센싱 예측 정보값, 적어도 하나 이상의 라이센싱 예측 분석 정보, 적어도 하나 이상의 라이센싱 예측 근거 정보 중 어느 하나 이상인 것이 바람직하다.The licensing prediction information may be at least one of at least one preset license prediction information value, at least one license prediction analysis information, and at least one license prediction basis information.

상기 (I12) 단계의 상기 타겟 특허 집합의 입수는 기 설정된 조건을 충족하는 특허만을 타겟 특허로 입수하는 것인 것이거나, 상기 (I12) 단계의 상기 타겟 특허 집합의 입수는 분할된 상기 부분 자기 특허 집합별로 입수하는 것인 것이거나, 상기 (I12) 단계의 상기 타겟 특허 집합의 입수는 2 이상의 상기 자기 특허 집합별로 입수하는 것인 것이며, 상기 기 설정된 조건은 상기 사용자가 설정하거나, 상기 특허 정보 시스템이 설정하는 것인 것이며, 상기 기 설정된 조건은 권리자 조건, 관련성 조건, 권리자 매출 규모, 권리자 최근 분쟁 피소 정보 및 개별 특허 속성 조건 중 어느 하나 이상인 것인 것이며, 상기 타겟 특허 집합의 입수가 분할된 상기 부분 자기 특허 집합별로 입수하는 것인 경우, 상기 (d) 단계의 상기 라이센싱 예측 정보를 생성하는 것은 상기 부분 자기 특허 집합에 대응되는 부분 타겟 특허 집합별로 생성되는 것인 것이며, 상기 타겟 특허 집합의 입수가 2 이상의 상기 자기 특허 집합별로 입수하는 것인 경우, 상기 (I) 단계의 상기 자기 특허 집합의 입수는 적어도 2 이상의 자기 특허 집합의 입수인 것이며, 상기 (d) 단계의 상기 라이센싱 예측 정보를 생성하는 것은 2 이상의 상기 자기 특허 집합에 대응되는 타겟 특허 집합별로 생성되는 것인 것이 바람직하다.Obtaining the target patent set in the step (I12) is to obtain only a patent that satisfies a predetermined condition as a target patent, or obtaining the target patent set in the step (I12) is the divided partial patent Acquisition for each set or acquisition of the target patent set in the step (I12) is for acquiring for each of two or more self-patent sets, and the preset condition is set by the user or the patent information system. The predetermined condition is any one or more of rights holder conditions, relevance conditions, owner sales volume, rights holder recent dispute suit information and individual patent attribute conditions, the acquisition of the target patent set is divided When obtaining the partial self patent set, generating the licensing prediction information of the step (d) When the partial target patent set corresponding to the partial self patent set is generated and the acquisition of the target patent set is obtained for each of the two or more self patent sets, the acquisition of the self patent set in the step (I) Is the acquisition of at least two self-patent sets, and the generating of the licensing prediction information of step (d) is preferably generated for each target patent set corresponding to the at least two self-patent sets.

상기 (I22) 단계의 상기 자기 특허 집합의 입수는 기 설정된 조건을 충족하는 특허만을 자기 특허로 입수하는 것인 것이거나, 상기 (I22) 단계의 상기 자기 특허 집합의 입수는 분할된 상기 부분 타겟 특허 집합별로 입수하는 것인 것이거나, 상기 (I22) 단계의 상기 자기 특허 집합의 입수는 2 이상의 상기 타겟 특허 집합별로 입수하는 것인 것이며, 상기 기 설정된 조건은 상기 사용자가 설정하거나, 상기 특허 정보 시스템이 설정하는 것인 것이며, 상기 기 설정된 조건은 권리자 조건, 관련성 조건, 권리자 매출 규모, 권리자 최근 분쟁 피소 정보 및 개별 특허 속성 조건 중 어느 하나 이상인 것인 것이며, 상기 자기 특허 집합의 입수가 분할된 상기 부분 타겟 특허 집합별로 입수하는 것인 경우, 상기 (d) 단계의 상기 라이센싱 예측 정보를 생성하는 것은 상기 부분 타겟 특허 집합에 대응되는 부분 자기 특허 집합별로 생성되는 것인 것이며, 상기 자기 특허 집합의 입수가 2 이상의 상기 타겟 특허 집합별로 입수하는 것인 경우, 상기 (I) 단계의 상기 타겟 특허 집합의 입수는 적어도 2 이상의 타겟 특허 집합의 입수인 것이며, 상기 (d) 단계의 상기 라이센싱 예측 정보를 생성하는 것은 2 이상의 상기 타겟 특허 집합에 대응되는 자기 특허 집합별로 생성되는 것인 것이 바람직하다.Acquiring the self-patent set in the step (I22) is to obtain only a patent that satisfies a predetermined condition as a self-patent, or acquiring the self-patent set in the (I22) step is divided partial patents Acquisition of the self-patent set in the step (I22) is obtained by the target patent set of two or more, and the preset condition is set by the user or the patent information system. The predetermined condition is any one or more of rights holder conditions, relevance conditions, owner sales volume, rights holder dispute dispute information and individual patent attribute conditions, and the acquisition of the self-patent set When obtaining by the partial target patent set, generating the licensing prediction information of the step (d) is If the self-patent set corresponding to the partial target patent set corresponding to the partial target patent set is generated, and the acquisition of the self-patent set is obtained for each target patent set of two or more, the acquisition of the target patent set in the step (I) Is the acquisition of at least two or more target patent sets, and the generating of the licensing prediction information of step (d) is preferably generated for each of the own patent sets corresponding to the two or more target patent sets.

상기 자기 특허 집합을 구성하는 자기 특허 및 적어도 하나 이상의 분할 기준 또는 적어도 하나 이상의 선별 기준이 적용된 적어도 하나 이상의 부분 자기 특허 집합 중 어느 하나 이상은 기 설정된 랭킹을 가지는 것이며, 상기 랭킹은 상기 관련성 빈도, 상기 관련성 강도 및 상기 라이센싱 예측 모델값 중 어느 하나 이상을 사용하는 랭킹 생성 규칙에 따라서 생성되는 것인 것이 바람직하다.At least one of self patents constituting the self patent set and at least one partial self patent set to which at least one or more division criteria or at least one or more selection criteria are applied has a predetermined ranking, wherein the ranking is the relevance frequency, the It is preferably generated according to a ranking generation rule using any one or more of the relevance strength and the licensing prediction model value.

상기 (K) 단계의 상기 라이센싱 예측 정보를 생성하는 것은 적어도 2 이상의 분할되거나 선별되는 타겟 특허 집합별로 생성되는 것인 것이거나, 상기 (K) 단계의 상기 라이센싱 예측 정보의 생성은 상기 사용자로부터 입수 받은 사용자 가중치 정보를 반영하여 생성되는 것인 것이며, 상기 타겟 특허 집합의 선별이나 분할은 사용자에 의해서 선별되거나 분할되는 방식 또는 상시 시스템의 기 설정된 선별 기준이나 분할 기준에 의해 분할되는 방식 중 어느 하나 이상이 적용되는 것인 것이며, 상기 기 설정된 선별 기준이나 분할 기준은 타겟 특허 집합을 구성하는 타겟 특허의 권리자명, 권리자 속성, 특허 기술 분류, 기 설정된 적어도 하나 이상의 평가값 중 어느 하나 이상인 것인 것이며, 상기 사용자 가중치 정보는 상기 타겟 특허 집합을 구성하는 타겟 특허 중 어느 하나 이상에 대하여 개별 타겟 특허별로 상기 사용자가 설정하는 타겟 특허별 가중치 정보, 자기 특허의 개별 속성에 설정하는 속성별 가중치 정보, 인용-피인용 관계에 대해 설정하는 인용-피인용 가중치 정보, 텍스트 마이닝 관계에 대해 설정하는 텍스트 마이닝 가중치 정보, 특허 분류상 유사 기술 특허 관계에 대해 설정하는 유사 기술 특허 관계 가중치 정보 중 어느 하나 이상인 것이 바람직하다.The generating of the licensing prediction information of the step (K) may be generated by at least two or more divided or selected target patent sets, or the generation of the licensing prediction information of the (K) step may be obtained from the user. It is generated by reflecting the user weight information, wherein the selection or division of the target patent set is selected or divided by the user or at least one of the methods divided by predetermined selection criteria or division criteria of the system at all times. The predetermined selection criterion or division criterion may be any one or more of an owner's name, an attribute of an owner, a patent technology classification, and at least one predetermined evaluation value of a target patent constituting a target patent set. User weight information is a target constituting the target patent set Weight information for each target patent set by the user for each or more patents of each patent, weight information for each attribute set for individual attributes of own patents, and citation-cited weight information set for a citation-cited relationship , At least one of text mining weight information set for the text mining relationship and similar technology patent relationship weight information set for the similar technology patent relationship in the patent classification.

상기 (J) 단계의 특허별 분쟁 예측 모델값의 입수는 입수된 1차 타겟 특허 집합에 대하여 기 설정된 집합 연산을 수행한 후에 생성되는 2차 타겟 특허 집합을 구성하는 2차 타겟 특허를 대상으로 실시되는 것이며, 상기 2차 타겟 특허 집합은 상기 1차 타겟 특허 집합을 구성하는 1차 타겟 특허에 대한 삭제, 상기 1차 타겟 특허 집합에 대한 한정 및 2차 타겟 특허 집합으로의 신규 특허 추가 중 어느 하나 이상이 실시되어 생성되는 것인 것이 바람직하다.Obtaining the dispute prediction model value for each patent in the step (J) is performed on the secondary target patents constituting the secondary target patent set generated after performing a predetermined set operation on the obtained primary target patent set. The secondary target patent set is any one of deletion of the primary target patents constituting the primary target patent set, limitation on the primary target patent set, and addition of a new patent to the secondary target patent set. It is preferable that the above is performed and produced.

상기 특허별 라이센싱 예측 모델값은 적어도 하나 이상의 인용 관점 분쟁 예측 요소, 적어도 하나 이상의 다분쟁 특허 관점 분쟁 예측 요소, 적어도 하나 이상의 다분쟁 유발자 관점 분쟁 예측 요소 및 적어도 하나 이상의 다분쟁 기술군 관점 분쟁 예측 요소 중에서 선택되는 어느 하나 이상의 분쟁 예측 요소의 분쟁 예측 요소값을 사용하여 생성되는 것인 것이 바람직하다.The licensing prediction model value of each patent may include at least one citation perspective dispute prediction element, at least one multi-dispute patent perspective dispute prediction element, at least one multi-dispute cause perspective dispute prediction element, and at least one multi-conflict technology group perspective dispute prediction. It is preferable that it is generated using the dispute prediction element value of at least one dispute prediction element selected from the elements.

상기 분쟁 예측 모델값은 라이센싱 특허와 비라이센싱 특허에 대하여, 라이센싱 예측 요소값을 설명 변수값으로 하고, 상기 라이센싱 특허에 부여된 라이센싱 특허 부여값과 상기 라이센싱 특허 부여값과 다르게 부여되며 상기 비라이센싱 특허에 부여된 비라이센싱 특허 부여값을 반응 변수값으로 하여 기 설정된 통계학적 처리를 수행하여 생성되는 것인 것이 바람직하다.The dispute prediction model value may be assigned to a licensed prediction element value as an explanatory variable value for a licensed patent and a non-licensed patent, and is different from the licensed patent grant value and the licensed patent grant value granted to the licensed patent. It is preferable that the non-licensing patent grant value assigned to is generated by performing a predetermined statistical process using the response variable value.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 상기 어느 한 항의 방법을 실시하는 것을 특징으로 하는 특허 라이센싱 예측 정보 처리하는 특허 정보 시스템을 제시한다.In order to achieve the technical problem to be achieved by the present invention, there is provided a patent information system for processing patent licensing prediction information, characterized in that the method of any one of the above.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 상기 어느 한 항의 방법을 실시하는 컴퓨터가 읽을 수 있는 프로그램이 기록된 기록 매체를 제시한다.In order to achieve the technical problem to be achieved by the present invention, there is provided a recording medium on which a computer-readable program for implementing the method of any one of the above is recorded.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 상기 어느 한 항의 방법을 실시하는 컴퓨터가 읽을 수 있는 프로그램을 제시한다.본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 적어도 하나 이상의 특허를 포함하는 자기 특허 집합을 입수하거나 생성하는 자기 특허 집합 생성부; 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 입수하거나 생성하는 타겟 특허 집합 생성부; 상기 타겟 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 라이센싱 예측 모델값을 입수하는 분쟁 예측 모델값 입수부; 및 상기 특허별 라이센싱 예측 모델값을 활용하여 적어도 하나 이상의 라이센싱 예측 정보를 생성하는 라이센싱 예측 정보 생성부;를 포함하는 것을 특징으로 하는 특허 라이센싱 예측 정보 처리하는 특허 정보 시스템을 제시한다.In order to achieve the technical problem to be achieved by the present invention, a computer-readable program for implementing the method of any one of the above is proposed. In order to achieve the technical problem to be achieved by the present invention, the present invention includes at least one patent. A self patent set generation unit for obtaining or generating a patent set; A target patent set generation unit for obtaining or generating a target patent set including at least one patent; A dispute prediction model value obtaining unit which obtains at least one license prediction model value for each patent for each patent constituting the target patent set; And a licensing prediction information generation unit for generating at least one license prediction information by using the license prediction model value for each patent.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, (i) (i1) (i11) 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 먼저 입수하고, (i12) 상기 타겟 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 보완 특허 집합을 입수하거나, (i2) (i21) 적어도 하나 이상의 특허를 포함하는 보완 특허 집합을 먼저 입수하고, (i22) 상기 보완 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허 집합을 입수하는 단계; (j) 상기 보완 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 분쟁 예측 모델값을 입수하는 단계; 및 (k) 상기 특허별 분쟁 예측 모델값을 활용하여 적어도 하나 이상의 리스크 헤징 예측 정보를 생성하는 단계;를 포함하는 것인 것을 특징으로 하는 특허 정보 시스템의 특허 리스크 헤징 예측 정보 처리 방법을 제시한다.In order to achieve the technical problem to be achieved by the present invention, (i) (i1) (i11) first obtain a target patent set including at least one or more patents, (i12) at least one or more of a predetermined relationship with the target set Obtaining a supplementary patent set or (i2) (i21) first obtaining a supplementary patent set including at least one or more patents, and (i22) obtaining at least one target patent set having a predetermined relationship with the supplementary set; (j) obtaining at least one dispute prediction model value for each patent for each patent constituting the complementary patent set; And (k) generating at least one risk hedging prediction information using the dispute prediction model value for each patent. The method provides a method for processing patent risk hedging prediction information of a patent information system.

상기 (i12) 단계에서 상기 보완 특허 집합은 상기 타겟 특허 집합을 구성하는 타겟 특허의 최선일을 기준으로 하여 선출원인 것이거나, 상기 (i22) 단계에서 상기 타겟 특허 집합은 상기 타겟 특허 집합을 구성하는 보완 특허의 최선일을 기준으로 하여 후출원인 것인 것이 바람직하다.In the step (i12), the complementary patent set is a prior application based on the best day of the target patents constituting the target patent set, or in the step (i22), the target patent set is the supplement that constitutes the target patent set. It is preferable that it is a post-application based on the best date of a patent.

상기 타겟 특허 집합은 적어도 하나 이상의 특허를 포함하는 자기 특허 집합과 기 설정된 관련성을 가지는 특허 집합이거나, 상기 자기 특허 집합과 기 설정된 관련성을 가지는 특허 집합의 부분 특허 집합이거나,기 설정된 상기 분쟁 예측 모델값 이상을 가지는 특허들로 구성되는 특허 집합이거나, 기 설정된 상기 분쟁 예측 모델값 이상을 가지는 특허들로 구성되는 특허 집합의 부분 특허 집합이거나, 상기 자기 특허 집합과의 관계에서 기 설정된 분쟁 예측 정보값 이상을 가지는 특허들로 구성되는 특허 집합이거나, 상기 자기 특허 집합과의 관계에서 기 설정된 분쟁 예측 정보값 이상을 가지는 특허들로 구성되는 특허 집합의 부분 특허 집합이거나 상기 자기 특허 집합과의 관계에서 기 설정된 분쟁 예측 정보값 이상을 가지는 특허 집합이거나, 상기 자기 특허 집합과의 관계에서 기 설정된 분쟁 예측 정보값 이상을 가지는 특허 집합의 부분 특허 집합인 것인 것이 바람직하다.The target patent set may be a patent set having a preset association with a self patent set including at least one patent, or a partial patent set of a patent set having a preset association with the self patent set, or more than the preset dispute prediction model value. Patents having a patent set consisting of patents, or a partial patent set of a patent set consisting of patents having more than the predetermined dispute prediction model value, or a patent having a predetermined dispute prediction information value or more in relation to the self-patent set A patent set consisting of patents or partial patent sets of patent sets consisting of patents having a preset dispute prediction information value in relation to the own patent set or a dispute prediction information value set in relation to the own patent set or more Patent set with It is preferable that it is a partial patent set of the patent set having more than a predetermined dispute prediction information value in relation to the own patent set.

상기 타겟 특허와 상기 보완 특허는 기 설정된 관련성을 가지는 특허들로 이루어지는 관련 특허 관계인 것이며, 상기 기 설정된 관련성은 인용-피인용 특허 관계, 텍스트 마이닝 기법 상의 유사 특허군 관계, 특허 분류 상 유사 기술 특허 관계 중 어느 하나 이상인 것이며, 상기 보완 특허 집합을 구성하는 보완 특허들은 2 이상의 타겟 특허들과 기 설정된 관련성을 가질 수 있는 것인 것이며, 상기 관련성은 계량화 될 수 있는 것이며, 상기 계량화는 관련성 빈도 및 관련성 강도 중 어느 하나 이상을 포함하여 구성되는 것인 것이 바람직하다.The target patent and the supplementary patent are related patent relationships consisting of patents having a predetermined relationship, and the predetermined relationship is a citation-cited citation relationship, a similar patent group relationship in a text mining technique, and a similar technology patent relationship in a patent classification. Any one or more of the above, and the supplementary patents constituting the supplementary patent set may have a predetermined association with two or more target patents, the association may be quantified, and the quantification is the association frequency and the intensity of association. It is preferable that it is comprised including any one or more of these.

상기 관련성 빈도는 상기 보완 특허에 대응되는 기 설정된 관련성을 가지는 타겟 특허의 개수인 것이며, 상기 관련성 강도는 인용-피인용 특허 관계 정보, 유사 특허군 관계 정보 및 유사 기술 특허 관계 정보 중 어느 하나 이상을 사용하여 생성되는 것인 것이 바람직하다.The relevance frequency is the number of target patents having a predetermined relevance corresponding to the supplementary patent, and the relevance strength is one or more of citation-cited patent relationship information, similar patent group relationship information, and similar technology patent relationship information. It is preferable that it is produced using.

상기 인용-피인용 특허 관계 정보로 상기 관련성 강도를 생성하는 것은 인용-피인용 깊이 정보 및 인용-피인용 종류 정보 중 어느 하나 이상을 사용하여 생성되는 것을 포함하는 것이거나 상기 유사 특허군 관계 정보로 상기 관련성 강도를 생성하는 것은 상기 타겟 특허와 상기 보완 특허의 유사도 정보를 사용하여 생성되는 것인 것이거나, 상기 유사 기술 특허 관계 정보로 상기 관련성 강도를 생성하는 것은 상기 타겟 특허와 상기 보완 특허에 포함되어 있는 적어도 한 종류 이상의 특허 분류에 대하여, 상기 특허 분류가 속하는 특허 분류 체계 상에서의 일치 깊이, 일치 빈도 및 일치 서열 중 어느 하나 이상을 활용하여 생성되는 것이며, 상기 유사도 정보는 상기 타겟 특허와 상기 보완 특허에서 추출되는 추출 키워드, 상기 타겟 특허와 상기 보완 특허의 레퍼런스 정보에서 생성되는 인용-피인용 정보, 상기 타겟 특허와 상기 보완 특허에서 추출되는 적어도 한 종류 이상의 특허 분류 및 상기 특허 분류가 속하는 특허 분류 체계 상의 상기 특허 분류의 적어도 한 단계 이상의 상위 특허 분류 중 어느 하나 이상의 활용을 포함하여 생성되는 것인 것이며, 상기 유사 기술 특허 관계 정보로 상기 관련성 강도를 생성하는 것은 상기 타겟 특허와 상기 보완 특허에 포함되어 있는 적어도 한 종류 이상의 특허 분류에 대하여, 상기 특허 분류가 속하는 특허 분류 체계 상에서의 일치 깊이, 일치 빈도 및 일치 서열 중 어느 하나 이상을 활용하여 생성되는 것인 것이 바람직하다.Generating the relevance strength with the citation-cited patent relationship information may include generating using at least one of citation-cited depth information and citation-cited citation type information or as the similar patent group relationship information. Generating the relevance strength is generated using similarity information between the target patent and the supplementary patent, or generating the relevance strength with the similar technology patent relation information is included in the target patent and the supplementary patent. For at least one or more types of patent classifications, at least one of a matching depth, a matching frequency, and a matching sequence on the patent classification system to which the patent classification belongs, the similarity information is generated using the target patent and the supplement. Extracted keywords extracted from the patent, the target patent and the complementary feature Citation-cited information generated from the reference information, at least one or more types of patents extracted from the target and complementary patents, and at least one or more higher levels of the patent classifications on the patent classification system to which the patent classification belongs It is generated by using any one or more of the above, and generating the relevance strength with the similar technology patent relationship information for the at least one kind of patent classification included in the target patent and the supplementary patent, the patent Preferably, the classification is generated by utilizing any one or more of the matching depth, the matching frequency, and the matching sequence on the patent classification system to which the classification belongs.

상기 생성되는 리스크 헤징 예측 정보는 상기 보완 집합에 대응되는 타겟 특허별 리스크 헤징 예측 정보, 상기 보완 집합에 대응되는 적어도 하나 이상의 타겟 특허군별 리스크 헤징 예측 정보, 상기 보완 집합에 대응되는 타겟 특허 집합에 대한 리스크 헤징 예측 정보, 적어도 하나 이상의 부분 보완 집합에 대응되는 타겟 특허별 리스크 헤징 예측 정보, 적어도 하나 이상의 부분 보완 집합에 대응되는 적어도 하나 이상의 타겟 특허군별 리스크 헤징 예측 정보 중 어느 하나 이상인 것이 바람직하다.The generated risk hedging prediction information includes risk hedging prediction information for each target patent corresponding to the complementary set, at least one risk hedging prediction information for each target patent group corresponding to the supplementary set, and a target patent set corresponding to the supplementary set. The risk hedging prediction information, the risk hedging prediction information for each target patent corresponding to the at least one partial complementary set, and the at least one risk hedging prediction information for the at least one target patent group corresponding to the at least one partial supplement set are preferably one or more.

상기 리스크 헤징 예측 정보는 기 설정된 적어도 하나 이상의 리스크 헤징 예측 정보값, 적어도 하나 이상의 라이센싱 예측 분석 정보, 적어도 하나 이상의 라이센싱 예측 근거 정보 중 어느 하나 이상인 것이 바람직하다.The risk hedging prediction information may be at least one of at least one predetermined risk hedging prediction information value, at least one licensing prediction analysis information, and at least one licensing prediction basis information.

상기 (i12) 단계의 상기 보완 특허 집합의 입수는 기 설정된 조건을 충족하는 특허만을 보완 특허로 입수하는 것인 것이거나, 상기 (i12) 단계의 상기 보완 특허 집합의 입수는 분할된 상기 부분 타겟 특허 집합별로 입수하는 것인 것이거나, 상기 (i12) 단계의 상기 보완 특허 집합의 입수는 2 이상의 상기 타겟 특허 집합별로 입수하는 것인 것이며, 상기 기 설정된 조건은 상기 사용자가 설정하거나, 상기 특허 정보 시스템이 설정하는 것인 것이며, 상기 기 설정된 조건은 권리자 조건, 관련성 조건, 권리자 매출 규모, 권리자 최근 분쟁 피소 정보 및 개별 특허 속성 조건 중 어느 하나 이상인 것인 것이며, 상기 보완 특허 집합의 입수가 분할된 상기 부분 타겟 특허 집합별로 입수하는 것인 경우, 상기 (d) 단계의 상기 리스크 헤징 예측 정보를 생성하는 것은 상기 부분 타겟 특허 집합에 대응되는 부분 보완 특허 집합별로 생성되는 것인 것이며, 상기 보완 특허 집합의 입수가 2 이상의 상기 타겟 특허 집합별로 입수하는 것인 경우, 상기 (i) 단계의 상기 타겟 특허 집합의 입수는 적어도 2 이상의 타겟 특허 집합의 입수인 것이며, 상기 (d) 단계의 상기 리스크 헤징 예측 정보를 생성하는 것은 2 이상의 상기 타겟 특허 집합에 대응되는 보완 특허 집합별로 생성되는 것인 것이 바람직하다.Acquiring the complementary patent set in the step (i12) is to obtain only a patent that satisfies a preset condition, or acquiring the complementary patent set in the (i12) step is divided partial patents Acquired by the set or the acquisition of the complementary patent set in the step (i12) is obtained by the target patent set of two or more, wherein the predetermined condition is set by the user or the patent information system The predetermined condition is any one or more of rights holder conditions, relevance conditions, owner sales size, rights holder dispute dispute information and individual patent attribute conditions, the acquisition of the complementary patent set is divided When obtaining by the partial target patent set, generating the risk hedging prediction information of step (d) Generated by the partial complementary patent set corresponding to the partial target patent set, and the acquisition of the supplementary patent set is obtained by each of the target patent sets of two or more of the target patent set of step (i). The acquisition may be acquisition of at least two or more target patent sets, and the generating of the risk hedging prediction information in step (d) may be generated for each complementary patent set corresponding to the two or more target patent sets.

상기 보완 특허 집합의 입수에서 상기 기 설정된 조건은 시스템 또는 사용자가 선택하거나 설정하는 상기 보완 특허가 충족해야 하는 적어도 하나 이상의 권리자 속성 조건을 충족하는 것인 것이며, 상기 권리자 속성 정보는 권리자 규모 정보, 권리자 기관 유형 정보 및 상기 사용자가 선택하거나 지정하는 적어도 하나 이상의 특수 관계 권리자 군에 포함되는지 여부 중 어느 하나 이상인 것을In the acquisition of the supplementary patent set, the predetermined condition is that the system or user satisfies at least one attribute attribute of the supplementary patent selected or set by the supplementary patent, and the owner attribute information includes rights holder size information and rights holder. One or more of agency type information and whether the user is included in at least one group of related parties selected or designated by the user.

상기 (i22) 단계의 상기 타겟 특허 집합의 입수는 기 설정된 조건을 충족하는 특허만을 타겟 특허로 입수하는 것인 것이거나, 상기 (i22) 단계의 상기 타겟 특허 집합의 입수는 분할된 상기 부분 보완 특허 집합별로 입수하는 것인 것이거나, 상기 (i22) 단계의 상기 타겟 특허 집합의 입수는 2 이상의 상기 보완 특허 집합별로 입수하는 것인 것이며, 상기 기 설정된 조건은 상기 사용자가 설정하거나, 상기 특허 정보 시스템이 설정하는 것인 것이며, 상기 기 설정된 조건은 권리자 조건, 관련성 조건, 권리자 매출 규모, 권리자 최근 분쟁 피소 정보 및 개별 특허 속성 조건 중 어느 하나 이상인 것인 것이며, 상기 타겟 특허 집합의 입수가 분할된 상기 부분 보완 특허 집합별로 입수하는 것인 경우, 상기 (d) 단계의 상기 리스크 헤징 예측 정보를 생성하는 것은 상기 부분 보완 특허 집합에 대응되는 부분 타겟 특허 집합별로 생성되는 것인 것이며, 상기 타겟 특허 집합의 입수가 2 이상의 상기 보완 특허 집합별로 입수하는 것인 경우, 상기 (i) 단계의 상기 보완 특허 집합의 입수는 적어도 2 이상의 보완 특허 집합의 입수인 것이며, 상기 (d) 단계의 상기 리스크 헤징 예측 정보를 생성하는 것은 2 이상의 상기 보완 특허 집합에 대응되는 타겟 특허 집합별로 생성되는 것인 것이 바람직하다.Acquiring the target patent set in the step (i22) is to obtain only a patent that satisfies a predetermined condition as a target patent, or acquisition of the target patent set in the (i22) step is divided partial complementary patent Acquisition of the target patent set in the step (i22) is obtained for each of two or more complementary patent sets, and the preset condition is set by the user or the patent information system. The predetermined condition is any one or more of rights holder conditions, relevance conditions, owner sales volume, rights holder recent dispute suit information and individual patent attribute conditions, the acquisition of the target patent set is divided When obtaining by the partial complementary patent set, generating the risk hedging prediction information of step (d) When the target patent set corresponding to the partial target patent set corresponding to the partial complementary patent set is obtained, and when the acquisition of the target patent set is obtained for each of the complementary patent set of two or more, the complementary patent set of the step (i) The acquisition may be acquisition of at least two complementary patent sets, and the generating of the risk hedging prediction information in step (d) may be generated for each target patent set corresponding to two or more complementary patent sets.

상기 타겟 특허 집합을 구성하는 타겟 특허 및 적어도 하나 이상의 분할 기준 또는 적어도 하나 이상의 선별 기준이 적용된 적어도 하나 이상의 부분 타겟 특허 집합 중 어느 하나 이상은 기 설정된 랭킹을 가지는 것이며, 상기 랭킹은 상기 관련성 빈도, 상기 관련성 강도 및 상기 분쟁 예측 모델값 중 어느 하나 이상을 사용하는 랭킹 생성 규칙에 따라서 생성되는 것인 것이 바람직하다.At least one of a target patent constituting the target patent set and at least one or more partial target patent sets to which at least one or more division criteria or at least one or more selection criteria are applied has a predetermined ranking, wherein the ranking is the relevance frequency, the It is preferably generated according to the ranking generation rule using any one or more of the relevance strength and the dispute prediction model value.

상기 (k) 단계의 상기 리스크 헤징 예측 정보를 생성하는 것은 적어도 2 이상의 분할되거나 선별되는 보완 특허 집합별로 생성되는 것인 것이거나, 상기 (k) 단계의 상기 리스크 헤징 예측 정보의 생성은 상기 사용자로부터 입수 받은 사용자 가중치 정보를 반영하여 생성되는 것인 것이며, 상기 보완 특허 집합의 선별이나 분할은 사용자에 의해서 선별되거나 분할되는 방식 또는 상시 시스템의 기 설정된 선별 기준이나 분할 기준에 의해 분할되는 방식 중 어느 하나 이상이 적용되는 것인 것이며, 상기 기 설정된 선별 기준이나 분할 기준은 보완 특허 집합을 구성하는 보완 특허의 권리자명, 권리자 속성, 특허 기술 분류, 기 설정된 적어도 하나 이상의 평가값 중 어느 하나 이상인 것인 것이며, 상기 사용자 가중치 정보는 상기 보완 특허 집합을 구성하는 보완 특허 중 어느 하나 이상에 대하여 개별 보완 특허별로 상기 사용자가 설정하는 보완 특허별 가중치 정보, 타겟 특허의 개별 속성에 설정하는 속성별 가중치 정보, 인용-피인용 관계에 대해 설정하는 인용-피인용 가중치 정보, 텍스트 마이닝 관계에 대해 설정하는 텍스트 마이닝 가중치 정보, 특허 분류상 유사 기술 특허 관계에 대해 설정하는 유사 기술 특허 관계 가중치 정보 중 어느 하나 이상인 것이 바람직하다.The generating of the risk hedging prediction information in the step (k) may be generated by at least two divided or selected complementary patent sets, or the generation of the risk hedging prediction information in the step (k) may be performed by the user. It is generated by reflecting the received user weight information, and the selection or segmentation of the supplementary patent set is either selected or divided by the user, or divided by the predetermined selection criteria or division criteria of the system at all times. The above is applied to, wherein the predetermined selection criteria or division criteria is any one or more of the name of the holder of the complementary patent, the attribute of the owner, patent technology classification, at least one predetermined evaluation value of the complementary patent set. The user weight information constitutes the supplementary patent set. Weight information for each complementary patent set by the user for each one or more of the supplementary patents, weight information for each attribute set for individual attributes of the target patent, and citation-cited weights set for the citation-cited relationship At least one of information, text mining weight information set for the text mining relationship, and similar technology patent relationship weight information set for the similar technology patent relationship in patent classification is preferable.

상기 (j) 단계의 특허별 분쟁 예측 모델값의 입수는 입수된 1차 보완 특허 집합에 대하여 기 설정된 집합 연산을 수행한 후에 생성되는 2차 보완 특허 집합을 구성하는 2차 보완 특허를 대상으로 실시되는 것이며, 상기 2차 보완 특허 집합은 상기 1차 보완 특허 집합을 구성하는 1차 보완 특허에 대한 삭제, 상기 1차 보완 특허 집합에 대한 한정 및 2차 보완 특허 집합으로의 신규 특허 추가 중 어느 하나 이상이 실시되어 생성되는 것인 것이 바람직하다.Obtaining the dispute prediction model value for each patent in the step (j) is performed on the secondary supplementary patents constituting the secondary supplementary patent set generated after performing a predetermined set operation on the obtained primary supplementary patent set. The secondary supplement patent set is any one of deletion of the primary supplement patent constituting the primary supplement patent set, limitation on the primary supplement patent set, and addition of a new patent to the secondary supplement patent set. It is preferable that the above is performed and produced.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 상기 중 어느 한 항의 방법을 실시하는 것을 특징으로 하는 특허 리스크 헤징 예측 정보를 처리하는 특허 정보 시스템을 제시한다.In order to achieve the technical problem to be achieved by the present invention, there is provided a patent information system for processing patent risk hedging prediction information, characterized in that the method of any one of the above.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 상기 어느 한 항의 방법을 실시하는 컴퓨터가 읽을 수 있는 프로그램을 제시한다.In order to achieve the technical problem to be achieved by the present invention, a computer-readable program for implementing the method of any one of the above is proposed.

본 발명이 이루고자 하는 기술적 과제를 달성하기 위하여, 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 입수하거나 생성하는 타겟 특허 집합 생성부; 적어도 하나 이상의 특허를 포함하는 보완 특허 집합을 입수하거나 생성하는 보완 특허 집합 생성부; 상기 보완 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 분쟁 예측 모델값을 입수하는 분쟁 예측 모델값 입수부; 및 상기 특허별 분쟁 예측 모델값을 활용하여 적어도 하나 이상의 리스크 헤징 예측 정보를 생성하는 리스크 헤징 예측 정보 생성부;를 포함하는 것을 특징으로 하는 특허 리스크 헤징 예측 정보를 처리하는 특허 정보 시스템을 제시한다.In order to achieve the technical problem to be achieved by the present invention, a target patent set generation unit for obtaining or generating a target patent set including at least one or more patents; A supplementary patent set generating unit for obtaining or generating a supplementary patent set including at least one patent; A dispute prediction model value obtaining unit which obtains at least one dispute prediction model value for each patent for the individual patents constituting the supplementary patent set; And a risk hedging prediction information generation unit for generating at least one risk hedging prediction information using the dispute prediction model value for each patent.

상기 타겟 특허 집합을 구성하는 타겟 특허와 상기 보완 특허 집합을 구성하는 보완 특허 간에서 기 설정된 관련성을 계산하는 다중 관계성 처리 모듈;를 더 포함하며 상기 다중 관계성 처리 모듈이 계산하는 상기 기 설정된 관련성은 인용-피인용 특허 관계, 텍스트 마이닝 기법 상의 유사 특허군 관계, 특허 분류 상 유사 기술 특허 관계 중 어느 하나 이상인 것이 바람직하다.
And a multiple relationship processing module that calculates a predetermined relationship between a target patent constituting the target patent set and a complementary patent constituting the supplementary patent set. Is preferably at least one of a citation-cited patent relationship, a similar patent group relationship on a text mining technique, and a similar technology patent relationship on a patent classification.

본 발명을 실시하면 다음과 같은 효과가 있다.The present invention has the following effects.

첫째, 시스템적으로 신뢰성 있고 타당성 높은 특허 분쟁 예측 모델을 생성할 수 있다.First, it is possible to create a system-reliable and reasonable patent dispute prediction model.

둘째, 시스템적으로 신뢰성 있고 타당성 높은 특허 분쟁 예측 정보를 생성할 수 있다.Secondly, it is possible to generate reliable and valid patent dispute prediction information systematically.

셋째, 시스템적으로 신뢰성 있고 타당성 높은 특허 라이센싱 예측 정보를 생성할 수 있다.Third, it is possible to generate patent licensing prediction information that is reliable and feasible systemically.

넷째, 시스템적으로 신뢰성 있고 타당성 높은 특허 리스크 헤징 정보를 생성할 수 있다.
Fourth, it is possible to generate systemic reliable and reasonable patent risk hedging information.

도 1은 본 발명의 특허 정보 시스템의 사용 환경에 관한 일 실시예적 도면이다.
도 2는 본 발명의 특허 정보 시스템의 일 실시예적 구성에 관한 도면이다.
도 3은 본 발명의 특허 정보 시스템의 데이터부의 일 실시예적 구성에 관한 도면이다.
도 4는 본 발명의 데이터 가공부의 일 실시예적 구성에 관한 도면이다.
도 5는 본 발명의 특허 정보 서비스 지원부의 일 실시예적 구성에 관한 도면이다.
도 6은 본 발명의 특허 분석 정보 생성부의 일 실시예적 구성에 관한 도면이다.
도 7은 본 발명의 옵션 선택부의 일 실시예적 구성에 관한 도면이다.
도 8은 본 발명의 핵심 키워드 DB 의 일 실시예적 구성에 관한 도면이다.
도 9는 본 발명의 특허 분쟁 예측 정보 생성 시스템의 일 실시예적 구성에 관한 도면이다.
도 10은 본 발명의 분쟁 예측 요소별 분쟁 예측 요소값 생성 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 11은 본 발명의 인용 특허 집합의 속성을 설명하기 위한 개념도이다.
도 12은 본 발명의 인용 특허 집합의 속성 중 전방 인용 특허 집합을 설명하기 위한 개념도이다.
도 13은 본 발명의 인용 특허 집합의 속성 중 후방 인용 특허 집합을 설명하기 위한 개념도이다.
도 14는 본 발명의 인용 특허 집합의 속성 중 전방 자기 인용 특허 집합과 후방 자기 인용 특허 집합을 설명하기 위한 개념도이다.
도 15는 본 발명의 인용 특허 집합의 속성 중 인용 발생 입수 특허 집합을 설명하기 위한 개념도이다.
도 16은 입수 특허 집합에 대한 인용 특허 집합으로서의 전방 인용 특허 집합과 후방 인용 특허 집합을 설명하는 개념도이다.
도 17은 입수 특허 집합에 대한 인용 특허 집합으로서의 전방 자기 인용 특허 집합과 후방 자기 인용 특허 집합을 설명하는 개념도이다.
도 18은 입수 특허 집합에 대한 인용 특허 집합으로서의 인용 발생 입수 특허 집합을 설명하는 개념도이다.
도 19는 조건을 가하여 한정된 입수 특허 집합에 대한 인용 특허 집합으로서의 전방 인용 특허 집합과 후방 인용 특허 집합을 설명하는 개념도이다. 전방 인용 특허 집합과 후방 인용 특허 집합도 입수 특허 집합에 한정을 가함에 따라 한정을 가하지 않은 입수 특허 집합에 대한 전방 인용 특허 집합과 후방 인용 특허 집합의 부분 집합이 됨을 보여 준다. 입수 특허 집합에 조건을 가하게 되면, 전방 인용 특허 집합, 후방 인용 특허 집합, 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 및 인용 발생 입수 특허 집합 모두가 한정을 가하지 않은 전방 인용 특허 집합, 후방 인용 특허 집합, 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 및 인용 발생 입수 특허 집합의 부분 집합이 된다.
도 20은 입수 특허 집합에 대한 인용 특허 집합으로서의 전방 인용 특허 집합과 후방 인용 특허 집합에 조건을 가하여 한정된 전방 인용 특허 집합과 한정된 후방 인용 특허 집합을 설명하는 개념도이다. 한정된 전방 인용 특허 집합이나 한정된 후방 인용 특허 집합과 마찬가지로 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 및 인용 발생 입수 특허 집합도 한정을 가하는 경우, 한정을 가하지 않은 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 및 인용 발생 입수 특허 집합의 부분 집합이 된다.
도 21은 본 발명의 제1종 잠재 인용 특허를 설명하기 위한 일 실시예적 개념도이다.
도 22는 본 발명의 제2종 잠재 인용 특허를 설명하기 위한 일 실시예적 개념도이다.
도 23은 본 발명의 제1종 사슬 인용 특허를 설명하기 위한 일 실시예적 개념도이다.
도 24은 본 발명의 제2종 사슬 인용 특허를 설명하기 위한 일 실시예적 개념도이다.
도 25는 본 발명의 특허 분쟁 예측 정보 생성 시스템의 인용 관점의 분쟁 예측 요소값 생성 방법에 대한 일 실시예적 구성에 관한 도면이다.
도 26은 본 발명의 특허 분쟁 예측 정보 생성 시스템의 분쟁 예측 모델을 생성 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 27은 Gradient Boosting algorithm 알고리즘에 대한 요약을 설명하는 도면이다.
도 28은 Friedman(2002)이 새로이 stochastic gradient boosting algorithm을 제안한 알고리즘에 대한 요약을 설명하는 도면이다.
도 29는 본 발명의 특허 분쟁 예측 정보 생성 시스템의 분쟁 예측 모델 생성 과정에 관한 일 실시예적 개념을 설명하는 도면이다.
도 30은 과적합 개념을 설명하기 위한 보조 도면이다.
도 31은 본 발명의 특허 분쟁 예측 정보 생성 시스템의 분쟁 예측 모델 생성 엔진이 부스팅 알고리즘을 통하여 분쟁 예측 모델을 생성하는 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 32는 부스팅 알고리즘이 생성하는 스텀프(stump)의 예시에 관한 도면이다.
도 33은 5-fold cross validation 기법을 설명하기 위한 개념도이다.
도 34는 범주형 반응 변수에 대한 스텀프의 예시에 관한 도면이다.
도 35는 연속형 반응 변수에 적용되는 회귀 모델에 대한 스텀프의 예시에 관한 도면이다.
도 36은 본 발명의 특허 분쟁 예측 정보 생성 시스템이 분쟁 예측 모델값 생성하고 저장하는 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 37은 2 이상의 자기 특허가 2 이상의 타겟 특허와 각각 관계성을 가질 때, 각 관계마다 관계성 정보가 대응될 수 있음을 보여 주는 개념도이다.
도 38은 하나의 자기 특허가, 적어도 하나 이상의 타겟 특허와 기 설정된 관계성을 가질 때, 각 관계성에는 기 설정된 가중치가 부여될 수 있음을 보여 주는 개념도이다.
도 39는 본 발명의 특허 분쟁 예측 정보 생성 시스템이 분쟁 예측 정보를 생성하는 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 40은 본 발명의 특허 분쟁 예측 정보 생성 시스템이 타겟 특허별 가중치 정보를 생성하고, 가중치 정보를 고려한 분쟁 예측 정보를 생성하는 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 41은 본 발명의 특허 분쟁 예측 정보 생성 시스템이 생성하는 개별적인 분쟁 예측 정보의 예시적 종류 및 일 실시예적 분쟁 예측 정보 생성 순서를 보여 주는 도면이다.
도 42는 본 발명의 특허 분쟁 예측 정보 생성 시스템이 분할된 자기 특허 집합에 대응되는 타겟 특허 집합을 기준으로 한 분쟁 예측 정보 생성하는 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 43은 본 발명의 특허 분쟁 예측 정보 생성 시스템이 분할된 타겟 특허 집합을 기준으로 한 분쟁 예측 정보 생성하는 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 44는 본 발명의 특허 분쟁 예측 정보 생성 시스템의 분쟁 예측 정보 분석 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 45는 본 발명의 특허 분쟁 예측 정보 생성 시스템의 등급 부여 모델에 따른 등급 정보 생성 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 46은 본 발명의 특허 분쟁 예측 정보 생성 시스템의 신규 분쟁 파급 예측 정보 생성 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 47은 본 발명의 특허 분쟁 예측 정보 생성 시스템의 분쟁 정보 알람(alarm) 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 48은 본 발명의 특허 분쟁 예측 정보 생성 시스템의 리스크 헤징 예측 정보를 생성하는 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 49는 본 발명의 특허 분쟁 예측 정보 생성 시스템의 크로스 라이센싱 예측 정보를 생성하는 방법에 관한 일 실시예적 구성에 관한 도면이다.
도 50은 본 발명의 특허 라이센싱 예측 정보 생성 시스템의 일 실시예적 구성에 관한 도면이다.
도 51은 본 발명의 특허 라이센싱 예측 정보 생성 시스템이 라이센싱 예측 정보를 생성하는 방법에 관한 일 실시예적 구성에 관한 도면이다.1 is an exemplary diagram of an environment for using a patent information system of the present invention.
FIG. 2 is a diagram showing a configuration of an embodiment of the patent information system of the present invention.
3 is a view of an exemplary configuration of a data unit of the patent information system of the present invention.
4 is a diagram of an exemplary configuration of a data processing unit of the present invention.
5 is a diagram illustrating an exemplary configuration of a patent information service support unit of the present invention.
6 is a diagram illustrating an exemplary configuration of a patent analysis information generation unit of the present invention.
7 is a diagram illustrating an exemplary configuration of an option selector of the present invention.
8 is a diagram illustrating an exemplary configuration of a key keyword DB of the present invention.
9 is a diagram illustrating an exemplary configuration of a system for generating patent dispute prediction information of the present invention.
10 is a diagram illustrating an exemplary configuration of a method for generating a dispute prediction element value for each dispute prediction element according to the present invention.
11 is a conceptual diagram for explaining an attribute of a cited patent set of the present invention.
12 is a conceptual diagram illustrating a forward cited patent set among the attributes of a cited patent set of the present invention.
13 is a conceptual diagram for explaining a back cited patent set among the attributes of a cited patent set of the present invention.
14 is a conceptual diagram illustrating a forward self citation patent set and a rear self citation patent set among attributes of a cited patent set according to the present invention.
15 is a conceptual diagram illustrating a cited generation obtained patent set among attributes of a cited patent set according to the present invention.
16 is a conceptual diagram illustrating a forward cited patent set and a back cited patent set as cited patent sets for the acquired patent set.
17 is a conceptual diagram illustrating a forward self citation patent set and a rear self citation patent set as cited patent sets for an acquired patent set.
18 is a conceptual diagram illustrating a citation generation obtained patent set as a cited patent set for an acquired patent set.
19 is a conceptual diagram illustrating a forward cited patent set and a back cited patent set as cited patent sets for a limited set of acquired patents under conditions. The forward cited patent set and the back cited patent set are also shown to be a subset of the forward cited patent set and the back cited patent set for the unrestricted acquired patent set as the limitation is placed on the acquired patent set. When conditions are placed on an acquired patent set, the forward cited patent set, the back cited patent set, the forward self cited patent set, the back self cited patent set, and the cited generation obtained patent set are all not limited to the forward cited patent set and the back cited set. A subset of the patent set, the forward self-citation patent set, the back self-citation patent set, and the citation generation obtained patent set.
FIG. 20 is a conceptual diagram illustrating a forward citation patent set and a limited backward citation patent set defined by applying conditions to the forward citation patent set and the backward citation patent set as the citation patent set to the obtained patent set. Like the limited forward citation patent set or the limited backward citation patent set, the forward self citation patent set, the back self citation patent set, and the citation generation obtained patent set are also limited, if the limitation is the forward self citation patent set, the rear self citation patent set. And a subset of the citation generation obtained patent set.
FIG. 21 is a conceptual diagram for explaining a first-type latent cited patent of the present invention. FIG.
FIG. 22 is a conceptual diagram for explaining a second kind of latent cited patent of the present invention. FIG.
Figure 23 is an exemplary conceptual diagram for explaining the first type chain citation patent of the present invention.
24 is a conceptual diagram illustrating an example of a chain citation patent of the present invention.
FIG. 25 is a diagram illustrating an embodiment of a method for generating a dispute prediction element value from a citation point of view of a patent dispute prediction information generation system according to the present invention. FIG.
FIG. 26 is a diagram illustrating an exemplary configuration of a method of generating a dispute prediction model of a system for generating patent dispute prediction information according to the present invention. FIG.
27 is a diagram illustrating a summary of the Gradient Boosting algorithm algorithm.
FIG. 28 is a diagram illustrating a summary of an algorithm proposed by Friedman (2002) to newly propose a stochastic gradient boosting algorithm.
29 is a diagram for explaining an exemplary concept of a dispute prediction model generation process of the system for generating patent dispute prediction information according to the present invention.
30 is an auxiliary diagram for explaining an overfit concept.
31 is a diagram illustrating an exemplary configuration of a method of generating a dispute prediction model through a boosting algorithm by a dispute prediction model generation engine of the patent dispute prediction information generating system according to the present invention.
32 is a diagram of an example of the stump generated by the boosting algorithm.
33 is a conceptual diagram illustrating a 5-fold cross validation technique.
FIG. 34 is a diagram of examples of stumps for categorical response variables. FIG.
35 is a diagram of an example of a stump for a regression model applied to a continuous response variable.
36 is a diagram illustrating an exemplary configuration of a method for generating and storing a dispute prediction model value by a system for generating patent dispute prediction information according to the present invention.
37 is a conceptual diagram illustrating that when two or more self patents have a relationship with two or more target patents, relationship information may correspond to each relationship.
FIG. 38 is a conceptual diagram illustrating that when one self-patent has a predetermined relationship with at least one target patent, each relationship may be assigned a predetermined weight.
FIG. 39 is a diagram illustrating an exemplary configuration of a method for generating dispute prediction information by the system for generating patent dispute prediction information according to the present invention. FIG.
40 is a diagram illustrating an exemplary configuration of a method in which a patent dispute prediction information generation system of the present invention generates weight information for each target patent and generates dispute prediction information in consideration of weight information.
FIG. 41 is a diagram illustrating an exemplary type of individual dispute prediction information generated by the patent dispute prediction information generating system of the present invention, and an exemplary sequence of generating dispute prediction information. FIG.
FIG. 42 is a diagram illustrating an embodiment of a method for generating dispute prediction information based on a target patent set corresponding to a divided self patent set by the system for generating patent dispute prediction information according to the present invention.
43 is a diagram illustrating an exemplary configuration of a method for generating dispute prediction information based on a divided target patent set by the system for generating patent dispute prediction information according to the present invention.
44 is a diagram illustrating an embodiment of a method for analyzing dispute prediction information in a system for generating patent dispute prediction information according to the present invention.
45 is a diagram illustrating an embodiment of a method of generating rating information according to a rating model of the patent dispute prediction information generating system of the present invention.
46 is a diagram illustrating an exemplary configuration of a method for generating new dispute propagation prediction information in a system for generating patent dispute prediction information according to the present invention.
FIG. 47 is a diagram illustrating an exemplary configuration of a dispute information alarm method of a patent dispute prediction information generating system according to the present invention. FIG.
48 is a diagram illustrating an exemplary configuration of a method of generating risk hedging prediction information of the patent dispute prediction information generating system of the present invention.
FIG. 49 is a diagram illustrating an embodiment of a method for generating cross-licensing prediction information of a patent dispute prediction information generating system of the present invention. FIG.
50 is a diagram illustrating an exemplary configuration of a system for generating patent licensing prediction information of the present invention.
FIG. 51 is a diagram illustrating an exemplary configuration of a method for generating licensing prediction information by a system for generating patent licensing prediction information according to the present invention. FIG.

이하, 도면을 참조하면서, 본 발명 사상을 상세하게 설명한다.EMBODIMENT OF THE INVENTION Hereinafter, the thought of this invention is demonstrated in detail, referring drawings.

본 발명의 특허 정보 시스템(10000)은 도 1에서 예시되어 있는 바와 같이, 유무선 네트워크(200)를 통하여 사용자 컴퓨터(100)에게 정보 서비스를 제공한다. 상기 특허 정보 시스템(10000)은 적어도 하나 이상의 연계 시스템(300)과 유무선 네트워크(200)로 연결되어 있을 수 있는데, 상기 연계 시스템(300)의 예는 특허 raw data를 제공하는 각국 특허청 또는 raw data 공급자의 시스템, 기업 정보 제공자의 시스템 등이 그 예가 될 수 있으며, 본 발명의 특허 정보 시스템(10000)의 서비스를 연계하여 제공하는 적어도 하나 이상의 시스템이 그 예가 될 수 있으며, 본 설명에 한정하지 않는다.As illustrated in FIG. 1, the patent information system 10000 of the present invention provides an information service to a user computer 100 through a wired / wireless network 200. The patent information system 10000 may be connected to at least one link system 300 and a wired / wireless network 200. An example of the link system 300 is a patent office or a raw data provider of each country that provides patent raw data. An example may be a system, a system of an enterprise information provider, or the like, and at least one or more systems that provide services by linking the services of the patent information system 10000 of the present invention may be examples.

상기 특허 정보 시스템(10000)은 도 2에서 예시되어 있는 바와 같이, 크게 데이터부(1000) 및 데이터 가공부(2000), 검색 처리부(3100)와 회원용 관리부(3200)를 포함하는 특허 정보 서비스 지원부(3000), 특허 분석 정보 생성부(4000), 특허 분쟁 예측 시스템(5000), 특허 라이센싱 예측 시스템(6000) 또는 목적 특화 시스템부(20000) 등을 포함하고 있다. As illustrated in FIG. 2, the patent information system 10000 includes a patent information service support unit including a data unit 1000, a data processing unit 2000, a search processing unit 3100, and a member management unit 3200. And a patent analysis information generation unit 4000, a patent dispute prediction system 5000, a patent licensing prediction system 6000, or a purpose-specific system unit 20000.

상기 데이터부(1000)는 특허 데이터부(1100), 비특허 데이터부(1200), 핵심 키워드 DB(1300), 분류 메타데이터 DB(1400), 사용자 데이터부(1700), 기타 DB(1800), 목적 특화 데이터부(1900) 등 중 어느 하나 이상을 포함한다. 상기 특허 데이터부(1100)는 특허 명세서 파일부(1110)와 특허 DB(1120), 특허 분류 DB(1130), 기타 특허 데이터 DB(1140), 기타 분류 DB부 등을 포함하고 있다. The data unit 1000 may include a patent data unit 1100, a non-patent data unit 1200, a key keyword DB 1300, a classification metadata DB 1400, a user data unit 1700, another DB 1800, One or more of the object-specific data unit 1900 and the like. The patent data unit 1100 includes a patent specification file unit 1110, a patent DB 1120, a patent classification DB 1130, other patent data DB 1140, and other classification DB unit.

상기 특허 DB(1120)는 모든 특허에 대한 서지 사항과 명세서 본문, 도면 등을 필드별로 관리하며, 상기 명세서 본문을 구성하는 각종 필드(제목, 요약, 선행 기술, 특허 청구 범위, 발명의 상세한 설명 등)에서 추출한 핵심 키워드를 포함하고 있다. 한편, 상기 특허들은 상기 특허들에 대한 선행 기술 문건으로 인용 정보를 더 포함하고 있을 수 있다. 미국 특허 data를 예로 들면, 상기 인용 정보는 Reference에 포함된 정보들로, 미국 특허 문건 번호, 외국 특허 문건 번호, 비특허 문건에 대한 표시자 등이 포함되어 있다. 한편, 특허청 심사관이나 관련자에 의한 선행 기술 조사 정보, 심사관 의견 제출 시 참증 정보 등도 광의의 인용 정보가 된다. 특정 문건에 전방 인용 정보가 있는 경우, 상기 전방 인용 정보에 포함된 문건의 입장에서 보면 상기 특정 문건은 후방 인용 문건이 된다. 특정 문건을 기준으로 전방 인용 정보에 포함된 문건은 부모(parent) 문건이 되며, 상기 부모 문건을 기준으로 보면 상기 특정 문건은 자식(child) 문건이 된다. 이러한 child-parent 관계에 있는 정보를 DB로 처리하는 것은 당업자에게는 자명한 것으로 상세한 설명은 생략한다.The patent DB 1120 manages bibliographic matters of all patents, the specification body, and drawings for each field, and various fields (title, summary, prior art, claims, detailed description, etc.) constituting the specification body. ), Including key keywords extracted from Meanwhile, the patents may further include citation information as prior art documents for the patents. Taking the US patent data as an example, the cited information includes information included in the reference, including US patent document numbers, foreign patent document numbers, and indicators for non-patent documents. On the other hand, prior art research information by examiners or related persons of the Patent Office, and information on the examination when submitting the examiner's opinion are broad citation information. If there is forward citation information in a specific document, the specific document is a backward citation document from the viewpoint of the document included in the forward citation information. The document included in the forward citation information based on the specific document becomes the parent document and the specific document becomes the child document based on the parent document. It is obvious to those skilled in the art to process the information in the child-parent relationship with the DB, and a detailed description thereof will be omitted.

특허 문건의 서지 사항에는 발행 국가 정보, 각종 일자 정보, 각종 번호 정보, 적어도 하나 이상의 권리자 정보, 적어도 하나 이상의 발명자 정보, 적어도 하나 이상의 특허 분류 정보, 적어도 하나 이상의 우선권 정보 등이 포함되어 있다. 일자에는 출원일, 공개일, 등록일 및 기타 일자들이 있다. 각종 번호 정보에는 출원번호, 공개번호, 등록번호, 원출원 번호, 우선권 주장 번호 등이 있다. 권리자 정보는 출원인, 양수인, 특허권자 등이 있으며, 권리자의 변동이 있고, 변동이 관리되는 경우, 양도인과 양수인에 대한 정보 및 최종 권리자 정보가 있을 수 있다. 우선권 정보에는 우선권 주장 번호, 주장일자, 국가 등의 정보가 포함되어 있다. 한편, 분할출원이나 일부 계속 출원, 계속 출원 등이 있는 경우, 원출원 번호, 원출원일자 등의 정보가 부가되어 있다. 또한, 대표도, 제목, 요약, 색인어 등도 서지 사항에 포함되기도 한다. 한편, 가공된 서지 사항으로는 국내 패밀리 정보(분할출원, 변경출원이나 일부계속출원, 계속출원 관계에 있는 특허 출원)이나, 해외 패밀리 정보(조약 우선권 관계로 연관될 수 있는 출원, 국제출원 등)이 있을 수 있다. 한편, 자연어 처리 등을 통해서 특허 명세서 본문의 텍스트를 본문을 구성하는 각 필드별 또는 필드 통합별로 기 설정된 키워드 추출 방식으로 추출한 핵심 키워드 정보가 더 있을 수 있다. The bibliography of the patent document includes the issuing country information, various date information, various number information, at least one owner information, at least one inventor information, at least one patent classification information, at least one priority information, and the like. The date includes the filing date, the date of publication, the date of registration, and other dates. The various number information includes an application number, a public number, a registration number, an original number, and a priority claim number. The right holder information includes the applicant, the assignee, the patent owner, etc. If there is a change in the right holder and the change is managed, there may be information about the transferor and the assignee and information about the right holder. Priority information includes information such as priority claim number, date of claim, and country. On the other hand, when there is a split application, partial continuation application, continuation application, etc., information such as an original number, date of the original source is added. In addition, bibliographies may include representations, titles, abstracts, and index words. Meanwhile, the bibliographies processed include domestic family information (such as a split application, a change of application, or a part of a continuation application, a patent application that is related to a continuation application), overseas family information (an application that may be related to a treaty priority relationship, This can be. On the other hand, the text of the patent specification body may be further extracted by a keyword extracting method for each field or combination of fields constituting the main text through natural language processing or the like.

특허 분류 정보는 공통적인 IPC 이외에도 USPC, FT, FI, ECLA 등과 같이 각 국가별로 자국만의 특별한 특허 분류가 있을 수 있다. 이와 같이 특허 분류는 계층 구조를 가진다. 본 발명의 특허 분류는 특허 분류는 IPC, USPC, FT, FI, ECLA를 포함한다. 인덱스(index)란, 캐치워드(catchword)라고 불리기도 하는 것으로, 단어, 구 또는 절에 적어도 하나 이상의 특허 분류를 대응해 놓은 체계를 말한다. 상기 인덱스 중 대표적인 것으로 IPC를 처리해 놓은 catchword가 있으며, 미국 특허청이 발행하는 index to USPC가 있다. 상기 인덱스도 특허 분류처럼 계층 구조를 가지는 경우가 많다. 상기 인덱스에는 제품명/부품명/요소기술 등에 해당하는 키워드가 내재되어 있는 경우가 많다. 상기 인덱스는 특허 분류를 용이하게 찾아 보기 쉽게 해 놓은 것으로 볼 수 있다. WIPO가 제공하는 IPC에 대한 캐치워드에 대한 정보는 2011년 3월 현재 http://www.wipo.int/ipc/itos4ipc/ITSupport_and_download_area/20100101/MasterFiles/에 있는 ipcr_catchwordindex_20100101.zip 파일로 입수 가능하다. 상기 파일에는 ABACUSES에 대하여 G06C 1/00이 대응되어 있음을 볼 수 있다. 이때, ABACUSES를 대하여 G06C 1/00에 대응되는 인덱스라 명명한다. 이에 따라, G06C 1/00은 Abacuses로 역맵핑 할 수 있다. 한편, 상기 파일의 내용 중 인덱스 ABARADING에서 알 수 있듯이, 상기 캐치워드 체계는 적어도 1단계 이상의 계층 구조를 가지고 있음을 알 수 있다. ABARADING은 2단계 계층임을 보여 준다. USPTO가 USPC에 대한 인덱스로 이들에 대한 정보는 2010년 3월 현재 http://www.uspto.gov/web/patents/classification/uspcindex/indextouspc.htm에서 확인할 수 있다. 상리 링크의 내용 중에서 인덱스 Abrading은 3단계의 계층으로 분류되어 있음을 알 수 있다. 인덱스 Abrading은 USPC 451/38에 대응되고 있음을 알 수 있으며, Abrading의 하위 인덱스에 Class 451 이외의 다른 Class의 특허 분류도 존재함을 알 수 있다. 상기 특허 분류와 관련된 일체의 사항은 특허 분류 DB(1130)에 저장된다.In addition to the common IPC, the patent classification information may have a unique patent classification for each country such as USPC, FT, FI, ECLA, etc. Thus, the patent classification has a hierarchical structure. Patent classification of the present invention, the patent classification includes IPC, USPC, FT, FI, ECLA. An index, also called a catchword, refers to a system in which at least one patent classification is associated with a word, phrase, or clause. Among the indexes, there is a catchword that processes the IPC, and there is an index to USPC issued by the US Patent and Trademark Office. The index also has a hierarchical structure like a patent classification. In the index, keywords corresponding to product names / part names / element descriptions are often embedded. It can be seen that the index makes it easy to find the patent classification. Information on catchwords for IPC provided by WIPO is available as ipcr_catchwordindex_20100101.zip at http://www.wipo.int/ipc/itos4ipc/ITSupport_and_download_area/20100101/MasterFiles/ as of March 2011. It can be seen that G06C 1/00 corresponds to ABACUSES in the file. In this case, ABACUSES is called an index corresponding to G06C 1/00. Accordingly, G06C 1/00 can be reverse mapped to Abacuses. On the other hand, as can be seen from the index ABARADING of the contents of the file, it can be seen that the catchword system has a hierarchical structure of at least one step. ABARADING shows that it is a two-tier hierarchy. The USPTO is an index for USPC and information about them is available at http://www.uspto.gov/web/patents/classification/uspcindex/indextouspc.htm as of March 2010. In the content of the uplink, it can be seen that the index Abrading is classified into three levels. It can be seen that the index Abrading corresponds to USPC 451/38, and there is a patent classification of a class other than Class 451 in the lower index of Abrading. Any matters related to the patent classification are stored in the patent classification DB 1130.

핵심 키워드 DB(1300)는 추출된 핵심 키워드에 관한 정보를 저장하고 있다. 특허 문건에서 어떻게 핵심 키워드(공기쌍을 포함한다)를 추출하는지에 대해서는 후술한다. 핵심 키워드 DB(1300)에는 핵심 키워드별 메타 데이터 DB가 있는데, 이에는 핵심 키워드별 메타 정보가 저장된다. 핵심 키워드가 추출될 때는 키워드별로 어떤 특허 문건에서 추출되었는지에 대한 특허 문건 번호와 추출된 필드 및 상기 특정 특허 문건 번호의 특정 필드에서의 빈도(frequency)가 동시에 계산된다. 상기 특허 문건 번호의 서지 사항에는 각종 날짜 정보, 출원인 정보, 발명자 정보, 특허 분류 정보, reference 정보 등과 같은 각종 서지 사항 정보가 대응되게 된다. 따라서, 특정한 핵심 키워드에 대해서는 특허 문건 번호, 서지 사항, 필드 및 빈도가 대응될 수 있으며, 이러한 정보는 핵심 키워드 메타 데이터 DB(1310)의 기본적인 내용이 된다. 한편, 상기와 같은 핵심 키워드 메타 데이터 DB(1310)의 기본적인 내용이 있는 경우, 각 키워드별 증감율 데이터, 연관 분석(장바구니 분석이 한 예가 된다.) 결과 데이터, 및 각 핵심 키워드별 특허 분류 간의 맵핑 확률 데이터 등이 생성될 수 있으며, 이러한 데이터들도 핵심 키워드 메타 데이터 DB(1310)의 내용이 될 수 있다. 상기 핵심 키워드별 특허 분류 간의 맵핑 확률 데이터는 핵심 키워드 Ki가 있을 때, 이 Ki가 적어도 하나 이상의 특허 분류 Cj에 대응되는 확률, P(Cj(Ki))이 상기 맵핑 확률이 된다. Ki가 특허 문건 P1, P2, P3 및 P4에서 나왔고, P1, P2, P3는 C1의 분류가 되고, P4는 C2의 분류가 되는 경우, Ki가 C1에 대응될 확률은 3/4가 되고, C2에 대응될 확률은 1/4이 된다. 즉, 상기 P(Cj(Ki)) =(Ki가 추출된 특허 문건 중에서 Cj 분류를 가지는 특허 문건의 수)}/(Ki가 추출된 특허 문건의 수)가 된다. 한편, 상기 P(Cj(Ki))가 있는 경우, Cj를 대표하는 Ki들의 집합을 추출할 수 있을 것이다. 이때, KI의 Cj에 대한 P(Cj(Ki)) 또는 기설정된 메트릭(metric)이 높은 Ki들이 이러한 집합을 형성할 수 있다. The core keyword DB 1300 stores information about the extracted core keyword. How to extract key keywords (including air pairs) from patent documents will be described later. The core keyword DB 1300 has a metadata key DB for each key keyword, and the meta information for each key keyword is stored therein. When a key keyword is extracted, a patent document number for which patent document is extracted for each keyword, a frequency in the extracted field, and a specific field of the specific patent document number are simultaneously calculated. The bibliographic information of the patent document number corresponds to various bibliographic information such as various date information, applicant information, inventor information, patent classification information, reference information, and the like. Therefore, a patent document number, bibliography, fields, and frequency may correspond to a specific key keyword, and this information becomes the basic content of the key keyword metadata DB 1310. On the other hand, if there is a basic content of the core keyword metadata DB 1310 as described above, the probability of mapping between increase and decrease rate data for each keyword, association analysis (basket analysis is an example) result data, and patent classification for each key keyword Data may be generated, and such data may also be contents of the core keyword metadata DB 1310. The mapping probability data between the patent classifications for each of the key keywords has a key keyword Ki, and the probability that Ki corresponds to at least one or more patent classifications Cj, P (Cj (Ki)), becomes the mapping probability. If Ki comes from patent documents P1, P2, P3 and P4, P1, P2, P3 is classified as C1, P4 is classified as C2, the probability that Ki corresponds to C1 is 3/4, and C2 The probability of matching is 1/4. That is, P (Cj (Ki)) = (the number of patent documents with Cj classification among the patent documents from which Ki was extracted)} / (the number of patent documents from which Ki was extracted). Meanwhile, when P (Cj (Ki)) is present, a set of Kis representing Cj may be extracted. In this case, P (Cj (Ki)) or Kis having a predetermined metric with respect to Cj of the KI may form this set.

특허 분류 DB(1130)를 가공한 것에는 분류 메타 데이터 DB가 있는데, 이에는 특허 분류별 메타 정보가 저장된다. 특허 문건에는 적어도 한 종류 이상 및 적어도 하나 이상의 특허 분류가 대응되어 있으며, 상기 특허 문건 번호의 서지 사항에는 각종 날짜 정보, 출원인 정보, 발명자 정보, 특허 분류 정보, reference 정보 등과 같은 각종 서지 사항 정보가 대응되게 된다. 따라서, 특정한 특허 분류에 대해서는 특허 문건 번호, 서지 사항이 대응될 수 있으며, 이러한 정보는 특허 분류 메타 데이터 DB의 기본적인 내용이 된다. 한편, 상기와 같은 특허 분류 메타 데이터 DB의 기본적인 내용이 있는 경우, 각 특허 분류별 증감율 데이터, 연관 분석(장바구니 분석이 한 예가 된다.) 결과 데이터, 및 각 특허 분류별 특허 분류 간의 맵핑 확률 데이터, 점유율이나 활동율 등과 같은 특허 지표(일종의 메트릭)별로 특허 지표값이 높은 출원인 데이터나 발명자 데이터 등이 생성될 수 있으며, 이러한 데이터들도 특허 분류 메타 데이터 DB의 내용이 될 수 있다.The processed patent classification DB 1130 includes a classification metadata DB, in which meta information for each patent classification is stored. At least one kind and at least one patent classification correspond to a patent document, and the bibliographic information of the patent document number corresponds to various bibliographic information such as various date information, applicant information, inventor information, patent classification information, reference information, and the like. Will be. Therefore, a patent document number and a bibliographic matter may correspond to a specific patent classification, and this information becomes the basic content of the patent classification metadata DB. On the other hand, if there are basic contents of the patent classification metadata DB as described above, the increase / decrease rate data for each patent classification, the association analysis (an example of a shopping cart analysis), the result data, and the mapping probability data, share of the patent classification for each patent classification, Applicant data or inventor data having high patent index values may be generated for each patent index (such as a metric) such as an activity rate, and these data may also be contents of a patent classification metadata DB.

상기 사용자 데이터부(1700)는 상기 특허 정보 시스템(10000)을 사용하는 사용자의 일체의 관리 정보 및 사용자가 생성하거나 관리하는 정보가 저장된다. 한편, 목적 특화 데이터부(1900)에는 출원인 대표명화 정보 등을 포함하는 출원인 DB(1910), 각종 규칙(특허 지표나 분석 명령어 구문 등도 규칙이 된다.) 정보가 저장되는 규칙 데이터부(1920), 자연어 처리를 위한 사전(번역을 위한 대역어 사전, 시소러스 사전 등이 그 예가 된다.), 기타 사전(과학기술용어 사전, 논문에서 추출된 색인어 사전 등이 그 예가 된다.) 등과 같은 정보는 언어 데이터부(1930)에 저장되며, 표준 특허풀에 속하는 특허 등은 표준 특허풀 데이터부(1940)에 저장된다. 기타 상기 목적 특화 데이터부(1900)에는 특화된 목적에 부합하는 각종 데이터가 저장된다.The user data unit 1700 stores all management information of a user who uses the patent information system 10000 and information generated or managed by the user. On the other hand, in the object-specific data unit 1900, the applicant DB 1910 including the applicant representative name information, etc., the rule data unit 1920 which stores various rules (patent indicators or analysis instruction syntax, etc.) information is stored; Information such as dictionaries for natural language processing (bandword dictionaries for translation, thesaurus dictionary, etc.) and other dictionaries (e.g., scientific and technical dictionaries, indexed dictionaries extracted from articles, etc.) 1930, the patents belonging to the standard patent pool and the like are stored in the standard patent pool data unit 1940. In addition, the purpose-specific data unit 1900 stores various types of data corresponding to the purpose.

상기 데이터 가공부(2000)는 도 4에서 예시되어 있는 바와 같이, 특허 문건에서 적어도 하나 이상의 핵심 키워드를 추출하는 핵심 키워드 생성부(2100), 특허 분류에 대한 각종 메타 데이터를 생성하는 분류 메타 데이터 생성부(2200), 목적 특화된 데이터를 생성하는 목적 특화 데이터 생성부(2300), 주어진 특허 문건과 내용적으로 유사한 특허 집합을 생성하는 유사 특허 집합 생성부(2400), 연관 분석 결과가 있는 경우, 연관한 노드(node)들 및 상기 연관된 노드들 간의 관계 정보(에지(edge 정보))를 생성하는 네트워크 데이터 생성부(2500) 등이 있을 수 있다.As illustrated in FIG. 4, the data processing unit 2000 includes a key keyword generator 2100 for extracting at least one key keyword from a patent document, and classification metadata for generating various metadata for patent classification. Unit 2200, object-specific data generation unit 2300 for generating object-specific data, similar patent set generation unit 2400 for generating a patent set that is similar in content to a given patent document, and if there is an association analysis result, the association There may be a network data generation unit 2500 for generating one node and related information (edge information) between the associated nodes.

도 5에서 예시되는 특허 정보 서비스 지원부(3000)는 검색 처리를 수행하기 위한 검색 처리부(3100), 회원 관리와 관련된 회원용 관리부(3200), 상기 특허 정보 시스템(10000)이 제공하는 서비스나 데이터 등을 웹서비스나 SOA 방식 등으로 적어도 하나 이상의 사용자 컴퓨터(100)(본 발명의 사용자 컴퓨터(100)는 PC 등과 같은 개인용 컴퓨터나 휴대용 단말기 뿐만 아니라, 서버나 시스템 등과 같은 것도 본 발명의 특허 정보 시스템(10000)을 사용하는 한 본 발명의 사용자 컴퓨터(100)가 된다.) 플랫폼 서비스를 제공하기 위한 플랫폼 서비스 제공부, 전자 상거래 서비스를 제공하는 전자 상거래 서비스 제공부, 커뮤니티 서비스를 제공하는 커뮤니티 서비스 제공부(8500), 과금 서비스를 제공하는 과금 처리부(8600), 특허 문건 단위로 댓글을 달수 있도록 하고, 상기 댓글을 상기 특허 문건의 발명자나 출원인에게 전송하거나, 또는 상기 댓글을 상기 특허 문건을 오픈하는 사용자 컴퓨터(100)에 제공하는 기능을 수행하는 것을 포함하여 일체의 댓글 서비스를 처리하는 댓글 처리부(8700)를 포함한다. 상기 검색 처리부(3100)는 검색 엔진부(3110) 및 DB 쿼리 처리부(3120) 중 어느 하나 이상을 포함할 수 있다. 통상적인 DBMS는 SQL 검색 등을 지원하므로, 상기 DBMS는 상기 DB 쿼리 처리부(3120)의 기능을 수행할 수 있다.The patent information service support unit 3000 illustrated in FIG. 5 includes a search processor 3100 for performing a search process, a member manager 3200 related to member management, a service or data provided by the patent information system 10000, and the like. At least one or more user computers 100 by using a web service or SOA method (the user computer 100 of the present invention is not only a personal computer or a portable terminal such as a PC, but also a server or a system). 10000), which is the user computer 100 of the present invention.) A platform service provider for providing a platform service, an e-commerce service provider for providing an e-commerce service, and a community service provider for providing a community service. (8500), the billing processing unit (8600) for providing a billing service, to be able to comment in units of patent documents, the comment And a comment processing unit 8700 for processing any comment service, including performing a function of transmitting to the inventor or applicant of the patent document or providing the comment to the user computer 100 opening the patent document. do. The search processor 3100 may include one or more of a search engine 3110 and a DB query processor 3120. Since a typical DBMS supports an SQL search, the DBMS may perform a function of the DB query processor 3120.

도 6은 본 발명의 특허 분석 정보 생성부(4000)에 대한 일 실시예적 도면이다. 상기 특허 분석 정보 생성부(4000)는 입수된 임의의 적어도 하나 이상의 특허 문건 집합 또는 분석 대상 데이터 집합을 입수하고, 입수된 특허 문건 집합 또는 분석 대상 데이터 집합에 대한 기설정된 적어도 하나 이상의 분석 결과를 생성하는 기능을 수행한다. 특허 분석을 위한 특허 문건 집합 또는 분석 대상 데이터 집합의 입수는 분석 대상 입수부(4100)가 처리한다. 분석 결과의 생성은 특허 분석 처리부(4200)가 수행한다. 상기 특허 분석 처리부(4200)는 분석 지표 DB(4210)에 저장되어 있는 적어도 하나 이상의 분석 지표에 대한 분석 지표값을 생성하는 명령어를 입수하거나, 분석 명령 구문 DB(4220)에 저장되어 있는 분석 명령 구문을 입수하고, 이를 입수한 특허 문건 집합 또는 분석 대상 데이터 집합에 적용하여 분석 결과를 생성한다. 한편, 특허 분석에서 각종 옵션이 부가될 수 있는데, 이러한 옵션은 분석 옵션 처!!리부(4300)가 수행한다. 분석 옵션 처!!리부(4300)에는 분석 대상이 되는 특허 문건 집합이나 분석 대상 데이터를 한정하는 데이터 한정 옵션 처리부(4310), 분석 결과 중에 어떤 사항만을 표시할 것인지를 처리하는 표시 옵션 처!!리부(4320), 및 기타 특허 분석을 위한 각종 옵션을 처리하는 기타 옵션 처리부(4330) 및 사용자 컴퓨터(100)로부터 각종 옵션에 대한 선택 정보를 입수하고, 입수된 옵션 선택 정보를 특허 분석 처리부(4200)에 전달하는 옵션 선택부(4340)가 있다. 사용자에 전송되는 분석 결과는 분석 결과 리포팅부(4400)에서 생성된다. 리포팅의 형태는 표, 차트, 다이어그램(네트워크 다이어그램이 그 한 예가 된다.), 또는 기설정된 형식(예를 들면 pdf, 웹 페이지 등)의 문서 중 어느 하나 이상이 될 수 있다. 이들 각각의 생성은 표 생성부(4410), 차트 생성부(4420), 다이어그램 생성부(4430), 리포트 생성부(4440)가 담당한다. 집합 단위 인용 분석(4500)는 적어도 2 이상의 특허 문건 집합을 대상으로 기설정된 적어도 하나 이상의 분석 지표나 분석 명령 구문을 적용하여, 특허 분석 결과를 생성해 낸다. 이에 대해서는 후술한다.6 is an exemplary diagram of a patent analysis information generation unit 4000 of the present invention. The patent analysis information generation unit 4000 obtains any one or more patent document sets or analysis target data sets obtained, and generates at least one preset analysis result for the obtained patent document set or analysis target data set. It performs the function. The acquisition of the patent document set or the analysis target data set for patent analysis is processed by the analysis target acquisition unit 4100. The patent analysis processor 4200 generates the analysis result. The patent analysis processor 4200 obtains an instruction for generating an analysis index value for at least one analysis index stored in the analysis index DB 4210, or an analysis command syntax stored in the analysis command syntax DB 4220. And obtain the result of analysis by applying to the patent document set or the analysis target data set obtained. Meanwhile, various options may be added in the patent analysis, which is performed by the analysis option processing unit 4300. The analysis option processing unit 4300 includes a data limiting option processing unit 4310 for defining a set of patent documents to be analyzed or data to be analyzed, and a display option processing unit for processing only what is displayed in the analysis results. 4320, and other option processing unit 4330 for processing various options for other patent analysis, and selection information for various options from the user computer 100, and the obtained option selection information is obtained from the patent analysis processing unit 4200. There is an option selector 4340 to transmit to. The analysis result transmitted to the user is generated by the analysis result reporting unit 4400. The reporting may be one or more of a table, chart, diagram (network diagram is one example), or a document in a predetermined format (eg, pdf, web page, etc.). Each of these generations is in charge of the table generator 4410, the chart generator 4420, the diagram generator 4430, and the report generator 4440. The set unit citation analysis 4500 applies at least one analysis index or analysis command syntax preset to at least two sets of patent documents to generate a patent analysis result. This will be described later.

도 7에서는 본 발명의 옵션 선택부(4340)에 대한 상세한 구성예를 보여 주고 있다. 옵션 선택부(4340)는 출원일, 공개일, 등록일, 우선일, 최선일 등과 같은 특허 문건의 기간과 관련된 한정을 수행하는 기간 한정부(4340-1), 권리자의 주소 중 국가 정보에 기초하여 국가를 한정하는 국가 한정부(4340-2), 적어도 하나 이상의 출원인에 대한 한정을 처리하는 출원인 한정부(4340-3), 적어도 하나 이상의 발명자에 대한 한정을 수행하는 발명자 한정부(4340-4), 적어도 하나 이상의 특허 분류의 종류별, 특허 분류의 깊이별, 및 특허 분류의 주특허 분류와 부특허분류별 중 어느 하나 이상에 대한 한정을 수행하는 특허 분류 한정부(4341-5), 특정 필드의 속성을 한정하는 필드 속성 한정부(4341-6), 개별 필드의 값에 대한 구체적 한정을 수행하는 개별 필드값 한정부(4341-7), 개별 특허의 구체적 속성에 대한 한정을 수행하는 개별 특허 속성값 한정부(2111-8), 및 기타 인용 깊이, 인용 종류(직접, 간접, 잠재, 사슬 등 중 어느 하나 이상), 및 인용 방향(전방, 후방)에 대한 한정 등과 같은 기타 한정을 수행하는 기타 한정부(4341-9) 등이 있다. 상기 필드에는 서지 사항을 구성하는 필드가 포함되며, 출원인, 발명자, 특허 분류 등이 있을 수 있다. 예를 들어, 출원인 중 기업, 기관, 대학, 개인 중에 대한 한정, 기업이라도 대기업과 중소기업 등과 같은 기업 규모에 기반한 한정이나, 발명자 중에서도 국내 발명자, 해외 발명자 등과 같은 발명자 주소에 기반한 한정 등이 그 예가 된다. 한편, 특허당 피인용 수가 기설정된 값 이상이 되는 출원인에 대한 한정, 최근 5년간 특허 출원 증가율이 25% 이상인 발명자에 대한 한정 등과 같은 것이 개별 필드값에 대한 한정이 된다. 그리고, 피인용수가 10회 이상인 특허, 특허 등급 평가에서 상위 10% 이내에 드는 특허, 또는 분쟁 발생 특허 등과 같은 개별 특허의 속성(피인용수, 특허 등급, 분쟁 발생 여부 등)에 대한 값을 기준으로 한 한정이 개별 특허 속성값 한정이 된다.7 shows a detailed configuration example of the option selector 4340 of the present invention. The option selector 4340 may select a country based on the country information among the owner's address and a time limiter 4340-1 which performs limitations related to the period of the patent document such as the application date, publication date, registration date, priority date, and best date. National limiting section 4340-2, applicant limiting section 4340-3 for processing limitations for at least one or more applicants, inventor limiting section 4340-4 for performing limitations for at least one or more inventors, at least A patent classification limiter 4431-5 which performs a limitation on one or more types of one or more patent classifications, by depth of patent classification, and by main patent classification and subpatent classification of a patent classification, and defines attributes of a specific field. A field attribute defining portion 4431-6 for performing a field, an individual field value defining portion for performing a specific limitation on a value of an individual field 4361-7, and an individual patent attribute value defining portion for performing a limitation on a specific attribute of an individual patent. (2111-8), Miscellaneous constraints 4434-9 that perform other constraints, such as citation depth, one or more of citation types (direct, indirect, latent, chain, etc.), and citation directions (front, back). . The field includes a field constituting a bibliographic matter, and may include an applicant, an inventor, a patent classification, and the like. For example, a company, an institution, a university, an individual among applicants, a limitation based on the size of a company such as a large company and a small and medium company, or a limitation based on an inventor address such as a domestic inventor or an overseas inventor among the inventors. . On the other hand, limitations on the applicants whose number of citations per patent are equal to or greater than a predetermined value, limitations on the inventors whose patent application growth rate is 25% or more in recent 5 years, and the like are limitations on individual field values. And based on the value of the property of the individual patent (such as the number of citations, patent grades, disputes, etc.), such as a patent with 10 or more citations, a patent within the top 10% of a patent rating evaluation, or a disputed patent. One limitation is an individual patent attribute value limitation.

도 8은 본 발명의 핵심 키워드 DB(1300)의 일 실시예적 구성에 관한 도면이다. 핵심 키워드 DB(1300)에는 기술 키워드 DB(131), 제품 키워드 DB(132), 구문 키워드 DB(133) 및 핵심 키워드 메타 데이터 DB(1310)가 있다. 상기 핵심 키워드 DB(1300)는 각 특허 문건별로 추출한 핵심 키워드, 공기쌍 등에 관한 정보가 저장되어 있으며, 상기 제품 키워드 DB(132)에는 제품을 특징 짓는 각종 키워드나 공기쌍이, 구문 키워드 DB(133)에는 기술적 문제(예를 들면, 경량화, 금속 내 수소 함량 감소(to reduce the amount of hydrogen in metal에서 to 액션 + 액션 대상 + 대상의 소재 위치)를 표현하는 구문, 기술적 해결 방법을 표현하는 구문 등과 같이 특정한 속성을 가지는 구문을 표현하는 구문 키워드 집합에 대한 정보가 저장되어 있다. 한편, 상기 핵심 키워드 메타 데이터 DB(1310)에 대해서는 상기에서 설명한 바 이외에도, 키워드 간의 관계성(연관 분석의 결과로 키워드 간의 관계성 정보가 생성될 수 있다.) 정보가 저장되는 키워드 관계성 DB(1311), 특허 분류별로 그 특허 분류를 대표하는 키워드를 모아 놓은 특허 분류별 대표 키워드 DB(1312) 등이 포함되어 있을 수 있다.8 is a diagram illustrating an exemplary configuration of the key keyword DB 1300 of the present invention. The key keyword DB 1300 includes a technology keyword DB 131, a product keyword DB 132, a syntax keyword DB 133, and a key keyword metadata DB 1310. The key keyword DB 1300 stores information on key keywords, air pairs, etc. extracted for each patent document, and the product keyword DB 132 includes various keywords or air pairs that characterize a product, and a phrase keyword DB 133. This includes technical problems (e.g., reducing weight, reducing the amount of hydrogen in metal to to action + action + material location of the object), and syntax for describing technical solutions. Information about a phrase keyword set representing a phrase having a specific attribute is stored in. Meanwhile, the core keyword metadata DB 1310 is not only described above, but also relations between keywords (as a result of association analysis). Relationship information can be generated.) Keyword relation information DB 1311 in which the information is stored, and keywords representing the patent classification for each patent classification. The representative keyword DB 1312 for each patent classification collected may be included.

도 9은 본 발명의 특허 분쟁 예측 정보 생성 시스템(5000)의 일 실시예적 구성을 보여 주는 도면이다.9 is a diagram illustrating an exemplary configuration of a patent dispute prediction information generation system 5000 of the present invention.

상기 특허 분쟁 예측 정보 생성 시스템(5000)은 특허 분쟁 예측 정보를 생성하는 분쟁 예측 엔진(5100), 분쟁에 대한 각종 데이터를 저장하고 있는 분쟁 DB부(5200), 특허 분쟁에 대해서 시스템 및/또는 사용자의 관리 정보를 제어하는 분쟁 예측 관리부(5400), 분쟁 예측 모델을 생성하는 분쟁 예측 모델 엔진과 분쟁 예측 정보를 분석하는 분쟁 예측 정보 분석 엔진(5300)을 포함하고 있을 수 있다.The patent dispute prediction information generating system 5000 includes a dispute prediction engine 5100 for generating patent dispute prediction information, a dispute DB unit 5200 storing various data about disputes, a system and / or a user for patent disputes. The dispute prediction management unit 5400 for controlling the management information of the dispute prediction model engine for generating a dispute prediction model and a dispute prediction information analysis engine 5300 for analyzing the dispute prediction information may be included.

상기 분쟁 예측 엔진(5100)은 상기 사용자 또는 상기 특허 정보 시스템(10000) 또는 상기 특허 분쟁 예측 정보 생성 시스템(5000)이 생성하거나 관리하는 적어도 하나 이상의 특허로 구성된 자기 특허를 포함하는 자기 특허 집합을 생성하거나 특정하는 자기 특허 집합 생성부(5110), 상기 자기 특허 집합을 구성하는 자기 특허와 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허로 구성되는 타겟 특허 집합을 생성하거나, 상기 사용자가 생성하거나, 지정하는 타겟 특허로 구성되는 타겟 특허 집합을 입수하는 타겟 특허 집합 생성부(5120), 상기 특허별 분쟁 예측 모델값 생성부(5530)가 생성한 특허별 분쟁 예측 모델값을 입수하거나, 분쟁 예측 모델 생성부(5520)를 통하여 적어도 하나 이상의 지정된 특허의 분쟁 예측 모델값을 생성하도록 제어하여, 생성된 분쟁 예측 모델값을 입수하는 분쟁 예측 모델값 입수부(5130), 기 설정된 적어도 하나 이상의 분쟁 예측 정보를 생성하는 분쟁 예측 정보 생성부(5140)와 생성된 분쟁 예측 정보값을 제공하는 분쟁 예측 정보값 제공부(5150)를 포함하고 있다.The dispute prediction engine 5100 generates a self patent set including a self patent comprising at least one patent generated or managed by the user or the patent information system 10000 or the patent dispute prediction information generation system 5000. Or a self-patent set generation unit 5110 to generate or specify a target patent set composed of at least one target patent having a predetermined relationship with a self-patent constituting the self-patent set, or generated or specified by the user. A target patent set generation unit 5120 for obtaining a target patent set composed of target patents, a dispute prediction model value for each patent generated by the dispute prediction model value generation unit 5530 for each patent, or a dispute prediction model generation unit And control to generate a dispute prediction model value of at least one or more designated patents through 5520 A dispute prediction model value acquisition unit 5130 that obtains a dispute prediction model value, a dispute prediction information generation unit 5140 that generates at least one or more preset dispute prediction information, and a dispute prediction information value that provides the generated dispute prediction information value The provision part 5150 is included.

상기 분쟁 예측 모델 생성 엔진(5500)은 분쟁 예측 요소마다 기 설정된 분쟁 예측 요소값 생성 규칙에 따라 분쟁 예측 요소값을 생성하는 분쟁 예측 요소값 생성부(5510), 기 설정된 통계 알고리즘을 사용하여 적어도 하나 이상의 분쟁 예측 모델을 생성하는 분쟁 예측 모델 생성부(5520), 상기 분쟁 예측 모델로 특허별로 분쟁 예측 모델값을 생성하는 특허별 분쟁 예측 모델값 생성부(5530), 생성된 특허별 분쟁 예측 모델값을 제공하는 분쟁 예측 모델값 제공부(5540)를 포함하고 있을 수 있다.The dispute prediction model generation engine 5500 may use the dispute prediction element value generation unit 5510, which generates a dispute prediction element value according to a dispute prediction element value generation rule set for each dispute prediction element, using at least one predetermined statistical algorithm. Dispute prediction model generation unit 5520 to generate a dispute prediction model described above, a dispute prediction model value generation unit 5530 for each patent to generate a dispute prediction model value for each patent using the dispute prediction model, the generated dispute prediction model value for each patent It may include a dispute prediction model value providing unit 5540 that provides.

한편, 상기 분쟁 예측 요소값 생성부(5510)는 생성되는 분쟁 예측 요소값의 속성에 따라, 인용 계열 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성하는 인용 계열 분쟁 예측 요소값 생성부(5510), 제품이나 기술군 계열 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성하는 제품/기술군 계열 분쟁 예측 요소값 생성부(5510), 권리자나 출원인, 발명자 등 주체 계열 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성하는 주체 계열 분쟁 예측 요소값 생성부(5510), 사용자가 입력하거나 설정하는 적어도 하나 이상의 트롤(Patent Troll), 특허 기술 분류, 특허 집합, 경쟁사나 관계사, 표준 특허 등과 같은 기 설정된 속성을 가지는 특허군 등과 같이 사용자 계열 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성하는 사용자 입력 계열 분쟁 예측 요소값 생성부(5510)가 있다.Meanwhile, the dispute prediction element value generation unit 5510 may generate a dispute prediction element value generation unit 5510 for generating a dispute prediction element value for the quotation sequence dispute prediction element according to the generated attribute of the dispute prediction element value. The dispute prediction element values for the product / technology group dispute prediction element value generation unit 5510 and the owner, the applicant, the inventor, etc., which generate the dispute prediction element values for the product or technology family dispute prediction element, A subject-based dispute prediction element value generation unit 5510 to be generated, at least one or more trolls (patent troll) input or set by the user, patent technology classification, patent set, competitors or affiliates, standard patents, etc. There is a user input sequence dispute prediction element value generator 5510 that generates a dispute prediction element value for a user sequence dispute prediction element, such as a group.

상기 분쟁 DB부(5200)에는 발생된 분쟁에 관한 정보를 저장하는 분쟁 발생 특허 DB(5210)가 있으며, 상기 분쟁 발생 DB부가 저장하는 정보에는 분쟁 고유 식별자 정보, 분쟁 처리 법원 정보, 적어도 하나 이상의 분쟁 대상 특허 정보(특허 번호, 및/또는 상기 특허 번호에 대응되는 서지 정보, 및/또는 명세서 정보, 및/또는 도면 정보, 및/또는 경과 정보 등 상기 특허 번호로 특정될 수 있는 일체의 특허 정보), 적어도 하나 이상의 원고 정보, 적어도 하나 이상의 피고 정보, 적어도 하나 이상의 소송 경과 정보, 소송 결과 정보, 소송 심급 정보 등이 저장되어 있을 수 있다. 한편, 상기 분쟁 DB부(5200)에는 분쟁 예측 요소 및 상기 분쟁 예측 요소별 분쟁 예측 요소값을 생성하는 규칙(SQL 명령문 등) 데이터를 분쟁 예측 요소 DB, 상기 분쟁 예측 모델을 저장하는 분쟁 예측 모델 DB, 특허별로 상기 분쟁 예측 요소값을 저장하는 분쟁 예측 요소값 DB(5220), 특허별로 상기 분쟁 예측 모델값을 저장하는 분쟁 예측 모델값 DB(5230)를 포함하고 있을 수 있다.The dispute DB portion 5200 includes a dispute occurrence patent DB 5210 for storing information on a dispute that has occurred, and the information stored in the dispute occurrence DB portion includes dispute unique identifier information, dispute processing court information, and at least one dispute. Target patent information (any patent information that can be specified by the patent number, such as patent number and / or bibliographic information corresponding to the patent number, and / or specification information, and / or drawing information, and / or progress information) At least one or more manuscript information, at least one or more defendant information, at least one or more litigation progress information, litigation result information, litigation depth information, and the like may be stored. Meanwhile, the dispute DB unit 5200 includes a dispute prediction element DB and a dispute prediction model DB for storing the dispute prediction element DB and rule data for generating the dispute prediction element value for each dispute prediction element (such as an SQL statement) and the dispute prediction model. And a dispute prediction element value DB 5220 for storing the dispute prediction element value for each patent, and a dispute prediction model value DB 5230 for storing the dispute prediction model value for each patent.

상기 분쟁 예측 관리부(5400)에는 특허 분쟁 관련된 데이터를 입수하고, 파싱(parsing) 등의 처리하는 분쟁 데이터 입수부(5410), 사용자가 입력하는 분쟁 관련 data 및 기타 사용자가 입력하는 특허 집합을 관리하는 사용자 입력 특허군 관리부 (5432), 분쟁 관련되어 상기 분쟁 예측 엔진(5100)이 생성하는 분쟁 정보 및, 특허 분쟁 예측과 관련하여 상기 사용자에게 상기 특허 분쟁 예측 정보 생성 시스템(5000)을 사용하도록 하며, 생성된 분쟁 예측 정보를 기 설정된 UI를 통하여 제공하는 분쟁 UI부(5431)를 포함하고 있을 수 있다. 상기 분쟁 예측 관리부(5400)는 상기 특허 분쟁 예측 정보 생성 시스템(5000)을 관리하는 분쟁 예측 시스템 관리부(5420)와 상기 특허 분쟁 예측 정보 생성 시스템(5000)을 사용하는 사용자를 관리하는 분쟁 예측 사용자 관리부(5430)를 더 포함하고 있을 수 있다. 상기 분쟁 예측 시스템 관리부(5420)는 기 설정된 분쟁 예측 정보의 생성을 위한 배치(batch) 처리를 수행하는 분쟁 예측 정보 배치 생성부(5421)를 더 포함하고 있을 수 있다. 상기 배치 처리는 기 설정된 권리자 단위, 기 설정된 특허 분류 단위, 상기 분쟁 예측 관리부(5400)가 관리하고 있는 사용자별 특허군 단위로 기 설정된 주기 또는 기 설정된 조건 성취(분쟁 예측 모델의 갱신 또는 분쟁 예측 요소값의 갱신, 신규 분쟁 데이터의 유입 등)에 따라 기 설정된 분쟁 예측 정보를 생성을 위한 처리를 말한다. 상기 분쟁 예측 사용자 관리부(5430)는 사용자가 입력하는 특허군을 관리하는 사용자 입력 특허군 관리부 (5432)를 더 포함하고 있을 수 있다.The dispute prediction management unit 5400 obtains patent dispute related data and manages a dispute data acquisition unit 5410 which processes parsing and the like, manages dispute related data input by a user, and a patent set input by other users. A user input patent group management unit 5432, dispute information generated by the dispute prediction engine 5100 in relation to a dispute, and the user using the patent dispute prediction information generation system 5000 in connection with a patent dispute prediction, It may include a dispute UI unit 5523 providing the generated dispute prediction information through a predetermined UI. The dispute prediction management unit 5400 is a dispute prediction user management unit managing a dispute prediction system management unit 5520 managing the patent dispute prediction information generation system 5000 and a user who uses the patent dispute prediction information generation system 5000. It may further include 5430. The dispute prediction system manager 5520 may further include a dispute prediction information batch generation unit 5251 which performs a batch process for generating preset dispute prediction information. The batching process may be performed by a predetermined owner unit, a predetermined patent classification unit, or a user-specific patent group managed by the dispute prediction management unit 5400, or a predetermined period or a predetermined condition achievement (update of dispute prediction model or dispute prediction element). Refers to a process for generating preset dispute prediction information according to value update, inflow of new dispute data, etc.). The dispute prediction user manager 5430 may further include a user input patent group manager 5432 for managing a patent group input by a user.

상기 분쟁 예측 정보값 제공부(5150)는 특허 분쟁 정보나 특허 분쟁 예측 정보를 리포트 형식의 문서로 제공하기 위한 리포트를 생성하는 분쟁 정보 리포트 생성부(4440)를 포함하고 있으며, 상기 분쟁 정보 리포트 생성부(4440)가 생성하는 리포트는 분쟁 정보 입출력부를 통하여 사용자에게 전자 우편 등으로 메일링 또는 기 설정된 방식으로 리포팅할 수 있다.The dispute prediction information value providing unit 5150 includes a dispute information report generating unit 4440 that generates a report for providing patent dispute information or patent dispute prediction information in a document form, and generates the dispute information report. The report generated by the unit 4440 may be reported to the user through e-mail or in a predetermined manner through the dispute information input / output unit.

상기 분쟁 예측 정보 분석 엔진(5300)은 주어진 특허 집합을 분할하는 특허 집합 분할부(5310), 특허 공격이 예상되는 특허군 정보를 생성하는 공격 예상 특허군 생성부, 특허 분쟁 리스크를 효과적으로 헤징하기 위한 정보를 생성하는 리스크 헤징 정보 생성부(5330)또는 크로스 라이센싱의 대상 후보 특허군을 발견하는 크로스 라이센싱 정보 생성부(5340), 또는 라이센싱 대상 정보를 생성하는 라이센싱 대상 정보 생성부를 포함하고 있을 수 있다.The dispute prediction information analysis engine 5300 may include a patent set dividing unit 5310 dividing a given patent set, an attack predicting patent group generating unit generating patent group information for which a patent attack is expected, and effectively hedging a patent dispute risk. It may include a risk hedging information generation unit 5330 for generating information, a cross-licensing information generation unit 5340 for finding a candidate patent group for cross licensing, or a licensing target information generation unit for generating licensing target information.

먼저 도 10을 참조하면서, 분쟁 예측 모델의 생성을 설명한다. 상기 분쟁 예측 모델의 생성은 본 발명의 분쟁 예측 모델 생성 엔진(5500)이 수행한다. 상기 분쟁 예측 모델 생성 엔진(5500)은 분쟁 예측 요소별로 분쟁 예측 요소값 생성 규칙 정보를 입수(SL11)하고, 입력되는 상기 특허별 분쟁 예측 요소값을 생성(SL12)하고, 생성된 상기 특허별 분쟁 예측 요소값을 저장(SL13)한다. 상기 분쟁 예측 모델이 생성되기 위해서는 분쟁 예측 요소의 도출이 선행되어야 한다. First, generation of the dispute prediction model will be described with reference to FIG. 10. The generation of the dispute prediction model is performed by the dispute prediction model generation engine 5500 of the present invention. The dispute prediction model generation engine 5500 obtains the dispute prediction element value generation rule information for each dispute prediction element (SL11), generates the input dispute prediction element value for each patent (SL12), and generates the dispute for each patent. The predictive element value is stored (SL13). In order to generate the dispute prediction model, derivation of a dispute prediction element must be preceded.

분쟁 예측 요소의 도출은 분쟁 특허에 대한 분석을 통해서 도출할 수 있다. 특허 분쟁이 일어난 특허(분쟁 대상 특허)들은 www.westlaw.com의 patent practitioner 등과 같은 온라인 사이트 등에서 유료 또는 무료로 입수할 수 있다. 이들 특허 분쟁 관련 데이터를 분석하면 분쟁이 일어난 특허들은 분쟁이 일어나지 않은 특허들과 다음과 같은 관점에서 구분되는 특징을 가진다. 상기 구분되는 특징에 기반하여 분쟁 예측 요소라는 변수 집합을 생성한다.Derivation of dispute prediction factors can be derived through analysis of dispute patents. Patent disputed patents (disputed patents) are available for a fee or for free at online sites such as the patent practitioner at www.westlaw.com. Analyzing these patent dispute-related data, the disputed patents are distinguished from the non-dispute patents in the following aspects. A variable set called a dispute prediction element is generated based on the distinguished features.

첫째, 분쟁 특허들은 인용 관계가 강하다. 분쟁 대상 특허들은 비분쟁 특허들보다 상대적으로 인용을 많이 받거나, 최근에 인용이 증가하는 등과 같은 경향이 있다. 특히, 분쟁 대상 특허와 소송 피고 간에는 강한 인용 관계가 존재하는 경우가 많다. 즉, 분쟁 대상 특허를 피고의 특허가 인용하는 관계가 성립하는 경우가 분쟁이 일어나지 않은 특허를 피고의 특허가 인용하는 관계가 성립하는 경우보다 상대적으로 더 많다. 이러한 인용 관계는 분쟁 대상 특허를 인용하는 후출원 특허들의 권리자(출원인) 정보 등과 같은 서지 사항을 입수하여, 입수된 서지 사항을 분석함으로써 알 수 있게 된다. 인용 관계가 분쟁 대상과 관련하여 중요한 역할을 하는 이유는 1) 특허들이 제품에 반영되는 경향이 강하고, 후출원 권리자가 자신의 제품의 기능(특히, 개선, 개량 기능)을 보호하기 위하여 그 기능과 관련된 특허를 출원하는 경우가 많으며, 2) 후출원 권리자의 특허권자가 선출원 권리자의 특허를 알고 있어 IDS 등으로 공개하거나, 선행 기술 조사, 특허청 심사관의 심사 과정에서 후출원 특허와 관련 있는 특허로 reference 등에 등재되는 경우가 많으며, 3) 선출원 권리자도 자신의 특허를 인용하는 후출원 권리자를 모니터링 하는 경우가 많으며, 4) 후출원 권리자로부터 많은 인용을 받는 선출원 권리자의 특허는 후출원 권리자의 제품에 반영되거나 강한 관련성을 가지는 경우가 많기 때문에(특히, 선출원 권리자의 특허의 청구항이 넓은 권리 범위를 가지는 경우에는 더욱 더 그러하다) 인용 관계는 분쟁과 상당히 강한 관련성을 가진다.First, disputed patents have a strong citation relationship. Patents in dispute tend to receive more citations than non-dispute patents, or to increase citations in recent years. In particular, there is often a strong citation relationship between the disputed patent and litigation defendant. In other words, the relationship between the defendant's patent and the disputed patent is established more than the relationship between the defendant's patent and the disputed patent. This citation relationship can be known by acquiring bibliographic matters such as information on the owner (applicant) of post-patent patents citing the disputed patent, and analyzing the bibliographic matters obtained. Reasons for citations to play an important role in relation to the subject of the dispute include: 1) patents tend to be reflected in the product, and the applicant for the post-applicant is required to protect the function of his / her product (especially to improve or improve). Many applicants apply for related patents, and 2) the patent holder of the post-application owner knows the patent of the pre-applicant owner and discloses it to IDS, etc. In many cases, 3) the pre-applicant also monitors the post-applicant's citation of his patent, and 4) the patent of the pre-applicant's patent, which receives a lot of citations from the post-applicant's patent, is reflected in the product of the post-applicant's Since there are many cases where there is a strong relationship (especially, the claims of the patentee of the first-applicant have broad scope of rights). If there is even more true) incorporated relationship has a dispute with extremely strong relation.

위와 같은 특허 분쟁과 인용과의 관계성에 따라, 다음과 같은 인용 관점 분쟁 예측 요소 후보군을 도출할 수 있다. 선출원 권리자의 개별 특허 관점에서, 개별 특허를 인용하는 후출원 특허를 대상으로 하여 하기 표 1과 같은 인용 관점 분쟁 예측 요소 후보군이 예시적으로 고려될 수 있다. According to the relationship between the patent dispute and the citation as described above, the following candidate viewpoint for citation perspective dispute prediction factor can be derived. In view of the individual patents of the prior applicants, a citation viewpoint dispute prediction element candidate group as shown in Table 1 below may be considered as an example for a post-application patent citing an individual patent.

계열line 계열 codeSeries code 인용 관점 분쟁 예측 요소 후보군Citation perspective dispute prediction factor 비고Remarks 총개수Total number A1A1 총피인용수Total Blood Citations 최근 n년 개수Last n years B1B1 최근 n년간 피인용수(n은 1개 이상이 될 수 있음, 이하 이 표에서는 동일하게 적용)Citations in the last n years (n can be more than one, the same applies in this table below) Self 개수Self count C1C1 자기 인용수, 최근 n년간 자기 인용수Self citations, self citations in the last n years Non-Self 개수Non-Self Count D1D1 비자기 인용수, 최근 n년간 비자기 피인용수Nonmagnetic citations, nonmagnetic citations in the last n years 평균 개수Average number E1E1 청구항 당 A1~D1 모두, 피인용 특허수 당 피인용 권리자수, 피인용 특허수 당 피인용 특허 분류수, 피인용 특허수 당 피인용 권리자 종류수A1 to D1 per claim, the number of citations per citation, the number of citations per citation, and the number of citations per citation. 속성치Attribute value F1F1 피인용 특허의 권리자를 기준으로 한 A1, B1, D1, E1
피인용 특허의 권리자 종류를 기준으로 한 A1, B1, D1, E1
피인용 특허의 권리자 종류별 비율을 기준으로 한 A1, B1, D1, E1
피인용 특허의 특허 분류를 기준으로 한 A1, B1, D1, E1A1, B1, D1, E1 based on the rights holder of the cited patent
A1, B1, D1, E1 based on the type of rights holder of the cited patent
A1, B1, D1, E1 based on the ratio of the right holder of the cited patent
A1, B1, D1, E1 based on patent classification of cited patent 증감량 또는 증감율Increase or decrease rate G1G1 전기간 기준 A1~F1의 증감량 또는 증감율, 특정 기간 기준 A1~F1의 증감량 또는 증감율Increase or decrease rate of A1 ~ F1 for the whole period, increase or decrease rate of A1 ~ F1 for a certain period ReferenceReference H1H1 Reference수, Reference 중 특허수, Reference 중 특허 대비 논문수, Reference 특허 중 외국 특허수, Reference 중 국내 특허수/Reference 중 외국 특허수Number of references, number of patents in reference, number of papers compared to patent in reference, number of foreign patents in reference patent, number of foreign patents in reference / reference in foreign patents

상기 표 1에 대해서 설명한다. 권리자 종류는 기업, 기관, 대학, 개인으로 대별되며, 이는 다시 자국과 타국으로 대별될 수 있으며, 기업은 대기업과 소기업으로 구분될 수 있으며, 기업과 개인은 사적 주체, 기관과 대학은 공적 주체로 대별할 수 있으며, 다출원 주체와 소출원 주체로 대별할 수 있다. 미국 특허청이 발행하는 Maintenance Fee 정보에는 권리자에 대하여 Large Entity 여부에 대한 정보가 있으며, 다른 나라에 대해서는 권리자/출원인의 특허 출원수나 특허 등록수를 기준으로 다출원/다권리 주체와 소출원/소권리 주체로 구분할 수 있다. 기업이나, 기관, 대학의 구분은 출원인/권리자 명(name = organization name + organization type)에 포함된 조직 유형 정보로 파악할 수 있다. 예를 들어, name으로 Samsung Electronics Co. Ltd.가 있는 경우, Samsung Elelctronics은 organization name이 되고, Co. Ltd.는 organization type이 된다. 권리자 종류 비율은 기업 권리자/전체 권리자,(기업+개인)/(기관+대학) 등이 될 수 있다. Self는 개별 특허의 권리자와 피인용 특허의 권리자가 동일한 경우(소위 Self Citation)라는 것이며, Non-Self는 특정 특허의 권리자와 피인용 특허의 권리자가 다른 경우만 고려한다는 것을 말한다. Table 1 will be described. The right holders are classified into corporations, institutions, universities, and individuals, which in turn can be divided into domestic and foreign countries, and corporations can be divided into large and small businesses, and corporations and individuals are private entities, and institutions and universities are public entities. It can be divided roughly, and it can be divided into a multi-application subject and a small-application subject. The Maintenance Fee information issued by the US Patent and Trademark Office contains information on whether a right entity is a large entity, and in other countries, a multi-application / multiple rights entity and a small-application / sub-rights entity based on the number of patent applications or patent registrations of the right holder / applicant. Can be divided into The division of corporations, institutions, and universities can be grasped by organization type information included in the name of the applicant / authority (name = organization name + organization type). For example, name Samsung Electronics Co. Ltd., Samsung Elelctronics becomes the organization name and Co. Ltd. becomes an organization type. The ratio of the right holder type may be a company owner / total owner, (corporate + individual) / (institution + university), etc. Self refers to the case where the owner of an individual patent and the owner of a cited patent are the same (so-called Self Citation), while Non-Self means only considering cases where the owner of a particular patent differs from the owner of the cited patent.

한편, 표기 항목의 수가 너무 많기 때문에 도입하는 "A1, B1, D1, E1"나 A1~D1와 같은 이러한 약식 표기에 대해 설명한다. 피인용 특허의 권리자를 기준으로 한 A1, B1, D1, E1라는 것에서 A1는 피인용 특허의 권리자 총개수, B1는 최근 n년간의 피인용 특허의 권리자 총개수, D1라는 것은 피인용 특허의 권리자 중에서 개별 특허의 권리자와 다른 권리자의 총개수, E1라는 것은 개별 특허의 청구항 수로 A1, B1, D1값을 나눈 값을 말한다. 이와 같이 약식 표기는 "요소부분 + 참조 부분의 범위"로 구성된다. 위 예에서 요소 부분은 "피인용 특허의 권리자"가 되고, 범위 부분은 피인용 특허의 권리자를 기준으로 한 분쟁 예측 요소 후보를 구성할 참조 부분의 범위가 된다.On the other hand, such abbreviated notation as "A1, B1, D1, E1" and A1-D1 introduced because there are too many notation items are demonstrated. In terms of A1, B1, D1, and E1 based on the rights holder of the citation patent, A1 is the total number of right holders of the citation patent, B1 is the total number of right holders of the citation patent for the last n years, and D1 is the right holder of the citation patent. Among them, the total number of rights holders of individual patents and other rights holders, E1, refers to a value obtained by dividing A1, B1, and D1 values by the number of claims of an individual patent. Thus, the shorthand notation consists of "element part + range of reference part". In the above example, the element portion becomes the "owner of the cited patent," and the scope portion is the range of the reference portion that will constitute a candidate for dispute prediction element based on the assignee of the cited patent.

한편, 특허 분류에 대해서는 특허 분류 종류(IPC, USPC, FT, FI, ECLA)별, 특허 분류 계층 구조상에서의 레벨(예를 들면, IPC의 경우, subclass나 main group 등)별, 주특허분류와 부특허분류의 사용 범위별 중 어느 하나 또는 2 이상의 조합별로 분쟁 예측 요소 후보값을 생성할 수 있을 것이다. 예를 들면, 하나의 특허에 USPC와 IPC가 모두 들어 있는 경우, 주 특허 분류만 사용하고, USPC는 class 단위에서 IPC는 subclass 단위 및 main group 단위에서와 같이 총 3개 종류에 대해서 분쟁 예측 요소 후보값을 생성할 수 있을 것이다. 분쟁 예측 요소 후보나 분쟁 예측 요소에 관한 한 특허 분류의 처리에 대해서는 이하 동일하게 적용된다.On the other hand, the classification of patents is based on the types of patent classifications (IPC, USPC, FT, FI, ECLA), the level of the patent classification hierarchy (for example, subclass or main group in the case of IPC), main patent classification and The dispute prediction element candidate value may be generated for each one or a combination of two or more of the scope of use of the subpatent classification. For example, if a patent contains both USPC and IPC, only the main patent classification is used, and USPC uses only the main patent classification, and IPC is the class prediction unit candidate for a total of three categories, such as in the subclass unit and the main group unit. You will be able to generate a value. The same applies to the processing of patent classification as far as dispute prediction element candidates and dispute prediction elements are concerned.

본 발명에서 인용은 크게 4가지 종류가 있다. 첫째는 직접 인용(direct citation)으로 인용 깊이(depth) 1인 인용에 해당한다. 특허 P1의 레퍼런스(인용) 정보에 특허 P2가 있는 경우, P2는 P1의 부모(parent)가 되고, P1은 P2의 자식(child)가 된다. 이 경우, P2는 P1의 직접 인용, P1은 P2의 직접 피인용이 된다. 둘째는 간접 인용(indirect citation)으로 인용 depth 2 이상의 인용에 해당한다. P2의 레퍼런스에 P3가 있는 경우, P3는 P1의 인용 깊이 2의 간접 인용이 된다. 셋째는 잠재 인용(latent citation)이다. 넷째는 사슬 인용(chain citation)이다. 모든 종류의 인용은 후방(forward)와 전방(backward)로 나뉘어 진다. 상기 표 1과 같은 인용 관점의 다양한 분쟁 예측 요소 내지 분쟁 예측 요소 후보에 대한 분쟁 예측 요소값 내지는 분쟁 예측 요소 후보값은 상기 4가지 인용 종류별(상기 4가지 인용 종류별의 하위 종류별(예를 들어, 제1종 잠재 인용, 제2종 잠재 인용, 제1종 사슬 인용, 제2종 사슬 인용 등)을 포함할 수 있다. 이하, 같다.)로 별도로 생성되거나, 상기 4가지 인용 종류 중 어느 하나 이상을 단순 합산하거나, 상기 4가지 인용 종류에 따라 기 설정된 다른 가중치를 부여한 채 합산하는 방법으로 생성될 수 있을 것이다. 물론, 상기 분쟁 예측 요소값 내지 분쟁 예측 요소 후보값은 전방과 후방을 분리하여 생성되는 것이 타당할 것이다.There are four types of citations in the present invention. The first is direct citation, which corresponds to a citation of depth one. When patent P2 is included in the reference (quotation) information of patent P1, P2 becomes a parent of P1, and P1 becomes a child of P2. In this case, P2 is a direct citation of P1 and P1 is a direct citation of P2. The second is indirect citation, which corresponds to citation depth 2 or higher. If there is P3 in the reference of P2, P3 is an indirect citation of the citation depth 2 of P1. Third is latent citation. Fourth is chain citation. All kinds of citations are divided into forward and backward. The dispute prediction element values or the dispute prediction element candidate values for the various dispute prediction elements to the dispute prediction element candidates of the citation viewpoints as shown in Table 1 are classified by the four types of citations (by subtypes according to the four types of citations (for example, 1 kind of late citation, 2 kind of late citation, 1 kind of chain citation, 2 kind chain citation, etc.) It may be generated by a simple summing or adding with different weights set according to the four types of citations. Of course, it may be reasonable that the dispute prediction factor value or the dispute prediction factor candidate value is generated by separating the front and the rear.

또한, 자사 특허를 자사 또는 자사와 기 설정된 밀접 관련(예, 계열사 관계 등)을 맺고 있는 출원인이 인용하는 경우를 협의의 자기 인용(self citation)이라 한다. 자기 인용은 자신과 레퍼런스가 기 설정된 공통 분모를 가지는 경우로, 상기 공통 분모에는 상기의 출원인 공통, 발명자 공통, 특허 분류 공통 등이 있을 수 있다.In addition, self-citation of consultation refers to the case in which a patent is cited by an applicant who has a pre-established close relationship (eg, affiliated relationship, etc.) with the company or the company. Self-citation is a case in which the self and the reference have a predetermined common denominator, and the common denominator may include the applicant common, the inventor common, and the common patent classification.

상기 인용 특허 집합 생성부(5121)는 자기 특허 SSi에 대하여 적어도 하나 이상의 직접 인용 특허를 포함하는 직접 인용 특허 집합, 적어도 하나 이상의 간접 인용 특허를 포함하는 간접 인용 특허 집합, 적어도 하나 이상의 잠재 인용 특허를 포함하는 잠재 인용 특허 집합, 적어도 하나 이상의 사슬 인용 특허를 포함하는 사슬 인용 특허 집합 중 어느 하나 이상을 생성한다.The cited patent set generation unit 5121 may be configured to directly generate a cited patent set including at least one direct cited patent, an indirect cited patent set including at least one indirect cited patent, and at least one potential cited patent for the own patent SSi. Generating at least one of a potential cited patent set comprising, and a chain cited patent set comprising at least one chain cited patent.

본 발명의 직접 인용과 관련된 레퍼런스 정보는 1) Pi의 출원인이나 권리자가 부여한 것(출원인 인용), 2) 심사관이 부여한 것(심사관 인용), 3) 선행 기술 조사를 통해서 부여 된 것(선행 기술 조사 인용), 4) 심사관의 Pi 심사 시 인용 참증으로 부여된 것(참증 인용) 중 어느 하나 이상이 포함된다. 상기 인용 특허 집합 생성부(5121)의 인용 특허 집합의 생성은 이와 같이 간접 인용 정보, 잠재 인용 정보 및 사슬 인용 정보도 상기 1) 내지 4) 종류 중 어느 하나 이상이 고려되어 생성된다.Reference information related to the direct citation of the present invention is 1) given by the applicant or owner of Pi (applicant cited), 2) given by the examiner (cited by the examiner), 3) given through prior art research (prior art research) Citations), 4) any one or more of the citations cited in the examiner's Pi review (consideration citations). In the citation patent set generation unit 5121, the citation patent set generation is generated by considering any one or more types of indirect citation information, latent citation information, and chain citation information.

이어, 도 11내지 도 20을 참조하면서 상기 입수된 자기 특허 집합, 인용 특허 집합, 전방 인용 특허 집합, 후방 인용 특허 집합, 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 및 인용 발생 입수 특허 집합에 대해서 설명한다.Subsequently, the self-patent set, cited patent set, forward cited patent set, back cited patent set, forward cited patent set, back self cited patent set, back cited patent set, and cited generation obtained patent sets are obtained with reference to FIGS. 11 to 20. Explain.

도 11에서 I1 내지 I6 특허는 입수 특허 집합을 구성한다. I1은 P1을 인용하고, I2는 P2를 인용하며, I3는 P3와 P4를 인용하며, I4는 P5와 입수 특허 집합에 속하는 I4를 인용하며, I 4는 I 5를 인용하며, I5는 I6을 인용하고 있다. 한편, I1은 C1에 의해 인용되고 있으며, I2는 C2 내지 C4에 의해 인용되고 있다. 이때, 인용 특허 집합 I1 내지 I6에 대하여, P1 내지 P5 및 I5, I6이 전방 인용 특허 집합이 되며, C1 내지 C4 및 I4와 I5가 후방 인용 특허 집합이 된다. 도 12에서 알 수 있듯이, 인용 특허 집합이 I1 내지 I4로 한정이 되는 경우, 상기 전방 인용 특허 집합은 P1 내지 P5 및 I5가 돈다. 한편, 도 13에서 알 수 있듯이, 인용 특허 집합이 I1 내지 I3로 한정이 되는 경우, 상기 후방 인용 특허 집합은 C1 내지 P4가 된다.In FIG. 11, the I1 to I6 patents constitute a set of obtained patents. I1 cites P1, I2 cites P2, I3 cites P3 and P4, I4 cites P4 and I4 belonging to the patent set, I 4 cites I 5, I5 cites I6 Quoting. On the other hand, I1 is cited by C1, and I2 is cited by C2 to C4. At this time, with respect to the cited patent sets I1 to I6, P1 to P5 and I5 and I6 become forward cited patent sets, and C1 to C4 and I4 and I5 become back cited patent sets. As can be seen from Fig. 12, when the cited patent set is limited to I1 to I4, the forward cited patent set is P1 to P5 and I5. On the other hand, as can be seen in Figure 13, when the cited patent set is limited to I1 to I3, the back cited patent set is C1 to P4.

도 14는 자기 인용 다이어그램의 개념을 보여 주고 있다. I4 내지 I6은 입수 특허 집합에 속하며, I4의 입장에서는 I5가 전방 인용이며, I6은 깊이 2의 전방 인용이며, I5의 입장에서는 I6이 전방 인용이고, I4가 후방 인용이며, I6의 입장에서는 I5가 후방 인용이며, I4가 깊이 2의 후방 인용이 된다. 따라서, I1 내지 I4로 입수 특허 집합이 한정되는 경우, I5가 깊이 1의 전방 자기 인용 특허 집합이 되며, I6은 깊이 2의 전방 자기 인용 특허 집합이 된다. 전방 자기 인용 특허 집합은 상기 I1 내지 I6이 동일한 권리자일 경우, 전방 자기 인용 특허 집합을 구성하는 I5 및/또는 I6은 동일한 권리자의 특허이므로 I5 및/또는 I6과 I4의 관계는 유사한 기술일 가능성이 높으며, I4는 I5 및/또는 I6을 개선하거나 개량한 기술일 가능성이 높으므로, 특정한 권리자의 기술 개발 동향(지속성, 집중 분야) 등을 알 수 있게 된다. 이는 후방 자기 인용 특허 집합의 경우도 마찬가지이다. 즉, 인용 특허 집합에 포함되는 특허에 대하여, 전방 자기 인용 특허 집합과 후방 자기 인용 특허 집합의 관계에 있는 특허들은 유관성이 강하게 되며, 그 권리자의 특허 포트폴리오에 있어서, 연속성, 개량성, 확장성과 포괄성 등을 표상하게 된다. 이는 상기 인용 특허 집합을 권리자의 관점에서 본 것이며, 발명자 단위로 인용 특허 집합을 한정하건, 특허 분류(기술 분야를 대변한다.)별로 인용 특허 집합을 한정하여 볼 경우도 마찬가지일 것이다. 즉, 인용 특허 집합에 있는 특정 발명자의 특허와 인용 발생 입수 특허 집합과 후방 자기 인용 특허 집합의 관계를 가지는 특허 중 동일 발명자의 특허들은 그 발명자를 기준으로 볼 때, 기술 개발이나 연구의 연속성 등을 표상할 수 있다. 이러한 결과를 볼 수 있는 것이 본 발명의 집단 단위 인용 분석의 특유한 효과가 된다.14 illustrates the concept of a self citation diagram. I4 to I6 belong to the set of obtained patents, I5 is forward citation at I4, I6 is forward citation at depth 2, I6 is forward citation, I4 is backward citation, I5 is I5 Is the back citation and I4 is the back citation of depth 2. Therefore, when the obtained patent set is limited to I1 to I4, I5 becomes the forward self citation patent set of depth 1, and I6 becomes the forward self citation patent set of depth 2. In the forward self-cited patent set, if I1 to I6 are the same rights holder, the relationship between I5 and / or I6 and I4 is likely to be a similar technology since I5 and / or I6 constituting the forward self-cited patent set are patents of the same right owner. It is high, and I4 is likely to be a technology that improves or improves I5 and / or I6, so that it is possible to know the trend of technology development (continuity, concentration) of a specific owner. This is also the case for the back self-cited patent set. That is, for the patents included in the cited patent set, the patents related to the forward self cited patent set and the back self cited patent set become strongly related, and in the owner's patent portfolio, continuity, improvement, extensibility and comprehensiveness. Representation of the last name, etc. This is the case in which the cited patent set is viewed from the owner's point of view, and the cited patent set is defined by the inventor unit or the cited patent set by the patent classification (represents the technical field). That is, the patents of the same inventor among the patents of a specific inventor in the cited patent set, the patent generation of the citation generation patent set and the back self-cited patent set represent the continuity of technology development or research, based on the inventors. can do. Viewing these results is a unique effect of the collective unit citation analysis of the present invention.

이어, 인용 발생 입수 특허 집합에 대해서 설명한다. 인용 발생 입수 특허 집합은 입수 특허 집합을 구성하는 특허 중 적어도 1회 이상의 인용을 받은 특허군으로 이루어진다. 상기 인용을 인용 특허 집합을 구성하는 특허 중에서 I1, I2, I4, I5이 인용 발생 입수 특허 집합을 구성한다. 한편, 상기 인용 발생 입수 특허 집합 중에서 특별한 한정 중의 하나인 인용 발생 입수 특허 집합을 구성하는 특허 중 입수 특허 집합의 구성하는 특허를 제외하는 한정을 수행하면, 도 15와 같은 한정된 인용 발생 입수 특허 집합이 생성되게 된다. 만약 I1 내지 I6의 권리자가 동일하고, C1 내지 C4의 출원인이 I1 내지 I6의 권리자와 다르다면, I2는 타 권리자에 의해 가장 많은 인용을 받은 특허가 되어, 인용 발생 입수 특허 집합 중에서도 중요한 특허가 될 가능성이 높으며, 이러한 중요한 특허를 발명한 자도 중요 발명자가 될 가능성이 높다.Next, a citation generation acquisition patent set is demonstrated. The cited generation obtained patent set is composed of a patent group that receives at least one citation of the patents constituting the obtained patent set. Of the patents constituting the cited patent set, I1, I2, I4, and I5 constitute the cited generation obtained patent set. On the other hand, if the restriction to exclude the patent constituting the acquisition patent set of the patents constituting the citation generation acquisition patent set, which is one of the special limitations among the citation generation acquisition patent set, the limited citation generation acquisition patent set as shown in FIG. Will be created. If the right holders of I1 to I6 are the same, and the applicants of C1 to C4 are different from the right holders of I1 to I6, I2 becomes the most cited patent by the other right holders, and becomes an important patent among the cited generation obtained patent sets. There is a high possibility, and the inventor of such an important patent is also likely to become an important inventor.

이어, 도 16 내지 도 20을 참조하면서 상기 입수된 자기 특허 집합, 인용 특허 집합, 전방 인용 특허 집합, 후방 인용 특허 집합, 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 및 인용 발생 입수 특허 집합에 대해서 설명한다. 도 16에서 IS(Input Set)는 입수 특허 집합을 의미하며, PIS(Parent of Input Set)은 전방 인용 특허 집합을, CIS(Child of Input Set)은 후방 인용 특허 집합을 의미한다. 도 17에서 CPIS(Cross-over Parent of Input Set)은 전방 자기 인용 특허 집합을, CCIS(Cross-over Child of Input Set)은 후방 자기 인용 특허 집합을 의미한다. CPIS(Cross-over Parent of Input Set)와 CCIS(Cross-over Child of Input Set)에 속하는 특허들은 IS에도 속한다. 도 18은 COIS(Citation Occurred IS)으로 인용 발생 입수 특허 집합을 의미한다. IS에 속하는 모든 특허들에 대한 후방 인용 특허 집합과 COIS에 속하는 모든 특허들에 대한 후방 인용 특허 집합은 동일하다는 특징이 있다.Subsequently, the self-patent set, cited patent set, forward cited patent set, back cited patent set, forward self cited patent set, back self cited patent set, and citation generation obtained patent set are obtained with reference to FIGS. 16 to 20. Explain. In FIG. 16, an input set (IS) means a patent set obtained, a parent of input set (PIS) refers to a forward cited patent set, and a child of input set (CIS) refers to a back cited patent set. In FIG. 17, a cross-over parent of input set (CPIS) refers to a front self-cited patent set, and a cross-over child of input set (CCIS) refers to a rear self-cited patent set. Patents belonging to Cross-over Parent of Input Set (CPIS) and Cross-over Child of Input Set (CCIS) also belong to IS. 18 refers to a citation occurrence acquisition patent set as COIS (Citation Occurred IS). The feature of the backward cited patent set for all patents belonging to the IS and the back cited patent set for all patents belonging to the COIS is the same.

도 19 내지 도 20은 한정이 발생하는 경우를 예시하고 있다. 이러한 한정은 직접 인용뿐만 아니라, 간접 인용, 잠재 인용 또는 사슬 인용을 통하여 생성되는 인용 특허 집합에 대해서도 동등하게 적용될 수 있다. 입수 특허 집합 IS에 특정한 한정이 가해 지면 SIS(Specified Input Set)이 생성되게 된다. 상기 SIS에 대한 한정된 전방 인용 특허 집합인 PSIS(Parent of Specified Input Set)과 한정된 후방 인용 특허 집합인 CSIS(Child of Specified Input Set)이 생성되게 된다. 도 20은 입수 특허 집합 IS에 대하여 전방 인용 특허 집합과 후방 인용 특허 집합이 생성되었을 때, 상기 전방 인용 특허 집합과 후방 인용 특허 집합에 대하여 기 설정된 한정을 수행했을 경우, 생성되는 한정된 전방 인용 특허 집합의 일종인 SPIS(Specified Parent of Input Set-전방 인용 특허 집합이 먼저 생성된 후, 그 전방 인용 특허 집합에 한정(specification)을 가함)과 후방 인용 특허 집합의 일종인 SCIS(Specified Child of Input Set)가 생성되게 된다. 19-20 illustrate the case where limitations occur. This limitation is equally applicable to the cited patent sets generated through indirect citations, latent citations or chain citations, as well as direct citations. When a specific restriction is placed on the acquired patent set IS, a SIS (Specified Input Set) is generated. A PSIS (Parent of Specified Input Set), a limited forward cited patent set for the SIS, and a CSIS (Child of Specified Input Set), a limited back cited patent set, are generated. 20 is a type of a limited forward citation patent set generated when the forward citation patent set and the backward citation patent set are generated for the acquired patent set IS, when preset limitations are performed on the forward citation patent set and the backward citation patent set. SPIS (Specified Parent of Input Set) is created first, and then the specification is applied to the forward cited patent set and SCIS (Specified Child of Input Set), a kind of backward cited patent set. Will be.

한정은 본 발명의 옵션 선택부(4340)를 통해서 한정할 수 있으며, 도 7에는 옵션 선택부(4340)의 구성의 일 실시예를 보여 주고 있다. 상기 한정은 기간 한정, 국가 한정, 출원인 한정, 발명자 한정, 특허 분류 한정, 필드 속성 한정, 개별 필드값 한정, 개별 특허 속성값 한정, 기타 한정 등이 있으며, 이들 각각은 본 발명의 기간 한정부(4340-1), 국가 한정부(4340-2), 출원인 한정부(4340-3), 발명자 한정부(4340-4), 특허 분류 한정부(4341-5), 필드 속성 한정부(4341-6), 개별 필드값 한정부(4341-7), 개별 특허 속성값 한정부(2111-8), 기타 한정부(4341-9)가 수행한다. 기간 한정은 from ~ to 로 선택할 수 있으며, 기간의 기준은 출원일, 공개일, 등록일 등 여러 가지가 있을 수 있다. 국가의 한정은 우선권 주장 출원에 나와 있는 국가일 수 있으며(권리자의 국적) 발명자의 국적(발명자 주소 정보에 나옴)일 수도 있으며, 상태는 공개나 등록, 무효, 소멸 등 특허의 각종 현재 상황을 나타낸다. 특허 분류의 경우에는 IPC, USPC, FT, FI, ECLA에 대한 특허 분류 종류의 선택과, 메인 특허 분류인지 서브 특허 분류 인지의 종류의 선택과, 깊이(IPC의 경우 섹션, 클래스, 서브클래스, 메인그룹, 1 dot 서브 그룹, n dot 서브 그룹 등)의 선택 등이 있을 수 있다.한편, 수치값의 선택으로는 공동 발명자수, 공동 출원인 수, 피인용수, 특허 분류 수, 특허 등급, 특허 점수 등과 같은 개별 특허에 대하여 측정하거나, 계산되거나, 입수되는 필드별 수치값에 대한 선택이 있을 수 있다.Limitation may be limited through the option selector 4340 of the present invention, and FIG. 7 illustrates an embodiment of the configuration of the option selector 4340. The above limitations include time limitations, country limitations, applicant limitations, inventor limitations, patent classification limitations, field attribute limitations, individual field value limitations, individual patent attribute value limitations, and other limitations. 4340-1), country limiter 4340-2, applicant limiter 4340-3, inventor limiter 4340-4, patent classification limiter 4431-5, field attribute limiter 4431-6 ), The individual field value limiting unit 4341-7, the individual patent attribute value limiting unit 2111-8, and the other limiting unit 4341-9. The time limit can be selected from to to, and there may be various criteria for the period, such as an application date, a publication date, and a registration date. A country's limitation may be the country listed in the priority claim (the nationality of the rights holder) or the nationality of the inventor (in the inventor's address information), and the status represents various current status of the patent, including disclosure, registration, invalidation, and extinction. . In the case of patent classification, the selection of the patent classification type for IPC, USPC, FT, FI, ECLA, the selection of whether it is the main patent classification or the sub patent classification, and the depth (in the case of IPC, section, class, subclass, main) Group, 1 dot subgroup, n dot subgroup, etc.) On the other hand, the selection of numerical values includes the number of co-inventors, co-applicants, citations, patent classifications, patent grades, patent scores There may be a choice of numerical values for each field to be measured, calculated or obtained for individual patents such as.

한정을 수행하는 방법은 기간 한정, 국가 한정, 권리자 한정, 발명자 한정, 특허 분류 한정 등이 있을 수 있으며, 이는 각각 기간 한정부(4340-1), 국가 한정부(4340-2), 출원인 한정부(4340-3), 발명자 한정부(4340-4) 및 특허 분류 한정부(4341-5)가 수행한다. 출원인 등을 한정할 때, 입수한 특허 집합에 포함된 권리자의 리스트를 보여 주고, 상기 특허 정보 시스템(10000)을 사용하는 사용자가 적어도 하나 이상의 권리자를 선택할 수 있게 한다. 이는 발명자 한정과 특허 분류 한정에서도 마찬가지이다.The method of performing the limitation may include a period limitation, a country limitation, a rights holder limitation, an inventor limitation, a patent classification limitation, and the like, which is a time limitation portion 4340-1, a country limitation portion 4340-2, and an applicant limitation portion. 4340-3, inventor limiter 4340-4, and patent classification limiter 4431-5. When limiting the applicant or the like, a list of rights holders included in the acquired patent set is shown, and a user using the patent information system 10000 can select at least one rights holder. The same holds true for the inventor limitation and the patent classification limitation.

이어, 특수한 한정을 소개한다. 먼저 필드 속성 한정이다. 예를 들어 특정한 권리자가 특허 괴물로 분류되거나, 경쟁사로 분류되는 경우, 특허 괴물이나 경쟁사 등이 특별한 필드 속성이 된다. 이는 특히 특허 괴물로 분류되는 권리자의 특허를 다수 인용하는 경우, 특허 분쟁이나 라이센스 요구 리스크가 커지게 되기 때문에 중요하다. 권리자는 출원인과 같은 필드이며, 출원인 필드 속성에는 특허 괴물이나 경쟁사(이러한 특허 괴물이나 경쟁사는 특정한 사용자별로 다를 수 있으며, 사용자별로 지정이 가능함은 물론이다 할 것이다.)뿐만 아니라, 권리자 속성(대학, 연구 기관, 기업)이나, 권리자 규모(대기업, 중소 기업), 권리자 속성(다출원 기업, 핵심 특허 다수 보유 기업, 특허 품질 지수가 높은 기업 등)과 같은 다양한 속성이 있을 수 있다. 발명자 등도 권리자와 마찬가지로 다양한 속성이 부여 될 수 있음은 당연할 것이다. 이러한 필드 속성의 한정은 본 발명의 필드 속성 한정부(4341-6)가 수행한다. 한편, 필드 속성이 값으로 존재할 때, 이러한 한정은 본 발명은 개별 필드값 한정부(4341-7)가 수행한다. 한편, 특정 특허 집합에 대하여 공동 출원인수 2주체 이상, 공동 발명자수 3인 이상, 특허 분류 개수 2개 이상, 피인용수 5회 이상 등과 같이 개별 필드의 필드값별로 필드값을 한정하여, 한정된 조건을 만족하는 특허군 만을 추출할 수도 있을 것이다. 이러한 한정은 개별 특허 속성값 한정부(4341-8) 가 수행한다. 한편, 기타 한정이 있는 경우 이러한 한정은 본 발명은 기타 한정부(4341-9)가 수행한다. 상기 한정 등은 포괄적으로 수행하는 것은 옵션 선택부(4340)의 기능이다. 상기 한정은 입수 특허 집합과 인용 특허 집합 모두에 대하여 수행할 수 있고, 상기 입수된 자기 특허 집합의 생성이 1차 입수 집합을 입수하고, 입수된 1차 입수 집합을 그대로 입수 특허 집합으로 처리하는 경우, 상기 1차 입수 집합에 대해서도, 상기 인용 특허 집합의 생성이 1차 대상 특허 집합을 생성하는 경우, 상기 1차 대상 특허 집합에 대해서도 한정될 수 있다. 이러한 한정이 수행되는 경우에 대해서는 도 19와 도 20에 잘 나타나 있다. 도 19에서는 입수 특허 집합에 추가적으로 한정하는 경우, 한정된 입수 특허 집합에 대해서 전방 인용 특허 집합도 한정되지 않은 입수 특허 집합에 대한 전방 인용 특허 집합보다 한정되며, 한정된 입수 특허 집합에 대해서 후방 인용 특허 집합도 한정되지 않은 입수 특허 집합에 대한 후방 인용 특허 집합보다 한정됨을 보여 주고 있다. 한편, 도 20은 입수 특허 집합에 대하여 전방 인용 특허 집합과 후방 인용 특허 집합이 생성될 때, 상기 전방 인용 특허 집합과 후방 인용 특허 집합에 한정을 가하여, 한정된 전방 인용 특허 집합과 후방 인용 특허 집합을 생성할 수 있음을 보여 준다. 특수한 한정으로 키워드 한정이 있다. 즉, 개별 특허들마다 핵심 키워드를 추출해 놓은 경우, 특정한 적어도 하나 이상의 키워드를 포함하거나 배제되거나 하는 등의 키워드 관련 조건으로 특허 집합을 한정할 수 있게 된다.Next, special limitations are introduced. The first is field attribute qualification. For example, when a specific owner is classified as a patent monster or a competitor, a patent monster or a competitor is a special field attribute. This is particularly important because there is a high risk of patent disputes or licensing requirements, especially when quoting a large number of patents of right holders classified as patent monsters. The right holder is the same field as the applicant, and the property field of the applicant is not only a patent monster or a competitor (the patent monster or competitor may be different for a specific user and may be designated by a user). There may be a variety of attributes, such as research institutes, firms), owner size (large and small firms), owner attributes (multiple firms, many key patent holders, and companies with high patent quality indices). It will be apparent that the inventor and the like can be given various attributes as well as the owner. This field attribute limitation is performed by the field attribute limiter 4431-6 of the present invention. On the other hand, when the field attribute is present as a value, such limitation is performed by the individual field value limiter 4431-7 in the present invention. On the other hand, for a specific patent set, the field conditions are limited by field values of individual fields such as two or more co-applicants, three or more co-inventors, two or more patent classifications, five or more citations, and so on. Only the patent group that satisfies may be extracted. This restriction is performed by the individual patent attribute value limiter 4431-8. On the other hand, when there are other limitations such limitations are performed by the other limiting portion (4341-9) the present invention. It is a function of the option selector 4340 to perform the above limitations and the like comprehensively. The above limitation may be performed for both the acquired patent set and the cited patent set, and when the generation of the obtained self-patent set obtains the first obtained set and processes the obtained first obtained set as the acquired patent set, Also for the primary acquisition set, when the generation of the cited patent set generates the primary target patent set, the primary target patent set may be limited. The case where this limitation is performed is well illustrated in FIGS. 19 and 20. In FIG. 19, in the case of further limiting to the acquired patent set, the forward cited patent set is also limited than the forward cited patent set for the unrestricted acquired patent set for the limited obtained patent set, and the back cited patent set for the limited acquired patent set. It is shown that it is more limited than the back cited patent set for the unrestricted acquired patent set. Meanwhile, FIG. 20 illustrates that when the forward cited patent set and the back cited patent set are generated with respect to the obtained patent set, the forward cited patent set and the back cited patent set may be limited to generate a limited forward cited patent set and the back cited patent set. Show that there is. Special limitations include keyword limitations. That is, when core keywords are extracted for individual patents, a patent set may be limited to keyword-related conditions such as including or excluding at least one specific keyword.

이어, 도 21 내지 도 22를 참조하면서 잠재 인용을 더욱 더 상세하게 설명한다. 상기 잠재 인용 특허에는 제1 종 잠재 인용 특허와 제2 종 잠재 인용 특허가 있다.Subsequently, the potential citation will be described in more detail with reference to FIGS. 21 to 22. The latent cited patent includes a first latent cited patent and a second latent cited patent.

제1 종 잠재 인용 특허는 자기 특허(SSi)과 공통 부모 특허(예를 들면, P(SSi), SSi의 직접 인용 특허)를 가지는 특허 중에서, 자기 특허와 직접 인용 관계를 가지지 않으면서, 자기 특허의 기 설정된 기준일에 상기 인용 특허가 공개되어 입수 가능성이 있거나, 상기 인용 특허의 기 설성된 기준일에 자기 특허가 공개되어 입수 가능성이 있었던 특허를 말한다. 제1 종 잠재 인용은 도 21에서 LCP1과 LCP4와 같은 특허들이다. 제1 종 잠재 인용 특허 중 전방 제1 종 잠재 인용 특허(LCP1)를 찾기 위해서는 상기 자기 특허가 직접 인용하는 특허 집합(인용 깊이 1인 직접 인용뿐만 아니라 인용 깊이 2인 간접 인용 특허들도 그 대상이 될 수도 있다.)을 인용하는 특허 중 그 공개일이 자기 특허의 출원일보다 앞선 것을 찾으면 된다. 제1 종 잠재 인용 특허 중 후방 제1 종 잠재 인용 특허(LCP4)를 찾기 위해서는 상기 자기 특허가 직접 인용하는 특허 집합(인용 깊이 1인 직접 인용뿐만 아니라 인용 깊이 2인 간접 인용 특허들도 그 대상이 될 수도 있다.)을 인용하는 특허 중 자기 특허의 공개일 이후에 출원된 특허를 찾으면 된다.The first type of potential cited patent is a patent owned by self patent (SSi) and a common parent patent (for example, P (SSi), a direct cited patent of SSi), without having a direct citation relationship with self patent Refers to a patent in which the cited patent is disclosed and available on a predetermined reference date, or the self-patent is disclosed and available on an established reference date of the cited patent. First class latent citations are patents such as LCP1 and LCP4 in FIG. 21. In order to find the first class latent cited patent (LCP1) among the first type latent cited patents, the patent set directly cited by the self-patent (direct citation with citation depth 1 as well as indirect citation patents with citation depth 2) is the target. Patents cited therein may be found earlier than their filing date. In order to find the late class 1 latent cited patent (LCP4) among the first type of late cited patents, the patent set directly cited by the self-patent (direct citation with citation depth 1 as well as indirect cited patents with citation depth 2) is also included. See patents cited after the publication date of their patents.

제2 종 잠재 인용 특허는 자기 특허(SSi)과 공통 부모 특허(예를 들면, P(SSi), SSi의 직접 인용 특허)를 가지는 특허 중에서, 자기 특허와 직접 인용 관계를 가지지 않으면서, 자기 특허의 기 설정된 기준일에 상기 인용 특허가 공개되지 않아 입수 가능성이 없었거나, 상기 인용 특허의 기 설성된 기준일에 자기 특허가 공개되지 않아 입수 가능성이 없었던 특허를 말한다. 제 2종 잠재 인용 특허는 도 22에서 LCP2와 LCP3와 같은 특허들이다. 제2 종 잠재 인용 특허 중 전방 제2 종 잠재 인용 특허(LCP2)를 찾기 위해서는 상기 자기 특허가 직접 인용하는 특허 집합(인용 깊이 1인 직접 인용뿐만 아니라 인용 깊이 2인 간접 인용 특허들도 그 대상이 될 수도 있다.)을 인용하는 특허 중 출원일은 자기 특허의 출원일보다 빠르며, 공개일은 자기 특허의 출원일보다 늦은 것을 찾으면 된다. 제2 종 잠재 인용 특허 중 후방 제2 종 잠재 인용 특허(LCP3)를 찾기 위해서는 상기 자기 특허가 직접 인용하는 특허 집합(인용 깊이 1인 직접 인용뿐만 아니라 인용 깊이 2인 간접 인용 특허들도 그 대상이 될 수도 있다.)을 인용하는 특허 중 출원일이 자기 특허의 출원일보다 늦으며, 자기 특허의 공개일일보다 앞선 것을 찾으면 된다.The second type of potential cited patents are self-patented patents that do not have a direct citation relationship with their own patents among patents having self patents (SSi) and common parent patents (for example, P (SSi), direct cited patents of SSi). The patent does not have a possibility of obtaining because the cited patent is not disclosed on a predetermined reference date, or refers to a patent that could not be obtained because its own patent was not disclosed on the established reference date of the cited patent. The second type of potential cited patents are patents such as LCP2 and LCP3 in FIG. 22. In order to find a forward second class latent cited patent (LCP2) among the second latent citation patents, the patent set directly cited by the self-patent (direct citation with citation depth 1 as well as indirect citation patents with citation depth 2) is the target. For patents cited), the filing date is earlier than the filing date of the own patent, and the publication date is later than the filing date of the own patent. In order to find the late class 2 latent cited patent (LCP3) among the second type of late cited patents, the patent set directly cited by the self-patent (direct citation with citation depth 1 as well as indirect cited patents with citation depth 2) is the target. For patents cited), the application date is later than the filing date of the patent and before the publication date of the patent.

상기 기 설정된 기준일은 출원일로 처리하는 것이 바람직할 것이다. 엄격하게 말하면 상기 기준일은 자기 특허의 레퍼런스별로 레퍼런스를 입력하는 날짜(예를 들면, 심사관이 레퍼런스를 입력하는 경우, 그 입력일, 선행 기술 조사 문건의 입력의 경우에는 그 선행 조사 문건 조사일 등이 그 예가 될 것이다. 한편, IDS 제도가 있는 미국의 경우, 레퍼런스마다 제출일 등이 다를 수가 있다.)가 타당하나, 이는 사실상 data적으로 알기 어려운 문제가 있다. 한편, 출원일을 기준일로 할 경우 PCT 국제출원을 통하여 국내 단계로 진입하는 경우 국제출원일이 통상적인 출원일이 되나, 상기 기준일은 국제 출원의 경우에는 국내 단계에서의 출원일(filing date)으로 처리하는 것이 더욱 더 타당할 것이다.It is preferable that the predetermined reference date be treated as an application date. Strictly speaking, the reference date is a date for inputting a reference for each patent reference (e.g., when the examiner inputs a reference, the input date when the examiner inputs a reference, or when the prior art research document is input, the date of the prior research document investigation, etc.). On the other hand, in the United States, where there is an IDS scheme, the submission date may vary for each reference). On the other hand, if the filing date is the reference date, the international filing date becomes the usual filing date when entering the domestic phase through the PCT international application, but the filing date is more likely to be treated as the filing date at the domestic stage in the case of international filing. Would be more reasonable.

이어, 도 23 내지 도 24를 참조하면서 사슬 인용 특허를 더욱 더 상세하게 설명한다. 상기 사슬 인용 특허에는 제1 종 사슬 인용 특허와 제2종 사슬 인용 특허가 있다.Next, the chain citation patent will be described in more detail with reference to FIGS. 23 to 24. The chain citation patent includes a first kind chain citation patent and a second kind chain citation patent.

상기 제1 종 사슬 인용 특허는 자기 특허(SSi)가 직접 인용하는 특허들인 직접 인용 특허(인용 깊이 1인 직접 인용뿐만 아니라 인용 깊이 2인 간접 인용 특허 또는 잠재 인용 특허들도 그 대상이 될 수도 있다.) 중에서, 상기 직접 인용 특허가 인용하는 특허와 상기 자기 특허가 공통 부모 특허를 가지는 경우가 있는 특허거나(예를 들면, WCP1), 자기 특허(SSi)를 직접 인용하는 특허들인 직접 인용 특허 중에서, 상기 직접 인용 특허가 인용하는 특허와 상기 자기 특허가 공통 부모 특허를 가지는 경우가 있는 특허(예를 들면, WCP4)를 말한다. 상기 제 1종 사슬 인용 특허는 도 23에 잘 도시되어 있다. The first type chain cited patents may be subject to direct cited patents (direct citation with citation depth 1 as well as indirect cited patents or late cited patents with cited depth 2) which are patents directly cited by SSi. Among the patents cited by the direct-cited patent and the self-patent patent may have a common parent patent (for example, WCP1), or a direct-cited patent which is a patent citing self patent (SSi) directly. And a patent (eg, WCP4) in which the patent cited by the direct-cited patent and the self-patent may have a common parent patent. The first chain citation patent is well illustrated in FIG.

상기 제2 종 사슬 인용 특허는 자기 특허(SSi)가 직접 인용하는 특허(예를 들면, WCP2)(인용 깊이 1인 직접 인용뿐만 아니라 인용 깊이 2인 간접 인용 특허 또는 잠재 인용 특허들도 그 대상이 될 수도 있다.)의 적어도 하나 이상 인용 깊이의 직접 인용 특허(예를 들면, WCP3)와 상기 자기 특허가 공통 부모 특허(예를 들면, P(SSi))를 가지는 경우에 있어서, 상기 자기 특허(SSi)가 직접 인용하는 특허(WCP2) 및 상기 자기 특허(SSi)가 직접 인용하는 특허(WCP2)의 적어도 하나 이상 인용 깊이의 직접 인용 특허(WCP3)를 말한다. 한편, 상기 제2 종 사슬 인용 특허에는 상기 자기 특허(SSi)를 직접 인용하는 특허(예를 들면, WCP5)의 적어도 하나 이상 인용 깊이의 직접 인용 특허(예를 들면, WCP6)와 상기 자기 특허가 공통 부모 특허(예를 들면, P(SSi))를 가지는 경우에 있어서, 상기 자기 특허(SSi)를 직접 인용하는 특허(WCP5) 및 상기 자기 특허(SSi)를 직접 인용하는 특허(WCP5)의 적어도 하나 이상 인용 깊이의 직접 인용 특허(WCP6)를 말한다.The second type chain cited patents include patents which are directly cited by SSi (for example, WCP2) (direct citation having a citation depth of 1 as well as indirect citing patents or potential citing patents having a citation depth of 2). At least one citation depth (e.g., WCP3) and the self-patent having a common parent patent (e.g., P (SSi)). Refers to a directly cited patent (WCP3) of at least one citation depth of a patent (WCP2) directly cited by SSi and a patent (WCP2) directly cited by the self patent (SSi). Meanwhile, the second type chain citation patent includes a direct citation patent (eg, WCP6) and at least one citation depth of at least one citation depth of a patent (eg, WCP5) that directly cites the self patent (SSi). In the case of having a common parent patent (for example, P (SSi)), at least a patent (WCP5) that directly cites the magnetic patent (SSi) and a patent (WCP5) that cites the magnetic patent (SSi) directly. Refers to a direct cited patent (WCP6) of one or more cited depths.

직접 인용 특허만으로 사슬 인용 특허를 찾기 위해서는 자기 특허가 직접 인용하는 특허들과 자기 특허로 사슬 인용 특허 집합을 구성한 다음, 상기 사슬 인용 특허 집합에서 인용-피인용 관계를 가지는 특허를 추출한다. 추출된 특허 중에서 자기 특허보다 출원일이 빠르면 전방, 자기 특허보다 출원일이 늦으면 후방이 된다. 한편, 추출된 특허 중에서, 자기 특허를 인용하거나, 자기 특허가 인용하는 관계가 되는 경우에는 제1종 사슬 인용 특허가 되고, 자기 특허와의 직접적인 인용 관계가 없으면 제 2종이 된다. 간접 인용 특허만으로 상기 사슬 인용 특허를 찾기 위해서는 자기 특허와 간접 인용 특허로 사슬 인용 특허 집합을 구성하여 동일한 정보 처리를 수행하면 된다. 한편, 잠재 인용 특허만으로 상기 사슬 인용 특허를 찾기 위해서는 자기 특허와 간접 인용 특허로 사슬 인용 특허 집합을 구성하여 동일한 정보 처리를 수행하면 된다. 한편, 자기 특허에 대하여, 직접 인용 특허, 간접 인용 특허, 및 잠재 인용 특허 중 어느 2 이상으로 사슬 인용 특허 집합을 구성한 다음에 동일한 정보 처리를 수행할 수도 있을 것이다.In order to find a chain citation patent using only the direct citation patent, a chain citation patent set is composed of the patents citationd directly by the own patent and the self patent, and then a patent having a citation-citation relationship is extracted from the chain citation patent set. Among the extracted patents, the filing date is earlier than the patent, and the filing date is later than the patent. On the other hand, among the extracted patents, when the self-patent is cited or when the self-patent becomes a citation, the first patent is a chain citation patent, and when there is no direct citation relationship with the own patent, the second patent is obtained. In order to find the chain citation patent using only the indirect citation patent, a chain citation patent set may be composed of self patents and indirect citation patents, and the same information processing may be performed. On the other hand, in order to find the chain citation patent with only the potential citation patent, a chain citation patent set may be composed of self patents and indirect citation patents, and the same information processing may be performed. On the other hand, the self-patent may comprise the chain citation patent set of any two or more of direct citation patents, indirect citation patents, and latent citation patents, and then perform the same information processing.

상기 인용 관련된 모든 특허에 있어서, 전방과 후방은 상기 자기 특허와 상기 인용 특허들 간의 출원일을 기준으로 설정하는 것이 타당할 것이나, 최선일(우선권 주장 출원의 경우 우선일이 최선일이 되며, 우선권 주장이 없는 경우에는 통상의 출원일이 최선일이 된다.)을 기준으로 설정할 수도 있을 것이다.In all of the patents related to the above citations, it would be reasonable to set the front and rear on the basis of the filing date between the self-patent and the cited patents, but the best date (in the case of a priority claim application, the priority date is the best, and there is no priority claim) In this case, the normal filing date may be the best date.

본 발명에서의 인용과 관련된 상기 표 1과 같은 각각의 분쟁 예측 요소(후보)에 대한 분쟁 예측 요소(후보)값은 상기 직접 인용, 상기 간접 인용, 상기 잠재 인용, 상기 사슬 인용 중 어느 하나 이상의 인용 종류별로 생성해 놓을 수 있다. 나아가 상기 각각의 분쟁 예측 요소는 간접 인용의 깊이별 및/또는 잠재 인용이나 사슬 인용의 종류별로도 분쟁 예측 요소를 세분화할 수 있으며, 세분화된 분쟁 예측 요소별로 분쟁 예측 요소값을 생성할 수도 있을 것이다. 한편, 세분화하지 않는 경우, 간접 인용의 깊이 및 잠재 인용의 종류 중 실제 분쟁 예측 모델의 생성 시에 사용하는 범위는 상기 사용자 또는 상기 특허 분쟁 예측 정보 생성 시스템(5000)의 관리자가 선택할 수 있을 것이다.The dispute prediction element (candidate) value for each dispute prediction element (candidate) as shown in Table 1 related to the citation in the present invention may be any one or more of the direct citation, the indirect citation, the latent citation, and the chain citation. You can create by type. Furthermore, each dispute prediction element may subdivide the dispute prediction element by the depth of indirect citation and / or by the type of potential citation or chain citation, and may generate the dispute prediction element value by the segmented dispute prediction element. . On the other hand, if not subdivided, the range used in generating the actual dispute prediction model among the depth of the indirect citation and the type of the potential citation may be selected by the user or the administrator of the patent dispute prediction information generation system 5000.

특정 특허 집합(Self Set, SS)이 있을 때, 상기 특허 집합에 포함된 특허가 인용하는 특허군으로 형성되는 특허 집합은 선출원 특허 집합(Parent Set)이 될 수 있고, 특정한 조건을 만족하는 특허(예를 들면 상기 특허 집합에 포함된 특허를 인용하는 특허 중 최근 3년 이내 또는 특정 출원인 또는 특허 분류를 포함하는 특허 등) 등도 선출원 특허 집합(Parent Set)이 될 수 있다. 등록 특허가 이 선출원 특허 집합에 포함된 특허들의 임의의 조합은 선출원 특허군이 된다. 상기 특허 집합에 포함된 특허를 인용하는 특허군 또는 특정한 조건을 만족하면서 상기 특허 집합에 포함된 특허를 인용하는 특허군으로 형성되는 특허 집합은 후출원 특허 집합(Child Set)이 될 수 있다. 상기 후출원 특허 집합에 포함된 특허들의 임의의 조합은 후출원 특허군이 된다. 상기 인용 관점 분쟁 예측 요소값은 상기 특정 특허 집합별, 선출원 특허 집합별, 후출원 특허 집합별로도 생성될 수 있다. B라는 특정 특허 집합이 A라는 특허 집합의 후출원 특허 집합일 경우, A가 Self Set이 되고, B는 Child Set이 되며, B라는 특정 특허 집합이 C라는 특허 집합의 선출원 특허 집합일 경우, C가 Self Set이 되고, B는 Parent Set이 된다. 한편, 인용-피인용 관계의 특성상 B라는 특정 특허 집합이 A라는 특허 집합의 후출원 특허 집합일 경우, 반드시 B의 Parent Set이 A가 되는 것은 아니다. B의 Parent Set의 부분 집합이 A가 되는 것이 보통이다.When there is a specific patent set (Self Set, SS), the patent set formed of the patent group cited by the patent included in the patent set may be a parent patent set (Parent Set), the patent satisfying a specific condition ( For example, patents citing patents included in the patent set within the last three years or patents including a specific applicant or patent classification, etc.) may also be a patent set (Parent Set). Any combination of patents in which a registered patent is included in this set of prior patents becomes a group of prior patents. A patent set formed of a patent group citing a patent included in the patent set or a patent group citing a patent included in the patent set while satisfying specific conditions may be a post-application patent set (Child Set). Any combination of patents included in the post-application patent set is a post-application patent group. The cited viewpoint dispute prediction element value may be generated for each specific patent set, for each prior patent set, or for each subsequent patent set. If the specific patent set B is a post-patent patent set of the patent set A, then A becomes a Self Set, B becomes a Child Set, and if the specific patent set B is a pre-patent patent set of a patent set C, then C Becomes Self Set, and B becomes Parent Set. On the other hand, if the specific patent set B is a post-patent patent set of the patent set A, the parent set of B does not necessarily become A due to the nature of the citation-to-cited relationship. It is common for a subset of B's Parent Set to be A.

도 25는 표 1과 관련하여 인용 관점의 분쟁 예측 요소별 분쟁 예측 요소값을 생성하는 일 실시예적 방법을 도시하고 있다. 상기 분쟁 요소값 생성 엔진은 분쟁 예측 요소값 생성 대상이 되는 특허를 입수(SL21)하고, 입수된 특허의 직접 인용 특허를 입수하고 직접 인용 관련 분쟁 예측 요소값을 생성하거나, 입수된 특허의 간접 인용 특허를 입수하고 간접 인용 관련 분쟁 예측 요소값을 생성하거나, 입수된 특허의 잠재 인용 특허를 입수하고, 잠재 인용 관련 분쟁 예측 요소값을 생성하거나, 입수된 특허의 사슬 인용 특허를 입수하고 사슬 인용 관련 분쟁 예측 요소값을 생성(SL22)하고, 생성된 인용 관련 분쟁 예측 요소값을 저장(SL23)한다.FIG. 25 illustrates one embodiment method for generating a dispute prediction element value for each dispute prediction element in a citation view with respect to Table 1. FIG. The dispute element value generation engine acquires a patent for which a dispute prediction element value is generated (SL21), obtains a directly cited patent of the acquired patent, generates a dispute prediction element value related to the direct citation, or indirectly cites the obtained patent. Obtain a patent and generate a dispute prediction factor value of an indirect citation, obtain a potential citation patent of a received patent, generate a dispute prediction factor value of a late citation, obtain a chain citation patent of the obtained patent, A dispute prediction element value is generated (SL22), and the generated citation related dispute prediction element value is stored (SL23).

둘째, 다수의 분쟁 특허들은 2회 이상 분쟁을 일으키는 등과 같이 분쟁 특허 집합에는 특허 분쟁에 다수 참여하는 다분쟁 유발 특허가 존재한다. 등록된 모든 특허권에 대하여 고르게 분쟁이 유발되는 것이 아니라, 분쟁에 관련되는 분쟁 대상 특허는 소수이며, 특히 몇몇의 다분쟁 유발 특허가 다수의 분쟁을 유발시킨다. 상기 다분쟁 유발 특허는 특허 분쟁 정보를 분석함으로써도 알 수 있다. 특허 분쟁이 발생한 특허를 특허 분쟁별로 입수하여, 특허 분쟁과 관련된 특허 번호를 입수하고, 그 특허 번호 중 기 설정된 분쟁 참여 회수 이상인 것은 다분쟁 유발 특허가 된다. 이러한 분쟁 발생 특허는 제1종 다분쟁 유발 특허가 된다.Second, there is a multi-dispute-induced patent that participates in a number of patent disputes in a set of disputed patents, such as many disputed patents causing more than one dispute. Rather than causing even disputes over all registered patent rights, there are only a few disputed patents involved in a dispute, and in particular, several multi-dispute-induced patents cause a number of disputes. The multi-dispute causing patent can also be known by analyzing patent dispute information. Obtain a patent for which a patent dispute has occurred by patent dispute, obtain a patent number related to a patent dispute, and more than the predetermined number of participations in a dispute among the patent numbers becomes a multi-dispute-inducing patent. Such dispute-producing patents become first-class multi-dispute-inducing patents.

한편, 분쟁 발생은 되지 않았지만, 분쟁 가능성이 높은 특허들도 존재한다. 예를 들면, 특허 괴물 또는 특허 괴물의의 자회사, 유관 회사, 특허 괴물이 관리하는 대상 권리자의 적어도 하나 이상의 특허군들이나, 적어도 하나 이상의 표준 특허풀에 포함된 특허, 다분쟁 유발자가 보유하고 있는 특허군의 특허나 이들에 신규로 편입(편입 여부는 current assignee 정보를 처리함으로 알 수 있다.)된 특허들이 예가 된다. 이러한 예에 속하는 특허를 제2종 다분쟁 유발 특허라 한다.On the other hand, there are patents that have not been disputed but are highly likely to be disputed. For example, at least one or more patent groups of subsidiaries, affiliates, affiliates of patent monsters or patent monsters, or patent holders controlled by patent monsters, patents contained in at least one standard patent pool, or patents held by multi-dispute creators Examples include patents in the group or newly incorporated into them (whether or not they can be found by processing current assignee information). The patents belonging to this example are referred to as a second type multi-dispute inducing patent.

한편, Reissue 특허, 다패밀리량 특허, 다패밀리량 증가 특허, 다피인용 특허, 피인용 증가 속도가 높은 특허 등과 같은 특허를 제3종 다분쟁 유발 특허라 한다.Meanwhile, patents such as a reissue patent, a multi-family amount patent, a multi-family amount increase patent, a multi-citation patent, and a patent with a high increase rate of citation are referred to as a third type multi-dispute-induced patent.

개별 특허를 기준으로 상기 개별 특허 기준 다분쟁 유발 특허 관점 분쟁 예측 요소별로 상기 제1종, 제2종 및/또는 제3종 다분쟁 유발 개별 특허 인지의 여부는 개별 특허 기준 다분쟁 유발 특허 관점 분쟁 예측 요소값을 생성할 수 있다. 개별 특허 기준 다분쟁 유발 특허 관점 분쟁 예측 요소값은 Reissue 특허 해당 여부 등과 같이 해당되는 경우 1, 해당되지 않는 경우에는 0처럼 취급될 수도 있지만, 대부분의 다분쟁 유발 특허 관점 분쟁 예측 요소값은 0과 1이 아닌 값(예를 들어 패밀리량은 정수, 피인용 증가 속도는 유리수값이 될 수도 있다.)이 될 수도 있다. 한편, 개별 특허가 아닌 특허군 단위로 상기 다분쟁 유발 특허 관점 분쟁 예측 요소별로 상기 제1종, 제2종 및/또는 제3종 다분쟁 유발 특허 관점 분쟁 예측 요소값을 생성할 수 있다. 즉, n개의 특허로 이루어진 다분쟁 유발 특허군이 있을 때, 상기 n개의 특허 중에서 상기 다분쟁 유발 특허 관점 분쟁 예측 요소별로 다분쟁 유발 특허 관점 분쟁 예측 요소값을 생성할 수 있다. 이때 생성되는 다분쟁 유발 특허 관점 분쟁 예측 요소값은 카운트(합산 포함)값, 평균값(산술평균, 기하평균 등) 및 기 설정된 함수식값 중 어느 하나가 될 수 있다. 예를 들어 100개의 특허로 이루어진 특허군이 있을 때, Reissue 특허의 개수가 10개일 경우, 10이 카운트값이 될 수 있다. 한편, 동일한 Reissue 특허 10개라도 모집단(n)이 작을수록 그 모집단은 상대적으로 많은 Reissue특허를 포함하게 된다. 즉, Reissue값의 밀도가 높아지게 되며, 높은 밀도는 그 특허군이 분쟁과 관련될 가능성을 높이게 된다. 한편, 상기 밀도는 평균값으로 측정될 수 있을 것이다.Multi-dispute-induced patent perspective disputes based on individual patents Whether the first, second and / or third-class multi-dispute-induced individual patents are identified on a per-prediction basis. Predictive element values may be generated. Per-patent multi-dispute-prone patent perspective Dispute prediction factor values may be treated as 1 if applicable, such as whether a Reissue patent is applicable, or 0 if not applicable. It may also be a value other than 1 (for example, the amount of the family is an integer, the rate of increase may be a rational value). Meanwhile, the first, second and / or third type multi-dispute-prone patent viewpoint dispute prediction element values may be generated for each of the multi-dispute-prone patent viewpoint dispute prediction elements in units of patent groups instead of individual patents. That is, when there is a multi-dispute-induced patent group consisting of n patents, the multi-dispute-induced patent viewpoint dispute prediction element value may be generated for each of the n-patent-induced patent viewpoint dispute prediction elements. In this case, the generated multi-dispute-prone patent viewpoint dispute prediction element value may be any one of a count (including summation) value, an average value (arithmetic mean, geometric mean, etc.) and a predetermined function value. For example, when there is a patent group consisting of 100 patents, when the number of Reissue patents is 10, 10 may be a count value. On the other hand, even if ten of the same Reissue patent is smaller the population (n), the population includes a relatively large number of Reissue patents. In other words, the density of Reissue increases, and the higher density increases the possibility that the patent group is related to the dispute. On the other hand, the density may be measured as an average value.

나아가, 상기 제1종, 제2종 및/또는 제3종 다분쟁 유발 개별 특허별 또는 상기 제1종, 제2종 및/또는 제3종 다분쟁 유발 개별 특허군별로 상기 인용 관점 분쟁 예측 요소 후보값들을 생성할 수 있음은 물론이다 할 것이다.Furthermore, the citation point of view dispute prediction element for each patent of the first, second and / or third kind of multi-dispute-induced patents or for each of the first, second, and / or third kind of multi-dispute-induced individual patent groups. Of course, it is possible to generate candidate values.

상기 다분쟁 유발 특허 관점에서 하기 표 2와 같은 분쟁 예측 요소 후보가 예시적으로 제시될 수 있을 것이다.In view of the multi-dispute-induced patents, dispute prediction element candidates as shown in Table 2 may be exemplarily presented.

계열line 계열 codeSeries code 다분쟁 유발 특허 관점 분쟁 예측 요소 후보군Multi-dispute-Prone Patent Perspective Dispute Prediction Factor Candidates 비고Remarks 총개수Total number A2A2 재분쟁수(분쟁참여수-1), 패밀리수, 권리자 변동수 또는 비발명자 assignee 변동수Re-disputes (participation-1), families, change in owners, or non-inventor assignee changes 최근n년 개수Last n years B2B2 최근 n년간 A2(예, 최근 n년간 재분쟁수, 최근 n년간 패밀리수, 최근 n년간 권리자 변동수, 최근 n년간 비발명자 assignee 변동수)(n은 1개 이상이 될 수 있음, 이하 이 표에서는 동일하게 적용)A2 for the last n years (e.g., number of re-disputes in the last n years, number of families in the last n years, change in the rights holders in the last n years, and change in assignee nonees in the last n years) (n may be one or more, The same applies) 증감량 또는 증감율Increase or decrease rate C2C2 전기간 기준 A2~B2의 증감량, 특정 기간 기준 A2~B2의 증감율The amount of increase and decrease of standard A2-B2 for the whole period, increase and decrease rate of standard A2-B2 for specific period 여부Whether D2D2 Reissue특허 여부, 해외 특허 여부, 트롤 특허 여부Reissue patent status, foreign patent status, troll patent status 서지 사항Bibliography E2E2 출원 청구항수, 등록 청구항 수, 도면수, 페이지수, 특허 분류수, 특정 특허 분류 레벨 기준 이종 특허 분류수, 공동출원인수, 공동발명자수, 등록 소요 기간, 권리 유지 기간, 외국 발명자수, 외국 발명자 비율, 우선권 주장수, 등록 유지 기간No. of Application Claims, No. of Registration Claims, No. of Drawings, No. of Pages, No. of Patents, No. of Heterogeneous Patents Based on Specific Patent Classification Levels, No. of Co-Applications, No. of Co-Inventors, No. of Registrations, No. of Retention Periods, No. of Foreign Inventors, No. of Foreign Inventors Percentage, priority claims, and retention period 표준 또는 소속풀Standard or Membership Pool E2E2 표준 특허풀 포함 여부, 표준 특허풀 포함수, 소속 특허풀 포함수Inclusion of standard patent pool, inclusion of standard patent pool, inclusion of affiliated patent pool 다중 분쟁Multiple disputes F2F2 분쟁참여수, 공동참여분쟁수, 분쟁당 공동참여 특허수, 총피고수, 참여분쟁당 피고수Number of dispute participation, number of joint participation disputes, number of joint participation patents per dispute, total defendants, defendants per participation dispute 다중 분쟁 증감Multiple disputes G2G2 최근 n년 간 F2, 전기간 기준 F2의 증감량, 특정 기간 기준 F2의 증감율F2 in the last n years, F2 increase or decrease in the previous period, F2 increase or decrease in a specific period 분쟁 분포Dispute distribution H2H2 최초 분쟁 연도, 최후 분쟁 연도, 최빈 분쟁 연도First dispute year, last dispute year, least disputed year 경과 정보Progress information I2I2 심판별 심판 제기 여부, 심판별 심판 제기 수, 우선심사청구 여부, 출원일과 심사청구일 차이, 출원일과 등록일 차이Judgment by Judgment, Number of Judgments by Judgment, Request for Priority Examination, Difference between filing date and request, Difference between filing date and registration date

셋째, 다분쟁 유발자가 존재한다. 다분쟁 유발자 정보는 특허 분쟁 정보를 입수하고, 특허 분쟁 정보에 포함된 특허 번호의 권리자 또는 원고(plaintiff)와 관련되어, 기 설정된 빈도(frequency) 또는 기 설정된 증가율 등의 각종 조건을 적용하여 적어도 하나 이상의 다분쟁 유발자를 추출할 수 있다. 분쟁 정보를 분석하여 생성하는 다분쟁 유발자를 제1종 다분쟁 유발자라 한다. 한편, 특허 괴물 또는 적어도 하나 이상의 특허풀에 기 설정된 개수 이상의 특허를 보유하고 있거나 기 설정된 조건(최근 개수, 고증가율)을 충족하는 특허를 보유하고 있는 권리자 등은 제2종 다분쟁 유발자라 한다. 그리고, 권리자들 중에서 그 권리자의 전체 특허 포트폴리오 또는 특정 시기 또는 특정 조건으로 결정되는 특허군의 특허 중에서 Reissue 특허 비중, 다패밀리량 특허 비중, 다패밀리량 증가 특허 비중, 다피인용 특허 비중, 피인용 증가 속도가 높은 특허 비중 등이 높은 특허군을 보유하는 권리자를 제3종 다분쟁 유발 권리자라 한다.Third, there is a multi-dispute inducer. The multi-dispute cause information is obtained by applying various conditions such as a predetermined frequency or a predetermined increase rate in accordance with the patent dispute information and the owner or plaintiff of the patent number included in the patent dispute information. One or more polydisputers can be extracted. The multi-dispute inducer that analyzes and generates the conflict information is called the first multi-dispute inducer. On the other hand, the right holder who owns a patent monster or a predetermined number of patents in at least one or more patent pools, or holds a patent that satisfies a predetermined condition (recent number, high growth rate), etc., is called a second type of multi-dispute inducer. In addition, among the right holders, the proportion of Reissue patents, multi-family patents, multi-family patents, multi-patent patents, and increased citations among the patent holders determined by the owner's entire patent portfolio or at specific times or under certain conditions. The right holder who owns a patent group with a high proportion of high-speed patents is called a third type multi-dispute right holder.

상기 다분쟁 유발자 관점에서 하기 표 3와 같은 분쟁 예측 요소 후보가 예시적으로 제시될 수 있을 것이다.From the viewpoint of the multi-dispute inducer, a dispute prediction factor candidate may be presented as an example.

계열line 계열 codeSeries code 다분쟁 유발자 관점 분쟁 예측 요소 후보군Multi-Dispute Provoker Perspective Conflict Predictor Candidate 비고Remarks 다분쟁 제기자 관련성Multidispute Relevant A3A3 권리자의 분쟁 참여 회수, 권리자의 총 분쟁 참여 특허 보유수, 권리자의 특허 중 분쟁 참여 특허 비율Number of dispute participation patent holders, total number of dispute participation patent holders, ratio of dispute participation patents among patent holders 다분쟁 경험자 관련성Multidispute Experience B3B3 권리자의 분쟁 피소 회수, 권리자의 분쟁 피소 회수 대비 분쟁 제기 회수 비율Number of disputes filed by right holder, number of disputes filed against number of disputes filed by right holder 표준 특허풀 참여자 관련성Relevance of Standard Patent Pool Participants C3C3 권리자의 표준 특허풀 참여 특허수, 권리자의 총 특허 중 표준 특허풀 비참여 특허수, 권리자의 특허 중 표준 특허풀 참여 특허 비율No. of Participants' Standard Patent Pool Participation Patents, No. of Patents Participating in Standard Patent Pool Participants, and Ratio of Participant's Patents in Standard Patent Pool Participants 최근n년 개수Last n years D3D3 최근 n년간 A3~C3A3-C3 in last n years 증감량 또는 증감율Increase or decrease rate E3E3 최근 n년 기준 A3~C3 증감량, 최근 n년 기준 A3~C3 증감율A3 ~ C3 increase / decrease in recent n years, A3 ~ C3 increase / decrease in recent n years 등록된 트롤 관련성Registered Troll Relevance F3F3 권리자가 등록된 트롤인지 여부Whether the owner is a registered troll 권리자 기관 속성Owner Authority Attributes G3G3 대기업 여부, 해외 기업 여부, 권리자의 기업/기관/대학/개인 여부Whether it is a large company, an overseas company, or whether the owner is a company / institution / university / individual 권리자 특허 포트폴리오 속성Owner Patent Portfolio Properties H3H3 전체 출원청구항수, 특허출원 1건당 청구항 수, 발명자 1인당 청구항수, 전체 특허등록건수, 전체 등록청구항수, 특허등록율, 전체 유효특허건수, 유효특허잔존율, 유효특허의 평균잔존 소요기간, 특허 당 인용지수, 기술 영향력 지수, 기술력 지수, 기술진보 측정 지수, Innovation Speed Index(isi), 과학적 연계성, 과학력 지수, 과학적 연계지수, International Knowledge Flow(IKF), 기술 의존도, 기술 자립도, 공동 출원 비율, 공동 출원인수Total number of claimed claims, number of claims per patent application, number of claims per inventor, total number of patents registered, total number of claimed claims, patent registration rate, total number of effective patents, valid patent remaining rate, average duration of valid patents, patents Party Citation Index, Technology Influence Index, Technology Index, Technology Progress Measurement Index, Innovation Speed Index (isi), Scientific Linkage, Science Index, Scientific Linkage Index, International Knowledge Flow (IKF), Technology Dependency, Technology Independence, Co-Application Rate , Co-applicants

n개의 특허로 이루어진 Set가 있을 때, 상기 n개의 특허 중에서 상기 다분쟁 유발자가 권리자로 되어 있는 특허의 카운트값, 평균값 및 기 설정된 함수식값 중 어느 하나 이상을 다분쟁 유발자 관점 분쟁 예측 요소값으로 할 수 있다. 예를 들어 100개의 특허로 이루어진 특허군이 있을 때, AA라는 다분쟁 유발자가 권리자로 된 특허의 개수가 10개일 경우, 10이 카운트값이 될 수 있다. 한편, 동일한 다분쟁 유발자가 권리자인 특허가 10개라도 모집단(n)이 작을수록 그 모집단은 상대적으로 많은 수의 다분쟁 유발자가 권리자로 된 특허를 포함하게 된다. 즉, 다분쟁 유발자 관련 밀도가 높아지게 되며, 높은 밀도는 그 특허군이 분쟁과 관련될 가능성을 높이게 된다. 한편, 상기 밀도는 평균값으로 측정될 수 있을 것이다.When there is a Set consisting of n patents, one or more of the count value, average value, and a predetermined function value of a patent in which the multi-dispute initiator is the owner of the n patents is set as the multi-dispute viewpoint perspective dispute prediction element value. can do. For example, when there is a patent group consisting of 100 patents, if the number of patents in which the multi-dispute-provoker of AA is the owner is 10, 10 may be a count value. On the other hand, even if there are ten patents in which the same multi-dispute inducer is the owner, the smaller the population (n), the larger the number of patents in which the multi-dispute inducer is the owner. In other words, the density of multi-dispute triggerers becomes higher, and the higher density increases the likelihood that the patent group will be related to the dispute. On the other hand, the density may be measured as an average value.

나아가, 상기 제1종, 제2종 및/또는 제3종 다분쟁 유발자의 다분쟁 유발자의 전체 특허로 생성되는 특허군 또는 기 설정된 조건(특정 기간, 특정 특허 분류, 또는 특정 검색어 또는 기타 조건)을 충족하는 특허군 각각에 대하여 상기 인용 관점 분쟁 예측 요소 후보값을 생성해 놓을 수 있을 것이다.Furthermore, a patent group or preset conditions (specific period, specific patent classification, or specific search term or other condition) generated by the entire patent of the multi-dispute inducer of the first, second and / or third multi-dispute inducers For each patent group that satisfies the above, the cited viewpoint dispute prediction element candidate value may be generated.

넷째, 소송이 빈번하게 발생하는 다분쟁 기술군이 존재한다. 또한, 권리자가 자신이 보유하고 있는 특허 포트폴리오를 기준으로 분쟁을 다수 일으키는 기술군과 그렇지 않을 기술군이 존재하며, 기술군마다 권리자의 특허 포트폴리오가 다르며, 권리자의 주관적 중요도가 다를 수 있다. 권리자는 통상적으로 2 이상의 기술군(예를 들면 특허 분류)에 대하여 2 이상의 특허를 보유하고 있을 수 있고, 동일 권리자의 각 기술군에 포함된 특허마다 분쟁 발생 여부, 분쟁 제기 회수 등이 다를 수 있다.Fourth, there is a multi-dispute technology group in which litigation occurs frequently. In addition, there exists a technology group that causes a lot of disputes based on a patent portfolio owned by the rights holder and a technology group that does not. The technology portfolio of the rights holder may be different, and subjective importance of the rights holder may be different. The right holder may normally have two or more patents for two or more technology groups (for example, a patent classification), and the patents included in each technology group of the same rights holder may have different disputes and the number of disputes filed. .

예를 들면, 미국 특허 분쟁 정보에서 분쟁 유발 특허의 특허 분류 중에서 IPC 기준 G06F와 A61K에는 다른 특허 분류에 비하여 다수의 특허 분쟁이 발생함을 알 수 있다. 분쟁 유발 특허에는 적어도 한 종류 이상의 적어도 하나 이상의 특허 분류가 포함되어 있으며, 상기 특허 분류를 처리하여 적어도 하나 이상의 다분쟁 기술군을 추출한다. 상기 다분쟁 기술군은 IPC, USPC 또는 기타 특허 서지 사항에 포함되어 있는 특허 분류가 될 수 있다. 한편, 특허 분류 체계의 계층 구조상 다양한 계층별로의 다분쟁 기술군 정보를 생성할 수 있다. 예를 들어, H04B 7/26라는 특허 분류가 있을 때, H04B 7/26(2 dot Subgroup) 부모는 특허 분류 체계 상 차례로 H04B 7/24(1dot Subgroup), H04B 7/00(Main Group), H04B(Subclass), H04(Class), H(Section)가 된다. 따라서, n dot Subgroup부터 Main Group, Subclass, Class 등과 같이 각 계층별로 분쟁 발생 특허군의 특허 분류를 기준으로 적어도 하나 이상의 최빈 특허 분류를 추출할 수 있을 것이다. 이때, 주특허 분류(Main Classification)만으로 최빈 특허 분류를 추출할 지, 부특허 분류(Sub Classification)을 포함하여 최빈 특허 분류를 추출할 지는 시스템의 설정 또는 사용자의 선택에 따라 달라질 수 있다. 물론, 특허 분류 계층 구조상의 어느 깊이(IPC의 경우 Section에서 n dot Subgroup, USPC의 경우 Class에서 n dot Subgroup 등 특허 분류 체계의 종류마다 다를 수 있다.)를 선택할지도 시스템의 설정 또는 사용자의 선택에 따라 달라질 수 있다. For example, in the US patent dispute information, it can be seen that among the patent classifications of dispute-inducing patents, a number of patent disputes occur in IPC standards G06F and A61K compared to other patent classifications. The dispute-inducing patent includes at least one or more types of at least one patent classification, and the patent classification is processed to extract at least one multi-dispute technology group. The multidispute technology group may be a patent classification included in the IPC, USPC or other patent bibliography. Meanwhile, the multi-dispute technology group information for various hierarchies can be generated in the hierarchical structure of the patent classification system. For example, when there is a patent classification of H04B 7/26, the parents of H04B 7/26 (2 dot Subgroup) are in sequence H04B 7/24 (1dot Subgroup), H04B 7/00 (Main Group), H04B (Subclass), H04 (Class), and H (Section). Therefore, at least one of the most frequent patent classifications may be extracted based on the patent classification of the disputed patent group for each layer such as n dot subgroup, main group, subclass, and class. At this time, whether to extract the most frequent patent classification using only the main patent classification or to extract the most frequent patent classification including the sub classification may vary according to the system setting or the user's selection. Of course, the depth of the patent classification hierarchy (which may be different for each type of patent classification system such as n dot subgroup in Section in the case of IPC and n dot Subgroup in Class in the case of USPC) is also determined by the system setting or user selection. It may vary.

상기 다분쟁 기술군 관점에서 하기 표 4와 같은 분쟁 예측 요소 후보가 예시적으로 제시될 수 있을 것이다.From the perspective of the multi-dispute technology group, a dispute prediction element candidate as shown in Table 4 may be presented as an example.

계열line 계열 codeSeries code 다분쟁 기술군 관점 분쟁 예측 요소 후보군Multi-dispute Technology Group Perspective Conflict Prediction Factor Candidate 비고Remarks 다분쟁 기술군 관련성Multidispute Technology Group Relevance A4A4 특허 분류 기준 분쟁 발생 회수,
특허 분류 기준 분쟁 제기 특허수,
특허 분류 기준 분쟁 제기자수,
특허 분류 기준 분쟁 피고자수,
특허 분류 기준 전체 특허수 대비 분쟁 발생 회수 비율,
특허 분류 기준 전체 특허수 대비 분쟁 제기 특허수 비율
특허 분류 기준 전체 특허수 대비 분쟁 제기자수 비율,
특허 분류 기준 전체 특허수 대비 분쟁 피고자수 비율Number of disputes over patent classification criteria,
Number of patents filed
Patent classifier disputes,
Patent classifier dispute defendants,
Ratio of number of dispute occurrences to total number of patents based on patent classification,
Ratio of disputed patents to total patents based on patent classification
Ratio of disputes to total patents,
Ratio of disputed defendants to total number of patents based on patent classification 권리자 보유 특허군 한정 다분쟁 기술군 관련성Relevant patent group limited multi-dispute technology group relevance B4B4 권리자 보유 특허군 한정 특허 분류 기준 분쟁 발생 회수,
권리자 보유 특허군 한정 특허 분류 기준 분쟁 제기 특허수,
권리자 보유 특허군 한정 특허 분류 기준 분쟁 제기자수,
권리자 보유 특허군 한정 특허 분류 기준 분쟁 피고자수,
권리자 보유 특허군 한정 특허 분류 기준 전체 특허수 대비 분쟁 발생 회수 비율,
권리자 보유 특허군 한정 특허 분류 기준 전체 특허수 대비 분쟁 제기 특허수 비율
권리자 보유 특허군 한정 특허 분류 기준 전체 특허수 대비 분쟁 제기자수 비율,
권리자 보유 특허군 한정 특허 분류 기준 전체 특허수 대비 분쟁 피고자수 비율Number of patent disputes based on patent group-specific patent classification
Number of patents for disputes
Number of patent dispute holders
Number of disputed defendants
The ratio of the number of times a dispute arises out of the total number of patents
Patent Group Based on Patent Group Limited Patent Classification Criteria Ratio of Disputed Patents
Patent holder-limited patent classification criteria The ratio of the number of disputed applicants to the total number of patents,
Patent group-specific patent classification criteria The ratio of disputed defendants to total patents 권리자 기술 분야 포트폴리오 속성Owner Technology Sector Portfolio Attributes C4C4 권리자 특허 분류 기준 점유율,
권리자 특허 분류 기준 집중율,
권리자 특허 분류 기준 활동율,
권리자 특허 분류 기준 H3
권리자 특허 분류 기준 특허당 평균 등록 유지 기간Owner patent classification share,
Owner patent classification criteria concentration rate,
Owner patent classification criteria activity rate,
Owner Patent Classification Criteria H3
Patent Holder Criteria Average Duration of Registration per Patent 최근n년 개수Last n years 최근 n년간 A4~C4A4 to C4 in the last n years 증감량 또는 증감율Increase or decrease rate 최근 n년 기준 A4~C4 증감량, 최근 n년 기준 A4~C4 증감율A4 ~ C4 increase or decrease based on the last n years, A4 ~ C4 increase or decrease based on the last n years 기술군 속성Technology attribute 기술의 경제적 수명The economic life of the technology

상기 표 4에서 특허 분류는 특허 분류 종류(예, IPC, USPC 등)별, 특허 분류 깊이(예, IPC의 경우, class, subclass, main group, n dot subgroup 등)별, 주특허분류별 또는 모든 특허분류별 중 어느 하나 또는 2 이상의 조합이 될 수 있을 것이다. 한편, 상기 표 4에서 권리자 보유 특허군 한정 특허 분류 기준이란, 특정 특허 Pi의 권리자가 보유하고 있는 특허군에 포함된 특허 분류만을 대상으로 생성되는 분쟁 예측 요소를 말한다. 예를 들어 C1 특허 분류의 특허 m개와 C2 특허 분류의 특허 n개를 보유하고 있는 A 권리자가 있을 경우, 상기 B4에 해당하는 분쟁 예측 요소값은 m개와 n개별로 구분지어 처리된다. 상기 A 권리자의 m개의 특허 중에서 2개가 분쟁을 제기한 특허이며, n개의 특허 중에는 분쟁을 제기한 특허가 1개 있는 경우, A 권리자의 권리자 보유 특허군 한정 특허 분류 기준 분쟁 제기 특허수라는 분쟁 예측 요소값은 C1에 대해서는 2, C2에 대해서는 1이 되며, 상기 권리자 보유 특허군 한정 특허 분류 기준 전체 특허수 대비 분쟁 제기 특허수 비율이라는 분쟁 예측 요소값은 C1에 대해서는 C1에 대해서는 2/m이 되며, C2에 대해서는 1/n이 된다.In Table 4, the patent classification is classified by patent classification type (eg, IPC, USPC, etc.), patent classification depth (eg, IPC, class, subclass, main group, n dot subgroup, etc.), main patent classification, or all patents. It may be any one of the classification or a combination of two or more. On the other hand, in Table 4, the patent group based on the patent group limited patent group refers to a dispute prediction element generated only for the patent classification included in the patent group held by the owner of the specific patent Pi. For example, if there is an A owner who holds m patents in the C1 patent classification and n patents in the C2 patent classification, the dispute prediction element values corresponding to B4 are processed separately by m and n. Of the m patents of the right holder A, two are the disputed patents, and if there are one disputed patent among the n patents, the number of disputed patents based on the patent group limited patent classification criteria of the right holders owned by the right holder of the rights holder A The factor value is 2 for C1 and 1 for C2, and the dispute prediction factor value of the number of disputed patents against the total number of patents based on the patent group-limited patent classification is 2 / m for C1 for C1. , C2 is 1 / n.

한편, 상기 분쟁 예측 요소값 생성부(5510)는 사전에 또는 기 설정된 주기별로 또는 기 설정된 조건 성취를 기준으로 특허 DB(1120)에 속하는 모든 특허별로 또는 적어도 하나 이상의 특허로 구성되는 적어도 하나 이상의 특허 집합별로 표 1 내지 표 2의 어느 하나 이상 또는 전체 분쟁 예측 요소분의 분쟁 예측 요소값을 생성해 놓을 수 있을 것이다. 한편, 상기 분쟁 예측 요소값 생성부(5510)는 실시간으로 적어도 하나 이상의 특허로 구성되는 적어도 하나 이상의 특허 집합별 또는 개별 특허별로 표 1 내지 표 2의 어느 하나 이상 또는 전체 분쟁 예측 요소분의 분쟁 예측 요소값을 생성할 수 있을 것이다. 그리고, 상기 분쟁 예측 요소값 생성부(5510)는 사전에 또는 기 설정된 주기별로 또는 기 설정된 조건 성취를 기준으로 특허 DB(1120)에 속하는 모든 특허 권리자별로 또는 적어도 하나 이상의 특허로 구성되는 적어도 하나 이상의 특허 집합에 포함된 특허 권리자별로 표 3의 어느 하나 이상 또는 전체 분쟁 예측 요소분의 분쟁 예측 요소값을 생성해 놓을 수 있을 것이다. 한편, 상기 분쟁 예측 요소값 생성부(5510)는 실시간으로 적어도 하나 이상의 특허로 구성되는 적어도 하나 이상의 특허 집합에 포함된 특허 권리자별 또는 개별 특허의 특허 권리자에 대해서 표 3의 어느 하나 이상 또는 전체 분쟁 예측 요소분의 분쟁 예측 요소값을 생성할 수 있을 것이다.On the other hand, the dispute prediction element value generation unit 5510 is at least one patent consisting of at least one patent or for every patent belonging to the patent DB 1120 in advance or for a predetermined period or based on the achievement of a predetermined condition Dispute prediction element values of one or more or all dispute prediction elements of Tables 1 to 2 may be generated for each set. On the other hand, the dispute prediction element value generation unit 5510 is in real time dispute prediction of any one or more or all of the dispute prediction elements of Table 1 to Table 2 by at least one or more patent sets or individual patents composed of at least one or more patents You will be able to create element values. The dispute prediction element value generation unit 5510 may include at least one or more patents configured by all patent owners belonging to the patent DB 1120 or at least one or more patents in advance or at predetermined periods or based on achievement of preset conditions. For each patent owner included in a patent set, dispute prediction element values of one or more or all dispute prediction elements in Table 3 may be generated. On the other hand, the dispute prediction element value generation unit 5510 is any one or more or all of the disputes in Table 3 with respect to patent holders of each patent holder or individual patents included in at least one or more patent sets consisting of at least one patent in real time Dispute prediction element values of the prediction elements may be generated.

또한, 상기 분쟁 예측 요소값 생성부(5510)는 사전에 또는 기 설정된 주기별로 또는 기 설정된 조건 성취를 기준으로 특허 DB(1120)에 속하는 모든 특허의 특허 분류별로 또는 적어도 하나 이상의 특허로 구성되는 적어도 하나 이상의 특허 집합에 포함된 특허의 특허 분류별로 또는 적어도 하나 이상의 특허 집합에 포함된 특허의 권리자를 기준으로 한 특허 분류별로 표 4의 어느 하나 이상 또는 전체 분쟁 예측 요소분의 분쟁 예측 요소값을 생성해 놓을 수 있을 것이다. 한편, 상기 분쟁 예측 요소값 생성부(5510)는 실시간으로 적어도 하나 이상의 특허로 구성되는 적어도 하나 이상의 특허 집합에 포함된 특허 분류별 또는 개별 특허의 특허 분류에 대해서 표 4의 어느 하나 이상 또는 전체 분쟁 예측 요소분의 분쟁 예측 요소값을 생성할 수 있을 것이다. 상기 분쟁 예측 요소값 생성부(5510)의 정보 처리의 대상이 되는 것은 기 설정된 특허 분류 종류에 대하여, 상기 특허 분류가 속하는 특허 분류 체계 상의 적어도 하나 이상의 특허 분류 깊이 별로 생성될 수 있을 것이며, 상기 정보 처리는 특허 분류 종류별로 처리될 수도 있으며, 상기 정보 처리는 주특허분류만 사용하거나, 부특허 분류도 함께 사용하여 처리될 수도 있을 것이다. 예를 들면, 미국 특허의 경우, IPC와 USPC가 표기되어 있을 경우, IPC는 subclass 및 main group 단위로 각각 분쟁 예측 요소값을 생성하도록 처리하고, USPC의 경우에는 class 단위로 분쟁 예측 요소값을 생성할 수도 있을 것이다. 이때, IPC의 경우에는 주 특허 분류만 사용하고, USPC의 경우에는 부특허분류도 함께 고려(예를 들면, Pi 특허에 C1 주특허분류와 C2 부특허분류가 있는 경우, C2를 기준으로 분쟁 예측 요소값을 생성할 때, Pi를 정보 처리의 대상으로 처리하는 등)하여 상기 분쟁 예측 요소값을 생성할 수 있을 것이다.In addition, the dispute prediction element value generation unit 5510 may include at least one of patents of all patents belonging to the patent DB 1120 or at least one or more patents in advance or at predetermined periods, or based on achievement of preset conditions. Generate dispute prediction element values for any one or more of Table 4 or for all dispute prediction elements in each of the patent classifications of patents included in one or more patent sets, or by patent classification based on the owners of patents included in at least one patent set. You can do it. On the other hand, the dispute prediction element value generation unit 5510 predicts any one or more or all disputes in Table 4 for patent classification of each patent classification or individual patents included in at least one patent set composed of at least one patent in real time. Dispute prediction factor values for the components may be generated. The object of the information processing of the dispute prediction element value generation unit 5510 may be generated for at least one or more patent classification depths on the patent classification system to which the patent classification belongs, with respect to a preset patent classification type. The processing may be performed by patent classification type, and the information processing may be performed using only the main patent classification or using the subpatent classification together. For example, in the case of US patents, if IPC and USPC are indicated, IPC processes to generate dispute prediction element values in subclass and main group units, and USPC generates dispute prediction element values in class units. You could do it. In this case, only the main patent classification is used in the case of IPC, and the sub-patent classification is also considered in the case of USPC (for example, if there is a C1 main patent classification and a C2 sub patent classification in the Pi patent, the dispute is predicted based on C2). When generating an element value, the dispute prediction element value may be generated by processing Pi as an object of information processing).

한편, 상기 분쟁 예측 요소값 생성부(5510)가 특허 분류 기준으로 특정 특허 분류에 관한 정보 처리할 때, 상기 특정 특허 분류의 하위 특허 분류에 해당하는 특허들을 상기 특정 특허 분류의 정보 처리 시 포함되어 처리될 수 있다. 예를 들면, 상기 분쟁 예측 요소값 생성부(5510)는 H04B 7/24가 다분쟁 기술군일 경우, Main IPC로 H04B 7/24를 포함하는 특허로 된 특허군(물론, H04B 7/24를 Parent 특허 분류로 하는 Child 특허 분류가 있을 경우, 그 Child 특허 분류 모두를 포함하는 특허로 된 특허군을 포함한다. 이는 특허 분류의 계층 구조의 속성상 당연할 것이다.)로 생성되는 특허군 또는 기 설정된 조건(특정 기간, 특정 특허 분류, 또는 특정 검색어 또는 기타 조건)을 충족하는 특허군 각각에 대하여, 기 설정된 분쟁 예측 요소별 분쟁 예측 요소값을 생성한다.On the other hand, when the dispute prediction element value generation unit 5510 processes information about a specific patent classification on a patent classification basis, patents corresponding to lower patent classifications of the specific patent classification are included in the information processing of the specific patent classification. Can be processed. For example, when H04B 7/24 is a multi-dispute technology group, the dispute prediction element value generation unit 5510 may be a parent of a patent group including H04B 7/24 as a main IPC (of course, H04B 7/24 is a parent). If there is a child patent classification as a patent classification, it includes a patent group consisting of patents covering all of the child patent classifications, which would be natural due to the nature of the hierarchy of the patent classification. For each patent group that meets a condition (a specific period, a specific patent classification, or a specific search word or other condition), a dispute prediction element value for each preset dispute prediction element is generated.

분쟁 예측 요소와 분쟁 예측 요소 후보와의 관계에 대해서 설명한다. 분쟁 예측 요소 후보는 본질적으로 분쟁 예측 요소가 된다. 분쟁 예측 요소 후보를 대상으로 분쟁 예측 모델을 생성했을 때, 그 분쟁 예측 모델에 기여하는 바가 없거나, 기 설정된 수준 이하가 되는 분쟁 예측 요소 후보는 버릴 수 있고, 남은 것들이 분쟁 예측 요소가 될 수 있다. 하지만, 기여하는 바가 아주 미미하거나(모델의 생성에서 분쟁 예측 요소별로 모델에 대한 기여도가 순수하게 0인 경우는 통상적으로 없으며, 아주 미미한 경우가 대부분이다.) 기 설정된 수준 이하가 되는 분쟁 예측 요소를 분쟁 예측 모델에 포함시켜도 그러한 분쟁 예측 요소는 기여도(중요도)가 작으므로, 그러한 분쟁 예측 요소가 포함되어 있어도 분쟁 예측 모델값은 거의 달라지지 않는다. 따라서, 모든 분쟁 예측 요소 후보는 분쟁 예측 요소가 될 수 있다. 기여도(중요도)가 미미한 분쟁 예측 요소가 포함되는 경우에는 이러한 분쟁 예측 요소에 대해서도 분쟁 예측 요소값을 생성하고, 모델 생성에 반영해야 하고, 분쟁 예측 모델값을 생성하는데 이러한 분쟁 예측 요소에 대한 분쟁 예측 요소값을 반영해야 하므로, 컴퓨팅 파워의 소모 면에서 비경제적, 비효율적이 되는 면이 있으나, 컴퓨팅 파워가 충분하다면 많은 분쟁 예측 요소를 사용하는 것도 바람직한 방법이 될 수 있다.The relationship between the conflict prediction element and the candidate for the conflict prediction element will be described. Dispute predictor candidates are essentially conflict predictor elements. When a dispute prediction model is generated for a dispute prediction element candidate, a dispute prediction element candidate that does not contribute to the dispute prediction model or falls below a predetermined level may be discarded, and the remaining ones may be a dispute prediction element. However, conflict prediction elements that contribute very little (typically, very rarely, have zero contributions to the model for each conflict prediction element in the generation of the model) or are very minimal. Even if included in the dispute prediction model, such a dispute prediction element has a small contribution (importance), so even if such a dispute prediction element is included, the dispute prediction model value is hardly changed. Thus, all dispute prediction element candidates may be dispute prediction elements. If a conflict prediction element with a low contribution (importance) is included, a dispute prediction element value should be generated for the dispute prediction element, reflected in the model generation, and a dispute prediction model value is generated. Since the element value must be reflected, it may be uneconomical and inefficient in terms of computing power consumption. However, if the computing power is sufficient, it may be a desirable method to use many dispute prediction factors.

상기 표 1 내지 표 4에 나타나는 분쟁 예측 요소에 대하여 사전에 기초적인 사전 통계 분석을 수행할 수 있다. 상기 분쟁 예측 모델 생성 엔진(5500)은 상기 분쟁 예측 요소 중 어느 하나 이상에 대하여, 분쟁 특허 집합과 비분쟁 특허 집합별로 상기 각 특허 집합에 속하는 개별 특허에 대하여 상기 분쟁 예측 요소값에 대한 기 설정된 통계량 및 통계 처리 정보를 생성한다. 상기 통계량은 상기 분쟁 예측 요소값의 평균, 분산, 표준편차, 분포 속성 등에 대한 수치값이며, 상기 통계 처리 정보에는 통계 분석에 대한 비교 그래프 등과 같은 시각화된 정보가 포함될 수 있다. 특정한 분쟁 예측 요소에 있어서, 분쟁 특허 집합에 대한 분쟁 예측 요소값과 비분쟁 특허 집합에 대한 분쟁 예측 요소값이 같거나 극히 유사한 경우에는 이러한 분쟁 예측 요소는 분쟁 예측 모델의 생성에 처음부터 배제시킬 수도 있다.(후보 단계에서 탈락시킴)Dictionary-based prior statistical analysis may be performed on the dispute prediction elements shown in Tables 1 to 4. The dispute prediction model generation engine 5500 is configured with respect to any one or more of the dispute prediction elements, predetermined statistics for the dispute prediction element value for individual patents belonging to each of the patent sets for each dispute patent set and non-dispute patent set; Generate statistical processing information. The statistics are numerical values for the mean, variance, standard deviation, distribution attribute, etc. of the dispute prediction element value, and the statistical processing information may include visualized information such as a comparison graph for statistical analysis. For certain dispute prediction elements, if the value of the dispute prediction element for the disputed patent set and the value of the dispute prediction element for the non-dispute patent set are the same or extremely similar, these dispute prediction elements may be excluded from the beginning in the generation of the dispute prediction model. (Dropped out at the candidate stage)

이어, 상기 분쟁 예측 요소값 생성부(5510)가 생성하는 분쟁 예측 요소값을 사용하여 분쟁 예측 모델을 생성하는 구체적 방법에 대하여 설명한다. 도 26에는 분쟁 예측 모델을 생성하는 일 실시예적 방법을 도시하고 있다. 상기 분쟁 예측 모델 생성 엔진(5500)은 적어도 한 종류 이상의 특허 분쟁에 사용된 특허를 포함하는 적어도 하나 이상의 분쟁 특허 집합과 적어도 하나 이상의 비분쟁 특허 집합을 입수(SL31)하고, 분쟁 특허 집합을 구성하는 적어도 2 이상의 분쟁 특허들과 비분쟁 특허 집합을 구성하는 적어도 2 이상의 비분쟁 특허들에 대하여 기 설정된 적어도 2 이상의 분쟁 예측 요소에 대한 분쟁 예측 요소값을 생성(SL32)하며, 분쟁 특허와 비분쟁 특허에 대하여, 분쟁 예측 요소값을 설명 변수값으로 하고, 분쟁 특허에 부여된 분쟁 특허 부여값과 분쟁 특허 부여값과 다르게 부여되며 비분쟁 특허에 부여된 비분쟁 특허 부여값을 반응 변수값으로 하여 기 설정된 통계학적 처리를 수행하여 기 설정된 적어도 하나 이상의 분쟁 예측 모델을 생성(SL33)한다. 이하, 분설한다. Next, a concrete method of generating a dispute prediction model using the dispute prediction element value generated by the dispute prediction element value generation unit 5510 will be described. FIG. 26 illustrates one embodiment method of generating a dispute prediction model. The dispute prediction model generation engine 5500 acquires (SL31) at least one or more dispute patent sets and at least one non-dispute patent set including patents used in at least one or more types of patent disputes, and constitutes at least one dispute patent set. Generate a dispute prediction element value for at least two dispute prediction elements preset for at least two dispute patents and at least two non-dispute patents constituting a non-dispute patent set (SL32); For example, the dispute prediction element value is set as an explanatory variable value, and the dispute patent grant value assigned to the dispute patent and the dispute patent grant value are different from each other, and the non-dispute patent grant value assigned to the non-dispute patent is set as a response variable value. Statistical processing is performed to generate at least one preset dispute prediction model (SL33). Hereinafter, powdering is performed.

분쟁 예측 모델의 생성을 위해 분쟁 특허 집합과 비분쟁 특허 집합을 생성이 우선적으로 진행된다. 분쟁 특허 집합은 분쟁을 일으킨 특허들로 구성되는 집합이다. 비분쟁 특허 집합은 분쟁을 일으키지 않은 특허들을 말하며, 분쟁을 일으키지 않은 특허들이 대다수이므로, 분쟁 예측 모델 생성에 사용되는 비분쟁 특허 집합은 샘플링을 통하여 별도로 구성된다. 전체 비분쟁 특허에 대하여 비분쟁 특허 집합의 생성을 위한 샘플링은 샘플 크기를 결정한 다음 1) 랜덤 추출, 2) 층화 추출, 3) 또는 기 설정된 통계학적 추출 방법을 사용한다. 비분쟁 특허 집합의 샘플 크기는 클수록 좋으나, 분쟁 특허 집합의 크기와 컴퓨팅 파워 등을 고려하여 결정하나, 분쟁 특허 집합의 크기보다 같거나 크게 설정할 필요가 있다. 층화 추출 방식을 선택할 경우, 비례 층화 추출이 타당한데, 비례 층화 추출은 상기 분쟁 특허군을 구성하는 분쟁 특허의 등록 연도나 출원 연도 등의 시간 분포 정보 및 특허 분류를 기준으로 하는 기술 분포 정보 중 어느 하나 이상을 고려하여 실행될 수 있다. 한편, 비례 층화 추출 시 특허 권리자가 분쟁을 다수 일으킨 경우, 그 권리자의 비분쟁 특허도 다수 추출할 수 있는 등과 같이 상기 분쟁 특허군을 구성하는 분쟁 특허의 권리자별 분쟁 특허수가 고려되는 층화 추출이 일부 실시될 수도 있고, 분쟁 발생 연도별 분쟁 특허 수를 일부 고려하는 층화 추출이 일부 실시될 수도 있을 것이다.In order to generate a dispute prediction model, a dispute patent set and a non-dispute patent set are generated first. A dispute patent set is a set of disputed patents. The non-dispute patent set refers to patents that do not cause a dispute, and since the majority of non-dispute patents are used, the non-disputed patent set used for generating a dispute prediction model is separately configured through sampling. Sampling for generation of a non-dispute patent set for all non-dispute patents determines the sample size and then uses 1) random extraction, 2) stratified extraction, 3) or a predetermined statistical extraction method. The larger the sample size of the non-dispute patent set is, the better, but the decision is made in consideration of the size and computing power of the dispute patent set, but it is necessary to set the same or larger than the size of the dispute patent set. When the stratification extraction method is selected, proportional stratification extraction is appropriate. The proportional stratification extraction is any one of time distribution information such as the registration year or filing year of the disputed patent constituting the disputed patent group, and the technology distribution information based on the patent classification. It may be implemented in consideration of one or more. On the other hand, if the patent owner has caused a large number of disputes during the proportional stratification extraction, the stratification extraction in which the number of disputed patents by the right holders of the disputed patents constituting the disputed patent group is considered, such as extracting a large number of non-disputed patents of the rights holder, is partially. Some stratified extractions may be implemented, taking into account some of the number of disputed patents by year of dispute.

상기 분쟁 예측 모델 생성 엔진(5500)은 상기 분쟁 특허 집합과 상기 비분쟁 특허 집합을 각각 또는 서로 합친 다음 적어도 2 이상으로 분할하여, 분할된 제1 분쟁 특허 집합과 분할된 제1 비분쟁 특허 집합은 분쟁 예측 모델 생성용으로 사용하고, 분할된 제2 분쟁 특허 집합과 분할된 제2 비분쟁 특허 집합은 생성된 분쟁 예측 모델의 검증용으로 사용할 수 있다. 이때, 제1 분쟁 특허 집합과 제1 비분쟁 특허 집합의 크기가 제2 분쟁 특허 집합과 제2 비분쟁 특허 집합보다 큰 것이 바람직하다.The dispute prediction model generation engine 5500 divides the disputed patent set and the non-disputed patent set, respectively, or combines them, and divides them into at least two or more, so that the divided first dispute patent set and the divided first non-disputed patent set are dispute prediction. The model may be used for model generation, and the divided second dispute patent set and the divided second non-dispute patent set may be used for verifying the generated dispute prediction model. At this time, the size of the first dispute patent set and the first non-dispute patent set is preferably larger than the second dispute patent set and the second non-dispute patent set.

상기 분쟁 예측 요소값 생성부(5510)는 상기 분쟁 특허 집합과 상기 비분쟁 특허 집합을 구성하는 분쟁 특허와 비분쟁 특허에 대하여 분쟁 예측 요소별로 분쟁 예측 요소값을 생성하거나 입수한다.The dispute prediction element value generation unit 5510 generates or obtains a dispute prediction element value for each dispute prediction element for the dispute patent and the non-dispute patent constituting the dispute patent set and the non-dispute patent set.

상기 분쟁 예측 모델 생성부(5520)는 분쟁 특허 집합에 속하는 특허에 분쟁 특허 반응 변수값을 할당하고, 비분쟁 특허 집합에 속하는 특허에는 분쟁 특허 반응 변수값과 다른 비분쟁 특허 반응 변수값을 할당한다. 상기 분쟁 특허 반응 변수값을 할당하는 방법은 크게 1) 모든 분쟁 특허에 동일한 값을 대응시키는 방법, 2) 분쟁 특허의 속성에 따라 다른 값을 대응시키는 방법이 있다. 일반적으로 전자의 경우에는 통계 모델은 분류 모델을 사용하고, 후자의 경우에는 회귀 모델을 사용할 수 있으나, 후자의 경우에도 반응 변수의 설계에 따라 분류 모델이 사용될 수도 있다.The dispute prediction model generation unit 5520 allocates a dispute patent response variable value to a patent belonging to the dispute patent set, and assigns a dispute patent response variable value different from the dispute patent response variable value to a patent belonging to the non-dispute patent set. . The method for allocating the dispute patent response variable value is largely divided into 1) a method of matching the same value to all dispute patents, and 2) a method of matching different values according to the attributes of the dispute patent. In the former case, the statistical model uses a classification model, and the latter case uses a regression model. In the latter case, the classification model may be used depending on the design of the response variable.

분쟁 특허의 속성에 따라 다른 반응 변수값을 대응시키는 방법은 1) 분쟁 특허의 분쟁 제기 회수를 반응 변수값으로 하는 방법, 2) 분쟁 특허의 분쟁 제기 회수를 기준으로 적어도 2 이상의 카테고리로 분류하고, 카테고리 값을 반응 변수값으로 하는 방법(예를 들면, 분쟁 제기 1회, 2회 이상으로 2분화 하는 방법이나 분쟁 특허별로 분쟁 회수의 분포를 고려하여 4분위로 4분화 하는 방법 등과 같은 n 분화 방법이 될 수 있다.), 3) 분쟁 특허의 분쟁 제기 회수 기준 n분위와 분쟁 제기 연도 기준 m 분위를 함께 고려하여 n*m개 셀로 구성된 매트릭스를 구성하고(각 셀의 분쟁 특허 개수는 최대한 동일하게 유지한다.) 각 셀별로 반응 변수값을 대응시키는 방법 등이 될 수 있다. 한편, 분쟁 특허 중 2심 이상과 관련된 분쟁 특허는 심급별로 독립된 분쟁으로 보아 분쟁 회수를 증가시키는 방법 또는 심급에 따라 다른 반응 변수값을 대응시키는 방법 등이 채용될 수 있을 것이다.According to the nature of the disputed patent, a method of matching different response variable values is classified into at least two categories based on 1) the number of disputed claims of the disputed patent as a response variable value, and 2) the number of disputed claims of the disputed patent, N differentiation methods such as the method of using the category value as a response variable value (e.g., dividing the dispute into one or two or more disputes, or by dividing into four quartiles by considering the distribution of disputes by dispute patent). 3) A matrix of n * m cells is formed by considering both the n-quartile based on the number of disputes and the m-quartile based on the disputed year of disputed patents (the number of disputed patents in each cell should be as equal as possible). It may be a method of mapping the response variable value to each cell. On the other hand, the disputed patents related to two or more of the disputed patents can be adopted as a method of increasing the number of disputes, or a method of matching different response variable values depending on the level of the dispute as independent disputes by class.

한편, 광의의 분쟁 특허 중에는 준분쟁 특허가 있을 수 있다. 준분쟁 특허는 분쟁에 준하는 특허로 1) 무효 심판이나 ITC 제소 등과 같은 비사법적인 행정적 분쟁과 관련된 제1종 준분쟁 특허, 2) 발송된 경고장에 포함된 특허나, 로열티 지급 요구를 받는 제2종 준분쟁 특허, 3) 로열티 지급 특허나 크로스 라이센스 체결 대상이 된 제3종 준분쟁 특허, 4) 기타 법률상의 권리 행사를 한 제4종 준분쟁 특허 등이 있을 수 있다. 상기 1)과 관련된 준분쟁 특허는 특허청이나 기타의 방법으로 데이터 수집이 상대적으로 용이한 반면, 2) 내지 4)는 사적인 영역에서 발생한 것으로 데이터 수집의 난감함이 있다. 하지만, 2) 내지 4)는 각 사용자별(예를 들면, 특정 기업 A를 위한 A에 특화된) 분쟁 예측 모델을 생성시킬 때는 각 사용자는 자신과 관련된 2) 내지 4)를 알 수 있으므로, 2) 내지 4)를 분쟁 예측 모델의 생성에 활용할 수 있을 것이다. 2) 내지 4)는 기업 맞춤형 분쟁 예측 모델의 생성에 중요하게 활용될 수 있다.Meanwhile, among the broad dispute patents, there may be a sub-dispute patent. Quasi-dispute patents are patents in accordance with disputes, which include: 1) Class I semi-dispute patents relating to non-judicial administrative disputes, such as invalid trials or ITC complaints; Quasi-dispute patents, 3) royalty-patented patents, third-class quasi-dispute patents subject to cross licensing, and 4) other quasi-dispute patents for exercising legal rights. The quasi-dispute patent related to 1) is relatively easy to collect data by the Patent Office or other methods, while 2) to 4) occurred in a private area, which is difficult to collect data. However, 2) to 4), when creating a conflict prediction model for each user (e.g., specialized for A for a particular company A), each user can know 2) to 4) associated with him, To 4) may be used to generate a dispute prediction model. 2) to 4) can be importantly used for generating a company-specific dispute prediction model.

상기 준분쟁 특허에도 분쟁 특허(사법적 분쟁이 발생한 특허)과 동일한 반응 변수값을 대응시키는 방법과 분쟁 특허와 다른 반응 변수값을 대응시키는 방법이 있을 수 있는데, 후자가 상대적으로 더 바람직하다. 반응 변수값 대응의 가장 간단한 일례로, 분쟁 특허와 준분쟁 특허에는 모두 1을, 비분쟁 특허에는 모두 0을 대응하는 방법이 있을 수 있을 것이다.The quasi-dispute patent may also have a method of matching a response variable value identical to a dispute patent (patent in which a judicial dispute has occurred) and a method of matching a response patent value with another response variable value. The latter is more preferable. As the simplest example of response value response, there may be a method of allocating 1 for both disputed and semi-disputed patents and 0 for non-disputed patents.

상기 분쟁 예측 모델 생성부(5520)는 상기와 같이 분쟁 특허와 비분쟁 특허에 대한 설명 변수값인 분쟁 예측 모델별 분쟁 예측 모델값과 반응 변수값이 대응되게 되면, 상기 설명 변수값과 반응 변수값을 대상으로 기 설정된 적어도 하나 이상의 통계적 모델링 기법을 적용하여 분쟁 예측 모델을 생성하게 된다.The dispute prediction model generation unit 5520, when the dispute prediction model value corresponding to the dispute prediction model value and the response variable value corresponding to the explanatory variable value for the dispute patent and non-dispute patent, as described above, the explanatory variable value and the response variable value A conflict prediction model is generated by applying at least one statistical modeling method that is set in advance.

분쟁 특허와 준분쟁 특허에는 반응 변수값으로 모두 1을, 비분쟁 특허에는 모두 0이 대응되는 경우, 분쟁 예측 모델을 생성하는 것은 전형적인 분류(classification) 모델을 생성하는 것이 된다. 상기 분쟁 예측 모델 생성부(5520)가 분류 모델을 생성하는 방법은 다양한 통계적 기법이 동원될 수 있으며, 이러한 통계적 기법은 모두 본 발명에서 채용될 수 있으며, 본 발명의 통계적 처리 방법에 포함되며, 이러한 통계적 기법에 대해 상세히 설명하지 않는다고 하여 본 발명에서 배제하는 것은 아님은 분명할 것이다. If the dispute patent and the semi-dispute patent all correspond to 1 as the response variable value and all the non-dispute patents correspond to 0, generating the dispute prediction model is to generate a typical classification model. The method for generating a classification model by the dispute prediction model generator 5520 may employ various statistical techniques, all of which may be employed in the present invention, and are included in the statistical processing method of the present invention. It will be clear that the lack of a detailed description of the statistical technique does not exclude the present invention.

본 명세서에서는 분류 모델의 생성 방법의 한 예로 기계 학습 기법 트리(tree)를 바탕으로 한 앙상블(ensemble) 기법 중 부스팅(boosting) 기법을 사용하여 분류 모델을 생성하는 방법에 대해 상세하게 설명한다.(앙상블 기법을 사용하는 다른 방법으로 random forest 기법이 있다.) 도 27은 Gradient Boosting algorithm 알고리즘에 대한 요약을 설명하는 도면이며, 도 28은 Friedman(2002)이 새로이 stochastic gradient boosting algorithm을 제안한 알고리즘에 대한 요약을 설명하는 도면이며, 도 29는 본 발명의 특허 분쟁 예측 정보 생성 시스템(5000)의 분쟁 예측 모델 생성 과정에 관한 일 실시예적 개념을 설명하는 도면이다. In this specification, as an example of generating a classification model, a method of generating a classification model using boosting among ensemble techniques based on a machine learning technique tree will be described in detail. Another method using the ensemble technique is the random forest technique. FIG. 27 is a diagram illustrating a summary of the Gradient Boosting algorithm algorithm, and FIG. 28 is a summary of an algorithm proposed by Friedman (2002) for a newly stochastic gradient boosting algorithm. 29 is a view for explaining an exemplary concept of a dispute prediction model generation process of the patent dispute prediction information generation system 5000 of the present invention.

이어, 도 29 및 도 31을 참조하면서, 본 발명의 분쟁 예측 모델 생성부(5520)의 정보 처리 방법을 상세하게 설명한다. 상기 분쟁 예측 모델 생성부(5520)는 분쟁 특허들과 비분쟁 특허들에 대하여, 설명 변수인 분쟁 예측 요소별 분쟁 예측 요소값 생성(SLBO11)하고, 분쟁 특허에 분쟁 특허 대응 반응 변수값 설정하고, 비분쟁 특허에 비분쟁 특허 대응 반응 변수값 설정(SLBO12)한다. 이어, 상기 분쟁 예측 모델 생성부(5520)는 설명 변수들 중 어느 하나 이상에 대한 제1 스텀프를 생성(SLBO13)하고, 생성된 제1 스텀프로 구성된 제1 스텀프 집합으로 분쟁 예측 모델값을 생성(SLBO14)하고, 분쟁 예측 모델값을 검증(SLBO15)한다. 검증 결과 기 설정된 기준을 충족(SLBO16)하지 못하면, 트리 중요도 결정 후 오분류 특허 대상 가중치를 조절(SLBO17)하고, 설명 변수에 대한 제2 스텀프를 생성하고, 제1 스텀프 및 제2 제 2 스텀프로 분쟁 예측 모델값을 생성하고 분쟁 예측 모델값을 검증하고, 검증 결과 기 설정된 기준을 충족할 때까지 스텀프의 생성과, 생성된 스텀프들을 사용하여 분쟁 예측 모델값의 생성 및 분쟁 예측 모델값의 검증을 반복한다. 검증 결과 기 설정된 기준을 충족하는 경우, 그때까지 생성된 스텀프들을 접합하여 분쟁 예측 모델 결정(SLBO18)한다. Next, the information processing method of the dispute prediction model generation part 5520 of this invention is demonstrated in detail, referring FIG. 29 and FIG. The dispute prediction model generation unit 5520 generates a dispute prediction element value for each dispute prediction element, which is an explanatory variable, for dispute patents and non-dispute patents, and sets a dispute patent response response variable value for the dispute patent, Non-dispute patent-response response variable values are set (SLBO12). Subsequently, the dispute prediction model generator 5520 generates a first stump for any one or more of the explanatory variables (SLBO13), and generates a dispute prediction model value using a first set of stumps configured with the generated first stump ( SLBO14) and the dispute prediction model value is verified (SLBO15). If the verification result does not meet the preset criteria (SLBO16), after determining the tree importance, adjust the misclassification patent target weight (SLBO17), generate a second stump for the explanatory variable, and use the first stump and the second second stump. Generate the dispute prediction model value, verify the dispute prediction model value, and generate the stump until the verification result meets the preset criteria, and generate the dispute prediction model value and verify the dispute prediction model value using the generated stumps. Repeat. If the verification result meets the preset criteria, the generated stumps are then joined to determine the dispute prediction model (SLBO18).

상기 분쟁 예측 모델 생성부(5520)는 설명 변수별로 스텀프(stump)를 생성한다. 경우에 따라서 스텀프 말고도 일반적인 트리(tree)를 사용할 수도 있다. 상기 분쟁 예측 모델 생성부(5520)는 분쟁 특허 집합을 구성하는 분쟁 특허와 비분쟁 특허 집합을 구성하는 비분쟁 특허의 설명 변수값을 분석하여 중요한 분쟁 예측 요소가 포함된 적어도 하나 이상의 스텀프를 생성한다. 스텀프는 1) 분쟁 예측 요소, 2) 분기(split) 지점, 3) 좌측 노드 예측값, 4) 우측 노드 예측값, 5) 분기 적용 불가 시의 예측값 등과 같이 5가지 정보를　 포함하여 구성된다. 분기를 생성하는 규칙(rule)은 주어진 손실 함수(Loss Function)를 최소화 하도록 분기 기점과 예측값을 찾도록 하는 것이 바람직하다. 상기의 경우, Loss Function 으로 Deviance나 Exponential loss 함수 등을 사용할 수 있다.The dispute prediction model generator 5520 generates a stump for each explanatory variable. In some cases, you can use generic trees in addition to stumps. The dispute prediction model generation unit 5520 generates at least one stump including an important dispute prediction element by analyzing the values of explanatory variables of the dispute patent constituting the dispute patent set and the non-dispute patent constituting the non-dispute patent set. . The stump includes five pieces of information, such as 1) conflict prediction factor, 2) split point, 3) left node prediction value, 4) right node prediction value, and 5) prediction when branch is not applicable. The rule for generating branches preferably seeks to find branch origins and prediction values to minimize a given loss function. In this case, a loss function or a loss function can be used as a loss function.

도 32는 생성된 스텀프의 일례를 보여 주고 있으며, 하기 표 5는 상기 스텀프 정보의 일례를 보여 주고 있다.32 shows an example of the generated stump, and Table 5 below shows an example of the stump information.

　 SplitVarSplitvar SplitCodePredSplitCodePred LeftNodeLeftnode RightNodeRightNode MissingNodeMissingnode ErrorReductionErrorReduction WeightWeight PredictionPrediction 00 1111 0.50.5 1One 22 33 802.9285468802.9285468 2266622666 0.0001283210.000128321 1One -1-One 0.0073534690.007353469 -1-One -1-One -1-One 00 1332713327 0.0073534690.007353469 22 -1-One -0.010182157-0.010182157 -1-One -1-One -1-One 00 93399339 -0.010182157-0.010182157 33 -1-One 0.0001283210.000128321 -1-One -1-One -1-One 00 2266622666 0.0001283210.000128321

상기 표 5에서 SplitVar는 분기되는 설명 변수(분쟁 예측 요소)를 말하며, 11이란 값은 11번째 설명 변수를 말한다. SplitCodePred는 설명 변수가 분기되는 분기 지점을 말하며, 0.5는 분기 지점값을 말한다. LeftNode는 좌측 노드를 표시하는 것으로 1이 대응되어 있으며, 1로 시작하는 세번째 행은 좌측 분기 시에 대한 정보를 알려 준다. 좌측 분기 시는 X11 <= 0.5인 경우로, 이 경우의 예측값은 0.007353469로 되며, 도 32에서는 0.007353을 표시되어 있다. RightNode는 우측 노드로 네번째 행에 이 경우의 정보값이 나타나 있으며, MissingNode는 분기 적용 불가(missing) 시에 대한 것으로 다섯번째 행에 이 경우의 정보값이 나타나 있다.In Table 5, SplitVar refers to a branching explanatory variable (disput prediction element), and a value of 11 refers to the eleventh explanatory variable. SplitCodePred is the branch point where the explanatory variable is branched, and 0.5 is the branch point value. LeftNode represents the left node, where 1 corresponds, and the third row starting with 1 gives information about the left branch time. In the left branch, X11 <= 0.5, the predicted value in this case is 0.007353469, and 0.007353 is shown in FIG. RightNode is the right node and the information value in this case is shown in the fourth row, and MissingNode is for branching when missing. The information value in this case is shown in the fifth row.

하기 표 6은 상기 표 5의 정보를 포함하여 다수의 스텀프가 생성되는 예시를 보여 준다.Table 6 below shows an example of generating a plurality of stumps, including the information of Table 5.

SplitVarSplitvar SplitCodePredSplitCodePred LeftNodeLeftnode RightNodeRightNode MissingNodeMissingnode ErrorReductionErrorReduction WeightWeight 1111 0.50.5 1One 22 33 802.9285802.9285 2266622666 -1-One 0.0073530.007353 -1-One -1-One -1-One 00 1332713327 -1-One -0.01018-0.01018 -1-One -1-One -1-One 00 93399339 -1-One 0.0001280.000128 -1-One -1-One -1-One 00 2266622666 1111 0.50.5 1One 22 33 773.5689773.5689 2266622666 -1-One 0.0070460.007046 -1-One -1-One -1-One 00 1323813238 -1-One -0.01016-0.01016 -1-One -1-One -1-One 00 94289428 -1-One -0.00011-0.00011 -1-One -1-One -1-One 00 2266622666 1111 0.50.5 1One 22 33 785.034785.034 2266622666 -1-One 0.0070550.007055 -1-One -1-One -1-One 00 1324213242 -1-One -0.0103-0.0103 -1-One -1-One -1-One 00 94249424 -1-One -0.00016-0.00016 -1-One -1-One -1-One 00 2266622666

표 6에서 알 수 있듯이, 4개의 행이 하나의 스텀프에 대응된다. 상기 표 6에서는 설명 변수 11번에 대하여 적어도 3개 이상의 스텀프가 생성되고 있음을 보여 준다. 이와 같이 하나의 설명 변수에 대하여 하나 이상의 스텀프가 생성되는 경우가 다수 있게 된다.As can be seen in Table 6, four rows correspond to one stump. Table 6 shows that at least three stumps are generated for the explanatory variable 11. As described above, one or more stumps are generated for one explanatory variable.

상기 분쟁 예측 모델 생성부(5520)는 최초 스텀프 S1를 적용하여 분쟁 특허의 분쟁 예측 요소값과 비분쟁 특허의 분쟁 예측 요소값을 사용하여 분쟁 특허 집합의 분쟁 특허와 비분쟁 특허 집합의 비분쟁 특허별로 예측값(S1이 적용된 상기 3) 좌측 노드 예측값, 상기 4) 우측 노드 예측값 또는 상기 5) 분기 적용 불가 시의 예측값 중 어느 하나가 예측값이 된다.)을 생성한다. 최초 스텀프 S1은 최초의 분쟁 예측 모델 후보가 된다. 예측값은 모델 생성에 사용된 분쟁 특허와 비분쟁 특허마다 생성될 것이다. 이때, 상기 분쟁 예측 모델 생성부(5520)는 상기 분쟁 특허에 대한 반응 변수값과 분쟁 특허에 대한 예측값, 비분쟁 특허에 대한 반응 변수값과 비분쟁 특허에 대한 예측값을 분석하여 오분류율과 같은 예측 오류 정보를 생성해 낸다. 상기 분쟁 예측 모델 생성부(5520)는 tree의 중요도를 결정 후 잘못 분류한 분쟁 특허와 잘못 분류한 비분쟁 특허에 대하여 가중치(weight)을(강제적으로) 높이는 방법(예를 들면 빈도(frequency)를 높이는 등, 예를 들어 특정 특허 Pi가 분쟁 특허인데, 비분쟁 특허로 분류하면, Pi의 빈도를 1에서 그 이상으로 높임)을 사용하여 분쟁 특허 집합 데이터와 비분쟁 특허 집합 데이터를 재조정한다. 상기 재조정된 분쟁 특허 집합과 비분쟁 특허 집합에 대하여 두번째 스텀프인 S2를 생성해 낸다. 두번째 분쟁 예측 모델 후보는 S1과 S2로 이루어 진다. 이때, 상기 분쟁 예측 모델 생성부(5520)는 S1rhk S2를 적용하여 분쟁 특허와 비분쟁 특허의 예측값을 생성해 낸다. 상기 분쟁 특허와 상기 비분쟁 특허에 대하여, 분쟁 특허의 반응 변수값 및 비분쟁 특허의 반응 변수값과 S1과 S2로 구성되는 두번째 분쟁 예측 모델 후보가 생성한 예측값인 분쟁 예측 모델값을 비교 분석하여 예측 오류 정보를 생성한다. 상기 생성된 예측 오류 정보를 고려하여 분쟁 특허와 비분쟁 특허 데이터를 재조정한다. 상기와 같이 i 번째 스텀프 집합(SSi, ith Stump Set, S1 ~Si의 스텀프로 구성되는 집합으로 i번째 분쟁 예측 모델 후보가 된다.)으로 분쟁 특허 집합과 비분쟁 특허 집합을 구성하는 각 특허 Pi를 대상으로 예측값을 생성하고, 생성된 예측값과 Pi의 반응 변수값을 비교하여 오류 예측 정보를 생성하고, 생성된 오류 예측 정보를 참조하여 상기 분쟁 특허 집합 데이터와 비분쟁 특허 집합 데이터를 재조정하여 S_i ₊₁을 생성하고 i+1번째 스텀프 집합을 사용하는 i+1번째 분쟁 예측 모델 후보가 생성되게 된다.The dispute prediction model generation unit 5520 may apply the first stump S1 and use the dispute prediction element value of the dispute patent and the dispute prediction element value of the non-dispute patent to set the dispute patent of the dispute patent set and the non-dispute patent of the non-dispute patent set. The prediction value S1 is applied to any one of the 3) left node prediction value, the 4) right node prediction value, and 5) the prediction value when the branch is not applicable. The first stump S1 becomes the first dispute prediction model candidate. Prediction values will be generated for each disputed and non-disputed patent used to generate the model. In this case, the dispute prediction model generation unit 5520 analyzes the response variable value for the dispute patent, the prediction value for the dispute patent, the response variable value for the non-dispute patent, and the prediction value for the non-dispute patent, such as a misclassification rate. Produce predictive error information. The dispute prediction model generation unit 5520 determines a method of increasing the weight (forcibly) of weights (for example, frequency) for misclassified dispute patents and misclassified non-disputed patents after determining the importance of the tree. For example, if a specific patent Pi is a disputed patent and is classified as a non-disputed patent, the frequency of Pi is increased from 1 to more), and the disputed patent set data and the non-disputed patent set data are readjusted. A second stump S2 is generated for the readjusted disputed patent set and the non-disputed patent set. The second dispute prediction model candidate consists of S1 and S2. At this time, the dispute prediction model generation unit 5520 generates a prediction value of the dispute patent and non-dispute patent by applying S1rhk S2. Regarding the disputed patent and the non-disputed patent, a comparison between the response variable value of the disputed patent and the response variable value of the non-disputed patent and a dispute prediction model value which is a prediction value generated by the second dispute prediction model candidate consisting of S1 and S2 Generate prediction error information. The dispute patent and non-dispute patent data are readjusted in consideration of the generated prediction error information. As described above, each patent Pi constituting the disputed patent set and the non-disputed patent set is composed of the i-th stump set (SSi, ith Stump Set, a set consisting of the stumps of S1 to Si, and the candidate for the i-th dispute prediction model). to generate a predicted value, and by comparing the response variable value of the generated prediction value and Pi generating the error prediction information, and with reference to the generated error prediction information readjust the dispute Patent data set and non-conflict Patent data set S _i ₊ generating a _first and a (i + 1) th candidate conflict prediction model using the (i + 1) th set are generated stump.

상기와 같은 과정을 통하여 상당히 큰 n개의 스텀프 S1, S2, , Sn로 구성되는 스텀프 집합이 생성된다. 상기 각 스텀프 Si에 대하여, 1) 분쟁 예측 요소, 2) 분기(split) 지점, 3) 좌측 노드 예측값, 4) 우측 노드 예측값, 5) 분기 적용 불가 시의 예측값이 대응되게 된다. n개의 Si별 상기 1) 내지 5)가 분쟁 예측 모델 후보가 된다. Through the above process, a stump set composed of significantly large n stumps S1, S2, and Sn is generated. For each of the stumps Si, 1) a dispute prediction element, 2) a split point, 3) a left node prediction value, 4) a right node prediction value, and 5) a prediction value when branch inapplicability is applied. 1) to 5) for each of n Sis are candidates for dispute prediction model.

상기 분쟁 예측 모델 후보가 있는 경우, 상기 분쟁 예측 모델 후보를 사용하는 방법은 다음과 같다. 상기 분쟁 예측 모델 생성부(5520)는 임의의 특허 Pi에 대하여, Pi의 분쟁 예측 요소별 분쟁 예측 요소값을 상기 분쟁 예측 모델 후보에 입력한다. 상기 분쟁 예측 모델 생성부(5520)는 분쟁 예측 요소별로 분쟁 예측 요소값을 해당되는 적어도 하나 이상의 스텀프(아주 특별한 경우에는 해당되는 스텀프가 없을 수도 있으며, 이 경우 그 분쟁 예측 요소는 모델에서 사용되지 않는 분쟁 예측 요소가 된다.)별로 적용하여 스텀프별로의 예측값을 생성한다. 예측값은 상기 3) 좌측 노드 예측값, 상기 4) 우측 노드 예측값 또는 상기 5) 분기 적용 불가 시의 예측값 중 어느 하나가 된다. 생성된 예측값을 모든 스텀프에 대하여 합산하면 분쟁 예측 모델값이 생성되게 된다. 즉, 상기 분쟁 예측 모델 생성부(5520)는 상기 분쟁 예측 요소 후보인 스텀프 집합에 Pi의 분쟁 예측 요소별 분쟁 예측 요소값을 적용하여 생성되는 모든 예측값을 합산하는 방식으로 Pi별로 분쟁 예측 모델값을 생성한다. 분쟁 특허 반응 변수값을 1로 하고, 비분쟁 특허 반응 변수값을 0으로 할 경우, Pi별 분쟁 예측 모델값은 0에서 1사이 값이 된다. 이때, Pi의 분쟁 예측 모델값이 0.5를 기준으로 0.5 미만이면 Pi를 비분쟁 특허로 예측하는 것으로 하고, 0.5 이상이면 분쟁 특허로 예측하는 것으로 분류할 수 있다. 한편, Pi는 분쟁 특허이거나 비분쟁 특허이므로 Pi에 대응되는 반응 변수값이 참값이 되고, 이 참값과 예측값은 차이가 있게 되는 것이 일반적이다.If there is a dispute prediction model candidate, the method of using the dispute prediction model candidate is as follows. The dispute prediction model generation unit 5520 inputs a dispute prediction element value for each dispute prediction element of Pi into the dispute prediction model candidate for any patent Pi. The dispute prediction model generation unit 5520 may determine a dispute prediction element value for each dispute prediction element at least one or more stumps (in a very special case, there may be no corresponding stump, in which case the dispute prediction element is not used in the model). To generate a prediction value for each stump. The predicted value is any one of the 3) left node predicted value, the 4) right node predicted value, and the 5) branch predictable value. When the generated prediction value is summed for all the stumps, a dispute prediction model value is generated. That is, the dispute prediction model generator 5520 may add a dispute prediction model value for each Pi in such a manner as to add all the predicted values generated by applying the dispute prediction element values for each dispute prediction element of Pi to the stump set which is the dispute prediction element candidate. Create When the dispute patent response variable value is 1 and the non-dispute patent response variable value is 0, the dispute prediction model value for each Pi becomes a value between 0 and 1. At this time, if the dispute prediction model value of Pi is less than 0.5 on the basis of 0.5, Pi may be predicted as a non-dispute patent, and if it is 0.5 or more, it may be classified as predicting a dispute patent. On the other hand, since Pi is a disputed patent or a non-dispute patent, the response variable value corresponding to Pi becomes a true value, and this true value and a predicted value are generally different.

이어, 상기 분쟁 예측 모델 생성부(5520)가 최적 또는 타당성 높은 분쟁 예측 모델을 생성하는 방법에 대해 설명한다. n개의 스텀프를 사용하는 분쟁 예측 모델 후보가 있을 경우, 분쟁 예측 모델을 생성하는 것은 교차 검증(Cross Validation)을 통해서 생성한다. 통상적으로 n이 클수록 분쟁 예측 모델 후보를 생성하는 train set(분쟁 특허와 비분쟁 특허로 구성됨)에서의 예측 성능은 높아지나, n이 과도하게 커지면 과적합(over fitting)이 발생하여 test set(분할된 제2 분쟁 특허 집합과 분할된 제2 비분쟁 특허 집합이 생성된 분쟁 예측 모델 후보의 검증에 사용되면 이들이 test set이 될 수 있다.)에서의 예측 성능이 낮아지는 문제가 있게 된다. 따라서, 적정한 크기의 n을 결정하는 것이 중요하게 된다. 상기 분쟁 예측 모델 생성부(5520)는 생성된 분쟁 예측 모델 후보 M1, M2, , Mi에 대하여, test set에 포함되는 개별 특허 Pi에 대하여 분쟁 예측 모델값 Mi(Pi)를 생성한다. Mi(Pi)란 분쟁 예측 모델 후보 Mi에 Pi의 분쟁 예측 요소별 분쟁 예측 요소값을 적용하여 생성된 예측값을 말한다. 한편, 상기 Pi의 반응 변수값을 Y(Pi)라 한다. 상기 반응 변수값이 O과 1인 경우, Y(Pi)는 0 또는 1이 된다. 이어, 상기 분쟁 예측 모델 생성부(5520)는 test set을 구성하는 모든 특허 Pi에 대하여 Y(Pi), Mi(Pi) 값을 구하고 이 값들을 주어진 손실함수에 대입하여 그 합을 최소로 하는 Mi를 선정한다.Next, a method of generating a dispute prediction model having the optimum or validity by the dispute prediction model generator 5520 will be described. When there are candidate candidates for the conflict prediction model using n stumps, the generation of the conflict prediction model is generated through cross validation. In general, the larger n, the higher the predictive performance of a train set (consisting of disputed patents and non-disputed patents) that generate candidates for conflict prediction models.However, excessively large n results in overfitting, resulting in a test set. If the second dispute patent set and the divided second non-dispute patent set are used to verify the generated dispute prediction model candidates, they may be test sets. Therefore, it is important to determine n of the appropriate size. The dispute prediction model generation unit 5520 generates a dispute prediction model value Mi (Pi) for each patent Pi included in the test set with respect to the generated dispute prediction model candidates M1, M2, and Mi. Mi (Pi) refers to a prediction value generated by applying a dispute prediction element value for each dispute prediction element of Pi to a dispute prediction model candidate Mi. On the other hand, the reaction variable value of Pi is referred to as Y (Pi). When the response variable values are 0 and 1, Y (Pi) is 0 or 1. Subsequently, the dispute prediction model generating unit 5520 obtains Y (Pi) and Mi (Pi) values for all patents Pi constituting the test set, and substitutes these values into a given loss function to minimize Mi. Select.

도 30에는 과적합의 개념이 표현된 개념도를 예시하고 있다. 도 30에서 붉은색 점과 푸른색 점을 구분하는 모델의 생성에서, 녹색 선과 같은 모델을 생성하면, train set에서는 굉장이 잘 구분하지만, 이런 경우 모델 생성에 참여하지 않은 test set에서는 오히려 예측 성능이 떨어지는 문제가 있게 된다. 이때, 검은색 선과 같은 모델을 생성하면 적정한 분류 모델이 되게 된다.30 illustrates a conceptual diagram in which the concept of overfit is expressed. In the generation of a model that distinguishes a red dot from a blue dot in FIG. 30, if a model such as a green line is generated, the train set is distinguished very well. However, in this case, the predictive performance of the test set does not participate in the model generation. There is a problem of falling. At this time, if a model such as a black line is generated, it becomes an appropriate classification model.

이어, shrinkage parameter에 대해서 설명한다. 상기 tree의 중요도를 결정하는 것은 tree 중요도 결정 알고리즘이 수행하는데, tree 중요도 결정 알고리즘에서 나온 tree 중요도 결정값에 shrinkage parameter를 곱하는 등과 같이 shrinkage parameter를 예측 모델 생성에 도입하면 모형의 성능이 좋아지게 된다. 상기 tree 중요도 결정 알고리즘에서 tree 중요도는 상기 분쟁 예측 요소별로 분기 지점을 변동시켜 가면서 교차 검증(Cross validation) 과정을 실행하면서 결정된다.Next, the shrinkage parameter will be described. The tree importance determination algorithm is used to determine the importance of the tree. The performance of the model is improved by introducing the shrinkage parameter into the prediction model such as multiplying the tree importance determination value derived from the tree importance determination algorithm by the shrinkage parameter. In the tree importance determination algorithm, tree importance is determined by performing a cross validation process while varying branch points for each conflict prediction element.

상기 분쟁 예측 모델 생성부(5520)가 분쟁 예측 모델을 생성할 때, 분쟁 특허 집합의 크기가 충분히 큰 경우에는(전체 특허 중 분쟁이 발생하지 않은 특허가 대다수이므로, 비분쟁 특허 집합의 크기는 충분히 크게 할 수 있다.) 상기 분쟁 특허 집합을 분할해도 되지만, 크기가 충분하지 않을 경우에는 n-fold Cross validation 기법을 사용할 수 있다. 도 33은 5-fold Cross validation 기법을 도식적으로 보여 주고 있다. 상기 분쟁 예측 모델 생성부(5520)는 분쟁 특허와 비분쟁 특허가 섞여 있는 자료를 5개의 집합으로 나눈 다음 각각의 경우마다 4개의 집합으로 모형을 구축하고 나머지 하나의 집합으로 test error을 구해서 이 오류가 가장 작은 모형을 선택하는 방법과 같이 n-fold Cross validation 방법을 실시할 수 있다.When the dispute prediction model generation unit 5520 generates the dispute prediction model, if the size of the dispute patent set is large enough (the majority of the patents do not have disputes among all patents, the size of the non-dispute patent set is sufficient). The dispute patent set may be divided. However, if the size is not sufficient, the n-fold cross validation technique may be used. 33 illustrates a 5-fold cross validation scheme. The dispute prediction model generation unit 5520 divides the data containing the disputed patent and the non-dispute patent into five sets, and then builds a model with four sets in each case and obtains a test error with the other set. N-fold Cross validation method can be implemented as the method of selecting the smallest model.

상기에서는 2개의 범주에 대한 분류 모델에 대해서 설명하였다. 한편, n개의 범주(비분쟁, 분쟁, 다분쟁 등)에 대해서도 상기와 같은 분류 모델 방법이 적용될 수 있을 것이다. 분쟁 특허가 분쟁을 제기한 회수를 기준으로 1~기 설정된 값의 범위를 가지는 경우 분쟁으로 분류하고, 기 설정된 값을 초과할 경우에는 다분쟁으로 분류할 수 있을 것이다. 이 경우, 비분쟁에 대하여 반응 변수값을 0, 분쟁에 대해서는 1, 다분쟁에 대해서는 2를 대응시킬 수 있을 것이다. 이때, 상기 2는 숫자적 의미의 2라기 구분되는 범주로서 비분쟁도 아니면, 분쟁과는 다른 별개의 속성을 가지는 분류 집단으로 취급되는 것이 더 타당할 것이며, 2가 아닌 3이나 다른 수치값이 반응 변수값으로 할 수도 있을 것이다. 이 경우 반응 변수를 벡터로 표현할 수도 있다. 이를테면(1, 0, 0)인 자료는 반응변수가 첫 번째 범주에 속하는 자료를(0, 1, 0)인 자료는 반응변수가 두 번째 범주에 속하는 자료를 나타낸다.In the above, the classification model for the two categories has been described. Meanwhile, the classification model method described above may be applied to n categories (non-dispute, dispute, multi-dispute, etc.). Disputes Patents can be classified as disputes if they have a range of 1 to a predetermined value, and if they exceed a predetermined value, they can be classified as multiple disputes. In this case, the response variable value may be 0 for non-dispute, 1 for dispute, and 2 for multidispute. In this case, 2 is a category distinguished by 2 as a numerical meaning, and it may be more appropriate to treat it as a non-dispute or a classification group having an attribute different from the dispute, and 3 or another numerical value other than 2 responds. It could be a variable value. In this case, the response variable may be expressed as a vector. For example, data with (1, 0, 0) represents data in which the response variable is in the first category, and data with (0, 1, 0) represents data in the second category.

이어, 도 34를 참조함녀서, A, B, C 3개의 범주를 대상으로 분류 모델의 생성 방법으로 n개의 분류 모델을 상기 분쟁 예측 모델을 생성하는 방법에 대해서 설명한다. 상기 도 32에서와는 달리 범주가 3개 이상인 경우는 각 범주마다 스텀프를 생성한다. 즉, A범주에 대한 스텀프, B범주에 대한 스텀프, C범주에 대한 스텀프를 각각 생성한다. 그 후 각각의 스텀프로부터 해당 범주에 속할 확률을 계산하여 확률이 가장 큰 범주로 각각의 자료를 할당한다. 예를 들어, 특정 자료 하나를 3개의 스텀프에 대입하여 각각의 예측값이-0.567, 0.456, 0.234 가 나왔다고 하면 범주B에 대한 예측값이 가장 크므로 이 자료는 B범주에 속할 확률이 가장 높다. 따라서 이 자료는 B범주로 분류하게 된다.Next, referring to FIG. 34, a method of generating the dispute prediction model using n classification models as a method of generating a classification model for three categories A, B, and C will be described. Unlike in FIG. 32, when there are three or more categories, a stump is generated for each category. That is, a stump for category A, a stump for category B, and a stump for category C are generated, respectively. Then, from each stump, we calculate the probability of belonging to that category and assign each piece of data to the category with the highest probability. For example, if one particular data is assigned to three stumps, and each of the predictions is -0.567, 0.456, and 0.234, the predictions for category B are the highest, so this data is most likely in category B. Therefore, this data is classified as category B.

이어, 도 35를 참조하면서, 회귀(regression) 모델에 대해 설명한다. 회귀 모델은 반응 변수값이 연속형일 경우에 사용되는 통계 모형이다. 상기 분쟁 특허를 준분쟁 특허와 분쟁 특허로 대별하고, 준분쟁 특허는 분쟁과 관련하여 분쟁 특허에 비해 기 설정되는 중요도(예를 들어, 70%)를 가진다고 가정할 수 있다. 이 경우, 비분쟁 특허에 0, 준분쟁 특허에 0.7, 분쟁 특허에 1이라는 반응 변수값을 할당할 수도 있으며, 비분쟁 특허에 0, 준분쟁 특허에 1.4, 분쟁 특허 2라는 값을 할당할 수도 있을 것이다. 한편, 다분쟁 특허에 대하여, 분쟁마다 1을 할당할 수도 있을 것이며, 이 경우, 비분쟁 특허에 0, 1회 분쟁 제기 특허 1, 2회 분쟁 제기 특허 2, .., n회 분쟁 제기 특허는 n이라는 반응 변수값을 할당할 수 있을 것이다. 한편, 분쟁 회수를 n개의 범주로 구획하고, 각 범주에 대하여 1회 분쟁 제기 특허에 대한 기 설정된 중요도값(2~5회는 2, 6~20회는 3, 20회 초과는 3 등)을 할당하는 경우, 비분쟁 특허에 0, 1회 분쟁 제기 특허 1, 2~5회 분쟁 제기 특허는 2, 6~20회 분쟁 제기 특허는 3, 20회 초과 분쟁 제기 특허는 3이라는 값을 반응 변수값으로 할당할 수 있을 것이다. 한편, 준분쟁 특허에 대해서는 0.7과 같은 값을 할당할 수 있을 것이며, 준분쟁의 종류별로 중요도를 다르게 취급할 경우, 준분쟁 특허의 종류별로 다른 반응 변수값을 할당할 수 있을 것이다. Next, a regression model will be described with reference to FIG. 35. The regression model is a statistical model used when the response values are continuous. The dispute patent can be divided into semi-dispute patents and dispute patents, and it can be assumed that semi-dispute patents have a predetermined importance (for example, 70%) relative to dispute patents in relation to disputes. In this case, a response variable value of 0 may be assigned to the non-dispute patent, 0.7 to the semi-dispute patent, and 1 to the dispute patent. The values of 0 may be assigned to the non-disputed patent, 1.4 to the semi-disputed patent, and 2 may be assigned to the disputed patent 2, respectively. There will be. On the other hand, for multi-dispute patents, one may be assigned to each dispute. In this case, 0, 1-time dispute patents 1, 2-time dispute patents 2, .., n-time dispute dispute patents We can assign the response variable n. On the other hand, the number of disputes is divided into n categories, and for each category, the predetermined importance value for each dispute-patented patent (2 to 5 times, 3 to 6 to 20 times, 3 to 20 times, etc.) If assigned, non-disputed patents have 0, 1 disputed patent 1, 2-5 disputed patents, 2, 6-20 disputed patents 3, 20 disputed patents 3 and 3 You can assign it by value. On the other hand, a semi-disputed patent may be assigned a value equal to 0.7, and if the importance is handled differently according to the type of the semi-disputed patent, different response variable values may be assigned to the types of the semi-disputed patent.

회귀 모델에서 분쟁 예측 모델을 생성하는 방법 또한 도 32에서와 같이 분류문제에서 사용한 것과 동일한 스텀프를 사용할 수 있다. 다만, 이 때 사용되는 손실함수는 이항분포의 우도함수로부터 유도된 deviance 함수가 아닌 정규분포로부터 유도된 최소제곱오차를 사용한다. 그리고 예측값의 의미 또한 달라지는데 분류 문제에서의 예측값은 해당 자료의 반응 변수 값이 1일 확률과 관련되 수치이지만 회귀 모형에서의 예측값은 반응변수 자체에 대한 예측값이라고 할 수 있다. 도 35에서, 설명 변수 X3에 대하여, 이 스텀프는 X3의 값이 18.6을 기준으로 분기 되고, 각 분기에 따른 예측값이 표시되어 있음을 알 수 있다.A method of generating a dispute prediction model from a regression model may also use the same stump as used in the classification problem as shown in FIG. 32. However, the loss function used here uses the least square error derived from the normal distribution, not the deviance function derived from the likelihood function of the binomial distribution. And the meaning of the predicted value is also different. The predicted value in the classification problem is related to the probability that the response variable value of the data is 1, but the predicted value in the regression model is the predicted value of the response variable itself. In Fig. 35, for the explanatory variable X3, it can be seen that the value of X3 is branched based on 18.6, and the predicted value for each branch is displayed.

회귀 문제에서 스텀프를 만드는 방법은 분류 문제와 전혀 다르지 않다. 다만 사용하는 손실 함수만 바뀌게 되고 바뀌 손실함수에 따라서 최적의 스텀프가 생성된다. 분류문제에서는 이항분포의 우도함수에 기반한 손실함수를 사용하고 회귀문제에서는 최소제곱오차를 손실함수로 사용한다. 그리고 예측값의 의미 또한 달라지는데 회귀 문제에서의 예측값은 반응 변수 자체를 예측한 것이고 분류 문제에서의 예측값은 반응 변수가 1을 값을 가질 확률과 관련된다.How to make stumps in regression problems is no different from classification problems. Only the loss function used is changed and the optimal stump is generated according to the changed loss function. In the classification problem, we use the loss function based on the likelihood function of the binomial distribution. In the regression problem, we use the least square error as the loss function. The meaning of the predicted value is also different. The predicted value in the regression problem predicts the response variable itself, and the predicted value in the classification problem is related to the probability that the response variable has a value of 1.

한편, 상기 회귀 모델의 특수한 경우로서, 분쟁 특허만을 대상으로 한 분쟁 특허 중에서의 재분쟁 예측 모델을 생성할 수 있을 것이다. 이 경우, 샘플은 분쟁 특허로만 구성되고, 반응 변수값은 1) 분쟁 특허의 분쟁 제기 회수 또는 2) 재분쟁 제기 회수 또는 3) 분쟁 제기 회수의 범주 처리값 또는 4) 재분쟁 제기 회수의 범주 처리값이 된다. 1), 1회 분쟁 제기 특허는 1, 2회 분쟁 제기 특허는 2, n회 분쟁 제기 특허는 n이 반응 변수값으로 할당 될 수 있으며, 2) 1회 분쟁 제기 특허는 0, 2회 분쟁 제기 특허는 1, n회 분쟁 제기 특허는 n-1이 반응 변수값으로 할당 될 수 있으며, 3)의 예를 든다면, 1회 분쟁 제기 특허는 1, 2회~5회 분쟁 제기 특허는 2, 6~20회 분쟁 제기 특허는 3, 20를 초과하는 분쟁 제기 특허는 4가 반응 변수값으로 할당 될 수 있으며, 4)의 예를 든다면 1회 분쟁 제기 특허는 0, 2회~5회 분쟁 제기 특허는 1, 6~20회 분쟁 제기 특허는 2, 20를 초과하는 분쟁 제기 특허는 3이 반응 변수값으로 할당될 수도 있을 것이다. On the other hand, as a special case of the regression model, it will be possible to generate a re-dispute prediction model among dispute patents targeting only dispute patents. In this case, the sample consists only of the disputed patent, and the response variable value is 1) the number of disputed claims of the disputed patent, or 2) the number of disputed claims, or 3) the category of the disputed claims, or 4) the category of the number of disputed claims. Value. 1), 1 dispute dispute patent is 1, 2 dispute dispute patent is 2, n dispute dispute patent can be assigned as a response variable value n, 2) 1 dispute dispute patent is 0, 2 dispute dispute Patents for 1 and n disputed patents can be assigned a response variable value of n-1, and for example, 3), one disputed patent is 1, 2 to 5 disputed patents are 2, For 6-20 disputed patents, dispute disputed patents exceeding 3, 20 can be assigned as a 4-valued response variable.For example, 4) disputed patents have 0, 2-5 disputes. 1, 6-20 disputed patents may be assigned to 2 or 20 disputed patents, and 3 may be assigned as a response variable.

상기 분쟁 예측 모델 생성부(5520)는 상기와 같은 반응 변수값으로 분쟁 특허 집합을 구성하는 분쟁 특허 Pi의 분쟁 예측 요소별 분쟁 예측 요소값으로 상기 분쟁 예측 모델의 생성에 대한 정보 처리 방법과 동일한 방법으로 재분쟁 예측 모델을 생성한다. 상기 재분쟁 예측 모델로 생성되는 값은 재분쟁 예측 모델값이 된다. 이 재분쟁 예측 모델이 필요한 이유는 사용자가 관심이 있는 특허가 재분쟁을 일으킬 것인지를 판단하거나, 얼마나 많은 분쟁을 일으킬 지를 판단하는데 활용될 수 있을 것이다.The dispute prediction model generating unit 5520 is the same method as the information processing method for generating the dispute prediction model with the dispute prediction element value for each dispute prediction element of the dispute patent Pi constituting the dispute patent set using the response variable values as described above. Create a re-dispute prediction model. The value generated by the re-dispute prediction model becomes a re-dispute prediction model value. The reason for this re-dispute prediction model may be used to determine whether a patent of interest will cause re-dispute or how many disputes it will cause.

한편, 상기에서 상기 분쟁 예측 모델 생성부(5520)는 분쟁의 발생 여부나 분쟁의 발생 회수를 중심으로 하는 분쟁 예측 모델의 생성에 대해 설명하였다. 이어, 상기 분쟁 예측 모델 생성부(5520)의 분쟁 제기 시점을 중심으로 하는 분쟁 시점 예측 모델의 생성 방법에 대해 설명한다. 분쟁 시점의 예측이란 특정 특허 Pi의 분쟁 예측 요소값으로 어느 시점에 분쟁을 제기하느냐를 예측하는 것이다. On the other hand, the dispute prediction model generation unit 5520 described the generation of a dispute prediction model focusing on the occurrence of disputes or the number of occurrences of disputes. Next, a method of generating a dispute point prediction model centered on the dispute point of time of the dispute prediction model generation unit 5520 will be described. Prediction of a dispute is a prediction of when a dispute is raised by the value of a dispute prediction element of a specific patent Pi.

이어, 도 36을 참조하면서, 본 발명의 특허별 분쟁 예측 모델값 생성부(5530)에 대하여 설명한다. 상기 특허별 분쟁 예측 모델값 생성부(5530)는 분쟁 예측 모델값을 생성하려는 특허를 입수(SL41)하고, 입수된 특허의 분쟁 예측 요소값을 생성 또는 입수(SL42)하고, 특허의 분쟁 예측 요소값을 분쟁 예측 모델에 입력하여 특허의 분쟁 예측 모델값을 생성(SL43)하고, 생성된 특허의 분쟁 예측 모델값 저장(SL44)한다. Next, with reference to FIG. 36, the dispute prediction model value generation part 5530 for each patent of this invention is demonstrated. The dispute prediction model value generation unit 5530 for each patent obtains a patent for generating a dispute prediction model value (SL41), generates or obtains a dispute prediction element value of the obtained patent (SL42), and sets a dispute prediction element for a patent. The value is input to the dispute prediction model to generate a dispute prediction model value of the patent (SL43), and the dispute prediction model value storage of the generated patent (SL44).

상기 특허별 분쟁 예측 모델값 생성부(5530)는 Pi에 분쟁 예측 모델을 적용하여 분쟁 예측 모델값을 생성하는 기능을 수행한다. 상기 특허별 분쟁 예측 모델값 생성부(5530)는 분쟁 예측 모델의 생성에서도 기능할 수도 있으며(상기 분쟁 예측 모델 생성부(5520)에서도 Pi를 분쟁 예측 모델 후보를 적용하여 분쟁 예측 모델 후보에 대한 분쟁 예측 모델값을 생성하는 것을 설명한 바 있으며, 이 경우, 상기 특허별 분쟁 예측 모델값 생성부(5530)는 분쟁 예측 모델 생성부(5520)의 하위에서 작동하거나, 상기 분쟁 예측 모델 생성부(5520)가 특허별 분쟁 예측 모델값 생성부(5530)의 기능을 호출하여 사용할 수도 있다.) 분쟁 예측 모델 생성부(5520)와는 독립적으로 상기 분쟁 예측 모델 생성부(5520)가 생성한 분쟁 예측 모델을 사용하여 임의의 입력되는 특허 Pi별로 분쟁 예측 모델값을 생성하는데 사용될 수 있다. 상기 특허별 분쟁 예측 모델값 생성부(5530)는 특허 Pi에 대하여 분쟁 예측 요소값 생성부(5510)가 생성한 분쟁 예측 요소별 분쟁 예측 요소값을 분쟁 예측 모델(n개의 스텀프 집합으로 구성되는 SSn은 분쟁 예측 모델의 한 예가 된다. 다른 통계적 처리 기법을 사용하여 생성되는 분쟁 예측 모델도 본 발명의 분쟁 예측 모델이 된다.)에 적용하여 Pi의 분쟁 예측 모델값을 생성한다. The dispute prediction model value generating unit 5530 for each patent performs a function of generating a dispute prediction model value by applying a dispute prediction model to Pi. The dispute prediction model value generation unit 5530 for each patent may also function in generation of a dispute prediction model (the dispute prediction model generation unit 5520 also applies a dispute prediction model candidate Pi to a dispute for a dispute prediction model candidate. The generation of prediction model values has been described. In this case, the patent-specific dispute prediction model value generation unit 5530 may operate under the dispute prediction model generation unit 5520 or the dispute prediction model generation unit 5520. May be used by calling the function of the dispute prediction model value generation unit 5530 for each patent.) The dispute prediction model generated by the dispute prediction model generation unit 5520 is used independently of the dispute prediction model generation unit 5520. Can be used to generate a dispute prediction model value for each input patent Pi. The dispute prediction model value generation unit 5530 for each patent determines a dispute prediction element value for each dispute prediction element generated by the dispute prediction element value generation unit 5510 with respect to patent Pi (SSn composed of a set of n stumps). Is an example of a dispute prediction model, which is also applied to a dispute prediction model generated using other statistical processing techniques, to generate a dispute prediction model value of Pi.

상기 분쟁 예측 모델값은 기 설정된 후처리를 거친 값이 될 수 있다. 이어, 상기 후처리에 대해서 설명한다. 모형을 구축할 때 사용한 자료(분쟁 특허 집합과 비분쟁 특허 집합)가 전체 모집단으로부터 랜덤하게 추출된 자료라면 상관이 없지만 그렇지 않은 경우에는 통계학적 후처리가 필요하다. 모집단(미국 특허의 경우, 약 400만 건)에서 분쟁 특허는 몇 만 건 정도 밖에 되지 않기 때문에 이러한 모집단에서 랜덤 표본(분쟁 특허 집합+비분쟁 특허 집합)을 사용하게 되면 표본의 크기가 아주 크지 않는 한 그 표본에 포함되는 분쟁 특허의 수가 굉장히 적기 때문에 좋은 모형을 만들기가 어렵게 된다. 샘플에 분쟁 특허를 가능한 한 많이 포함시켜 분석해서 그 특성을 제대로 파악하여야지 좋은 모형을 구축할 수 있으므로, case-control study 방법으로 자료를 분석하게 된다. 특허 자료의 경우 분쟁 특허에 대해서 이미 정보가 수집되어 있기 때문에 이와 대응되는 비분쟁 특허 자료만 임의로 추출하여 case-control study 방법으로 자료를 분석할 수 있다. 이 때 주의해야 할 점은 case-control 표본으로부터 얻은 점수는 원래 모집단에서 알고자 하는 점수가 아니기 때문에 원래 점수로 환원시켜 주는 절차가 필요하며, 이 경우 전체 모집단의 크기를 알아야 한다. 예를 들어, 미국 특허 자료의 경우 전체 모집단의 크기(약 400만 건)를 알기 때문에 Bayes 정리를 써서 쉽게 원래 점수를 구할 수가 있다. Bayes 정리로부터 유도 된 하기 수학식 1을 이용해서 상기 특허별 분쟁 예측 모델값 생성부(5530)는 최종적인 분쟁 예측 모델값(final score)를 구할 수가 있다. 하기 수학식 1 에서 n1은 표본의 분쟁 특허의 개수, n0는 표본의 비분쟁 특허의 개수, N1은 전체 분쟁 특허의 개수, N0는 전체 비분쟁 특허의 개수를 말한다. boosting score는 분쟁 예측 모델 생성부(5520)가 생성한 분쟁 예측 모델에 대하여 Pi에 대응되는 분쟁 예측 요소값을 적용한 분쟁 예측 모델값이 된다.The dispute prediction model value may be a value that has undergone a predetermined post-processing. Next, the post-processing will be described. It does not matter if the data used in constructing the model (disputed patent set and non-disputed patent set) are randomly extracted from the entire population, otherwise statistical post-processing is required. Since there are only tens of thousands of disputed patents in the population (about 4 million US patents), using a random sample (disputed patent set + non-disputed patent set) in these populations results in a very small sample size. The very small number of disputed patents in a sample makes it difficult to create a good model. As many dispute patents are included in the sample as possible and analyzed to understand the characteristics, a good model can be constructed, and the data are analyzed by the case-control study method. In the case of patent data, information on the disputed patent has already been collected, so only the non-disputed patent data corresponding to the patent can be randomly extracted and analyzed by the case-control study method. It is important to note that the score obtained from the case-control sample is not what the original population wants to know, so a procedure is required to reduce the original score. In this case, the size of the entire population must be known. For example, in the case of US patent data, because we know the size of the entire population (about 4 million), we can easily get the original score using Bayes' theorem. The dispute prediction model value generating unit 5530 for each patent may obtain a final dispute prediction model value (final score) using Equation 1 derived from Bayes' theorem. In Equation 1, n1 denotes the number of disputed patents in the sample, n0 denotes the number of non-disputed patents in the sample, N1 denotes the total number of disputed patents, and N0 denotes the total number of non-disputed patents. The boosting score is a dispute prediction model value obtained by applying a dispute prediction element value corresponding to Pi to the dispute prediction model generated by the dispute prediction model generator 5520.

[수학식 1][Equation 1]

상기 수학식 1은 control 표본을 랜덤으로 추출하였을 경우 최종 점수를 구하는 방법이고 control 표본을 층화 추출 하였을 경우에는 식이 달라진다. 각 특허 분류를 기준으로 한 범주마다 그 크기가 다르고 분쟁 예측 모델값의 통계적 성질이 다르기 마련인데 이를 제대로 반영하기 위해서는 비분쟁 특허의 샘플링 시 분쟁 특허의 특허 분류에 대응되게 비분쟁 특허를 층화 추출하는 것이 고려된다. 각 특허 분류 범주마다 표본이 고르게 뽑힐 수 있도록 층화 추출을 이용한다. 층화 추출로 표본이 생성되었을 경우 최종 점수를 구하는 식은 하기 수학식 2와 같다. 여기서 s는 특허 분류 범주를 구분하는 인덱스이다. 예를 들면 n_1s는 s번째 특허 분류 범주에서 표본에 뽑은 분쟁 특허의 개수를 나타낸다.Equation 1 is a method of obtaining a final score when the control sample is randomly extracted, and the equation is different when the control sample is stratified. The size of each category is different and the statistical properties of the dispute prediction model value are different for each category based on each patent classification.To properly reflect this, the non-dispute patents are stratified and extracted to correspond to the patent classification of the disputed patents. Is considered. Stratified extraction is used to ensure that samples are evenly sampled for each patent classification category. When the sample is generated by stratification extraction, the equation for obtaining the final score is shown in Equation 2 below. Where s is an index identifying a patent classification category. For example, n _1s represents the number of disputed patents sampled from the sth patent category.

[수학식 2]&Quot; (2) "

상기와 같이 특허별 분쟁 예측 모델값 생성부(5530)는 특허별로 분쟁 예측 모델값을 생성한다. 한편, 상기 특허별 분쟁 예측 모델값 생성부(5530)는 특허 DB(1120)에 있는 모든 특허에 대하여 상기 분쟁 예측 모델값을 생성할 수 있을 것이며, 상기 특허 분쟁 예측 정보 생성 시스템(5000) 또는 상기 특허 분쟁 예측 정보 생성 시스템(5000)의 사용자가 지정하거나 관리하는 적어도 하나 이상의 특허로 구성되는 특허에 대하여 상기 분쟁 예측 모델값을 생성할 수 있을 것이다. 생성된 분쟁 예측 모델값은 상기 분쟁 예측 모델값 DB(5230)에 저장된다. 본 발명의 분쟁 예측 모델값 제공부(5540)는 외부로부터 적어도 하나 이상의 특허에 대한 분쟁 예측 모델값의 요청을 받는 경우, 상기 특허에 대한 분쟁 예측 모델값을 제공하는 기능을 수행한다. 상기 분쟁 예측 모델값 제공부(5540)는 사용자가 적어도 하나 이상의 특허를 선택하거나, 검색 결과 등으로 적어도 하나 이사의 특허 목록이 제시될 때, 상기 분쟁 예측 모델값 제공부(5540)는 상기 선택된 특허 또는 검색 결과에 분쟁 예측 모델값이 포함되도록 분쟁 예측 모델값을 제공할 수 있을 것이다.As described above, the dispute prediction model value generating unit 5530 for each patent generates a dispute prediction model value for each patent. Meanwhile, the dispute prediction model value generation unit 5530 for each patent may generate the dispute prediction model value for all patents in the patent DB 1120, and the patent dispute prediction information generation system 5000 or the The dispute prediction model value may be generated for a patent composed of at least one patent designated or managed by a user of the patent dispute prediction information generation system 5000. The generated dispute prediction model value is stored in the dispute prediction model value DB 5230. The dispute prediction model value providing unit 5540 of the present invention performs a function of providing a dispute prediction model value for the patent when receiving a request for dispute prediction model value for at least one or more patents from the outside. When the user selects at least one or more patents or the list of patents of at least one director is presented as a search result, the dispute prediction model value providing unit 5540 may select the selected patent. Alternatively, the dispute prediction model value may be provided to include the dispute prediction model value in the search result.

이어, 도면을 참조하면서, 상기 분쟁 예측 모델을 통하여 분쟁 예측 정보를 생성하는 방법에 대해서 설명한다.Next, a method of generating dispute prediction information through the dispute prediction model will be described with reference to the drawings.

상기 분쟁 예측 엔진(5100)은 적어도 하나 이상의 특허를 포함하는 자기 특허 집합을 먼저 입수하고 자기 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허 집합을 입수하거나, 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 먼저 입수하고, 타겟 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 자기 특허 집합을 입수(SL51)한다. 이어, 상기 분쟁 예측 엔진(5100)은 상기 타겟 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 분쟁 예측 모델값을 입수(SL52)하고, 상기 특허별 분쟁 예측 모델값을 활용하여 적어도 하나 이상의 분쟁 예측 정보를 생성(SL53)한다. 이하 분설한다. The dispute prediction engine 5100 first obtains a self patent set including at least one patent and obtains at least one target patent set having a predetermined relationship with the self set, or obtains a target patent set including at least one patent. First, at least one self patent set having a predetermined relationship with a target set is obtained (SL51). Subsequently, the dispute prediction engine 5100 obtains at least one dispute prediction model value for each patent for each patent constituting the target patent set (SL52), and utilizes the dispute prediction model value for each patent to at least one or more patents. The dispute prediction information is generated (SL53). Separate it below.

먼저 자기 특허 집합(Self Set, SS)을 정의한다. 자기 특허 집합은 시스템의 기 설정된 입력 또는 사용자가 분쟁 예측 정보의 생성을 위하여 입력하는 입력에 대응되는 적어도 하나 이상의 특허를 포함하는 특허군으로 특정되는 특허 집합을 말한다. 상기 입력은 적어도 하나 이상의 특허 입력, 적어도 하나 이상의 권리자 입력, 적어도 하나 이상의 발명자 입력, 적어도 하나 이상의 특허 분류 입력 및 적어도 하나 이상의 검색식 입력 중 어느 하나 또는 어느 하나 이상의 조합인 것일 수 있다. 상기 검색식은 특허 DB(1120) 상에 저장된 특허를 호출할 수 있는 모든 필드에 대한 필드별, 필드 조합별 검색식을 당연히 포함하며, 특정 필드에 대한 검색 키워드의 입력을 당연히 포함한다. 예시적으로 사용자는 자사의 기술 분야 트리(tree), 국내, 또는 해외 또는 국가별 경쟁사나 유관 기업 트리, 자사 제품과 관련된 기능에 대응되는 기술 분야별 트리, 자사가 관리하는 적어도 하나 이사의 특허군에 대한 트리, 자사가 관심 있는 특허군을 호출할 수 있는 검색식 트리를 생성할 수 있으며, 각 트리의 종단 노드(end node)에는 사용자의 입력 또는 조건 또는 개별 특허 문건들이 대응되어 있을 수 있다. 상기 트리는 적어도 1개 계층 이상의 다단 트리일 수 있음은 당연할 것이다. IPC 계층 구조는 상기 다단 트리의 대표적인 예시가 된다. 상기 SS를 생성하는 것은 본 발명의 자기 특허 집합 생성부(5110)가 수행한다.First, a self-patent set (SS) is defined. The self patent set refers to a patent set specified by a patent group including at least one patent corresponding to a preset input of a system or an input input by a user for generation of dispute prediction information. The input may be any one or combination of at least one patent input, at least one owner input, at least one inventor input, at least one patent classification input, and at least one search input. The search expression naturally includes a search expression for each field and field combination for all fields capable of calling a patent stored on the patent DB 1120, and naturally includes an input of a search keyword for a specific field. By way of example, a user may apply to a company's technology tree, a domestic or foreign or country-specific competitor or affiliated company tree, a technology field tree corresponding to a function related to his product, or at least one director's patent group. A search tree that can call a patent group of interest, and a patent group of interest, and end nodes of each tree may correspond to user inputs or conditions or individual patent documents. It will be appreciated that the tree can be a multi-stage tree of at least one hierarchy. The IPC hierarchy is a representative example of the multistage tree. Generating the SS is performed by the self-patent set generating unit 5110 of the present invention.

자기 특허 집합은 상기 사용자 또는 상기 특허 분쟁 예측 정보 생성 시스템(5000)이 생성하거나 선택하거나 지정하는 적어도 하나 이상의 공개 또는 등록 특허로 구성되는 특허 집합이다.상기 분쟁 예측 엔진(5100)의 자기 특허 집합 생성부(5110)는 적어도 하나 이상의 특허로 구성되는 자기 특허를 생성하거나 입수하여 자기 특허 집합을 구성한다. 상기 자기 특허 집합의 생성은 검색 엔진이나 DBMS를 통한 검색을 통해서 생성될 수도 있으며, 사용자가 저장하고 있는 적어도 하나 이상의 특허 집합 또는 상기 특허 집합의 연산 결과로 생성되는 특허 집합에 대한 지정 또는 상기 특허 집합에 포함된 적어도 하나 이상의 특허에 대한 지정을 통하여도 생성될 수 있다. 상기 자기 특허 집합은 상기 사용자의 소속 기관이 보유하는 특허일 필요는 없으며, 사용자가 관심을 가지고 지정하거나 선택하는 특허이면 충분하다. 상기 사용자는 다단계 계층 구조의 트리(tree) 방식으로 자신이 관심을 가지는 특허를 관리할 수 있을 것이다. 상기 트리를 구성하는 각 노드는 노드 명칭을 가지며, 각 노드에는 적어도 하나 이상의 특허들이 대응되어 있다.The self-patent set is a patent set composed of at least one published or registered patent generated, selected, or designated by the user or the patent dispute prediction information generation system 5000. [0047] Self-patent set generation of the dispute prediction engine 5100. The unit 5110 generates or obtains a self-patent composed of at least one patent to form a self-patent set. The generation of the self patent set may be generated through a search through a search engine or a DBMS, and designation of the at least one patent set stored by a user or a patent set generated as a result of the operation of the patent set or the patent set. It may also be generated through designation of at least one patent included in. The self-patent set does not need to be a patent owned by the user's organization, but a patent that the user designates or selects with interest is sufficient. The user may manage patents of interest in a tree of a multi-level hierarchy. Each node constituting the tree has a node name, and at least one patent is associated with each node.

상기 SSi를 도입하는 이유는 다음과 같다. 특허 분쟁은 본질적으로 원고의 특허권과 피고의 기계, 제품, 방법, 조성물(이하, 제품이라 한다)과의 관계의 문제이며, 원고의 특허와 피고의 특허 간의 문제가 아니다. 이에 따라, 피고의 제품을 시스템이 처리할 수 있는 데이터로 투사 또는 투영시키는 것이 필요하게 된다. 이에 따라, 본 시스템의 사용자는 자신이 관심이 있는 제품과 관련된 특허군을 시스템이 이해할 수 있도록 입력할 필요가 있게 된다. 상기 제품과 관련된 특허군의 입력의 예는 1) 자사의 제품의 기능, 구조, 방식, 재질 등 제품에 반영된 기술 중 특허화된 기술이 있는 경우, 그 특허들의 입력, 2) 자사의 특허가 없거나 부족한 경우, 자사의 제품과 관련된 타사의 특허, 3) 자사의 제품과 관련된 타인의 특허, 4) 자사의 제품과 관련된 기술 분야의 특허 분류 입력, 5) 자사의 제품과 관련된 특허를 검색할 수 있는 검색식의 입력 중 어느 하나 이상 또는 어느 하나 이상의 조합의 입력을 통하여 제품과 관련된 특허군을 특정할 수 있게 된다. 시스템은 직접적인 제품이 아닌, 제품이 투영된 특허군으로 특허 분쟁 예측 정보를 생성하게 된다.The reason for introducing the SSi is as follows. A patent dispute is in essence a matter of the relationship between the plaintiff's patent rights and the defendant's machine, product, method, and composition (hereinafter referred to as the product) and not the plaintiff's patent and the defendant's patent. Accordingly, it is necessary to project or project the defendant's product into data that the system can process. Accordingly, the user of the system needs to input the patent group related to the product of interest so that the system can understand. Examples of the input of patent group related to the product include 1) the input of patents if there is a patented technology among the technologies reflected in the product, such as the function, structure, method and material of the product, and 2) no or insufficient patent of the company. Search for retrieval of third-party patents related to our products, 3) patents of others related to our products, 4) entering patent classifications in technical fields related to our products, and 5) patents related to our products. It is possible to specify a patent group associated with a product through the input of any one or more or any combination of one or more of the inputs of the equation. The system will generate patent dispute prediction information from a group of patents on which a product is projected, not a direct product.

상기 사용자가 상기 SS를 설정할 때도, 상기 SSi에 대하여 주관적인 가중치를 설정할 수 있다. 상기 가중치의 설정은 본 발명의 가중치 조절부(5143)에서 제공하는 UI를 통하여 수행한다. 상기 UI는 SS에 속하는 특허 리스트를 제공해 주고, 상기 특허 리스트에 사용자가 가중치나 중요도 또는 중요 등급을 입력할 수 있도록 하는 방식으로 수행한다. 이러한 가중치를 사용자 설정 SSi 가중치라 한다. 상기 사용자 설정 가중치는 상기 SSi로 생성되는 후술하는 TS에도 반영된다.When the user sets the SS, a subjective weight may be set for the SSi. The setting of the weight is performed through a UI provided by the weight adjusting unit 5153 of the present invention. The UI provides a list of patents belonging to the SS, and the user can enter a weight, importance or importance level in the patent list. This weight is referred to as user set SSi weight. The user setting weight is also reflected in a later-described TS generated by the SSi.

자기 특허 집합 SS에 속하는 특허를 SSi라 하자. 상기 SSi 개별 특허에 대하여 타겟 특허 집합(Target Set, TS)를 정의하자. TS는 사용자 또는 시스템이 SSi와 분쟁 예측 정보의 생성 목적으로 생성하거나 지정하거나, 입력하는 방식으로 특정되는 특허 집합을 말한다. 상기 TS를 생성하는 것은 본 발명의 타겟 특허 집합 생성부(5120)가 수행한다. 타겟 특허 집합은 자기 특허 집합에 포함되는 자기 특허와 기 설정된 관련성을 가지는 특허들인 타겟 특허를 포함하는 특허 집합이다. 본 발명의 타겟 특허 집합 생성부(5120)는 자기 특허 집합을 통하여 타겟 특허 집합을 생성하거나, 상기 사용자 또는 상기 특허 분쟁 예측 정보 생성 시스템(5000)의 지정이나 선택을 통하여 타겟 특허 집합을 생성한다. 전자에 대해서 상세히 설명한다. 상기 기 설정된 관련성은 1) 인용-피인용 특허 관계, 2) 텍스트 마이닝 기법 상의 유사 특허군 관계, 3) 특허 분류 상 유사 기술 특허 관계, 4) 사용자자 지정하는 특허로 구성되는 지정 특허군 관계 등이 있을 수 있다.Let SSi be a patent belonging to its own patent set SS. Define a target patent set (TS) for the SSi individual patent. TS refers to a set of patents specified by a user or a system generated, designated, or inputted for the purpose of generating SSi and dispute prediction information. Generating the TS is performed by the target patent set generation unit 5120 of the present invention. The target patent set is a patent set including a target patent which is a patent having a predetermined relationship with the own patent included in the own patent set. The target patent set generation unit 5120 of the present invention generates a target patent set through its own patent set, or generates a target patent set through designation or selection of the user or the patent dispute prediction information generation system 5000. The former will be described in detail. The preset relations include 1) citation-cited patent relations, 2) similar patent group relations in text mining techniques, 3) similar technology patent relations in patent classifications, and 4) designation patent group relations consisting of user-specified patents. This can be.

이어, 상기 분쟁 예측 엔진(5100)이, 기설정된 관계성 정보를 고려한 타겟 특허별 가중치 정보를 활용하여 분쟁 예측 정보를 생성하는 방법에 대해 설명한다. 상기 분쟁 예측 엔진(5100)은 자기 특허 집합을 입수(SL61)하고, 자기 특허 집합을 구성하는 개별 자기 특허별로 기 설정된 관계성을 가지는 타겟 특허를 추출하고 타겟 특허 집합을 생성(SL62)한다. 이어, 상기 분쟁 예측 엔진(5100)은 상기 타겟 특허에 대한 관계성 정보를 입수하거나 생성(SL63)한 다음, 상기 관계성 정보를 고려한 타겟 특허별 가중치 정보를 생성(SL64)하고, 상기 타겟 특허별 가중치 정보를 활용하여, 기 설정된 적어도 하나 이상의 분쟁 예측 정보를 생성(SL65)한다. Next, a method of generating dispute prediction information by using the dispute prediction engine 5100 using weight information for each target patent in consideration of predetermined relationship information will be described. The dispute prediction engine 5100 obtains a self patent set (SL61), extracts a target patent having a predetermined relationship for each individual patent constituting the self patent set, and generates a target patent set (SL62). Subsequently, the dispute prediction engine 5100 obtains or generates relationship information about the target patent (SL63), and then generates weight information for each target patent in consideration of the relationship information (SL64), and then for each target patent. By using the weight information, at least one preset dispute prediction information is generated (SL65).

도 37에는 상기 자기 특허 집합을 구성하는 자기 특허들과 타겟 특허 집합을 구성하는 타겟 특허 간에 기 설정된 관계성이 설정되는 것을 도시해 주고 있다. 상기 관계성은 R(SSi, TSi)로 정의되는데, SSi와 TSi간에 성립되는 값이다. 도 37에서 알 수 있듯이, 하나의 SSi가 적어도 하나 이사의 TSi와 관련될 수 있고, 하나의 TSi가 적어도 하나 이상의 SSi와 관련되어 있을 수 있다. 상기 도관1에서 R(SS1, TS1), R(SS1, TS2)와 R(SS2, TS2)는 다를 수 있으며, 상기 TS2의 관점에서는 2개의 관계성 R(SS1, TS2)와 R(SS2, TS2)을 가진다. 상기 TSi에는 n개의 관계성을 가질 수 있으며, 상기 관계성은 TSi에 대한 가중치인 W(TSi)를 생성하는데 사용될 수 있다. 예를 들어, 상기 관계성이 텍스트 마이닝 상의 유사 특허군 관계일 때, R(SSi, TSi)는 SSi와 TSi간의 핵심 키워드를 기준으로 한 기 설정된 키워드 유사도 함수를 적용한 유사도의 정도가 될 수 있다. 한편, 상기 관계성이 인용일 경우, R(SSi, TSi)는 SSi와 TSi의 인용의 종류가 고려되어 부가되는 인용 관계성 함수값이 될 수 있을 것이다.FIG. 37 shows that a predetermined relationship is established between the self patents constituting the self patent set and the target patents constituting the target patent set. The relationship is defined as R (SSi, TSi), which is a value established between SSi and TSi. As can be seen in FIG. 37, one SSi may be associated with at least one TSi, and one TSi may be associated with at least one SSi. In conduit 1, R (SS1, TS1), R (SS1, TS2) and R (SS2, TS2) may be different, and in terms of TS2, two relations R (SS1, TS2) and R (SS2, TS2) ) The TSi may have n relationships, and the relationship may be used to generate W (TSi), which is a weight for TSi. For example, when the relationship is a similar patent group relationship in text mining, R (SSi, TSi) may be a degree of similarity to which a preset keyword similarity function based on a key keyword between SSi and TSi is applied. On the other hand, if the relationship is a citation, R (SSi, TSi) may be a citation relationship function value added considering the type of citation of SSi and TSi.

상기 인용-피인용 관계는 자기 특허와 타겟 특허가 1) 직접 인용, 2) 인용 깊이 n(n>1) 간접 인용, 3) 잠재 인용, 4) 사슬 인용 중 어느 하나 이상의 관계를 가지는 것이며, 이들에 대해서는 전술하였다. 개별 특허 Pi가 있는 경우, 각 인용 종류별로 기 설정된 다른 인용 가중치(Citation Weight)가 제공될 수 있다. 텍스트 마이닝 기법 상의 유사 특허군 관계는 자기 특허와 타겟 특허가 추출된 키워드를 기준으로 기 설정된 키워드 유사성을 가지는 것이며, 분류 상 유사 기술 특허 관계는 자기 특허와 타겟 특허가 특허 분류 체계 상에서 기 설정된 유사 기술 관계를 가지는 것을 말한다.The citation-cited relationship is a self-patent and a target patent having at least one relationship of 1) direct citation, 2) citation depth n (n> 1) indirect citation, 3) latent citation, and 4) chain citation. As described above. If there is an individual patent Pi, a different citation weight preset for each citation type may be provided. The similar patent group relationship in the text mining technique has the keyword similarity set based on the keyword from which the own patent and the target patent are extracted, and the similar technology patent relationship in the classification is the similar technology that the own patent and the target patent are preset in the patent classification system. To have a relationship.

본 발명의 인용 특허 집합 생성부(5121)는 자기 특허 집합에 속하는 Pi에 대하여, 상기 Pi의 인용 종류별로 상기 Pi가 인용하는 선행 특허들로 인용 특허 집합을 생성하며, 상기 인용 특허 집합은 타겟 특허 집합의 한 예가 되거나 타겟 특허 집합의 부분 집합이 된다. 상기 인용 특허 집합을 생성할 때, 자기 특허 집합에 속하는 Pj 특허가 상기 인용 특허 집합에 포함될 수 있으며, 이 경우 인용 특허 집합에 Pj를 포함시킬 지의 여부는 타겟 특허 집합 생성 정책에 따른다. 한편, 상기 사용자는 상기 타겟 특허 집합 생성부(5120)가 타겟 특허 집합을 생성할 때, 특정 출원인(예를 들면 사용자가 A 기업에 속하는 경우, A 기업 또는 A 기업과 특정한 관계를 맺고 있는 출원인 등)의 특허 등과 같은 기 설정된 속성을 가지는 특허들을 배제시킬 수 있을 것이다. 물론, 상기 특정 출원인의 특허의 배제는 생성된 타겟 특허 집합에 대한 상기 사용자의 후처리(타겟 특허의 삭제, 추가 등)를 통해서 처리될 수도 있다.The cited patent set generation unit 5121 of the present invention generates a cited patent set from the preceding patents cited by the Pi for each Pi cited type, for the Pi belonging to its own patent set, and the cited patent set is a target patent. An example of a set or a subset of a target patent set. When generating the cited patent set, the Pj patent belonging to the own patent set may be included in the cited patent set. In this case, whether to include Pj in the cited patent set depends on the target patent set generation policy. On the other hand, when the target patent set generation unit 5120 generates the target patent set, the user is a specific applicant (for example, if the user belongs to company A, applicants having a specific relationship with company A or company A, etc. Patents having predetermined attributes such as patents) may be excluded. Of course, the exclusion of the patent of the particular applicant may be processed through the post-processing (deleting, adding, etc.) of the user to the generated target patent set.

이때, 인용 특허 집합인 TS에 속하는 특허인 TSi에는 가중치가 서로 다른 TSi를 가질 수 있다. 인용 특허 집합을 구성하는 인용 특허들은 1) 인용 종류 기반 인용 가중치, 2) 중복 기반 인용 가중치, 3) 간접 인용의 경우 인용 깊이 기반 인용 가중치, 4) 인용 개수 반비례 가중치, 5) 인용 진부화 가중치를 가진다. 인용 종류 기반 인용 가중치의 부여는 직접 인용에 가장 큰 가중치가 부여되고, 잠재 인용의 세부 종류별로 다른 가중치를 부여하고, 간접 인용의 경우, 인용의 깊이가 클수록 더 낮은 가중치를 부여하는 방식으로 부여될 수 있을 것이다. 상기 4) 인용 개수 반비례 가중치는 SSi에 레퍼런스 수가 많으면 많을수록 특정 레퍼런스의 상대적인 가중치를 낮게 가져가는 것이다. 최근 레퍼런스에 대한 남용(가급적 많은 특허나 논문 등의 자료를 레퍼런스에 추가하는 것)이 일어나고 있으며, 레퍼런스 개수가 너무 많을 경우, 사실상 레퍼런스의 역할을 수행하지 못하는 레퍼런스가 포함되어 있을 확률이 높다. 상기 인용 개수 반비례 가중치는 "상수/(레퍼런스 수 - 기 설정된 정수)"가 될 수 있으며, 이 식은 레퍼런스 수가 기 설정된 정수를 초과할 때만 적용된다. SSi와 TSi간의 출원일이 차이가 많이 날수록 동일한 특허 분류를 가지더라도 내용상 관련성이 강도가 떨어질 가능성이 클 수 있다. 따라서, TSi에 대하여 TSi와 SSi의 출원일의 차이값에 반비례 관계를 가지는 진부화 함수를 고려한 인용 진부화 가중치를 부가할 수 있다. 인용 진부화 가중치는 "상수/f(SSi의 출원일과 TSi의 출원일간의 경과 일수)"가 될 수 있다At this time, TSi, which is a patent belonging to TS, which is a cited patent set, may have TSi having different weights. The cited patents that make up the cited patent set have 1) citation weights based on citation type, 2) weight citations based on duplicates, 3) citation depth based citation weights for indirect citations, 4) inverse citation counts, and 5) citation obsolescence weights. . Quotation type-based citation weighting can be given in such a way that the direct weight is given the largest weight, different weights are given for each detail type of potential citation, and in the case of indirect citation, the weight is lower as the citation depth is larger. Could be. 4) The inverse weight of the citation count is that the larger the number of references in SSi, the lower the relative weight of a specific reference. There has been a recent abuse of references (adding as many patents and articles as possible), and if the number of references is too high, it is likely that there are references that do not actually act as references. The quoted inverse weight may be "constant / (reference number-preset integer)", which is applied only when the reference number exceeds a preset integer. As the date of filing between SSi and TSi increases, there is a high possibility that the relevance of content may be reduced even if the same patent classification is used. Therefore, the cited obsolescence weight may be added to TSi in consideration of an obsolescence function having an inverse relationship to the difference between the application date of TSi and SSi. The citation obsolescence weight may be "constant / f (the number of days of application of SSi and the date of application of TSi)"

한편, 서로 다른 n개의 SSi가 동일한 TSi를 인용할 수 있으므로, 이 경우 TSi의 중복 인용에 따른 가중치는 n이 되게 된다. 한편, SSi의 입장에서 자신이 직접 인용하는 인용 깊이 1인 parent 특허와 그 parent가 인용하는 인용 깊이 n(n>1)인 grand parent 특허를 동등한 가중치로 취급할 수는 없게 된다. 이때, 인용 깊이 n이 커질수록 지수적, 기하급수적 또는 산술급수적으로 감쇄하는 감쇄 팩터(factor)를 부가하여 인용 깊이에 따른 TSi의 가중치를 조절한다. 한편, TSi들 중에서 최근에 인용된 경우에 가중치를 더 부여하거나, 그 권리자가 트롤이나 다분쟁 권리자 또는 기타 사용자의 등록된 경쟁자인 경우 등과 같이 권리자의 속성에 따라 가중치를 더 부여하거나, 표준 특허풀에 소속된 특허이거나, 분쟁 발생 특허이거나 하는 등 그 특허가 기 설정된 속성을 가지는 경우 가중치를 더 부여할 수도 있을 것이다. 시스템에 의한 TS의 가중치의 설정은 본 발명의 가중치 조절부(5143)에서 수행한다.On the other hand, since n different SSi may quote the same TSi, in this case, the weight according to the duplicate citation of TSi is n. On the other hand, SSi cannot directly treat the parent patent of citation depth 1 and the grand parent patent of citation depth n (n> 1) cited by the parent as equal weights. At this time, as the citation depth n increases, a weighting factor of TSi according to the citation depth is adjusted by adding a decay factor that decays exponentially, exponentially, or arithmetic exponentially. On the other hand, additional weights are assigned in the case of the recently cited among TSi, the weights are further assigned according to the property of the owner, such as when the owner is a registered competitor of a troll, multi-dispute or other user, or standard patent pool If the patent has a predetermined attribute, such as a patent belonging to or a disputed patent, it may be given a weight more. The weighting of the TS by the system is performed by the weight adjusting unit 5103 of the present invention.

SSi에 기반한 가중치에 대해서 설명한다. 사용자는 자신이 설정한 SSi 특허별로 자신의 주관적인 가중치인 사용자 설정 SSi 가중치를 부여할 수 있다. 인용 특허 집합 또는 유사 특허군 등으로 생성되는 TSi들은 SSi에 기반하기 때문에, SSi에 부여된 가중치는 TSi에도 부가될 수 있게 된다. 즉, 가중치가 아주 높은 특정한 SSi가 인용하는 특허군인 TSi는 가중치가 낮은 SSj가 인용하는 특허군인 TSj보다 더 높은 가중치를 가지도록 해야 한다. 따라서, TS를 구성하는 TSi들에는 SSi에 기반한 가중치인 사용자 설정 SSi 가중치 정보가 부가되어 있는 것이 더욱 바람직하다.The weight based on SSi is demonstrated. The user may assign a user set SSi weight which is a subjective weight for each SSi patent set by the user. Since TSi generated by cited patent sets or similar patent groups are based on SSi, weights assigned to SSi can be added to TSi. That is, TSi, the patent group cited by a particular SSi having a very high weight, should have a higher weight than TSj, a patent group cited by a low SSj. Therefore, it is more preferable that user set SSi weight information, which is a weight based on SSi, is added to TSi constituting the TS.

상기 TS의 대표적인 다른 예는 상기 SSi에 속하는 특허들에 대하여 텍스트 마이닝 기법을 통하여 생성되는 유사 특허군이다. 이에 대해 설명한다. 우선적으로 모든 개별 특허에 대하여 적어도 하나 이상의 키워드 후보를 추출한 다음, 키워드 후보 중에서 핵심 키워드를 추출하고 저장한 다음, 임의의 특허i에 대응되는 핵심 키워드군과 유사성이 높은 핵심 키워드를 가지는 적어도 하나 이상의 특허를 추출하여 유사 특허군을 생성한다. 상기 키워드 후보에는 키워드 쌍(공기쌍), 특허 분류 코드가 포함되어 키워드처럼 처리될 수 있다. 유사 특허군의 생성은 본 발명의 유사 특허군 생성부(5122)가 담당한다. 상기 유사 특허군 생성부(5122)는 SSi에 대하여 SSi를 대상으로 추출한 핵심 키워드 집합을 재료로 하여 상기 SSi와 유사한 선출원 유사 특허군을 추출한다. 상기 SSi와 선출원 유사 특허군 TS(SSi)을 생성한다. 상기 SSi와 TSi(SSi)간에는 유사도 점수를 가진다. 한편 SS를 구성하는 다른 특허(SSj)들에 대해서도 TS(SSj)를 생성하며, TS를 구성하는 TSi는 빈도(SSi와 i가 아닌 적어도 하나 이상의 j에 대응되는 SSj 모두 동일한 TSi를 유사 특허로 가질 수 있으며, 이 경우에는 TSi의 빈도는 2 이상이 되게 된다.)와 빈도수와 동일한 개수의 유사도를 가질 수 있다.Another representative example of the TS is a similar patent group generated through a text mining technique for patents belonging to the SSi. This will be described. At least one keyword candidate is first extracted for all individual patents, and then a key keyword is extracted and stored from the keyword candidates, and then at least one or more patents having a key keyword with high similarity to the key keyword group corresponding to any patent i. To generate a similar patent group. The keyword candidate includes a keyword pair (air pair) and a patent classification code, and may be processed like a keyword. The generation of the similar patent group is in charge of the similar patent group generation unit 5122 of the present invention. The similar patent group generation unit 5122 extracts a similar patent application group similar to the SSi from the key keyword set extracted for the SSi as the material. Generate the SSi and the similar patent group TS (SSi). The SSi and the TSi (SSi) has a similarity score. On the other hand, TS (SSj) is also generated for the other patents SSj constituting the SS, and TSi constituting the TS may have the same TSi as the similar patent for both SSj corresponding to at least one or more j instead of SSi and i. In this case, the frequency of TSi may be 2 or more.) And the same number of similarities as the frequency.

특히, 특허 분류의 경우, 복수 개의 특허 분류 각각이 핵심 키워드처럼 사용될 수 있을 것이다. 한편, 주 특허 분류에 가중치를 더 주는 방식으로 특허 분류마다 다른 가중치를 부여할 수 있을 것이다. 한편, 특허 분류 중 깊이가 깊은 n dot subclass 등에서 n이 클수록 특허 분류 체계 상에서 깊이가 깊은데, 이 깊은 특허 분류일수록 더 높은 가중치를 부여할 수 있다. 한편, H04B 7/26라는 특허 분류가 있을 때, H04B 7/26(2 dot Subgroup)의 부모에 해당하는 H04B 7/24(1dot Subgroup), H04B 7/00(Main Group) 등도 명시되어 나와있지는 않지만, 특허 명세서 상의 기술을 표상하는 것이므로, 핵심 키워드처럼 취급할 수 있을 것이다. 물론, H04B 7/26에 대한 가중치가 높고, H04B 7/24는 낮으며, H04B 7/00(Main Group)는 가장 낮게 설정할 수 있을 것이다. 나아가, 텍스트 마이닝 기법으로 유사 특허군을 생성할 때, 레퍼런스 정보(직접 인용, 간접 인용, 잠재 인용과 관련된 특허 번호)도 핵심 키워드처럼 취급하여, 유사 특허군의 생성에 활용 될 수 있을 것이다. 이 경우, 각 인용 종류별, 인용 깊이별로 상기 인용 관계 특허별로 같거나 다른 가중치가 부여될 수 있을 것이다.In particular, in the case of patent classification, each of the plurality of patent classifications may be used as a key keyword. On the other hand, it is possible to give a different weight for each patent classification in a manner that further weights the main patent classification. On the other hand, the deeper n deep in the patent classification system, such as n deep sub n dot subclass of the patent classification, the deeper patent classification can be assigned a higher weight. Meanwhile, when there is a patent classification of H04B 7/26, H04B 7/24 (1dot Subgroup), H04B 7/00 (Main Group), etc., which are parents of H04B 7/26 (2 dot Subgroup), are not specified. , Because it represents the technology on the patent specification, it can be treated like a key keyword. Of course, the weight for H04B 7/26 is high, H04B 7/24 is low, and H04B 7/00 (Main Group) may be set to the lowest. Furthermore, when generating similar patent groups by text mining techniques, reference information (patent numbers related to direct citations, indirect citations, and potential citations) may be treated as key keywords and used in generating similar patent groups. In this case, the same or different weights may be given for each citation type and citation depth for each citation relationship patent.

유사 특허군을 생성하는 방법은 클러스터링 방법과 검색 엔진을 이용하는 방법이 있다. 클러스터링은 핵심 키워드 간의 거리에 기반하여 유사 특허군을 생성하는 방법이며, 가장 간단한 방법으로 K-Means 알고리즘이 있다. 한편, 검색 엔진은 자체 내에 랭킹 알고리즘을 포함하고 있기 때문에, 입력된 n개의 핵심 키워드의 입력이 있는 경우, 검색 대상 특허 문건 중에서 n개의 핵심 키워드를 다수 포함하고 있는 특허 문건을 출력해 주는데, 이 중 일정 개수 또는 일정 스코어 이상이 되는 특허군을 유사 특허군으로 처리할 수 있다. 이때, 각 핵심 키워드는 특허 명세서 상의 필드별로 다른 가중치를 가지도록 처리될 수 있음은 물론일 것이며(예를 들어, 발명의 명칭에 나온 핵심 키워드에 가중치를 가장 높게 주고, 특허 청구 범위에 나온 핵심 키워드에 대해서도 상대적으로 높은 가중치를 부여하며, 발명의 상세한 설명에만 나온 핵심 키워드에는 가장 낮은 가중치를 부여할 수 있을 것이다.), 검색 엔진이 키워드별 가중치를 입수 받을 수 있는 경우라면, 입력되는 n개의 핵심 키워드에 가중치를 부여하여 검색 엔진에 질의할 수 있을 것이다. 특히, 가중치가 포함된 n개의 핵심 키워드로 쿼리를 생성하고, 생성된 쿼리를 핵심 키워드 필드(추출된 핵심 키워드만을 모아 놓은 필드)에 질의하면 빠른 속도로 유사 특허군을 생성할 수 있을 것이다. The method of generating a similar patent group includes a clustering method and a method using a search engine. Clustering is a method of generating a similar patent group based on the distance between key keywords. The simplest method is the K-Means algorithm. On the other hand, since the search engine includes a ranking algorithm in itself, when there are n key keywords input, the search engine outputs a patent document including a plurality of n key keywords among the searched patent documents. Patent groups that become a certain number or a predetermined score or more can be treated as a similar patent group. At this time, each of the key keywords may be processed to have a different weight for each field in the patent specification (for example, to give the highest weight to the key keywords in the name of the invention, the key keywords in the claims For example, if the search engine is able to obtain the weight for each keyword, then the n cores are entered. You can query the search engine by weighting the keywords. In particular, if a query is generated with n key keywords including weights, and the generated query is queried into a key keyword field (a field collecting only extracted key keywords), a similar patent group may be rapidly generated.

이어, 핵심 키워드를 생성하는 방법에 대해서 설명한다. 상기 키워드는 특허 명세서에 포함된 텍스트를 처리하여 생성할 수 있으며, 이는 본 발명의 상기 데이터 가공부(2000)의 핵심 키워드 생성부(2100)가 담당한다. 상기 핵심 키워드 생성부(2100)는 특허 명세서를 구성하는 각종 필드별에 대응된 구나 문장에서 키워드를 추출한다. 공기쌍의 추출은 근접 거리(1개의 문장 내 등 기 설정된 텀 간의 거리 기준을 충족하는 거리)에 있는 텀간의 조합을 통하여 추출한다. 상기 필드는 발명의 명칭, 특허 청구 범위, 요약, 발명의 상세한 설명, 산업상 이용 가능성, 효과, 종래의 기술(배경 기술) 등 특허 명세서를 구성하는 각종 필드 중 어느 하나 이상일 수 있다. 상기 핵심 키워드 생성부(2100)는 상기 필드에서 추출한 n개의 키워드에 대하여 핵심 키워드 집합을 생성한다. 상기 핵심 키워드 생성 시, 상기 핵심 키워드 생성부(2100)는 동의어 처리, 시소러스 처리 등을 수행하여 실질적으로 같거나, 동등하거나, 대등한 의미를 지니는 텀들은 묶어서 대표 텀으로 처리하여 핵심 키워드를 선정하는 것이 바람직하다. 한편, 상기 대표 텀으로 처리할 때, 사전 내지는 기계 번역기를 활용하여 1개 특허 문건에 나오는 2 이상의 언어에 대한 동의어, 시소러스 처리를 수행하는 것이 바람직할 것이다. 한편, 상기 대표 텀 내지는 상기 추출된 핵심 키워드는 상기 사전 내지는 기계 번역기를 통하여 적어도 하나 이상의 언어로 번역해 놓는 것이 더욱 바람직할 것이다. 자연어 처리를 통하여 키워드(통상적으로 이 기술 분야에서는 키워드를 텀(term)이라 부른다.) 또는 공기쌍을 추출하는 것은 자연어 처리 기술 분야에서 공지의 기술에 속한다. 추출된 n 개의 키워드(공기쌍 포함함은 당연할 것이다.)에 대하여, 기 설정된 핵심 키워드 선정 알고리즘을 적용하여 상기 특허 명세서를 대표할 수 있는 핵심 키워드 집합(핵심 공기쌍 집합이 선택적으로 포함됨은 당연할 것이다.)을 선정한다. 가장 빈번하게 사용되는 알고리즘은 TF(Term Frequency)와 IDF(Inverse Document Frequency)를 사용하는 것이다. Next, a method of generating key keywords will be described. The keyword may be generated by processing the text included in the patent specification, which is in charge of the key keyword generator 2100 of the data processor 2000 of the present invention. The key keyword generator 2100 extracts a keyword from a phrase or sentence corresponding to each field of the patent specification. The extraction of the air pair is extracted through the combination between the terms in the proximity distance (the distance satisfying the distance standard between the registered terms in one sentence). The field may be any one or more of various fields constituting the patent specification, such as the name of the invention, the claims, the summary, the detailed description of the invention, industrial applicability, effects, and the prior art (background art). The key keyword generator 2100 generates a key keyword set for the n keywords extracted from the field. When generating the key keyword, the key keyword generator 2100 performs synonym processing, thesaurus processing, and the like to select the key keywords by grouping the terms having substantially the same, equivalent, or equivalent meaning as a representative term. It is preferable. Meanwhile, when processing with the representative term, it may be desirable to perform synonyms and thesaurus processing for two or more languages in one patent document by using a dictionary or a machine translator. On the other hand, it is more preferable that the representative term or the extracted key keywords are translated into at least one language through the dictionary or machine translator. Extracting a keyword (commonly referred to in this technical field as a term) or an air pair through natural language processing belongs to techniques known in the field of natural language processing technology. For the extracted n keywords (including air pairs, it is natural), a core keyword set (a core air pair set is selectively included to represent the patent specification by applying a predetermined key keyword selection algorithm) Will be selected). The most frequently used algorithms use term frequency (TF) and inverse document frequency (IDF).

자연어 처리 기술 분야에서는 상기 TF와 IDF를 변수로 한 다양한 함수식이 공개되어 있으며, 필드별 가중치 등 기타의 복잡한 산식을 정책적으로 적용할 수 있음은 당연하다 할 것이다. 이때, 상기 좁은 의미의 키워드만으로 구성되는 핵심 키워드 집합, 상기 공기쌍만으로 구성되는 핵심 키워드 집합을 분리하여 생성하거나, 상기 좁은 의미의 키워드 및 상기 공기쌍을 동등하게 상기 알고리즘으로 처리한다. 상기 핵심 키워드 선정 알고리즘이 상기 공기쌍을 상기 좁은 의미의 핵심 키워드와 같이 처리하면, n개의 키워드 쌍에 대하여, 핵심 키워드 쌍 집합을 생성할 수 있다. In the field of natural language processing technology, various functional expressions using TF and IDF as variables are disclosed, and it is natural that other complex equations such as weights for each field can be applied in a policy manner. At this time, the core keyword set composed of only the narrow meaning keywords and the key keyword set composed only of the air pairs are generated separately or the keywords having the narrow meaning and the air pair are equally processed by the algorithm. When the key keyword selection algorithm processes the air pair together with the narrow key keyword, a key keyword pair set may be generated for n keyword pairs.

하기 수학식 3은 추출된 키워드에 대하여 핵심 키워드를 추출하는 알고리즘에 사용되는 수식의 일례이다.Equation 3 below is an example of an equation used in an algorithm for extracting a key keyword with respect to the extracted keyword.

[수학식 3]&Quot; (3) "

tf : term frequency로 키워드(색인어)가 현재 문서에 출현한 빈도 수 tf: term frequency The frequency at which the keyword (index) appears in the current document.

N : 전체 문서의 개수 N: the total number of documents

df : document frequency로 키워드가 출현한 문서 수 df: the number of documents in which the keyword appears in document frequency

slope : 기울기(임의의 상수값, 조정가능)slope: slope (arbitrary value, adjustable)

ut : 전체 문서집합에서의 unique termsut: unique terms for the entire document set

pivot : 평균 문헌 길이(ut/N)pivot: average length of literature (ut / N)

uf : 해당문서의 ut uf: ut of the document

Fwi : 필드별 가중치 Fwi: weight by field

한편, 상기 핵심 키워드 생성부(2100)가 2개 이상의 단어/어휘/어절로 구성되는 새로운 텀(term)을 추출할 때, 상기 새로운 텀이 기술적으로 의미를 가지는 텀인지를 판단하기 어려운 경우가 다수 있다. 이때, google.com과 같은 외부 검색 엔진을 활용하여 새로운 텀이 기술적 의미를 가지는 것인지를 결정하는 방법이 있을 수 있다. 상기 핵심 키워드 생성부(2100)는 상기 추출된 새로운 텀을 따옴표 처리(exact match를 처리하는 google.com의 검색어 처리 방식) 등과 같은 기 설정된 적어도 하나 이상의 처리를 수행한 다음, 상기 google.com과 같은 외부 검색 서비스 시스템에 전송하고, 상기 외부 검색 서비스 시스템으로부터 검색 결과를 전송 받고, 상기 검색 결과를 분석하여 기 설정된 기준이 충족되는 경우, 상기 추출된 새로운 텀을 정상적인 텀으로 처리한다. 상기 검색 결과의 분석은 검색 결과 수(hit 수, 그 질의어에 매칭되는 검색 결과가 몇 개인지를 가르쳐 줌)를 측정하는 것이며, 기 설정된 기준의 예로 영어의 경우, 1,000개 이상, 다른 언어의 경우에는 100개 이상 등이 될 수 있을 것이다. 예를 들어, 2010년 3월 현재 google.com에 "patent informatics"와 "patent informatics services"를 각각 질의하면, 67,300과 279개의 Results가 나오는데, 이 경우에는 "patent informatics"는 텀으로 취급하고, "patent informatics services"는 새로운 텀으로 취급하지 않을 수 있을 것이다. 한편, google.com등과 같은 검색 엔진이 아니라, wikipedia.org와 같은 용어에 대한 설명을 제공하는 시스템에 질의하여 그 term에 대한 설명이 존재하면, 새로운 텀으로 취급할 수 있다.On the other hand, when the core keyword generator 2100 extracts a new term consisting of two or more words / vocabulary / words, it is often difficult to determine whether the new term is a term having technical meaning. have. In this case, there may be a method of determining whether a new term has a technical meaning by using an external search engine such as google.com. The core keyword generation unit 2100 performs at least one or more predetermined processing such as quotation processing (queries processing method of google.com that processes an exact match) and the like, and then extracts the new term. When the search result is transmitted to an external search service system, the search result is received from the external search service system, and the search result is satisfied, a predetermined criterion is satisfied, and the extracted new term is processed as a normal term. The analysis of the search results measures the number of search results (the number of hits, which teaches how many search results match the query), and examples of the preset criteria are 1,000 or more in English and other languages. It could be more than 100 and so on. For example, as of March 2010, querying google.com for "patent informatics" and "patent informatics services", respectively, yields 67,300 and 279 results, in which case "patent informatics" is treated as a term. patent informatics services "may not be treated as a new term. On the other hand, you can query a system that provides a description of a term, such as wikipedia.org, rather than a search engine such as google.com, and treat the term as a new term if it exists.

상기와 같은 핵심 키워드 선정 알고리즘을 통하면, 1개의 특허 문건에 대응되는 1개 이상의 핵심 키워드 집합(Keyword Set, 1개 이상의 핵심 빈도 유지 핵심 키워드 조합 집합이 선택적으로 포함될 수 있음은 당연할 것이다.) 즉, i번째 특허 문건 Pi에 대응되는 핵심 키워드 집합 KS(Pi)={K1(Pi), K2(Pi), ,,,, Ki(Pi), Kj(Pi),,,,Kn(Pi)}을 얻게 된다. 상기에서 i, j, n은 정수이며, Kn(Pi)란 i번째 특허 문건 Pi에서 선정된 n번째 핵심 키워드를 말한다. 1개의 특허 문건에 복수개의 핵심 키워드 집합이 대응될 수 있는데, 그 이유는 1) 특정 필드(예를 들면 특허 청구 범위 또는 요약), 2) 필드별로 다른 가중치를 부여, 3) 2 이상의 핵심 키워드 선정 알고리즘 사용, 4) IDF 계산의 기준 범위, 5) 텀 추출 방식 등의 다양한 이유가 있을 수 있다.) 상기 i번째 특허 문건 Pi에 대응되는 핵심 키워드 집합 KS(Pi)={K1(Pi), K2(Pi), ,,,, Ki(Pi), Kj(Pi),,,,Kn(Pi)}은 Pi 또는 Pi에 대응되는 키(key)값을 기준으로 핵심 키워드 DB(1300)에 저장될 수 있다.Through the above-described key keyword selection algorithm, it will be obvious that one or more key keyword sets corresponding to one patent document may be selectively included. That is, the key keyword set KS (Pi) = {K1 (Pi), K2 (Pi), ,,,, Ki (Pi), Kj (Pi) ,,,, Kn (Pi) corresponding to the i th patent document Pi } You get In the above description, i, j, n are integers, and Kn (Pi) refers to the nth key keyword selected from the i th patent document Pi. A set of key keywords can be mapped to a single patent document, because: 1) specific fields (for example, claims or summaries), 2) different weights for each field, 3) selection of two or more key keywords There may be various reasons, such as the use of an algorithm, 4) reference range of IDF calculation, 5) term extraction method, etc.) Core keyword set KS (Pi) = {K1 (Pi), K2 corresponding to the i th patent document Pi. (Pi), ,,,, Ki (Pi), Kj (Pi) ,,,, Kn (Pi)} may be stored in the core keyword DB 1300 based on Pi or a key value corresponding to Pi. Can be.

한편, 본 발명의 핵심 키워드 메타 정보 생성부(2140)는 핵심 키워드별로 기 설정된 적어도 하나 이상의 메타 데이터를 생성하며, 생성된 핵심 키워드 메타 정보는 핵심 키워드 메타 정보 DB에 저장된다. 상기 핵심 키워드 메타 정보 생성부(2140)가 생성하는 핵심 키워드별 메타 데이터는 크게 2가지이다. Meanwhile, the core keyword meta information generation unit 2140 of the present invention generates at least one or more preset metadata for each of the core keywords, and the generated core keyword meta information is stored in the core keyword meta information DB. The key keyword meta data generated by the key keyword meta information generation unit 2140 is largely two types.

첫번째는 핵심 키워드간의 연관 정보이다. 모든 핵심 키워드는 상기 핵심 키워드가 나온 문건 Pi가 대응되게 된다. 따라서, 적어도 2 이상의 특허로 생성되는 특허군이 있을 때(상기 특허군의 예는 전체 특허, 일정 기간 동안의 전체 특허, 특정 특허 분류별 특허, 특정 권리자별 특허, 특정 발명자별 특허 등으로 구성될 수 있다.) 특허 문건 Pi에 대응되는 핵심 키워드 집합 KS(Pi)을 상기 특허군 단위로 연관 분석(association analysis, 예를 들면 소위 장바구니 분석)으로 처리하면, 핵심 키워드 간의 연관 정보가 생성되게 되며, 핵심 키워드 간의 연관 정보는 핵심 키워드 네트워크로 시각화 될 수 있다. 상기 핵심 키워드 네트워크를 본 발명의 네트워크 분석부(4700)가 분석할 수 있으며, 시각화는 본 발명의 데이터 시각화부(4700)이 수행할 수 있다.The first is association information between key keywords. All the key keywords are to be associated with the document Pi from which the key keywords appear. Therefore, when there is a patent group generated by at least two patents (an example of the patent group may be composed of all patents, all patents for a certain period, patents by specific patent classification, patents by specific rights holder, patents by specific inventors, etc.). If the key keyword set KS (Pi) corresponding to the patent document Pi is processed by the association group (association analysis, for example, so-called shopping cart analysis), the association information between the key keywords is generated. Association information between keywords can be visualized into the core keyword network. The core keyword network may be analyzed by the network analyzer 4700 of the present invention, and the visualization may be performed by the data visualizer 4700 of the present invention.

두번째는 핵심 키워드별 대응 서지 정보이다. 특허 문건 Pi에 대응되는 핵심 키워드 집합 KS(Pi)은 모두 Pi의 모든 서지 사항 요소(출원일, 출원인, 발명자, 특허 분류, 레퍼런스 등등)이나, Pi에 대응되는 표 1 내지 표 4의 분쟁 예측 요소별 분쟁 예측 요소값이 대응되게 된다. 모든 분석 정보가 대응되게 된다. 따라서, 적어도 2 이상의 특허로 생성되는 특허군이 있을 때(상기 특허군의 예는 전체 특허, 일정 기간별 전체 특허, 특정 특허 분류별 특허, 특정 권리자별 특허, 특정 발명자별 특허 등으로 구성될 수 있다.) 상기 특허군을 기준으로 개별 핵심적인 키워드마다 적어도 하나 이상의 특허 문건이 대응될 수 있게 된다. 따라서, 상기 개별적인 핵심 키워드별로 상기 핵심 키워드를 자신의 핵심 키워드로 하는 상기 특허군의 특허 문건들을 대상으로 한 정량 분석 정보(출원수, 등록수, 출원수 증가량, 등록수 증가량, 기타 특허 집합들에 대한 임의의 정량 분석 정보)를 상기 개별적인 핵심 키워드의 메타 데이터가 될 수 있다. 상기와 같은 핵심 키워드별 메타 데이터를 활용하여 rising하는 핵심 키워드 등을 찾을 수 있게 되며, 상기 핵심 키워드 네트워크를 통합하여 활용하면, 상기 rising 하는 핵심 키워드를 중심으로 하는 핵심 키워드 네트워크를 알 수 있게 된다. 한편, 상기 핵심 키워드 네트워크를 구성하는 핵심 키워드 간의 에지(edge)에는 2개의 핵심 키워드가 대응되므로, 상기 2개의 핵심 키워드를 동시에 포함하는 특허 문건을 검색할 수 있게 된다.(검색 엔진이 Pi별로 추출한 핵심 키워드만을 별도의 검색 가능하도록 하기 위하여 핵심 키워드를 검색 인덱스에 포함시킬 수 있을 것이다.)The second is the corresponding bibliographic information for each key keyword. The core keyword set KS (Pi) corresponding to the patent document Pi is all of the bibliographic elements of the Pi (application date, applicant, inventor, patent classification, reference, etc.) or by the dispute prediction elements of Tables 1 to 4 corresponding to Pi. The dispute prediction element values are matched. All analysis information will be matched. Therefore, when there is a patent group generated by at least two or more patents (an example of the patent group may be composed of all patents, all patents for a certain period, patents by specific patent classification, patents by specific rights holders, patents by specific inventors, and the like. ) Based on the patent group, at least one or more patent documents can be corresponded to each key keyword. Accordingly, quantitative analysis information (applications, registrations, application increase, registration increase, other patent sets) for the patent documents of the patent group for which the core keywords are the core keywords of the respective core keywords. Quantitative analysis information) may be metadata of the individual key keywords. It is possible to find a rising key keyword and the like using the meta data for each of the key keywords, and by integrating and utilizing the key keyword network, a key keyword network centered on the rising key keyword can be found. On the other hand, since two core keywords correspond to an edge between the core keywords constituting the core keyword network, a patent document including the two core keywords can be searched at the same time. Core keywords can be included in the search index so that only key keywords can be searched separately.)

Pi에 대응되는 핵심 키워드 집합 KS(Pi)을 구성하는 개별 키워드를 상기 특허군 단위로 연관 분석(association analysis, 예를 들면 소위 장바구니 분석)으로 처리하면, 핵심 키워드 간의 연관 정보가 생성되게 되며, 핵심 키워드 간의 연관 정보는 핵심 키워드 네트워크로 시각화 될 수 있다.When individual keywords constituting the core keyword set KS (Pi) corresponding to Pi are processed by association analysis (eg, so-called shopping cart analysis) in units of the patent group, association information between key keywords is generated. Association information between keywords can be visualized into the core keyword network.

한편, 상기 타겟 특허 집합 생성부(5120)의 유사 기술 특허군 생성부(5123)가 특허 분류를 활용하여 유사 특허군을 생성 방법을 설명한다. 자기 특허 SSi에 n개의 IPC와 m개의 고유 특허 분류(USPC를 예로 들어 설명한다)가 있는 경우, 상기 n개의 IPC와 상기 m개의 고유 특허 분류를 활용하여 유사 특허군을 생성한다. 상기 유사 기술 특허군 생성부(5123)는 IPC1 내지 IPCn별로 주특허 분류 또는 부특허 분류를 포함하여 이 IPC를 가지는 TS(IPC1),TS(IPCn)을 추출한다. 추출된 TS(IPCi)의 크기(TS(IPC1)에 속하는 특허의 개수가 크기가 된다.)를 s(TS(IPC1))라고 할 때, TS(IPCi)에 속하는 TSi(IPCi)에 대해서 1/s(TS(IPCi))의 가중치를 할당한다. 한편, TS(IPC1),TS(IPCn) 중 동일한 특허가 중복되어 있는 경우, 즉, SSi에서 2개 이상의 IPC가 있고, TS에 속하는 TSi 중에서 상기 2개 이상의 IPC가 일치하는 특허가 있는 경우, 일치되는 IPCi와 적어도 하나 이상의 IPCj에 대해서 1/s(TS(IPCi and IPCj))를 할당할 수 있다. TS(IPCi and IPCj)란 IPCi와 IPCj를 함께 가지는 TSi로 생성되는 TS를 말한다. 한편, 동일한 IPC는 아니더라도 기술 분류 체계 상 근접한 IPC의 경우에는 일정 수준 이상의 유사성을 가질 수 있다. 상기 유사 기술 특허군 생성부(5123)는 특허 분류 체계를 참조하여 SSi의 IPCi의 직상위 특허 분류 등과 같은 적어도 하나 이상의 상위 특허 분류를 추출하고, 상기 상위 특허 분류를 포함하는 특허 문건에 대해서는 상기 상위 특허 분류를 포함하는 TS의 크기에 반비례하는 가중치를 대응시킬 수 있을 것이다. 한편, 상기 유사 기술 특허군 생성부(5123)는 특허 분류 체계를 참조하여 SSi의 IPCi의 적어도 하나 이상의 하위 특허 분류를 추출하고, 상기 추출된 하위 특허 분류를 포함하는 특허 문건에 대해서는 상기 하위 특허 분류를 포함하는 TS의 크기에 반비례하는 가중치를 대응시킬 수 있을 것이다. Meanwhile, the method of generating a similar patent group using the patent classification by the similar technology patent group generation unit 5123 of the target patent set generation unit 5120 will be described. If the own patent SSi has n IPCs and m unique patent classifications (to be described using USPC as an example), a similar patent group is generated using the n IPCs and the m unique patent classifications. The similar technology patent group generating unit 5123 extracts TS (IPC1) and TS (IPCn) having this IPC, including the main patent classification or the subpatent classification by IPC1 to IPCn. When the size of the extracted TS (IPCi) (the number of patents belonging to TS (IPC1) becomes size) is s (TS (IPC1)), 1 / for TSi (IPCi) belonging to TS (IPCi) The weight of s (TS (IPCi)) is assigned. On the other hand, if the same patent among TS (IPC1) and TS (IPCn) overlaps, that is, if there are two or more IPCs in SSi, and there is a patent in which the two or more IPCs match among TSi belonging to TS, 1 / s (TS (IPCi and IPCj)) may be allocated to the IPCi and at least one IPCj. TS (IPCi and IPCj) refers to TS generated by TSi having both IPCi and IPCj. On the other hand, even if the IPC is not the same IPC in the technical classification scheme may have a similar level or more than a certain level. The similar technology patent group generation unit 5123 extracts at least one or more upper patent classifications, such as upper-level patent classifications of the IPCi of SSi, with reference to the patent classification system, and for the patent documents including the upper patent classifications. Weights that are inversely proportional to the size of the TS including the patent classification may be mapped. Meanwhile, the similar technology patent group generation unit 5123 extracts at least one lower patent classification of the IPCi of SSi with reference to the patent classification system, and the lower patent classification for the patent document including the extracted lower patent classification. The weight may be inversely proportional to the size of the TS that includes a.

한편, SSi와 TSi간의 출원일이 차이가 많이 날수록 동일한 특허 분류를 가지더라도 내용상 비유사할 가능성이 클 수 있다. 따라서, 상기 가중치에 출원일의 차이값에 반비례 관계를 가지는 진부화 함수를 고려한 진부화 가중치를 곱할 수 있다. 진부화 가중치는 상수/f(SSi의 출원일과 TSi의 출원일간의 경과 일수)가 될 수 있다.On the other hand, the more the application date between the SSi and TSi is more likely to be similar in content even if they have the same patent classification. Therefore, the weight may be multiplied by the obsolescence weight in consideration of the obsolescence function having an inverse relationship to the difference in the filing date. The obsolescence weight may be a constant / f (the number of days elapsed between the filing date of SSi and the filing date of TSi).

한편, 주특허 분류 이외의 부특허 분류를 활용한 유사 특허군 생성 시, SSi 나 TSi에 다수의 특허 분류가 포함되어 있는 경우, 특허 분류 개수가 많을수록 SSi의 특허 분류와 TSi의 특허 분류가 일치한다고 하더라도 기술의 내용상 일치할 확률이 낮아지게 된다. 따라서, 특허 분류의 개수에 따르는 희석율이 도입이 필요하게 된다. 희석율은 각 가중치에 곱하여 진다. 희석율은 SSi의 특허 분류 수에 반비례하며 및 TSi의 특허 분류수에 반비례한다. 한편, 희석율의 적용 시 일치하는 특허 분류가 주특허 분류에서 일치하는 경우, 주특허 분류 일치 가중치가 더 부여 될 수 있다. 즉, SSi의 주특허 분류와 TSi의 특허 분류가 일치하거나, SSi의 특허 분류와 TSi의 주특허 분류와 일치할 경우 분류 일치 가중치가 추가적으로 더 부여될 수 있다. 상기와 같은 방법으로 특허 분류를 활용하여 생성되는 TSi별로 기 설정된 가중치가 부여될 수 있을 것이다. USPC, FT, FI, ECLA와 같은 고유 특허 분류에 대해서도 IPC와 동일하게 처리한다. SSi에 대응되는 TSi가 2 종류 이상의 특허 분류와 관련되어 있을 때는 TSi의 가중치는 각 특허 분류 종류별로 합산할 수 있을 것이다.On the other hand, when generating similar patent groups using sub-patent classifications other than the main patent classification, when SSi or TSi includes a plurality of patent classifications, the larger the number of patent classifications, the more the patent classification of SSi and the patent classification of TSi match. Even so, the probability of coincidence is low. Therefore, the dilution rate according to the number of patent classifications needs to be introduced. The dilution rate is multiplied by each weight. Dilution rate is inversely proportional to SSi's patent classification number and inversely proportional to TSi's patent classification number. On the other hand, if the matching patent classification in the main patent classification when the dilution rate is applied, the main patent classification matching weight may be further given. That is, when the main patent classification of SSi and the patent classification of TSi coincide with each other, or when the patent classification of SSi and the main patent classification of TSi coincide with each other, a classification matching weight may be additionally given. In the above manner, a preset weight may be given for each TSi generated by using a patent classification. Proprietary patent classifications such as USPC, FT, FI, and ECLA are treated the same as IPC. When TSi corresponding to SSi is associated with two or more types of patent classifications, the weight of TSi may be summed for each patent classification type.

상기와 같은 방법으로 SS에 속하는 개별 특허별로 유사 특허군을 생성할 수 있다. 생성되는 유사 특허의 개별 특허에는 유사도 점수가 배정되어 있을 수 있다. 따라서, SSi 전체 특허의 관점에서 유사 특허군 집합을 생성하는 것은 1) SS에 속하는 개별 특허별로 생성되는 유사 특허군 중에서 기 설정된 유사도 점수 이상이 되는 특허들로 생성하는 방법, 2) SS에 속하는 개별 특허별로 유사도 점수를 포함하는 유사 특허군을 생성하고, 생성된 유사 특허군을 합하고, 합친 전체 유사 특허군 중에서 개별 특허를 기준으로 유사도의 합계 점수(2회 이상 나타나는 유사 특허는 2개 이상의 유사도 점수를 가짐)를 기준으로 기 설정된 개수나 비율(SS의 특허 개수 대비 3~5배 등)의 유사 특허군 집합을 생성할 수 있다. 이와 같은 과정을 통하여 유사 특허군 집합에 포함되는 특허마다 유사도 합계 점수가 대응되게 된다. 상기 유사도 점수의 생성도 일종의 TSi에 대한 가중치가 되므로, 상기 유사도 점수의 설정은 본 발명의 가중치 조절부(5143)에서 수행한다.In this manner, a similar patent group may be generated for each patent belonging to the SS. Similar patents may be assigned to individual patents of similar patents that are created. Accordingly, generating a set of similar patent groups in terms of all SSi patents includes: 1) a method for generating patents having a predetermined similarity score or more among similar patent groups generated for each individual patent belonging to the SSi, and 2) individual belonging to the SS. Create a similar patent group including similarity scores for each patent, add the similar patent groups generated, and add the total score of similarity based on the individual patents among the total similar patent groups combined (the similar patents appearing more than once for two or more similarity scores) Based on the number) can be generated a set of similar patent group of a predetermined number or ratio (3 to 5 times the number of patents of the SS, etc.). Through this process, the similarity total scores correspond to each patent included in the similar patent group set. Since the generation of the similarity score also becomes a weight for a kind of TSi, the setting of the similarity score is performed by the weight adjusting unit 5203 of the present invention.

상기 유사 특허군 집합에서 선출원 유사 특허군 집합을 생성할 수 있다. SSi가 다수 있을 때 이들의 특허 출원일이 범위(range)를 가질 수 있기 때문에, SS의 출원일을 일의적으로 특정하기 어려운 문제가 발생하여, 선출원 유사 특허군과 후출원 유사 특허군을 특정하기 어려워지는 문제가 발생한다. 선출원 유사 특허군 집합을 생성하는 방법은 1)SS에 속하는 개별 특허에서 유사 특허군을 생성할 때, 상기 개별 특허의 출원일보다 선출원인 유사 특허들로 유사 특허군을 생성하는 방법, 2)SS에 속하는 특허들의 출원인의 중앙값을 기준으로 전체 유사 특허군 중에서 중앙값보다 빠른 출원일을 가지는 경우, 그러한 속성을 가지는 특허군을 선출원 유사 특허군으로 하고, 그렇지 않은 속성을 가지는 특허군을 후출원 유사 특허군으로 처리하는 방법이 있을 수 있다. In the set of similar patent groups, a previous set of similar patent groups may be generated. Since there may be a range of patent filing dates when there are a plurality of SSi, a problem arises that it is difficult to uniquely specify the filing date of the SS, which makes it difficult to specify a prior patent group and a similar patent group. A problem arises. The method of generating a set of similar patent groups of the first application includes: 1) a method of generating a similar patent group with similar patents that are earlier than the filing date of the individual patents when generating a similar patent group from individual patents belonging to the SS; Based on the median of the applicant's median of the patents belonging to the patent group, the group of patents having such an attribute is regarded as a pre-applied similar patent group, and the patent group having an attribute as a post-applied pseudo-patent group. There may be a way to do it.

상기 선출원 유사 특허군 집합을 생성하는 방법과 마찬가지로 후출원 유사 특허군 집합을 생성할 수 있음은 상기 설명으로 자명할 것이다.It will be apparent from the above description that the post-appearance similar patent group set may be generated similarly to the method of generating the prior application similar patent group set.

상기 TS의 대표적인 또 다른 예는 사용자가 구성하는 사용자 구성 특허 집합이다. 상기 선출원/후출원 유사 특허군 집합이나, 선출원/후출원 인용 특허 집합 등에 대해서도 기간, 속성(권리자 속성, 특허 속성, 기술 분야 속성 등) 또는 검색어나 검색식 등으로 한정을 수행할 수 있지만, 본질적으로 일반적 특허 집합이다. 이에, 사용자는 자신의 검색식이나, 자신이 관리하는 특허 집합이나, 자신의 경쟁사의 특허나 유관사의 특허 등과 같은 임의의 특허 집합 한정 수단을 부가하여 자신만의 특허 집합을 생성할 수 있다. 예를 들면, 자신을 공격해온 A라는 권리자가 있을 때, A의 특허 포트폴리오에 속하는 특허 또는 공격해 온 특허 또는 공격해 온 특허의 유사 특허군에 속하는 특허 등의 특정된 특허 집합의 선출원 인용 특허 집합은 사용자 구성 특허 집합의 예가 될 수 있다. 사용자 구성 특허 집합을 생성하는 다른 방법은 상기 인용 특허 집합 및/또는 유사 특허군 집합 및/또는 사용자가 입력 또는 생성하는 특허 집합에 속하는 특허들 중에서 선별하는 방식으로 생성하는 것이다.Another representative example of the TS is a user configuration patent set configured by a user. It is possible to limit the term of the preceding patent set or the similar patent group set, the patent set of the first patent application or the post-patent application, and the like by term, attribute (property attribute, patent attribute, technical field attribute, etc.), search term or search expression, and the like. As a general patent set. Thus, the user can create his own patent set by adding his or her own search formula, patent set managed by him, or any patent set limiting means such as his competitor's patent or related company's patent. For example, when there is an owner of A who has attacked himself, a prior patent cited set of specified patent sets, such as a patent belonging to A's patent portfolio or a patent belonging to an attacking patent or a similar patent group of an attacking patent. May be an example of a user configured patent set. Another method of generating a user-configured patent set is to generate a selected method from among the cited patent set and / or similar patent group set and / or patents belonging to a patent set inputted or generated by a user.

한편, 사용자 구성 특허 집합을 구성할 때, 상기 사용자는 상기 TSi에 대하여 주관적인 가중치를 설정할 수도 있을 것이다. 이러한 가중치는 사용자 설정 TSi 가중치라 한다. 상기 가중치의 설정은 본 발명의 가중치 조절부(5143)에서 제공하는 UI를 통하여 수행한다. 상기 UI는 TS에 속하는 특허 리스트를 제공해 주고, 상기 특허 리스트에 사용자가 가중치나 중요도 또는 중요 등급을 입력할 수 있도록 하는 방식으로 수행한다.On the other hand, when constructing a user-configured patent set, the user may set a subjective weight for the TSi. This weight is referred to as a user set TSi weight. The setting of the weight is performed through a UI provided by the weight adjusting unit 5153 of the present invention. The UI provides a list of patents belonging to the TS, and the user can input a weight, importance, or important grade to the patent list.

상기 TS의 대표적인 다른 예는 시스템이 제공하는 시스템 구성 특허 집합이다. 시스템은 SSi가 속하는 최빈 특허 분류 등과 같이 SS 또는 SSi를 대표하는 속성에 따라 SSi가 인용하는 선출원 특허들로 구성되는 협의의 선출원 인용 특허 집합이다. 한편, 상기 시스템은 최근 출원 또는 등록량이 급증하는 특허 집합 또는 대학이나 연구 기관의 특허, 특정 권리자(매각 예정 기업 등)의 특허 집합 등과 같이 시스템의 관리자 설정하는 다양한 특허 집합을 사용자에게 제시하여 선택 받거나, 디폴트로 처리하여 시스템 구성 특허 집합을 생성할 수 있게 된다.Another representative example of the TS is a system configuration patent set provided by the system. The system is a set of negotiated prior-cited patents consisting of pre-patent patents cited by SSi according to attributes representing SS or SSi, such as the most frequent patent classification to which SSi belongs. On the other hand, the system is selected by presenting to the user various patent sets that are set by the administrator of the system, such as patent sets of recent applications or registrations, or patents of universities or research institutes, patent sets of specific rights holders (to be sold, etc.) By default, the system configuration patent set can be generated.

이어, SS와 TS가 특정되는 경우, 상기 시스템의 정보 처리 방법에 대해서 설명한다. 먼저 분쟁 예측 방법에 대해 설명한다.Next, when the SS and the TS are specified, the information processing method of the system will be described. First, the dispute prediction method will be described.

타겟 특허 집합 생성부(5120)에 의해 TS가 결정되면, TS에 속하는 모든 TSi에 대하여 상기 분쟁 예측 모델값 입수부(5130)는 분쟁 예측 모델 Sg에 대한 분쟁 예측 모델값인 Sg(TSi)를 입수하고, 저장한다. 상기 분쟁 예측 정보 생성부(5140)는 TS에 속하는 모든 TSi에 대한 Sg(TSi)값을 사용할 수도 있지만, 4분위수 등과 같은 n분위수 중 기 설정된 높은 분위에 속하는 Sg(TSi)들만을 처리 대상으로 하거나, Sg(TSi)값이 기 설정된 기준 이상이 것들만을 처리 대상으로 하는 등, 처리 대상에 대한 한정을 수행할 수 있다. 이러한 처리 대상의 한정은 시스템이 설정하거나, 사용자가 설정할 수 있을 것이다. When the TS is determined by the target patent set generation unit 5120, the dispute prediction model value acquisition unit 5130 obtains Sg (TSi), which is the dispute prediction model value for the dispute prediction model Sg, for all TSi belonging to the TS. Save it. The dispute prediction information generating unit 5140 may use Sg (TSi) values for all TSi belonging to TS, but may process only Sg (TSi) belonging to a predetermined high quartile among n quartiles, etc. For example, the Sg (TSi) value may be limited to a processing target such that only those having a predetermined reference value or more are processed. The limitation of the processing target may be set by the system or set by the user.

상기 분쟁 예측 정보 생성부(5140)는 상기 분쟁 예측 모델값 입수부(5130)가 입수한 Sg(TSi)에 대하여 다중 관계성 처리 모듈(5141)을 통하여 상기 Sg(TSi)에 기 설정된 가중치를 부가한다. Sg(TSi)에 부과되는 가중치는 TSi에 부과되는 가중치와 동일한 기원을 가진다. TSi에 부과되는 가중치는 1) SSi와의 관련성 가중치, 2) 중복 가중치, 3) 사용자가 SSi에 부여한 주관적 가중치, 4) TSi에 대하여 사용자가 부여한 주관적 가중치 중 어느 하나 이상을 사용하여 결정된다. TSi에 부과되는 가중치를 W(TSi)라 한다. 상기 TSi에 부과되는 가중치는 상기 TSi와 SSi와의 관계에도 영향을 받는다. 따라서, 상기 W(TSi)는 W(TSi)=W(TSi, SSi)가 될 수 있다. 상기 W(TSi, SSi)에는 사용자가 SSi에 부과하는 주관적인 가중치, SSi와 관련되는 TSi별 관련성 정도(상기 관련성 정도에는 적어도 2 이상의 SSi들이 하나의 TSi와 관련될 때(즉, SSi 기준으로 동일한 TSi에서 중복이 발생될 때)의 관련성을 포함한다.), TSi에 대하여 사용자가 부과하는 주관적 가중치가 고려되는 가중치 관계식이다.The dispute prediction information generation unit 5140 adds a predetermined weight to Sg (TSi) through the multiple relationship processing module 5251 with respect to Sg (TSi) obtained by the dispute prediction model value acquisition unit 5130. do. The weight imposed on Sg (TSi) has the same origin as the weight imposed on TSi. The weight imposed on TSi is determined using one or more of 1) relevance weight with SSi, 2) overlapping weight, 3) subjective weight assigned by SSi to user, and 4) subjective weight assigned by user to TSi. The weight imposed on TSi is called W (TSi). The weight imposed on the TSi is also affected by the relationship between the TSi and the SSi. Therefore, W (TSi) may be W (TSi) = W (TSi, SSi). W (TSi, SSi) includes the subjective weights that the user imposes on SSi, and the degree of relevance for each TSi related to SSi (when at least two or more SSi are associated with one TSi (i.e., the same TSi based on SSi) ), A weight relation that takes into account the subjective weight imposed by the user on TSi.

상기 가중치와 관련된 정보의 처리는 본 발명의 가중치 조절부(5143)에서 제공하는 UI를 통하여 수행한다. 상기 사용자는 상기 가중치 조절부(5143)를 통하여 개별 특허 SSi나 TSi에 주관적 가중치를 부여할 수 있으며, 관련성의 종류나 세부적인 관련성별로 자신이 설정하는 가중치를 부여하거나 시스템이 부여한 가중치를 조절할 수 있으며, 상기 1) 내지 4)의 각 가중치 종류별로 각 가중치 종류의 적용 여부나 각 가중치 종류별로의 상대적인 가중치를 설정(예를 들면, 1)에 50%, 2)에 30%, 3)에 10%, 4)에 10% 등을 부여하는 것 등)하거나, 시스템이 상기 1) 내지 4)에 부여한 가중치를 조절할 수 있을 것이다.Processing of the information related to the weight is performed through the UI provided by the weight adjusting unit 5503 of the present invention. The user may assign subjective weights to individual patents SSi or TSi through the weight adjusting unit 5503, and may assign weights set by the user according to the type of relevance or the detailed relevance or adjust the weights assigned by the system. For each weighting type of 1) to 4), whether each weighting type is applied or the relative weighting for each weighting type is set (for example, 50% in 1), 30% in 2), and 10% in 3). , 4), 10%, etc.), or the weight assigned by the system to 1) to 4).

상기 분쟁 예측 정보값 생성 모듈(5142)은 상기 Sg(TSi)와 상기 W(TSi)를 처리하여 TSi별로 분쟁 예측 정보 wf(Sg(TSi), W(TSi))를 생성한다. 여기서 wf는 처리식이며, wf의 간단한 예는 Sg(TSi)*W(TSi)이며, 가장 간단한 예를 가중치를 전혀 적용 않은 Sg(TSi)가 될 수 있다. 이와 같이 상기 분쟁 예측 정보 생성부(5140)는 타겟 특허 집합이 결정되면, 타겟 특허 집합의 TSi에 대하여 wf(Sg(TSi), W(TSi))를 생성한다. 상기 분쟁 예측 정보값 제공부(5150)는 상기 wf(Sg(TSi), W(TSi))를 요청자에게 제공한다. 상기 TSi별로 생성되는 wf(Sg(TSi), W(TSi))가 TS를 기준으로 한 SS 입장의 분쟁 예측 정보의 일례가 된다. 이어, 상기 분쟁 예측 정보 생성부(5140)는 타겟 특허 집합 전체에 대하여 적어도 하나 이상의 집합적 분쟁 예측 정보를 생성할 수 있다. 상기 집합적 분쟁 예측 정보 생성 함수를 f(TS)라 한다. f(TS)의 한 예는 TSi별 wf(Sg(TSi), W(TSi))의 총합, TSi별 wf(Sg(TSi), W(TSi))의 평균, 기 설정된 분위수나 기 설정된 상위%의 TSi별 wf(Sg(TSi), W(TSi))의 총합이나 평균, TSi별 wf(Sg(TSi), W(TSi))의 분포에 대한 기 설정된 통계 처리값, 상기 통계 처리값을 기 설정된 등급 부여 모델에 적용한 등급값 등이 f(TS)의 예가 될 수 있다.The dispute prediction information value generation module 5514 processes the Sg (TSi) and the W (TSi) to generate dispute prediction information wf (Sg (TSi), W (TSi)) for each TSi. Here, wf is a processing formula, and a simple example of wf is Sg (TSi) * W (TSi), and the simplest example may be Sg (TSi) without any weight applied thereto. As such, when the target patent set is determined, the dispute prediction information generation unit 5140 generates wf (Sg (TSi), W (TSi)) for TSi of the target patent set. The dispute prediction information value provider 5150 provides the wf (Sg (TSi), W (TSi)) to the requester. Wf (Sg (TSi), W (TSi)) generated for each TSi is an example of dispute prediction information of the SS position based on the TS. Subsequently, the dispute prediction information generation unit 5140 may generate at least one collective dispute prediction information for the entire target patent set. The collective dispute prediction information generating function is called f (TS). An example of f (TS) is the sum of wf (Sg (TSi), W (TSi)) by TSi, the average of wf (Sg (TSi), W (TSi)) by TSi, the preset quantile or the preset upper percent. Pre-set statistical processing values for the sum or average of wf (Sg (TSi), W (TSi)) per TSi, the distribution of wf (Sg (TSi), W (TSi)) per TSi, and the statistical processing value. An example of f (TS) may be a rating value applied to the set rating model.

도 38는 상기 분쟁 예측 정보값 생성 모듈(5142)이 1개의 특허에 대하여 분쟁 예측 정보 값을 생성하는 과정에 대한 모식도이다. 상기 도 38에서는 SSi가 n개의 TSi들과 각각 W(SSi, TSi)의 가중치를 가지는 것을 보여 주고 있다. 이 경우, 상기 분쟁 예측 정보값 생성 모듈(5142)은 상기 개별 특허 SSi에 대하여 하기와 같은 수식의 분쟁 예측 정보를 생성할 수 있다.38 is a schematic diagram of a process in which the dispute prediction information value generation module 5514 generates a dispute prediction information value for one patent. In FIG. 38, SSi has n TSi and a weight of W (SSi, TSi), respectively. In this case, the dispute prediction information value generation module 5514 may generate dispute prediction information of the following formula for the individual patent SSi.

wf(Sg(TSi), W(TSi))(SSi) = wf (Sg (TSi), W (TSi)) (SSi) =

{Sg(TS1)*W(SSi,TS1)++ Sg(TSi) *W(SSi,TSi)++ Sg(TSn) *W(SSi,TSn)}/{W(SSi,TS1)++W(SSi,TSi)++ W(SSi,TSn)}{Sg (TS1) * W (SSi, TS1) ++ Sg (TSi) * W (SSi, TSi) ++ Sg (TSn) * W (SSi, TSn)} / {W (SSi, TS1) ++ W (SSi, TSi) ++ W (SSi, TSn)}

한편, 상기 분쟁 예측 정보는 n개의 개별 특허 SSi로 이루어진 SS 전체에 대하여 하기 수식과 같은 분쟁 예측 정보를 생성할 수 있다.On the other hand, the dispute prediction information may generate dispute prediction information such as the following formula for the entire SS consisting of n individual patent SSi.

wf(Sg(TSi), W(TSi))(SS) = 1-{1-wf(Sg(TSi), W(TSi))(SS1)}{1-wf(Sg(TSi), W(TSi))(SSi)} {1-wf(Sg(TSi), W(TSi))(SSn)} wf (Sg (TSi), W (TSi)) (SS) = 1- {1-wf (Sg (TSi), W (TSi)) (SS1)} {1-wf (Sg (TSi), W (TSi )) (SSi)} {1-wf (Sg (TSi), W (TSi)) (SSn)}

도 41은 본 발명의 분쟁 예측 정보 생성부(5140)가 생성하는 분쟁 예측 정보의 종류나 순서에 대한 일 실시예적 도면이다. 상기 분쟁 예측 정보 생성부(5140)은 먼저 개별적인 자기 특허별 분쟁 예측 정보를 생성(SL71)할 수 있다. 즉, 하나의 SSi에 대한 타겟 특허 집합을 생성한 다음, SSi에 대한 분쟁 예측 정보로서 TS를 구성하는 개별 TSi별 또는 TS별 분쟁 예측 정보를 생성할 수 있다. 한편, 상기 분쟁 예측 정보 생성부(5140)는 적어도 하나 이상이 자기 특허로 구성되는 자기 특허 집합별로 기설정된 분쟁 예측 정보를 생성(SL72)할 수 있다. 한편, 상기 분쟁 예측 정보 생성부(5140)는 개별적인 SSi에 대해서건 SS에 대해서건 구성되는 TS를 구성하는 타겟 특허별로 기설정된 분쟁 예측 정보를 생성(SL73)할 수 있다. 그리고, 상기 분쟁 예측 정보 생성부(5140)는 개별적인 SSi에 대해서건 SS에 대해서건 구성되는 타겟 특허 집합별로 기설정된 분쟁 예측 정보를 생성(SL74)할 수 있다. 41 is a diagram illustrating the type and order of dispute prediction information generated by the dispute prediction information generation unit 5140 of the present invention. The dispute prediction information generation unit 5140 may first generate dispute prediction information for each self patent (SL71). That is, the target patent set for one SSi may be generated and then individual TSi or TS-specific dispute prediction information constituting the TS may be generated as the dispute prediction information for the SSi. On the other hand, the dispute prediction information generation unit 5140 may generate the dispute prediction information predetermined for each of a set of self-patents of which at least one is composed of own patents (SL72). Meanwhile, the dispute prediction information generation unit 5140 may generate dispute prediction information preset for each target patent constituting the TS configured for each SSi or SS (SL73). In addition, the dispute prediction information generation unit 5140 may generate dispute prediction information preset for each target patent set configured for individual SSi or SS (SL74).

이어, 도 42 내지 도 43을 참조하면서 집합 분할 개념을 적용한 분쟁 예측 정보 생성 방법에 대해 설명한다. 도 42는 본 발명의 특허 집합 분할부(5310)를 통하여 자기 특허 집합을 분할한 다음, 분할된 자기 특허 집합에 대한 분쟁 예측 정보를 생성하는 일 실시예적 방법을 도시하고 있다. 상기 분쟁 예측 엔진(5100)은 자기 특허 집합을 입수(SL81)하고, 상기 특허 집합 분할부(5310)를 통하여, 자기 특허 집합에 대한 분할 기준을 입수(SL82)하고, 분할된 자기 특허 집합에 대응되는 타겟 특허 집합 생성(SL83)한 다음, 분할된 자기 특허 집합에 대응되는 타겟 특허 집합을 기준으로 한 분쟁 예측 정보를 생성(SL84)한다.Next, the dispute prediction information generation method to which the aggregation concept is applied will be described with reference to FIGS. 42 to 43. 42 illustrates an exemplary method of dividing a self-patent set through the patent set divider 5310 of the present invention and then generating dispute prediction information for the divided self-patent set. The dispute prediction engine 5100 obtains a self patent set (SL81), obtains a division criterion for the self patent set (SL82) through the patent set divider 5310, and corresponds to the divided self patent set. After generating the target patent set SL83, dispute prediction information based on the target patent set corresponding to the divided self patent set is generated (SL84).

도 43은 본 발명의 특허 집합 분할부(5310)를 통하여 타겟 특허 집합을 분할한 다음, 분할된 타겟 특허 집합에 대한 분쟁 예측 정보를 생성하는 일 실시예적 방법을 도시하고 있다. 상기 분쟁 예측 엔진(5100)은 자기 특허 집합을 입수(SL91)하고, 상기 자기 특허 집합에 대응되는 타겟 특허 집합을 생성(SL92)하고, 타겟 특허별 분쟁 예측 모델값을 생성(SL93)한다. 이어, 상기 상기 특허 집합 분할부(5310)를 통하여, 분할 기준을 입수한 후 상기 타겟 특허 집합을 분할(SL94)하고, 분할된 상기 타겟 특허 집합을 기준으로 한 분쟁 예측 정보 생성(SL95)를 생성한다. 이하, 분설한다. FIG. 43 illustrates an exemplary method of dividing a target patent set through the patent set dividing unit 5310 of the present invention and then generating dispute prediction information for the divided target patent set. The dispute prediction engine 5100 obtains a self patent set (SL91), generates a target patent set corresponding to the self patent set (SL92), and generates a dispute prediction model value for each target patent (SL93). Subsequently, through the patent set dividing unit 5310, after obtaining a dividing criterion, the target patent set is divided (SL94), and dispute prediction information generation (SL95) is generated based on the divided target patent set. do. Hereinafter, powdering is performed.

상기 TS는 타겟 특허 집합이며, 상기 타겟 특허 집합은 적어도 하나 이상의 분할 기준을 적용하여 적어도 하나 이상의 유닛으로 분할할 수 있다. 상기 분할은 본 발명의 분쟁 예측 정보 분석 엔진(5300)의 특허 집합 분할부(5310)가 수행할 수 있다. 유닛으로 분리된 i번째 TS를 TSui라 하자. 이 경우 상기 분쟁 예측 정보 생성부(5140)는 TSui별로 wf(Sg(TSui), W(TSui))나 f(TSui)의 값이 생성될 수 있을 것이다. TS를 분할하는 방법은 1) 권리자별, 2) 기술군(IPC나, USPC를 활용할 수 있음)별, 3) 기간별, 4) 발명자별, 5) 국가별, 6) 권리자 속성별, 7) 적어도 하나 이상의 특정 키워드 포함 여부별, 8) 기타 특허 집합을 분할하거나 한정할 수 있는 조건별, 9) 특정된 속성을 가지는 특허군별, 및/또는 10) 상기 1) 내지 9)의 방법을 조합한 것들이 있을 수 있다. 상기 6)의 예는(1) 등록된 트롤 속성,(2) 다분쟁 유발자 등이 될 수 있다. 상기 9)의 특정된 속성의 예는(1) TS 중 SS와 유사성이 높은 특허군,(2) 사용자 설정 SSi 가중치가 기 설정된 랭킹, 수준, 범위 이상 높은 SSi에 대응되는 TSi들로 구성되는 특허군 등이 될 수 있다.The TS is a target patent set, and the target patent set may be divided into at least one unit by applying at least one or more division criteria. The division may be performed by the patent set division unit 5310 of the dispute prediction information analysis engine 5300 of the present invention. Let the i-th TS divided into units be TSui. In this case, the dispute prediction information generation unit 5140 may generate a value of wf (Sg (TSui), W (TSui)) or f (TSui) for each TSui. How to segment TS is 1) by owner, 2) by technology group (may use IPC or USPC), 3) by period, 4) by inventor, 5) by country, 6) by owner attribute, 7) at least By combination of one or more specific keywords, 8) by conditions that can divide or define other patent sets, 9) by patent groups with specified attributes, and / or 10) by combining the methods of 1) to 9) above. There may be. Examples of 6) may be (1) registered troll attributes, (2) multidispute triggers, and the like. Examples of the specified attribute of 9) include (1) a patent group having a high similarity to SS among TS, and (2) a patent consisting of TSi corresponding to SSi whose user-set SSi weight is higher than a predetermined ranking, level, and range. Military and the like.

한편, 상기 SS는 자기 특허 집합이며, 상기 자기 특허 집합도 상기 타겟 특허 집합처럼 분할 방법을 통하여 하나 이상의 유닛으로 분할할 수 있을 것이다. 상기 분할된 i번째 SS를 SSui라 하며, 상기 SSui별로 생성되는 타겟 특허 집합인 TS(SSui)를 생성할 수 있을 것이며, 상기 TS(SSui)에 대하여 TS(SSui)별로 wf(Sg(TS(SSui)), W(TS(SSui)))나 f(TS(SSui))를 생성할 수 있을 것이다. 한편, 상기 TS(SSui)도 타겟 특허 집합이 일종이므로, 상기 분할 방법을 통하여 분할 할 수 있으며, 분할된 TSui(SSui)에 대하여도 wf(Sg(TSui(SSui)), W(TSui(SSui)))나 f(TSui(SSui))를 생성할 수 있을 것이다.Meanwhile, the SS is a self-patent set, and the self-patent set may be divided into one or more units through a division method as the target patent set. The divided i-th SS may be called SSui, and a TS (SSui) which is a target patent set generated for each SSui may be generated, and wf (Sg (TS (SSui)) for each TS (SSui) with respect to the TS (SSui). )), W (TS (SSui))) or f (TS (SSui)). On the other hand, the TS (SSui) is also a target patent set is a kind, can be divided by the partitioning method, and also for the divided TSui (SSui) wf (Sg (TSui (SSui)), W (TSui (SSui)) )) Or f (TSui (SSui)).

본 발명의 상기 분쟁 예측 정보 분석 엔진(5300)은 분쟁 예측 정보를 분석한다. 도 44에는 본 발명의 분쟁 예측 정보 분석 엔진(5300)의 정보 처리 방법에 대한 일 실시예적 구성이 나와 있다. 상기 분쟁 예측 정보 분석 엔진(5300)은 타겟 특허별 분쟁 예측 모델값 정보를 생성(SL101)한 다음, 자기 특허, 자기 특허 집합, 분할 자기 특허 집합 각각에 대한 타겟 특허, 타겟 특허 집합, 분할 타겟 특허 집합을 기준으로 한 분쟁 예측 모델값 또는 분쟁 예측 정보값을 생성(SL102)한다. 이어, 상기 분쟁 예측 정보 분석 엔진(5300)은 각 기준별 분쟁 예측 모델값 또는 분쟁 예측 정보값의 랭킹, 분포, 권리자별, 권리자 종류별, 발명자별, 특허 분류별 정량 분석을 수행(SL103)하거나, 타겟 특허의 서지 사항이나 분석 지표별 정량 분석을 수행(SL104)하여, 정량 분석 수행 결과값을 생성해 낸다.The dispute prediction information analysis engine 5300 of the present invention analyzes the dispute prediction information. FIG. 44 shows an exemplary configuration of an information processing method of the dispute prediction information analysis engine 5300 of the present invention. The dispute prediction information analysis engine 5300 generates the dispute prediction model value information for each target patent (SL101), and then target patents, target patent sets, and split target patents for respective patents, own patent sets, and divided self patent sets. The dispute prediction model value or the dispute prediction information value based on the set is generated (SL102). Subsequently, the dispute prediction information analysis engine 5300 performs quantitative analysis of ranking, distribution, right holder, right holder type, inventor, and patent classification of each dispute prediction model value or dispute prediction information value for each criterion (SL103), or a target. Quantitative analysis is performed for each bibliographic item or analytical index of the patent (SL104) to generate a quantitative analysis result.

상기 분쟁 예측 정보 분석 엔진(5300)의 분석 대상이 되는 분쟁 예측 정보는, SSi, TSi, wf(Sg(TSi), W(TSi)), f(TS)가 된다. 특히, TSi에는 서지 사항값과 상기 표 1 내지 표 4의 분쟁 예측 요소별 분쟁 예측 요소값이 있으며, 상기 서지 사항값과 분쟁 예측 요소값과 wf(Sg(TSi), W(TSi))은 다양한 정량 분석의 대상이 될 수 있다. 예를 들면, TS에 포함된 권리자별 TSi에 대한 wf(Sg(TSi), W(TSi))값으로, 특정 권리자가 SS에 얼마나 큰 분쟁 예측 정보값을 가지는 지를 분석할 수 있으며, 특정 권리자의 TSi에 대하여 가장 wf(Sg(TSi), W(TSi))값이 큰 TSi를 생성하는 등과 같이 wf(Sg(TSi), W(TSi))별 랭킹을 생성할 수 있을 것이다. 이를 통해서 SS를 소유하거나 SS와 관련 있는 제품을 생산하는 기업을 공격할 가능성이 높은 특허들 및 권리자를 효과적으로 선별할 수 있게 된다. 한편, 기 설정된 wf(Sg(TSi), W(TSi)) 이상을 가지는 TSi를 분석하여 최빈 발명자를 찾아, 최빈 발명자별 wf(Sg(TSi), W(TSi)) 정보를 생성할 수도 있을 것이다. 한편, wf(Sg(TSi), W(TSi))가 기 설정된 기준 이상이 되는 TSi 중에서 개인 권리자나 중소기업의 TSi를 추출할 수도 있을 것이다. The dispute prediction information to be analyzed by the dispute prediction information analysis engine 5300 is SSi, TSi, wf (Sg (TSi), W (TSi)), f (TS). In particular, TSi has a bibliographic value and a dispute prediction element value for each dispute prediction element of Tables 1 to 4, wherein the bibliographic value, dispute prediction element value, and wf (Sg (TSi), W (TSi)) are various. May be subject to quantitative analysis. For example, with the wf (Sg (TSi), W (TSi)) value for TSi for each rights holder included in TS, it is possible to analyze how much dispute prediction information value a specific rights holder has in SS, A ranking of wf (Sg (TSi), W (TSi)) may be generated, such as generating TSi having the largest wf (Sg (TSi), W (TSi)) value with respect to TSi. This effectively identifies patents and rights holders who are likely to attack companies that own the SS or produce products related to the SS. On the other hand, by analyzing the TSi having a predetermined wf (Sg (TSi), W (TSi)) or more to find the least inventor, it is also possible to generate wf (Sg (TSi), W (TSi)) information for each of the least inventors . On the other hand, the TSi of the individual owner or SME may be extracted from TSi whose wf (Sg (TSi), W (TSi)) is greater than or equal to a predetermined standard.

분쟁 예측 함수인 f에 대하여 상기 f(TS) 또는 f(TSui)별로 분쟁 예측 정보값 또는 분쟁 예측 함수값이 나오면 분쟁 예측 함수값을 어떻게 구조화시켜 분쟁 예측 정보를 생성할 것인지는 본 발명의 분쟁 예측 정보 생성부(5140)에서 수행하며, 생성된 분쟁 예측 모델은 분쟁 예측 모델이 사용할 수 있다. 한편, 분쟁 예측 결과를 어떻게 출력할 것인지는 분쟁 예측 결과 출력부에서 수행한다. 실제 pdf 형태의 리포트나 전자 우편용 리포트는 본 발명의 특허 분쟁 정보 리포트 생성부(4440)가 담당하여 생성한다.When the dispute prediction function value or the dispute prediction function value is generated for each f (TS) or f (TSui) for f, which is a dispute prediction function, how to structure the dispute prediction function value to generate dispute prediction information is the dispute prediction information of the present invention. The dispute prediction model, which is performed by the generation unit 5140, may be used by the dispute prediction model. Meanwhile, how to output the dispute prediction result is performed by the dispute prediction result output unit. The actual report in pdf form or e-mail report is generated by the patent dispute information report generator 4440 of the present invention.

이어, 도 45를 참조하면서, 상기 분쟁 예측 엔진(5100)이 분쟁 예측 정보값에 기반하여 기 설정된 등급 정보를 생성하는 방법을 설명한다. 상기 분쟁 예측 엔진(5100)은 자기 특허, 자기 특허 집합, 분할 자기 특허 집합 각각에 대한 타겟 특허, 타겟 특허 집합, 분할 타겟 특허 집합을 기준으로 한 분쟁 예측 모델값 또는 분쟁 예측 정보값을 생성(SL111)한 다음, 등급 부여 모델에 따른 등급 정보를 생성(SL112)한다. 적어도 하나 이상의 집합적 분쟁 예측 정보 생성 함수인 f에 대하여 상기 f(TS) 또는 f(TSui)별로 분쟁 예측 정보값이나 분쟁 예측 함수값이 나오면, 기 설정된 등급 부여 모델에 따라, 분쟁 예측 등급을 결정할 수 있다. 등급 부여 모델은 분쟁 예측 함수값 구간별로 등급을 부여하거나, 분쟁 예측 함수값의 분포를 고려하여, 분포에 기반한 등급 구간을 정하고, 정해진 등급 구간으로 예측 등급을 부여할 수 있을 것이다. 점수값이 있을 때, 이를 n개의 등급으로 나누는 모델은 다수가 있으며, 이는 당업자에게 당연할 것이다.Next, a method of generating, by the dispute prediction engine 5100, predetermined grade information based on the dispute prediction information value will be described with reference to FIG. 45. The dispute prediction engine 5100 generates a dispute prediction model value or a dispute prediction information value based on a target patent, a target patent set, and a split target patent set for each of a self patent, a self patent set, and a divided self patent set (SL111). ), And generates grade information according to the graded model (SL112). If a dispute prediction information value or a dispute prediction function value is generated for each of f (TS) or f (TSui) with respect to f, which is at least one collective dispute prediction information generating function, the dispute prediction grade is determined according to a preset rating model. Can be. The grading model may assign a rating to each dispute prediction function value section, or determine a rating section based on the distribution in consideration of the distribution of the dispute prediction function value, and give a prediction rating to the determined rating section. When there are score values, there are a number of models that divide them into n classes, which will be apparent to those skilled in the art.

이어, 특허 분쟁 예측 정보 생성 시스템(5000)의 모델을 업그레이드하는 방법에 대해 설명한다. 시간이 경과할수록 분쟁 발생 특허들이 추가되는 경우, 추가된 분쟁 발생 특허들을 포함시켜서 기존 분쟁 예측 요소 또는 새로운 분쟁 예측 요소 집합으로 분쟁 예측 모델을 생성하고, 생성된 분쟁 예측 모델로 분쟁 예측 정보 생성할 수 있게 된다. 즉, 시간의 경과에 따라 Pi별 분쟁 예측 요소값이 달라지게 되고, 분쟁 예측 요소값이 달라지면, 분쟁 예측 모델이 달라지게 되고, 분쟁 예측 모델값과 분쟁 예측 정보값도 달라지게 된다. 한편, 상기 특허 분쟁 예측 정보 생성 시스템(5000)은 상기 특허 분쟁 예측 정보 생성 시스템(5000)이 관리하는 적어도 하나 이상의 특허 집합과 상기 사용자가 관리하는 적어도 하나 이상의 특허 집합을 대상으로, 적어도 하나 이상의 분쟁 예측 모델값과 분쟁 예측 정보값을 기 설정된 주기 또는 기 설정된 조건에 따라 배치 생성할 수 있게 된다.Next, a method of upgrading the model of the patent dispute prediction information generation system 5000 will be described. If dispute-prone patents are added over time, a dispute prediction model can be generated from the existing dispute prediction element or a new dispute prediction element set by including the additional dispute occurrence patents, and the dispute prediction information can be generated using the generated dispute prediction model. Will be. In other words, the dispute prediction element value for each Pi changes over time, and if the dispute prediction element value is different, the dispute prediction model is different, and the dispute prediction model value and the dispute prediction information value are also different. The patent dispute prediction information generation system 5000 may predict at least one or more disputes regarding at least one patent set managed by the patent dispute prediction information generation system 5000 and at least one patent set managed by the user. It is possible to batch generate the model value and the dispute prediction information value according to a predetermined cycle or a predetermined condition.

이어, 도 46 내지 도 47을 참조하면서, 본 발명의 분쟁 파급 예측 엔진(5600)의 정보 처리 방법을 설명한다. 상기 분쟁 파급 예측 엔진(5600)은 신규한 개별 분쟁 특허 또는 신규한 분쟁 특허 집합을 입수(SL121)한 다음, 입수된 개별 분쟁 특허별로 또는 신규한 분쟁 특허 집합별로 타겟 특허 집합을 생성(SL122)한다. 이어, 상기 분쟁 파급 예측 엔진(5600)은 상기 타겟 특허 집합을 대상으로 권리자 분석을 포함한 기 설정된 정량 분석을 수행함(SL123)한다. 상기 분쟁 파급 예측 엔진(5600)은 사용자로부터 적어도 하나 이상의 자기 특허 집합 또는 자기 특허 집합을 생성할 수 있는 조건을 입수하여 등록(SL131)한 다음, 주기적 또는 기 설정된 조건 충족 시 자기 특허 집합별 또는 자기 특허 집합을 구성하는 자기 특허별로 분쟁 예측 정보를 생성(SL132)한다. 이어, 상기 분쟁 파급 예측 엔진(5600)은 생성된 분쟁 예측 정보를 사용자에게 리포팅(SL133)한다. 이하, 분설한다.Next, with reference to FIGS. 46-47, the information processing method of the dispute propagation prediction engine 5600 of this invention is demonstrated. The dispute propagation prediction engine 5600 obtains a new individual dispute patent or a new dispute patent set (SL121), and then generates a target patent set for each individual dispute patent obtained or for each new dispute patent set (SL122). . Subsequently, the dispute propagation prediction engine 5600 performs a predetermined quantitative analysis including an owner analysis on the target patent set (SL123). The dispute propagation prediction engine 5600 obtains and registers (SL131) a condition for generating at least one self-patent set or self-patent set from a user, and then sets a self-patent set or a self-patent when a periodic or preset condition is satisfied. Dispute prediction information is generated for each patent of the patent set (SL132). Subsequently, the dispute propagation prediction engine 5600 reports the generated dispute prediction information to the user (SL133). Hereinafter, powdering is performed.

상기 분쟁 파급 예측 엔진(5600)은 적어도 하나 이상의 신규한 분쟁 특허가 입수되는 경우, 신규 발생 분쟁 분석부(5610)를 통하여 상기 분쟁 특허를 분석하여, 상기 신규한 분쟁 특허에 관한 분석 정보를 생성하거나, 또는 신규 발생 분쟁 파급 예측부(5620)를 통하여 상기 신규한 분쟁 특허에 관한 분쟁 파급 예측 정보를 생성하거나, 분쟁 파급 알림부(5630)를 통하여 상기 신규한 분쟁 특허에 관한 알림 처리를 수행한다.When at least one new dispute patent is obtained, the dispute spread prediction engine 5600 analyzes the dispute patent through a new dispute analysis unit 5610 to generate analysis information regarding the new dispute patent, or Alternatively, the newly generated dispute propagation prediction unit 5620 may generate dispute propagation prediction information about the new dispute patent, or may perform notification processing on the new dispute patent through the dispute propagation notification unit 5630.

먼저 상기 신규 발생 분쟁 분석부(5610)가 분석하는 정보는 상기 신규한 분쟁 특허가 분쟁을 일으킨 회수 및 회수의 시계열적 분포, 분쟁 당사자(피고 등)의 중복 여부 및 상기 신규한 분쟁 특허가 포함된 분쟁들에서 분쟁 당사자별 분쟁의 시계열적 분포 등을 분석한다. 한편, 기 설정된 기간 단위로 신규한 분쟁 특허에 대한 기 설정된 정량 분석 처리할 수 있다. 상기 정량 분석 처리는 권리자별, 분쟁 제기자별, 분쟁 특허별, 피고별, 발명자별, 특허 분류별, 분쟁 제기 일자별 정량 분석 정보를 생성하는 것을 포함한다.First, the information analyzed by the newly generated dispute analysis unit 5610 includes a time series distribution of the number and the number of times the new dispute patent caused a dispute, whether a dispute party (defendant, etc.) overlaps, and the new dispute patent is included. Analyze the time series distribution of disputes between parties in disputes. On the other hand, a predetermined quantitative analysis process for a new disputed patent on a predetermined period basis. The quantitative analysis process includes generating quantitative analysis information by right holder, dispute claimant, dispute patent, defendant, inventor, patent classification, and dispute filing date.

상기 신규 발생 분쟁 파급 예측부(5620)는 신규한 분쟁 특허 각각을 자기 특허(SSi)처럼 취급하여 하여, 본 발명의 상기 타겟 특허 집합 생성부(5120)를 이용하여, 상기 자기 특허의 출원일(최선일 또는 우선일 기준이어도 된다.) 기준으로 후출원 특허로 구성되는 TS를 생성한다. 상기 생성된 TS에 대하여 TSi의 권리자 분석을 포함한 기 설정된 정량 분석 처리를 수행한다. 한편, 상기 신규 발생 분쟁 파급 예측부(5620)는 2 이상의 신규한 분쟁 특허들로 이루어진 신규 분쟁 특허 집합을 자기 특허 집합을 구성하고, 상기 자기 특허 집합을 대상으로 상기 타겟 특허 집합 생성부(5120)를 이용하여, 상기 자기 특허 집합의 SSi의 출원일(최선일 또는 우선일 기준이어도 된다.) 기준으로 후출원이 되는 TSi(SSi)를 생성하고, 생성된 TSi(SSi)로 TS를 구성한다. 상기 생성된 TS에 대하여 TSi의 권리자 분석을 포함한 기 설정된 정량 분석 처리를 수행한다. 이때, 상기 타겟 특허 집합을 구성하는 TSi에 대해서는 본 발명의 기 설정된 분쟁 예측 모델값 및 가중치가 부여되어 있음은 물론이다 할 것이며, 상기 분쟁 예측 모델값과 상기 가중치는 본 발명이 정량 분석에 활용됨은 당연하다 할 것이다.The new occurrence dispute spreading unit 5620 treats each new disputed patent as a self patent (SSi), and uses the target patent set generation unit 5120 of the present invention to apply for the own patent application (the best date). Or a priority date.) A TS composed of a post-application patent is generated as a reference. A predetermined quantitative analysis process is performed on the generated TS, including an owner analysis of TSi. On the other hand, the new occurrence dispute spreading unit 5620 constitutes a self-patent set of a new dispute patent set consisting of two or more new disputed patents, and the target patent set generation unit 5120 for the self-patent set. By using, a TSi (SSi) which is a post-application is generated on the basis of the filing date of the SSi of the self-patent set (which may be the best date or the priority date), and the TS is composed of the generated TSi (SSi). A predetermined quantitative analysis process is performed on the generated TS, including an owner analysis of TSi. At this time, the TSi constituting the target patent set is given that the predetermined dispute prediction model value and weight of the present invention are given, and the dispute prediction model value and the weight are used for the quantitative analysis of the present invention. It will be natural.

이어, 사기 분쟁 파급 알림부(5630)를 사용하여 사용자별로 alert 서비스를 제공하는 방법에 대해서 설명한다. 사용자는 적어도 하나 이상의 SS를 설정할 수 있으며, 자신과 관련된 경쟁사 등의 권리자, 자신의 관심 특허, 자신의 관심 기술군(특허 분류로 등록) 등을 등록해 놓을 수 있다. 이때, 신규 특허 데이터의 발생 또는 권리자 변동 등과 같이 전체 특허 데이터에서 변동이 일어나거나, 새로운 트롤의 추가, 표준 특허풀의 추가 등이 발생하거나, 주기적으로 분쟁 발생 특허가 추가되는 경우, 상기 SS마다의 상기 f(TS) 또는 f(TSui)별로 분쟁 예측 함수값이 변동되게 된다. 이때, 특정 시점(t=t1)에서의 상기 f(TS) 또는 f(TSui)별로 분쟁 예측 함수값과 다른 시점(t=t2)에서의 상기 f(TS) 또는 f(TSui)별로 분쟁 예측 함수값에서 기 설정된 값 이상이 변동되는 경우, 상기 시스템은 상기 사용자에게 변동의 발생 등에 대한 정보를 제공해 줄 수 있게 된다. 예를 들어, 최근 1달간에 특정 트롤이 그 트롤의 자회사로 적어도 하나 이상의 타인으로부터 n개의 특허를 매입(current assignee 정보로 알 수 있음)하는 event가 발생한 경우, 그 n개의 특허의 권리자 변동에 따라 현시점(t=t2)에서의 SS에 대한 상기 f(TS) 또는 f(TSui)별로 분쟁 예측 함수값에서 기 설정된 값 이상이 변동이 발생하여, 사용자에게 특허 분쟁 위험성이 현격하게 증가할 수도 있게 된다. 물론, 상기 시스템은 n개의 특허가 있을 때, 상기 특허에 대한 기 설정된 정량 분석(권리자 분석, 기술 분야 분석, 시계열 분석 등)을 수행해 줄 수 있음은 물론일 것이며, n개의 특허가 인용하는 parent 특허 집합을 입수해 줄 수 있을 것이며, parent 특허 집합에 대해서도 기 설정된 정량 분석을 수행해 줄 수 있을 것이다.Next, a method of providing an alert service for each user using the fraud dispute spreading notification unit 5630 will be described. The user may set at least one SS, and may register rights holders, such as competitors associated with themselves, own interest patents, own interest technology groups (registered in a patent classification), and the like. In this case, when a change occurs in the entire patent data such as the generation of new patent data or a change in the owner, or when a new troll is added, a standard patent pool is added, or a dispute-prone patent is added periodically, The dispute prediction function value is changed for each of f (TS) or f (TSui). At this time, the dispute prediction function for each f (TS) or f (TSui) at a specific time point (t = t1) and the dispute prediction function for each f (TS) or f (TSui) at a different time point (t = t2) If the value is greater than or equal to the preset value, the system can provide information about the occurrence of the change to the user. For example, in the last month, when a troll has an event where a troll is a subsidiary of the troll and has purchased n patents from at least one other person (which can be identified by current assignee information), More than a predetermined value is changed in the dispute prediction function value for each f (TS) or f (TSui) for SS at the present time (t = t2), so that the risk of patent disputes may increase significantly for the user. . Of course, the system may perform a predetermined quantitative analysis (author analysis, technical field analysis, time series analysis, etc.) for the patent when there are n patents, and parent patents cited by the n patents A set can be obtained, and a predetermined quantitative analysis can be performed for the parent patent set.

이어, 도 48을 참조하면서, 본 발명의 특허 분쟁 예측 정보 생성 시스템(5000)을 활용하여 리스크 헤징 예측 정보 생성에 대해 설명한다. 상기 리스크 헤징 예측 정보의 생성은 본 발명의 리스크 헤징 정보 생성부가 담당한다. 도 48에는 상기 리스크 헤징 정보 생성부의 정보 처리 방법의 일 실시예적 구성이 나와 있다. 상기 리스크 헤징 정보 생성부는 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 먼저 입수하고 타겟 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 보완 특허 집합을 입수하거나, 적어도 하나 이상의 특허를 포함하는 보완 특허 집합을 먼저 입수하고 보완 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허 집합을 입수(SL141)하고, 상기 보완 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 분쟁 예측 모델값을 입수(SL142)하고, 특허별 분쟁 예측 모델값을 활용하여 적어도 하나 이상의 리스크 헤징 예측 정보를 생성(SL143)한다. 이하, 더욱 더 상세하게 설명한다.Next, with reference to FIG. 48, the risk hedging prediction information generation is demonstrated using the patent dispute prediction information generation system 5000 of this invention. The risk hedging information generation unit of the present invention is responsible for generating the risk hedging prediction information. 48 is a diagram showing the configuration of an information processing method of the risk hedging information generation unit. The risk hedging information generation unit first obtains a target patent set including at least one patent and obtains at least one complementary patent set having a predetermined relationship with the target set, or first obtains a complementary patent set including at least one patent. And acquire at least one or more target patent sets having a predetermined relation with the complementary set (SL141), obtain at least one dispute prediction model value for each patent for each individual patent constituting the supplementary patent set (SL142), and for each patent. At least one risk hedging prediction information is generated using the dispute prediction model value (SL143). It will be described in more detail below.

리스크 헤징 예측 정보에는 상기 SS와 관련된 TS가 있을 때, 상기 TS 전체 또는 분할된 적어도 하나 이상의 분할 TS에 대하여 특허 공격을 예상할 수 있는 보완 특허 집합의 일종인 리스크 헤징 특허 집합(Risk Hedging Set, RHS)를 생성하는 것과 RHS 중 이들 중 SS 또는 SS의 생성자가 지정하는 기 설정된 관계를 가지는 보완 특허 집합의 일종인 크로스 라이센싱 특허 집합(Cross Licensing Set, CLS)를 생성하는 것이 있다. 먼저 상기 리스크 헤징 정보 생성부가 RHS를 생성하는 방법에 대해서 설명한다. 상기 리스크 헤징 정보 생성부는 상기 TS 전체 또는 적어도 하나 이사의 분할 TS에 대한 사용자 또는 상기 특허 분쟁 예측 정보 생성 시스템(5000)의 선택 정보를 입수한다. 상기 분할 TS는 상기 TS 중에서 권리자별 또는 특정 속성을 가지는 권리자별(트롤, 경쟁사, 분쟁 제기자, 분쟁 경험자 등)로 된 분할 TS일 수 있으며, 이들 중에서 분할 TS를 대상으로 특히 상기 wf(Sg(TSi), W(TSi))가 높은 분할 TSi를 다수 포함하고 있는 분할 TS나, 분할 TS를 대상으로 상기 f(TS)가 높은 분할 TS일 수 있다.In the risk hedging prediction information, when there is a TS related to the SS, a risk hedging set (RHS) is a kind of complementary patent set that can predict a patent attack on the whole TS or at least one divided TS. ) And cross-licensing set (CLS), which is a kind of complementary patent set having a predetermined relationship designated by SS or SS generator among them. First, the method of generating the RHS by the risk hedging information generation unit will be described. The risk hedging information generation unit obtains selection information of the user or the patent dispute prediction information generation system 5000 for the whole TS or the divided TS of at least one director. The divided TS may be divided TSs according to the right holders or the right holders having a specific attribute (troll, competitor, dispute claimant, dispute experienced person, etc.) among the TSs, and among them, the wf (Sg ( TSi) and W (TSi)) may be divided TSs containing a large number of divided TSi, or divided TSs having a high f (TS) for the divided TSs.

상기 리스크 헤징 정보 생성부는 상기 TS 또는 분할 TS(이하, 재투입 TS라 한다.)를 SS처럼 취급할 수 있다. 상기 재투입 TS를 마치 자기 특허 집합처럼 처리하면, 상기 타겟 특허 집합 생성부(5120)는 상기 재투입 TS에 대한 타겟 특허 집합인 TS(재투입 TS)를 생성하게 된다. 상기 분쟁 예측 요소값 생성부(5510)는 상기 TS(재투입 TS)를 구성하는 타겟 특허인 TSi(재투입 TS)에 대하여, 상기 표 1 내지 표 4의 분쟁 예측 요소별 분쟁 예측 요소값을 생성한다. 상기 분쟁 예측 모델값 입수부(5130)는 상기 생성된 상기 TSi(재투입 TS)의 분쟁 예측 요소값에 대한 분쟁 예측 모델값을 입수한다. 상기 분쟁 예측 정보 생성부(5140)는 상기 TSi(재투입 TS)에 대한 기 설정된 일체의 정보를 생성한다. 상기 분쟁 예측 정보 생성부(5140)가 생성하는 정보는 wf(Sg(TSi(재투입 TS)), W(TSi(재투입 TS))) 또는 f(TS(재투입 TS))일 수 있다.The risk hedging information generation unit may treat the TS or the divided TS (hereinafter referred to as re-insert TS) as SS. When the re-entry TS is treated like a self-patent set, the target patent set generating unit 5120 generates a TS (re-entry TS) which is a target patent set for the re-entry TS. The dispute prediction element value generator 5510 generates a dispute prediction element value for each dispute prediction element of Tables 1 to 4 with respect to TSi (re-entry TS), which is a target patent constituting the TS (re-entry TS). do. The dispute prediction model value acquisition unit 5130 obtains a dispute prediction model value for the generated dispute prediction element value of the TSi (re-entry TS). The dispute prediction information generation unit 5140 generates all predetermined information about the TSi (re-entry TS). The information generated by the dispute prediction information generator 5140 may be wf (Sg (TSi (re-input TS)), W (TSi (re-intake TS))) or f (TS (re-intake TS)).

상기 리스크 헤징 정보 생성부는 TSi(재투입 TS) 중 wf(Sg(TSi(재투입 TS)), W(TSi(재투입 TS)))가 기 설정된 수준 이상이 되는 특허나, wf(Sg(TSi(재투입 TS)), W(TSi(재투입 TS)))가 기 설정된 수준 이상이 되면서 그 출원인이 개인, 중소기업, 대학, 기관이 되거나, 기 설정된 리스크 헤징 속성을 가지는 권리자의 특허인 보완 특허 후보군 내지는 보완 특허군을 선별한다. 리스크 헤징 속성을 SS 또는 SS의 생성자의 입장에서 리스크를 헤징할 수 있는 속성으로, 자신을 공격할 가능성이 있는 자를 반격할 수 있거나 기타 특허 분쟁 리스크를 줄일 수 있는 속성을 말하며, 상기 속성이 강할수록 강한 보완 특허가 된다. 만약 SS에 특허 분쟁 위험이 높은 특정 권리자의 특허로 구성되는 TS가 특정한 특허군 A로부터 분쟁 위험 가능성이 높다면, 상기 특정한 특허군 A는 SS의 입장에서는 TS로부터의 리스크를 헤징할 가능성이 높은 보완 특허군이 된다. 따라서, 상기와 같은 보완 특허군 A를 발견하는 것이 상기 리스크 헤징 정보 생성부가 수행하는 정보 처리이다.The risk hedging information generation unit has a patent in which wf (Sg (TSi (re-input TS)), W (TSi (re-intake TS))) of TSi (re-input TS) is equal to or higher than a predetermined level, or wf (Sg (TSi). Complementary patents that are patents of right holders whose (re-entry TS)), W (TSi (re-entry TS))) are above the predetermined level and the applicant becomes an individual, SME, university, institution, or has a predetermined risk hedging attribute Candidate or complementary patent groups are selected. The risk hedging attribute is an attribute that can hedge risk from the point of view of the SS or the creator of the SS. The risk hedging attribute refers to an attribute capable of countering a potential attacker or reducing the risk of other patent disputes. It becomes a strong complementary patent. If a TS composed of patents of a specific right holder with a high risk of patent disputes in the SS has a high risk of dispute from a particular patent group A, the particular patent group A is a supplement that is likely to hedge the risk from the TS for the SS. It becomes a patent group. Therefore, finding the supplemental patent group A as described above is information processing performed by the risk hedging information generating unit.

상기 분할 TS는 상기 특허 집합 분할부(5310)에서 수행한다. 상기 특정 권리자의 분할 TS가 있는 경우, 상기 리스크 헤징 정보 생성부는 상기 특허 집합 분할부(5310)로 하여금 상기 특정 권리자의 분할 TS에 대하여 다양한 분할 정책에 따른 적어도 하나 이상의 재분할 TS를 생성하도록 제어한다. 상기 재분할 TS별로 상기 리스크 헤징 정보 생성부는 TSi(재분할 TS) 중 wf(Sg(TSi(재분할 TS)), W(TSi(재분할 TS)))가 기 설정된 수준 이상이 되는 특허나, wf(Sg(TSi(재분할 TS)), W(TSi(재분할 TS)))가 기 설정된 수준 이상이 되면서 그 출원인이 개인, 중소기업, 대학, 기관이 되거나, 기 설정된 리스크 헤징 속성을 가지는 권리자의 특허를 선별한다. 상기 리스크 헤징 정보 생성부가 여러 개의 재분할 TS를 생성하고, 생성된 재분할 TS에 대한 특허군 A를 발견하는 것은 일종의 시뮬레이션일 수 있다. 한편, 상기 재분할은 상기 특정 권리자의 분할 TS를 구성하는 분할 TSi에 대하여 클러스터링을 적용하여, n개의 클러스터링 된 재분할 TS를 구성하거나, 상기 분할 TSi를 특허 분류별로 나누어 여러 개의 재분할 TS를 생성할 수 있을 것이다. 상기 재분할 TS별로 상기 리스크 헤징 정보 생성부가 정보 처리하는 것은 분할 TS의 권리자에 대한 반격을 고려할 때, 상기 분할 TS라는 넓은 범위를 대상으로 한 반격 가능한 특허군 A를 찾을 수도 있지만, 여러 개의 재분할 TS라는 좁은 범위를 대상으로 한 특화되면서도 강력한 반격 가능한 특허군 B를 찾을 수도 있기 때문이다.The division TS is performed by the patent set division unit 5310. If there is a divided TS of the specific rights holder, the risk hedging information generation unit controls the patent set dividing unit 5310 to generate at least one redivided TS according to various division policies for the divided TS of the specific rights holder. The risk hedging information generation unit for each of the repartitioned TSs includes a patent in which wf (Sg (TSi (repartitioned TS)) and W (TSi (repartitioned TS))) of TSi (repartitioned TS) becomes a predetermined level or higher, or wf (Sg ( As TSi (Redivision TS), W (TSi (Redivision TS)) is above the predetermined level, the applicant becomes an individual, a small business, a university, an institution, or selects a patent of a right holder having a predetermined risk hedging attribute. It may be a kind of simulation that the risk hedging information generation unit generates a plurality of repartitioned TSs and discovers the patent group A for the generated repartitioned TSs. On the other hand, the redistribution may apply clustering to the divided TSi constituting the divided TS of the specific rights holder, to configure n clustered redistributed TSs, or to divide the divided TSi by patent classification to generate a plurality of redistributed TSs. will be. The information processing by the risk hedging information generation unit for each of the repartitioned TS may find counterattackable patent group A targeting a wide range of the divided TS, in consideration of the counterattack against the owner of the divided TS. This is because they can find a specialized but powerful counterattack group B that targets a narrow range.

이어, 도 49를 참조하면서, 본 발명의 크로스 라이센싱 정보 생성부(5340)의 정보 처리 방법에 대해서 설명한다. 상기 크로스 라이센싱 정보 생성부(5340)는 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 먼저 입수하고 타겟 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 보완 특허 집합을 입수하거나, 적어도 하나 이상의 특허를 포함하는 보완 특허 집합을 먼저 입수하고 보완 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허 집합을 입수(SL151)한 다음, 상기 보완 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 분쟁 예측 모델값을 입수(SL152)하고, 특허별 분쟁 예측 모델값을 활용하여 적어도 하나 이상의 크로스 라이센싱 예측 정보를 생성(SL153)한다. 이하, 더욱 더 상세히 설명한다. Next, with reference to FIG. 49, the information processing method of the cross licensing information generation part 5340 of this invention is demonstrated. The cross-licensing information generation unit 5340 first obtains a target patent set including at least one or more patents, and obtains at least one complementary patent set having a predetermined relationship with the target set, or includes a complementary patent including at least one patent A set is first obtained, and at least one target patent set having a predetermined relationship with the complementary set is obtained (SL151), and then, at least one dispute prediction model value for each patent is obtained for the individual patents constituting the supplementary patent set (SL152). In operation S153, at least one cross-licensing prediction information is generated by using a dispute prediction model value for each patent. It will be described in more detail below.

상기 크로스 라이센싱 특허군은 본 발명의 크로스 라이센싱 정보 생성부(5340)가 생성한다. 상기 크로스 라이센싱 정보 생성부(5340)는 상기 분할 TS라는 넓은 범위를 대상으로 한 반격 가능한 특허군 A나 여러 개의 재분할 TS라는 좁은 범위를 대상으로 한 특화되면서도 강력한 반격 가능한 특허군 B 중에서 상기 SS에 포함되거나, SS를 생성하는 자가 지정하는 권리자(가장 대표적인 것이 자사, 계열사, 하청사, 원청사나 협력사, 특별한 관계를 맺고 있는 개인, 기관, 대학 등이 될 수 있다.)의 특허를 추출한다. 한편, 상기 크로스 라이센싱 정보 생성부(5340)는 상기 분할 TS에 대하여, TS(분할 TS)의 생성 시, 상기 SS 나, SS를 생성하는 자가 지정하는 권리자(가장 대표적인 것이 자사, 계열사, 하청사, 원청사나 협력사, 특별한 관계를 맺고 있는 개인, 기관, 대학 등이 될 수 있다.)의 특허를 한정하여 자신이 원하는 속성을 가지는 특허만을 대상으로 한정하여 상기 타겟 특허 집합 생성부(5120)를 통하여 TS(분할 TS)를 생성할 수 있다. 한편, 상기 크로스 라이센싱 정보 생성부(5340)는 상기 분쟁 예측 정보 생성부(5140)를 제어하여 상기 TS(분할 TS)를 구성하는 개별 특허인 TSi(분할 TS)에 대하여 wf(Sg(TSi(분할 TS)), W(TSi(분할 TS)))나, f(TS(분할 TS))를 생성하도록 한다. 통상적으로 SS와 TS(분할 TS)와는 일치하는 특허가 없을 가능성이 크나, SS를 구성하는 특허가 다수 있고, SS를 구성하는 특허들의 출원일이 넓게 분포되어 있으며, 일치하는 특허가 발생할 가능성도 배제하지 못한다.The cross licensing patent group is generated by the cross licensing information generation unit 5340 of the present invention. The cross-licensing information generation unit 5340 is included in the SS from the patent group A that can counterattack a wide range of the divided TS or the patent group B that is capable of counterattacking a narrow range of several subdivision TSs. Or extract the patent of the right holder (the most representative one can be company, subsidiary, subcontractor, original government or subcontractor, individual, institution, university, etc.) with special relationship. On the other hand, the cross-licensing information generation unit 5340 may be assigned to the divided TS, the SS (or the subsidiary, subcontractors, etc.) designated by the person generating the SS when the TS (divisional TS) is generated And the patent of the original government office, a partner company, an individual, an institution, a university, etc. having a special relationship. Only a patent having a desired attribute can be applied to the target patent set generation unit 5120. (Split TS) can be generated. On the other hand, the cross-licensing information generation unit 5340 controls the dispute prediction information generation unit 5140 with respect to wf (Sg (TSi (divisional division)) for TSi (divisional TS) which is an individual patent constituting the TS (divisional TS). TS)), W (TSi (split TS)) or f (TS (split TS)). In general, there is a high possibility that there is no patent matching the SS and the TS (split TS), but there are many patents constituting the SS, and a wide range of filing dates of the patents constituting the SS are not excluded. can not do it.

이어, 본 발명의 특허 분쟁 예측 정보 생성 시스템(5000)을 구성하는 각 엔진과 DB 및 기능부를 활용하는 응용 시스템의 구성 및 그 응용 시스템의 정보 처리 방법에 대해서 설명한다. 대표적인 응용 시스템으로 특허 라이센싱 예측 모델 생성 시스템이 있다. 도 50에는 특허 라이센싱 예측 정보 생성 시스템(6000)의 일 실시예적 구성이 제시되어 있으며, 도 51에는 특허 라이센싱 예측 정보 생성 시스템(6000)의 정보 처리 방법에 대한 일 실시예적 구성이 제시되어 있다.Next, a description will be given of a configuration of an application system utilizing each engine, a DB, and a functional unit constituting the patent dispute prediction information generation system 5000 of the present invention, and an information processing method of the application system. A representative application system is a patent licensing prediction model generation system. 50 illustrates an exemplary configuration of a patent licensing prediction information generating system 6000, and FIG. 51 illustrates an exemplary configuration of an information processing method of the patent licensing prediction information generating system 6000.

먼저 특허 라이센싱 예측 정보 생성 시스템(6000)부터 설명한다. 특허 분쟁이 발생하는 경우, 라이센싱으로 분쟁이 종료되는 경우가 많으며, 라이센싱 협상이 실패한 경우, 분쟁으로 비화되는 경우도 많다. 따라서, 특허 분쟁은 라이센싱과 동전의 양면 관계에 있다. 따라서, 특허 분쟁 예측 정보 생성 시스템(5000)의 기능을 대부분 그대로 활용하면, 특허 라이센싱 예측 정보 생성 시스템(6000)을 구성할 수 있게 되며, 대부분의 기능 모듈은 특허 라이센싱 예측 정보 생성 시스템(6000)의 경우와 특허 분쟁 예측 정보 생성 시스템(5000)의 경우에서 동일하게 작동한다. 다만, 특허 라이센싱 예측 정보 생성 시스템(6000)의 경우에는 특허 분쟁 예측 정보 생성 시스템(5000)과는 달리 상기 타겟 특허 집합 생성부(5120) 는 기 설정된 관련성을 가진 유사 특허군을 생성할 때, 자기 특허 집합의 SSi보다 후출원인 TSi를 대상으로 타겟 특허 집합을 생성한다는 큰 차이가 있다.First, the patent licensing prediction information generating system 6000 will be described. When a patent dispute occurs, the dispute is often terminated by licensing, and when a licensing negotiation fails, it is often turned into a dispute. Thus, patent disputes are a two-sided relationship between licensing and coins. Therefore, if most of the functions of the patent dispute prediction information generation system 5000 are utilized as it is, the patent licensing prediction information generation system 6000 can be configured, and most of the functional modules of the patent licensing prediction information generation system 6000 are used. The same works in the case of the patent dispute prediction information generation system 5000. However, in the case of the patent licensing prediction information generation system 6000, unlike the patent dispute prediction information generation system 5000, when the target patent set generation unit 5120 generates a similar patent group having a predetermined relationship, There is a big difference that a target patent set is generated for TSi, which is a later application than SSi of a patent set.

특허 분쟁 예측 정보 생성 시스템(5000)과 특허 라이센싱 예측 정보 생성 시스템(6000)의 관계에서 분쟁 예측 요소는 라이센싱 예측 요소에 대응되며, 분쟁 예측 모델은 라이센싱 예측 모델에 대응될 수 있으며, 분쟁 DB부(5200)는 라이센싱 DB부(6200), 분쟁 예측 정보 분석 엔진(5300)은 라이센싱 예측 정보 분석 엔진(6300), 분쟁 예측 관리부(5400)는 라이센싱 예측 관리부(6400), 분쟁 예측 모델 생성 엔진(5500)은 라이센싱 예측 모델 생성 엔진(6500)에 대응되며, 분쟁 예측 엔진(5100)은 라이센싱 예측 엔진(6100)에, 분쟁 예측 모델값 입수부(5130)는 라이센싱 예측 모델값 입수부(6130), 분쟁 예측 정보 생성부(5140)는 라이센싱 예측 정보 생성부(6140), 분쟁 예측 정보값 생성 모듈(5142)은 라이센싱 예측 정보값 생성 모듈(6142), 분쟁 예측 정보값 제공부(5150)는 라이센싱 예측 정보값 제공부(6150), 분쟁 예측 요소값 DB(5220)는 라이센싱 예측 요소값 DB(6220), 분쟁 예측 모델값 DB(5230)는 라이센싱 예측 모델값 DB(6230), 분쟁 예측 시스템 관리부(5420)는 라이센싱 예측 시스템 관리부(6420), 분쟁 예측 정보 배치 생성부(5421)는 라이센싱 예측 정보 배치 생성부(6421), 분쟁 예측 사용자 관리부(5430)는 라이센싱 예측 사용자 관리부(6430), 분쟁 UI부(5431)는 라이센싱 UI부(6431)로, 분쟁 예측 요소값 생성부(5510)는 라이센싱 예측 요소값 생성부(6510), 분쟁 예측 모델 생성부(5520)는 라이센싱 예측 모델 생성부(6520), 특허별 분쟁 예측 모델값 생성부(5530)는 특허별 라이센싱 예측 모델값 생성부, 분쟁 예측 모델값 제공부(5540)는 라이센싱 예측 모델값 제공부(6540), 공격 예상 정보 생성부는 라이센싱 예상 정보 생성부가 대응되므로, 특허 라이센싱 예측 정보 생성 시스템(6000)의 구성 및 정보 처리 방법에 대한 설명은 상기 특허 분쟁 예측 정보 생성 시스템(5000)의 설명에서 상기의 대응되는 용어로 변경하는 것으로 당업자에게 충분할 것이다. In the relationship between the patent dispute prediction information generation system 5000 and the patent licensing prediction information generation system 6000, the dispute prediction element may correspond to the licensing prediction element, and the dispute prediction model may correspond to the licensing prediction model, and the dispute DB unit ( 5200 is a licensing DB unit 6200, a dispute prediction information analysis engine 5300 is a licensing prediction information analysis engine 6300, a dispute prediction management unit 5400 is a licensing prediction management unit 6400, a dispute prediction model generation engine 5500 Corresponds to the licensing prediction model generation engine 6500, the dispute prediction engine 5100 is a licensing prediction engine 6100, the dispute prediction model value acquisition unit 5130 is a licensing prediction model value acquisition unit 6130, dispute prediction The information generation unit 5140 is a licensing prediction information generation unit 6140, the dispute prediction information value generation module 5152 is a licensing prediction information value generation module 6162, and the dispute prediction information value providing unit 5150 is licensing prediction information. The value providing unit 6150, the dispute prediction element value DB 5220, the licensing prediction element value DB 6220, the dispute prediction model value DB 5230, the licensing prediction model value DB 6230, and the dispute prediction system manager 5520. The licensing prediction system manager 6620, the dispute prediction information batch generator 5241, the licensing prediction information batch generator 6421, and the dispute prediction user manager 5430 are the licensing prediction user manager 6630, and the dispute UI unit 5431. ) Is a licensing UI unit 6431, the dispute prediction element value generator 5510 is a licensing prediction element value generator 6510, and the dispute prediction model generator 5520 is a licensing prediction model generator 6520, by patent. The dispute prediction model value generating unit 5530 corresponds to a licensing prediction model value generation unit for each patent, the dispute prediction model value providing unit 5540 corresponds to the licensing prediction model value providing unit 6540, and the attack prediction information generation unit corresponds to the licensing prediction information generation unit. Patent licensing prediction information Description of the configuration and the information processing method of the system (6000) is sufficient to one of ordinary skill in the art that changes in the description of the patent conflict prediction information generating system (5000) with the corresponding terms are described above.

한편, 특허 라이센싱 예측 정보 생성 시스템(6000)에서도 특허 분쟁 예측 정보 생성 시스템(5000)의 분쟁 발생 특허 DB (5210)(6210), 분쟁 데이터 입수부(5410)(6410)는 동일하게 사용되며, 자기 특허 집합 생성부(5110)(6110), 타겟 특허 집합 생성부(5120)(6120), 인용 특허 집합 생성부 (5121)(6121), 유사 특허군 생성부(5122)(6122), 유사 기술 특허군 생성부(5123)(6123), 타겟 집합 입수부(5124)(6124), 옵션 처리부(6125)(5125), 다중 관계성 처리 모듈(5141)(6141), 가중치 조절부(5143)(6143)도 동일하게 사용된다. 특히, 상기 특허 라이센싱 예측 정보 생성 시스템(6000)이 라이센싱 예측 요소와 라이센싱 예측 모델을 생성할 때는 분쟁 발생 특허와 비분쟁 발생 특허를 사용하여 생성하므로, 분쟁 발생 특허 DB (5210)(6210)와 분쟁 데이터 입수부(5410)(6410)는 특허 분쟁 예측 정보 생성 시스템(5000)과 동일한 명칭으로 유지되게 된다. 본 문단에서는 특허 라이센싱 예측 정보 생성 시스템(6000)과 특허 분쟁 예측 정보 생성 시스템(5000)에서 명칭 변경 없이 동일하게 사용되는 구성 요소들에 대해서는 도면 부호를 병기하였다.Meanwhile, in the patent licensing prediction information generation system 6000, the dispute occurrence patent DB 5210 and 6210 and the dispute data acquisition unit 5410 and 6410 of the patent dispute prediction information generation system 5000 are used in the same manner. Patent set generation unit 5110 and 6110, target patent set generation unit 5120 and 6120, cited patent set generation unit 5121 and 6121, similar patent group generation unit 5122 and 6122, and similar technology patents Group generation unit 5123 (6123), target set acquisition unit (5124) (6124), option processing unit (6125) (5125), multiple relationship processing module (5141) (6141), weight adjusting unit (5143) (6143) ) Is used equally. In particular, when the patent licensing prediction information generation system 6000 generates a licensing prediction element and a licensing prediction model, the patent licensing prediction information generation system 6000 generates a dispute using a dispute-prone patent and a non-dispute-prone patent, and thus disputes with the dispute-prone patent DB 5210 and 6210. The data acquisition units 5410 and 6410 are maintained under the same names as the patent dispute prediction information generation system 5000. In this paragraph, the same reference numerals are used for the components used in the patent licensing prediction information generating system 6000 and the patent dispute prediction information generating system 5000 without changing their names.

도 51에서 제시되는 바와 같이, 상기 특허 라이센싱 예측 정보 생성 시스템(6000)은 적어도 하나 이상의 특허를 포함하는 자기 특허 집합을 먼저 입수하고 자기 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 타겟 특허 집합을 입수하거나, 적어도 하나 이상의 특허를 포함하는 타겟 특허 집합을 먼저 입수하고, 타겟 집합과 기 설정된 관련성을 가지는 적어도 하나 이상의 자기 특허 집합을 입수(SL161)하고, 상기 타겟 특허 집합을 구성하는 개별 특허에 대하여 적어도 하나 이상의 특허별 라이센싱 예측 모델값을 입수(SL16)하고, 특허별 라이센싱 예측 모델값을 활용하여 적어도 하나 이상의 라이센싱 예측 정보를 생성(SL163)한다. 상기 상기 특허 라이센싱 예측 정보 생성 시스템(6000)의 라이센싱 예측 정보의 생성은 상기 특허 분쟁 예측 정보 생성 시스템(5000)의 분쟁 예측 정보의 생성 방법을 준용하며, 준용되는 개별적인 라이센싱 예측 정보 생성 방법은 상기에서 상술한 분쟁 예측 정보의 생성 방법을 통하여 당업자에게 자명할 것이므로, 상세한 설명은 생략한다.As shown in FIG. 51, the patent licensing prediction information generating system 6000 first obtains a self-patent set including at least one or more patents, and obtains at least one or more target patent sets having a predetermined association with the self-set, First, obtain a target patent set including at least one patent, obtain at least one self patent set having a predetermined relationship with the target set (SL161), and at least one or more patents for individual patents constituting the target patent set. Each license prediction model value is obtained (SL16), and at least one license prediction information is generated (SL163) using the license prediction model value for each patent. The generation of the licensing prediction information of the patent licensing prediction information generating system 6000 applies the method of generating dispute prediction information of the patent dispute prediction information generating system 5000, and the individual licensing prediction information generating method applied mutatis mutandis is described above. Since it will be apparent to those skilled in the art through the above-described method of generating dispute prediction information, a detailed description thereof will be omitted.

이어, 상기 특허 분쟁 정보 처리의 핵심이 되는 요소 기술 중의 하나인 집단 단위 인용 분석 정보 생성 방법에 대하여 도면을 참조하면서 상술한다.Next, the method of generating the group unit citation analysis information which is one of the element technologies which are the core of the patent dispute information processing will be described in detail with reference to the drawings.

도 6에는 본 발명의 집합 단위 인용 분석(4500)에 대한 일 실시예적 구성이 제시되어 있다. 상기 집합 단위 인용 분석(4500)는 적어도 2 이상의 복수 개의 특허 정보로 구성되는 특허 집합을 입수하여 입수 특허 집합을 생성하는 입수 특허 집합 생성부(4510)와 상기 입수된 자기 특허 집합에 포함된 개별 특허를 처리하여 적어도 하나 이상의 인용 특허 집합을 생성하는 대상 특허 집합 생성부(4520)와 상기 인용 특허 집합에 대하여 기 설정된 적어도 하나 이상의 인용 분석 목적별에 따른 인용 분석을 처리하는 인용 분석부(4530);를 포함하고 있다.6 illustrates one exemplary configuration for an aggregated unit citation analysis 4500 of the present invention. The aggregated unit citation analysis 4500 acquires a patent set composed of at least two or more pieces of patent information and generates an acquired patent set, and an individual patent included in the obtained self-patent set. A target patent set generation unit 4520 for generating at least one cited patent set by processing a citation analysis unit 4530 for processing a citation analysis based on at least one cited analysis purpose set for the cited patent set; It includes.

입수 특허 집합 생성부(4510)는 특허 집합을 입수하는 특허 집합 입수부(4511)과 입수한 특허 집합을 한정하는 입수 특허 집합 한정부(4512)를 포함하고 있다. 상기 입수 대상 특허 집합 생성부(4520)은 대상 특허 집합을 입수하는 대상 특허 집합 입수부(4522)과 대상 특허 집합을 한정하는 대상 특허 집합 한정부(4521)를 포함하고 있다. 대상 특허 집합 입수부(4522)가 입수하는 특허 집합은 전방 인용 특허 집합, 후방 인용 특허 집합, 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 인용 발생 입수 특허 집합 등이 있을 수 있다. 입수 대상 특허 집합 생성부(4520)이 특정한 조건을 가하여 대상 특허 집합을 한정하는 경우, 상기 조건을 반영하여, 데이터부(1000)로부터 대상 특허를 입수해 온다. 입수된 대상 특허군으로 상기 인용 특허 집합 생성부(4523)은 인용 특허 집합을 생성한다. 상기 인용 분석부(4530)는 인용 분석 목적을 선택하는 인용 분석 목적 선택부(4531)과 선택된 목적으로 인용 분석을 수행하는 인용 분석 실행부(4532)를 포함하고 있다. 한편, 상기 특허 정보 시스템(10000)은 적어도 하나 이상의 카데고리별로 특허 집합을 생성하고 관리할 수 있으며, 상기 특허 집합의 관리는 본 발명의 시스템 특허 집합 관리부이 수행한다. 상기 시스템 특허 집합 관리부의 하부에는 출원인별로 특정되는 특허 집합을 관리하는 출원인별 특허 집합 관리부와, 특허 분류별로 특정되는 특허 집합을 관리하는 특허분류별 특허 집합 관리부와 기타 분류나 카테고리별로 특정되는 특허 집합을 관리하는 기타 분류별 특허 집합 관리부등이 있다.The acquisition patent set generation unit 4510 includes a patent set acquisition unit 4511 for acquiring a patent set and an acquisition patent set limitation unit 4512 for limiting the acquired patent sets. The acquisition target patent set generation unit 4520 includes a target patent set acquisition unit 4522 for acquiring the target patent set and a target patent set limitation unit 4451 for defining the target patent set. The patent set obtained by the target patent set obtaining unit 4522 may include a forward cited patent set, a back cited patent set, a front self cited patent set, a back self cited patent set, a citation generation obtained patent set, and the like. When the acquisition target patent set generation unit 4520 defines a target patent set by applying specific conditions, the target patent is obtained from the data unit 1000 by reflecting the above conditions. The cited patent set generation unit 4523 generates a cited patent set as the obtained target patent group. The citation analysis unit 4530 includes a citation analysis purpose selection unit 4531 for selecting a citation analysis purpose and a citation analysis execution unit 4452 for performing citation analysis for the selected purpose. Meanwhile, the patent information system 10000 may generate and manage patent sets for at least one or more categories, and the patent set management unit of the present invention manages the patent sets. At the bottom of the system patent set management unit, an applicant-specific patent set management unit managing a patent set specific to each applicant, a patent set management unit by patent classification managing a patent set specified by patent classification, and a patent set specified by other classifications or categories There are other category-specific patent set management departments to manage.

한편, 사용자들이 생성하는 특허 집합은 회원별 특허 집합 관리부가 수행한다.On the other hand, the patent set generated by the user is performed by the member patent set management unit.

특허 데이터부(1100)는 도 2에서 예시되어 있듯이, 특허 명세서 파일부(1110), 특허 DB(1120), 특허 분류 DB(1130) 등을 포함하고 있다. 특허 DB(1120)부는 모든 특허에 대한 서지 사항과 명세서 본문, 도면 등을 필드별로 관리하며, 상기 명세서 본문을 구성하는 각종 필드(제목, 요약, 선행 기술, 특허 청구 범위, 발명의 상세한 설명 등)에서 추출한 핵심 키워드를 포함하고 있다. 한편, 상기 특허들은 상기 특허들에 대한 선행 기술 문건으로 인용 정보를 더 포함하고 있을 수 있다. 미국 특허 data를 예로 들면, 상기 인용 정보는 Reference에 포함된 정보들로, 미국 특허 문건 번호, 외국 특허 문건 번호, 비분쟁 특허 문건에 대한 표시자 등이 포함되어 있다. 한편, 특허청 심사관이나 관련자에 의한 선행 기술 조사 정보, 심사관 의견 제출 시 참증 정보 등도 광의의 인용 정보가 된다. 특정 문건에 전방 인용 정보가 있는 경우, 상기 전방 인용 정보에 포함된 문건의 입장에서 보면 상기 특정 문건은 후방 인용 문건이 된다. 특정 문건을 기준으로 전방 인용 정보에 포함된 문건은 부모(parent) 문건이 되며, 상기 부모 문건을 기준으로 보면 상기 특정 문건은 자식(child) 문건이 된다. 이러한 child-parent 관계에 있는 정보를 DB로 처리하는 것은 당업자에게는 자명한 것으로 상세한 설명은 생략한다.As illustrated in FIG. 2, the patent data unit 1100 includes a patent specification file unit 1110, a patent DB 1120, a patent classification DB 1130, and the like. The patent DB 1120 manages bibliographic matters of all patents, the specification body, and drawings for each field, and various fields constituting the specification body (title, summary, prior art, claims, detailed description of the invention, etc.). Contains key keywords extracted from. Meanwhile, the patents may further include citation information as prior art documents for the patents. For example, in the US patent data, the citation information is information included in the reference, and includes a US patent document number, a foreign patent document number, and an indicator of a non-disputed patent document. On the other hand, prior art research information by examiners or related persons of the Patent Office, and information on the examination when submitting the examiner's opinion are broad citation information. If there is forward citation information in a specific document, the specific document is a backward citation document from the viewpoint of the document included in the forward citation information. The document included in the forward citation information based on the specific document becomes the parent document and the specific document becomes the child document based on the parent document. It is obvious to those skilled in the art to process the information in the child-parent relationship with the DB, and a detailed description thereof will be omitted.

특허 문건의 서지 사항에는 발행 국가 정보, 각종 일자 정보, 각종 번호 정보, 적어도 하나 이상의 권리자 정보, 적어도 하나 이상의 발명자 정보, 적어도 하나 이상의 특허 분류 정보, 적어도 하나 이상의 우선권 정보 등이 포함되어 있다. 일자에는 출원일, 공개일, 등록일 및 기타 일자들이 있다. 각종 번호 정보에는 출원번호, 공개번호, 등록번호, 원출원 번호, 우선권 주장 번호 등이 있다. 권리자 정보는 출원인, 양수인, 특허권자 등이 있으며, 권리자의 변동이 있고, 변동이 관리되는 경우, 양도인과 양수인에 대한 정보 및 최종 권리자 정보가 있을 수 있다. 우선권 정보에는 우선권 주장 번호, 주장일자, 국가 등의 정보가 포함되어 있다. 한편, 분할출원이나 일부 계속 출원, 계속 출원 등이 있는 경우, 원출원 번호, 원출원일자 등의 정보가 부가되어 있다. 또한, 대표도, 제목, 요약, 색인어 등도 서지 사항에 포함되기도 한다. 한편, 가공된 서지 사항으로는 국내 패밀리 정보(분할출원, 변경출원이나 일부계속출원, 계속출원 관계에 있는 특허 출원)이나, 해외 패밀리 정보(조약 우선권 관계로 연관될 수 있는 출원, 국제출원 등)이 있을 수 있다. 한편, 자연어 처리 등을 통해서 특허 명세서 본문의 텍스트를 본문을 구성하는 각 필드별 또는 필드 통합별로 기 설정된 키워드 추출 방식으로 추출한 핵심 키워드 정보가 더 있을 수 있다. 특허 분류 정보는 공통적인 IPC 이외에도 USPC, FT, FI, ECLA 등과 같이 각 국가별로 자국만의 특별한 특허 분류가 있을 수 있다.The bibliography of the patent document includes the issuing country information, various date information, various number information, at least one owner information, at least one inventor information, at least one patent classification information, at least one priority information, and the like. The date includes the filing date, the date of publication, the date of registration, and other dates. The various number information includes an application number, a public number, a registration number, an original number, and a priority claim number. The right holder information includes the applicant, the assignee, the patent owner, etc. If there is a change in the right holder and the change is managed, there may be information about the transferor and the assignee and information about the right holder. Priority information includes information such as priority claim number, date of claim, and country. On the other hand, when there is a split application, partial continuation application, continuation application, etc., information such as an original number, date of the original source is added. In addition, bibliographies may include representations, titles, abstracts, and index words. Meanwhile, the bibliographies processed include domestic family information (such as a split application, a change of application, or a part of a continuation application, a patent application that is related to a continuation application), overseas family information (an application that may be related to a treaty priority relationship, This can be. On the other hand, the text of the patent specification body may be further extracted by a keyword extracting method for each field or combination of fields constituting the main text through natural language processing or the like. In addition to the common IPC, the patent classification information may have a unique patent classification for each country such as USPC, FT, FI, ECLA, etc.

이어, 집합 단위 인용 분석(4500)의 정보 처리 방법을 더욱 더 자세히 설명한다.Next, the information processing method of the aggregated unit citation analysis 4500 will be described in more detail.

상기 입수 특허 집합 생성부(4510)은 적어도 2 이상의 복수 개의 특허 정보로 구성되는 입수 특허 집합을 입수 받고, 상기 대상 특허 집합 생성부(4520)는 상기 입수 특허 문건 집합에 포함된 개별 특허를 처리하여 적어도 하나 이상의 대상 특허 집합을 생성하고, 상기 인용 분석부(4530)는 상기 대상 특허 집합에 대하여 기 설정된 적어도 하나 이상의 인용 분석을 처리한다. The acquisition patent set generation unit 4510 receives an acquisition patent set consisting of at least two or more pieces of patent information, and the target patent set generation unit 4520 processes individual patents included in the acquisition patent document set. Generate at least one target patent set, and the citation analysis unit 4530 processes at least one citation analysis preset for the target patent set.

한편, 도 20에서 알 수 있듯이, 상기 집합 단위 인용 분석(4500)는 입수 특허 집합 한정부(4512) 또는 옵션 선택부(4340)를 통하여 입수 특허 집합 또는 대상 특허 집합에 대한 한정을 수행하고, 상기 인용 분석 목적 선택부(4531)를 통하여 인용 분석 목적에 대한 선택 정보를 입수하고, 상기 인용 특허 집합 생성부(4523)를 통하여 한정된 특허 집합을 대상으로 인용 분석의 대상이 되는 인용 특허 집합 생성하고, 인용 분석 목적 선택부(4531)과 인용 분석 실행부(4532)를 통하여 인용 분석 대상 집합에 포함된 특허들을 대상으로 인용 분석 목적에 따른 분석 정보 생성한다.On the other hand, as can be seen in Figure 20, the set unit citation analysis 4500 is limited to the acquisition patent set or the target patent set through the acquisition patent set limiting unit 4512 or option selection unit 4340, Obtain selection information on the purpose of citation analysis through the citation analysis purpose selection unit 4531, and generate a citation patent set that is the target of citation analysis for a limited patent set through the citation patent set generation unit 4523, The citation analysis purpose selection unit 4531 and the citation analysis execution unit 4532 generate analysis information according to the citation analysis purpose for the patents included in the citation analysis target set.

본 발명의 인용 분석부(4530)는 인용 분석 목적 선택부(4531)를 활용하여 사용자로부터 인용 목적에 대한 선택을 입수 받는다. 인용 분석 목적은 인용 분석 결과에 나타나는 주요 정보가 무엇에 관한 것인지를 나타낸다. 인용 분석 목적의 예시로는 총량, 출원인/권리자, 발명자, 특허 분류, 개별 특허가 있을 수 있다. 상기 인용 분석 실행부(4532)는 인용과 관련된 기 설정된 특허 집합이 있는 경우, 상기 특허 집합에서의 특허 총량, 출원인/권리자별 특허 총량이나 시기별 특허 총량, 특허 분류별 특허 총량이나 시기별 특허 총량 등과 같은 기 설정된 적어도 하나 이상의 정량 분석을 수행할 수 있다. 상기 인용 분석 실행부(4532)에는 정량 분석을 수행하는 정량 분석부를 포함하고 있다. 상기 정량 분석은 필드별로 수행할 수도 있다. 한편, 상기 정량 분석은 시계열 분석을 포함하고 있을 수 있으며, 필드별 시계열 분석도 상기 정량 분석에 포함된다. The citation analysis unit 4530 of the present invention obtains a selection for a citation purpose from a user by using the citation analysis purpose selection unit 4531. The purpose of citation analysis is to indicate what key information appears in the citation analysis results. Examples of citation analysis purposes may include total amount, applicant / owner, inventor, patent classification, and individual patent. When there is a preset patent set related to the citation, the citation analysis execution unit 4532 may include the total patent amount in the patent set, the total amount of patents by applicant or right holder, the total amount of patents by time, the total amount of patents by patent classification or the total amount of patents by time, etc. At least one or more of the same preset quantitative analysis may be performed. The citation analysis execution unit 4532 includes a quantitative analysis unit that performs quantitative analysis. The quantitative analysis may be performed for each field. Meanwhile, the quantitative analysis may include time series analysis, and the time series analysis for each field is also included in the quantitative analysis.

인용 방향은 전방 또는 후방 또는 인용 발생 입수 특허 집합 자체 중 어느 하나가 된다. 한편, 전방 인용 특허 집합이나 후방 인용 특허 집합은 중복을 허용하는 것인 타당하나 특수한 경우에는 중복을 허용하지 않는 선택을 사용자가 할 수 있다. 즉, 입수 특허 집합에 속하는 n개의 특허(I1,,,, ,In)가 동일한 특허 Pi를 인용하고 있는 경우, Pi는 n회의 가중치를 가지는 것으로 취급해야 함이 더 타당하다. 통상적으로 집합 연산(합집합 연산)을 수행하면, 중복되는 원소(Pi들)는 중복이 제거되어 1회로 취급되는데, 인용 분석의 취지상 이렇게 되는 것은 타당하지 않다. 따라서, 각 Pi들은 빈도(frequency) n을 유지하면서 처리되어야 한다.The citation direction can be either forward or backward or the citation generating patent set itself. On the other hand, the forward cited patent set or the back cited patent set is reasonable to allow duplication, but in a special case, the user may select not to allow duplication. That is, when n patents (I1 ,,,,, In) belonging to the acquired patent set refer to the same patent Pi, it is more appropriate that Pi should be treated as having a weight of n times. Typically, when performing a set operation (set operation), overlapping elements (Pi) are treated as one time by eliminating duplication, but for the purpose of citation analysis, this is not valid. Therefore, each Pi must be processed while maintaining frequency n.

한편, 한정이 있거나, 한정이 없는 상기 전방 인용 특허 집합, 후방 인용 특허 집합, 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 인용 발생 입수 특허 집합 등을 구성하는 특허군에 대하여 rising 분석(증감율 분석)을 수행할 수 있다. rising 분석이란, 개별적인 출원인별, 발명자별, 특허 분류별, 키워드별(키워드 쌍을 포함한다), 개별 문건별 또는 공동출원인별, 공동 발명자별, 특허 분류쌍별, 키워드 쌍별로 빈도 등에 대한 시계열 분석을 수행할 수 있게 된다. 한편, 상기 한정이 있거나, 한정이 없는 상기 전방 인용 특허 집합, 후방 인용 특허 집합, 전방 자기 인용 특허 집합, 후방 자기 인용 특허 집합, 인용 발생 입수 특허 집합 등을 구성하는 특허군에 대하여 신규 진입 분석을 수행할 수 있다. 신규 진입 분석이란, 특정한 cutoff(threshold)인 특정 시점을 정해 놓고, 그 시점 이전에는 없고 그 시점 이후에만 발생한 개별적인 출원인, 발명자, 특허 분류, 키워드, 개별 문건 또는 공동출원인, 공동 발명자, 특허 분류쌍, 키워드쌍을 추출할 수 있을 것이다. 한편, 신규 진입 분석의 일종으로 특정한 cutoff를 기준으로 출원인, 발명자, 특허 분류, 키워드, 개별 문건 또는 공동출원인, 공동 발명자, 특허 분류쌍, 키워드쌍의 빈도의 rising speed가 큰 출원인, 발명자, 특허 분류, 키워드, 개별 문건 또는 공동출원인, 공동 발명자, 특허 분류쌍, 키워드쌍을 추출할 수 있을 것이다.On the other hand, rising analysis (increase / decrease rate analysis) for the patent group constituting the forward-cited patent set, the back-cited patent set, the forward self-cited patent set, the back-self-cited patent set, the citation generation obtained patent set, etc., with or without limitation. ) Can be performed. The rising analysis is time series analysis of individual applicants, inventors, patent classifications, keywords (including keyword pairs), individual documents or co-applicants, joint inventors, patent classification pairs, and frequency by keyword pairs. You can do it. On the other hand, new entry analysis is performed on the patent group constituting the above-mentioned or unrestricted forward cited patent set, back cited patent set, forward self cited patent set, back self cited patent set, cited generation obtained patent set, and the like. Can be done. A new entry analysis is defined by a specific cutoff (threshold) at a specific point in time, and includes individual applicants, inventors, patent classifications, keywords, individual documents or co-applicants, co-inventors, patent classification pairs, You will be able to extract keyword pairs. Meanwhile, as a type of new entry analysis, applicants, inventors, patent classifications, keywords, individual documents or co-applicants, co-inventors, patent classification pairs, and applicants, inventors, and patent classifications having high rising speeds of keyword pairs based on specific cutoffs. , Keywords, individual documents or co-applicants, common inventors, patent classification pairs, keyword pairs.

한편, 인용 분석의 결과에 나오는 모든 수치마다에는 그 수치가 나온 조건이 할당될 수 있으므로(모든 조건은 SQO문으로 한정할 수 있다.) 그 수치에 클릭을 하면, 그 수치에 대응되는 특허 문건들을 호출할 수 있을 것임은 당업자에게 용이할 것이다. 이렇게 되면, 분석 수치에 대응되는 특허 문건을 입수 특허 집합으로 하여 다시 인용 분석을 수행할 수 있게 된다.On the other hand, every number in the results of the citation analysis can be assigned a condition that the number appears in (all conditions can be limited to SQO statements). It will be easy for one skilled in the art to be able to call. In this case, the citation analysis can be performed again using the patent document corresponding to the analysis value as the acquired patent set.

한편, 상기 네트워크 분석부(4700)는 공동 피고간의 연관 관계(association)을 네트워크를 분석할 수 있으며, 상기 데이터 시각화부(4710)는 공동 피고간의 네트워크를 시각화시킬 수 있게 된다. 예를 들어, 특허 분쟁i에 대하여, 피고i와 적어도 하나 이상의 피고j가 공동 피고가 되는 경우나, 분쟁 특허i에 대하여 피고i와 적어도 하나 이상의 피고j가 공동 피고가 되는 경우, 적어도 하나 이상의 특허 분쟁i를 포함하는 특허 분쟁 그룹별 또는 적어도 하나 이상의 분쟁 특허i를 포함하는 분쟁 특허 그룹별로 공동 피고간의 네트워크 분석을 수행할 수 있게 된다. 한편, 상기 네트워크 분석부(4700)는 분쟁 특허i마다 또는 특허 분쟁i마다 원고가 있으므로, 적어도 하나 이상의 특허 분쟁i를 포함하는 특허 분쟁 그룹별 또는 적어도 하나 이상의 분쟁 특허i를 포함하는 분쟁 특허 그룹별로 원고와 공동 피고를 포함하여 네트워크 분석을 수행할 수 있다. 이때, 원고와 피고 간의 관계는 방향성을 가지고, 공동 피고 간의 관계는 방향성이 없이 처리할 수 있게 된다. 이 경우, 특허 분쟁 그룹별 또는 분쟁 특허 그룹별로 네트워크 분석을 수행할 때, 원고의 개수가 가장 많게 되므로, 원고가 중심에 서게 된다. 상기 네트워크 분석부는 임의의 연관 관계를 네트워크 분석을 수행할 수 있다. 상기 네트워크 분석부(4700)가 분석하는 상기 연관 관계는 임의의 특허 집합을 기준으로, 동일한 특허 문건에 포함되는 다수의 핵심 키워드가 있을 경우나, 적어도 2 이상의 특허 분류가 있는 경우나, 적어도 2 이상의 발명자가 있는 경우나, 적어도 2 이상의 출원인이 있는 경우나, 적어도 2 이상의 인용 특허 문건이 있는 경우에도 동일하게 적용될 수 있다. 이와 같이, 단일한 대상에 2 이상의 대상이 동시에 존재하는 경우, 상기 단일한 대상이 n개 있을 때, 동시에 존재하는 상기 2 이상의 대상은 연관 관계를 가지게 되며, 네트워크 분석의 대상이 된다.Meanwhile, the network analyzer 4700 may analyze a network for an association between the joint defendants, and the data visualization unit 4710 may visualize the network between the joint defendants. For example, for patent dispute i, if defendant i and at least one defendant j are joint defendants or if dispute i and at least one defendant j are joint defendants, at least one patent It is possible to perform a network analysis between joint defendants by a patent dispute group including dispute i or by a dispute patent group including at least one dispute patent i. Meanwhile, since the network analysis unit 4700 has a manuscript for each dispute patent i or a patent dispute i, each network dispute unit 4700 includes a patent dispute group including at least one patent dispute i or a dispute patent group including at least one dispute patent i. Network analysis can be performed, including plaintiffs and joint defendants. At this time, the relationship between the plaintiff and the defendant is directional, and the relationship between the co-defendants can be processed without the directivity. In this case, when the network analysis is performed for each patent dispute group or dispute patent group, the number of the manuscripts is the largest, so the manuscript is at the center. The network analyzer may perform network analysis on any relation. The association analyzed by the network analyzer 4700 may be based on any patent set, when there are a plurality of key keywords included in the same patent document, when there are at least two or more patent classifications, or at least two or more patents. The same applies to the case where there is an inventor, at least two applicants, or at least two cited patent documents. As such, when two or more objects exist in a single object at the same time, when there are n single objects, the two or more objects that exist at the same time have an association relationship, and are subject to network analysis.

본 발명은 특허 정보 산업, 특허 정보 분석 산업, 법률 산업, 특허 컨설팅 산업, R&D 등에 광범위하게 활용될 수 있다.The present invention can be widely used in the patent information industry, patent information analysis industry, legal industry, patent consulting industry, R & D and the like.

100 : 사용자 컴퓨터
1000 : 데이터부
10000 : 특허 정보 시스템
1100 : 특허 데이터부
1110 : 특허 명세서 파일부
1120 : 특허 DB
1130 : 특허분류 DB
1140 : 기타 특허 데이터 DB
1200 : 비특허 데이터부
1210 : 비특허 파일부
1220 : 비특허 DB
1230 : 기업정보 DB
1240 : 기타 비특허 데이터 DB
1300 : 핵심 키워드 DB
131-1 : 기술 키워드 공기쌍 DB
131 : 기술 키워드 DB
1310 : 핵심 키워드 메타 데이터 DB
1311 : 키워드 관계성 DB
1312 : 특허 분류별 대표 키워드 DB
132-1 : 제품 키워드 공기쌍 DB
132 : 제품 키워드 DB
133-1 : 키워드 구문 공기쌍 DB
133-2 : 구문 키워드 공기쌍 DB
133-3 : 패턴별 구문 DB
133 : 구문 키워드 DB
1400 : 분류 메타데이터 DB
1700 : 사용자 데이터부
1800 : 기타 DB
1900 : 목적 특화 데이터부
1910 : 출원인 DB
1920 : 규칙 데이터부
1930 : 언어 데이터부
1940 : 표준 특허풀 데이터부
200 : 유무선 네트워크
2000 : 데이터 가공부
20000 : 목적 특화 시스템부
2032 : 인용 분석용 데이터 생성부
2100 : 핵심 키워드 생성부
21000 : 니치 기술 영역 발견 시스템
2110 : 키워드 추출부
2120 : 핵심 키워드 선정부
2130 : 구문 키워드 생성부
2140 : 핵심 키워드 메타 정보 생성부
2150 : 제품 키워드 생성부
2200 : 분류 메타 데이터 생성부
22000 : 특허 자동 평가 시스템
2210 : 분류 대표 핵심 키워드군 생성부
2220 : 분류간 맵핑 정보 생성부
2230 : 핵심 키워드 계층화부
2240 : 특허 자동 분류부
2300 : 목적 특화 데이터 생성부
23000 : 미래 기술 예측 시스템
2310 : 통계 분석용 데이터 생성부
2330 : 발견 분석용 데이터 생성부
2400 : 유사 특허 집합 생성부
24000 : 융합 정보 처리 시스템
2410 : 클러스터링 처리부
2500 : 네트워크 데이터 생성부
2510 : 노드 데이터 생성부
2520 : 에지 데이터 생성부
300 : 연계 시스템
3000 : 특허 정보 서비스 지원부
3100 : 검색 처리부
3110 : 검색 엔진부
3120 : DB 쿼리 처리부
3200 : 회원용 관리부
3210 : 회원 data부
3211 : 회원 문건 DB
3212 : 회원 맵핑 정보 DB
3213 : 회원 과금 DB
3220 : 회원 디렉토리 생성부
3230 : 회원 플래그 생성부
3240 : 회원 맵핑 정보 생성부
3241 : 회원 키워드 맵핑 정보 생성부
3242 : 회원 출원인 맵핑 정보 생성부
3300 : 플랫폼 서비스 제공부
3310 : 웹서비스 제공부
3311 : 웹서비스 계정 관리부
3312 : 웹서비스 인터페이스부
4000 : 특허 분석 정보 생성부
4100 : 분석 대상 입수부
4200 : 특허 분석 처리부
4210 : 분석 지표 DB
4220 : 분석 명령 구문 DB
4300 : 분석 옵션 처리부
4310 : 데이터 한정 옵션 처리부
4310 : 비교 분석부
4320 : What If 분석부
4320 : 표시 옵션 처리부
4330 : 기타 옵션 처리부
4330 : 편입/협의 분석부
4340 : 연관 분석부
4340 : 옵션 선택부
4341-1 : 기간 한정부
4341-2 : 국가 한정부
4341-3 : 출원인 한정부
4341-4 : 발명자 한정부
4341-5 : 특허 분류 한정부
4341-6 : 필드 속성 한정부
4341-7 : 개별 필드값 한정부
4341-8 : 개별 특허 속성값 한정부
4341-9 : 기타 한정부
4350 : Risk/기회 분석부
4360 : 2X2 분석부
4370 : 오픈이노베이션 대상 발견부
4400 : 분석 결과 리포팅부
4410 : 표 생성부
4420 : 차트 생성부
4430 : 다이어그램 생성부
4440 : 리포트 생성부
4500 : 집합 단위 인용 분석부
4510 : 입수 특허 집합 생성부
4511 : 특허 집합 입수부
4512 : 입수 특허 집합 한정부
4520 : 대상 특허 집합 생성부
4521 : 대상 특허 집합 한정부
4522-1 : 인용 특허 집합 입수부
4522-2 : 피인용 특허 집합 입수부
4522 : 대상 특허 집합 입수부
4523 : 인용 특허 집합 생성부
4530 : 인용 분석부
4531 : 인용 분석 목적 선택부
4532 : 인용 분석 실행부
4600 : 분석 결과 리포팅부
4700 : 네트워크 분석부
4710 : 데이터 시각화부
5000 : 특허 분쟁 예측 정보 생성 시스템
5100 : 분쟁 예측 엔진
5110 : 자기 특허 집합 생성부
5120 : 타겟 특허 집합 생성부
5121 : 인용 특허 집합 생성부
5122 : 유사 특허군 생성부
5123 : 유사 기술 특허군 생성부
5124 : 타겟 집합 입수부
5125 : 옵션 처리부
5130 : 분쟁 예측 모델값 입수부
5140 : 분쟁 예측 정보 생성부
5141 : 다중 관계성 처리 모듈
5142 : 분쟁 예측 정보값 생성 모듈
5143 : 가중치 조절부
5150 : 분쟁 예측 정보값 제공부
5200 : 분쟁 DB부
5210 : 분쟁 발생 특허 DB
5220 : 분쟁 예측 요소값 DB
5230 : 분쟁 예측 모델값 DB
5300 : 분쟁 예측 정보 분석 엔진
5310 : 특허 집합 분할부
5320 : 공격 예상군 정보 생성부
5330 : 리스크 헤징 정보 생성부
5340 : 크로스 라이센싱 정보 생성부
5400 : 분쟁 예측 관리부
5410 : 분쟁 데이터 입수부
5420 : 분쟁 예측 시스템 관리부
5421 : 분쟁 예측 정보 배치 생성부
5430 : 분쟁 예측 사용자 관리부
5431 : 분쟁 UI부
5432 : 사용자 입력 특허군 관리부
5500 : 분쟁 예측 모델 생성 엔진
5510 : 분쟁 예측 요소값 생성부
5520 : 분쟁 예측 모델 생성부
5530 : 특허별 분쟁 예측 모델값 생성부
5540 : 분쟁 예측 모델값 제공부
5600 : 분쟁 파급 예측 엔진
5610 : 신규 발생 분쟁 분석부
5620 : 신규 발생 분쟁 파급 예측부
5630 : 분쟁 파급 알림부
6000 : 특허 라이센싱 예측 정보 생성 시스템
6100 : 라이센싱 예측 엔진
6110 : 자기 특허 집합 생성부
6120 : 타겟 특허 집합 생성부
6121 : 인용 특허 집합 생성부
6122 : 유사 특허군 생성부
6123 : 유사 기술 특허군 생성부
6124 : 타겟 집합 입수부
6125 : 옵션 처리부
6130 : 라이센싱 예측 모델값 입수부
6140 : 라이센싱 예측 정보 생성부
6141 : 다중 관계성 처리 모듈
6142 : 라이센싱 예측 정보값 생성 모듈
6143 : 가중치 조절부
6150 : 라이센싱 예측 정보값 제공부
6200 : 라이센싱 DB부
6210 : 분쟁 발생 특허 DB
6220 : 라이센싱 예측 요소값 DB
6230 : 라이센싱 예측 모델값 DB
6300 : 라이센싱 예측 정보 분석 엔진
6310 : 특허 집합 분할부
6400 : 라이센싱 예측 관리부
6410 : 분쟁 데이터 입수부
6420 : 라이센싱 예측 시스템 관리부
6421 : 라이센싱 예측 정보 배치 생성부
6430 : 라이센싱 예측 사용자 관리부
6431 : 라이센싱 UI부
6432 : 사용자 입력 특허군 관리부
6500 : 라이센싱 예측 모델 생성 엔진
6510 : 라이센싱 예측 요소값 생성부
6520 : 라이센싱 예측 모델 생성부
6530 : 특허별 라이센싱 예측 모델값 생성부
6540 : 라이센싱 예측 모델값 제공부
8400 : 전자상거래 서비스 제공부
8410 : 전자상거래 대상 DB
8420 : 전자상거래 처리부
8430 : 오픈 마켓 생성부
8500 : 커뮤니티 서비스 제공부
8600 : 과금 처리부
8700 : 댓글 처리부
8710 : 댓글 수신부
8720 : 댓글 저장부
8730 : 동일성 판단부
8740 : 댓글 전달부
8800 : 번역 처리부100: user computer
1000: data section
10000: Patent Information System
1100: Patent data part
1110: Patent specification file portion
1120: Patent DB
1130: Patent Classification DB
1140: Other Patent Data DB
1200: non-patent data portion
1210: non-patent file portion
1220: Non-patent DB
1230: Company Information DB
1240: other non-patent data DB
1300: Key Keyword DB
131-1: Technical Keyword Air Pair DB
131: Technical Keyword DB
1310: Core Keyword Metadata DB
1311: Keyword Relationship DB
1312: representative keyword DB by patent classification
132-1: Product Keyword Air Pair DB
132: Product Keyword DB
133-1: Keyword Syntax Air Pair DB
133-2: Syntax Keyword Air Pair DB
133-3: Syntax DB by Pattern
133: syntax keyword DB
1400: classification metadata DB
1700: user data section
1800: other DB
1900: Purpose-specific data section
1910: Applicant DB
1920: rule data portion
1930: language data section
1940: Standard Patent Pool Data Section
200: wired and wireless network
2000: Data Processing Department
20000: Purpose Specialized System
2032: citation analysis data generation unit
2100: key keyword generator
21000: Nich Technology Zone Discovery System
2110: keyword extraction unit
2120: Key Keyword Selection Unit
2130: syntax keyword generation unit
2140: core keyword meta information generation unit
2150: product keyword generator
2200: classification metadata generation unit
22000: Patent Auto Evaluation System
2210: classification keyword core keyword group generation unit
2220: mapping information generation unit between classifications
2230 key keyword stratification unit
2240: Patent automatic classification unit
2300: purpose-specific data generation unit
23000: Future Technology Prediction System
2310: statistical data generation unit
2330: data generation unit for discovery analysis
2400: Similar patent set generation unit
24000: Convergence Information Processing System
2410: clustering processing unit
2500: network data generator
2510: node data generator
2520: edge data generator
300: linkage system
3000: Patent Information Service Support Department
3100: search processing unit
3110: search engine unit
3120: DB query processing unit
3200: member management unit
3210: Member data
3211: Membership Document DB
3212: Member mapping information DB
3213: Member Billing DB
3220: member directory generator
3230 member flag generator
3240: Member mapping information generation unit
3241: member keyword mapping information generation unit
3242: Member Applicant Mapping Information Generator
3300: platform service provider
3310: Web service provider
3311: Web service account management unit
3312: Web service interface unit
4000: patent analysis information generation unit
4100: acquisition target
4200: Patent Analysis Processing Unit
4210: Analytical Indicators DB
4220: Analysis command syntax DB
4300: analysis option processing unit
4310: data limited option processing unit
4310: Comparative Analysis
4320: what if analysis unit
4320: display option processing unit
4330: other option processing unit
4330: Transfer / Consultation Analysis Department
4340: association analysis
4340: Option selector
4341-1: Limited Time
4341-2: State Limited
4341-3: Applicant Limited
4341-4: Inventor Limited
4341-5: Patent Classification Limited
4341-6: Field Attribute Qualifier
4341-7: Individual field value limiter
4341-8: Individual patent attribute value limiter
4341-9: Other Limited
4350: Risk / Opportunity Analysis Department
4360: 2x2 analysis unit
4370: Open Innovation Target Discovery Department
4400: Analysis result reporting unit
4410: table generator
4420: chart generator
4430: diagram generator
4440: report generation unit
4500: set unit citation analysis unit
4510: acquisition patent set generation unit
4511: patent set acquisition department
4512: acquisition patent set limited part
4520: target patent set generation unit
4521: limited target patent set
4522-1: Acquisition of cited patent set
4522-2: Obtained citation patent set
4522: acquisition of target patent set
4523: cited patent set generation unit
4530: citation analysis unit
4531: Citation Analysis Purpose Selection Unit
4532: Citation Analysis Execution Unit
4600: Analysis result reporting unit
4700: network analysis unit
4710: data visualization
5000: Patent dispute prediction information generation system
5100: Dispute Prediction Engine
5110: self-patent set generation unit
5120: target patent set generation unit
5121: Citation Patent Set Generator
5122: similar patent group generation unit
5123: similar technology patent group generation unit
5124: target set acquisition unit
5125: Option processing unit
5130: obtain a dispute prediction model value
5140: dispute prediction information generation unit
5141: Multiple Relationship Processing Module
5142: dispute prediction information value generation module
5143: weight control unit
5150: dispute prediction information value provider
5200: dispute DB
5210: Dispute Generation Patent DB
5220: dispute prediction element value DB
5230: dispute prediction model value DB
5300: Dispute Prediction Information Analysis Engine
5310: Patent Set Division
5320: attack group information generation unit
5330 risk hedging information generation unit
5340: cross-licensing information generation unit
5400: Dispute Prediction Management
5410: Obtaining Dispute Data
5420: Dispute Prediction System Management Department
5421: Dispute prediction information batch generation unit
5430: Dispute prediction user management
5431: Dispute UI Department
5432: User input patent group management unit
5500: Conflict Prediction Model Generation Engine
5510: dispute prediction element value generation unit
5520: dispute prediction model generator
5530: dispute prediction model value generation unit for each patent
5540: dispute prediction model value provider
5600: Dispute Ripple Prediction Engine
5610: New occurrence dispute analysis
5620: New Outbreak Dispute Prediction Unit
5630: Dispute Ripple Notification Unit
6000: Patent Licensing Prediction Information Generation System
6100: Licensing Prediction Engine
6110: self-patent set generation unit
6120: target patent set generation unit
6121: Citation Patent Set Generator
6122: similar patent group generation unit
6123: similar technology patent group generation unit
6124: target set acquisition unit
6125: Option processing unit
6130: license prediction model value acquisition
6140: licensing prediction information generation unit
6141: Multiple Relationship Processing Module
6142: licensing prediction information value generation module
6143: weight control unit
6150: licensing prediction information provider
6200: Licensing DB Department
6210: Dispute Generation Patent DB
6220: Licensing prediction element value DB
6230: licensing prediction model value DB
6300: Licensing Prediction Information Analysis Engine
6310: Patent Set Division
6400: Licensing Prediction Management
6410: Get Dispute Data
6420: Licensing Prediction System Management
6421: licensing prediction information batch generation unit
6430: Licensing Prediction User Management
6431: Licensing UI
6432: User input patent group management unit
6500: Licensing Prediction Model Generation Engine
6510: licensing prediction element value generator
6520: licensing prediction model generation unit
6530: license prediction model value generation unit by patent
6540: licensing prediction model value provider
8400: e-commerce service provider
8410: DB for e-commerce
8420: Electronic Commerce Processing Department
8430: Open Market Generator
8500: Community Service Provider
8600: Billing processing unit
8700: Comment processing unit
8710: Comment Receiving Unit
8720: Comment storage unit
8730: Identity identification unit
8740: Comment delivery unit
8800: Translation Processing Unit

Claims

(i) (i1) (i11) first obtain a target patent set comprising at least one or more patents, and (i12) obtain at least one complementary patent set having a predetermined association with the target set, or (i2) (i21 A) first obtaining a complementary patent set including at least one patent, and (i22) obtaining at least one target patent set having a predetermined relationship with the supplement set;
(j) obtaining at least one dispute prediction model value for each patent for each patent constituting the complementary patent set; And
(k) generating at least one risk hedging prediction information by using the dispute prediction model value for each patent; patent risk hedging prediction information processing method of a patent information system comprising a.

The method of claim 1,
In the step (i12), the complementary patent set is a prior application based on the best day of the target patents constituting the target patent set, or in the step (i22), the target patent set is the supplement that constitutes the target patent set. A patent risk hedging prediction information processing method of a patent information system, characterized in that it is a post-application based on the best date of the patent.

The method of claim 1,
The target patent set is
A patent set having a predetermined relationship with a self patent set including at least one patent, or a partial patent set of a patent set having a predetermined relationship with the self patent set,
A patent set consisting of patents having a preset dispute prediction model value or more, or a partial patent set of a patent set consisting of patents having a predetermined dispute prediction model value or more,
A patent set consisting of patents having at least a predetermined dispute prediction information value in relation to the self patent set, or a portion of a patent set consisting of patents having at least a predetermined dispute prediction information value in relationship with the own patent set. Patent set
A patent set having at least a predetermined dispute prediction information value in relation to the own patent set or a partial patent set of a patent set having at least a predetermined dispute prediction information value in relation to the own patent set Patent risk hedging prediction information processing method of information system.

The method of claim 1,
The target patent and the complementary patent is a related patent relationship consisting of patents having a predetermined relationship,
The preset relation may be at least one of a citation-cited patent relationship, a similar patent group relationship in a text mining technique, and a similar technology patent relationship in a patent classification.
Complementary patents constituting the complementary patent set are those that can have a predetermined relationship with two or more target patents,
The relevance can be quantified, wherein the quantification is configured to include any one or more of the relevance frequency and relevance strength patent risk hedging prediction information processing method of a patent information system.

5. The method of claim 4,
The relevance frequency is the number of target patents having a predetermined relevance corresponding to the supplementary patent, and the relevance strength is one or more of citation-cited patent relationship information, similar patent group relationship information, and similar technology patent relationship information. Patent risk hedging prediction information processing method of a patent information system characterized in that it is generated using.

6. The method of claim 5,
Generating the relevance strength with the citation-cited patent relationship information includes generating using one or more of citation-cited depth information and citation-cited type information.
Generating the relevance strength with the similar patent group relationship information is generated by using similarity information between the target patent and the supplementary patent,
Generating the relevance strength with the similar technology patent relationship information may correspond to at least one or more types of patent classifications included in the target patent and the supplementary patent, the matching depth, the matching frequency, and the like in the patent classification system to which the patent classification belongs. Is generated using any one or more of the matching sequences,
The similarity information may include at least one type of extracted keyword extracted from the target patent and the supplementary patent, citation-cited information generated from the reference information of the target patent and the supplementary patent, and at least one kind extracted from the target patent and the supplementary patent. It is generated by including at least one of the patent classification and at least one or more of the higher patent classification of the patent classification in the patent classification system to which the patent classification belongs,
Generating the relevance strength with the similar technology patent relationship information may correspond to at least one or more types of patent classifications included in the target patent and the supplementary patent, the matching depth, the matching frequency, and the like in the patent classification system to which the patent classification belongs. A patent risk hedging prediction information processing method of a patent information system, characterized in that it is generated utilizing any one or more of the matching sequence.

The method of claim 1,
The generated risk hedging prediction information
Risk hedging prediction information for each target patent corresponding to the complementary set, at least one risk hedging prediction information for each target patent group corresponding to the supplementary set, risk hedging prediction information for the target patent set corresponding to the supplementary set, and at least one or more Patent risk hedging prediction information of the patent information system, characterized in that any one or more of the risk hedging prediction information for each target patent corresponding to the partial complementary set, and at least one or more risk hedging prediction information for each target patent group corresponding to the at least one partial complementary set. Treatment method.

8. The method of claim 7,
The risk hedging prediction information is any one or more of the preset at least one risk hedging prediction information value, at least one licensing prediction analysis information, at least one licensing prediction basis information processing of the patent risk hedging prediction information of the patent information system Way.

The method of claim 1,
Acquiring the complementary patent set in the step (i12) is to obtain only a patent that meets a predetermined condition as a supplementary patent,
Acquiring the complementary patent set in the step (i12) is to obtain the divided partial target patent set, or
Acquiring the complementary patent set of the step (i12) is to obtain for each of the two or more target patent set,
The preset condition is set by the user or set by the patent information system, and the preset condition is any one or more of an owner condition, a relevance condition, an owner's sales amount, an owner's recent dispute complaint information, and an individual patent attribute condition. Will be
When the acquisition of the supplementary patent set is to be obtained for each of the divided partial target patent sets, generating the risk hedging prediction information in the step (k) may be a partial supplement patent set corresponding to the partial target patent set. Is generated by
When the acquisition of the supplementary patent set is obtained for each of the two or more target patent sets, the acquisition of the target patent set in the step (i) is that of the at least two or more target patent sets, wherein the (k) step And generating the risk hedging prediction information is generated for each complementary patent set corresponding to two or more target patent sets.

The method of claim 9,
The predetermined condition in the acquisition of the supplementary patent set is that the patent information system or at least one right attribute attribute condition that the supplementary patent selected or set by the user must satisfy is satisfied,
The rights holder attribute information is any one or more of owner size information, rights holder type information, and whether the user is included in at least one group of related rights holders selected or designated by the user. Way.

The method of claim 1,
Obtaining the target patent set in the step (i22) is to obtain only a patent that meets a predetermined condition as a target patent,
Obtaining the target patent set in the step (i22) is to obtain by the divided partial complementary patent set, or
Obtaining the target patent set in the step (i22) is to obtain for each of two or more complementary patent set,
The preset condition is set by the user or set by the patent information system, and the preset condition is any one or more of an owner condition, a relevance condition, an owner's sales amount, an owner's recent dispute complaint information, and an individual patent attribute condition. Will be
When the acquisition of the target patent set is to be obtained for each of the divided partial complementary patent sets, the generating of the risk hedging prediction information in the step (k) may include partial target patent sets corresponding to the partial supplement patent sets. Is generated by
When the acquisition of the target patent set is obtained for each of two or more complementary patent sets, the acquisition of the supplementary patent set in step (i) is that of at least two or more complementary patent sets, and the (k) step Generating the risk hedging prediction information is generated for each target patent set corresponding to two or more of the complementary patent sets.

The method of claim 1,
At least one of a target patent constituting the target patent set and at least one or more partial target patent sets to which at least one or more division criteria or at least one or more selection criteria are applied has a predetermined ranking,
And said ranking is generated according to a ranking generation rule using at least one of said association frequency, said association strength, and said dispute prediction model value.

The method of claim 1,
Generating the risk hedging prediction information of step (k) is generated by at least two divided or selected complementary patent sets,
The generation of the risk hedging prediction information in the step (k) is to be generated by reflecting the user weight information received from the user,
The selection or division of the complementary patent set is one or more of a method of selecting or dividing by the user or a method of dividing by a predetermined selection criterion or a division criterion of the system at all times. Or the dividing criterion is at least one of the name of the holder of the supplementary patent, the attribute of the owner, the classification of the patent technology, and at least one predetermined evaluation value constituting the supplementary patent set.
The user weight information includes weight information for each supplementary patent set by the user for each supplementary patent with respect to any one or more of the supplementary patents constituting the supplementary patent set, weight information for each attribute set in an individual attribute of a target patent, and citation- Citation-cited weight information set for the citation relationship, text mining weight information set for the text mining relationship, and similar technology patent relationship weight information set for the similar technology patent relationship in the patent classification. Patent risk hedging prediction information processing method of a patent information system.

The method of claim 1,
Obtaining the dispute prediction model value for each patent in the step (j) is performed on the secondary supplementary patents constituting the secondary supplementary patent set generated after performing a predetermined set operation on the obtained primary supplementary patent set. Will be
The secondary supplementary patent set is performed by one or more of deletion of the primary supplementary patent constituting the primary supplementary patent set, limitation on the primary supplementary patent set, and addition of new patents to the secondary supplementary patent set. And a patent risk hedging prediction information processing method of a patent information system, characterized in that it is generated.

The method of claim 1,
The dispute prediction model value for each patent is
Any one or more disputes selected from at least one citation perspective dispute prediction element, at least one multi-conflict patent perspective dispute prediction element, at least one multi-dispute cause perspective dispute prediction element and at least one multi-conflict technology family viewpoint dispute prediction element A method for processing patent risk hedging prediction information of a patent information system, characterized in that it is generated using a dispute prediction element value of the prediction element.

The method of claim 1,
The dispute prediction model value may be assigned to a licensed prediction element value as an explanatory variable value for a licensed patent and a non-licensed patent, and is different from the licensed patent grant value and the licensed patent grant value granted to the licensed patent. And a non-licensing patent grant value assigned as a response variable value to generate a predetermined statistical processing, wherein the patent risk hedging prediction information processing method of the patent information system is generated.

17. The method of claim 16,
The statistical processing is a patent risk hedging prediction information processing method of a patent information system, characterized in that for using the algorithm of the machine learning series using an ensemble technique using a tree.

A target patent set generation unit for obtaining or generating a target patent set including at least one patent;
A supplementary patent set generating unit for obtaining or generating a supplementary patent set including at least one patent;
A dispute prediction model value obtaining unit which obtains at least one dispute prediction model value for each patent for the individual patents constituting the supplementary patent set; And
And a risk prediction information generation unit for generating at least one risk hedging prediction information by using the dispute prediction model value for each patent.

19. The method of claim 18,
And a multiple relationship processing module for calculating a predetermined relationship between a target patent constituting the target patent set and a supplementary patent constituting the supplementary patent set.
The predetermined relationship calculated by the multi-relationship processing module is any one or more of a citation-cited patent relationship, a similar patent group relationship in a text mining technique, and a similar technology patent relationship in a patent classification. Patent information system to process.

A patent information system for processing patent risk hedging prediction information, comprising performing the method of any one of claims 1 to 17.

18. A recording medium having recorded thereon a computer readable program for performing the method of any one of claims 1 to 17.

delete